Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian
2016-01-01
Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus. PMID:27336696
Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian
2016-01-01
Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus.
Bu, Huajie; Narisu, Narisu; Schlick, Bettina; Rainer, Johannes; Manke, Thomas; Schäfer, Georg; Pasqualini, Lorenza; Chines, Peter; Schweiger, Michal R.; Fuchsberger, Christian
2015-01-01
ABSTRACT Genome‐wide association studies have identified genomic loci, whose single‐nucleotide polymorphisms (SNPs) predispose to prostate cancer (PCa). However, the mechanisms of most of these variants are largely unknown. We integrated chromatin‐immunoprecipitation‐coupled sequencing and microarray expression profiling in TMPRSS2‐ERG gene rearrangement positive DUCaP cells with the GWAS PCa risk SNPs catalog to identify disease susceptibility SNPs localized within functional androgen receptor‐binding sites (ARBSs). Among the 48 GWAS index risk SNPs and 3,917 linked SNPs, 80 were found located in ARBSs. Of these, rs11891426:T>G in an intron of the melanophilin gene (MLPH) was within a novel putative auxiliary AR‐binding motif, which is enriched in the neighborhood of canonical androgen‐responsive elements. T→G exchange attenuated the transcriptional activity of the ARBS in an AR reporter gene assay. The expression of MLPH in primary prostate tumors was significantly lower in those with the G compared with the T allele and correlated significantly with AR protein. Higher melanophilin level in prostate tissue of patients with a favorable PCa risk profile points out a tumor‐suppressive effect. These results unravel a hidden link between AR and a functional putative PCa risk SNP, whose allele alteration affects androgen regulation of its host gene MLPH. PMID:26411452
DOE Office of Scientific and Technical Information (OSTI.GOV)
Banerjee, Poulabi; Bahlo, Melanie; Schwartz, Jody R.
2002-01-01
Genome wide disease association analysis using SNPs is being explored as a method for dissecting complex genetic traits and a vast number of SNPs have been generated for this purpose. As there are cost and throughput limitations of genotyping large numbers of SNPs and statistical issues regarding the large number of dependent tests on the same data set, to make association analysis practical it has been proposed that SNPs should be prioritized based on likely functional importance. The most easily identifiable functional SNPs are coding SNPs (cSNPs) and accordingly cSNPs have been screened in a number of studies. SNPs inmore » gene regulatory sequences embedded in noncoding DNA are another class of SNPs suggested for prioritization due to their predicted quantitative impact on gene expression. The main challenge in evaluating these SNPs, in contrast to cSNPs is a lack of robust algorithms and databases for recognizing regulatory sequences in noncoding DNA. Approaches that have been previously used to delineate noncoding sequences with gene regulatory activity include cross-species sequence comparisons and the search for sequences recognized by transcription factors. We combined these two methods to sift through mouse human genomic sequences to identify putative gene regulatory elements and subsequently localized SNPs within these sequences in a 1 Megabase (Mb) region of human chromosome 5q31, orthologous to mouse chromosome 11 containing the Interleukin cluster.« less
Patterns of population differentiation of candidate genes for cardiovascular disease.
Kullo, Iftikhar J; Ding, Keyue
2007-07-12
The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (FST) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database. We calculated FST for 15,559 common SNPs (minor allele frequency > or = 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.
Al-Tobasei, Rafet; Ali, Ali; Leeds, Timothy D; Liu, Sixin; Palti, Yniv; Kenney, Brett; Salem, Mohamed
2017-08-07
Coding/functional SNPs change the biological function of a gene and, therefore, could serve as "large-effect" genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7-93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.
Patterns of population differentiation of candidate genes for cardiovascular disease
Kullo, Iftikhar J; Ding, Keyue
2007-01-01
Background The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (FST) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database. Results We calculated FST for 15,559 common SNPs (minor allele frequency ≥ 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. Conclusion These results suggest a possible basis for varying susceptibility to CVD among ethnic groups. PMID:17626638
Localization of Allotetraploid Gossypium SNPs Using Physical Mapping Resources
USDA-ARS?s Scientific Manuscript database
Recent efforts in Gossypium SNP development have produced thousands of putative SNPs for G. barbadense, G. mustelinum, and G. tomentosum relative to G. hirsutum. Here we report on current efforts to localize putative SNPs using physical mapping resources. Recent advances in physical mapping resour...
Leal-Gutiérrez, Joel D.; Elzo, Mauricio A.; Johnson, Dwain D.; Scheffler, Tracy L.; Scheffler, Jason M.; Mateescu, Raluca G.
2018-01-01
Autogenous proteolytic enzymes of the calpain family are implicated in myofibrillar protein degradation. As a result, the μ-calpain gene and its specific inhibitor, calpastatin, have been repeatedly investigated for their association with meat quality traits in cattle; however, no functional mutation has been identified for these two genes. The objectives of this study were: (1) to assess breed composition effect on tenderness; (2) to perform a linkage disequilibrium (LD) analysis in μ-calpain and calpastatin genes as well as an association analyses with tenderness; and (3) to analyze putative functional SNPs inside the significant LD block for an effect on tenderness. Tenderness measurements and genotypes for 16 SNPs in μ-calpain gene and 28 SNPs in calpastatin gene from 673 steers were analyzed. A bioinformatic analysis identified “putative functional SNPs” inside the associated LD block – polymorphisms able to produce a physical and/or chemical change in the DNA, mRNA, or translated protein in silico. Breed composition had a significant (P < 0.0001) effect on tenderness where animals with more than 80% Angus composition had the most tender meat. One 11-kb LD-block and three LD-blocks of 37, 17, and 14 kb in length were identified in the μ-calpain and calpastatin genes, respectively. Out of these, the LD-block 3 in calpastatin, tagged by SNPs located at 7-98566391 and 7-98581038, had a significant effect on tenderness with the TG-CG diplotype being approximately 1 kg more tender than the toughest diplotype, TG-CG. A total of 768 SNPs in the LD-block 3 of calpastatin were included in the bioinformatic analysis, and 28 markers were selected as putative functional SNPs inside the LD-block 3 of calpastatin; however, none of them were polymorphic in this population. Out of 15 initial polymorphisms segregating inside the LD-block 3 of calpastatin in this population, markers ARSUSMARC116, Cast5, rs730723459, and rs210861835 were found to be significantly associated with tenderness. PMID:29520298
Regulatory single nucleotide polymorphisms (rSNPs) at the promoters 1A and 1B of the human APC gene.
Matveeva, Marina Yu; Kashina, Elena V; Reshetnikov, Vasily V; Bryzgalov, Leonid O; Antontseva, Elena V; Bondar, Natalia P; Merkulova, Tatiana I
2016-12-22
Germline mutations in the coding sequence of the tumour suppressor APC gene give rise to familial adenomatous polyposis (which leads to colorectal cancer) and are associated with many other oncopathologies. The loss of APC function because of deletion of putative promoter 1A or 1B also results in the development of colorectal cancer. Since the regions of promoters 1A and 1B contain many single nucleotide polymorphisms (SNPs), the aim of this study was to perform functional analysis of some of these SNPs by means of an electrophoretic mobility shift assay (EMSA) and a luciferase reporter assay. First, it was shown that both putative promoters of APC (1A and 1B) drive transcription in an in vitro reporter experiment. From eleven randomly selected SNPs of promoter 1A and four SNPs of promoter 1B, nine and two respectively showed differential patterns of binding of nuclear proteins to oligonucleotide probes corresponding to alternative alleles. The luciferase reporter assay showed that among the six SNPs tested, the rs75612255 C allele and rs113017087 C allele in promoter 1A as well as the rs138386816 T allele and rs115658307 T allele in promoter 1B significantly increased luciferase activity in the human erythromyeloblastoid leukaemia cell line K562. In human colorectal cancer HCT-116 cells, none of the substitutions under study had any effect, with the exception of minor allele G of rs79896135 in promoter 1B. This allele significantly decreased the luciferase reporter's activity CONCLUSION: Our results indicate that many SNPs in APC promoters 1A and 1B are functionally relevant and that allele G of rs79896135 may be associated with the predisposition to colorectal cancer.
Meng, Xiang-He; Shen, Hui; Chen, Xiang-Ding; Xiao, Hong-Mei; Deng, Hong-Wen
2018-03-01
Genome-wide association studies (GWAS) have successfully identified numerous genetic variants associated with diverse complex phenotypes and diseases, and provided tremendous opportunities for further analyses using summary association statistics. Recently, Pickrell et al. developed a robust method for causal inference using independent putative causal SNPs. However, this method may fail to infer the causal relationship between two phenotypes when only a limited number of independent putative causal SNPs identified. Here, we extended Pickrell's method to make it more applicable for the general situations. We extended the causal inference method by replacing the putative causal SNPs with the lead SNPs (the set of the most significant SNPs in each independent locus) and tested the performance of our extended method using both simulation and empirical data. Simulations suggested that when the same number of genetic variants is used, our extended method had similar distribution of test statistic under the null model as well as comparable power under the causal model compared with the original method by Pickrell et al. But in practice, our extended method would generally be more powerful because the number of independent lead SNPs was often larger than the number of independent putative causal SNPs. And including more SNPs, on the other hand, would not cause more false positives. By applying our extended method to summary statistics from GWAS for blood metabolites and femoral neck bone mineral density (FN-BMD), we successfully identified ten blood metabolites that may causally influence FN-BMD. We extended a causal inference method for inferring putative causal relationship between two phenotypes using summary statistics from GWAS, and identified a number of potential causal metabolites for FN-BMD, which may provide novel insights into the pathophysiological mechanisms underlying osteoporosis.
Bruque, Carlos D; Delea, Marisol; Fernández, Cecilia S; Orza, Juan V; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D; Dain, Liliana
2016-12-14
Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90-95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant's expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient's phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort.
Bruque, Carlos D.; Delea, Marisol; Fernández, Cecilia S.; Orza, Juan V.; Taboas, Melisa; Buzzalino, Noemí; Espeche, Lucía D.; Solari, Andrea; Luccerini, Verónica; Alba, Liliana; Nadra, Alejandro D.; Dain, Liliana
2016-01-01
Congenital adrenal hyperplasia due to 21-hydroxylase deficiency accounts for 90–95% of CAH cases. In this work we performed an extensive survey of mutations and SNPs modifying the coding sequence of the CYP21A2 gene. Using bioinformatic tools and two plausible CYP21A2 structures as templates, we initially classified all known mutants (n = 343) according to their putative functional impacts, which were either reported in the literature or inferred from structural models. We then performed a detailed analysis on the subset of mutations believed to exclusively impact protein stability. For those mutants, the predicted stability was calculated and correlated with the variant’s expected activity. A high concordance was obtained when comparing our predictions with available in vitro residual activities and/or the patient’s phenotype. The predicted stability and derived activity of all reported mutations and SNPs lacking functional assays (n = 108) were assessed. As expected, most of the SNPs (52/76) showed no biological implications. Moreover, this approach was applied to evaluate the putative synergy that could emerge when two mutations occurred in cis. In addition, we propose a putative pathogenic effect of five novel mutations, p.L107Q, p.L122R, p.R132H, p.P335L and p.H466fs, found in 21-hydroxylase deficient patients of our cohort. PMID:27966633
Reid, Brett M; Permuth, Jennifer B; Chen, Y Ann; Teer, Jamie K; Monteiro, Alvaro N A; Chen, Zhihua; Tyrer, Jonathan; Berchuck, Andrew; Chenevix-Trench, Georgia; Doherty, Jennifer A; Goode, Ellen L; Iverson, Edwin S; Lawrenson, Kate; Pearce, Celeste L; Pharoah, Paul D; Phelan, Catherine M; Ramus, Susan J; Rossing, Mary Anne; Schildkraut, Joellen M; Cheng, Jin Q; Gayther, Simon A; Sellers, Thomas A
2017-01-01
Genome-wide association studies (GWAS) have identified multiple loci associated with epithelial ovarian cancer (EOC) susceptibility, but further progress requires integration of epidemiology and biology to illuminate true risk loci below genome-wide significance levels (P < 5 × 10 -8 ). Most risk SNPs lie within non-protein-encoding regions, and we hypothesize that long noncoding RNA (lncRNA) genes are enriched at EOC risk regions and represent biologically relevant functional targets. Using imputed GWAS data from about 18,000 invasive EOC cases and 34,000 controls of European ancestry, the GENCODE (v19) lncRNA database was used to annotate SNPs from 13,442 lncRNAs for permutation-based enrichment analysis. Tumor expression quantitative trait locus (eQTL) analysis was performed for sub-genome-wide regions (1 × 10 -5 > P > 5 × 10 -8 ) overlapping lncRNAs. Of 5,294 EOC-associated SNPs (P < 1.0 × 10 -5 ), 1,464 (28%) mapped within 53 unique lncRNAs and an additional 3,484 (66%) SNPs were correlated (r 2 > 0.2) with SNPs within 115 lncRNAs. EOC-associated SNPs comprised 130 independent regions, of which 72 (55%) overlapped with lncRNAs, representing a significant enrichment (P = 5.0 × 10 -4 ) that was more pronounced among a subset of 5,401 lncRNAs with active epigenetic regulation in normal ovarian tissue. EOC-associated lncRNAs and their putative promoters and transcription factors were enriched for biologically relevant pathways and eQTL analysis identified five novel putative risk regions with allele-specific effects on lncRNA gene expression. lncRNAs are significantly enriched at EOC risk regions, suggesting a mechanistic role for lncRNAs in driving predisposition to EOC. lncRNAs represent key candidates for integrative epidemiologic and functional studies. Further research on their biologic role in ovarian cancer is indicated. Cancer Epidemiol Biomarkers Prev; 26(1); 116-25. ©2016 AACR. ©2016 American Association for Cancer Research.
Tissue-Specific Enrichment of Lymphoma Risk Loci in Regulatory Elements
Hayes, James E.; Trynka, Gosia; Vijai, Joseph; Offit, Kenneth; Raychaudhuri, Soumya; Klein, Robert J.
2015-01-01
Though numerous polymorphisms have been associated with risk of developing lymphoma, how these variants function to promote tumorigenesis is poorly understood. Here, we report that lymphoma risk SNPs, especially in the non-Hodgkin’s lymphoma subtype chronic lymphocytic leukemia, are significantly enriched for co-localization with epigenetic marks of active gene regulation. These enrichments were seen in a lymphoid-specific manner for numerous ENCODE datasets, including DNase-hypersensitivity as well as multiple segmentation-defined enhancer regions. Furthermore, we identify putatively functional SNPs that are both in regulatory elements in lymphocytes and are associated with gene expression changes in blood. We developed an algorithm, UES, that uses a Monte Carlo simulation approach to calculate the enrichment of previously identified risk SNPs in various functional elements. This multiscale approach integrating multiple datasets helps disentangle the underlying biology of lymphoma, and more broadly, is generally applicable to GWAS results from other diseases as well. PMID:26422229
2012-01-01
Background Single nucleotide polymorphism (SNP) validation and large-scale genotyping are required to maximize the use of DNA sequence variation and determine the functional relevance of candidate genes for complex stress tolerance traits through genetic association in rice. We used the bead array platform-based Illumina GoldenGate assay to validate and genotype SNPs in a select set of stress-responsive genes to understand their functional relevance and study the population structure in rice. Results Of the 384 putative SNPs assayed, we successfully validated and genotyped 362 (94.3%). Of these 325 (84.6%) showed polymorphism among the 91 rice genotypes examined. Physical distribution, degree of allele sharing, admixtures and introgression, and amino acid replacement of SNPs in 263 abiotic and 62 biotic stress-responsive genes provided clues for identification and targeted mapping of trait-associated genomic regions. We assessed the functional and adaptive significance of validated SNPs in a set of contrasting drought tolerant upland and sensitive lowland rice genotypes by correlating their allelic variation with amino acid sequence alterations in catalytic domains and three-dimensional secondary protein structure encoded by stress-responsive genes. We found a strong genetic association among SNPs in the nine stress-responsive genes with upland and lowland ecological adaptation. Higher nucleotide diversity was observed in indica accessions compared with other rice sub-populations based on different population genetic parameters. The inferred ancestry of 16% among rice genotypes was derived from admixed populations with the maximum between upland aus and wild Oryza species. Conclusions SNPs validated in biotic and abiotic stress-responsive rice genes can be used in association analyses to identify candidate genes and develop functional markers for stress tolerance in rice. PMID:22921105
Sánchez, Cecilia Castaño; Smith, Timothy P L; Wiedmann, Ralph T; Vallejo, Roger L; Salem, Mohamed; Yao, Jianbo; Rexroad, Caird E
2009-11-25
To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population. The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme HaeIII; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts. The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.
2011-01-01
Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was amplified by PCR from AL8/78 and AS75 and resequenced with the ABI 3730 xl. In a sample of 302 randomly selected putative SNPs, 84.0% in gene regions, 88.0% in repeat junctions, and 81.3% in uncharacterized regions were validated. Conclusion An annotation-based genome-wide SNP discovery pipeline for NGS platforms was developed. The pipeline is suitable for SNP discovery in genomic libraries of complex genomes and does not require a reference genome sequence. The pipeline is applicable to all current NGS platforms, provided that at least one such platform generates relatively long reads. The pipeline package, AGSNP, and the discovered 497,118 Ae. tauschii SNPs can be accessed at (http://avena.pw.usda.gov/wheatD/agsnp.shtml). PMID:21266061
Elastic-net regularization approaches for genome-wide association studies of rheumatoid arthritis.
Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung
2009-12-15
The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.
On marker-based parentage verification via non-linear optimization.
Boerner, Vinzent
2017-06-15
Parentage verification by molecular markers is mainly based on short tandem repeat markers. Single nucleotide polymorphisms (SNPs) as bi-allelic markers have become the markers of choice for genotyping projects. Thus, the subsequent step is to use SNP genotypes for parentage verification as well. Recent developments of algorithms such as evaluating opposing homozygous SNP genotypes have drawbacks, for example the inability of rejecting all animals of a sample of potential parents. This paper describes an algorithm for parentage verification by constrained regression which overcomes the latter limitation and proves to be very fast and accurate even when the number of SNPs is as low as 50. The algorithm was tested on a sample of 14,816 animals with 50, 100 and 500 SNP genotypes randomly selected from 40k genotypes. The samples of putative parents of these animals contained either five random animals, or four random animals and the true sire. Parentage assignment was performed by ranking of regression coefficients, or by setting a minimum threshold for regression coefficients. The assignment quality was evaluated by the power of assignment (P[Formula: see text]) and the power of exclusion (P[Formula: see text]). If the sample of putative parents contained the true sire and parentage was assigned by coefficient ranking, P[Formula: see text] and P[Formula: see text] were both higher than 0.99 for the 500 and 100 SNP genotypes, and higher than 0.98 for the 50 SNP genotypes. When parentage was assigned by a coefficient threshold, P[Formula: see text] was higher than 0.99 regardless of the number of SNPs, but P[Formula: see text] decreased from 0.99 (500 SNPs) to 0.97 (100 SNPs) and 0.92 (50 SNPs). If the sample of putative parents did not contain the true sire and parentage was rejected using a coefficient threshold, the algorithm achieved a P[Formula: see text] of 1 (500 SNPs), 0.99 (100 SNPs) and 0.97 (50 SNPs). The algorithm described here is easy to implement, fast and accurate, and is able to assign parentage using genomic marker data with a size as low as 50 SNPs.
Report on the development of putative functional SSR and SNP markers in passion fruits.
da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro
2017-09-06
Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.
USDA-ARS?s Scientific Manuscript database
The objective of this study was to develop a canonical SNP panel for subtyping of Shiga-toxin producing Escherichia coli (STEC). To this purpose, 906 putative SNPs were identified using resequencing tiling arrays. A subset of 391 SNPs was further screened using high-throughput TaqMan PCR against a d...
SNP discovery by high-throughput sequencing in soybean
2010-01-01
Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770
Genome-wide association analysis for feed efficiency in Angus cattle.
Rolf, M M; Taylor, J F; Schnabel, R D; McKay, S D; McClure, M C; Northcutt, S L; Kerley, M S; Weaber, R L
2012-08-01
Estimated breeding values for average daily feed intake (AFI; kg/day), residual feed intake (RFI; kg/day) and average daily gain (ADG; kg/day) were generated using a mixed linear model incorporating genomic relationships for 698 Angus steers genotyped with the Illumina BovineSNP50 assay. Association analyses of estimated breeding values (EBVs) were performed for 41,028 single nucleotide polymorphisms (SNPs), and permutation analysis was used to empirically establish the genome-wide significance threshold (P < 0.05) for each trait. SNPs significantly associated with each trait were used in a forward selection algorithm to identify genomic regions putatively harbouring genes with effects on each trait. A total of 53, 66 and 68 SNPs explained 54.12% (24.10%), 62.69% (29.85%) and 55.13% (26.54%) of the additive genetic variation (when accounting for the genomic relationships) in steer breeding values for AFI, RFI and ADG, respectively, within this population. Evaluation by pathway analysis revealed that many of these SNPs are in genomic regions that harbour genes with metabolic functions. The presence of genetic correlations between traits resulted in 13.2% of SNPs selected for AFI and 4.5% of SNPs selected for RFI also being selected for ADG in the analysis of breeding values. While our study identifies panels of SNPs significant for efficiency traits in our population, validation of all SNPs in independent populations will be necessary before commercialization. © 2011 The Authors, Animal Genetics © 2011 Stichting International Foundation for Animal Genetics.
Yoneyama, Sachiko; Yao, Jie; Guo, Xiuqing; Fernandez-Rhodes, Lindsay; Lim, Unhee; Boston, Jonathan; Buzková, Petra; Carlson, Christopher S.; Cheng, Iona; Cochran, Barbara; Cooper, Richard; Ehret, Georg; Fornage, Myriam; Gong, Jian; Gross, Myron; Gu, C. Charles; Haessler, Jeff; Haiman, Christopher A.; Henderson, Brian; Hindorff, Lucia A.; Houston, Denise; Irvin, Marguerite R.; Jackson, Rebecca; Kuller, Lew; Leppert, Mark; Lewis, Cora E.; Li, Rongling; Le Marchand, Loic; Matise, Tara C.; Nguyen, Khanh-Dung H.; Chakravarti, Aravinda; Pankow, James S.; Pankratz, Nathan; Pooler, Loreall; Ritchie, Marylyn D.; Bien, Stephanie A.; Wassel, Christina L.; Chen, Yii-Der I.; Taylor, Kent D.; Allison, Matthew; Rotter, Jerome I.; Schreiner, Pamela J.; Schumacher, Fredrick; Wilkens, Lynne; Boerwinkle, Eric; Kooperberg, Charles; Peters, Ulrike; Buyske, Steven; Graff, Mariaelisa; North, Kari E.
2016-01-01
Background/Objectives Central adiposity measures such as waist circumference (WC) and waist-to-hip ratio (WHR) are associated with cardiometabolic disorders independently of BMI and are gaining clinically utility. Several studies report genetic variants associated with central adiposity, but most utilize only European ancestry populations. Understanding whether the genetic associations discovered among mainly European descendants are shared with African ancestry populations will help elucidate the biological underpinnings of abdominal fat deposition. Subjects/Methods To identify the underlying functional genetic determinants of body fat distribution, we conducted an array-wide association meta-analysis among persons of African ancestry across seven studies/consortia participating in the Population Architecture using Genomics and Epidemiology (PAGE) consortium. We used the Metabochip array, designed for fine mapping cardiovascular associated loci, to explore novel array-wide associations with WC and WHR among 15 945 African descendants using all and sex-stratified groups. We further interrogated 17 known WHR regions for African ancestry-specific variants. Results Of the 17 WHR loci, eight SNPs located in four loci were replicated in the sex-combined or sex-stratified meta-analyses. Two of these eight independently associated with WHR after conditioning on the known variant in European descendants (rs12096179 in TBX15-WARS2 and rs2059092 in ADAMTS9). In the fine mapping assessment, the putative functional region was reduced across all four loci but to varying degrees (average 40% drop in number of putative SNPs and 20% drop in genomic region). Similar to previous studies, the significant SNPs in the female stratified analysis were stronger than the significant SNPs from the sex-combined analysis. No novel associations were detected in the array-wide analyses. Conclusions Of 17 previously identified loci, four loci replicated in the African ancestry populations of this study. Utilizing different linkage disequilibrium patterns observed between European and African ancestries, we narrowed the suggestive region containing causative variants for all four loci. PMID:27867202
Lawrenson, Kate; Kar, Siddhartha; McCue, Karen; Kuchenbaeker, Karoline; Michailidou, Kyriaki; Tyrer, Jonathan; Beesley, Jonathan; Ramus, Susan J.; Li, Qiyuan; Delgado, Melissa K.; Lee, Janet M.; Aittomäki, Kristiina; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Arun, Banu K.; Arver, Brita; Bandera, Elisa V.; Barile, Monica; Barkardottir, Rosa B.; Barrowdale, Daniel; Beckmann, Matthias W.; Benitez, Javier; Berchuck, Andrew; Bisogna, Maria; Bjorge, Line; Blomqvist, Carl; Blot, William; Bogdanova, Natalia; Bojesen, Anders; Bojesen, Stig E.; Bolla, Manjeet K.; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Bruinsma, Fiona; Brunet, Joan; Buhari, Shaik Ahmad; Burwinkel, Barbara; Butzow, Ralf; Buys, Saundra S.; Cai, Qiuyin; Caldes, Trinidad; Campbell, Ian; Canniotto, Rikki; Chang-Claude, Jenny; Chiquette, Jocelyne; Choi, Ji-Yeob; Claes, Kathleen B. M.; Collonge-Rame, Marie- Agnès; Damette, Alexandre; Barouk-Simonet, Emmanuelle; Bonnet, Françoise; Bubien, Virginie; Sevenet, Nicolas; Longy, Michel; Berthet, Pascaline; Vaur, Dominique; Castera, Laurent; Ferrer, Sandra Fert; Bignon, Yves-Jean; Uhrhammer, Nancy; Coron, Fanny; Faivre, Laurence; Baurand, Amandine; Jacquot, Caroline; Bertolone, Geoffrey; Lizard, Sarab; Leroux, Dominique; Dreyfus, Hélène; Rebischung, Christine; Peysselon, Magalie; Peyrat, Jean-Philippe; Fournier, Joëlle; Révillion, Françoise; Adenis, Claude; Vénat-Bouvet, Laurence; Léone, Mélanie; Boutry-Kryza, Nadia; Calender, Alain; Giraud, Sophie; Verny-Pierre, Carole; Lasset, Christine; Bonadona, Valérie; Barjhoux, Laure; Sobol, Hagay; Bourdon, Violaine; Noguchi, Tetsuro; Remenieras, Audrey; Coupier, Isabelle; Pujol, Pascal; Sokolowska, Johanna; Bronner, Myriam; Delnatte, Capucine; Bézieau, Stéphane; Mari, Véronique; Gauthier-Villars, Marion; Buecher, Bruno; Rouleau, Etienne; Golmard, Lisa; Moncoutier, Virginie; Belotti, Muriel; de Pauw, Antoine; Elan, Camille; Fourme, Emmanuelle; Birot, Anne-Marie; Saule, Claire; Laurent, Maïté; Houdayer, Claude; Lesueur, Fabienne; Mebirouk, Noura; Coulet, Florence; Colas, Chrystelle; Soubrier, Florent; Warcoin, Mathilde; Prieur, Fabienne; Lebrun, Marine; Kientz, Caroline; Muller, Danièle; Fricker, Jean-Pierre; Toulas, Christine; Guimbaud, Rosine; Gladieff, Laurence; Feillel, Viviane; Mortemousque, Isabelle; Bressac-de-Paillerets, Brigitte; Caron, Olivier; Guillaud-Bataille, Marine; Cook, Linda S.; Cox, Angela; Cramer, Daniel W.; Cross, Simon S.; Cybulski, Cezary; Czene, Kamila; Daly, Mary B.; Damiola, Francesca; Dansonka-Mieszkowska, Agnieszka; Darabi, Hatef; Dennis, Joe; Devilee, Peter; Diez, Orland; Doherty, Jennifer A.; Domchek, Susan M.; Dorfling, Cecilia M.; Dörk, Thilo; Dumont, Martine; Ehrencrona, Hans; Ejlertsen, Bent; Ellis, Steve; Gregory, Helen; Miedzybrodzka, Zosia; Morrison, Patrick J.; Donaldson, Alan; Rogers, Mark T.; Kennedy, M. John; Porteous, Mary E.; Brady, Angela; Barwell, Julian; Foo, Claire; Lalloo, Fiona; Side, Lucy E.; Eason, Jacqueline; Henderson, Alex; Walker, Lisa; Cook, Jackie; Snape, Katie; Murray, Alex; McCann, Emma; Engel, Christoph; Lee, Eunjung; Evans, D. Gareth; Fasching, Peter A.; Feliubadalo, Lidia; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Foretova, Lenka; Fostira, Florentia; Foulkes, William D.; Fridley, Brooke L.; Friedman, Eitan; Frost, Debra; Gambino, Gaetana; Ganz, Patricia A.; Garber, Judy; García-Closas, Montserrat; Gentry-Maharaj, Aleksandra; Ghoussaini, Maya; Giles, Graham G.; Glasspool, Rosalind; Godwin, Andrew K.; Goldberg, Mark S.; Goldgar, David E.; González-Neira, Anna; Goode, Ellen L.; Goodman, Marc T.; Greene, Mark H.; Gronwald, Jacek; Guénel, Pascal; Haiman, Christopher A.; Hall, Per; Hallberg, Emily; Hamann, Ute; Hansen, Thomas V. O.; Harrington, Patricia A.; Hartman, Mikael; Hassan, Norhashimah; Healey, Sue; Rookus, M. A.; van Leeuwen, F. E.; van der Kolk, L. E.; Schmidt, M. K.; Russell, N. S.; de Lange, J. L.; Wijnands, R.; Collée, J. M.; Hooning, M. J.; Seynaeve, C.; van Deurzen, C. H. M.; Obdeijn, I. M.; van Asperen, C. J.; Tollenaar, R. A. E. M.; van Cronenburg, T. C. T. E. F.; Kets, C. M.; Ausems, M. G. E. M.; van der Pol, C. C.; van Os, T. A. M.; Waisfisz, Q.; Meijers-Heijboer, H. E. J.; Gómez-Garcia, E. B.; Oosterwijk, J. C.; Mourits, M. J.; de Bock, G. H.; Vasen, H. F.; Siesling, S.; Verloop, J.; Overbeek, L. I. H.; Heitz, Florian; Herzog, Josef; Høgdall, Estrid; Høgdall, Claus K.; Hogervorst, Frans B. L.; Hollestelle, Antoinette; Hopper, John L.; Hulick, Peter J.; Huzarski, Tomasz; Imyanitov, Evgeny N.; Fox, Stephen; Kirk, Judy; Lindeman, Geoff; Price, Melanie; Bowtell, David; deFazio, Anna; Webb, Penny; Isaacs, Claudine; Ito, Hidemi; Jakubowska, Anna; Janavicius, Ramunas; Jensen, Allan; John, Esther M.; Johnson, Nichola; Kabisch, Maria; Kang, Daehee; Kapuscinski, Miroslav; Karlan, Beth Y.; Khan, Sofia; Kiemeney, Lambertus A.; Kjaer, Susanne Kruger; Knight, Julia A.; Konstantopoulou, Irene; Kosma, Veli-Matti; Kristensen, Vessela; Kupryjanczyk, Jolanta; Kwong, Ava; de la Hoya, Miguel; Laitman, Yael; Lambrechts, Diether; Le, Nhu; De Leeneer, Kim; Lester, Jenny; Levine, Douglas A.; Li, Jingmei; Lindblom, Annika; Long, Jirong; Lophatananon, Artitaya; Loud, Jennifer T.; Lu, Karen; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Le Marchand, Loic; Margolin, Sara; Marme, Frederik; Massuger, Leon F. A. G.; Matsuo, Keitaro; Mazoyer, Sylvie; McGuffog, Lesley; McLean, Catriona; McNeish, Iain; Meindl, Alfons; Menon, Usha; Mensenkamp, Arjen R.; Milne, Roger L.; Montagna, Marco; Moysich, Kirsten B.; Muir, Kenneth; Mulligan, Anna Marie; Nathanson, Katherine L.; Ness, Roberta B.; Neuhausen, Susan L.; Nevanlinna, Heli; Nord, Silje; Nussbaum, Robert L.; Odunsi, Kunle; Offit, Kenneth; Olah, Edith; Olopade, Olufunmilayo I.; Olson, Janet E.; Olswold, Curtis; O'Malley, David; Orlow, Irene; Orr, Nick; Osorio, Ana; Park, Sue Kyung; Pearce, Celeste L.; Pejovic, Tanja; Peterlongo, Paolo; Pfeiler, Georg; Phelan, Catherine M.; Poole, Elizabeth M.; Pylkäs, Katri; Radice, Paolo; Rantala, Johanna; Rashid, Muhammad Usman; Rennert, Gad; Rhenius, Valerie; Rhiem, Kerstin; Risch, Harvey A.; Rodriguez, Gus; Rossing, Mary Anne; Rudolph, Anja; Salvesen, Helga B.; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schildkraut, Joellen M.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Sellers, Thomas A.; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Sieh, Weiva; Singer, Christian F.; Sinilnikova, Olga M.; Slager, Susan; Song, Honglin; Soucy, Penny; Southey, Melissa C.; Stenmark-Askmalm, Marie; Stoppa-Lyonnet, Dominique; Sutter, Christian; Swerdlow, Anthony; Tchatchou, Sandrine; Teixeira, Manuel R.; Teo, Soo H.; Terry, Kathryn L.; Terry, Mary Beth; Thomassen, Mads; Tibiletti, Maria Grazia; Tihomirova, Laima; Tognazzo, Silvia; Toland, Amanda Ewart; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Tseng, Chiu-chen; Tung, Nadine; Tworoger, Shelley S.; Vachon, Celine; van den Ouweland, Ans M. W.; van Doorn, Helena C.; van Rensburg, Elizabeth J.; Van't Veer, Laura J.; Vanderstichele, Adriaan; Vergote, Ignace; Vijai, Joseph; Wang, Qin; Wang-Gohrke, Shan; Weitzel, Jeffrey N.; Wentzensen, Nicolas; Whittemore, Alice S.; Wildiers, Hans; Winqvist, Robert; Wu, Anna H.; Yannoukakos, Drakoulis; Yoon, Sook-Yee; Yu, Jyh-Cherng; Zheng, Wei; Zheng, Ying; Khanna, Kum Kum; Simard, Jacques; Monteiro, Alvaro N.; French, Juliet D.; Couch, Fergus J.; Freedman, Matthew L.; Easton, Douglas F.; Dunning, Alison M.; Pharoah, Paul D.; Edwards, Stacey L.; Chenevix-Trench, Georgia; Antoniou, Antonis C.; Gayther, Simon A.
2016-01-01
A locus at 19p13 is associated with breast cancer (BC) and ovarian cancer (OC) risk. Here we analyse 438 SNPs in this region in 46,451 BC and 15,438 OC cases, 15,252 BRCA1 mutation carriers and 73,444 controls and identify 13 candidate causal SNPs associated with serous OC (P=9.2 × 10−20), ER-negative BC (P=1.1 × 10−13), BRCA1-associated BC (P=7.7 × 10−16) and triple negative BC (P-diff=2 × 10−5). Genotype-gene expression associations are identified for candidate target genes ANKLE1 (P=2 × 10−3) and ABHD8 (P<2 × 10−3). Chromosome conformation capture identifies interactions between four candidate SNPs and ABHD8, and luciferase assays indicate six risk alleles increased transactivation of the ADHD8 promoter. Targeted deletion of a region containing risk SNP rs56069439 in a putative enhancer induces ANKLE1 downregulation; and mRNA stability assays indicate functional effects for an ANKLE1 3′-UTR SNP. Altogether, these data suggest that multiple SNPs at 19p13 regulate ABHD8 and perhaps ANKLE1 expression, and indicate common mechanisms underlying breast and ovarian cancer risk. PMID:27601076
Lawrenson, Kate; Kar, Siddhartha; McCue, Karen; Kuchenbaeker, Karoline; Michailidou, Kyriaki; Tyrer, Jonathan; Beesley, Jonathan; Ramus, Susan J; Li, Qiyuan; Delgado, Melissa K; Lee, Janet M; Aittomäki, Kristiina; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Arun, Banu K; Arver, Brita; Bandera, Elisa V; Barile, Monica; Barkardottir, Rosa B; Barrowdale, Daniel; Beckmann, Matthias W; Benitez, Javier; Berchuck, Andrew; Bisogna, Maria; Bjorge, Line; Blomqvist, Carl; Blot, William; Bogdanova, Natalia; Bojesen, Anders; Bojesen, Stig E; Bolla, Manjeet K; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Bruinsma, Fiona; Brunet, Joan; Buhari, Shaik Ahmad; Burwinkel, Barbara; Butzow, Ralf; Buys, Saundra S; Cai, Qiuyin; Caldes, Trinidad; Campbell, Ian; Canniotto, Rikki; Chang-Claude, Jenny; Chiquette, Jocelyne; Choi, Ji-Yeob; Claes, Kathleen B M; Cook, Linda S; Cox, Angela; Cramer, Daniel W; Cross, Simon S; Cybulski, Cezary; Czene, Kamila; Daly, Mary B; Damiola, Francesca; Dansonka-Mieszkowska, Agnieszka; Darabi, Hatef; Dennis, Joe; Devilee, Peter; Diez, Orland; Doherty, Jennifer A; Domchek, Susan M; Dorfling, Cecilia M; Dörk, Thilo; Dumont, Martine; Ehrencrona, Hans; Ejlertsen, Bent; Ellis, Steve; Engel, Christoph; Lee, Eunjung; Evans, D Gareth; Fasching, Peter A; Feliubadalo, Lidia; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Foretova, Lenka; Fostira, Florentia; Foulkes, William D; Fridley, Brooke L; Friedman, Eitan; Frost, Debra; Gambino, Gaetana; Ganz, Patricia A; Garber, Judy; García-Closas, Montserrat; Gentry-Maharaj, Aleksandra; Ghoussaini, Maya; Giles, Graham G; Glasspool, Rosalind; Godwin, Andrew K; Goldberg, Mark S; Goldgar, David E; González-Neira, Anna; Goode, Ellen L; Goodman, Marc T; Greene, Mark H; Gronwald, Jacek; Guénel, Pascal; Haiman, Christopher A; Hall, Per; Hallberg, Emily; Hamann, Ute; Hansen, Thomas V O; Harrington, Patricia A; Hartman, Mikael; Hassan, Norhashimah; Healey, Sue; Heitz, Florian; Herzog, Josef; Høgdall, Estrid; Høgdall, Claus K; Hogervorst, Frans B L; Hollestelle, Antoinette; Hopper, John L; Hulick, Peter J; Huzarski, Tomasz; Imyanitov, Evgeny N; Isaacs, Claudine; Ito, Hidemi; Jakubowska, Anna; Janavicius, Ramunas; Jensen, Allan; John, Esther M; Johnson, Nichola; Kabisch, Maria; Kang, Daehee; Kapuscinski, Miroslav; Karlan, Beth Y; Khan, Sofia; Kiemeney, Lambertus A; Kjaer, Susanne Kruger; Knight, Julia A; Konstantopoulou, Irene; Kosma, Veli-Matti; Kristensen, Vessela; Kupryjanczyk, Jolanta; Kwong, Ava; de la Hoya, Miguel; Laitman, Yael; Lambrechts, Diether; Le, Nhu; De Leeneer, Kim; Lester, Jenny; Levine, Douglas A; Li, Jingmei; Lindblom, Annika; Long, Jirong; Lophatananon, Artitaya; Loud, Jennifer T; Lu, Karen; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Le Marchand, Loic; Margolin, Sara; Marme, Frederik; Massuger, Leon F A G; Matsuo, Keitaro; Mazoyer, Sylvie; McGuffog, Lesley; McLean, Catriona; McNeish, Iain; Meindl, Alfons; Menon, Usha; Mensenkamp, Arjen R; Milne, Roger L; Montagna, Marco; Moysich, Kirsten B; Muir, Kenneth; Mulligan, Anna Marie; Nathanson, Katherine L; Ness, Roberta B; Neuhausen, Susan L; Nevanlinna, Heli; Nord, Silje; Nussbaum, Robert L; Odunsi, Kunle; Offit, Kenneth; Olah, Edith; Olopade, Olufunmilayo I; Olson, Janet E; Olswold, Curtis; O'Malley, David; Orlow, Irene; Orr, Nick; Osorio, Ana; Park, Sue Kyung; Pearce, Celeste L; Pejovic, Tanja; Peterlongo, Paolo; Pfeiler, Georg; Phelan, Catherine M; Poole, Elizabeth M; Pylkäs, Katri; Radice, Paolo; Rantala, Johanna; Rashid, Muhammad Usman; Rennert, Gad; Rhenius, Valerie; Rhiem, Kerstin; Risch, Harvey A; Rodriguez, Gus; Rossing, Mary Anne; Rudolph, Anja; Salvesen, Helga B; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schildkraut, Joellen M; Schmidt, Marjanka K; Schmutzler, Rita K; Sellers, Thomas A; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Sieh, Weiva; Singer, Christian F; Sinilnikova, Olga M; Slager, Susan; Song, Honglin; Soucy, Penny; Southey, Melissa C; Stenmark-Askmalm, Marie; Stoppa-Lyonnet, Dominique; Sutter, Christian; Swerdlow, Anthony; Tchatchou, Sandrine; Teixeira, Manuel R; Teo, Soo H; Terry, Kathryn L; Terry, Mary Beth; Thomassen, Mads; Tibiletti, Maria Grazia; Tihomirova, Laima; Tognazzo, Silvia; Toland, Amanda Ewart; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Tseng, Chiu-Chen; Tung, Nadine; Tworoger, Shelley S; Vachon, Celine; van den Ouweland, Ans M W; van Doorn, Helena C; van Rensburg, Elizabeth J; Van't Veer, Laura J; Vanderstichele, Adriaan; Vergote, Ignace; Vijai, Joseph; Wang, Qin; Wang-Gohrke, Shan; Weitzel, Jeffrey N; Wentzensen, Nicolas; Whittemore, Alice S; Wildiers, Hans; Winqvist, Robert; Wu, Anna H; Yannoukakos, Drakoulis; Yoon, Sook-Yee; Yu, Jyh-Cherng; Zheng, Wei; Zheng, Ying; Khanna, Kum Kum; Simard, Jacques; Monteiro, Alvaro N; French, Juliet D; Couch, Fergus J; Freedman, Matthew L; Easton, Douglas F; Dunning, Alison M; Pharoah, Paul D; Edwards, Stacey L; Chenevix-Trench, Georgia; Antoniou, Antonis C; Gayther, Simon A
2016-09-07
A locus at 19p13 is associated with breast cancer (BC) and ovarian cancer (OC) risk. Here we analyse 438 SNPs in this region in 46,451 BC and 15,438 OC cases, 15,252 BRCA1 mutation carriers and 73,444 controls and identify 13 candidate causal SNPs associated with serous OC (P=9.2 × 10(-20)), ER-negative BC (P=1.1 × 10(-13)), BRCA1-associated BC (P=7.7 × 10(-16)) and triple negative BC (P-diff=2 × 10(-5)). Genotype-gene expression associations are identified for candidate target genes ANKLE1 (P=2 × 10(-3)) and ABHD8 (P<2 × 10(-3)). Chromosome conformation capture identifies interactions between four candidate SNPs and ABHD8, and luciferase assays indicate six risk alleles increased transactivation of the ADHD8 promoter. Targeted deletion of a region containing risk SNP rs56069439 in a putative enhancer induces ANKLE1 downregulation; and mRNA stability assays indicate functional effects for an ANKLE1 3'-UTR SNP. Altogether, these data suggest that multiple SNPs at 19p13 regulate ABHD8 and perhaps ANKLE1 expression, and indicate common mechanisms underlying breast and ovarian cancer risk.
Discovery, Validation and Characterization of 1039 Cattle Single Nucleotide Polymorphisms
USDA-ARS?s Scientific Manuscript database
We identified approximately 13000 putative single nucleotide polymorphisms (SNPs) by comparison of repeat-masked BAC-end sequences from the cattle RPCI-42 BAC library with whole-genome shotgun contigs of cattle genome assembly Btau 1.0. Genotyping of a subset of these SNPs was performed on a panel ...
USDA-ARS?s Scientific Manuscript database
Analysis of gene polymorphisms and disease association is essential for assessing putative candidate genes affecting susceptibility or resistance to disease. In this paper, we report the results of an association analysis between SNPs in common carp innate immune response genes and resistance to Cy...
Hamilton, Natasha A; Tammen, Imke; Raadsma, Herman W
2013-01-01
Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism.
Hamilton, Natasha A.; Tammen, Imke; Raadsma, Herman W.
2013-01-01
Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism. PMID:23408978
Bayer, Arnold S; Mishra, Nagendra N; Chen, Liang; Kreiswirth, Barry N; Rubio, Aileen; Yang, Soo-Jin
2015-08-01
MprF is responsible for the lysinylation of phosphatidylglycerol (PG) to synthesize the positively charged phospholipid (PL) species, lysyl-PG (L-PG). It has been proposed that the single-nucleotide polymorphisms (SNPs) within the mprF open reading frame (ORF) are associated with a gain-in-function phenotype in terms of daptomycin resistance in Staphylococcus aureus. (Note that although the official term is daptomycin nonsusceptibility, we use the term daptomycin resistance in this paper for ease of presentation.) Using 22 daptomycin-susceptible (DAP(s))/daptomycin-resistant (DAP(r)) clinical methicillin-resistant S. aureus (MRSA) strain pairs, we assessed (i) the frequencies and distribution of putative mprF gain-in-function SNPs, (ii) the relationships of the SNPs to both daptomycin resistance and cross-resistance to the prototypical endovascular host defense peptide (HDP) thrombin-induced platelet microbicidal protein (tPMP), and (iii) the impact of mprF SNPs on positive surface charge phenotype and modifications of membrane PL profiles. Most of the mprF SNPs identified in our DAP(r) strains were clustered within the two MprF loci, (i) the central bifunctional domain and (ii) the C-terminal synthase domain. Moreover, we were able to correlate the presence and location of mprF SNPs in DAP(r) strains with HDP cross-resistance, positive surface charge, and L-PG profiles. Although DAP(r) strains with mprF SNPs in the bifunctional domain showed higher resistance to tPMPs than DAP(r) strains with SNPs in the synthase domain, this relationship was not observed in positive surface charge assays. These results demonstrated that both charge-mediated and -unrelated mechanisms are involved in DAP resistance and HDP cross-resistance in S. aureus. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Katz, Lee S.; Sharma, Nitya V.; Harcourt, Brian H.; Thomas, Jennifer Dolan; Wang, Xin; Mayer, Leonard W.; Jordan, I. King
2011-01-01
Neisseria meningitidis is one of the main agents of bacterial meningitis, causing substantial morbidity and mortality worldwide. However, most of the time N. meningitidis is carried as a commensal not associated with invasive disease. The genomic basis of the difference between disease-associated and carried isolates of N. meningitidis may provide critical insight into mechanisms of virulence, yet it has remained elusive. Here, we have taken a comparative genomics approach to interrogate the difference between disease-associated and carried isolates of N. meningitidis at the level of individual nucleotide variations (i.e., single nucleotide polymorphisms [SNPs]). We aligned complete genome sequences of 8 disease-associated and 4 carried isolates of N. meningitidis to search for SNPs that show mutually exclusive patterns of variation between the two groups. We found 63 SNPs that distinguish the 8 disease-associated genomes from the 4 carried genomes of N. meningitidis, which is far more than can be expected by chance alone given the level of nucleotide variation among the genomes. The putative list of SNPs that discriminate between disease-associated and carriage genomes may be expected to change with increased sampling or changes in the identities of the isolates being compared. Nevertheless, we show that these discriminating SNPs are more likely to reflect phenotypic differences than shared evolutionary history. Discriminating SNPs were mapped to genes, and the functions of the genes were evaluated for possible connections to virulence mechanisms. A number of overrepresented functional categories related to virulence were uncovered among SNP-associated genes, including genes related to the category “symbiosis, encompassing mutualism through parasitism.” PMID:21622743
Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing
2014-01-01
Background Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families. Results Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays. Conclusions This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication. PMID:24571138
Pardo, Belén G; Fernández, Carlos; Millán, Adrián; Bouza, Carmen; Vázquez-López, Araceli; Vera, Manuel; Alvarez-Dios, José A; Calaza, Manuel; Gómez-Tato, Antonio; Vázquez, María; Cabaleiro, Santiago; Magariños, Beatriz; Lemos, Manuel L; Leiro, José M; Martínez, Paulino
2008-01-01
Background The turbot (Scophthalmus maximus; Scophthalmidae; Pleuronectiformes) is a flatfish species of great relevance for marine aquaculture in Europe. In contrast to other cultured flatfish, very few genomic resources are available in this species. Aeromonas salmonicida and Philasterides dicentrarchi are two pathogens that affect turbot culture causing serious economic losses to the turbot industry. Little is known about the molecular mechanisms for disease resistance and host-pathogen interactions in this species. In this work, thousands of ESTs for functional genomic studies and potential markers linked to ESTs for mapping (microsatellites and single nucleotide polymorphisms (SNPs)) are provided. This information enabled us to obtain a preliminary view of regulated genes in response to these pathogens and it constitutes the basis for subsequent and more accurate microarray analysis. Results A total of 12584 cDNAs partially sequenced from three different cDNA libraries of turbot (Scophthalmus maximus) infected with Aeromonas salmonicida, Philasterides dicentrarchi and from healthy fish were analyzed. Three immune-relevant tissues (liver, spleen and head kidney) were sampled at several time points in the infection process for library construction. The sequences were processed into 9256 high-quality sequences, which constituted the source for the turbot EST database. Clustering and assembly of these sequences, revealed 3482 different putative transcripts, 1073 contigs and 2409 singletons. BLAST searches with public databases detected significant similarity (e-value ≤ 1e-5) in 1766 (50.7%) sequences and 816 of them (23.4%) could be functionally annotated. Two hundred three of these genes (24.9%), encoding for defence/immune-related proteins, were mostly identified for the first time in turbot. Some ESTs showed significant differences in the number of transcripts when comparing the three libraries, suggesting regulation in response to these pathogens. A total of 191 microsatellites, with 104 having sufficient flanking sequences for primer design, and 1158 putative SNPs were identified from these EST resources in turbot. Conclusion A collection of 9256 high-quality ESTs was generated representing 3482 unique turbot sequences. A large proportion of defence/immune-related genes were identified, many of them regulated in response to specific pathogens. Putative microsatellites and SNPs were identified. These genome resources constitute the basis to develop a microarray for functional genomics studies and marker validation for genetic linkage and QTL analysis in turbot. PMID:18817567
Association Studies of 22 Candidate SNPs with Late-Onset Alzheimer's Disease
Figgins, Jessica A.; Minster, Ryan L.; Demirci, F. Yesim; DeKosky, Steven T.; Kamboh, M. Ilyas
2009-01-01
Alzheimer's disease (AD) is a complex and multifactorial disease with the possible involvement of several genes. With the exception of the APOE gene as a susceptibility marker, no other genes have been shown consistently to be associated with late-onset AD (LOAD). A recent genome-wide association study of 17,343 gene-based putative functional single nucleotide polymorphisms (SNPs) found 19 significant variants, including 3 linked to APOE, showing association with LOAD (Hum Mol Genet 2007; 16:865–873). We have set out to replicate the 16 new significant associations in a large case-control cohort of American Whites. Additionally, we examined six variants present in positional and/or biological candidate genes for AD. We genotyped the 22 SNPs in up to 1,009 Caucasian Americans with LOAD and up to 1,010 age-matched healthy Caucasian Americans, using 5′ nuclease assays. We did not observe a statistically significant association between the SNPs and the risk of AD, either individually or stratified by APOE. Our data suggest that the association of the studied variants with LOAD risk, if it exists, is not statistically significant in our sample. PMID:18780302
Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology.
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N; Kumar, Dibyendu
2017-01-01
RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments.
Yucesoy, Berran; Kaufman, Kenneth M.; Lummus, Zana L.; Weirauch, Matthew T.; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M.; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B.; Bernstein, David I.
2015-01-01
Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10−14). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10−9 and rs2514805, p = 1.22 × 10−8, respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10−6). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132
Natarajan, Sathishkumar; Kim, Hoy-Taek; Thamilarasan, Senthil Kumar; Veerappan, Karpagam; Park, Jong-In; Nou, Ill-Sup
2016-01-01
Powdery mildew is one of the most common fungal diseases in the world. This disease frequently affects melon (Cucumis melo L.) and other Cucurbitaceous family crops in both open field and greenhouse cultivation. One of the goals of genomics is to identify the polymorphic loci responsible for variation in phenotypic traits. In this study, powdery mildew disease assessment scores were calculated for four melon accessions, 'SCNU1154', 'Edisto47', 'MR-1', and 'PMR5'. To investigate the genetic variation of these accessions, whole genome re-sequencing using the Illumina HiSeq 2000 platform was performed. A total of 754,759,704 quality-filtered reads were generated, with an average of 82.64% coverage relative to the reference genome. Comparisons of the sequences for the melon accessions revealed around 7.4 million single nucleotide polymorphisms (SNPs), 1.9 million InDels, and 182,398 putative structural variations (SVs). Functional enrichment analysis of detected variations classified them into biological process, cellular component and molecular function categories. Further, a disease-associated QTL map was constructed for 390 SNPs and 45 InDels identified as related to defense-response genes. Among them 112 SNPs and 12 InDels were observed in powdery mildew responsive chromosomes. Accordingly, this whole genome re-sequencing study identified SNPs and InDels associated with defense genes that will serve as candidate polymorphisms in the search for sources of resistance against powdery mildew disease and could accelerate marker-assisted breeding in melon.
Windelinckx, An; De Mars, Gunther; Huygens, Wim; Peeters, Maarten W.; Vincent, Barbara; Wijmenga, Cisca; Lambrechts, Diether; Aerssens, Jeroen; Vlietinck, Robert; Beunen, Gaston
2011-01-01
Muscle strength is an important determinant in elite sports performance as well as in the activities of daily living. Muscle metabolism also plays a role in the genesis, and therefore prevention, of common pathological conditions and chronic diseases. Even though heritability estimates between 31 and 78% suggest a significant genetic component in muscle strength, only a limited number of genes influencing muscle strength have been identified. This study aimed to identify and prioritize positional candidate genes within a skeletal muscle strength quantitative trait locus on chromosome 12q22-23 for follow-up. A two-staged gene-centered fine-mapping approach using 122 single nucleotide polymorphisms (SNPs) in stage 1 identified a familybased association (n = 500) between several tagSNPs located in the ATPase, Ca2+ transporting, cardiac muscle, slow twitch 2 (ATP2A2; rs3026468), the NUAK family, SNF1-like kinase, 1 (NUAK1; rs10861553 and rs3741886), and the protein phosphatase 1, catalytic subunit, gamma isoform (PPP1CC; rs1050587 and rs7901769) genes and knee torque production (P values up to 0.00092). In stage 2, family-based association tests on additional putatively functional SNPs (e.g., exonic SNPs, SNPs in transcription factor binding sites or in conserved regions) in an enlarged sample (n = 536; 464 individuals overlap with stage 1) did not identify additional associations with muscle strength characteristics. Further in-depth analyses will be necessary to elucidate the exact role of ATP2A2, PPP1CC, and NUAK1 in muscle strength and to find out which functional polymorphisms are at the base of the interindividual strength differences. PMID:21750233
Diopere, Eveline; Hellemans, Bart; Volckaert, Filip A M; Maes, Gregory E
2013-03-01
Genomic methodologies applied in evolutionary and fisheries research have been of great benefit to understand the marine ecosystem and the management of natural resources. Although single nucleotide polymorphisms (SNPs) are attractive for the study of local adaptation, spatial stock management and traceability, and investigating the effects of fisheries-induced selection, they have rarely been exploited in non-model organisms. This is partly due to difficulties in finding and validating SNPs in species with limited or no genomic resources. Complementary to random genome-scan approaches, a targeted candidate gene approach has the potential to unveil pre-selected functional diversity and provides more in depth information on the action of selection at specific genes. For example genes can be under selective pressure due to climate change and sustained periods of heavy fishing pressure. In this study, we applied a candidate gene approach in sole (Solea solea L.), an important member of the demersal ecosystem. As consumption flatfish it is heavy exploited and has experienced associated life-history changes over the last 60years. To discover novel genetic polymorphisms in or around genes linked to important life history traits in sole, we screened a total of 76 candidate genes related to growth and maturation using a targeted resequencing approach. We identified in total 86 putative SNPs in 22 genes and validated 29 SNPs using a multiplex single-base extension genotyping assay. We found 22 informative SNPs, of which two represent non-synonymous mutations, potentially of functional relevance. These novel markers should be rapidly and broadly applicable in analyses of natural sole populations, as a measure of the evolutionary signature of overfishing and for initiatives on marker assisted selection. Copyright © 2012 Elsevier B.V. All rights reserved.
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
Law, Amanda J.; Lipska, Barbara K.; Weickert, Cynthia Shannon; Hyde, Thomas M.; Straub, Richard E.; Hashimoto, Ryota; Harrison, Paul J.; Kleinman, Joel E.; Weinberger, Daniel R.
2006-01-01
Genetic variation in neuregulin 1 (NRG1) is associated with schizophrenia. The disease-associated SNPs are noncoding, and their functional implications remain unknown. We hypothesized that differential expression of the NRG1 gene explains its association to the disease. We examined four of the disease-associated SNPs that make up the original risk haplotype in the 5′ upstream region of the gene for their effects on mRNA abundance of NRG1 types I–IV in human postmortem hippocampus. Diagnostic comparisons revealed a 34% increase in type I mRNA in schizophrenia and an interaction of diagnosis and genotype (SNP8NRG221132) on this transcript. Of potentially greater interest, a single SNP within the risk haplotype (SNP8NRG243177) and a 22-kb block of this core haplotype are associated with mRNA expression for the novel type IV isoform in patients and controls. Bioinformatic promoter analyses indicate that both SNPs lead to a gain/loss of putative binding sites for three transcription factors, serum response factor, myelin transcription factor-1, and High Mobility Group Box Protein-1. These data implicate variation in isoform expression as a molecular mechanism for the genetic association of NRG1 with schizophrenia. PMID:16618933
Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N.; Kumar, Dibyendu
2017-01-01
Background RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. Results The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. Conclusions This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments. PMID:28234981
Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci
Brorsson, Caroline A.; Pociot, Flemming
2014-01-01
Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376
Walsh, Kyle M; Anderson, Erik; Hansen, Helen M; Decker, Paul A; Kosel, Matt L; Kollmeyer, Thomas; Rice, Terri; Zheng, Shichun; Xiao, Yuanyuan; Chang, Jeffrey S; McCoy, Lucie S; Bracci, Paige M; Wiemels, Joe L; Pico, Alexander R; Smirnov, Ivan; Lachance, Daniel H; Sicotte, Hugues; Eckel-Passow, Jeanette E; Wiencke, John K; Jenkins, Robert B; Wrensch, Margaret R
2013-02-01
Genomewide association studies (GWAS) and candidate-gene studies have implicated single-nucleotide polymorphisms (SNPs) in at least 45 different genes as putative glioma risk factors. Attempts to validate these associations have yielded variable results and few genetic risk factors have been consistently replicated. We conducted a case-control study of Caucasian glioma cases and controls from the University of California San Francisco (810 cases, 512 controls) and the Mayo Clinic (852 cases, 789 controls) in an attempt to replicate previously reported genetic risk factors for glioma. Sixty SNPs selected from the literature (eight from GWAS and 52 from candidate-gene studies) were successfully genotyped on an Illumina custom genotyping panel. Eight SNPs in/near seven different genes (TERT, EGFR, CCDC26, CDKN2A, PHLDB1, RTEL1, TP53) were significantly associated with glioma risk in the combined dataset (P < 0.05), with all associations in the same direction as in previous reports. Several SNP associations showed considerable differences across histologic subtype. All eight successfully replicated associations were first identified by GWAS, although none of the putative risk SNPs from candidate-gene studies was associated in the full case-control sample (all P values > 0.05). Although several confirmed associations are located near genes long known to be involved in gliomagenesis (e.g., EGFR, CDKN2A, TP53), these associations were first discovered by the GWAS approach and are in noncoding regions. These results highlight that the deficiencies of the candidate-gene approach lay in selecting both appropriate genes and relevant SNPs within these genes. © 2012 WILEY PERIODICALS, INC.
Batista, Philip D; Janes, Jasmine K; Boone, Celia K; Murray, Brent W; Sperling, Felix A H
2016-09-01
Assessments of population genetic structure and demographic history have traditionally been based on neutral markers while explicitly excluding adaptive markers. In this study, we compared the utility of putatively adaptive and neutral single-nucleotide polymorphisms (SNPs) for inferring mountain pine beetle population structure across its geographic range. Both adaptive and neutral SNPs, and their combination, allowed range-wide structure to be distinguished and delimited a population that has recently undergone range expansion across northern British Columbia and Alberta. Using an equal number of both adaptive and neutral SNPs revealed that adaptive SNPs resulted in a stronger correlation between sampled populations and inferred clustering. Our results suggest that adaptive SNPs should not be excluded prior to analysis from neutral SNPs as a combination of both marker sets resulted in better resolution of genetic differentiation between populations than either marker set alone. These results demonstrate the utility of adaptive loci for resolving population genetic structure in a nonmodel organism.
2013-01-01
Background Obesity, excess fat tissue in the body, can underlie a variety of medical complaints including heart disease, stroke and cancer. The pig is an excellent model organism for the study of various human disorders, including obesity, as well as being the foremost agricultural species. In order to identify genetic variants associated with fatness, we used a selective genomic approach sampling DNA from animals at the extreme ends of the fat and lean spectrum using estimated breeding values derived from a total population size of over 70,000 animals. DNA from 3 breeds (Sire Line Large White, Duroc and a white Pietrain composite line (Titan)) was used to interrogate the Illumina Porcine SNP60 Genotyping Beadchip in order to identify significant associations in terms of single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Results By sampling animals at each end of the fat/lean EBV (estimate breeding value) spectrum the whole population could be assessed using less than 300 animals, without losing statistical power. Indeed, several significant SNPs (at the 5% genome wide significance level) were discovered, 4 of these linked to genes with ontologies that had previously been correlated with fatness (NTS, FABP6, SST and NR3C2). Quantitative analysis of the data identified putative CNV regions containing genes whose ontology suggested fatness related functions (MCHR1, PPARα, SLC5A1 and SLC5A4). Conclusions Selective genotyping of EBVs at either end of the phenotypic spectrum proved to be a cost effective means of identifying SNPs and CNVs associated with fatness and with estimated major effects in a large population of animals. PMID:24225222
Mercenaro, Luca; Nieddu, Giovanni; Porceddu, Andrea; Pezzotti, Mario; Camiolo, Salvatore
2017-01-01
The genetic diversity among grapevine (Vitis vinifera L.) cultivars that underlies differences in agronomic performance and wine quality reflects the accumulation of single nucleotide polymorphisms (SNPs) and small indels as well as larger genomic variations. A combination of high throughput sequencing and mapping against the grapevine reference genome allows the creation of comprehensive sequence variation maps. We used next generation sequencing and bioinformatics to generate an inventory of SNPs and small indels in four widely cultivated Sardinian grape cultivars (Bovale sardo, Cannonau, Carignano and Vermentino). More than 3,200,000 SNPs were identified with high statistical confidence. Some of the SNPs caused the appearance of premature stop codons and thus identified putative pseudogenes. The analysis of SNP distribution along chromosomes led to the identification of large genomic regions with uninterrupted series of homozygous SNPs. We used a digital comparative genomic hybridization approach to identify 6526 genomic regions with significant differences in copy number among the four cultivars compared to the reference sequence, including 81 regions shared between all four cultivars and 4953 specific to single cultivars (representing 1.2 and 75.9% of total copy number variation, respectively). Reads mapping at a distance that was not compatible with the insert size were used to identify a dataset of putative large deletions with cultivar Cannonau revealing the highest number. The analysis of genes mapping to these regions provided a list of candidates that may explain some of the phenotypic differences among the Bovale sardo, Cannonau, Carignano and Vermentino cultivars. PMID:28775732
Calvo, J H; Serrano, M; Martinez-Royo, A; Lahoz, B; Sarto, P; Ibañez-Deler, A; Folch, J; Alabart, J L
2018-06-01
The aim of this study was to characterize and identify causative SNPs in the MTNR1A gene responsible for the reproductive seasonality traits in the Rasa aragonesa sheep breed. A total of 290 ewes (155, 84 and 51 mature, young and ewe lambs, respectively) from one flock were controlled from January to August. The following three reproductive seasonality traits were considered: the total days of anoestrus (TDA) and the progesterone cycling months (P4CM); both ovarian function seasonality traits based on blood progesterone levels; and the oestrus cycling months (OCM) based on oestrous detection, which indicate behavioural signs of oestrous. We have sequenced the total coding region plus 733 and 251 bp from the promoter and 3'-UTR regions, respectively, from the gene in 268 ewes. We found 9 and 4 SNPs associated with seasonality traits in the promoter (for TDA and P4CM) and exon 2 (for the three traits), respectively. The SNPs located in the gene promoter modify the putative binding sites for various trans-acting factors. In exon 2, two synonymous SNPs affect RFLP sites, rs406779174/RsaI (for the three traits) and rs430181568/MnlI (for OCM), and they have been related with seasonal reproductive activity in previous association studies with other breeds. SNP rs400830807, which is located in the 3'-UTR, was associated with the three traits, but this did not modify the putative target sites for ovine miRNAs according to in silico predictions. Finally, the SNP rs403212791 (NW_014639035.1: g.15099004G > A), which is also associated with the three seasonality phenotypes, was the most significant SNP detected in this study and was a non-synonymous polymorphism, leading a change from an Arginine to a Cysteine (R336C). Haplotype analyses confirmed the association results and showed that the effects found for the seasonality traits were caused by the SNPs located in exon 2. We have demonstrated that the T allele in the SNP rs403212791 in the MNTR1A gene is associated with a lower TDA and higher P4CM and OCM values in the Rasa Aragonesa breed. Copyright © 2018 Elsevier Inc. All rights reserved.
Han, Ying; Hazelett, Dennis J.; Wiklund, Fredrik; Schumacher, Fredrick R.; Stram, Daniel O.; Berndt, Sonja I.; Wang, Zhaoming; Rand, Kristin A.; Hoover, Robert N.; Machiela, Mitchell J.; Yeager, Merideth; Burdette, Laurie; Chung, Charles C.; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C.; Key, Timothy J.; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L.; Kolb, Suzanne; Gapstur, Susan M.; Diver, W. Ryan; Stevens, Victoria L.; Strom, Sara S.; Pettaway, Curtis A.; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A.; Yeboah, Edward D.; Tettey, Yao; Biritwum, Richard B.; Adjei, Andrew A.; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P.; Isaacs, William B.; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L.; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M.; Ingles, Sue A.; Kittles, Rick A.; Murphy, Adam B.; Blot, William J.; Signorello, Lisa B.; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M. Cristina; Wu, Suh-Yuh; Hennis, Anselm J. M.; Rybicki, Benjamin A.; Neslund-Dudas, Christine; Hsing, Ann W.; Chu, Lisa; Goodman, Phyllis J.; Klein, Eric A.; Zheng, S. Lilly; Witte, John S.; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L.; Hunter, David J.; Gronberg, Henrik; Cook, Michael B.; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J.; Easton, Douglas F.; Henderson, Brian E.; Coetzee, Gerhard A.; Conti, David V.; Haiman, Christopher A.
2015-01-01
Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10−4–5.6 × 10−3) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10−6) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation. PMID:26162851
Han, Ying; Hazelett, Dennis J; Wiklund, Fredrik; Schumacher, Fredrick R; Stram, Daniel O; Berndt, Sonja I; Wang, Zhaoming; Rand, Kristin A; Hoover, Robert N; Machiela, Mitchell J; Yeager, Merideth; Burdette, Laurie; Chung, Charles C; Hutchinson, Amy; Yu, Kai; Xu, Jianfeng; Travis, Ruth C; Key, Timothy J; Siddiq, Afshan; Canzian, Federico; Takahashi, Atsushi; Kubo, Michiaki; Stanford, Janet L; Kolb, Suzanne; Gapstur, Susan M; Diver, W Ryan; Stevens, Victoria L; Strom, Sara S; Pettaway, Curtis A; Al Olama, Ali Amin; Kote-Jarai, Zsofia; Eeles, Rosalind A; Yeboah, Edward D; Tettey, Yao; Biritwum, Richard B; Adjei, Andrew A; Tay, Evelyn; Truelove, Ann; Niwa, Shelley; Chokkalingam, Anand P; Isaacs, William B; Chen, Constance; Lindstrom, Sara; Le Marchand, Loic; Giovannucci, Edward L; Pomerantz, Mark; Long, Henry; Li, Fugen; Ma, Jing; Stampfer, Meir; John, Esther M; Ingles, Sue A; Kittles, Rick A; Murphy, Adam B; Blot, William J; Signorello, Lisa B; Zheng, Wei; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Nemesure, Barbara; Carpten, John; Leske, M Cristina; Wu, Suh-Yuh; Hennis, Anselm J M; Rybicki, Benjamin A; Neslund-Dudas, Christine; Hsing, Ann W; Chu, Lisa; Goodman, Phyllis J; Klein, Eric A; Zheng, S Lilly; Witte, John S; Casey, Graham; Riboli, Elio; Li, Qiyuan; Freedman, Matthew L; Hunter, David J; Gronberg, Henrik; Cook, Michael B; Nakagawa, Hidewaki; Kraft, Peter; Chanock, Stephen J; Easton, Douglas F; Henderson, Brian E; Coetzee, Gerhard A; Conti, David V; Haiman, Christopher A
2015-10-01
Interpretation of biological mechanisms underlying genetic risk associations for prostate cancer is complicated by the relatively large number of risk variants (n = 100) and the thousands of surrogate SNPs in linkage disequilibrium. Here, we combined three distinct approaches: multiethnic fine-mapping, putative functional annotation (based upon epigenetic data and genome-encoded features), and expression quantitative trait loci (eQTL) analyses, in an attempt to reduce this complexity. We examined 67 risk regions using genotyping and imputation-based fine-mapping in populations of European (cases/controls: 8600/6946), African (cases/controls: 5327/5136), Japanese (cases/controls: 2563/4391) and Latino (cases/controls: 1034/1046) ancestry. Markers at 55 regions passed a region-specific significance threshold (P-value cutoff range: 3.9 × 10(-4)-5.6 × 10(-3)) and in 30 regions we identified markers that were more significantly associated with risk than the previously reported variants in the multiethnic sample. Novel secondary signals (P < 5.0 × 10(-6)) were also detected in two regions (rs13062436/3q21 and rs17181170/3p12). Among 666 variants in the 55 regions with P-values within one order of magnitude of the most-associated marker, 193 variants (29%) in 48 regions overlapped with epigenetic or other putative functional marks. In 11 of the 55 regions, cis-eQTLs were detected with nearby genes. For 12 of the 55 regions (22%), the most significant region-specific, prostate-cancer associated variant represented the strongest candidate functional variant based on our annotations; the number of regions increased to 20 (36%) and 27 (49%) when examining the 2 and 3 most significantly associated variants in each region, respectively. These results have prioritized subsets of candidate variants for downstream functional evaluation. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Sork, Victoria L; Squire, Kevin; Gugger, Paul F; Steele, Stephanie E; Levy, Eric D; Eckert, Andrew J
2016-01-01
The ability of California tree populations to survive anthropogenic climate change will be shaped by the geographic structure of adaptive genetic variation. Our goal is to test whether climate-associated candidate genes show evidence of spatially divergent selection in natural populations of valley oak, Quercus lobata, as preliminary indication of local adaptation. Using DNA from 45 individuals from 13 localities across the species' range, we sequenced portions of 40 candidate genes related to budburst/flowering, growth, osmotic stress, and temperature stress. Using 195 single nucleotide polymorphisms (SNPs), we estimated genetic differentiation across populations and correlated allele frequencies with climate gradients using single-locus and multivariate models. The top 5% of FST estimates ranged from 0.25 to 0.68, yielding loci potentially under spatially divergent selection. Environmental analyses of SNP frequencies with climate gradients revealed three significantly correlated SNPs within budburst/flowering genes and two SNPs within temperature stress genes with mean annual precipitation, after controlling for multiple testing. A redundancy model showed a significant association between SNPs and climate variables and revealed a similar set of SNPs with high loadings on the first axis. In the RDA, climate accounted for 67% of the explained variation, when holding climate constant, in contrast to a putatively neutral SSR data set where climate accounted for only 33%. Population differentiation and geographic gradients of allele frequencies in climate-associated functional genes in Q. lobata provide initial evidence of adaptive genetic variation and background for predicting population response to climate change. © 2016 Botanical Society of America.
Fisher, Colleen A.; Bhattarai, Eric K.; Osterstock, Jason B.; Dowd, Scot E.; Seabury, Paul M.; Vikram, Meenu; Whitlock, Robert H.; Schukken, Ynte H.; Schnabel, Robert D.; Taylor, Jeremy F.; Womack, James E.; Seabury, Christopher M.
2011-01-01
Members of the Toll-like receptor (TLR) gene family occupy key roles in the mammalian innate immune system by functioning as sentries for the detection of invading pathogens, thereafter provoking host innate immune responses. We utilized a custom next-generation sequencing approach and allele-specific genotyping assays to detect and validate 280 biallelic variants across all 10 bovine TLR genes, including 71 nonsynonymous single nucleotide polymorphisms (SNPs) and one putative nonsense SNP. Bayesian haplotype reconstructions and median joining networks revealed haplotype sharing between Bos taurus taurus and Bos taurus indicus breeds at every locus, and specialized beef and dairy breeds could not be differentiated despite an average polymorphism density of 1 marker/158 bp. Collectively, 160 tagSNPs and two tag insertion-deletion mutations (indels) were sufficient to predict 100% of the variation at 280 variable sites for both Bos subspecies and their hybrids, whereas 118 tagSNPs and 1 tagIndel predictively captured 100% of the variation at 235 variable sites for B. t. taurus. Polyphen and SIFT analyses of amino acid (AA) replacements encoded by bovine TLR SNPs indicated that up to 32% of the AA substitutions were expected to impact protein function. Classical and newly developed tests of diversity provide strong support for balancing selection operating on TLR3 and TLR8, and purifying selection acting on TLR10. An investigation of the persistence and continuity of linkage disequilibrium (r2≥0.50) between adjacent variable sites also supported the presence of selection acting on TLR3 and TLR8. A case-control study employing validated variants from bovine TLR genes recognizing bacterial ligands revealed six SNPs potentially eliciting small effects on susceptibility to Mycobacterium avium spp paratuberculosis infection in dairy cattle. The results of this study will broadly impact domestic cattle research by providing the necessary foundation to explore several avenues of bovine translational genomics, and the potential for marker-assisted vaccination. PMID:22164200
Chen, Shanyuan; Gomes, Rui; Costa, Vânia; Santos, Pedro; Charneca, Rui; Zhang, Ya-ping; Liu, Xue-hong; Wang, Shao-qing; Bento, Pedro; Nunes, Jose-Luis; Buzgó, József; Varga, Gyula; Anton, István; Zsolnai, Attila; Beja-Pereira, Albano
2013-10-01
The coexistence of wild boars and domestic pigs across Eurasia makes it feasible to conduct comparative genetic or genomic analyses for addressing how genetically different a domestic species is from its wild ancestor. To test whether there are differences in patterns of genetic variability between wild and domestic pigs at immunity-related genes and to detect outlier loci putatively under selection that may underlie differences in immune responses, here we analyzed 54 single-nucleotide polymorphisms (SNPs) of 19 immunity-related candidate genes on 11 autosomes in three pairs of wild boar and domestic pig populations from China, Iberian Peninsula, and Hungary. Our results showed no statistically significant differences in allele frequency and heterozygosity across SNPs between three pairs of wild and domestic populations. This observation was more likely due to the widespread and long-lasting gene flow between wild boars and domestic pigs across Eurasia. In addition, we detected eight coding SNPs from six genes as outliers being under selection consistently by three outlier tests (BayeScan2.1, FDIST2, and Arlequin3.5). Among four non-synonymous outlier SNPs, one from TLR4 gene was identified as being subject to positive (diversifying) selection and three each from CD36, IFNW1, and IL1B genes were suggested as under balancing selection. All of these four non-synonymous variants were predicted as being benign by PolyPhen-2. Our results were supported by other independent lines of evidence for positive selection or balancing selection acting on these four immune genes (CD36, IFNW1, IL1B, and TLR4). Our study showed an example applying a candidate gene approach to identify functionally important mutations (i.e., outlier loci) in wild and domestic pigs for subsequent functional experiments.
Galvan, Antonella; Falvella, Felicia S; Frullanti, Elisa; Spinola, Monica; Incarbone, Matteo; Nosotti, Mario; Santambrogio, Luigi; Conti, Barbara; Pastorino, Ugo; Gonzalez-Neira, Anna; Dragani, Tommaso A
2010-03-01
We analyzed a series of young (median age = 52 years) non-smoker lung cancer patients and their unaffected siblings as controls, using a genome-wide 620 901 single-nucleotide polymorphism (SNP) array analysis and a case-control DNA pooling approach. We identified 82 putatively associated SNPs that were retested by individual genotyping followed by use of the sib transmission disequilibrium test, pointing to 36 SNPs associated with lung cancer risk in the discordant sibs series. Analysis of these 36 SNPs in a polygenic model characterized by additive and interchangeable effects of rare alleles revealed a highly statistically significant dosage-dependent association between risk allele carrier status and proportion of cancer cases. Replication of the same 36 SNPs in a population-based series confirmed the association with lung cancer for three SNPs, suggesting that phenocopies and genetic heterogeneity can play a major role in the complex genetics of lung cancer risk in the general population.
Pérez-Rubio, Gloria; Pérez-Rodríguez, Martha E; Fernández-López, Juan Carlos; Ramírez-Venegas, Alejandra; García-Colunga, Jesús; Ávila-Moreno, Federico; Camarena, Angel; Sansores, Raúl H; Falfán-Valencia, Ramcés
2016-07-01
To identify genetic variants associated with greater tobacco consumption in a Mexican population. Daily smokers were classified as light smokers (LS; n = 742), heavy smokers (HS; n = 601) and nonsmokers (NS; n = 606). In the first stage, a genotyping microarray that included 347 SNPs in CHRNA2-CHRNA7/CHRNA10, CHRNB2-CHRNB4 and NRXN1 genes and 37 ancestry-informative markers was used to analyze 707 samples (187 HS, 328 LS and 192 NS). In the second stage, 14 SNPs from stage 1 were validated in the remaining samples (HS, LS and NS; n = 414 in each group) using real-time PCR. To predict the role of the associated SNPs, an in silico analysis was performed. Two SNPs in NRXN1 and two in CHRNA5 were associated with cigarette consumption, while rs10865246/C (NRXN1) was associated with high nicotine addiction. The in silico analysis revealed that rs1882296/T had a high level of homology with Hsa-miR-6740-5p, which encodes a putative miRNA that targets glutamate receptor subunits (GRIA2, GRID2) and GABA receptor subunits (GABRG1, GABRA4, GABRB2), while rs1882296/C had a high level of homology with Hsa-miR-6866-5p, which encodes a different miRNA that targets GRID2 and GABRB2. In a Mexican Mestizo population, greater consumption of cigarettes was influenced by polymorphisms in the NRXN1 and CHRNA5 genes. We proposed new hypotheses regarding the putative roles of miRNAs that influence the GABAergic and glutamatergic pathways in smoking addiction.
Humble, E; Martinez-Barrio, A; Forcada, J; Trathan, P N; Thorne, M A S; Hoffmann, M; Wolf, J B W; Hoffman, J I
2016-07-01
Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N50 : 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling. © 2015 John Wiley & Sons Ltd.
Johns, N; Tan, B H; MacMillan, M; Solheim, T S; Ross, J A; Baracos, V E; Damaraju, S; Fearon, K C H
2014-12-01
Cancer cachexia is a complex and multifactorial disease. Evolving definitions highlight the fact that a diverse range of biological processes contribute to cancer cachexia. Part of the variation in who will and who will not develop cancer cachexia may be genetically determined. As new definitions, classifications and biological targets continue to evolve, there is a need for reappraisal of the literature for future candidate association studies. This review summarizes genes identified or implicated as well as putative candidate genes contributing to cachexia, identified through diverse technology platforms and model systems to further guide association studies. A systematic search covering 1986-2012 was performed for potential candidate genes / genetic polymorphisms relating to cancer cachexia. All candidate genes were reviewed for functional polymorphisms or clinically significant polymorphisms associated with cachexia using the OMIM and GeneRIF databases. Pathway analysis software was used to reveal possible network associations between genes. Functionality of SNPs/genes was explored based on published literature, algorithms for detecting putative deleterious SNPs and interrogating the database for expression of quantitative trait loci (eQTLs). A total of 154 genes associated with cancer cachexia were identified and explored for functional polymorphisms. Of these 154 genes, 119 had a combined total of 281 polymorphisms with functional and/or clinical significance in terms of cachexia associated with them. Of these, 80 polymorphisms (in 51 genes) were replicated in more than one study with 24 polymorphisms found to influence two or more hallmarks of cachexia (i.e., inflammation, loss of fat mass and/or lean mass and reduced survival). Selection of candidate genes and polymorphisms is a key element of multigene study design. The present study provides a contemporary basis to select genes and/or polymorphisms for further association studies in cancer cachexia, and to develop their potential as susceptibility biomarkers of cachexia.
Robinson, Nicholas A; Gopikrishna, Gopalapillay; Baranski, Matthew; Katneni, Vinaya Kumar; Shekhar, Mudagandur S; Shanmugakarthik, Jayakani; Jothivel, Sarangapani; Gopal, Chavali; Ravichandran, Pitchaiyappan; Gitterle, Thomas; Ponniah, Alphis G
2014-08-28
Shrimp culture is a fast growing aquaculture sector, but in recent years there has been a shift away from tiger shrimp Penaeus monodon to other species. This is largely due to the susceptibility of P. monodon to white spot syndrome virus disease (Whispovirus sp.) which has impacted production around the world. As female penaeid shrimp grow more rapidly than males, mono-sex production would be advantageous, however little is known about genes controlling or markers associated with sex determination in shrimp. In this study, a mapped set of 3959 transcribed single nucleotide polymorphisms were used to scan the P. monodon genome for loci associated with resistance to white-spot syndrome virus and sex in seven full-sibling tiger shrimp families challenged with white spot syndrome virus. Linkage groups 2, 3, 5, 6, 17, 18, 19, 22, 27 and 43 were found to contain quantitative trait loci significantly associated with hours of survival after white spot syndrome virus infection (P < 0.05 after Bonferroni correction). Nine QTL were significantly associated with hours of survival. Of the SNPs mapping to these and other regions with suggestive associations, many were found to occur in transcripts showing homology to genes with putative immune functions of interest, including genes affecting the action of the ubiquitin-proteasome pathway, lymphocyte-cell function, heat shock proteins, the TOLL pathway, protein kinase signal transduction pathways, mRNA binding proteins, lectins and genes affecting the development and differentiation of the immune system (eg. RUNT protein 1A). Several SNPs significantly associated with sex were mapped to linkage group 30, the strongest associations (P < 0.001 after Bonferroni correction) for 3 SNPs located in a 0.8 cM stretch between positions 43.5 and 44.3 cM where the feminisation gene (FEM-1, affecting sexual differentiation in Caenorhabditis elegans) mapped. The markers for disease resistance and sexual differentiation identified by this study could be useful for marker assisted selection to improve resistance to WSSV and for identifying homogametic female individuals for mono-sex (all female) production. The genes with putative functions affecting immunity and sexual differentiation that were found to closely map to these loci provide leads about the mechanisms affecting these important economic traits in shrimp.
Meyer, Kerstin B; O'Reilly, Martin; Michailidou, Kyriaki; Carlebur, Saskia; Edwards, Stacey L; French, Juliet D; Prathalingham, Radhika; Dennis, Joe; Bolla, Manjeet K; Wang, Qin; de Santiago, Ines; Hopper, John L; Tsimiklis, Helen; Apicella, Carmel; Southey, Melissa C; Schmidt, Marjanka K; Broeks, Annegien; Van 't Veer, Laura J; Hogervorst, Frans B; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Fasching, Peter A; Lux, Michael P; Ekici, Arif B; Beckmann, Matthias W; Peto, Julian; Dos Santos Silva, Isabel; Fletcher, Olivia; Johnson, Nichola; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Menegaux, Florence; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Milne, Roger L; Zamora, M Pilar; Arias, Jose I; Benitez, Javier; Neuhausen, Susan; Anton-Culver, Hoda; Ziogas, Argyrios; Dur, Christina C; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Meindl, Alfons; Schmutzler, Rita K; Engel, Christoph; Ditsch, Nina; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Dörk, Thilo; Helbig, Sonja; Bogdanova, Natalia V; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Chenevix-Trench, Georgia; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Lambrechts, Diether; Thienpont, Bernard; Christiaens, Marie-Rose; Smeets, Ann; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Bernard, Loris; Couch, Fergus J; Olson, Janet E; Wang, Xianshu; Purrington, Kristen; Giles, Graham G; Severi, Gianluca; Baglietto, Laura; McLean, Catriona; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Teo, Soo-Hwang; Yip, Cheng-Har; Phuah, Sze-Yee; Kristensen, Vessela; Grenaker Alnæs, Grethe; Børresen-Dale, Anne-Lise; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline M; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J; Lissowska, Jolanta; Czene, Kamila; Darabi, Hartef; Eriksson, Kimael; Hooning, Maartje J; Martens, John W M; van den Ouweland, Ans M W; van Deurzen, Carolien H M; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Reed, Malcolm W R; Blot, William; Signorello, Lisa B; Cai, Qiuyin; Pharoah, Paul D P; Ghoussaini, Maya; Harrington, Patricia; Tyrer, Jonathan; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Hartman, Mikael; Hui, Miao; Lim, Wei-Yen; Buhari, Shaik A; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Vachon, Celine; Slager, Susan; Fostira, Florentia; Pilarski, Robert; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Hou, Ming-Feng; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J; Ponder, Bruce A J; Dunning, Alison M; Easton, Douglas F
2013-12-05
The 10q26 locus in the second intron of FGFR2 is the locus most strongly associated with estrogen-receptor-positive breast cancer in genome-wide association studies. We conducted fine-scale mapping in case-control studies genotyped with a custom chip (iCOGS), comprising 41 studies (n = 89,050) of European ancestry, 9 Asian ancestry studies (n = 13,983), and 2 African ancestry studies (n = 2,028) from the Breast Cancer Association Consortium. We identified three statistically independent risk signals within the locus. Within risk signals 1 and 3, genetic analysis identified five and two variants, respectively, highly correlated with the most strongly associated SNPs. By using a combination of genetic fine mapping, data on DNase hypersensitivity, and electrophoretic mobility shift assays to study protein-DNA binding, we identified rs35054928, rs2981578, and rs45631563 as putative functional SNPs. Chromatin immunoprecipitation showed that FOXA1 preferentially bound to the risk-associated allele (C) of rs2981578 and was able to recruit ERα to this site in an allele-specific manner, whereas E2F1 preferentially bound the risk variant of rs35054928. The risk alleles were preferentially found in open chromatin and bound by Ser5 phosphorylated RNA polymerase II, suggesting that the risk alleles are associated with changes in transcription. Chromatin conformation capture demonstrated that the risk region was able to interact with the promoter of FGFR2, the likely target gene of this risk region. A role for FOXA1 in mediating breast cancer susceptibility at this locus is consistent with the finding that the FGFR2 risk locus primarily predisposes to estrogen-receptor-positive disease. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Meyer, Kerstin B.; O’Reilly, Martin; Michailidou, Kyriaki; Carlebur, Saskia; Edwards, Stacey L.; French, Juliet D.; Prathalingham, Radhika; Dennis, Joe; Bolla, Manjeet K.; Wang, Qin; de Santiago, Ines; Hopper, John L.; Tsimiklis, Helen; Apicella, Carmel; Southey, Melissa C.; Schmidt, Marjanka K.; Broeks, Annegien; Van ’t Veer, Laura J.; Hogervorst, Frans B.; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Fasching, Peter A.; Lux, Michael P.; Ekici, Arif B.; Beckmann, Matthias W.; Peto, Julian; dos Santos Silva, Isabel; Fletcher, Olivia; Johnson, Nichola; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Marme, Federick; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Menegaux, Florence; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Milne, Roger L.; Zamora, M. Pilar; Arias, Jose I.; Benitez, Javier; Neuhausen, Susan; Anton-Culver, Hoda; Ziogas, Argyrios; Dur, Christina C.; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Meindl, Alfons; Schmutzler, Rita K.; Engel, Christoph; Ditsch, Nina; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Dörk, Thilo; Helbig, Sonja; Bogdanova, Natalia V.; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Chenevix-Trench, Georgia; Wu, Anna H.; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O.; Lambrechts, Diether; Thienpont, Bernard; Christiaens, Marie-Rose; Smeets, Ann; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Bernard, Loris; Couch, Fergus J.; Olson, Janet E.; Wang, Xianshu; Purrington, Kristen; Giles, Graham G.; Severi, Gianluca; Baglietto, Laura; McLean, Catriona; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Teo, Soo-Hwang; Yip, Cheng-Har; Phuah, Sze-Yee; Kristensen, Vessela; Grenaker Alnæs, Grethe; Børresen-Dale, Anne-Lise; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Kauppila, Saila; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline M.; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Czene, Kamila; Darabi, Hartef; Eriksson, Kimael; Hooning, Maartje J.; Martens, John W.M.; van den Ouweland, Ans M.W.; van Deurzen, Carolien H.M.; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Cox, Angela; Reed, Malcolm W.R.; Blot, William; Signorello, Lisa B.; Cai, Qiuyin; Pharoah, Paul D.P.; Ghoussaini, Maya; Harrington, Patricia; Tyrer, Jonathan; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Hartman, Mikael; Hui, Miao; Lim, Wei-Yen; Buhari, Shaik A.; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Vachon, Celine; Slager, Susan; Fostira, Florentia; Pilarski, Robert; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Hou, Ming-Feng; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J.; Ponder, Bruce A.J.; Dunning, Alison M.; Easton, Douglas F.
2013-01-01
The 10q26 locus in the second intron of FGFR2 is the locus most strongly associated with estrogen-receptor-positive breast cancer in genome-wide association studies. We conducted fine-scale mapping in case-control studies genotyped with a custom chip (iCOGS), comprising 41 studies (n = 89,050) of European ancestry, 9 Asian ancestry studies (n = 13,983), and 2 African ancestry studies (n = 2,028) from the Breast Cancer Association Consortium. We identified three statistically independent risk signals within the locus. Within risk signals 1 and 3, genetic analysis identified five and two variants, respectively, highly correlated with the most strongly associated SNPs. By using a combination of genetic fine mapping, data on DNase hypersensitivity, and electrophoretic mobility shift assays to study protein-DNA binding, we identified rs35054928, rs2981578, and rs45631563 as putative functional SNPs. Chromatin immunoprecipitation showed that FOXA1 preferentially bound to the risk-associated allele (C) of rs2981578 and was able to recruit ERα to this site in an allele-specific manner, whereas E2F1 preferentially bound the risk variant of rs35054928. The risk alleles were preferentially found in open chromatin and bound by Ser5 phosphorylated RNA polymerase II, suggesting that the risk alleles are associated with changes in transcription. Chromatin conformation capture demonstrated that the risk region was able to interact with the promoter of FGFR2, the likely target gene of this risk region. A role for FOXA1 in mediating breast cancer susceptibility at this locus is consistent with the finding that the FGFR2 risk locus primarily predisposes to estrogen-receptor-positive disease. PMID:24290378
Liu, Jun-Jun; Sniezko, Richard; Murray, Michael; Wang, Ning; Chen, Hao; Zamany, Arezoo; Sturrock, Rona N.; Savin, Douglas; Kegley, Angelia
2016-01-01
Whitebark pine (WBP, Pinus albicaulis Engelm.) is an endangered conifer species due to heavy mortality from white pine blister rust (WPBR, caused by Cronartium ribicola) and mountain pine beetle (Dendroctonus ponderosae). Information about genetic diversity and population structure is of fundamental importance for its conservation and restoration. However, current knowledge on the genetic constitution and genomic variation is still limited for WBP. In this study, an integrated genomics approach was applied to characterize seed collections from WBP breeding programs in western North America. RNA-seq analysis was used for de novo assembly of the WBP needle transcriptome, which contains 97,447 protein-coding transcripts. Within the transcriptome, single nucleotide polymorphisms (SNPs) were discovered, and more than 22,000 of them were non-synonymous SNPs (ns-SNPs). Following the annotation of genes with ns-SNPs, 216 ns-SNPs within candidate genes with putative functions in disease resistance and plant defense were selected to design SNP arrays for high-throughput genotyping. Among these SNP loci, 71 were highly polymorphic, with sufficient variation to identify a unique genotype for each of the 371 individuals originating from British Columbia (Canada), Oregon and Washington (USA). A clear genetic differentiation was evident among seed families. Analyses of genetic spatial patterns revealed varying degrees of diversity and the existence of several genetic subgroups in the WBP breeding populations. Genetic components were associated with geographic variables and phenotypic rating of WPBR disease severity across landscapes, which may facilitate further identification of WBP genotypes and gene alleles contributing to local adaptation and quantitative resistance to WPBR. The WBP genomic resources developed here provide an invaluable tool for further studies and for exploitation and utilization of the genetic diversity preserved within this endangered conifer and other five-needle pines. PMID:27992468
Abdel-Ghani, Adel H; Kumar, Bharath; Pace, Jordon; Jansen, Constantin; Gonzalez-Portilla, Pedro J; Reyes-Matamoros, Jenaro; San Martin, Juan Pablo; Lee, Michael; Lübberstedt, Thomas
2015-05-01
A better understanding of the genetic control of root development might allow one to develop lines with root systems with the potential to adapt to soils with limited nutrient availability. For this purpose, an association study (AS) panel consisting of 74 diverse set of inbred maize lines were screened for seedling root traits and adult plant root traits under two contrasting nitrogen (N) levels (low and high N). Allele re-sequencing of RTCL, RTH3, RUM1, and RUL1 genes related to root development was carried out for AS panel lines. Association analysis was carried out between individual polymorphisms, and both seedling and adult plant traits, while controlling for spurious associations due to population structure and kinship relations. Based on the SNPs identified in RTCL, RTH3, RUM1, and RUL1, lines within the AS panel were grouped into 16, 9, 22, and 7 haplotypes, respectively. Association analysis revealed several polymorphisms within root genes putatively associated with the variability in seedling root and adult plant traits development under contrasting N levels. The highest number of significantly associated SNPs with seedling root traits were found in RTCL (19 SNPs) followed by RUM1 (4 SNPs) and in case of RTH3 and RUL1, two and three SNPs, respectively, were significantly associated with root traits. RTCL and RTH3 were also found to be associated with grain yield. Thus considerable allelic diversity is present within the candidate genes studied and can be utilized to develop functional markers that allow identification of maize lines with improved root architecture and yield under N stress conditions.
El-Halawany, Nermin; Abd-El-Monsif, Shawky A; Al-Tohamy Ahmed, F M; Hegazy, Lamees; Abdel-Shafy, Hamdy; Abdel-Latif, Magdy A; Ghazi, Yasser A; Neuhoff, Christiane; Salilew-Wondim, Dessie; Schellander, Karl
2017-03-01
Mastitis is an infectious disease of the mammary gland that leads to reduced milk production and change in milk composition. Complement component C3 plays a major role as a central molecule of the complement cascade involving in killing of microorganisms, either directly or in cooperation with phagocytic cells. C3 cDNA were isolated, from Egyptian buffalo and cattle, sequenced and characterized. The C3 cDNA sequences of buffalo and cattle consist of 5025 and 5019 bp, respectively. Buffalo and cattle C3 cDNAs share 99% of sequence identity with each other. The 4986 bp open reading frame in buffalo encodes a putative protein of 1661 amino acids-as in cattle-and includes all the functional domains. Further, analysis of the C3 cDNA sequences detected six novel single-nucleotide polymorphisms (SNPs) in buffalo and three novel SNPs in cattle. The association analysis of the detected SNPs with milk somatic cell score as an indicator of mastitis revealed that the most significant association in buffalo was found in the C>A substitution (ss: 1752816097) in exon 27, whereas in cattle it was in the C>T substitution (ss: 1752816085) in exon 12. Our findings provide preliminary information about the contribution of C3 polymorphisms to mastitis resistance in buffalo and cattle.
Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius
2013-05-01
Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple
Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron
2012-01-01
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718
Skallerup, P; Thamsborg, S M; Jørgensen, C B; Mejer, H; Göring, H H H; Archibald, A L; Fredholm, M; Nejsum, P
2015-06-15
Whipworms (Trichuris spp.) infect a variety of hosts, including domestic animals and humans. Of considerable interest is the porcine whipworm, T. suis, which is particularly prevalent in outdoor production systems. High infection levels may cause growth retardation, anaemia and haemorrhagic diarrhoea. A significant proportion of the variation in Trichuris faecal egg count (FEC) has been attributed to the host's genetic make-up. The aim of the present study was to identify genetic loci associated with resistance to T. suis in pigs. We used single nucleotide polymorphism (SNP) markers to perform a whole-genome scan of an F1 resource population (n = 195) trickle-infected with T. suis. A measured genotype analysis revealed a putative quantitative trait locus (QTL) for T. suis FEC on chromosome 13 covering ∼ 4.5 Mbp, although none of the SNPs reached genome-wide significance. We tested the hypothesis that this region of SSC13 harboured genes with effects on T. suis burden by genotyping three SNPs within the putative QTL in unrelated pigs exposed to either experimental or natural T. suis infections and from which we had FEC (n = 113) or worm counts (n = 178). In these studies, two of the SNPs (rs55618716, ST) were associated with FEC (P < 0.01), thus confirming our initial findings. However, we did not find any of the SNPs to be associated with T. suis worm burden. In conclusion, our study demonstrates that genetic markers for resistance to T. suis as indicated by low FEC can be identified in pigs. Copyright © 2015 Elsevier B.V. All rights reserved.
Geographic differences in allele frequencies of susceptibility SNPs for cardiovascular disease
2011-01-01
Background We hypothesized that the frequencies of risk alleles of SNPs mediating susceptibility to cardiovascular diseases differ among populations of varying geographic origin and that population-specific selection has operated on some of these variants. Methods From the database of genome-wide association studies (GWAS), we selected 36 cardiovascular phenotypes including coronary heart disease, hypertension, and stroke, as well as related quantitative traits (eg, body mass index and plasma lipid levels). We identified 292 SNPs in 270 genes associated with a disease or trait at P < 5 × 10-8. As part of the Human Genome-Diversity Project (HGDP), 158 (54.1%) of these SNPs have been genotyped in 938 individuals belonging to 52 populations from seven geographic areas. A measure of population differentiation, FST, was calculated to quantify differences in risk allele frequencies (RAFs) among populations and geographic areas. Results Large differences in RAFs were noted in populations of Africa, East Asia, America and Oceania, when compared with other geographic regions. The mean global FST (0.1042) for 158 SNPs among the populations was not significantly higher than the mean global FST of 158 autosomal SNPs randomly sampled from the HGDP database. Significantly higher global FST (P < 0.05) was noted in eight SNPs, based on an empirical distribution of global FST of 2036 putatively neutral SNPs. For four of these SNPs, additional evidence of selection was noted based on the integrated Haplotype Score. Conclusion Large differences in RAFs for a set of common SNPs that influence risk of cardiovascular disease were noted between the major world populations. Pairwise comparisons revealed RAF differences for at least eight SNPs that might be due to population-specific selection or demographic factors. These findings are relevant to a better understanding of geographic variation in the prevalence of cardiovascular disease. PMID:21507254
Schönhals, Elske Maria; Ding, Jia; Ritter, Enrique; Paulo, Maria João; Cara, Nicolás; Tacke, Ekhard; Hofferbert, Hans-Reinhard; Lübeck, Jens; Strahwald, Josef; Gebhardt, Christiane
2017-08-22
Tuber yield and starch content of the cultivated potato are complex traits of decisive importance for breeding improved varieties. Natural variation of tuber yield and starch content depends on the environment and on multiple, mostly unknown genetic factors. Dissection and molecular identification of the genes and their natural allelic variants controlling these complex traits will lead to the development of diagnostic DNA-based markers, by which precision and efficiency of selection can be increased (precision breeding). Three case-control populations were assembled from tetraploid potato cultivars based on maximizing the differences between high and low tuber yield (TY), starch content (TSC) and starch yield (TSY, arithmetic product of TY and TSC). The case-control populations were genotyped by restriction-site associated DNA sequencing (RADseq) and the 8.3 k SolCAP SNP genotyping array. The allele frequencies of single nucleotide polymorphisms (SNPs) were compared between cases and controls. RADseq identified, depending on data filtering criteria, between 6664 and 450 genes with one or more differential SNPs for one, two or all three traits. Differential SNPs in 275 genes were detected using the SolCAP array. A genome wide association study using the SolCAP array on an independent, unselected population identified SNPs associated with tuber starch content in 117 genes. Physical mapping of the genes containing differential or associated SNPs, and comparisons between the two genome wide genotyping methods and two different populations identified genome segments on all twelve potato chromosomes harboring one or more quantitative trait loci (QTL) for TY, TSC and TSY. Several hundred genes control tuber yield and starch content in potato. They are unequally distributed on all potato chromosomes, forming clusters between 0.5-4 Mbp width. The largest fraction of these genes had unknown function, followed by genes with putative signalling and regulatory functions. The genetic control of tuber yield and starch content is interlinked. Most differential SNPs affecting both traits had antagonistic effects: The allele increasing TY decreased TSC and vice versa. Exceptions were 89 SNP alleles which had synergistic effects on TY, TSC and TSY. These and the corresponding genes are primary targets for developing diagnostic markers.
Deep Sequencing-Based Analysis of the Cymbidium ensifolium Floral Transcriptome
Li, Xiaobai; Luo, Jie; Yan, Tianlian; Xiang, Lin; Jin, Feng; Qin, Dehui; Sun, Chongbo; Xie, Ming
2013-01-01
Cymbidium ensifolium is a Chinese Cymbidium with an elegant shape, beautiful appearance, and a fragrant aroma. C. ensifolium has a long history of cultivation in China and it has excellent commercial value as a potted plant and cut flower. The development of C. ensifolium genomic resources has been delayed because of its large genome size. Taking advantage of technical and cost improvement of RNA-Seq, we extracted total mRNA from flower buds and mature flowers and obtained a total of 9.52 Gb of filtered nucleotides comprising 98,819,349 filtered reads. The filtered reads were assembled into 101,423 isotigs, representing 51,696 genes. Of the 101,423 isotigs, 41,873 were putative homologs of annotated sequences in the public databases, of which 158 were associated with floral development and 119 were associated with flowering. The isotigs were categorized according to their putative functions. In total, 10,212 of the isotigs were assigned into 25 eukaryotic orthologous groups (KOGs), 41,690 into 58 gene ontology (GO) terms, and 9,830 into 126 Arabidopsis Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, and 9,539 isotigs into 123 rice pathways. Comparison of the isotigs with those of the two related orchid species P. equestris and C. sinense showed that 17,906 isotigs are unique to C. ensifolium. In addition, a total of 7,936 SSRs and 16,676 putative SNPs were identified. To our knowledge, this transcriptome database is the first major genomic resource for C. ensifolium and the most comprehensive transcriptomic resource for genus Cymbidium. These sequences provide valuable information for understanding the molecular mechanisms of floral development and flowering. Sequences predicted to be unique to C. ensifolium would provide more insights into C. ensifolium gene diversity. The numerous SNPs and SSRs identified in the present study will contribute to marker development for C. ensifolium. PMID:24392013
Prospecting for pig single nucleotide polymorphisms in the human genome: have we struck gold?
Grapes, L; Rudd, S; Fernando, R L; Megy, K; Rocha, D; Rothschild, M F
2006-06-01
Gene-to-gene variation in the frequency of single nucleotide polymorphisms (SNPs) has been observed in humans, mice, rats, primates and pigs, but a relationship across species in this variation has not been described. Here, the frequency of porcine coding SNPs (cSNPs) identified by in silico methods, and the frequency of murine cSNPs, were compared with the frequency of human cSNPs across homologous genes. From 150,000 porcine expressed sequence tag (EST) sequences, a total of 452 SNP-containing sequence clusters were found, totalling 1394 putative SNPs. All the clustered porcine EST annotations and SNP data have been made publicly available at http://sputnik.btk.fi/project?name=swine. Human and murine cSNPs were identified from dbSNP and were characterized as either validated or total number of cSNPs (validated plus non-validated) for comparison purposes. The correlation between in silico pig cSNP and validated human cSNP densities was found to be 0.77 (p < 0.00001) for a set of 25 homologous genes, while a correlation of 0.48 (p < 0.0005) was found for a primarily random sample of 50 homologous human and mouse genes. This is the first evidence of conserved gene-to-gene variability in cSNP frequency across species and indicates that site-directed screening of porcine genes that are homologous to cSNP-rich human genes may rapidly advance cSNP discovery in pigs.
Hess, Jon E; Campbell, Nathan R; Docker, Margaret F; Baker, Cyndi; Jackson, Aaron; Lampman, Ralph; McIlraith, Brian; Moser, Mary L; Statler, David P; Young, William P; Wildbill, Andrew J; Narum, Shawn R
2015-01-01
Next-generation sequencing data can be mined for highly informative single nucleotide polymorphisms (SNPs) to develop high-throughput genomic assays for nonmodel organisms. However, choosing a set of SNPs to address a variety of objectives can be difficult because SNPs are often not equally informative. We developed an optimal combination of 96 high-throughput SNP assays from a total of 4439 SNPs identified in a previous study of Pacific lamprey (Entosphenus tridentatus) and used them to address four disparate objectives: parentage analysis, species identification and characterization of neutral and adaptive variation. Nine of these SNPs are FST outliers, and five of these outliers are localized within genes and significantly associated with geography, run-timing and dwarf life history. Two of the 96 SNPs were diagnostic for two other lamprey species that were morphologically indistinguishable at early larval stages and were sympatric in the Pacific Northwest. The majority (85) of SNPs in the panel were highly informative for parentage analysis, that is, putatively neutral with high minor allele frequency across the species' range. Results from three case studies are presented to demonstrate the broad utility of this panel of SNP markers in this species. As Pacific lamprey populations are undergoing rapid decline, these SNPs provide an important resource to address critical uncertainties associated with the conservation and recovery of this imperiled species. © 2014 John Wiley & Sons Ltd.
Hulse-Kemp, Amanda M.; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A.; Scheffler, Brian E.; Fang, David D.; Chen, Z. Jeffrey; Van Deynze, Allen; Stelly, David M.
2015-01-01
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. PMID:25858960
Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay
2013-09-23
Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.
RICD: a rice indica cDNA database resource for rice functional genomics.
Lu, Tingting; Huang, Xuehui; Zhu, Chuanrang; Huang, Tao; Zhao, Qiang; Xie, Kabing; Xiong, Lizhong; Zhang, Qifa; Han, Bin
2008-11-26
The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones. Rice Indica cDNA Database (RICD) is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB) and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA. The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website http://www.ncgr.ac.cn/ricd.
Doyle, Jacqueline M.; Katzner, Todd E.; Roemer, Gary; Cain, James W.; Millsap, Brian; McIntyre, Carol; Sonsthagen, Sarah A.; Fernandez, Nadia B.; Wheeler, Maria; Bulut, Zafer; Bloom, Peter; DeWoody, J. Andrew
2016-01-01
Molecular markers can reveal interesting aspects of organismal ecology and evolution, especially when surveyed in rare or elusive species. Herein, we provide a preliminary assessment of golden eagle (Aquila chrysaetos) population structure in North America using novel single nucleotide polymorphisms (SNPs). These SNPs included one molecular sexing marker, two mitochondrial markers, 85 putatively neutral markers that were derived from noncoding regions within large intergenic intervals, and 74 putatively nonneutral markers found in or very near protein-coding genes. We genotyped 523 eagle samples at these 162 SNPs and quantified genotyping error rates and variability at each marker. Our samples corresponded to 344 individual golden eagles as assessed by unique multilocus genotypes. Observed heterozygosity of known adults was significantly higher than of chicks, as was the number of heterozygous loci, indicating that mean zygosity measured across all 159 autosomal markers was an indicator of fitness as it is associated with eagle survival to adulthood. Finally, we used chick samples of known provenance to test for population differentiation across portions of North America and found pronounced structure among geographic sampling sites. These data indicate that cryptic genetic population structure is likely widespread in the golden eagle gene pool, and that extensive field sampling and genotyping will be required to more clearly delineate management units within North America and elsewhere.
Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire
2012-01-01
Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Brütting, Christine; Emmer, Alexander; Kornhuber, Malte; Staege, Martin S
2016-08-01
Although multiple sclerosis (MS) is one of the most common central nervous system diseases in young adults, little is known about its etiology. Several human endogenous retroviruses (ERVs) are considered to play a role in MS. We are interested in which ERVs can be identified in the vicinity of MS associated genetic marker to find potential initiators of MS. We analysed the chromosomal regions surrounding 58 single nucleotide polymorphisms (SNPs) that are associated with MS identified in one of the last major genome wide association studies. We scanned these regions for putative endogenous retrovirus sequences with large open reading frames (ORFs). We observed that more retrovirus-related putative ORFs exist in the relatively close vicinity of SNP marker indices in multiple sclerosis compared to control SNPs. We found very high homologies to HERV-K, HCML-ARV, XMRV, Galidia ERV, HERV-H/env62 and XMRV-like mouse endogenous retrovirus mERV-XL. The associated genes (CYP27B1, CD6, CD58, MPV17L2, IL12RB1, CXCR5, PTGER4, TAGAP, TYK2, ICAM3, CD86, GALC, GPR65 as well as the HLA DRB1*1501) are mainly involved in the immune system, but also in vitamin D regulation. The most frequently detected ERV sequences are related to the multiple sclerosis-associated retrovirus, the human immunodeficiency virus 1, HERV-K, and the Simian foamy virus. Our data shows that there is a relation between MS associated SNPs and the number of retroviral elements compared to control. Our data identifies new ERV sequences that have not been associated with MS, so far.
Hulse-Kemp, Amanda M; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A; Scheffler, Brian E; Fang, David D; Chen, Z Jeffrey; Van Deynze, Allen; Stelly, David M
2015-04-09
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. Copyright © 2015 Hulse-Kemp et al.
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).
Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair
2014-02-06
Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection.
Evolutionary Quantitative Genomics of Populus trichocarpa
McKown, Athena D.; La Mantia, Jonathan; Guy, Robert D.; Ingvarsson, Pär K.; Hamelin, Richard; Mansfield, Shawn D.; Ehlting, Jürgen; Douglas, Carl J.; El-Kassaby, Yousry A.
2015-01-01
Forest trees generally show high levels of local adaptation and efforts focusing on understanding adaptation to climate will be crucial for species survival and management. Here, we address fundamental questions regarding the molecular basis of adaptation in undomesticated forest tree populations to past climatic environments by employing an integrative quantitative genetics and landscape genomics approach. Using this comprehensive approach, we studied the molecular basis of climate adaptation in 433 Populus trichocarpa (black cottonwood) genotypes originating across western North America. Variation in 74 field-assessed traits (growth, ecophysiology, phenology, leaf stomata, wood, and disease resistance) was investigated for signatures of selection (comparing Q ST -F ST) using clustering of individuals by climate of origin (temperature and precipitation). 29,354 SNPs were investigated employing three different outlier detection methods and marker-inferred relatedness was estimated to obtain the narrow-sense estimate of population differentiation in wild populations. In addition, we compared our results with previously assessed selection of candidate SNPs using the 25 topographical units (drainages) across the P. trichocarpa sampling range as population groupings. Narrow-sense Q ST for 53% of distinct field traits was significantly divergent from expectations of neutrality (indicating adaptive trait variation); 2,855 SNPs showed signals of diversifying selection and of these, 118 SNPs (within 81 genes) were associated with adaptive traits (based on significant Q ST). Many SNPs were putatively pleiotropic for functionally uncorrelated adaptive traits, such as autumn phenology, height, and disease resistance. Evolutionary quantitative genomics in P. trichocarpa provides an enhanced understanding regarding the molecular basis of climate-driven selection in forest trees and we highlight that important loci underlying adaptive trait variation also show relationship to climate of origin. We consider our approach the most comprehensive, as it uncovers the molecular mechanisms of adaptation using multiple methods and tests. We also provide a detailed outline of the required analyses for studying adaptation to the environment in a population genomics context to better understand the species’ potential adaptive capacity to future climatic scenarios. PMID:26599762
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)
2014-01-01
Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection. PMID:24524230
Schjørring, Susanne; Stegger, Marc; Kjelsø, Charlotte; Lilje, Berit; Bangsborg, Jette M; Petersen, Randi F; David, Sophia; Uldum, Søren A
2017-01-01
Between July and November 2014, 15 community-acquired cases of Legionnaires´ disease (LD), including four with Legionella pneumophila serogroup 1 sequence type (ST) 82, were diagnosed in Northern Zealand, Denmark. An outbreak was suspected. No ST82 isolates were found in environmental samples and no external source was established. Four putative-outbreak ST82 isolates were retrospectively subjected to whole genome sequencing (WGS) followed by phylogenetic analyses with epidemiologically unrelated ST82 sequences. The four putative-outbreak ST82 sequences fell into two clades, the two clades were separated by ca 1,700 single nt polymorphisms (SNP)s when recombination regions were included but only by 12 to 21 SNPs when these were removed. A single putative-outbreak ST82 isolate sequence segregated in the first clade. The other three clustered in the second clade, where all included sequences had < 5 SNP differences between them. Intriguingly, this clade also comprised epidemiologically unrelated isolate sequences from the UK and Denmark dating back as early as 2011. The study confirms that recombination plays a major role in L. pneumophila evolution. On the other hand, strains belonging to the same ST can have only few SNP differences despite being sampled over both large timespans and geographic distances. These are two important factors to consider in outbreak investigations. PMID:28662761
Mutation Scanning in Wheat by Exon Capture and Next-Generation Sequencing.
King, Robert; Bird, Nicholas; Ramirez-Gonzalez, Ricardo; Coghill, Jane A; Patil, Archana; Hassani-Pak, Keywan; Uauy, Cristobal; Phillips, Andrew L
2015-01-01
Targeted Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach to identify novel sequence variation in genomes, with the aims of investigating gene function and/or developing useful alleles for breeding. Despite recent advances in wheat genomics, most current TILLING methods are low to medium in throughput, being based on PCR amplification of the target genes. We performed a pilot-scale evaluation of TILLING in wheat by next-generation sequencing through exon capture. An oligonucleotide-based enrichment array covering ~2 Mbp of wheat coding sequence was used to carry out exon capture and sequencing on three mutagenised lines of wheat containing previously-identified mutations in the TaGA20ox1 homoeologous genes. After testing different mapping algorithms and settings, candidate SNPs were identified by mapping to the IWGSC wheat Chromosome Survey Sequences. Where sequence data for all three homoeologues were found in the reference, mutant calls were unambiguous; however, where the reference lacked one or two of the homoeologues, captured reads from these genes were mis-mapped to other homoeologues, resulting either in dilution of the variant allele frequency or assignment of mutations to the wrong homoeologue. Competitive PCR assays were used to validate the putative SNPs and estimate cut-off levels for SNP filtering. At least 464 high-confidence SNPs were detected across the three mutagenized lines, including the three known alleles in TaGA20ox1, indicating a mutation rate of ~35 SNPs per Mb, similar to that estimated by PCR-based TILLING. This demonstrates the feasibility of using exon capture for genome re-sequencing as a method of mutation detection in polyploid wheat, but accurate mutation calling will require an improved genomic reference with more comprehensive coverage of homoeologues.
González-Mercado, A; Sánchez-López, J Y; Regla-Nava, J A; Gámez-Nava, J I; González-López, L; Duran-Gonzalez, J; Celis, A; Perea-Díaz, F J; Salazar-Páramo, M; Ibarra, B
2013-07-30
We investigated associations between vitamin D receptor (VDR) gene polymorphisms, FokI T>C (rs2228570), BsmI G>A (rs1544410), ApaI G>T (rs7975232), and TaqI T>C (rs731236), with bone mineral density (BMD) in postmenopausal Mexican-Mestizo women. Three hundred and twenty postmenopausal women participated, who were classified according to World Health Organization criteria as non-osteoporotic (Non-OP; N = 88), osteopenic (Opn; N = 144), and osteoporotic (OP; N = 88). BMD measurements at the lumbar (L1-L4) spine and at the left and right femoral neck were obtained by dual-energy X-ray absorptiometry. Single nucleotide polymorphisms (SNPs) were genotyped using real-time polymerase chain reaction and TaqMan probes. Genotype and allelic frequencies of the 4 VDR SNPs were similar among the 3 groups. Polymorphic allele frequencies were as follows: FokI (C) 0.53, 0.49, 0.56; BsmI (A) 0.26, 0.22, 0.23; ApaI (T) 0.43, 0.39, 0.44; TaqI (C) 0.27, 0.22, 0.23 for the Non-OP, Opn, and OP groups, respectively. Although no associations were found between the SNPs and BMD, based on the putative function of the FokI SNP, we constructed, for the first time, the haplotype with the 4 VDR SNPs, and found that the CGGT haplotype differed between the Non- OP and OP groups (21.8 vs 31.8%, P < 0.05). The risk analysis for this haplotype was nearly significant under the dominant model (OR = 1.783, 95%CI = 0.98-3.25, P = 0.058). This result suggests a possible susceptibility effect of the C allele of the FokI SNP for the development of osteoporosis in postmenopausal Mexican-Mestizo women.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea; Slezak, Tom
With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs. The method is fast to compute, finding SNPs and building a SNP phylogeny in seconds to hours. We use it to identify thousands of putative SNPs from all publicly available Filoviridae, Poxviridae, foot-and-mouth disease virus, Bacillus, and Escherichia coli genomes and plasmids. Themore » SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle as input hundreds of gigabases of sequence in a single run. The algorithm is based on k-mer analysis using a suffix array, so we call it saSNP.« less
USDA-ARS?s Scientific Manuscript database
Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of an evolutionarily recent genome duplication event. This situation complicates single nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and ...
Wei, Dan-Dan; Chen, Er-Hu; Ding, Tian-Bo; Chen, Shi-Chun; Dou, Wei; Wang, Jin-Jun
2013-01-01
Background As a major stored-product pest insect, Liposcelis entomophila has developed high levels of resistance to various insecticides in grain storage systems. However, the molecular mechanisms underlying resistance and environmental stress have not been characterized. To date, there is a lack of genomic information for this species. Therefore, studies aimed at profiling the L. entomophila transcriptome would provide a better understanding of the biological functions at the molecular levels. Methodology/Principal Findings We applied Illumina sequencing technology to sequence the transcriptome of L. entomophila. A total of 54,406,328 clean reads were obtained and that de novo assembled into 54,220 unigenes, with an average length of 571 bp. Through a similarity search, 33,404 (61.61%) unigenes were matched to known proteins in the NCBI non-redundant (Nr) protein database. These unigenes were further functionally annotated with gene ontology (GO), cluster of orthologous groups of proteins (COG), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. A large number of genes potentially involved in insecticide resistance were manually curated, including 68 putative cytochrome P450 genes, 37 putative glutathione S-transferase (GST) genes, 19 putative carboxyl/cholinesterase (CCE) genes, and other 126 transcripts to contain target site sequences or encoding detoxification genes representing eight types of resistance enzymes. Furthermore, to gain insight into the molecular basis of the L. entomophila toward thermal stresses, 25 heat shock protein (Hsp) genes were identified. In addition, 1,100 SSRs and 57,757 SNPs were detected and 231 pairs of SSR primes were designed for investigating the genetic diversity in future. Conclusions/Significance We developed a comprehensive transcriptomic database for L. entomophila. These sequences and putative molecular markers would further promote our understanding of the molecular mechanisms underlying insecticide resistance or environmental stress, and will facilitate studies on population genetics for psocids, as well as providing useful information for functional genomic research in the future. PMID:24244605
Gaur, Pallavi; Chaturvedi, Anoop
2016-01-01
One of the newest and strongest members of intercellular communicators, the Extracellular vesicles (EVs) and their enclosed RNAs; Extracellular RNAs (exRNAs) have been acknowledged as putative biomarkers and therapeutic targets for various diseases. Although a very deep insight has not been possible into the physiology of these vesicles, they are believed to be involved in cell-to-cell communication and host-pathogen interactions. EVs might be significantly helpful in discovering biomarkers for possible target identification as well as prognostics, diagnostics and developing vaccines. In recent studies, highly bioactive EVs have drawn attention of parasitologists for being able to communicate between different cells and having likeliness of reflecting both source and target environments. Next-generation sequencing (NGS) has eased the way to have a deeper insight into these vesicles and their roles in various diseases. This article arises from bioinformatics-based analysis and predictive data mining of transcriptomic (RNA-Seq) data of EVs, derived from different life stages of Trypanosoma cruzi ; a causing agent of neglected Chagas disease. Variants (Single Nucleotide Polymorphisms (SNPs)) were mined from Extracellular vesicular transcriptomic data and functionally analyzed using different bioinformatics based approaches. Functional analysis showed the association of these variants with various important factors like Trans-Sialidase (TS), Alpha Tubulin, P-Type H+-ATPase, etc. which, in turn, are associated with disease in different ways. Some of the 'candidate SNPs' were found to be stage-specific, which strengthens the probability of finding stage-specific biomarkers. These results may lead to a better understanding of Chagas disease, and improved knowledge may provide further development of the biomarkers for prognosis, diagnosis and drug development for treating Chagas disease.
Fu, Yi-Ping; Yu, Jyh-Cherng; Cheng, Ting-Chih; Lou, M Ann; Hsu, Giu-Cheng; Wu, Chia-Yun; Chen, Sou-Tong; Wu, Hurng-Sheng; Wu, Pei-Ei; Shen, Chen-Yang
2003-05-15
The role of the familial breast cancer susceptibility genes, BRCA1 and BRCA2, in the homologous recombination pathway for DNA double-strand break (DSB) repair suggests that the mechanisms involved in DNA DSB repair are of particular etiological importance during breast tumorigenesis. However, there is currently no evidence for an association between breast cancer and the other DSB repair pathway, the nonhomologous end-joining (NHEJ) pathway. It is possible that, because this DNA repair pathway is so crucial for mammalian cells to maintain genomic stability, any severe defects in it would result in serious outcomes, such as genomic instability and cell death, and block subsequent cell outgrowth and tumor formation. Thus, only subtle defects arising from low-penetrance alleles would escape lethality accumulating essential genetic changes and be associated with cancer formation, and the tumorigenic contribution of these alleles would become more obvious if individual putative high-risk genotypes of each NHEJ gene act jointly. Furthermore, this joint effect might be modified by specific environmental factors, and we hypothesized that estrogen exposure might be one such factor because estrogen is suggested to cause DNA DSBs, triggering breast tumorigenesis. Because single nucleotide polymorphisms (SNPs) are the most subtle genetic variation in the genome, to examine these hypotheses, we have genotyped 30 SNPs in all five NHEJ genes (Ku70, Ku80, DNA-PKcs, Ligase IV, and XRCC4) in 254 primary breast cancer patients and 379 healthy controls. Support for these hypotheses came from the observations that (a) two SNPs in Ku70 and XRCC4 were associated with breast cancer risk (P < 0.05); (b) a trend toward increased risk of developing breast cancer was found in women harboring a greater number of putative high-risk genotypes of NHEJ genes (an adjusted odds ratio of 1.46 for having one additional putative high-risk genotype; 95% confidence interval, 1.19-1.80); (c) this association between risk and the number of putative high-risk genotypes was stronger and more significant in women thought to be more susceptible to estrogen, i.e., those with no history of full-term pregnancy; and (d) the protective effect conferred by a history of full-term pregnancy was only significant in women with a lower number of putative high-risk genotypes of NHEJ genes. Based on comprehensive NHEJ gene profiles, this study provides new insights to suggest the role of the NHEJ pathway in breast cancer development and supports the possibility that breast cancer is initiated by estrogen exposure, which causes DNA DSBs.
Hand, Brian K.; Hether, Tyler D; Kovach, Ryan P.; Muhlfeld, Clint C.; Amish, Stephen J.; Boyer, Matthew C.; O’Rourke, Sean M.; Miller, Michael R.; Lowe, Winsor H.; Hohenlohe, Paul A.; Luikart, Gordon
2015-01-01
Invasive hybridization and introgression pose a serious threat to the persistence of many native species. Understanding the effects of hybridization on native populations (e.g., fitness consequences) requires numerous species-diagnostic loci distributed genome-wide. Here we used RAD sequencing to discover thousands of single-nucleotide polymorphisms (SNPs) that are diagnostic between rainbow trout (RBT, Oncorhynchus mykiss), the world’s most widely introduced fish, and native westslope cutthroat trout (WCT, O. clarkii lewisi) in the northern Rocky Mountains, USA. We advanced previous work that identified 4,914 species-diagnostic loci by using longer sequence reads (100 bp vs. 60 bp) and a larger set of individuals (n = 84). We sequenced RAD libraries for individuals from diverse sampling sources, including native populations of WCT and hatchery broodstocks of WCT and RBT. We also took advantage of a newly released reference genome assembly for RBT to align our RAD loci. In total, we discovered 16,788 putatively diagnostic SNPs, 10,267 of which we mapped to anchored chromosome locations on the RBT genome. A small portion of previously discovered putative diagnostic loci (325 of 4,914) were no longer diagnostic (i.e., fixed between species) based on our wider survey of non-hybridized RBT and WCT individuals. Our study suggests that RAD loci mapped to a draft genome assembly could provide the marker density required to identify genes and chromosomal regions influencing selection in admixed populations of conservation concern and evolutionary interest.
Mining SNPs from EST sequences using filters and ensemble classifiers.
Wang, J; Zou, Q; Guo, M Z
2010-05-04
Abundant single nucleotide polymorphisms (SNPs) provide the most complete information for genome-wide association studies. However, due to the bottleneck of manual discovery of putative SNPs and the inaccessibility of the original sequencing reads, it is essential to develop a more efficient and accurate computational method for automated SNP detection. We propose a novel computational method to rapidly find true SNPs in public-available EST (expressed sequence tag) databases; this method is implemented as SNPDigger. EST sequences are clustered and aligned. SNP candidates are then obtained according to a measure of redundant frequency. Several new informative biological features, such as the structural neighbor profiles and the physical position of the SNP, were extracted from EST sequences, and the effectiveness of these features was demonstrated. An ensemble classifier, which employs a carefully selected feature set, was included for the imbalanced training data. The sensitivity and specificity of our method both exceeded 80% for human genetic data in the cross validation. Our method enables detection of SNPs from the user's own EST dataset and can be used on species for which there is no genome data. Our tests showed that this method can effectively guide SNP discovery in ESTs and will be useful to avoid and save the cost of biological analyses.
Madampage, Claudia Avis; Rawlyk, Neil; Crockford, Gordon; Van Donkersgoed, Joyce; Dorin, Craig; Potter, Andrew
2015-07-01
Histophilus somni, a causative agent of the bovine respiratory disease complex, can also cause a variety of systemic disorders, including bronchopneumonia, myocarditis, pericarditis, arthritis, pleuritis, and infectious thrombotic meningoencephalitis. The purpose of this study was to determine if currently circulating strains differ from those of the 1980s by identifying genomic changes. Single nucleotide polymorphisms (SNPs) and insertion and deletion (INDEL) sites were examined by whole-genome sequencing in 12 samples, 6 old and 6 new. The 31 028 SNP/INDELs recorded were compared against the reference genome sequence of the pathogenic H. somni strain 2336. The distribution of about 75% of these SNPs within a specified gene differed between old and new isolates and did not follow any particular pattern. The other 25% clustered into 2 groups containing the same SNPs in various genes: group I included 5 old isolates and 1 new isolate; group II included 5 new isolates and 1 old isolate. For putative virulence genes there were more SNPs in group I compared with strain 2336, itself an older isolate, than in group II. Although only 25% of all the SNPs formed 2 clusters, the results suggest some genetic difference in various genes between old and new strains.
Madampage, Claudia Avis; Rawlyk, Neil; Crockford, Gordon; Van Donkersgoed, Joyce; Dorin, Craig; Potter, Andrew
2015-01-01
Histophilus somni, a causative agent of the bovine respiratory disease complex, can also cause a variety of systemic disorders, including bronchopneumonia, myocarditis, pericarditis, arthritis, pleuritis, and infectious thrombotic meningoencephalitis. The purpose of this study was to determine if currently circulating strains differ from those of the 1980s by identifying genomic changes. Single nucleotide polymorphisms (SNPs) and insertion and deletion (INDEL) sites were examined by whole-genome sequencing in 12 samples, 6 old and 6 new. The 31 028 SNP/INDELs recorded were compared against the reference genome sequence of the pathogenic H. somni strain 2336. The distribution of about 75% of these SNPs within a specified gene differed between old and new isolates and did not follow any particular pattern. The other 25% clustered into 2 groups containing the same SNPs in various genes: group I included 5 old isolates and 1 new isolate; group II included 5 new isolates and 1 old isolate. For putative virulence genes there were more SNPs in group I compared with strain 2336, itself an older isolate, than in group II. Although only 25% of all the SNPs formed 2 clusters, the results suggest some genetic difference in various genes between old and new strains. PMID:26130851
Li, Jitao; Li, Jian; Chen, Ping; Liu, Ping; He, Yuying
2015-01-01
The ridgetail white prawn Exopalaemon carinicauda is one of major economic mariculture species in eastern China. The deficiency of genomic and transcriptomic data is becoming the bottleneck of further researches on its good traits. In the present study, 454 pyrosequencing was undertaken to investigate the transcriptome profiles of E. carinicauda. A collection of 1,028,710 sequence reads (459.59 Mb) obtained from cDNA prepared from eyestalk and hemocytes was assembled into 162,056 expressed sequence tags (ESTs). Of these, 29.88 % of 48,428 contigs and 70.12 % of 113,628 singlets possessed high similarities to sequences in the GenBank non-redundant database, with most significant (E value <1e(-10)) unigenes matches occurring with crustacean and insect sequences. KEGG analysis of unigenes identified putative members of biological pathways related to growth and immunity. In addition, we obtained a total of putative 125,112 SNPs and 13,467 microsatellites. These results will contribute to the understanding of the genome makeup and provide useful information for future functional genomic research in E. carinicauda.
French, Juliet D; Johnatty, Sharon E; Lu, Yi; Beesley, Jonathan; Gao, Bo; Kalimutho, Murugan; Henderson, Michelle J; Russell, Amanda J; Kar, Siddhartha; Chen, Xiaoqing; Hillman, Kristine M; Kaufmann, Susanne; Sivakumaran, Haran; O'Reilly, Martin; Wang, Chen; Korbie, Darren J; Lambrechts, Diether; Despierre, Evelyn; Van Nieuwenhuysen, Els; Lambrechts, Sandrina; Vergote, Ignace; Karlan, Beth; Lester, Jenny; Orsulic, Sandra; Walsh, Christine; Fasching, Peter A; Beckmann, Matthias W; Ekici, Arif B; Hein, Alexander; Matsuo, Keitaro; Hosono, Satoyo; Pisterer, Jacobus; Hillemanns, Peter; Nakanishi, Toru; Yatabe, Yasushi; Goodman, Marc T; Lurie, Galina; Matsuno, Rayna K; Thompson, Pamela J; Pejovic, Tanja; Bean, Yukie; Heitz, Florian; Harter, Philipp; du Bois, Andreas; Schwaab, Ira; Hogdall, Estrid; Kjaer, Susanne K; Jensen, Allan; Hogdall, Claus; Lundvall, Lene; Engelholm, Svend Aage; Brown, Bob; Flanagan, James M; Metcalf, Michelle D; Siddiqui, Nadeem; Sellers, Thomas; Fridley, Brooke; Cunningham, Julie; Schildkraut, Joellen M; Iversen, Ed; Weber, Rachel Palmieri; Brennan, Donal; Berchuck, Andrew; Pharoah, Paul; Harnett, Paul; Norris, Murray D; Haber, Michelle; Goode, Ellen L; Lee, Jason S; Khanna, Kum Kum; Meyer, Kerstin B; Chenevix-Trench, Georgia; deFazio, Anna; Edwards, Stacey L; MacGregor, Stuart
2016-02-09
Women with epithelial ovarian cancer (EOC) are usually treated with platinum/taxane therapy after cytoreductive surgery but there is considerable inter-individual variation in response. To identify germline single-nucleotide polymorphisms (SNPs) that contribute to variations in individual responses to chemotherapy, we carried out a multi-phase genome-wide association study (GWAS) in 1,244 women diagnosed with serous EOC who were treated with the same first-line chemotherapy, carboplatin and paclitaxel. We identified two SNPs (rs7874043 and rs72700653) in TTC39B (best P=7x10-5, HR=1.90, for rs7874043) associated with progression-free survival (PFS). Functional analyses show that both SNPs lie in a putative regulatory element (PRE) that physically interacts with the promoters of PSIP1, CCDC171 and an alternative promoter of TTC39B. The C allele of rs7874043 is associated with poor PFS and showed increased binding of the Sp1 transcription factor, which is critical for chromatin interactions with PSIP1. Silencing of PSIP1 significantly impaired DNA damage-induced Rad51 nuclear foci and reduced cell viability in ovarian cancer lines. PSIP1 (PC4 and SFRS1 Interacting Protein 1) is known to protect cells from stress-induced apoptosis, and high expression is associated with poor PFS in EOC patients. We therefore suggest that the minor allele of rs7874043 confers poor PFS by increasing PSIP1 expression.
French, Juliet D.; Johnatty, Sharon E.; Lu, Yi; Beesley, Jonathan; Gao, Bo; Kalimutho, Murugan; Henderson, Michelle J.; Russell, Amanda J.; Kar, Siddhartha; Chen, Xiaoqing; Hillman, Kristine M.; Kaufmann, Susanne; Sivakumaran, Haran; O'Reilly, Martin; Wang, Chen; Korbie, Darren J.; Lambrechts, Diether; Despierre, Evelyn; Van Nieuwenhuysen, Els; Lambrechts, Sandrina; Vergote, Ignace; Karlan, Beth; Lester, Jenny; Orsulic, Sandra; Walsh, Christine; Fasching, Peter A.; Beckmann, Matthias W.; Ekici, Arif B.; Hein, Alexander; Matsuo, Keitaro; Hosono, Satoyo; Pisterer, Jacobus; Hillemanns, Peter; Nakanishi, Toru; Yatabe, Yasushi; Goodman, Marc T.; Lurie, Galina; Matsuno, Rayna K.; Thompson, Pamela J.; Pejovic, Tanja; Bean, Yukie; Heitz, Florian; Harter, Philipp; du Bois, Andreas; Schwaab, Ira; Hogdall, Estrid; Kjaer, Susanne K.; Jensen, Allan; Hogdall, Claus; Lundvall, Lene; Engelholm, Svend Aage; Brown, Bob; Flanagan, James M.; Metcalf, Michelle D.; Siddiqui, Nadeem; Sellers, Thomas; Fridley, Brooke; Cunningham, Julie; Schildkraut, Joellen M.; Iversen, Ed; Weber, Rachel Palmieri; Brennan, Donal; Berchuck, Andrew; Pharoah, Paul; Harnett, Paul; Norris, Murray D.; Haber, Michelle; Goode, Ellen L.; Lee, Jason S.; Khanna, Kum Kum; Meyer, Kerstin B.; Chenevix-Trench, Georgia; deFazio, Anna; Edwards, Stacey L.; MacGregor, Stuart
2016-01-01
Women with epithelial ovarian cancer (EOC) are usually treated with platinum/taxane therapy after cytoreductive surgery but there is considerable inter-individual variation in response. To identify germline single-nucleotide polymorphisms (SNPs) that contribute to variations in individual responses to chemotherapy, we carried out a multi-phase genome-wide association study (GWAS) in 1,244 women diagnosed with serous EOC who were treated with the same first-line chemotherapy, carboplatin and paclitaxel. We identified two SNPs (rs7874043 and rs72700653) in TTC39B (best P=7×10−5, HR=1.90, for rs7874043) associated with progression-free survival (PFS). Functional analyses show that both SNPs lie in a putative regulatory element (PRE) that physically interacts with the promoters of PSIP1, CCDC171 and an alternative promoter of TTC39B. The C allele of rs7874043 is associated with poor PFS and showed increased binding of the Sp1 transcription factor, which is critical for chromatin interactions with PSIP1. Silencing of PSIP1 significantly impaired DNA damage-induced Rad51 nuclear foci and reduced cell viability in ovarian cancer lines. PSIP1 (PC4 and SFRS1 Interacting Protein 1) is known to protect cells from stress-induced apoptosis, and high expression is associated with poor PFS in EOC patients. We therefore suggest that the minor allele of rs7874043 confers poor PFS by increasing PSIP1 expression. PMID:26840454
Ashrafi, Hamid; Hill, Theresa; Stoffel, Kevin; Kozik, Alexander; Yao, Jiqiang; Chin-Wo, Sebastian Reyes; Van Deynze, Allen
2012-10-30
Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80-120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project.
Knecht, Carolin; Mort, Matthew; Junge, Olaf; Cooper, David N.; Krawczak, Michael
2017-01-01
Abstract The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. PMID:28180317
Fine-scale mapping of 8q24 locus identifies multiple independent risk variants for breast cancer
Zheng, Wei; Michailidou, Kyriaki; Ghoussaini, Maya; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Lush, Michael; Milne, Roger L.; Shu, Xiao-Ou; Beesley, Jonathan; Kar, Siddhartha; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Zhao, Zhiguo; Guo, Xingyi; Benitez, Javier; Beeghly-Fadiel, Alicia; Blot, William; Bogdanova, Natalia V.; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Cai, Hui; Canisius, Sander; Chang-Claude, Jenny; Choi, Ji-Yeob; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Darabi, Hatef; Devilee, Peter; Droit, Arnaud; Dork, Thilo; Fasching, Peter A.; Fletcher, Olivia; Flyger, Henrik; Fostira, Florentia; Gaborieau, Valerie; García-Closas, Montserrat; Giles, Graham G.; Guenel, Pascal; Haiman, Christopher A.; Hamann, Ute; Hartman, Mikael; Miao, Hui; Hollestelle, Antoinette; Hopper, John L.; Hsiung, Chia-Ni; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Torres, Diana; Kabisch, Maria; Kang, Daehee; Khan, Sofia; Knight, Julia A.; Kosma, Veli-Matti; Lambrechts, Diether; Li, Jingmei; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Le Marchand, Loic; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McLean, Catriona; Meindl, Alfons; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Nord, Silje; Børresen-Dale, Anne-Lise; Olson, Janet E.; Orr, Nick; van den Ouweland, Ans M.W.; Peterlongo, Paolo; Putti, Thomas Choudary; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Shen, Chen-Yang; Hou, Ming-Feng; Shrubsole, Matha J; Southey, Melissa C.; Swerdlow, Anthony; Teo, Soo Hwang; Thienpont, Bernard; Toland, Amanda E.; Tollenaar, Robert A.E.M.; Tomlinson, Ian; Truong, Therese; Tseng, Chiu-chen; Wen, Wanqing; Winqvist, Robert; Wu, Anna H.; Yip, Cheng Har; Zamora, Pilar M.; Zheng, Ying; Floris, Giuseppe; Cheng, Ching-Yu; Hooning, Maartje J.; Martens, John W.M.; Seynaeve, Caroline; Kristensen, Vessela N.; Hall, Per; Pharoah, Paul D.P.; Simard, Jacques; Chenevix-Trench, Georgia; Dunning, Alison M.; Antoniou, Antonis C.; Easton, Douglas F.; Cai, Qiuyin; Long, Jirong
2016-01-01
Previous genome-wide association studies among women of European ancestry identified two independent breast cancer susceptibility loci represented by single nucleotide polymorphisms (SNPs) rs13281615 and rs11780156 at 8q24. We conducted a fine-mapping study across 2.06 Mb (chr8:127,561,724 −129,624,067, hg19) in 55,540 breast cancer cases and 51,168 controls within the Breast Cancer Association Consortium. We found three additional independent association signals in women of European ancestry, represented by rs35961416 (OR = 0.95, 95% CI = 0.93-0.97, conditional P = 5.8 × 10−6), rs7815245 (OR = 0.94, 95% CI = 0.91-0.96, conditional P = 1.1 × 10−6), and rs2033101 (OR = 1.05, 95% CI = 1.02-1.07, conditional P = 1.1 × 10−4). Integrative analysis using functional genomic data from the Roadmap Epigenomics, the Encyclopedia of DNA Elements project, the Cancer Genome Atlas, and other public resources implied that SNPs rs7815245 in Signal 3, and rs1121948 in Signal 5 (in linkage disequilibrium with rs11780156, r2 = 0.77), were putatively functional variants for two of the five independent association signals. Our results highlight multiple 8q24 variants associated with breast cancer susceptibility in women of European ancestry. PMID:27087578
Ma, Keyi; Liao, Minghui; Liu, Feng; Ye, Baoqing; Sun, Fei; Yue, Gen Hua
2016-01-01
Zinc finger AN1-type domain 3 (ZFAND3) is essential for spermatogenesis in mice. However, its function in teleosts remains unclear. In this study, we characterized the ZFAND3 gene (termed as OsZFAND3) in an important food fish, tilapia. The OsZFAND3 cDNA sequence is 1,050 bp in length, containing an ORF of 615 bp, which encodes a putative peptide of 204 amino acid residues. Quantitative real-time PCR revealed that the OsZFAND3 transcripts were exclusively expressed in the testis and ovary. In situ hybridization showed that the high expression of OsZFAND3 transcripts was predominantly localized in the spermatocyte and spermatid. These results suggest that OsZFAND3 is involved in male germ cell maturation. Three single nucleotide polymorphisms (SNPs) were detected in the introns of OsZFAND3. The OsZFAND3 gene was mapped in the sex-determining locus on linkage group 1 (LG1). The three SNPs in the OsZFAND3 gene were strictly associated with sex phenotype, suggesting that the OsZFAND3 gene is tightly linked to the sex-determining locus. Our study provides new insights into the functions of the OsZFAND3 gene in tilapia and a foundation for further detailed analysis of the OsZFAND3 gene in sex determination and differentiation. PMID:27137111
Fine-scale mapping of 8q24 locus identifies multiple independent risk variants for breast cancer.
Shi, Jiajun; Zhang, Yanfeng; Zheng, Wei; Michailidou, Kyriaki; Ghoussaini, Maya; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Lush, Michael; Milne, Roger L; Shu, Xiao-Ou; Beesley, Jonathan; Kar, Siddhartha; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Zhao, Zhiguo; Guo, Xingyi; Benitez, Javier; Beeghly-Fadiel, Alicia; Blot, William; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Brinton, Louise; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Cai, Hui; Canisius, Sander; Chang-Claude, Jenny; Choi, Ji-Yeob; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Darabi, Hatef; Devilee, Peter; Droit, Arnaud; Dork, Thilo; Fasching, Peter A; Fletcher, Olivia; Flyger, Henrik; Fostira, Florentia; Gaborieau, Valerie; García-Closas, Montserrat; Giles, Graham G; Guenel, Pascal; Haiman, Christopher A; Hamann, Ute; Hartman, Mikael; Miao, Hui; Hollestelle, Antoinette; Hopper, John L; Hsiung, Chia-Ni; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Torres, Diana; Kabisch, Maria; Kang, Daehee; Khan, Sofia; Knight, Julia A; Kosma, Veli-Matti; Lambrechts, Diether; Li, Jingmei; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Le Marchand, Loic; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McLean, Catriona; Meindl, Alfons; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Nord, Silje; Børresen-Dale, Anne-Lise; Olson, Janet E; Orr, Nick; van den Ouweland, Ans M W; Peterlongo, Paolo; Putti, Thomas Choudary; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Shen, Chen-Yang; Hou, Ming-Feng; Shrubsole, Matha J; Southey, Melissa C; Swerdlow, Anthony; Teo, Soo Hwang; Thienpont, Bernard; Toland, Amanda E; Tollenaar, Robert A E M; Tomlinson, Ian; Truong, Therese; Tseng, Chiu-Chen; Wen, Wanqing; Winqvist, Robert; Wu, Anna H; Yip, Cheng Har; Zamora, Pilar M; Zheng, Ying; Floris, Giuseppe; Cheng, Ching-Yu; Hooning, Maartje J; Martens, John W M; Seynaeve, Caroline; Kristensen, Vessela N; Hall, Per; Pharoah, Paul D P; Simard, Jacques; Chenevix-Trench, Georgia; Dunning, Alison M; Antoniou, Antonis C; Easton, Douglas F; Cai, Qiuyin; Long, Jirong
2016-09-15
Previous genome-wide association studies among women of European ancestry identified two independent breast cancer susceptibility loci represented by single nucleotide polymorphisms (SNPs) rs13281615 and rs11780156 at 8q24. A fine-mapping study across 2.06 Mb (chr8:127,561,724-129,624,067, hg19) in 55,540 breast cancer cases and 51,168 controls within the Breast Cancer Association Consortium was conducted. Three additional independent association signals in women of European ancestry, represented by rs35961416 (OR = 0.95, 95% CI = 0.93-0.97, conditional p = 5.8 × 10(-6) ), rs7815245 (OR = 0.94, 95% CI = 0.91-0.96, conditional p = 1.1 × 10(-6) ) and rs2033101 (OR = 1.05, 95% CI = 1.02-1.07, conditional p = 1.1 × 10(-4) ) were found. Integrative analysis using functional genomic data from the Roadmap Epigenomics, the Encyclopedia of DNA Elements project, the Cancer Genome Atlas and other public resources implied that SNPs rs7815245 in Signal 3, and rs1121948 in Signal 5 (in linkage disequilibrium with rs11780156, r(2) = 0.77), were putatively functional variants for two of the five independent association signals. The results highlighted multiple 8q24 variants associated with breast cancer susceptibility in women of European ancestry. © 2016 UICC.
Antonini, S R; N'Diaye, N; Baldacchino, V; Hamet, P; Tremblay, J; Lacroix, A
2004-07-01
Gastric inhibitory polypeptide (GIP)-dependent Cushing's syndrome (CS) results from the ectopic expression of non-mutated GIP receptor (hGIPR) in the adrenal cortex. We evaluated whether mutations or polymorphisms in the regulatory region of the GIPR gene could lead to this aberrant expression. We studied 9.0kb upstream and 1.3kb downstream of the GIPR gene putative promoter (pProm) by sequencing leukocyte DNA from controls and from adrenal tissues of GIP- and non-GIP-dependent CS patients. The putative proximal promoter region (800 bp) and the first exon and intron of the hGIPR gene were sequenced on adrenal DNA from nine GIP-dependent CS, as well as on leukocyte DNA of nine normal controls. Three variations found in this region were found in all patients and controls; at position -4/-5, an insertion of a T was seen in four out of nine patients and in five out of nine controls. Transient transfection studies conducted in rat GC and mouse Y1 cells showed that the TT allele confers loss of 40% in the promoter activity. The analysis of the 8-kb distal pProm region revealed eight distal single nucleotide polymorphisms (SNPs) without probable association with the disease, since frequencies in patients and controls were very similar. In conclusion, mutations or SNPs in the regulatory region of the GIPR gene are unlikely to underlie GIP-dependent CS. Copyright 2004 Elsevier Ltd.
LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.
Karchin, Rachel; Diekhans, Mark; Kelly, Libusha; Thomas, Daryl J; Pieper, Ursula; Eswar, Narayanan; Haussler, David; Sali, Andrej
2005-06-15
The NCBI dbSNP database lists over 9 million single nucleotide polymorphisms (SNPs) in the human genome, but currently contains limited annotation information. SNPs that result in amino acid residue changes (nsSNPs) are of critical importance in variation between individuals, including disease and drug sensitivity. We have developed LS-SNP, a genomic scale software pipeline to annotate nsSNPs. LS-SNP comprehensively maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models, and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding or severely impact human health. It currently annotates 28,043 validated SNPs that produce amino acid residue substitutions in human proteins from the SwissProt/TrEMBL database. Annotations can be viewed via a web interface either in the context of a genomic region or by selecting sets of SNPs, genes, proteins or pathways. These results are useful for identifying candidate functional SNPs within a gene, haplotype or pathway and in probing molecular mechanisms responsible for functional impacts of nsSNPs. http://www.salilab.org/LS-SNP CONTACT: rachelk@salilab.org http://salilab.org/LS-SNP/supp-info.pdf.
Ward, Lucas D; Kellis, Manolis
2016-01-04
More than 90% of common variants associated with complex traits do not affect proteins directly, but instead the circuits that control gene expression. This has increased the urgency of understanding the regulatory genome as a key component for translating genetic results into mechanistic insights and ultimately therapeutics. To address this challenge, we developed HaploReg (http://compbio.mit.edu/HaploReg) to aid the functional dissection of genome-wide association study (GWAS) results, the prediction of putative causal variants in haplotype blocks, the prediction of likely cell types of action, and the prediction of candidate target genes by systematic mining of comparative, epigenomic and regulatory annotations. Since first launching the website in 2011, we have greatly expanded HaploReg, increasing the number of chromatin state maps to 127 reference epigenomes from ENCODE 2012 and Roadmap Epigenomics, incorporating regulator binding data, expanding regulatory motif disruption annotations, and integrating expression quantitative trait locus (eQTL) variants and their tissue-specific target genes from GTEx, Geuvadis, and other recent studies. We present these updates as HaploReg v4, and illustrate a use case of HaploReg for attention deficit hyperactivity disorder (ADHD)-associated SNPs with putative brain regulatory mechanisms. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.
Peace, Cameron; Bassil, Nahla; Main, Dorrie; Ficklin, Stephen; Rosyara, Umesh R.; Stegmeir, Travis; Sebolt, Audrey; Gilmore, Barbara; Lawley, Cindy; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Iezzoni, Amy
2012-01-01
High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a community initiative to enable marker-assisted breeding for rosaceous crops. Next-generation sequencing in diverse breeding germplasm provided 25 billion basepairs (Gb) of cherry DNA sequence from which were identified genome-wide SNPs for sweet cherry and for the two sour cherry subgenomes derived from sweet cherry (avium subgenome) and P. fruticosa (fruticosa subgenome). Anchoring to the peach genome sequence, recently released by the International Peach Genome Initiative, predicted relative physical locations of the 1.9 million putative SNPs detected, preliminarily filtered to 368,943 SNPs. Further filtering was guided by results of a 144-SNP subset examined with the Illumina GoldenGate® assay on 160 accessions. A 6K Infinium® II array was designed with SNPs evenly spaced genetically across the sweet and sour cherry genomes. SNPs were developed for each sour cherry subgenome by using minor allele frequency in the sour cherry detection panel to enrich for subgenome-specific SNPs followed by targeting to either subgenome according to alleles observed in sweet cherry. The array was evaluated using panels of sweet (n = 269) and sour (n = 330) cherry breeding germplasm. Approximately one third of array SNPs were informative for each crop. A total of 1825 polymorphic SNPs were verified in sweet cherry, 13% of these originally developed for sour cherry. Allele dosage was resolved for 2058 polymorphic SNPs in sour cherry, one third of these being originally developed for sweet cherry. This publicly available genomics resource represents a significant advance in cherry genome-scanning capability that will accelerate marker-locus-trait association discovery, genome structure investigation, and genetic diversity assessment in this diploid-tetraploid crop group. PMID:23284615
Bashalkhanov, Stanislav; Eckert, Andrew J; Rajora, Om P
2013-12-01
One of the most important drivers of local adaptation for forest trees is climate. Coupled to these patterns, however, are human-induced disturbances through habitat modification and pollution. The confounded effects of climate and disturbance have rarely been investigated with regard to selective pressure on forest trees. Here, we have developed and used a population genetic approach to search for signals of selection within a set of 36 candidate genes chosen for their putative effects on adaptation to climate and human-induced air pollution within five populations of red spruce (Picea rubens Sarg.), distributed across its natural range and air pollution gradient in eastern North America. Specifically, we used FST outlier and environmental correlation analyses to highlight a set of seven single nucleotide polymorphisms (SNPs) that were overly correlated with climate and levels of sulphate pollution after correcting for the confounding effects of population history. Use of three age cohorts within each population allowed the effects of climate and pollution to be separated temporally, as climate-related SNPs (n = 7) showed the strongest signals in the oldest cohort, while pollution-related SNPs (n = 3) showed the strongest signals in the youngest cohorts. These results highlight the usefulness of population genetic scans for the identification of putatively nonneutral evolution within genomes of nonmodel forest tree species, but also highlight the need for the development and application of robust methodologies to deal with the inherent multivariate nature of the genetic and ecological data used in these types of analyses. © 2013 John Wiley & Sons Ltd.
Bipartite Community Structure of eQTLs.
Platig, John; Castaldi, Peter J; DeMeo, Dawn; Quackenbush, John
2016-09-01
Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.
High-throughput gene mapping in Caenorhabditis elegans.
Swan, Kathryn A; Curtis, Damian E; McKusick, Kathleen B; Voinov, Alexander V; Mapa, Felipa A; Cancilla, Michael R
2002-07-01
Positional cloning of mutations in model genetic systems is a powerful method for the identification of targets of medical and agricultural importance. To facilitate the high-throughput mapping of mutations in Caenorhabditis elegans, we have identified a further 9602 putative new single nucleotide polymorphisms (SNPs) between two C. elegans strains, Bristol N2 and the Hawaiian mapping strain CB4856, by sequencing inserts from a CB4856 genomic DNA library and using an informatics pipeline to compare sequences with the canonical N2 genomic sequence. When combined with data from other laboratories, our marker set of 17,189 SNPs provides even coverage of the complete worm genome. To date, we have confirmed >1099 evenly spaced SNPs (one every 91 +/- 56 kb) across the six chromosomes and validated the utility of our SNP marker set and new fluorescence polarization-based genotyping methods for systematic and high-throughput identification of genes in C. elegans by cloning several proprietary genes. We illustrate our approach by recombination mapping and confirmation of the mutation in the cloned gene, dpy-18.
Lee, Younghee; Han, Seonggyun; Kim, Dongwook; Kim, Dokyoon; Horgousluoglu, Emrin; Risacher, Shannon L; Saykin, Andrew J; Nho, Kwangsik
2018-01-01
Genetic variation in cis-regulatory elements related to splicing machinery and splicing regulatory elements (SREs) results in exon skipping and undesired protein products. We developed a splicing decision model to identify actionable loci among common SNPs for gene regulation. The splicing decision model identified SNPs affecting exon skipping by analyzing sequence-driven alternative splicing (AS) models and by scanning the genome for the regions with putative SRE motifs. We used non-Hispanic Caucasians with neuroimaging, and fluid biomarkers for Alzheimer's disease (AD) and identified 17,088 common exonic SNPs affecting exon skipping. GWAS identified one SNP (rs1140317) in HLA-DQB1 as significantly associated with entorhinal cortical thickness, AD neuroimaging biomarker, after controlling for multiple testing. Further analysis revealed that rs1140317 was significantly associated with brain amyloid-f deposition (PET and CSF). HLA-DQB1 is an essential immune gene and may regulate AS, thereby contributing to AD pathology. SRE may hold potential as novel therapeutic targets for AD.
Gu, Wanjun; Gurguis, Christopher I.; Zhou, Jin J.; Zhu, Yihua; Ko, Eun-A.; Ko, Jae-Hong; Wang, Ting; Zhou, Tong
2015-01-01
Genetic variation arising from single nucleotide polymorphisms (SNPs) is ubiquitously found among human populations. While disease-causing variants are known in some cases, identifying functional or causative variants for most human diseases remains a challenging task. Rare SNPs, rather than common ones, are thought to be more important in the pathology of most human diseases. We propose that rare SNPs should be divided into two categories dependent on whether the minor alleles are derived or ancestral. Derived alleles are less likely to have been purified by evolutionary processes and may be more likely to induce deleterious effects. We therefore hypothesized that the rare SNPs with derived minor alleles would be more important for human diseases and predicted that these variants would have larger functional or structural consequences relative to the rare variants for which the minor alleles are ancestral. We systematically investigated the consequences of the exonic SNPs on protein function, mRNA structure, and translation. We found that the functional and structural consequences are more significant for the rare exonic variants for which the minor alleles are derived. However, this pattern is reversed when the minor alleles are ancestral. Thus, the rare exonic SNPs with derived minor alleles are more likely to be deleterious. Age estimation of rare SNPs confirms that these potentially deleterious SNPs are recently evolved in the human population. These results have important implications for understanding the function of genetic variations in human exonic regions and for prioritizing functional SNPs in genome-wide association studies of human diseases. PMID:26454016
Dudley, Joel T.; Chen, Rong; Sanderford, Maxwell; Butte, Atul J.; Kumar, Sudhir
2012-01-01
Genome-wide disease association studies contrast genetic variation between disease cohorts and healthy populations to discover single nucleotide polymorphisms (SNPs) and other genetic markers revealing underlying genetic architectures of human diseases. Despite scores of efforts over the past decade, many reproducible genetic variants that explain substantial proportions of the heritable risk of common human diseases remain undiscovered. We have conducted a multispecies genomic analysis of 5,831 putative human risk variants for more than 230 disease phenotypes reported in 2,021 studies. We find that the current approaches show a propensity for discovering disease-associated SNPs (dSNPs) at conserved genomic positions because the effect size (odds ratio) and allelic P value of genetic association of an SNP relates strongly to the evolutionary conservation of their genomic position. We propose a new measure for ranking SNPs that integrates evolutionary conservation scores and the P value (E-rank). Using published data from a large case-control study, we demonstrate that E-rank method prioritizes SNPs with a greater likelihood of bona fide and reproducible genetic disease associations, many of which may explain greater proportions of genetic variance. Therefore, long-term evolutionary histories of genomic positions offer key practical utility in reassessing data from existing disease association studies, and in the design and analysis of future studies aimed at revealing the genetic basis of common human diseases. PMID:22389448
Genetic polymorphisms associated with increased risk of developing chronic myelogenous leukemia
Bruzzoni-Giovanelli, Heriberto; González, Juan R.; Sigaux, François; Villoutreix, Bruno O.; Cayuela, Jean Michel; Guilhot, Joëlle; Preudhomme, Claude; Guilhot, François; Poyet, Jean-Luc; Rousselot, Philippe
2015-01-01
Little is known about inherited factors associated with the risk of developing chronic myelogenous leukemia (CML). We used a dedicated DNA chip containing 16 561 single nucleotide polymorphisms (SNPs) covering 1 916 candidate genes to analyze 437 CML patients and 1 144 healthy control individuals. Single SNP association analysis identified 139 SNPs that passed multiple comparisons (1% false discovery rate). The HDAC9, AVEN, SEMA3C, IKBKB, GSTA3, RIPK1 and FGF2 genes were each represented by three SNPs, the PSM family by four SNPs and the SLC15A1 gene by six. Haplotype analysis showed that certain combinations of rare alleles of these genes increased the risk of developing CML by more than two or three-fold. A classification tree model identified five SNPs belonging to the genes PSMB10, TNFRSF10D, PSMB2, PPARD and CYP26B1, which were associated with CML predisposition. A CML-risk-allele score was created using these five SNPs. This score was accurate for discriminating CML status (AUC: 0.61, 95%CI: 0.58–0.64). Interestingly, the score was associated with age at diagnosis and the average number of risk alleles was significantly higher in younger patients. The risk-allele score showed the same distribution in the general population (HapMap CEU samples) as in our control individuals and was associated with differential gene expression patterns of two genes (VAPA and TDRKH). In conclusion, we describe haplotypes and a genetic score that are significantly associated with a predisposition to develop CML. The SNPs identified will also serve to drive fundamental research on the putative role of these genes in CML development. PMID:26474455
Single Nucleotide Polymorphism Discovery in Bovine Pituitary Gland Using RNA-Seq Technology
Pareek, Chandra Shekhar; Smoczyński, Rafał; Kadarmideen, Haja N.; Dziuba, Piotr; Błaszczyk, Paweł; Sikora, Marcin; Walendzik, Paulina; Grzybowski, Tomasz; Pierzchała, Mariusz; Horbańczuk, Jarosław; Szostak, Agnieszka; Ogluszka, Magdalena; Zwierzchowski, Lech; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Wąsowicz, Krzysztof; Gelfand, Brian; Feng, Yaping; Kumar, Dibyendu
2016-01-01
Examination of bovine pituitary gland transcriptome by strand-specific RNA-seq allows detection of putative single nucleotide polymorphisms (SNPs) within potential candidate genes (CGs) or QTLs regions as well as to understand the genomics variations that contribute to economic trait. Here we report a breed-specific model to successfully perform the detection of SNPs in the pituitary gland of young growing bulls representing Polish Holstein-Friesian (HF), Polish Red, and Hereford breeds at three developmental ages viz., six months, nine months, and twelve months. A total of 18 bovine pituitary gland polyA transcriptome libraries were prepared and sequenced using the Illumina NextSeq 500 platform. Sequenced FastQ databases of all 18 young bulls were submitted to NCBI-SRA database with NCBI-SRA accession numbers SRS1296732. For the investigated young bulls, a total of 113,882,3098 raw paired-end reads with a length of 156 bases were obtained, resulting in an approximately 63 million paired-end reads per library. Breed-wise, a total of 515.38, 215.39, and 408.04 million paired-end reads were obtained for Polish HF, Polish Red, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed 93.04%, 94.39%, and 83.46% of the mapped sequencing reads were properly paired to the Polish HF, Polish Red, and Hereford breeds, respectively. Constructed breed-specific SNP-db of three cattle breeds yielded at 13,775,885 SNPs. On an average 765,326 breed-specific SNPs per young bull were identified. Using two stringent filtering parameters, i.e., a minimum 10 SNP reads per base with an accuracy ≥ 90% and a minimum 10 SNP reads per base with an accuracy = 100%, SNP-db records were trimmed to construct a highly reliable SNP-db. This resulted in a reduction of 95,7% and 96,4% cut-off mark of constructed raw SNP-db. Finally, SNP discoveries using RNA-Seq data were validated by KASP™ SNP genotyping assay. The comprehensive QTLs/CGs analysis of 76 QTLs/CGs with RNA-seq data identified KCNIP4, CCSER1, DPP6, MAP3K5 and GHR CGs with highest SNPs hit loci in all three breeds and developmental ages. However, CAST CG with more than 100 SNPs hits were observed only in Polish HF and Hereford breeds.These findings are important for identification and construction of novel tissue specific SNP-db and breed specific SNP-db dataset by screening of putative SNPs according to QTL db and candidate genes for bovine growth and reproduction traits, one can develop genomic selection strategies for growth and reproductive traits. PMID:27606429
Single Nucleotide Polymorphism Discovery in Bovine Pituitary Gland Using RNA-Seq Technology.
Pareek, Chandra Shekhar; Smoczyński, Rafał; Kadarmideen, Haja N; Dziuba, Piotr; Błaszczyk, Paweł; Sikora, Marcin; Walendzik, Paulina; Grzybowski, Tomasz; Pierzchała, Mariusz; Horbańczuk, Jarosław; Szostak, Agnieszka; Ogluszka, Magdalena; Zwierzchowski, Lech; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Wąsowicz, Krzysztof; Gelfand, Brian; Feng, Yaping; Kumar, Dibyendu
2016-01-01
Examination of bovine pituitary gland transcriptome by strand-specific RNA-seq allows detection of putative single nucleotide polymorphisms (SNPs) within potential candidate genes (CGs) or QTLs regions as well as to understand the genomics variations that contribute to economic trait. Here we report a breed-specific model to successfully perform the detection of SNPs in the pituitary gland of young growing bulls representing Polish Holstein-Friesian (HF), Polish Red, and Hereford breeds at three developmental ages viz., six months, nine months, and twelve months. A total of 18 bovine pituitary gland polyA transcriptome libraries were prepared and sequenced using the Illumina NextSeq 500 platform. Sequenced FastQ databases of all 18 young bulls were submitted to NCBI-SRA database with NCBI-SRA accession numbers SRS1296732. For the investigated young bulls, a total of 113,882,3098 raw paired-end reads with a length of 156 bases were obtained, resulting in an approximately 63 million paired-end reads per library. Breed-wise, a total of 515.38, 215.39, and 408.04 million paired-end reads were obtained for Polish HF, Polish Red, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed 93.04%, 94.39%, and 83.46% of the mapped sequencing reads were properly paired to the Polish HF, Polish Red, and Hereford breeds, respectively. Constructed breed-specific SNP-db of three cattle breeds yielded at 13,775,885 SNPs. On an average 765,326 breed-specific SNPs per young bull were identified. Using two stringent filtering parameters, i.e., a minimum 10 SNP reads per base with an accuracy ≥ 90% and a minimum 10 SNP reads per base with an accuracy = 100%, SNP-db records were trimmed to construct a highly reliable SNP-db. This resulted in a reduction of 95,7% and 96,4% cut-off mark of constructed raw SNP-db. Finally, SNP discoveries using RNA-Seq data were validated by KASP™ SNP genotyping assay. The comprehensive QTLs/CGs analysis of 76 QTLs/CGs with RNA-seq data identified KCNIP4, CCSER1, DPP6, MAP3K5 and GHR CGs with highest SNPs hit loci in all three breeds and developmental ages. However, CAST CG with more than 100 SNPs hits were observed only in Polish HF and Hereford breeds.These findings are important for identification and construction of novel tissue specific SNP-db and breed specific SNP-db dataset by screening of putative SNPs according to QTL db and candidate genes for bovine growth and reproduction traits, one can develop genomic selection strategies for growth and reproductive traits.
A genome-wide 20 K citrus microarray for gene expression analysis
Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose
2008-01-01
Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to catalogue genes expressed in citrus globular embryos. PMID:18598343
Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv
2018-01-01
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.
Xu, Meixiang; Cross, Courtney E; Speidel, Jordan T; Abdel-Rahman, Sherif Z
2016-10-01
The O 6 -methylguanine-DNA methyltransferase (MGMT) protein removes O 6 -alkyl-guanine adducts from DNA. MGMT expression can thus alter the sensitivity of cells and tissues to environmental and chemotherapeutic alkylating agents. Previously, we defined the haplotype structure encompassing single nucleotide polymorphisms (SNPs) in the MGMT promoter/enhancer (P/E) region and found that haplotypes, rather than individual SNPs, alter MGMT promoter activity. The exact mechanism(s) by which these haplotypes exert their effect on MGMT promoter activity is currently unknown, but we noted that many of the SNPs comprising the MGMT P/E haplotypes are located within or in close proximity to putative transcription factor binding sites. Thus, these haplotypes could potentially affect transcription factor binding and, subsequently, alter MGMT promoter activity. In this study, we test the hypothesis that MGMT P/E haplotypes affect MGMT promoter activity by altering transcription factor (TF) binding to the P/E region. We used a promoter binding TF profiling array and a reporter assay to evaluate the effect of different P/E haplotypes on TF binding and MGMT expression, respectively. Our data revealed a significant difference in TF binding profiles between the different haplotypes evaluated. We identified TFs that consistently showed significant haplotype-dependent binding alterations (p ≤ 0.01) and revealed their role in regulating MGMT expression using siRNAs and a dual-luciferase reporter assay system. The data generated support our hypothesis that promoter haplotypes alter the binding of TFs to the MGMT P/E and, subsequently, affect their regulatory function on MGMT promoter activity and expression level.
Tsai, Hsin Y; Robledo, Diego; Lowe, Natalie R; Bekaert, Michael; Taggart, John B; Bron, James E; Houston, Ross D
2016-07-07
High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species' genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the 'ssalar01' high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. Copyright © 2016 Tsai et al.
Valdisser, Paula Arielle M R; Pappas, Georgios J; de Menezes, Ivandilson P P; Müller, Bárbara S F; Pereira, Wendell J; Narciso, Marcelo G; Brondani, Claudio; Souza, Thiago L P O; Borba, Tereza C O; Vianello, Rosana P
2016-06-01
Researchers have made great advances into the development and application of genomic approaches for common beans, creating opportunities to driving more real and applicable strategies for sustainable management of the genetic resource towards plant breeding. This work provides useful polymorphic single-nucleotide polymorphisms (SNPs) for high-throughput common bean genotyping developed by RAD (restriction site-associated DNA) sequencing. The RAD tags were generated from DNA pooled from 12 common bean genotypes, including breeding lines of different gene pools and market classes. The aligned sequences identified 23,748 putative RAD-SNPs, of which 3357 were adequate for genotyping; 1032 RAD-SNPs with the highest ADT (assay design tool) score are presented in this article. The RAD-SNPs were structurally annotated in different coding (47.00 %) and non-coding (53.00 %) sequence components of genes. A subset of 384 RAD-SNPs with broad genome distribution was used to genotype a diverse panel of 95 common bean germplasms and revealed a successful amplification rate of 96.6 %, showing 73 % of polymorphic SNPs within the Andean group and 83 % in the Mesoamerican group. A slightly increased He (0.161, n = 21) value was estimated for the Andean gene pool, compared to the Mesoamerican group (0.156, n = 74). For the linkage disequilibrium (LD) analysis, from a group of 580 SNPs (289 RAD-SNPs and 291 BARC-SNPs) genotyped for the same set of genotypes, 70.2 % were in LD, decreasing to 0.10 %in the Andean group and 0.77 % in the Mesoamerican group. Haplotype patterns spanning 310 Mb of the genome (60 %) were characterized in samples from different origins. However, the haplotype frameworks were under-represented for the Andean (7.85 %) and Mesoamerican (5.55 %) gene pools separately. In conclusion, RAD sequencing allowed the discovery of hundreds of useful SNPs for broad genetic analysis of common bean germplasm. From now, this approach provides an excellent panel of molecular tools for whole genome analysis, allowing integrating and better exploring the common bean breeding practices.
2012-01-01
Background Molecular breeding of pepper (Capsicum spp.) can be accelerated by developing DNA markers associated with transcriptomes in breeding germplasm. Before the advent of next generation sequencing (NGS) technologies, the majority of sequencing data were generated by the Sanger sequencing method. By leveraging Sanger EST data, we have generated a wealth of genetic information for pepper including thousands of SNPs and Single Position Polymorphic (SPP) markers. To complement and enhance these resources, we applied NGS to three pepper genotypes: Maor, Early Jalapeño and Criollo de Morelos-334 (CM334) to identify SNPs and SSRs in the assembly of these three genotypes. Results Two pepper transcriptome assemblies were developed with different purposes. The first reference sequence, assembled by CAP3 software, comprises 31,196 contigs from >125,000 Sanger-EST sequences that were mainly derived from a Korean F1-hybrid line, Bukang. Overlapping probes were designed for 30,815 unigenes to construct a pepper Affymetrix GeneChip® microarray for whole genome analyses. In addition, custom Python scripts were used to identify 4,236 SNPs in contigs of the assembly. A total of 2,489 simple sequence repeats (SSRs) were identified from the assembly, and primers were designed for the SSRs. Annotation of contigs using Blast2GO software resulted in information for 60% of the unigenes in the assembly. The second transcriptome assembly was constructed from more than 200 million Illumina Genome Analyzer II reads (80–120 nt) using a combination of Velvet, CLC workbench and CAP3 software packages. BWA, SAMtools and in-house Perl scripts were used to identify SNPs among three pepper genotypes. The SNPs were filtered to be at least 50 bp from any intron-exon junctions as well as flanking SNPs. More than 22,000 high-quality putative SNPs were identified. Using the MISA software, 10,398 SSR markers were also identified within the Illumina transcriptome assembly and primers were designed for the identified markers. The assembly was annotated by Blast2GO and 14,740 (12%) of annotated contigs were associated with functional proteins. Conclusions Before availability of pepper genome sequence, assembling transcriptomes of this economically important crop was required to generate thousands of high-quality molecular markers that could be used in breeding programs. In order to have a better understanding of the assembled sequences and to identify candidate genes underlying QTLs, we annotated the contigs of Sanger-EST and Illumina transcriptome assemblies. These and other information have been curated in a database that we have dedicated for pepper project. PMID:23110314
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.
Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D
2013-02-28
Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation
2013-01-01
Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355
Batra, Jyotsna; Lose, Felicity; O'Mara, Tracy; Marquart, Louise; Stephens, Carson; Alexander, Kimberly; Srinivasan, Srilakshmi; Eeles, Rosalind A.; Easton, Douglas F.; Olama, Ali Amin Al; Kote-Jarai, Zsofia; Guy, Michelle; Muir, Kenneth; Lophatananon, Artitaya; Rahman, Aneela A.; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Chambers, Suzanne; Gardiner, Robert A.; Aitken, Joanne; Yaxley, John; Kedda, Mary-Anne
2011-01-01
Background Kallikrein 15 (KLK15)/Prostinogen is a plausible candidate for prostate cancer susceptibility. Elevated KLK15 expression has been reported in prostate cancer and it has been described as an unfavorable prognostic marker for the disease. Objectives We performed a comprehensive analysis of association of variants in the KLK15 gene with prostate cancer risk and aggressiveness by genotyping tagSNPs, as well as putative functional SNPs identified by extensive bioinformatics analysis. Methods and Data Sources Twelve out of 22 SNPs, selected on the basis of linkage disequilibrium pattern, were analyzed in an Australian sample of 1,011 histologically verified prostate cancer cases and 1,405 ethnically matched controls. Replication was sought from two existing genome wide association studies (GWAS): the Cancer Genetic Markers of Susceptibility (CGEMS) project and a UK GWAS study. Results Two KLK15 SNPs, rs2659053 and rs3745522, showed evidence of association (p<0.05) but were not present on the GWAS platforms. KLK15 SNP rs2659056 was found to be associated with prostate cancer aggressiveness and showed evidence of association in a replication cohort of 5,051 patients from the UK, Australia, and the CGEMS dataset of US samples. A highly significant association with Gleason score was observed when the data was combined from these three studies with an Odds Ratio (OR) of 0.85 (95% CI = 0.77–0.93; p = 2.7×10−4). The rs2659056 SNP is predicted to alter binding of the RORalpha transcription factor, which has a role in the control of cell growth and differentiation and has been suggested to control the metastatic behavior of prostate cancer cells. Conclusions Our findings suggest a role for KLK15 genetic variation in the etiology of prostate cancer among men of European ancestry, although further studies in very large sample sets are necessary to confirm effect sizes. PMID:22132073
Screening and Evaluation of Deleterious SNPs in APOE Gene of Alzheimer's Disease.
Masoodi, Tariq Ahmad; Al Shammari, Sulaiman A; Al-Muammar, May N; Alhamdan, Adel A
2012-01-01
Introduction. Apolipoprotein E (APOE) is an important risk factor for Alzheimer's disease (AD) and is present in 30-50% of patients who develop late-onset AD. Several single-nucleotide polymorphisms (SNPs) are present in APOE gene which act as the biomarkers for exploring the genetic basis of this disease. The objective of this study is to identify deleterious nsSNPs associated with APOE gene. Methods. The SNPs were retrieved from dbSNP. Using I-Mutant, protein stability change was calculated. The potentially functional nonsynonymous (ns) SNPs and their effect on protein was predicted by PolyPhen and SIFT, respectively. FASTSNP was used for functional analysis and estimation of risk score. The functional impact on the APOE protein was evaluated by using Swiss PDB viewer and NOMAD-Ref server. Results. Six nsSNPs were found to be least stable by I-Mutant 2.0 with DDG value of >-1.0. Four nsSNPs showed a highly deleterious tolerance index score of 0.00. Nine nsSNPs were found to be probably damaging with position-specific independent counts (PSICs) score of ≥2.0. Seven nsSNPs were found to be highly polymorphic with a risk score of 3-4. The total energies and root-mean-square deviation (RMSD) values were higher for three mutant-type structures compared to the native modeled structure. Conclusion. We concluded that three nsSNPs, namely, rs11542041, rs11542040, and rs11542034, to be potentially functional polymorphic.
Margam, Venu M.; Coates, Brad S.; Bayles, Darrell O.; Hellmich, Richard L.; Agunbiade, Tolulope; Seufferheld, Manfredo J.; Sun, Weilin; Kroemer, Jeremy A.; Ba, Malick N.; Binso-Dabire, Clementine L.; Baoua, Ibrahim; Ishiyaku, Mohammad F.; Covas, Fernando G.; Srinivasan, Ramasamy; Armstrong, Joel; Murdock, Larry L.; Pittendrigh, Barry R.
2011-01-01
The legume pod borer, Maruca vitrata (Lepidoptera: Crambidae), is an insect pest species of crops grown by subsistence farmers in tropical regions of Africa. We present the de novo assembly of 3729 contigs from 454- and Sanger-derived sequencing reads for midgut, salivary, and whole adult tissues of this non-model species. Functional annotation predicted that 1320 M. vitrata protein coding genes are present, of which 631 have orthologs within the Bombyx mori gene model. A homology-based analysis assigned M. vitrata genes into a group of paralogs, but these were subsequently partitioned into putative orthologs following phylogenetic analyses. Following sequence quality filtering, a total of 1542 putative single nucleotide polymorphisms (SNPs) were predicted within M. vitrata contig assemblies. Seventy one of 1078 designed molecular genetic markers were used to screen M. vitrata samples from five collection sites in West Africa. Population substructure may be present with significant implications in the insect resistance management recommendations pertaining to the release of biological control agents or transgenic cowpea that express Bacillus thuringiensis crystal toxins. Mutation data derived from transcriptome sequencing is an expeditious and economical source for genetic markers that allow evaluation of ecological differentiation. PMID:21754987
Computational Methods to Work as First-Pass Filter in Deleterious SNP Analysis of Alkaptonuria
Magesh, R.; George Priya Doss, C.
2012-01-01
A major challenge in the analysis of human genetic variation is to distinguish functional from nonfunctional SNPs. Discovering these functional SNPs is one of the main goals of modern genetics and genomics studies. There is a need to effectively and efficiently identify functionally important nsSNPs which may be deleterious or disease causing and to identify their molecular effects. The prediction of phenotype of nsSNPs by computational analysis may provide a good way to explore the function of nsSNPs and its relationship with susceptibility to disease. In this context, we surveyed and compared variation databases along with in silico prediction programs to assess the effects of deleterious functional variants on protein functions. In other respects, we attempted these methods to work as first-pass filter to identify the deleterious substitutions worth pursuing for further experimental research. In this analysis, we used the existing computational methods to explore the mutation-structure-function relationship in HGD gene causing alkaptonuria. PMID:22606059
Alsaif, Mohammed A.; Al Shammari, Sulaiman A.; Alhamdan, Adel A.
2012-01-01
Introduction Single-nucleotide polymorphisms (SNPs) are biomarkers for exploring the genetic basis of many complex human diseases. The prediction of SNPs is promising in modern genetic analysis but it is still a great challenge to identify the functional SNPs in a disease-related gene. The computational approach has overcome this challenge and an increase in the successful rate of genetic association studies and reduced cost of genotyping have been achieved. The objective of this study is to identify deleterious non-synonymous SNPs (nsSNPs) associated with the COL1A1 gene. Material and methods The SNPs were retrieved from the Single Nucleotide Polymorphism Database (dbSNP). Using I-Mutant, protein stability change was calculated. The potentially functional nsSNPs and their effect on proteins were predicted by PolyPhen and SIFT respectively. FASTSNP was used for estimation of risk score. Results Our analysis revealed 247 SNPs as non-synonymous, out of which 5 nsSNPs were found to be least stable by I-Mutant 2.0 with a DDG value of > –1.0. Four nsSNPs, namely rs17853657, rs17857117, rs57377812 and rs1059454, showed a highly deleterious tolerance index score of 0.00 with a change in their physicochemical properties by the SIFT server. Seven nsSNPs, namely rs1059454, rs8179178, rs17853657, rs17857117, rs72656340, rs72656344 and rs72656351, were found to be probably damaging with a PSIC score difference between 2.0 and 3.5 by the PolyPhen server. Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, were found to be highly polymorphic with a risk score of 3-4 with a possible effect of non-conservative change and splicing regulation by FASTSNP. Conclusions Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, are potential functional polymorphisms that are likely to have a functional impact on the COL1A1 gene. PMID:24273577
CerealsDB 3.0: expansion of resources and data integration.
Wilkinson, Paul A; Winfield, Mark O; Barker, Gary L A; Tyrrell, Simon; Bian, Xingdong; Allen, Alexandra M; Burridge, Amanda; Coghill, Jane A; Waterfall, Christy; Caccamo, Mario; Davey, Robert P; Edwards, Keith J
2016-06-24
The increase in human populations around the world has put pressure on resources, and as a consequence food security has become an important challenge for the 21st century. Wheat (Triticum aestivum) is one of the most important crops in human and livestock diets, and the development of wheat varieties that produce higher yields, combined with increased resistance to pests and resilience to changes in climate, has meant that wheat breeding has become an important focus of scientific research. In an attempt to facilitate these improvements in wheat, plant breeders have employed molecular tools to help them identify genes for important agronomic traits that can be bred into new varieties. Modern molecular techniques have ensured that the rapid and inexpensive characterisation of SNP markers and their validation with modern genotyping methods has produced a valuable resource that can be used in marker assisted selection. CerealsDB was created as a means of quickly disseminating this information to breeders and researchers around the globe. CerealsDB version 3.0 is an online resource that contains a wide range of genomic datasets for wheat that will assist plant breeders and scientists to select the most appropriate markers for use in marker assisted selection. CerealsDB includes a database which currently contains in excess of a million putative varietal SNPs, of which several hundreds of thousands have been experimentally validated. In addition, CerealsDB also contains new data on functional SNPs predicted to have a major effect on protein function and we have constructed a web service to encourage data integration and high-throughput programmatic access. CerealsDB is an open access website that hosts information on SNPs that are considered useful for both plant breeders and research scientists. The recent inclusion of web services designed to federate genomic data resources allows the information on CerealsDB to be more fully integrated with the WheatIS network and other biological databases.
Ferreira, Pedro Eduardo; Veiga, Maria Isabel; Cavaco, Isa; Martins, J Paulo; Andersson, Björn; Mushin, Shaliya; Ali, Abullah S; Bhattarai, Achuyt; Ribeiro, Vera; Björkman, Anders; Gil, José Pedro
2008-02-01
Artemisinin-based combination therapy is a main strategy for malaria control in Africa. Zanzibar introduced this new treatment policy in 2003. The authors have studied the prevalence of a number of functional single nucleotide polymorphisms (SNPs) in genes associated with the elimination of the artemisinin-based combination therapy compounds in use in Zanzibar to investigate the frequencies of subgroups potentially at higher drug exposure and therefore possible higher risk of toxicity. One hundred three unrelated children with uncomplicated malaria from the Unguja and Pemba islands of Zanzibar were enrolled. With use of polymerase chain reaction (PCR)-restriction fragment length polymorphism and real-time PCR-based allele discrimination methods, the CYP2B6 (G15631T), CYP3A4 (A-392G), CYP3A5 (A6986G, G14690A, 27131-132 insT, C3699T) SNPs and MDR1 SNPs C3435T, G2677T/A, and T-129C were analyzed. PCR product sequencing was applied to regulatory regions of MDR1, the CYP3A4 proximal promoter, and to exons 2 and 5 of PXR, a gene coding for a nuclear factor activated by artemisinin antimalarials and associated with the transcription induction of most of the studied genes. Homozygous subjects for alleles coding for low activity proteins were found at the following frequencies: 1) MDR1: 2.9%; 2) CYP2B6: 9.7%; 3) CYP3A5: 14.1%; and 4) CYP3A4: 49.5%. No functionally relevant allele was found in the analyzed regions of PXR. A new MDR1 SNP was found (T-158C), located in a putative antigen recognition element. Ten (10.1%) subjects were predicted to be low metabolizers simultaneously for CYP3A4 and CYP3A5. This fraction of the population is suggested to be under higher exposure to certain antimalarials, including lumefantrine and quinine.
Development of genetic markers in abalone through construction of a SNP database.
Kang, J-H; Appleyard, S A; Elliott, N G; Jee, Y-J; Lee, J B; Kang, S W; Baek, M K; Han, Y S; Choi, T-J; Lee, Y S
2011-06-01
In the absence of a reference genome, single-nucleotide polymorphisms (SNP) discovery in a group of abalone species was undertaken by random sequence assembly. A web-based interface was constructed, and 11 932 DNA sequences from the genus Haliotis were assembled, with 1321 contigs built. Of these, 118 contigs that consisted of at least ten annotation groups were selected. The 1577 putative SNPs were identified from the 118 contigs, with SNPs in several HSP70 gene contigs confirmed by PCR amplification of an 809-bp DNA fragment. SNPs in the HSP70 gene were compared across eight abalone species. A total of 129 polymorphic sites, including heterozygote sites within and among species, were observed. Phylogenetic analysis of the partial HSP70 gene region showed separation of the tested abalone into two groups, one reflecting the southern hemisphere species and the other the northern hemisphere species. Interestingly, Haliotis iris from New Zealand showed a closer relationship to species distributed in the northern Pacific region. Although HSP genes are known to be highly conserved among taxa, the validation of polymorphic SNPs from HSP70 in this mollusc demonstrates the applicability of cross-species SNP markers in abalone and the first step towards universal nuclear markers in Haliotis. © 2010 NFRDI, Animal Genetics © 2010 Stichting International Foundation for Animal Genetics.
USDA-ARS?s Scientific Manuscript database
Coding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, mus...
Humble, Emily; Thorne, Michael A S; Forcada, Jaume; Hoffman, Joseph I
2016-08-26
Single nucleotide polymorphism (SNP) discovery is an important goal of many studies. However, the number of 'putative' SNPs discovered from a sequence resource may not provide a reliable indication of the number that will successfully validate with a given genotyping technology. For this it may be necessary to account for factors such as the method used for SNP discovery and the type of sequence data from which it originates, suitability of the SNP flanking sequences for probe design, and genomic context. To explore the relative importance of these and other factors, we used Illumina sequencing to augment an existing Roche 454 transcriptome assembly for the Antarctic fur seal (Arctocephalus gazella). We then mapped the raw Illumina reads to the new hybrid transcriptome using BWA and BOWTIE2 before calling SNPs with GATK. The resulting markers were pooled with two existing sets of SNPs called from the original 454 assembly using NEWBLER and SWAP454. Finally, we explored the extent to which SNPs discovered using these four methods overlapped and predicted the corresponding validation outcomes for both Illumina Infinium iSelect HD and Affymetrix Axiom arrays. Collating markers across all discovery methods resulted in a global list of 34,718 SNPs. However, concordance between the methods was surprisingly poor, with only 51.0 % of SNPs being discovered by more than one method and 13.5 % being called from both the 454 and Illumina datasets. Using a predictive modeling approach, we could also show that SNPs called from the Illumina data were on average more likely to successfully validate, as were SNPs called by more than one method. Above and beyond this pattern, predicted validation outcomes were also consistently better for Affymetrix Axiom arrays. Our results suggest that focusing on SNPs called by more than one method could potentially improve validation outcomes. They also highlight possible differences between alternative genotyping technologies that could be explored in future studies of non-model organisms.
Nkhoma, Standwell; Nair, Shalini; Mukaka, Mavuto; Molyneux, Malcolm E; Ward, Stephen A; Anderson, Timothy J C
2009-07-01
We genotyped 160 P. falciparum infections from Malawi for pfmdr-1 copy number changes and SNPs associated with in vivo tolerance and poor in vitro sensitivity to the component drugs of Coartem. We also measured in vitro susceptibility of 49 of these isolates to a variety of drugs in clinical use or with a potential for use in Africa. All 160 infections carried a single copy of pfmdr-1 but 34% exhibited sequence variation at 4 of the 5 polymorphic sites in pfmdr-1. Isolates carrying 86-Asn and 184-Tyr pfmdr-1 alleles were significantly less sensitive (p<0.001) to mefloquine, lumefantrine, artemether and dihydroartemisinin compared with those bearing 86-Tyr and 184-Phe polymorphisms. This study provides baseline measures prior to policy change: continued surveillance for changes in baseline drug susceptibility, pfmdr-1 copy number and SNPs, and other putative Coartem resistance loci will be necessary to provide an early warning of emerging Coartem resistance in this setting.
Moore, Jean-Sébastien; Bourret, Vincent; Dionne, Mélanie; Bradbury, Ian; O'Reilly, Patrick; Kent, Matthew; Chaput, Gérald; Bernatchez, Louis
2014-12-01
Anadromous Atlantic salmon (Salmo salar) is a species of major conservation and management concern in North America, where population abundance has been declining over the past 30 years. Effective conservation actions require the delineation of conservation units to appropriately reflect the spatial scale of intraspecific variation and local adaptation. Towards this goal, we used the most comprehensive genetic and genomic database for Atlantic salmon to date, covering the entire North American range of the species. The database included microsatellite data from 9142 individuals from 149 sampling locations and data from a medium-density SNP array providing genotypes for >3000 SNPs for 50 sampling locations. We used neutral and putatively selected loci to integrate adaptive information in the definition of conservation units. Bayesian clustering with the microsatellite data set and with neutral SNPs identified regional groupings largely consistent with previously published regional assessments. The use of outlier SNPs did not result in major differences in the regional groupings, suggesting that neutral markers can reflect the geographic scale of local adaptation despite not being under selection. We also performed assignment tests to compare power obtained from microsatellites, neutral SNPs and outlier SNPs. Using SNP data substantially improved power compared to microsatellites, and an assignment success of 97% to the population of origin and of 100% to the region of origin was achieved when all SNP loci were used. Using outlier SNPs only resulted in minor improvements to assignment success to the population of origin but improved regional assignment. We discuss the implications of these new genetic resources for the conservation and management of Atlantic salmon in North America. © 2014 John Wiley & Sons Ltd.
FGWAS: Functional genome wide association analysis.
Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu
2017-10-01
Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.
Liu, Li; Venkatesh, Jelli; Jo, Yeong Deuk; Koeda, Sota; Hosokawa, Munetaka; Kang, Jin-Ho; Goritschnig, Sandra; Kang, Byoung-Cheorl
2016-08-01
The sy - 2 temperature-sensitive gene from Capsicum chinense was fine mapped to a 138.8-kb region at the distal portion of pepper chromosome 1. Based on expression analyses, two putative F-box genes were identified as sy - 2 candidate genes. Seychelles-2 ('sy-2') is a temperature-sensitive natural mutant of Capsicum chinense, which exhibits an abnormal leaf phenotype when grown at temperatures below 24 °C. We previously showed that the sy-2 phenotype is controlled by a single recessive gene, sy-2, located on pepper chromosome 1. In this study, a high-resolution genetic and physical map for the sy-2 locus was constructed using two individual F2 mapping populations derived from a cross between C. chinense mutant 'sy-2' and wild-type 'No. 3341'. The sy-2 gene was fine mapped to a 138.8-kb region between markers SNP 5-5 and SNP 3-8 at the distal portion of chromosome 1, based on comparative genomic analysis and genomic information from pepper. The sy-2 target region was predicted to contain 27 genes. Expression analysis of these predicted genes showed a differential expression pattern for ORF10 and ORF20 between mutant and wild-type plants; with both having significantly lower expression in 'sy-2' than in wild-type plants. In addition, the coding sequences of both ORF10 and ORF20 contained single nucleotide polymorphisms (SNPs) causing amino acid changes, which may have important functional consequences. ORF10 and ORF20 are predicted to encode F-box proteins, which are components of the SCF complex. Based on the differential expression pattern and the presence of nonsynonymous SNPs, we suggest that these two putative F-box genes are most likely responsible for the temperature-sensitive phenotypes in pepper. Further investigation of these genes may enable a better understanding of the molecular mechanisms of low temperature sensitivity in plants.
Kuniholm, Mark H; Liang, Hua; Anastos, Kathryn; Gustafson, Deborah; Kassaye, Seble; Nowicki, Marek; Sha, Beverly E; Pawlowski, Emilia J; Gange, Stephen J; Aouizerat, Bradley E; Pushkarsky, Tatiana; Bukrinsky, Michael I; Prasad, Vinayaka R
2017-11-28
To assess variation in genes that regulate cholesterol metabolism in relation to the natural history of HIV infection. Cross-sectional and longitudinal analysis of the Women's Interagency HIV Study. We examined 2050 single nucleotide polymorphisms (SNPs) in 19 genes known to regulate cholesterol metabolism in relation to HIV viral load and CD4 T-cell levels in a multiracial cohort of 1066 antiretroviral therapy-naive women. Six SNPs were associated with both HIV viral load and CD4 T-cell levels at a false discovery rate of 0.01. Bioinformatics tools did not predict functional activity for five SNPs, located in introns of nuclear receptor corepressor 2, retinoid X receptor alpha (RXRA), and tetratricopeptide repeat domain 39B. Rs17111557 located in the 3' untranslated region of proprotein convertase subtilisin/kexin type 9 (PCSK9) putatively affects binding of hsa-miR-548t-5p and hsa-miR-4796-3p, which could regulate PCSK9 expression levels. Interrogation of rs17111557 revealed stronger associations in the subset of women with HIV/hepatitis C virus (HCV) coinfection (n = 408, 38% of women). Rs17111557 was also associated with low-density lipoprotein cholesterol levels in HIV/HCV coinfected (β: -10.4; 95% confidence interval: -17.9, -2.9; P = 0.007), but not in HIV monoinfected (β:1.2; 95% confidence interval: -6.3, 8.6; P = 0.76) women in adjusted analysis. PCSK9 polymorphism may affect HIV pathogenesis, particularly in HIV/HCV coinfected women. A likely mechanism for this effect is PCSK9-mediated regulation of cholesterol metabolism. Replication in independent cohorts is needed to clarify the generalizability of the observed associations.
Zhang, Xu; Chu, Qin; Guo, Gang; Dong, Ganghui; Li, Xizhi; Zhang, Qin; Zhang, Shengli; Zhang, Zhiwu; Wang, Yachun
2017-01-01
The growth and maturity of cattle body size affect not only feed efficiency, but also productivity and longevity. Dissecting the genetic architecture of body size is critical for cattle breeding to improve both efficiency and productivity. The volume and weight of body size are indicated by several measurements. Among them, Heart Girth (HG) and Hip Height (HH) are the most important traits. They are widely used as predictors of body weight (BW). Few association studies have been conducted for HG and HH in cattle focusing on single growth stage. In this study, we extended the Genome-wide association studies to a full spectrum of four growth stages (6-, 12-, 18-, and 24-months after birth) in Chinese Holstein heifers. The whole genomic single nucleotide polymorphisms (SNPs) were obtained from the Illumina BovineSNP50 v2 BeadChip genotyped on 3,325 individuals. Estimated breeding values (EBVs) were derived for both HG and HH at the four different ages and analyzed separately for GWAS by using the Fixed and random model Circuitous Probability Unification (FarmCPU) method. In total, 27 SNPs were identified to be significantly associated with HG and HH at different growth stages. We found 66 candidate genes located nearby the associated SNPs, including nine genes that were known as highly related to development and skeletal and muscular growth. In addition, biological function analysis was performed by Ingenuity Pathway Analysis and an interaction network related to development was obtained, which contained 16 genes out of the 66 candidates. The set of putative genes provided valuable resources and can help elucidate the genomic architecture and mechanisms underlying growth traits in dairy cattle.
Exome-wide DNA capture and next generation sequencing in domestic and wild species.
Cosart, Ted; Beja-Pereira, Albano; Chen, Shanyuan; Ng, Sarah B; Shendure, Jay; Luikart, Gordon
2011-07-05
Gene-targeted and genome-wide markers are crucial to advance evolutionary biology, agriculture, and biodiversity conservation by improving our understanding of genetic processes underlying adaptation and speciation. Unfortunately, for eukaryotic species with large genomes it remains costly to obtain genome sequences and to develop genome resources such as genome-wide SNPs. A method is needed to allow gene-targeted, next-generation sequencing that is flexible enough to include any gene or number of genes, unlike transcriptome sequencing. Such a method would allow sequencing of many individuals, avoiding ascertainment bias in subsequent population genetic analyses.We demonstrate the usefulness of a recent technology, exon capture, for genome-wide, gene-targeted marker discovery in species with no genome resources. We use coding gene sequences from the domestic cow genome sequence (Bos taurus) to capture (enrich for), and subsequently sequence, thousands of exons of B. taurus, B. indicus, and Bison bison (wild bison). Our capture array has probes for 16,131 exons in 2,570 genes, including 203 candidate genes with known function and of interest for their association with disease and other fitness traits. We successfully sequenced and mapped exon sequences from across the 29 autosomes and X chromosome in the B. taurus genome sequence. Exon capture and high-throughput sequencing identified thousands of putative SNPs spread evenly across all reference chromosomes, in all three individuals, including hundreds of SNPs in our targeted candidate genes. This study shows exon capture can be customized for SNP discovery in many individuals and for non-model species without genomic resources. Our captured exome subset was small enough for affordable next-generation sequencing, and successfully captured exons from a divergent wild species using the domestic cow genome as reference.
Evidence for pleiotropism and recent selection in the PLAG1 region in Australian Beef cattle.
Fortes, M R S; Kemper, K; Sasazaki, S; Reverter, A; Pryce, J E; Barendse, W; Bunch, R; McCulloch, R; Harrison, B; Bolormaa, S; Zhang, Y D; Hawken, R J; Goddard, M E; Lehnert, S A
2013-12-01
A putative functional mutation (rs109231213) near PLAG1 (BTA14) associated with stature was studied in beef cattle. Data from 8199 Bos taurus, Bos indicus and Tropical Composite cattle were used to test the associations between rs109231213 and various phenotypes. Further, 23 496 SNPs located on BTA14 were tested for association with these phenotypes, both independently and fitted together with rs109231213. The C allele of rs109231213 significantly increased hip height, weight, net food intake, age at puberty in males and females and decreased IGF-I concentration in blood and fat depth. When rs109231213 was fitted as a fixed effect in the model, there was an overall reduction in associations between other SNPs and these traits but some SNPs remained associated (P < 10(-4) ). Frequency of the mutant C allele of rs109231213 differed among B. indicus (0.52), B. taurus (0.96) and Tropical Composite (0.68). Most chromosomes carrying the C allele had the same surrounding 10 SNP haplotype, probably because the C allele was introgressed into Brahman from B. taurus cattle. A region of reduced heterozygosity surrounds the C allele; this is small in B. taurus but 20 Mb long in Brahmans, indicating recent and strong selection for the mutant allele. Thus, the C allele appears to mark a mutation that has been selected almost to fixation in the B. taurus breeds studied here and introduced into Brahman cattle during grading up and selected to a frequency of 0.52 despite its negative effects on fertility. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
Hafer, Cory; Lin, Ying; Kornblum, John; Lowy, Franklin D.
2012-01-01
Infections with vancomycin-intermediate Staphylococcus aureus (VISA) have been associated with vancomycin treatment failures and poor clinical outcomes. Routine identification of clinical isolates with increased vancomycin MICs remains challenging, and no molecular marker exists to aid in diagnosis of VISA strains. We tested vancomycin susceptibilities by using microscan, Etest, and population analyses in a collection of putative VISA, methicillin-resistant S. aureus, and methicillin-sensitive S. aureus (VSSA) infectious isolates from community- or hospital-associated S. aureus infections (n = 77) and identified 22 VISA and 9 heterogeneous VISA (hVISA) isolates. Sequencing of VISA candidate loci vraS, vraR, yvqF, graR, graS, walR, walK, and rpoB revealed a high diversity of nonsynonymous single-nucleotide polymorphisms (SNPs). For vraS, vraR, yvqF, walK, and rpoB, SNPs were more frequently present in VISA and hVISA than in VSSA isolates, whereas mutations in graR, graS, and walR were exclusively detected in VISA isolates. For each of the individual loci, SNPs were only detected in about half of the VISA isolates. All but one VISA isolate had at least one SNP in any of the genes sequenced, and isolates with an MIC of 6 or 8 μg/ml harbored at least 2 SNPs. Overall, increasing vancomycin MICs were paralleled by a higher proportion of isolates with SNPs. Depending on the clonal background, SNPs appeared to preferentially accumulate in vraS and vraR for sequence type 8 (ST8) and in walK and walR for ST5 isolates. Taken together, by comparing VISA, hVISA, and VSSA controls, we observed preferential clustering of SNPs in VISA candidate genes, with an unexpectedly high diversity across these loci. Our results support a polygenetic etiology of VISA. PMID:22948864
Machiela, Mitchell J; Chanock, Stephen J
2015-11-01
Assessing linkage disequilibrium (LD) across ancestral populations is a powerful approach for investigating population-specific genetic structure as well as functionally mapping regions of disease susceptibility. Here, we present LDlink, a web-based collection of bioinformatic modules that query single nucleotide polymorphisms (SNPs) in population groups of interest to generate haplotype tables and interactive plots. Modules are designed with an emphasis on ease of use, query flexibility, and interactive visualization of results. Phase 3 haplotype data from the 1000 Genomes Project are referenced for calculating pairwise metrics of LD, searching for proxies in high LD, and enumerating all observed haplotypes. LDlink is tailored for investigators interested in mapping common and uncommon disease susceptibility loci by focusing on output linking correlated alleles and highlighting putative functional variants. LDlink is a free and publically available web tool which can be accessed at http://analysistools.nci.nih.gov/LDlink/. mitchell.machiela@nih.gov. Published by Oxford University Press 2015. This work is written by US Government employees and is in the public domain in the US.
De Kort, Hanne; Mergeay, Joachim; Jacquemyn, Hans; Honnay, Olivier
2016-01-01
Background and Aims Many invasive species severely threaten native biodiversity and ecosystem functioning. One of the most prominent questions in invasion genetics is how invasive populations can overcome genetic founder effects to establish stable populations after colonization of new habitats. High native genetic diversity and multiple introductions are expected to increase genetic diversity and adaptive potential in the invasive range. Our aim was to identify the European source populations of Frangula alnus (glossy buckthorn), an ornamental and highly invasive woody species that was deliberately introduced into North America at the end of the 18th century. A second aim of this study was to assess the adaptive potential as an explanation for the invasion success of this species. Methods Using a set of annotated single-nucleotide polymorphisms (SNPs) that were assigned a putative function based on sequence comparison with model species, a total of 38 native European and 21 invasive North American populations were subjected to distance-based structure and assignment analyses combined with population genomic tools. Genetic diversity at SNPs with ecologically relevant functions was considered as a proxy for adaptive potential. Key Results Patterns of invasion coincided with early modern transatlantic trading routes. Multiple introductions through transatlantic trade from a limited number of European port regions to American urban areas led to the establishment of bridgehead populations with high allelic richness and expected heterozygosity, allowing continuous secondary migration to natural areas. Conclusions Targeted eradication of the urban populations, where the highest genetic diversity and adaptive potential were observed, offers a promising strategy to arrest further invasion of native American prairies and forests. PMID:27539599
Singh, Kh Dhanachandra; Karthikeyan, Muthusamy
2014-12-01
The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.
Li, Qianqian; Liu, Jianguo; Zhang, Litao; Liu, Qian
2014-01-01
Background Algae in the order Trentepohliales have a broad geographic distribution and are generally characterized by the presence of abundant β-carotene. The many monographs published to date have mainly focused on their morphology, taxonomy, phylogeny, distribution and reproduction; molecular studies of this order are still rare. High-throughput RNA sequencing (RNA-Seq) technology provides a powerful and efficient method for transcript analysis and gene discovery in Trentepohlia jolithus. Methods/Principal Findings Illumina HiSeq 2000 sequencing generated 55,007,830 Illumina PE raw reads, which were assembled into 41,328 assembled unigenes. Based on NR annotation, 53.28% of the unigenes (22,018) could be assigned to gene ontology classes with 54 subcategories and 161,451 functional terms. A total of 26,217 (63.44%) assembled unigenes were mapped to 128 KEGG pathways. Furthermore, a set of 5,798 SSRs in 5,206 unigenes and 131,478 putative SNPs were identified. Moreover, the fact that all of the C4 photosynthesis genes exist in T. jolithus suggests a complex carbon acquisition and fixation system. Similarities and differences between T. jolithus and other algae in carotenoid biosynthesis are also described in depth. Conclusions/Significance This is the first broad transcriptome survey for T. jolithus, increasing the amount of molecular data available for the class Ulvophyceae. As well as providing resources for functional genomics studies, the functional genes and putative pathways identified here will contribute to a better understanding of carbon fixation and fatty acid and carotenoid biosynthesis in T. jolithus. PMID:25254555
The search for loci under selection: trends, biases and progress.
Ahrens, Collin W; Rymer, Paul D; Stow, Adam; Bragg, Jason; Dillon, Shannon; Umbers, Kate D L; Dudaniec, Rachael Y
2018-03-01
Detecting genetic variants under selection using F ST outlier analysis (OA) and environmental association analyses (EAAs) are popular approaches that provide insight into the genetic basis of local adaptation. Despite the frequent use of OA and EAA approaches and their increasing attractiveness for detecting signatures of selection, their application to field-based empirical data have not been synthesized. Here, we review 66 empirical studies that use Single Nucleotide Polymorphisms (SNPs) in OA and EAA. We report trends and biases across biological systems, sequencing methods, approaches, parameters, environmental variables and their influence on detecting signatures of selection. We found striking variability in both the use and reporting of environmental data and statistical parameters. For example, linkage disequilibrium among SNPs and numbers of unique SNP associations identified with EAA were rarely reported. The proportion of putatively adaptive SNPs detected varied widely among studies, and decreased with the number of SNPs analysed. We found that genomic sampling effort had a greater impact than biological sampling effort on the proportion of identified SNPs under selection. OA identified a higher proportion of outliers when more individuals were sampled, but this was not the case for EAA. To facilitate repeatability, interpretation and synthesis of studies detecting selection, we recommend that future studies consistently report geographical coordinates, environmental data, model parameters, linkage disequilibrium, and measures of genetic structure. Identifying standards for how OA and EAA studies are designed and reported will aid future transparency and comparability of SNP-based selection studies and help to progress landscape and evolutionary genomics. © 2018 John Wiley & Sons Ltd.
Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo)
2012-01-01
Background The turkey (Meleagris gallopavo) is an important agricultural species and the second largest contributor to the world’s poultry meat production. Genetic improvement is attributed largely to selective breeding programs that rely on highly heritable phenotypic traits, such as body size and breast muscle development. Commercial breeding with small effective population sizes and epistasis can result in loss of genetic diversity, which in turn can lead to reduced individual fitness and reduced response to selection. The presence of genomic diversity in domestic livestock species therefore, is of great importance and a prerequisite for rapid and accurate genetic improvement of selected breeds in various environments, as well as to facilitate rapid adaptation to potential changes in breeding goals. Genomic selection requires a large number of genetic markers such as e.g. single nucleotide polymorphisms (SNPs) the most abundant source of genetic variation within the genome. Results Alignment of next generation sequencing data of 32 individual turkeys from different populations was used for the discovery of 5.49 million SNPs, which subsequently were used for the analysis of genetic diversity among the different populations. All of the commercial lines branched from a single node relative to the heritage varieties and the South Mexican turkey population. Heterozygosity of all individuals from the different turkey populations ranged from 0.17-2.73 SNPs/Kb, while heterozygosity of populations ranged from 0.73-1.64 SNPs/Kb. The average frequency of heterozygous SNPs in individual turkeys was 1.07 SNPs/Kb. Five genomic regions with very low nucleotide variation were identified in domestic turkeys that showed state of fixation towards alleles different than wild alleles. Conclusion The turkey genome is much less diverse with a relatively low frequency of heterozygous SNPs as compared to other livestock species like chicken and pig. The whole genome SNP discovery study in turkey resulted in the detection of 5.49 million putative SNPs compared to the reference genome. All commercial lines appear to share a common origin. Presence of different alleles/haplotypes in the SM population highlights that specific haplotypes have been selected in the modern domesticated turkey. PMID:22891612
Genomic Characterization of Phenylalanine Ammonia Lyase Gene in Buckwheat
Thiyagarajan, Karthikeyan; Vitali, Fabio; Tolaini, Valentina; Galeffi, Patrizia; Cantale, Cristina; Vikram, Prashant; Singh, Sukhwinder; De Rossi, Patrizia; Nobili, Chiara; Procacci, Silvia; Del Fiore, Antonella; Antonini, Alessandro; Presenti, Ombretta; Brunori, Andrea
2016-01-01
Phenylalanine Ammonia Lyase (PAL) gene which plays a key role in bio-synthesis of medicinally important compounds, Rutin/quercetin was sequence characterized for its efficient genomics application. These compounds possessing anti-diabetic and anti-cancer properties and are predominantly produced by Fagopyrum spp. In the present study, PAL gene was sequenced from three Fagopyrum spp. (F. tataricum, F. esculentum and F. dibotrys) and showed the presence of three SNPs and four insertion/deletions at intra and inter specific level. Among them, the potential SNP (position 949th bp G>C) with Parsimony Informative Site was selected and successfully utilised to individuate the zygosity/allelic variation of 16 F. tataricum varieties. Insertion mutations were identified in coding region, which resulted the change of a stretch of 39 amino acids on the putative protein. Our Study revealed that autogamous species (F. tataricum) has lower frequency of observed SNPs as compared to allogamous species (F. dibotrys and F. esculentum). The identified SNPs in F. tataricum didn’t result to amino acid change, while in other two species it caused both conservative and non-conservative variations. Consistent pattern of SNPs across the species revealed their phylogenetic importance. We found two groups of F. tataricum and one of them was closely related with F. dibotrys. Sequence characterization information of PAL gene reported in present investigation can be utilized in genetic improvement of buckwheat in reference to its medicinal value. PMID:26990297
Yang, Hua; Wei, Chao-Ling; Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun
2016-01-01
Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants.
Qi, Jie; Liu, Xudong; Liu, Jinxiang; Yu, Haiyang; Wang, Wenji; Wang, Zhigang; Zhang, Quanqi
2014-08-01
Ambient temperature is one of the major abiotic environmental factors determining the main parameters of fish vital activity. HSP70 plays an essential role in heat response. In this investigation, the promoter and structure of Paralichthys olivaceus hsp70 (Pohsp70) gene was cloned and predicted. 2558 bp upstream regulatory region of Pohsp70 was annotated with four potential promoter elements and four putative binding sites of transcription factors heat shock elements (HSE, nGAAn) in the upstream of the transcription start site. In addition, one intron with 454 bp in the 5'-noncoding region was found. Quantitative Real Time PCR analysis indicated that the transcript level of Pohsp70 was raised markedly after 1 h by heat shocked. Furthermore, 25 SNPs were identified in Pohsp70 by resequencing, seven of which was associated with heat resistance. In addition, two of the seven SNPs, namely SNP14 and SNP16, were observed in strong linkage disequilibrium. The haplotype with association analysis showed TAGGAG haplotype was more represented in heat susceptible group while (DEL/T) GAATA haplotype was more frequent in heat resistant group. The heat resistant SNPs and haplotype could be candidate markers potentially serving for selective breeding programs of Japanese flounder aimed at improving anti-stress and production. Copyright © 2014 Elsevier Ltd. All rights reserved.
Haplotypes and Sequence Variation in the Ovine Adiponectin Gene (ADIPOQ)
An, Qing-Ming; Zhou, Hui-Tong; Hu, Jiang; Luo, Yu-Zhu; Hickford, Jon G. H.
2015-01-01
The adiponectin gene (ADIPOQ) plays an important role in energy homeostasis. In this study five separate regions (regions 1 to 5) of ovine ADIPOQ were analysed using PCR-SSCP. Four different PCR-SSCP patterns (A1-D1, A2-D2) were detected in region-1 and region-2, respectively, with seven and six SNPs being revealed. In region-3, three different patterns (A3-C3) and three SNPs were observed. Two patterns (A4-B4, A5-B5) and two and one SNPs were observed in region-4 and region-5, respectively. In total, nineteen SNPs were detected, with five of them in the coding region and two (c.46T/C and c.515G/A) putatively resulting in amino acid changes (p.Tyr16His and p.Lys172Arg). In region-1, -2 and -3 of 316 sheep from eight New Zealand breeds, variants A1, A2 and A3 were the most common, although variant frequencies differed in the eight breeds. Across region-1 and region-3, nine haplotypes were identified and haplotypes A1-A3, A1-C3, B1-A3 and B1-C3 were most common. These results indicate that the ADIPOQ gene is polymorphic and suggest that further analysis is required to see if the variation in the gene is associated with animal production traits. PMID:26610572
Wilson, Ian M.; Vucic, Emily A.; Enfield, Katey S.S.; Thu, Kelsie L.; Zhang, Yu-An; Chari, Raj; Lockwood, William W.; Radulovich, Niki; Starczynowski, Daniel T.; Banáth, Judit P.; Zhang, May; Pusic, Andrea; Fuller, Megan; Lonergan, Kim M.; Rowbotham, David; Yee, John; English, John C.; Buys, Timon P.H.; Selamat, Suhaida A.; Laird-Offringa, Ite A.; Liu, Pengyuan; Anderson, Marshall; You, Ming; Tsao, Ming-Sound; Brown, Carolyn J.; Bennewith, Kevin L.; MacAulay, Calum E.; Karsan, Aly; Gazdar, Adi F.; Lam, Stephen; Lam, Wan L.
2015-01-01
In an effort to identify novel biallelically inactivated tumor suppressor genes (TSG) in sporadic invasive and pre-invasive non-small cell lung cancer (NSCLC) genomes, we applied a comprehensive integrated multi-‘omics approach to investigate patient matched, paired NSCLC tumor and non-malignant parenchymal tissues. By surveying lung tumor genomes for genes concomitantly inactivated within individual tumors by multiple mechanisms, and by the frequency of disruption in tumors across multiple cohorts, we have identified a putative lung cancer TSG, Eyes Absent 4 (EYA4). EYA4 is frequently and concomitantly deleted, hypermethylated and underexpressed in multiple independent lung tumor data sets, in both major NSCLC subtypes, and in the earliest stages of lung cancer. We find not only that decreased EYA4 expression is associated with poor survival in sporadic lung cancers, but EYA4 SNPs are associated with increased familial cancer risk, consistent with EYA4’s proximity to the previously reported lung cancer susceptibility locus on 6q. Functionally, we find that EYA4 displays TSG-like properties with a role in modulating apoptosis and DNA repair. Cross examination of EYA4 expression across multiple tumor types suggests a cell type-specific tumorigenic role for EYA4, consistent with a tumor suppressor function in cancers of epithelial origin. This work shows a clear role for EYA4 as a putative TSG in NSCLC. PMID:24096489
Explaining the disease phenotype of intergenic SNP through predicted long range regulation
Chen, Jingqi; Tian, Weidong
2016-01-01
Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. PMID:27280978
Chatsuriyawong, Siriporn; Gozal, David; Kheirandish-Gozal, Leila; Bhattacharjee, Rakesh; Khalyfa, Ahamed A; Wang, Yang; Sukhumsirichart, Wasana; Khalyfa, Abdelnaby
2013-09-06
Obstructive sleep apnea (OSA) is associated with adverse and interdependent cognitive and cardiovascular consequences. Increasing evidence suggests that nitric oxide synthase (NOS) and endothelin family (EDN) genes underlie mechanistic aspects of OSA-associated morbidities. We aimed to identify single nucleotide polymorphisms (SNPs) in the NOS family (3 isoforms), and EDN family (3 isoforms) to identify potential associations of these SNPs in children with OSA. A pediatric community cohort (ages 5-10 years) enriched for snoring underwent overnight polysomnographic (NPSG) and a fasting morning blood draw. The diagnostic criteria for OSA were an obstructive apnea-hypopnea Index (AHI) >2/h total sleep time (TST), snoring during the night, and a nadir oxyhemoglobin saturation <92%. Control children were defined as non-snoring children with AHI <2/h TST (NOSA). Endothelial function was assessed using a modified post-occlusive hyperemic test. The time to peak reperfusion (Tmax) was considered as the indicator for normal endothelial function (NEF; Tmax<45 sec), or ED (Tmax ≥ 45 sec). Genomic DNA from peripheral blood was extracted and allelic frequencies were assessed for, NOS1 (209 SNPs), NOS2 (122 SNPs), NOS3 (50 SNPs), EDN1 (43 SNPs), EDN2 (48 SNPs), EDN3 (14 SNPs), endothelin receptor A, EDNRA, (27 SNPs), and endothelin receptor B, EDNRB (23 SNPs) using a custom SNPs array. The relative frequencies of NOS-1,-2, and -3, and EDN-1,-2,-3,-EDNRA, and-EDNRB genotypes were evaluated in 608 subjects [128 with OSA, and 480 without OSA (NOSA)]. Furthermore, subjects with OSA were divided into 2 subgroups: OSA with normal endothelial function (OSA-NEF), and OSA with endothelial dysfunction (OSA-ED). Linkage disequilibrium was analyzed using Haploview version 4.2 software. For NOSA vs. OSA groups, 15 differentially distributed SNPs for NOS1 gene, and 1 SNP for NOS3 emerged, while 4 SNPs for EDN1 and 1 SNP for both EDN2 and EDN3 were identified. However, in the smaller sub-group for whom endothelial function was available, none of the significant SNPs was retained due to lack of statistical power. Differences in the distribution of polymorphisms among NOS and EDN gene families suggest that these SNPs could play a contributory role in the pathophysiology and risk of OSA-induced cardiovascular morbidity. Thus, analysis of genotype-phenotype interactions in children with OSA may assist in the formulation of categorical risk estimates.
Genomic Regions Associated with Root Traits under Drought Stress in Tropical Maize (Zea mays L.)
Zaidi, P. H.; Krishna, Girish; Krishnamurthy, L.; Gajanan, S.; Babu, Raman; Zerka, M.; Vinayan, M. T.; Vivek, B. S.
2016-01-01
An association mapping panel, named as CIMMYT Asia association mapping (CAAM) panel, involving 396 diverse tropical maize lines were phenotyped for various structural and functional traits of roots under drought and well-watered conditions. The experiment was conducted during Kharif (summer-rainy) season of 2012 and 2013 in root phenotyping facility at CIMMYT-Hyderabad, India. The CAAM panel was genotyped to generate 955, 690 SNPs through GBS v2.7 using Illumina Hi-seq 2000/2500 at Institute for Genomic Diversity, Cornell University, Ithaca, NY, USA. GWAS analysis was carried out using 331,390 SNPs filtered from the entire set of SNPs revealed a total of 50 and 67 SNPs significantly associated for root functional (transpiration efficiency, flowering period water use) and structural traits (rooting depth, root dry weight, root length, root volume, root surface area and root length density), respectively. In addition to this, 37 SNPs were identified for grain yield and shoot biomass under well-watered and drought stress. Though many SNPs were found to have significant association with the traits under study, SNPs that were common for more than one trait were discussed in detail. A total 18 SNPs were found to have common association with more than one trait, out of which 12 SNPs were found within or near the various gene functional regions. In this study we attempted to identify the trait specific maize lines based on the presence of favorable alleles for the SNPs associated with multiple traits. Two SNPs S3_128533512 and S7_151238865 were associated with transpiration efficiency, shoot biomass and grain yield under well-watered condition. Based on favorable allele for these SNPs seven inbred lines were identified. Similarly, four lines were identified for transpiration efficiency and shoot biomass under drought stress based on the presence of favorable allele for the common SNPs S1_211520521, S2_20017716, S3_57210184 and S7_130878458 and three lines were identified for flowering period water-use, transpiration efficiency, root dry weight and root volume based on the presence of favorable allele for the common SNPs S3_162065732 and S3_225760139. PMID:27768702
2013-01-01
Background SNPs&GO is a method for the prediction of deleterious Single Amino acid Polymorphisms (SAPs) using protein functional annotation. In this work, we present the web server implementation of SNPs&GO (WS-SNPs&GO). The server is based on Support Vector Machines (SVM) and for a given protein, its input comprises: the sequence and/or its three-dimensional structure (when available), a set of target variations and its functional Gene Ontology (GO) terms. The output of the server provides, for each protein variation, the probabilities to be associated to human diseases. Results The server consists of two main components, including updated versions of the sequence-based SNPs&GO (recently scored as one of the best algorithms for predicting deleterious SAPs) and of the structure-based SNPs&GO3d programs. Sequence and structure based algorithms are extensively tested on a large set of annotated variations extracted from the SwissVar database. Selecting a balanced dataset with more than 38,000 SAPs, the sequence-based approach achieves 81% overall accuracy, 0.61 correlation coefficient and an Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve of 0.88. For the subset of ~6,600 variations mapped on protein structures available at the Protein Data Bank (PDB), the structure-based method scores with 84% overall accuracy, 0.68 correlation coefficient, and 0.91 AUC. When tested on a new blind set of variations, the results of the server are 79% and 83% overall accuracy for the sequence-based and structure-based inputs, respectively. Conclusions WS-SNPs&GO is a valuable tool that includes in a unique framework information derived from protein sequence, structure, evolutionary profile, and protein function. WS-SNPs&GO is freely available at http://snps.biofold.org/snps-and-go. PMID:23819482
Genetic variants in Fanconi Anemia Pathway Genes BRCA2 and FANCA Predict Melanoma Survival
Liu, Zhensheng; Wang, Li-E; Chen, Wei V.; Zhu, Dakai; Amos, Christopher I.; Fang, Shenying; Lee, Jeffrey E.; Wei, Qingyi
2014-01-01
Cutaneous melanoma (CM) is the most lethal skin cancer. The Fanconi Anemia (FA) pathway involved in DNA crosslinks repair may affect CM susceptibility and prognosis. Using data derived from published genome-wide association study, we comprehensively analyzed the associations of 2339 common single nucleotide polymorphisms (SNPs) in 14 autosomal FA genes with overall survival (OS) in 858 CM patients. By performing false-positive report probability corrections and stepwise Cox proportional hazards regression analyses, we identified significant associations between CM OS and four putatively functional SNPs: BRCA2 rs10492396 [AG vs. GG: adjusted hazard ratio (adjHR)=1.85, 95% confident interval (CI)=1.16-2.95, P=0.010], rs206118 (CC vs. TT+TC: adjHR=2.44, 95% CI=1.27-4.67, P=0.007), rs3752447 (CC vs. TT+TC: adjHR=2.10, 95% CI=1.38-3.18, P=0.0005), and FANCA rs62068372 (TT vs. CC+CT: adjHR=1.85, 95% CI=1.27-2.69, P=0.001). Moreover, patients with an increasing number of unfavorable genotypes (NUG) of these loci had markedly reduced OS and melanoma-specific survival (MSS). The final model incorporating with NUG, tumor stage and Breslow thickness showed an improved discriminatory ability to classify both 5-year OS and 5-year MSS. Additional investigations, preferably prospective studies, are needed to validate our findings. PMID:25243787
Genetic variants in fanconi anemia pathway genes BRCA2 and FANCA predict melanoma survival.
Yin, Jieyun; Liu, Hongliang; Liu, Zhensheng; Wang, Li-E; Chen, Wei V; Zhu, Dakai; Amos, Christopher I; Fang, Shenying; Lee, Jeffrey E; Wei, Qingyi
2015-02-01
Cutaneous melanoma (CM) is the most lethal skin cancer. The Fanconi anemia (FA) pathway involved in DNA crosslink repair may affect CM susceptibility and prognosis. Using data derived from published genome-wide association study, we comprehensively analyzed the associations of 2,339 common single-nucleotide polymorphisms (SNPs) in 14 autosomal FA genes with overall survival (OS) in 858 CM patients. By performing false-positive report probability corrections and stepwise Cox proportional hazards regression analyses, we identified significant associations between CM OS and four putatively functional SNPs: BRCA2 rs10492396 (AG vs. GG: adjusted hazard ratio (adjHR)=1.85, 95% confidence interval (CI)=1.16-2.95, P=0.010), rs206118 (CC vs. TT+TC: adjHR=2.44, 95% CI=1.27-4.67, P=0.007), rs3752447 (CC vs. TT+TC: adjHR=2.10, 95% CI=1.38-3.18, P=0.0005), and FANCA rs62068372 (TT vs. CC+CT: adjHR=1.85, 95% CI=1.27-2.69, P=0.001). Moreover, patients with an increasing number of unfavorable genotypes (NUG) of these loci had markedly reduced OS and melanoma-specific survival (MSS). The final model incorporating with NUG, tumor stage, and Breslow thickness showed an improved discriminatory ability to classify both 5-year OS and 5-year MSS. Additional investigations, preferably prospective studies, are needed to validate our findings.
Transcriptome sequencing and de novo analysis of the copepod Calanus sinicus using 454 GS FLX.
Ning, Juan; Wang, Minxiao; Li, Chaolun; Sun, Song
2013-01-01
Despite their species abundance and primary economic importance, genomic information about copepods is still limited. In particular, genomic resources are lacking for the copepod Calanus sinicus, which is a dominant species in the coastal waters of East Asia. In this study, we performed de novo transcriptome sequencing to produce a large number of expressed sequence tags for the copepod C. sinicus. Copepodid larvae and adults were used as the basic material for transcriptome sequencing. Using 454 pyrosequencing, a total of 1,470,799 reads were obtained, which were assembled into 56,809 high quality expressed sequence tags. Based on their sequence similarity to known proteins, about 14,000 different genes were identified, including members of all major conserved signaling pathways. Transcripts that were putatively involved with growth, lipid metabolism, molting, and diapause were also identified among these genes. Differentially expressed genes related to several processes were found in C. sinicus copepodid larvae and adults. We detected 284,154 single nucleotide polymorphisms (SNPs) that provide a resource for gene function studies. Our data provide the most comprehensive transcriptome resource available for C. sinicus. This resource allowed us to identify genes associated with primary physiological processes and SNPs in coding regions, which facilitated the quantitative analysis of differential gene expression. These data should provide foundation for future genetic and genomic studies of this and related species.
Genetic dissection of host immune response in pneumonia development and progression.
Smelaya, Tamara V; Belopolskaya, Olesya B; Smirnova, Svetlana V; Kuzovlev, Artem N; Moroz, Viktor V; Golubev, Arkadiy M; Pabalan, Noel A; Salnikova, Lyubov E
2016-10-11
The role of host genetic variation in pneumonia development and outcome is poorly understood. We studied common polymorphisms in the genes of proinflammatory cytokines (IL6 rs1800795, IL8 rs4073, IL1B rs16944), anti-inflammatory cytokines (IL10 rs1800896, IL4 rs2243250, IL13 rs20541) and toll-like receptors (TLR2 rs5743708 and rs4696480, TLR4 rs4986791, TLR9 rs352139, rs5743836 and rs187084) in patients with community-acquired pneumonia (CAP) (390 cases, 203 controls) and nosocomial pneumonia (355 cases, 216 controls). Experimental data were included in a series of 11 meta-analyses and eight subset analyses related to pneumonia susceptibility and outcome. TLR2 rs5743708 minor genotype appeared to be associated with CAP/Legionnaires' disease/pneumococcal disease. In CAP patients, the IL6 rs1800795-C allele was associated with severe sepsis/septic shock/severe systemic inflammatory response, while the IL10 rs1800896-A allele protected against the development of these critical conditions. To contribute to deciphering of the above results, we performed an in silico analysis and a qualitative synthesis of literature data addressing basal and stimulated genotype-specific expression level. This data together with database information on transcription factors' affinity changes caused by SNPs in putative promoter regions, the results of linkage disequilibrium analysis along with SNPs functional annotations supported assumptions about the complexity underlying the revealed associations.
Pooled Genome-Wide Analysis to Identify Novel Risk Loci for Pediatric Allergic Asthma
Ricci, Giampaolo; Astolfi, Annalisa; Remondini, Daniel; Cipriani, Francesca; Formica, Serena; Dondi, Arianna; Pession, Andrea
2011-01-01
Background Genome-wide association studies of pooled DNA samples were shown to be a valuable tool to identify candidate SNPs associated to a phenotype. No such study was up to now applied to childhood allergic asthma, even if the very high complexity of asthma genetics is an appropriate field to explore the potential of pooled GWAS approach. Methodology/Principal Findings We performed a pooled GWAS and individual genotyping in 269 children with allergic respiratory diseases comparing allergic children with and without asthma. We used a modular approach to identify the most significant loci associated with asthma by combining silhouette statistics and physical distance method with cluster-adapted thresholding. We found 97% concordance between pooled GWAS and individual genotyping, with 36 out of 37 top-scoring SNPs significant at individual genotyping level. The most significant SNP is located inside the coding sequence of C5, an already identified asthma susceptibility gene, while the other loci regulate functions that are relevant to bronchial physiopathology, as immune- or inflammation-mediated mechanisms and airway smooth muscle contraction. Integration with gene expression data showed that almost half of the putative susceptibility genes are differentially expressed in experimental asthma mouse models. Conclusion/Significance Combined silhouette statistics and cluster-adapted physical distance threshold analysis of pooled GWAS data is an efficient method to identify candidate SNP associated to asthma development in an allergic pediatric population. PMID:21359210
GENETIC VARIANTS, IMMUNE FUNCTION AND RISK OF PRE-ECLAMPSIA AMONG AMERICAN INDIANS
Best, Lyle G.; Nadeau, Melanie; Davis, Kylie; Lamb, Felicia; Bercier, Shellee; Anderson, Cindy M.
2011-01-01
Objective To determine the prevalence in an American Indian population of genetic variants with putative effects on immune function and determine if they are associated with pre-eclampsia. Methods In a study of 66 cases and 130 matched controls, six single nucleotide polymorphisms (SNP) with either previously demonstrated or postulated modulating effects on the immune system were genotyped. Allele frequencies and various genetic models were evaluated by conditional logistic regression in both univariate and multiply adjusted models. Results Although most genetic variants lacked evidence of association with pre-eclampsia, the minor allele of the CRP related, rs1205 SNP in a dominant model with adjustment for age at delivery, nulliparity and body mass index, exhibited an odds ratio of 0.259 (95% CI of 0.08 – 0.81, p=0.020) in relation to severe pre-eclampsia (48 cases). The allelic prevalence of this variant was 46.1% in this population. Conclusion Of the six SNPs related to immune function in this study, a functional variant in the 3'UTR of the CRP gene was shown to be associated with severe pre-eclampsia in an American Indian population. PMID:22004660
Matimba, Alice; Li, Fang; Livshits, Alina; Cartwright, Cher S; Scully, Stephen; Fridley, Brooke L; Jenkins, Gregory; Batzler, Anthony; Wang, Liewei; Weinshilboum, Richard; Lennard, Lynne
2014-01-01
Aim We investigated candidate genes associated with thiopurine metabolism and clinical response in childhood acute lymphoblastic leukemia. Materials & methods We performed genome-wide SNP association studies of 6-thioguanine and 6-mercaptopurine cytotoxicity using lymphoblastoid cell lines. We then genotyped the top SNPs associated with lymphoblastoid cell line cytotoxicity, together with tagSNPs for genes in the ‘thiopurine pathway’ (686 total SNPs), in DNA from 589 Caucasian UK ALL97 patients. Functional validation studies were performed by siRNA knockdown in cancer cell lines. Results SNPs in the thiopurine pathway genes ABCC4, ABCC5, IMPDH1, ITPA, SLC28A3 and XDH, and SNPs located within or near ATP6AP2, FRMD4B, GNG2, KCNMA1 and NME1, were associated with clinical response and measures of thiopurine metabolism. Functional validation showed shifts in cytotoxicity for these genes. Conclusion The clinical response to thiopurines may be regulated by variation in known thiopurine pathway genes and additional novel genes outside of the thiopurine pathway. PMID:24624911
Richardson, Kris; Schnitzler, Gavin R; Lai, Chao-Qiang; Ordovas, Jose M
2015-12-01
Cardiovascular disease and type 2 diabetes mellitus represent overlapping diseases where a large portion of the variation attributable to genetics remains unexplained. An important player in their pathogenesis is peroxisome proliferator-activated receptor γ (PPARγ) that is involved in lipid and glucose metabolism and maintenance of metabolic homeostasis. We used a functional genomics methodology to interrogate human chromatin immunoprecipitation-sequencing, genome-wide association studies, and expression quantitative trait locus data to inform selection of candidate functional single nucleotide polymorphisms (SNPs) falling in PPARγ motifs. We derived 27 328 chromatin immunoprecipitation-sequencing peaks for PPARγ in human adipocytes through meta-analysis of 3 data sets. The PPARγ consensus motif showed greatest enrichment and mapped to 8637 peaks. We identified 146 SNPs in these motifs. This number was significantly less than would be expected by chance, and Inference of Natural Selection from Interspersed Genomically coHerent elemenTs analysis indicated that these motifs are under weak negative selection. A screen of these SNPs against genome-wide association studies for cardiometabolic traits revealed significant enrichment with 16 SNPs. A screen against the MuTHER expression quantitative trait locus data revealed 8 of these were significantly associated with altered gene expression in human adipose, more than would be expected by chance. Several SNPs fall close, or are linked by expression quantitative trait locus to lipid-metabolism loci including CYP26A1. We demonstrated the use of functional genomics to identify SNPs of potential function. Specifically, that SNPs within PPARγ motifs that bind PPARγ in adipocytes are significantly associated with cardiometabolic disease and with the regulation of transcription in adipose. This method may be used to uncover functional SNPs that do not reach significance thresholds in the agnostic approach of genome-wide association studies. © 2015 American Heart Association, Inc.
Saavedra-Rodriguez, Karla; Beaty, Meaghan; Lozano-Fuentes, Saul; Denham, Steven; Garcia-Rejon, Julian; Reyes-Solis, Guadalupe; Machain-Williams, Carlos; Loroño-Pino, Maria Alba; Flores-Suarez, Adriana; Ponce-Garcia, Gustavo; Beaty, Barry; Eisen, Lars; Black, William C
2015-01-01
The mosquito Aedes aegypti is the major vector of the four serotypes of dengue virus (DENV1-4). Previous studies have shown that Ae. aegypti in Mexico have a high effective migration rate and that gene flow occurs among populations that are up to 150 km apart. Since 2000, pyrethroids have been widely used for suppression of Ae. aegypti in cities in Mexico. In Yucatan State in particular, pyrethroids have been applied in and around dengue case households creating an opportunity for local selection and evolution of resistance. Herein, we test for evidence of local adaptation by comparing patterns of variation among 27 Ae. aegypti collections at 13 single nucleotide polymorphisms (SNPs): two in the voltage-gated sodium channel gene para known to confer knockdown resistance, three in detoxification genes previously associated with pyrethroid resistance, and eight in putatively neutral loci. The SNPs in para varied greatly in frequency among collections, whereas SNPs at the remaining 11 loci showed little variation supporting previous evidence for extensive local gene flow. Among Ae. aegypti in Yucatan State, Mexico, local adaptation to pyrethroids appears to offset the homogenizing effects of gene flow. © The American Society of Tropical Medicine and Hygiene.
Demirci, F Yesim; Wang, Xingbin; Morris, David L; Feingold, Eleanor; Bernatsky, Sasha; Pineau, Christian; Clarke, Ann; Ramsey-Goldman, Rosalind; Manzi, Susan; Vyse, Timothy J; Kamboh, M I
2017-06-01
A major systemic lupus erythematosus (SLE) susceptibility locus lies within a common inversion polymorphism region (encompassing 3.8 - 4.5 Mb) located at 8p23. Initially implicated genes included FAM167A-BLK and XKR6 , of which BLK received major attention due to its known role in B-cell biology. Recently, additional SLE risk carried in non-inverted background was also reported. In this case -control study, we further investigated the 'extended' 8p23 locus (~ 4 Mb) where we observed multiple SLE signals and assessed these signals for their relation to the inversion affecting this region. The study involved a North American discovery data set ( ~ 1200 subjects) and a replication data set (> 10 000 subjects) comprising European-descent individuals. Meta-analysis of 8p23 SNPs, with p < 0.05 in both data sets, identified 51 genome-wide significant SNPs (p < 5.0 × 10 -8 ). While most of these SNPs were related to previously implicated signals ( XKR6-FAM167A-BLK subregion), our results also revealed two 'new' SLE signals, including SGK223-CLDN23-MFHAS1 (6.06 × 10 -9 ≤ meta p ≤ 4.88 × 10 -8 ) and CTSB (meta p = 4.87 × 10 -8 ) subregions that are located > 2 Mb upstream and ~ 0.3 Mb downstream from previously reported signals. Functional assessment of relevant SNPs indicated putative cis -effects on the expression of various genes at 8p23. Additional analyses in discovery sample, where the inversion genotypes were inferred, replicated the association of non-inverted status with SLE risk and suggested that a number of SLE risk alleles are predominantly carried in non-inverted background. Our results implicate multiple (known+novel) SLE signals/genes at the extended 8p23 locus, beyond previously reported signals/genes, and suggest that this broad locus contributes to SLE risk through the effects of multiple genes/pathways. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Li, Wan; Zhu, Lina; Huang, Hao; He, Yuehan; Lv, Junjie; Li, Weimin; Chen, Lina; He, Weiming
2017-10-01
Complex chronic diseases are caused by the effects of genetic and environmental factors. Single nucleotide polymorphisms (SNPs), one common type of genetic variations, played vital roles in diseases. We hypothesized that disease risk functional SNPs in coding regions and protein interaction network modules were more likely to contribute to the identification of disease susceptible genes for complex chronic diseases. This could help to further reveal the pathogenesis of complex chronic diseases. Disease risk SNPs were first recognized from public SNP data for coronary heart disease (CHD), hypertension (HT) and type 2 diabetes (T2D). SNPs in coding regions that were classified into nonsense and missense by integrating several SNP functional annotation databases were treated as functional SNPs. Then, regions significantly associated with each disease were screened using random permutations for disease risk functional SNPs. Corresponding to these regions, 155, 169 and 173 potential disease susceptible genes were identified for CHD, HT and T2D, respectively. A disease-related gene product interaction network in environmental context was constructed for interacting gene products of both disease genes and potential disease susceptible genes for these diseases. After functional enrichment analysis for disease associated modules, 5 CHD susceptible genes, 7 HT susceptible genes and 3 T2D susceptible genes were finally identified, some of which had pleiotropic effects. Most of these genes were verified to be related to these diseases in literature. This was similar for disease genes identified from another method proposed by Lee et al. from a different aspect. This research could provide novel perspectives for diagnosis and treatment of complex chronic diseases and susceptible genes identification for other diseases. Copyright © 2017 Elsevier Inc. All rights reserved.
Explaining the disease phenotype of intergenic SNP through predicted long range regulation.
Chen, Jingqi; Tian, Weidong
2016-10-14
Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Markunas, Christina A; Johnson, Eric O; Hancock, Dana B
2017-07-01
Genome-wide association study (GWAS)-identified variants are enriched for functional elements. However, we have limited knowledge of how functional enrichment may differ by disease/trait and tissue type. We tested a broad set of eight functional elements for enrichment among GWAS-identified SNPs (p < 5×10 -8 ) from the NHGRI-EBI Catalog across seven disease/trait categories: cancer, cardiovascular disease, diabetes, autoimmune disease, psychiatric disease, neurological disease, and anthropometric traits. SNPs were annotated using HaploReg for the eight functional elements across any tissue: DNase sites, expression quantitative trait loci (eQTL), sequence conservation, enhancers, promoters, missense variants, sequence motifs, and protein binding sites. In addition, tissue-specific annotations were considered for brain vs. blood. Disease/trait SNPs were compared to a control set of 4809 SNPs matched to the GWAS SNPs (N = 1639) on allele frequency, gene density, distance to nearest gene, and linkage disequilibrium at ~3:1 ratio. Enrichment analyses were conducted using logistic regression, with Bonferroni correction. Overall, a significant enrichment was observed for all functional elements, except sequence motifs. Missense SNPs showed the strongest magnitude of enrichment. eQTLs were the only functional element significantly enriched across all diseases/traits. Magnitudes of enrichment were generally similar across diseases/traits, where enrichment was statistically significant. Blood vs. brain tissue effects on enrichment were dependent on disease/trait and functional element (e.g., cardiovascular disease: eQTLs P TissueDifference = 1.28 × 10 -6 vs. enhancers P TissueDifference = 0.94). Identifying disease/trait-relevant functional elements and tissue types could provide new insight into the underlying biology, by guiding a priori GWAS analyses (e.g., brain enhancer elements for psychiatric disease) or facilitating post hoc interpretation.
Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.
Baniecki, Mary Lynn; Faust, Aubrey L; Schaffner, Stephen F; Park, Daniel J; Galinsky, Kevin; Daniels, Rachel F; Hamilton, Elizabeth; Ferreira, Marcelo U; Karunaweera, Nadira D; Serre, David; Zimmerman, Peter A; Sá, Juliana M; Wellems, Thomas E; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E; Volkman, Sarah K; Wirth, Dyann F; Sabeti, Pardis C
2015-03-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.
Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections
Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.
2015-01-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890
Identification, validation and high-throughput genotyping of transcribed gene SNPs in cassava.
Ferguson, Morag E; Hearne, Sarah J; Close, Timothy J; Wanamaker, Steve; Moskal, William A; Town, Christopher D; de Young, Joe; Marri, Pradeep Reddy; Rabbi, Ismail Yusuf; de Villiers, Etienne P
2012-03-01
The availability of genomic resources can facilitate progress in plant breeding through the application of advanced molecular technologies for crop improvement. This is particularly important in the case of less researched crops such as cassava, a staple and food security crop for more than 800 million people. Here, expressed sequence tags (ESTs) were generated from five drought stressed and well-watered cassava varieties. Two cDNA libraries were developed: one from root tissue (CASR), the other from leaf, stem and stem meristem tissue (CASL). Sequencing generated 706 contigs and 3,430 singletons. These sequences were combined with those from two other EST sequencing initiatives and filtered based on the sequence quality. Quality sequences were aligned using CAP3 and embedded in a Windows browser called HarvEST:Cassava which is made available. HarvEST:Cassava consists of a Unigene set of 22,903 quality sequences. A total of 2,954 putative SNPs were identified. Of these 1,536 SNPs from 1,170 contigs and 53 cassava genotypes were selected for SNP validation using Illumina's GoldenGate assay. As a result 1,190 SNPs were validated technically and biologically. The location of validated SNPs on scaffolds of the cassava genome sequence (v.4.1) is provided. A diversity assessment of 53 cassava varieties reveals some sub-structure based on the geographical origin, greater diversity in the Americas as opposed to Africa, and similar levels of diversity in West Africa and southern, eastern and central Africa. The resources presented allow for improved genetic dissection of economically important traits and the application of modern genomics-based approaches to cassava breeding and conservation.
Li, Shengjie; Liu, Hao; Bai, Junjie; Zhu, Xinping
2017-04-01
Growth is one of the most crucial economic traits of all aquaculture species, but the molecular mechanisms involved in growth of largemouth bass (Micropterus salmoides) are poorly understood. The objective of this study was to screen growth-related genes of M. salmoides by RNA sequencing and identify growth-related single-nucleotide polymorphism (SNP) markers through a growth association study. The muscle transcriptomes of fast- and slow-growing largemouth bass were obtained using the RNA-Seq technique. A total of 54,058,178 and 54,742,444 qualified Illumina read pairs were obtained for the fast-growing and slow-growing groups, respectively, giving rise to 4,865,236,020 and 4,926,819,960 total clean bases, respectively. Gene expression profiling showed that 3,530 unigenes were differentially expressed between the fast-growing and slow-growing phenotypes (false discovery rate ≤0.001, the absolute value of log 2 (fold change) ≥1), including 1,441 up-regulated and 2,889 down-regulated unigenes in the fast-growing largemouth bass. Analysis of these genes revealed that several signalling pathways, including the growth hormone-insulin-like growth factor 1 axis and signalling pathway, the glycolysis pathway, and the myostatin/transforming growth factor beta signalling pathway, as well as heat shock protein, cytoskeleton, and myofibril component genes might be associated with muscle growth. From these genes, 10 genes with putative SNPs were selected, and 17 SNPs were genotyped successfully. Marker-trait analysis in 340 individuals of Youlu No. 1 largemouth bass revealed three SNPs associated with growth in key genes (phosphoenolpyruvate carboxykinase 1, FOXO3b, and heat shock protein beta-1). This research provides information about key genes and SNPs related to growth, providing new clues to understanding the molecular basis of largemouth bass growth.
Genome Features of “Dark-Fly”, a Drosophila Line Reared Long-Term in a Dark Environment
Zhou, Jun; Sugiyama, Yuzo; Nishimura, Osamu; Aizu, Tomoyuki; Toyoda, Atsushi; Fujiyama, Asao; Agata, Kiyokazu
2012-01-01
Organisms are remarkably adapted to diverse environments by specialized metabolisms, morphology, or behaviors. To address the molecular mechanisms underlying environmental adaptation, we have utilized a Drosophila melanogaster line, termed “Dark-fly”, which has been maintained in constant dark conditions for 57 years (1400 generations). We found that Dark-fly exhibited higher fecundity in dark than in light conditions, indicating that Dark-fly possesses some traits advantageous in darkness. Using next-generation sequencing technology, we determined the whole genome sequence of Dark-fly and identified approximately 220,000 single nucleotide polymorphisms (SNPs) and 4,700 insertions or deletions (InDels) in the Dark-fly genome compared to the genome of the Oregon-R-S strain, a control strain. 1.8% of SNPs were classified as non-synonymous SNPs (nsSNPs: i.e., they alter the amino acid sequence of gene products). Among them, we detected 28 nonsense mutations (i.e., they produce a stop codon in the protein sequence) in the Dark-fly genome. These included genes encoding an olfactory receptor and a light receptor. We also searched runs of homozygosity (ROH) regions as putative regions selected during the population history, and found 21 ROH regions in the Dark-fly genome. We identified 241 genes carrying nsSNPs or InDels in the ROH regions. These include a cluster of alpha-esterase genes that are involved in detoxification processes. Furthermore, analysis of structural variants in the Dark-fly genome showed the deletion of a gene related to fatty acid metabolism. Our results revealed unique features of the Dark-fly genome and provided a list of potential candidate genes involved in environmental adaptation. PMID:22432011
González-Martínez, Santiago C; Ersoz, Elhan; Brown, Garth R; Wheeler, Nicholas C; Neale, David B
2006-03-01
Genetic association studies are rapidly becoming the experimental approach of choice to dissect complex traits, including tolerance to drought stress, which is the most common cause of mortality and yield losses in forest trees. Optimization of association mapping requires knowledge of the patterns of nucleotide diversity and linkage disequilibrium and the selection of suitable polymorphisms for genotyping. Moreover, standard neutrality tests applied to DNA sequence variation data can be used to select candidate genes or amino acid sites that are putatively under selection for association mapping. In this article, we study the pattern of polymorphism of 18 candidate genes for drought-stress response in Pinus taeda L., an important tree crop. Data analyses based on a set of 21 putatively neutral nuclear microsatellites did not show population genetic structure or genomewide departures from neutrality. Candidate genes had moderate average nucleotide diversity at silent sites (pi(sil) = 0.00853), varying 100-fold among single genes. The level of within-gene LD was low, with an average pairwise r2 of 0.30, decaying rapidly from approximately 0.50 to approximately 0.20 at 800 bp. No apparent LD among genes was found. A selective sweep may have occurred at the early-response-to-drought-3 (erd3) gene, although population expansion can also explain our results and evidence for selection was not conclusive. One other gene, ccoaomt-1, a methylating enzyme involved in lignification, showed dimorphism (i.e., two highly divergent haplotype lineages at equal frequency), which is commonly associated with the long-term action of balancing selection. Finally, a set of haplotype-tagging SNPs (htSNPs) was selected. Using htSNPs, a reduction of genotyping effort of approximately 30-40%, while sampling most common allelic variants, can be gained in our ongoing association studies for drought tolerance in pine.
Casimiro-Soriguer, Inés; Narbona, Eduardo; Buide, M. L.; del Valle, José C.; Whittall, Justen B.
2016-01-01
Flower color polymorphisms are widely used as model traits from genetics to ecology, yet determining the biochemical and molecular basis can be challenging. Anthocyanin-based flower color variations can be caused by at least 12 structural and three regulatory genes in the anthocyanin biosynthetic pathway (ABP). We use mRNA-Seq to simultaneously sequence and estimate expression of these candidate genes in nine samples of Silene littorea representing three color morphs (dark pink, light pink and white) across three developmental stages in hopes of identifying the cause of flower color variation. We identified 29 putative paralogs for the 15 candidate genes in the ABP. We assembled complete coding sequences for 16 structural loci and nine of ten regulatory loci. Among these 29 putative paralogs, we identified 622 SNPs, yet only nine synonymous SNPs in Ans had allele frequencies that differentiated pigmented petals (dark pink and light pink) from white petals. These Ans allele frequency differences were further investigated with an expanded sequencing survey of 38 individuals, yet no SNPs consistently differentiated the color morphs. We also found one locus, F3h1, with strong differential expression between pigmented and white samples (>42x). This may be caused by decreased expression of Myb1a in white petal buds. Myb1a in S. littorea is a regulatory locus closely related to Subgroup 7 Mybs known to regulate F3h and other loci in the first half of the ABP in model species. We then compare the mRNA-Seq results with petal biochemistry which revealed cyanidin as the primary anthocyanin and five flavonoid intermediates. Concentrations of three of the flavonoid intermediates were significantly lower in white petals than in pigmented petals (rutin, quercetin and isovitexin). The biochemistry results for rutin, quercetin, luteolin and apigenin are consistent with the transcriptome results suggesting a blockage at F3h, possibly caused by downregulation of Myb1a. PMID:26973662
Chen, N B; Ma, Y; Yang, T; Lin, F; Fu, W W; Xu, Y J; Li, F; Li, J Y; Gao, S X
2015-08-01
Angiopoietin-like protein 3 (ANGPTL3) is a secreted protein that regulates lipid, glucose and energy metabolism. This study was conducted to better understand the effect of ANGPTL3 on important economic traits in cattle. First, transcript profiles for ANGPTL3 were measured in nine different Jiaxian cattle tissues. Second, polymorphisms were identified in the complete coding region and promoter region of the bovine ANGPTL3 gene in 707 cattle samples. Finally, an association study was carried out utilizing these single nucleotide polymorphisms (SNPs) to determine the effect of these SNPs on the growth and meat quality traits. Quantitative real-time PCR analysis showed that ANGPTL3 was mainly expressed in the liver. The promoter of the bovine ANGPTL3 contained several putative transcription factor binding sites (SF1, HNF-1, LXRα, NFκβ, HNF-3 and C/EBP). In total, four SNPs of the bovine ANGPTL3 gene were identified by direct sequencing. SNP1 (rs469906272: g.-38T>C) was identified in the promoter, SNP2 (rs451104723:g.104A>T) and SNP3 (rs482516226: g.509A>G) were identified in exon 1, and SNP4 (rs477165942: g.8661T>C) was identified in exon 6. Changes in predicted protein structures due to non-synonymous SNPs were analyzed. Haplotype frequencies and linkage disequilibrium were also investigated. Analysis of four SNPs in cattle from different native Chinese breeds (Nanyang (NY) and Jiaxian (JX)) and commercial breeds (Angus (AG), Hereford (HF), Limousin (LM), Luxi (LX), Simmental (ST) and Jinnan (JN)) revealed a significant association with growth traits (including: BW and hipbone width) and meat quality traits (including: Warner-Bratzler shear force and ribeye area). Therefore, implementation of these four mutations in selection indices in the beef industry may be beneficial in selecting individuals with superior growth and meat quality traits.
Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple.
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C A M; Laurens, François; Tartarini, Stefano; Durel, Charles-Eric
2017-01-01
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs.
2010-01-01
Background The Fagaceae family comprises about 1,000 woody species worldwide. About half belong to the Quercus family. These oaks are often a source of raw material for biomass wood and fiber. Pedunculate and sessile oaks, are among the most important deciduous forest tree species in Europe. Despite their ecological and economical importance, very few genomic resources have yet been generated for these species. Here, we describe the development of an EST catalogue that will support ecosystem genomics studies, where geneticists, ecophysiologists, molecular biologists and ecologists join their efforts for understanding, monitoring and predicting functional genetic diversity. Results We generated 145,827 sequence reads from 20 cDNA libraries using the Sanger method. Unexploitable chromatograms and quality checking lead us to eliminate 19,941 sequences. Finally a total of 125,925 ESTs were retained from 111,361 cDNA clones. Pyrosequencing was also conducted for 14 libraries, generating 1,948,579 reads, from which 370,566 sequences (19.0%) were eliminated, resulting in 1,578,192 sequences. Following clustering and assembly using TGICL pipeline, 1,704,117 EST sequences collapsed into 69,154 tentative contigs and 153,517 singletons, providing 222,671 non-redundant sequences (including alternative transcripts). We also assembled the sequences using MIRA and PartiGene software and compared the three unigene sets. Gene ontology annotation was then assigned to 29,303 unigene elements. Blast search against the SWISS-PROT database revealed putative homologs for 32,810 (14.7%) unigene elements, but more extensive search with Pfam, Refseq_protein, Refseq_RNA and eight gene indices revealed homology for 67.4% of them. The EST catalogue was examined for putative homologs of candidate genes involved in bud phenology, cuticle formation, phenylpropanoids biosynthesis and cell wall formation. Our results suggest a good coverage of genes involved in these traits. Comparative orthologous sequences (COS) with other plant gene models were identified and allow to unravel the oak paleo-history. Simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were searched, resulting in 52,834 SSRs and 36,411 SNPs. All of these are available through the Oak Contig Browser http://genotoul-contigbrowser.toulouse.inra.fr:9092/Quercus_robur/index.html. Conclusions This genomic resource provides a unique tool to discover genes of interest, study the oak transcriptome, and develop new markers to investigate functional diversity in natural populations. PMID:21092232
Rheumatoid arthritis association at 6q23
Thomson, Wendy; Barton, Anne; Ke, Xiayi; Eyre, Steve; Hinks, Anne; Bowes, John; Donn, Rachelle; Symmons, Deborah; Hider, Samantha; Bruce, Ian N; Wilson, Anthony G; Marinou, Ioanna; Morgan, Ann; Emery, Paul; Carter, Angela; Steer, Sophia; Hocking, Lynne; Reid, David M; Wordsworth, Paul; Harrison, Pille; Strachan, David; Worthington, Jane
2009-01-01
The Wellcome Trust Case Control Consortium (WTCCC) identified nine single SNPs putatively associated with rheumatoid arthritis at P = 1 × 10 -5 - 5 × 10-7 in a genome-wide association screen. One, rs6920220, was unequivocally replicated (trend P = 1.1 × 10-8) in a validation study, as described here. This SNP maps to 6q23, between the genes oligodendrocyte lineage transcription factor 3 (OLIG3) and tumor necrosis factor-α-induced protein 3 (TNFAIP3). PMID:17982455
HLA-DQA1 and PLA2R1 polymorphisms and risk of idiopathic membranous nephropathy.
Bullich, Gemma; Ballarín, José; Oliver, Artur; Ayasreh, Nadia; Silva, Irene; Santín, Sheila; Díaz-Encarnación, Montserrat M; Torra, Roser; Ars, Elisabet
2014-02-01
Single nucleotide polymorphisms (SNPs) within HLA complex class II HLA-DQ α-chain 1 (HLA-DQA1) and M-type phospholipase A2 receptor (PLA2R1) genes were identified as strong risk factors for idiopathic membranous nephropathy (IMN) development in a recent genome-wide association study. Copy number variants (CNVs) within the Fc gamma receptor III (FCGR3) locus have been associated with several autoimmune diseases, but their role in IMN has not been studied. This study aimed to validate the association of HLA-DQA1 and PLA2R1 risk alleles with IMN in a Spanish cohort, test the putative association of FCGR3A and FCGR3B CNVs with IMN, and assess the use of these genetic factors to predict the clinical outcome of the disease. A Spanish cohort of 89 IMN patients and 286 matched controls without nephropathy was recruited between October of 2009 and July of 2012. Case-control studies for SNPs within HLA-DQA1 (rs2187668) and PLA2R1 (rs4664308) genes and CNVs for FCGR3A and FCGR3B genes were performed. The contribution of these polymorphisms to predict clinical outcome and renal function decline was analyzed. This study validated the association of these HLA-DQA1 and PLA2R1 SNPs with IMN in a Spanish cohort and its increased risk when combining both risk genotypes. No significant association was found between FCGR3 CNVs and IMN. These results revealed that HLA-DQA1 and PLA2R1 genotype combination adjusted for baseline proteinuria strongly predicted response to immunosuppressive therapy. HLA-DQA1 genotype adjusted for proteinuria was also linked with renal function decline. This study confirms that HLA-DQA1 and PLA2R1 genotypes are risk factors for IMN, whereas no association was identified for FCGR3 CNVs. This study provides, for the first time, evidence of the contribution of these HLA-DQA1 and PLA2R1 polymorphisms in predicting IMN response to immunosuppressors and disease progression. Future studies are needed to validate and identify prognostic markers.
Anderson, Eric C
2012-11-08
Advances in genotyping that allow tens of thousands of individuals to be genotyped at a moderate number of single nucleotide polymorphisms (SNPs) permit parentage inference to be pursued on a very large scale. The intergenerational tagging this capacity allows is revolutionizing the management of cultured organisms (cows, salmon, etc.) and is poised to do the same for scientific studies of natural populations. Currently, however, there are no likelihood-based methods of parentage inference which are implemented in a manner that allows them to quickly handle a very large number of potential parents or parent pairs. Here we introduce an efficient likelihood-based method applicable to the specialized case of cultured organisms in which both parents can be reliably sampled. We develop a Markov chain representation for the cumulative number of Mendelian incompatibilities between an offspring and its putative parents and we exploit it to develop a fast algorithm for simulation-based estimates of statistical confidence in SNP-based assignments of offspring to pairs of parents. The method is implemented in the freely available software SNPPIT. We describe the method in detail, then assess its performance in a large simulation study using known allele frequencies at 96 SNPs from ten hatchery salmon populations. The simulations verify that the method is fast and accurate and that 96 well-chosen SNPs can provide sufficient power to identify the correct pair of parents from amongst millions of candidate pairs.
Marlow, Gareth; Han, Dug Yeo; Wickens, Kristin; Stanley, Thorsten; Crane, Julian; Mitchell, Edwin A; Dekker, James; Barthow, Christine; Fitzharris, Penny; Ferguson, Lynnette R; Morgan, Angharad R
2015-05-01
There is strong evidence to support a genetic predisposition to eczema and more recently studies have suggested that probiotics might be used to prevent eczema by modifying the expression of putative allergy-associated genes. The aim of this present study was to investigate whether two probiotics, Lactobacillus rhamnosus HN001 (HN001) and Bifidobacterium animalis subsp. lactis HN019 (HN019), can modify the known genetic predisposition to eczema conferred by genetic variation in the Toll-like receptor (TLR) genes in a high-risk infant population. We selected 54 SNPs in the Toll-like receptor genes. These SNPs were analysed in 331 children of sole European ancestry as part of a double-blind, randomized, placebo-controlled trial examining the effects of HN001 and HN019 supplementation on eczema development and atopic sensitization. The data showed that 26 TLR SNPs interacted with HN001 resulting in a significantly reduced risk of eczema, 18 for eczema severity as defined by SCORAD ≥ 10 and 20 for atopic sensitization compared to placebo. There were only two SNPs that interacted with HN019 resulting in a reduced risk of eczema, eczema severity or atopy. This is the first study to show that the negative impact of specific TLR genotypes may be positively affected by probiotic supplementation. HN001 exhibits a much stronger effect than HN019 in this respect. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Song, Peng; Zhu, Haixia; Zhang, Dong; Chu, Haiyan; Wu, Dongmei; Kang, Meiyun; Wang, Meilin; Gong, Weida; Zhou, Jianwei; Zhang, Zhengdong; Zhao, Qinghong
2014-11-17
Single nucleotide polymorphisms (SNPs) in the 3'-untranslated regions targeted by putative mircoRNA can change its binding strength, affecting the susceptibility and prognosis of cancer. We aimed to investigate the associations between SNPs within miR-148a binding sites and gastric cancer (GC) risk and prognosis. Using bioinformatics tools, we selected two SNPs (SCRN1 rs6976789 and PDYN rs2235749) located in miR-148a target sites. We genotyped the two SNPs in a case-control study comprising 753 GC patients and 949 cancer-free subjects. We found a significantly increased risk of GC associated with the SCRN1 rs6976789 C>T polymorphism [adjusted OR = 1.25, 95% confidence interval (CI) = 1.02-1.53; CT/TT vs. CC]. However, no significant association was found between the PDYN rs2235749 and GC risk in all genetic models. Furthermore, we evaluated whether SCRN1 rs6976789 affected the survival of GC patients. Results showed that individuals with SCRN1 rs6976789 TT genotype had poorer overall survival compared with those carried CC/CT genotypes in intestinal-type GC (adjusted HR = 2.47, 95% CI = 1.21-5.05). Luciferase report assay showed that the rs6976789 variant T allele influenced the binding ability of miR-148a. Our results suggested that the SCRN1 rs6976789 polymorphism may play an important role in the GC development and progression.
Bahri, Bochra A; Daverdin, Guillaume; Xu, Xiangyang; Cheng, Jan-Fang; Barry, Kerrie W; Brummer, E Charles; Devos, Katrien M
2018-06-14
Advances in genomic technologies have expanded our ability to accurately and exhaustively detect natural genomic variants that can be applied in crop improvement and to increase our knowledge of plant evolution and adaptation. Switchgrass (Panicum virgatum L.), an allotetraploid (2n = 4× = 36) perennial C4 grass (Poaceae family) native to North America and a feedstock crop for cellulosic biofuel production, has a large potential for genetic improvement due to its high genotypic and phenotypic variation. In this study, we analyzed single nucleotide polymorphism (SNP) variation in 372 switchgrass genotypes belonging to 36 accessions for 12 genes putatively involved in biomass production to investigate signatures of selection that could have led to ecotype differentiation and to population adaptation to geographic zones. A total of 11,682 SNPs were mined from ~ 15 Gb of sequence data, out of which 251 SNPs were retained after filtering. Population structure analysis largely grouped upland accessions into one subpopulation and lowland accessions into two additional subpopulations. The most frequent SNPs were in homozygous state within accessions. Sixty percent of the exonic SNPs were non-synonymous and, of these, 45% led to non-conservative amino acid changes. The non-conservative SNPs were largely in linkage disequilibrium with one haplotype being predominantly present in upland accessions while the other haplotype was commonly present in lowland accessions. Tajima's test of neutrality indicated that PHYB, a gene involved in photoperiod response, was under positive selection in the switchgrass population. PHYB carried a SNP leading to a non-conservative amino acid change in the PAS domain, a region that acts as a sensor for light and oxygen in signal transduction. Several non-conservative SNPs in genes potentially involved in plant architecture and adaptation have been identified and led to population structure and genetic differentiation of ecotypes in switchgrass. We suggest here that PHYB is a key gene involved in switchgrass natural selection. Further analyses are needed to determine whether any of the non-conservative SNPs identified play a role in the differential adaptation of upland and lowland switchgrass.
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-01-01
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. PMID:26019233
SNPit: a federated data integration system for the purpose of functional SNP annotation.
Shen, Terry H; Carlson, Christopher S; Tarczy-Hornoch, Peter
2009-08-01
Genome wide association studies can potentially identify the genetic causes behind the majority of human diseases. With the advent of more advanced genotyping techniques, there is now an explosion of data gathered on single nucleotide polymorphisms (SNPs). The need exists for an integrated system that can provide up-to-date functional annotation information on SNPs. We have developed the SNP Integration Tool (SNPit) system to address this need. Built upon a federated data integration system, SNPit provides current information on a comprehensive list of SNP data sources. Additional logical inference analysis was included through an inference engine plug in. The SNPit web servlet is available online for use. SNPit allows users to go to one source for up-to-date information on the functional annotation of SNPs. A tool that can help to integrate and analyze the potential functional significance of SNPs is important for understanding the results from genome wide association studies.
In silico SNP analysis of the breast cancer antigen NY-BR-1.
Kosaloglu, Zeynep; Bitzer, Julia; Halama, Niels; Huang, Zhiqin; Zapatka, Marc; Schneeweiss, Andreas; Jäger, Dirk; Zörnig, Inka
2016-11-18
Breast cancer is one of the most common malignancies with increasing incidences every year and a leading cause of death among women. Although early stage breast cancer can be effectively treated, there are limited numbers of treatment options available for patients with advanced and metastatic disease. The novel breast cancer associated antigen NY-BR-1 was identified by SEREX analysis and is expressed in the majority (>70%) of breast tumors as well as metastases, in normal breast tissue, in testis and occasionally in prostate tissue. The biological function and regulation of NY-BR-1 is up to date unknown. We performed an in silico analysis on the genetic variations of the NY-BR-1 gene using data available in public SNP databases and the tools SIFT, Polyphen and Provean to find possible functional SNPs. Additionally, we considered the allele frequency of the found damaging SNPs and also analyzed data from an in-house sequencing project of 55 breast cancer samples for recurring SNPs, recorded in dbSNP. Over 2800 SNPs are recorded in the dbSNP and NHLBI ESP databases for the NY-BR-1 gene. Of these, 65 (2.07%) are synonymous SNPs, 191 (6.09%) are non-synoymous SNPs, and 2430 (77.48%) are noncoding intronic SNPs. As a result, 69 non-synoymous SNPs were predicted to be damaging by at least two, and 16 SNPs were predicted as damaging by all three of the used tools. The SNPs rs200639888, rs367841401 and rs377750885 were categorized as highly damaging by all three tools. Eight damaging SNPs are located in the ankyrin repeat domain (ANK), a domain known for its frequent involvement in protein-protein interactions. No distinctive features could be observed in the allele frequency of the analyzed SNPs. Considering these results we expect to gain more insights into the variations of the NY-BR-1 gene and their possible impact on giving rise to splice variants and therefore influence the function of NY-BR-1 in healthy tissue as well as in breast cancer.
Landscape genomics reveal signatures of local adaptation in barley (Hordeum vulgare L.)
Abebe, Tiegist D.; Naz, Ali A.; Léon, Jens
2015-01-01
Land plants are sessile organisms that cannot escape the adverse climatic conditions of a given environment. Hence, adaptation is one of the solutions to surviving in a challenging environment. This study was aimed at detecting adaptive loci in barley landraces that are affected by selection. To that end, a diverse population of barley landraces was analyzed using the genotyping by sequencing approach. Climatic data for altitude, rainfall and temperature were collected from 61 weather sites near the origin of selected landraces across Ethiopia. Population structure analysis revealed three groups whereas spatial analysis accounted significant similarities at shorter geographic distances (< 40 Km) among barley landraces. Partitioning the variance between climate variables and geographic distances indicated that climate variables accounted for most of the explainable genetic variation. Markers by climatic variables association analysis resulted in altogether 18 and 62 putative adaptive loci using Bayenv and latent factor mixed model (LFMM), respectively. Subsequent analysis of the associated SNPs revealed putative candidate genes for plant adaptation. This study highlights the presence of putative adaptive loci among barley landraces representing original gene pool of the farming communities. PMID:26483825
Safari, Roghaiyeh; Tunca, Zeliha; Özerdem, Ayşegül; Ceylan, Deniz; Yalçın, Yaprak; Sakizli, Meral
2017-01-01
Glial cell-derived neurotrophic factor and other neurotrophins have important role in the development of mental disorders. Here, we aimed to assess the effects of Single nucleotide polymorphisms at potentially regulated regions of GDNF on severity and functionality of bipolar disorder and GDNF serum levels in bipolar disorder patients and healthy volunteers. Severity and functionality of bipolar disorder were evaluated using the Clinical Global Impression and Global Assessment of Functioning scales in sixty-six bipolar disorder patients. The GDNF serum levels obtained from bipolar disorder patients and healthy volunteers who had been already reported SNPs information by our group. GAF scales were lower and GDNF serum levels were higher in Bipolar disorder patients with T/A genotype at 5:37812784 and 5:37812782 compared to patients with T/T genotype. There were significant difference in severity and functionality scores, but not in GDNF serum levels, between patients with G/G and G/A genotype of rs62360370 G > A SNP.rs2075680 C > A and rs79669773 T > C SNPs had no effect on bipolar disorder severity and functionality scores and GDNF serum levels. The results suggest that some SNPs of GDNF have potential association with severity and functionality of bipolar disorder. In addition, except two SNPs, none of GDNF SNPs had association with GDNF serum levels.
Perdiguero, Pedro; Venturas, Martin; Cervera, María Teresa; Gil, Luis; Collada, Carmen
2015-01-01
Elms, especially Ulmus minor and U. americana, are carrying out a hard battle against Dutch elm disease (DED). This vascular wilt disease, caused by Ophiostoma ulmi and O. novo-ulmi, appeared in the twentieth century and killed millions of elms across North America and Europe. Elm breeding and conservation programmes have identified a reduced number of DED tolerant genotypes. In this study, three U. minor genotypes with contrasted levels of tolerance to DED were exposed to several biotic and abiotic stresses in order to (i) obtain a de novo assembled transcriptome of U. minor using 454 pyrosequencing, (ii) perform a functional annotation of the assembled transcriptome, (iii) identify genes potentially involved in the molecular response to environmental stress, and (iv) develop gene-based markers to support breeding programmes. A total of 58,429 putative unigenes were identified after assembly and filtering of the transcriptome. 32,152 of these unigenes showed homology with proteins identified in the genome from the most common plant model species. Well-known family proteins and transcription factors involved in abiotic, biotic or both stresses were identified after functional annotation. A total of 30,693 polymorphisms were identified in 7,125 isotigs, a large number of them corresponding to single nucleotide polymorphisms (SNPs; 27,359). In a subset randomly selected for validation, 87% of the SNPs were confirmed. The material generated may be valuable for future Ulmus gene expression, population genomics and association genetics studies, especially taking into account the scarce molecular information available for this genus and the great impact that DED has on elm populations. PMID:26257751
2011-01-01
Background Identifying the functional importance of the millions of single nucleotide polymorphisms (SNPs) in the human genome is a difficult challenge. Therefore, a reverse strategy, which identifies functionally important SNPs by virtue of the bimodal abundance across the human population of the SNP-related mRNAs will be useful. Those mRNA transcripts that are expressed at two distinct abundances in proportion to SNP allele frequency may warrant further study. Matrix metalloproteinase 1 (MMP1) is important in both normal development and in numerous pathologies. Although much research has been conducted to investigate the expression of MMP1 in many different cell types and conditions, the regulation of its expression is still not fully understood. Results In this study, we used a novel but straightforward method based on agglomerative hierarchical clustering to identify bimodally expressed transcripts in human umbilical vein endothelial cell (HUVEC) microarray data from 15 individuals. We found that MMP1 mRNA abundance was bimodally distributed in un-treated HUVECs and showed a bimodal response to inflammatory mediator treatment. RT-PCR and MMP1 activity assays confirmed the bimodal regulation and DNA sequencing of 69 individuals identified an MMP1 gene promoter polymorphism that segregated precisely with the MMP1 bimodal expression. Chromatin immunoprecipation (ChIP) experiments indicated that the transcription factors (TFs) ETS1, ETS2 and GATA3, bind to the MMP1 promoter in the region of this polymorphism and may contribute to the bimodal expression. Conclusions We describe a simple method to identify putative bimodally expressed RNAs from transcriptome data that is effective yet easy for non-statisticans to understand and use. This method identified bimodal endothelial cell expression of MMP1, which appears to be biologically significant with implications for inflammatory disease. (271 Words) PMID:21244711
Role of the DLGAP2 Gene Encoding the SAP90/PSD-95-Associated Protein 2 in Schizophrenia
Li, Jun-Ming; Lu, Chao-Lin; Cheng, Min-Chih; Luu, Sy-Ueng; Hsu, Shih-Hsin; Hu, Tsung-Ming; Tsai, Hsin-Yao; Chen, Chia-Hsiang
2014-01-01
Aberrant synaptic dysfunction is implicated in the pathogenesis of schizophrenia. The DLGAP2 gene encoding the SAP90/PSD-95-associated protein 2 (SAPAP2) located at the post-synaptic density of neuronal cells is involved in the neuronal synaptic function. This study aimed to investigate whether the DLGAP2 gene is associated with schizophrenia. We resequenced the putative promoter region and all the exons of the DLGAP2 gene in 523 patients with schizophrenia and 596 non-psychotic controls from Taiwan and conducted a case-control association analysis. We identified 19 known SNPs in this sample. Association analysis of 9 SNPs with minor allele frequency greater than 5% showed no association with schizophrenia. However, we found a haplotype (CCACCAACT) significantly associated with schizophrenia (odds ratio:2.5, p<0.001). We also detected 16 missense mutations and 1 amino acid-insertion mutation in this sample. Bioinformatic analysis showed some of these mutations were damaging or pathological to the protein function, but we did not find increased burden of these mutations in the patient group. Notably, we identified 5 private rare variants in 5 unrelated patients, respectively, including c.−69+9C>T, c.−69+13C>T, c.−69+47C>T, c.−69+55C>T at intron 1 and c.−32A>G at untranslated exon 2 of the DLGAP2 gene. These rare variants were not detected in 559 control subjects. Further reporter gene assay of these rare variants except c.−69+13C>T showed significantly elevated promoter activity than the wild type, suggesting increased DLGAP2 gene expression may contribute to the pathogenesis of schizophrenia. Our results indicate that DLGAP2 is a susceptible gene of schizophrenia. PMID:24416398
Tran, C.; Gagnon, F.; Wigg, K.G.; Feng, Y.; Gomez, L.; Cate-Carter, T.D.; Kerr, E.N.; Field, L.L.; Kaplan, B.J.; Lovett, M.W.; Barr, C.L.
2017-01-01
Reading disabilities (RD) have a significant genetic basis and have shown linkage to multiple regions including chromosome 15q. Dyslexia susceptibility 1 candidate gene 1 (DYX1C1) on chromosome 15q21 was originally proposed as a candidate gene with two potentially functional polymorphisms at the −3G/A and 1249G/T positions showing association with RD. However, subsequent studies have yielded mixed results. We performed a literature review and meta-analysis of the −3G/A and 1249G/T polymorphisms, including new unpublished data from two family-based samples. Ten markers in DYX1C1 were genotyped in the two independently ascertained samples. Single marker and −3G/A:1249G/T haplotype analyses were performed for RD in both samples, and quantitative trait analyses using standardized reading-related measures was performed in one of the samples. For the meta-analysis, we used a random-effects model to summarize studies that tested for association between −3G/A or 1249G/T and RD. No significant association was found between the DYX1C1 SNPs and RD or any of the reading-related measures tested after correction for the number of tests performed. The previously reported risk haplotype (−3A:1249T) was not biased in transmission. A total of 9 and 10 study samples were included in the meta-analysis of the −3G/A and 1249G/T polymorphisms, respectively. Neither polymorphism reached statistical significance, but the heterogeneity for the 1249G/T polymorphism was high. The results of this study do not provide evidence for association between the putatively functional SNPs −3G/A and 1249G/T and RD. PMID:23341075
Yan, Hongwei; Cui, Xin; Shen, Xufang; Wang, Lianshun; Jiang, Linan; Liu, Haiying; Liu, Ying; Liu, Qi; Jiang, Chen
2018-06-01
The mantis shrimp Oratosquilla oratoria is a widely distributed, commercially important crustacean species. Although its conservation and the development of successful artificial breeding technologies have recently received considerable attention, there are currently no available data regarding the molecular mechanisms in controlling reproduction. In this study, we performed transcriptome sequencing of the testis, ovary, female and male eyestalks and the androgenic gland of O. oratoria, and compared the expression pattern of transcripts from the testis and ovary libraries to identify genes involved in gonadal development. A total of 147,130,937 clean reads were retrieved after removing the adapters in reads and filtering out low-quality data. All the reads were assembled into 94,990 unigenes (23,133 in testis and ovary) with an average length of 783 base pairs (bp) and N50 of 1502 bp. A search of all-unigenes against COG, GO, KEGG, KOG, Pfam, Swiss-Prot and Nr databases resulted in a total of 19,404 annotated unigenes. Comparison of the sequences in the ovary and testis libraries revealed that 1188 unigenes were up-regulated in the ovary and 2732 were up-regulated in the testis. Twenty ovary-up-regulated and 21 testis-up-regulated unigenes were confirmed by quantitative real-time PCR. Additionally, 13,437 simple sequence repeats (SSRs) and 275,799 putative single nucleotide polymorphisms (SNPs) were identified. The important functional genes and pathways identified here provide a valuable dataset for understanding the molecular mechanisms controlling gonad development in O. oratoria, and the numerous (13,437 SSRs and 275,799 SNPs) molecular markers obtained here will provide fundamental basis for functional genomic and population genetic studies of O. oratoria. Copyright © 2018 Elsevier Inc. All rights reserved.
Saowaphak, P; Duangjinda, M; Plaengkaeo, S; Suwannasing, R; Boonkum, W
2017-06-29
In this study, we estimated the genetic parameters and identified the putative quantitative trait loci (QTL) associated with the length of productive life (LPL), days open (DO), and 305-day milk yield for the first lactation (FM305) of crossbred Holstein dairy cattle. Data comprising 4,739 records collected between 1986 and 2004 were used to estimate the variance-covariance components using the multiple-trait animal linear mixed models based on the average information restricted maximum likelihood (AI-REML) algorithm. Thirty-six animals were genotyped using the Illumina BovineSNP50 Bead Chip [>50,000 single nucleotide polymorphisms (SNPs)] to identify the putative QTL in a genome-wide association study. The heritability of the production trait FM305 was 0.25 and that of the functional traits, LPL and DO, was low (0.10 and 0.06, respectively). The genetic correlation estimates demonstrated favorable negative correlations between LPL and DO (-0.02). However, we observed a favorable positive correlation between FM305 and LPL (0.43) and an unfavorable positive correlation between FM305 and DO (0.1). The GWAS results indicated that 23 QTLs on bovine chromosomes 1, 4, 5, 8, 15, 26, and X were associated with the traits of interest, and the putative QTL regions were identified within seven genes (SYT1, DOCK11, KLHL13, IL13RA1, PRKG1, GNA14, and LRRC4C). In conclusion, the heritability estimates of the LPL and DO were low. Therefore, the approach of multiple-trait selection indexes should be applied, and the QTL identified here should be considered for use in marker-assisted selection in the future.
As-sadi, Falah; Carrere, Sébastien; Gascuel, Quentin; Hourlier, Thibaut; Rengel, David; Le Paslier, Marie-Christine; Bordat, Amandine; Boniface, Marie-Claude; Brunel, Dominique; Gouzy, Jérôme; Godiard, Laurence; Vincourt, Patrick
2011-10-11
Downy mildew in sunflowers (Helianthus annuus L.) is caused by the oomycete Plasmopara halstedii (Farl.) Berlese et de Toni. Despite efforts by the international community to breed mildew-resistant varieties, downy mildew remains a major threat to the sunflower crop. Very few genomic, genetic and molecular resources are currently available to study this pathogen. Using a 454 sequencing method, expressed sequence tags (EST) during the interaction between H. annuus and P. halstedii have been generated and a search was performed for sites in putative effectors to show polymorphisms between the different races of P. halstedii. A 454 pyrosequencing run of two infected sunflower samples (inbred lines XRQ and PSC8 infected with race 710 of P. halstedii, which exhibit incompatible and compatible interactions, respectively) generated 113,720 and 172,107 useable reads. From these reads, 44,948 contigs and singletons have been produced. A bioinformatic portal, HP, was specifically created for in-depth analysis of these clusters. Using in silico filtering, 405 clusters were defined as being specific to oomycetes, and 172 were defined as non-specific oomycete clusters. A subset of these two categories was checked using PCR amplification, and 86% of the tested clusters were validated. Twenty putative RXLR and CRN effectors were detected using PSI-BLAST. Using corresponding sequences from four races (100, 304, 703 and 710), 22 SNPs were detected, providing new information on pathogen polymorphisms. This study identified a large number of genes that are expressed during H. annuus/P. halstedii compatible or incompatible interactions. It also reveals, for the first time, that an infection mechanism exists in P. halstedii similar to that in other oomycetes associated with the presence of putative RXLR and CRN effectors. SNPs discovered in CRN effector sequences were used to determine the genetic distances between the four races of P. halstedii. This work therefore provides valuable tools for further discoveries regarding the H. annuus/P. halstedii pathosystem.
2011-01-01
Background Downy mildew in sunflowers (Helianthus annuus L.) is caused by the oomycete Plasmopara halstedii (Farl.) Berlese et de Toni. Despite efforts by the international community to breed mildew-resistant varieties, downy mildew remains a major threat to the sunflower crop. Very few genomic, genetic and molecular resources are currently available to study this pathogen. Using a 454 sequencing method, expressed sequence tags (EST) during the interaction between H. annuus and P. halstedii have been generated and a search was performed for sites in putative effectors to show polymorphisms between the different races of P. halstedii. Results A 454 pyrosequencing run of two infected sunflower samples (inbred lines XRQ and PSC8 infected with race 710 of P. halstedii, which exhibit incompatible and compatible interactions, respectively) generated 113,720 and 172,107 useable reads. From these reads, 44,948 contigs and singletons have been produced. A bioinformatic portal, HP, was specifically created for in-depth analysis of these clusters. Using in silico filtering, 405 clusters were defined as being specific to oomycetes, and 172 were defined as non-specific oomycete clusters. A subset of these two categories was checked using PCR amplification, and 86% of the tested clusters were validated. Twenty putative RXLR and CRN effectors were detected using PSI-BLAST. Using corresponding sequences from four races (100, 304, 703 and 710), 22 SNPs were detected, providing new information on pathogen polymorphisms. Conclusions This study identified a large number of genes that are expressed during H. annuus/P. halstedii compatible or incompatible interactions. It also reveals, for the first time, that an infection mechanism exists in P. halstedii similar to that in other oomycetes associated with the presence of putative RXLR and CRN effectors. SNPs discovered in CRN effector sequences were used to determine the genetic distances between the four races of P. halstedii. This work therefore provides valuable tools for further discoveries regarding the H. annuus/P. halstedii pathosystem. PMID:21988821
2013-01-01
Background Identification of single nucleotide polymorphisms (SNPs) for specific genes involved in reproduction might improve reliability of genomic estimates for these low-heritability traits. Semen from 550 Holstein bulls of high (≥ 1.7; n = 288) or low (≤ −2; n = 262) daughter pregnancy rate (DPR) was genotyped for 434 candidate SNPs using the Sequenom MassARRAY® system. Three types of SNPs were evaluated: SNPs previously reported to be associated with reproductive traits or physically close to genetic markers for reproduction, SNPs in genes that are well known to be involved in reproductive processes, and SNPs in genes that are differentially expressed between physiological conditions in a variety of tissues associated in reproductive function. Eleven reproduction and production traits were analyzed. Results A total of 40 SNPs were associated (P < 0.05) with DPR. Among these were genes involved in the endocrine system, cell signaling, immune function and inhibition of apoptosis. A total of 10 genes were regulated by estradiol. In addition, 22 SNPs were associated with heifer conception rate, 33 with cow conception rate, 36 with productive life, 34 with net merit, 23 with milk yield, 19 with fat yield, 13 with fat percent, 19 with protein yield, 22 with protein percent, and 13 with somatic cell score. The allele substitution effect for SNPs associated with heifer conception rate, cow conception rate, productive life and net merit were in the same direction as for DPR. Allele substitution effects for several SNPs associated with production traits were in the opposite direction as DPR. Nonetheless, there were 29 SNPs associated with DPR that were not negatively associated with production traits. Conclusion SNPs in a total of 40 genes associated with DPR were identified as well as SNPs for other traits. It might be feasible to include these SNPs into genomic tests of reproduction and other traits. The genes associated with DPR are likely to be important for understanding the physiology of reproduction. Given the large number of SNPs associated with DPR that were not negatively associated with production traits, it should be possible to select for DPR without compromising production. PMID:23759029
Tan, Xiang-Lin; Moyer, Ann M.; Fridley, Brooke L.; Schaid, Daniel J.; Niu, Nifang; Batzler, Anthony J.; Jenkins, Gregory D.; Abo, Ryan P.; Li, Liang; Cunningham, Julie M.; Sun, Zhifu; Yang, Ping; Wang, Liewei
2011-01-01
Purpose Inherited variability in the prognosis of lung cancer patients treated with platinum-based chemotherapy has been widely investigated. However, the overall contribution of genetic variation to platinum response is not well established. To identify novel candidate SNPs/genes, we performed a genome-wide association study (GWAS) for cisplatin cytotoxicity using lymphoblastoid cell lines (LCLs), followed by an association study of selected SNPs from the GWAS with overall survival (OS) in lung cancer patients. Experimental Design GWAS for cisplatin were performed with 283 ethnically diverse LCLs. 168 top SNPs were genotyped in 222 small cell and 961 non-small cell lung cancer (SCLC, NSCLC) patients treated with platinum-based therapy. Association of the SNPs with OS was determined using the Cox regression model. Selected candidate genes were functionally validated by siRNA knockdown in human lung cancer cells. Results Among 157 successfully genotyped SNPs, 9 and 10 SNPs were top SNPs associated with OS for patients with NSCLC and SCLC, respectively, although they were not significant after adjusting for multiple testing. Fifteen genes, including 7 located within 200 kb up or downstream of the four top SNPs and 8 genes for which expression was correlated with three SNPs in LCLs were selected for siRNA screening. Knockdown of DAPK3 and METTL6, for which expression levels were correlated with the rs11169748 and rs2440915 SNPs, significantly decreased cisplatin sensitivity in lung cancer cells. Conclusions This series of clinical and complementary laboratory-based functional studies identified several candidate genes/SNPs that might help predict treatment outcomes for platinum-based therapy of lung cancer. PMID:21775533
A novel method for in silico identification of regulatory SNPs in human genome.
Li, Rong; Zhong, Dexing; Liu, Ruiling; Lv, Hongqiang; Zhang, Xinman; Liu, Jun; Han, Jiuqiang
2017-02-21
Regulatory single nucleotide polymorphisms (rSNPs), kind of functional noncoding genetic variants, can affect gene expression in a regulatory way, and they are thought to be associated with increased susceptibilities to complex diseases. Here a novel computational approach to identify potential rSNPs is presented. Different from most other rSNPs finding methods which based on hypothesis that SNPs causing large allele-specific changes in transcription factor binding affinities are more likely to play regulatory functions, we use a set of documented experimentally verified rSNPs and nonfunctional background SNPs to train classifiers, so the discriminating features are found. To characterize variants, an extensive range of characteristics, such as sequence context, DNA structure and evolutionary conservation etc. are analyzed. Support vector machine is adopted to build the classifier model together with an ensemble method to deal with unbalanced data. 10-fold cross-validation result shows that our method can achieve accuracy with sensitivity of ~78% and specificity of ~82%. Furthermore, our method performances better than some other algorithms based on aforementioned hypothesis in handling false positives. The original data and the source matlab codes involved are available at https://sourceforge.net/projects/rsnppredict/. Copyright © 2016 Elsevier Ltd. All rights reserved.
Allelic expression mapping across cellular lineages to establish impact of non-coding SNPs
Adoue, Veronique; Schiavi, Alicia; Light, Nicholas; Almlöf, Jonas Carlsson; Lundmark, Per; Ge, Bing; Kwan, Tony; Caron, Maxime; Rönnblom, Lars; Wang, Chuan; Chen, Shu-Huang; Goodall, Alison H; Cambien, Francois; Deloukas, Panos; Ouwehand, Willem H; Syvänen, Ann-Christine; Pastinen, Tomi
2014-01-01
Most complex disease-associated genetic variants are located in non-coding regions and are therefore thought to be regulatory in nature. Association mapping of differential allelic expression (AE) is a powerful method to identify SNPs with direct cis-regulatory impact (cis-rSNPs). We used AE mapping to identify cis-rSNPs regulating gene expression in 55 and 63 HapMap lymphoblastoid cell lines from a Caucasian and an African population, respectively, 70 fibroblast cell lines, and 188 purified monocyte samples and found 40–60% of these cis-rSNPs to be shared across cell types. We uncover a new class of cis-rSNPs, which disrupt footprint-derived de novo motifs that are predominantly bound by repressive factors and are implicated in disease susceptibility through overlaps with GWAS SNPs. Finally, we provide the proof-of-principle for a new approach for genome-wide functional validation of transcription factor–SNP interactions. By perturbing NFκB action in lymphoblasts, we identified 489 cis-regulated transcripts with altered AE after NFκB perturbation. Altogether, we perform a comprehensive analysis of cis-variation in four cell populations and provide new tools for the identification of functional variants associated to complex diseases. PMID:25326100
Behl, Jyotsna Dhingra; Mishra, Priyanka; Verma, N K; Niranjan, S K; Dangi, P S; Sharma, Rekha; Behl, Rahul
2016-03-15
The present study was undertaken to characterize the genetic variation present in lymphoxin A gene (LTA gene) encoding for the lymphotoxin A protein also known as tumor necrosis factor beta, a cytokine produced by lymphocytes, known to be cytotoxic for a wide range of tumor cells both in vitro and in vivo, and, which is essential for normal immunological development; in 40 animals of 5 diverse Bos indicus Indian zebu cattle breeds. These breeds survive under the harsh and tough tropical climatic conditions of various parts of the Indian subcontinent. The LTA gene in the present study was observed to contain 33 SNPs and 3 small insertion/deletion polymorphisms. Four SNPs occurred in the coding regions of the gene viz. g.1327A>G and g.1400C>T in exon 2 and g.1840C>T and g.1942C>T in exon 3, of which the SNP g.1327A>G in exon 2 resulted in a non-synonymous amino acid change G38D. This amino acid change was however predicted not be affecting the protein function in any manner. The gene contained putative transcription factor binding sites for the c-Re1 and for Pax-4 transcription factors. A putative promoter region was also predicted on the reverse DNA strand from position 894 to 644. Several repeat elements and microsatellite repeats were detected to be occurring across the 3.2kb LTA gene sequence. The study showed the occurrence of 40 genotypes and 48 most probable haplotypes. The genotypes at the observed SNP positions in the LTA gene were in near Hardy-Weinberg equilibrium. A negative Tajima's D value that was not significant statistically at P>0.10 indicated that the neutral mutation hypothesis could not be excluded. The genetic variations observed in the LTA gene in the present study have not been reported earlier and these could possibly be used as molecular markers for further studies involving association of the gene variability with disease resistance/tolerance traits. Copyright © 2015 Elsevier B.V. All rights reserved.
Genetic dissection of host immune response in pneumonia development and progression
Smelaya, Tamara V.; Belopolskaya, Olesya B.; Smirnova, Svetlana V.; Kuzovlev, Artem N.; Moroz, Viktor V.; Golubev, Arkadiy M.; Pabalan, Noel A.; Salnikova, Lyubov E.
2016-01-01
The role of host genetic variation in pneumonia development and outcome is poorly understood. We studied common polymorphisms in the genes of proinflammatory cytokines (IL6 rs1800795, IL8 rs4073, IL1B rs16944), anti-inflammatory cytokines (IL10 rs1800896, IL4 rs2243250, IL13 rs20541) and toll-like receptors (TLR2 rs5743708 and rs4696480, TLR4 rs4986791, TLR9 rs352139, rs5743836 and rs187084) in patients with community-acquired pneumonia (CAP) (390 cases, 203 controls) and nosocomial pneumonia (355 cases, 216 controls). Experimental data were included in a series of 11 meta-analyses and eight subset analyses related to pneumonia susceptibility and outcome. TLR2 rs5743708 minor genotype appeared to be associated with CAP/Legionnaires’ disease/pneumococcal disease. In CAP patients, the IL6 rs1800795-C allele was associated with severe sepsis/septic shock/severe systemic inflammatory response, while the IL10 rs1800896-A allele protected against the development of these critical conditions. To contribute to deciphering of the above results, we performed an in silico analysis and a qualitative synthesis of literature data addressing basal and stimulated genotype-specific expression level. This data together with database information on transcription factors’ affinity changes caused by SNPs in putative promoter regions, the results of linkage disequilibrium analysis along with SNPs functional annotations supported assumptions about the complexity underlying the revealed associations. PMID:27725770
Xu, Yao; Jiang, Yu; Shi, Tao; Cai, Hanfang; Lan, Xianyong; Zhao, Xin; Plath, Martin; Chen, Hong
2017-01-01
Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus) and Qinchuan (Bos taurus) are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 to 12 fold on average of 97.86% and 98.98% coverage of genomes, respectively. Comparison with the Bos_taurus_UMD_3.1 reference assembly yielded 9,010,096 SNPs for Nanyang, and 6,965,062 for Qinchuan cattle, 51% and 29% of which were novel SNPs, respectively. A total of 154,934 and 115,032 small indels (1 to 3 bp) were found in the Nanyang and Qinchuan genomes, respectively. The SNP and indel distribution revealed that Nanyang showed a genetically high diversity as compared to Qinchuan cattle. Furthermore, a total of 2,907 putative cases of copy number variation (CNV) were identified by aligning Nanyang to Qinchuan genome, 783 of which (27%) encompassed the coding regions of 495 functional genes. The gene ontology (GO) analysis revealed that many CNV genes were enriched in the immune system and environment adaptability. Among several CNV genes related to lipid transport and fat metabolism, Lepin receptor gene (LEPR) overlapping with CNV_1815 showed remarkably higher copy number in Qinchuan than Nanyang (log2 (ratio) = -2.34988; P value = 1.53E-102). Further qPCR and association analysis investigated that the copy number of the LEPR gene presented positive correlations with transcriptional expression and phenotypic traits, suggesting the LEPR CNV may contribute to the higher fat deposition in muscles of Qinchuan cattle. Our findings provide evidence that the distinct phenotypes of Nanyang and Qinchuan breeds may be due to the different genetic variations including SNPs, indels and CNV.
Genome-wide association study for the level of serum electrolytes in Italian Large White pigs.
Bovo, S; Schiavo, G; Mazzoni, G; Dall'Olio, S; Galimberti, G; Calò, D G; Scotti, E; Bertolini, F; Buttazzoni, L; Samorè, A B; Fontanesi, L
2016-10-01
Calcium, magnesium and phosphorus are essential electrolytes involved in a large number of biological processes. Imbalance of these minerals in blood may indicate clinically relevant conditions and are important in inferring acute or chronic pathologies in humans and animals. In this work, we carried out a genome-wide association study (GWAS) for the level of these three electrolytes in the serum of 843 performance-tested Italian Large White pigs. All pigs were genotyped with the Illumina PorcineSNP60 BeadChip, and GWAS was carried out using genome-wide efficient mixed-model association. For the level of Ca(2+) , eight single nucleotide polymorphisms (SNPs) were significant, considering a false discovery rate (FDR) < 0.05, and another eight were above the moderate association threshold (Pnominal value < 5.00E-05). These SNPs are distributed in four porcine chromosomes (SSC): SSC8, SSC11, SSC12 and SSC13. In particular, a few putative different signals of association detected on SSC13 and one on SSC12 were in genes or close to genes involved in calcium metabolism (P2RY1, RAP2B, SLC9A9, C3orf58, TSC22D2, PLCH1 and CACNB1). Only one SNP (on SSC7) and six SNPs (on SSC2 and SSC7) showed moderate association with the level of magnesium and phosphorus respectively. The association signals for these two latter minerals might identify genes not known thus far for playing a role in their biological functions and regulations. In conclusion, our GWAS contributed to increased knowledge on the role that calcium, magnesium and phosphorus may play in the genetically determined physiological mechanisms affecting the natural variability of mineral levels in mammalian blood. © 2016 Stichting International Foundation for Animal Genetics.
Yang, S; Li, C; Xie, Y; Cui, X; Li, X; Wei, J; Zhang, Y; Yu, Y; Wang, Y; Zhang, S; Zhang, Q; Sun, D
2015-12-01
Our previous RNA sequencing experiment showed that the serum amyloid A2 (SAA2) gene was one of the most promising candidates for milk protein and fat traits in dairy cattle. The SAA2 gene encodes an apolipoprotein related to high-density lipoproteins. To further validate its genetic effects, genotype-phenotype associations were performed in this study. Through resequencing of the entire coding region and the 5'-regulatory region of the SAA2 gene using pooled DNA of 12 unrelated sires, one novel 3-bp insertion-deletion and five previously reported SNPs were detected. These identified SNPs were genotyped and tested for association with five milk production-related traits in 717 Chinese Holstein cows. After Bonferroni correction for multiple t-tests, five of them were found to be statistically significant for milk yield, fat yield and protein yield (P < 0.0001~0.0053). Haplotype-based association analysis revealed a similar effect on fat yield and protein yield (P = 0.0005, P = 0.0032 respectively). Then, using luciferase report assay, the regulatory effect of the three SNPs located in the promoter region (c.-22G>A; c.17G>C; c.114G>A) was evaluated on transcriptional activity. In HEK-293 cell lines, we found that constructs GCG and AGG showed higher luciferase activity compared with GCA (P < 0.01, P < 0.01 respectively). Meanwhile, the prediction of the putative differential transcription factor binding site revealed that c.17G>C and c.114G>A caused the alteration in the transcription factor. Overall, the findings presented here provide the first evidence for associations of the SAA2 gene with milk fat and protein traits, which appears to be a key candidate for milk production traits in dairy cattle. © 2015 Stichting International Foundation for Animal Genetics.
Jiang, Yu; Shi, Tao; Cai, Hanfang; Lan, Xianyong; Zhao, Xin; Plath, Martin; Chen, Hong
2017-01-01
Whole-genome sequencing provides a powerful tool to obtain more genetic variability that could produce a range of benefits for cattle breeding industry. Nanyang (Bos indicus) and Qinchuan (Bos taurus) are two important Chinese indigenous cattle breeds with distinct phenotypes. To identify the genetic characteristics responsible for variation in phenotypes between the two breeds, in the present study, we for the first time sequenced the genomes of four Nanyang and four Qinchuan cattle with 10 to 12 fold on average of 97.86% and 98.98% coverage of genomes, respectively. Comparison with the Bos_taurus_UMD_3.1 reference assembly yielded 9,010,096 SNPs for Nanyang, and 6,965,062 for Qinchuan cattle, 51% and 29% of which were novel SNPs, respectively. A total of 154,934 and 115,032 small indels (1 to 3 bp) were found in the Nanyang and Qinchuan genomes, respectively. The SNP and indel distribution revealed that Nanyang showed a genetically high diversity as compared to Qinchuan cattle. Furthermore, a total of 2,907 putative cases of copy number variation (CNV) were identified by aligning Nanyang to Qinchuan genome, 783 of which (27%) encompassed the coding regions of 495 functional genes. The gene ontology (GO) analysis revealed that many CNV genes were enriched in the immune system and environment adaptability. Among several CNV genes related to lipid transport and fat metabolism, Lepin receptor gene (LEPR) overlapping with CNV_1815 showed remarkably higher copy number in Qinchuan than Nanyang (log2 (ratio) = -2.34988; P value = 1.53E-102). Further qPCR and association analysis investigated that the copy number of the LEPR gene presented positive correlations with transcriptional expression and phenotypic traits, suggesting the LEPR CNV may contribute to the higher fat deposition in muscles of Qinchuan cattle. Our findings provide evidence that the distinct phenotypes of Nanyang and Qinchuan breeds may be due to the different genetic variations including SNPs, indels and CNV. PMID:28841720
Oxytocin receptor gene variations predict neural and behavioral response to oxytocin in autism
Watanabe, Takamitsu; Otowa, Takeshi; Abe, Osamu; Kuwabara, Hitoshi; Aoki, Yuta; Natsubori, Tatsunobu; Takao, Hidemasa; Kakiuchi, Chihiro; Kondo, Kenji; Ikeda, Masashi; Iwata, Nakao; Kasai, Kiyoto; Sasaki, Tsukasa
2017-01-01
Abstract Oxytocin appears beneficial for autism spectrum disorder (ASD), and more than 20 single-nucleotide polymorphisms (SNPs) in oxytocin receptor (OXTR) are relevant to ASD. However, neither biological functions of OXTR SNPs in ASD nor critical OXTR SNPs that determine oxytocin’s effects on ASD remains known. Here, using a machine-learning algorithm that was designed to evaluate collective effects of multiple SNPs and automatically identify most informative SNPs, we examined relationships between 27 representative OXTR SNPs and six types of behavioral/neural response to oxytocin in ASD individuals. The oxytocin effects were extracted from our previous placebo-controlled within-participant clinical trial administering single-dose intranasal oxytocin to 38 high-functioning adult Japanese ASD males. Consequently, we identified six different SNP sets that could accurately predict the six different oxytocin efficacies, and confirmed the robustness of these SNP selections against variations of the datasets and analysis parameters. Moreover, major alleles of several prominent OXTR SNPs—including rs53576 and rs2254298—were found to have dissociable effects on the oxytocin efficacies. These findings suggest biological functions of the OXTR SNP variants on autistic oxytocin responses, and implied that clinical oxytocin efficacy may be genetically predicted before its actual administration, which would contribute to establishment of future precision medicines for ASD. PMID:27798253
Zhang, Wenting; Mirlohi, Shirin; Li, Xiaorong; He, Yuke
2018-06-01
Leaf traits affect plant agronomic performance; for example, leaf hair number provides a morphological indicator of drought and insect resistance. Brassica rapa crops have diverse phenotypes, and many B. rapa single-nucleotide polymorphisms (SNPs) have been identified and used as molecular markers for plant breeding. However, which SNPs are functional for leaf hair traits and, therefore, effective for breeding purposes remains unknown. Here, we identify a set of SNPs in the B. rapa ssp. pekinenesis candidate gene BrpHAIRY LEAVES1 ( BrpHL1 ) and a number of SNPs of BrpHL1 in a natural population of 210 B. rapa accessions that have hairy, margin-only hairy, and hairless leaves. BrpHL1 genes and their orthologs and paralogs have many SNPs. By intensive mutagenesis and genetic transformation, we selected the functional SNPs for leaf hairs by the exclusion of nonfunctional SNPs and the orthologous and paralogous genes. The residue tryptophan-92 of BrpHL1a was essential for direct interaction with GLABROUS3 and, thus, necessary for the formation of leaf hairs. The accessions with the functional SNP leading to substitution of the tryptophan-92 residue had hairless leaves. The orthologous BrcHL1b from B. rapa ssp. chinensis regulates hair formation on leaf margins rather than leaf surfaces. The selected SNP for the hairy phenotype could be adopted as a molecular marker for insect resistance in Brassica spp. crops. Moreover, the procedures optimized here can be used to explain the molecular mechanisms of natural variation and to facilitate the molecular breeding of many crops. © 2018 American Society of Plant Biologists. All rights reserved.
Multifaceted Genomic Risk for Brain Function in Schizophrenia
Chen, Jiayu; Calhoun, Vince D.; Pearlson, Godfrey D.; Ehrlich, Stefan; Turner, Jessica A.; Ho, Beng-Choon; Wassink, Thomas H.; Michael, Andrew M; Liu, Jingyu
2012-01-01
Recently, deriving candidate endophenotypes from brain imaging data has become a valuable approach to study genetic influences on schizophrenia (SZ), whose pathophysiology remains unclear. In this work we utilized a multivariate approach, parallel independent component analysis, to identify genomic risk components associated with brain function abnormalities in SZ. 5157 candidate single nucleotide polymorphisms (SNPs) were derived from genome-wide array based on their possible connections with SZ and further investigated for their associations with brain activations captured with functional magnetic resonance imaging (fMRI) during a sensorimotor task. Using data from 92 SZ patients and 116 healthy controls, we detected a significant correlation (r= 0.29; p= 2.41×10−5) between one fMRI component and one SNP component, both of which significantly differentiated patients from controls. The fMRI component mainly consisted of precentral and postcentral gyri, the major activated regions in the motor task. On average, higher activation in these regions was observed in participants with higher loadings of the linked SNP component, predominantly contributed to by 253 SNPs. 138 identified SNPs were from known coding regions of 100 unique genes. 31 identified SNPs did not differ between groups, but moderately correlated with some other group-discriminating SNPs, indicating interactions among alleles contributing towards elevated SZ susceptibility. The genes associated with the identified SNPs participated in four neurotransmitter pathways: GABA receptor signaling, dopamine receptor signaling, neuregulin signaling and glutamate receptor signaling. In summary, our work provides further evidence for the complexity of genomic risk to the functional brain abnormality in SZ and suggests a pathological role of interactions between SNPs, genes and multiple neurotransmitter pathways. PMID:22440650
SNPit: a federated data integration system for the purpose of functional SNP annotation
Shen, Terry H; Carlson, Christopher S; Tarczy-Hornoch, Peter
2009-01-01
Genome wide association studies can potentially identify the genetic causes behind the majority of human diseases. With the advent of more advanced genotyping techniques, there is now an explosion of data gathered on single nucleotide polymorphisms (SNPs). The need exists for an integrated system that can provide up-to-date functional annotation information on SNPs. We have developed the SNP Integration Tool (SNPit) system to address this need. Built upon a federated data integration system, SNPit provides current information on a comprehensive list of SNP data sources. Additional logical inference analysis was included through an inference engine plug in. The SNPit web servlet is available online for use. SNPit allows users to go to one source for up-to-date information on the functional annotation of SNPs. A tool that can help to integrate and analyze the potential functional significance of SNPs is important for understanding the results from genome wide association studies. PMID:19327864
PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs
Cheng, Zhongshan; Chu, Hin; Fan, Yanhui; Li, Cun; Song, You-Qiang; Zhou, Jie; Yuen, Kwok-Yung
2015-01-01
Expression quantitative trait loci (eQTLs) mapping and linkage disequilibrium (LD) analysis have been widely employed to interpret findings of genome-wide association studies (GWAS). With the availability of deep sequencing data of 423 lymphoblastoid cell lines (LCLs) from six global populations and the microarray expression data, we performed eQTL analysis, identified more than 228 K SNP cis-eQTLs and 21 K indel cis-eQTLs and generated a LCL cis-eQTL database. We demonstrate that the percentages of population-shared and population-specific cis-eQTLs are comparable; while indel cis-eQTLs in the population-specific subsection make more contribution to gene expression variations than those in the population-shared subsection. We found cis-eQTLs, especially the population-shared cis-eQTLs are significantly enriched toward transcription start site. Moreover, the National Human Genome Research Institute cataloged GWAS SNPs are enriched for LCL cis-eQTLs. Specifically, 32.8% GWAS SNPs are LCL cis-eQTLs, among which 12.5% can be tagged by indel cis-eQTLs, suggesting the fundamental contribution of indel cis-eQTLs to GWAS association signals. To search for functional indels and SNPs tagging GWAS SNPs, a pipeline Post-GWAS Explorer for Functional Indels and SNPs (PExFInS) has been developed, integrating LD analysis, functional annotation from public databases, cis-eQTL mapping with our LCL cis-eQTL database and other published cis-eQTL datasets. PMID:26612672
Friedrich, Juliane; Brand, Bodo; Ponsuksili, Siriluck; Graunke, Katharina L; Langbein, Jan; Knaust, Jacqueline; Kühn, Christa; Schwerin, Manfred
2016-02-01
Behaviour traits of cattle have been reported to affect important production traits, such as meat quality and milk performance as well as reproduction and health. Genetic predisposition is, together with environmental stimuli, undoubtedly involved in the development of behaviour phenotypes. Underlying molecular mechanisms affecting behaviour in general and behaviour and productions traits in particular still have to be studied in detail. Therefore, we performed a genome-wide association study in an F2 Charolais × German Holstein cross-breed population to identify genetic variants that affect behaviour-related traits assessed in an open-field and novel-object test and analysed their putative impact on milk performance. Of 37,201 tested single nucleotide polymorphism (SNPs), four showed a genome-wide and 37 a chromosome-wide significant association with behaviour traits assessed in both tests. Nine of the SNPs that were associated with behaviour traits likewise showed a nominal significant association with milk performance traits. On chromosomes 14 and 29, six SNPs were identified to be associated with exploratory behaviour and inactivity during the novel-object test as well as with milk yield traits. Least squares means for behaviour and milk performance traits for these SNPs revealed that genotypes associated with higher inactivity and less exploratory behaviour promote higher milk yields. Whether these results are due to molecular mechanisms simultaneously affecting behaviour and milk performance or due to a behaviour predisposition, which causes indirect effects on milk performance by influencing individual reactivity, needs further investigation. © 2015 Stichting International Foundation for Animal Genetics.
Population genomics of Pacific lamprey: adaptive variation in a highly dispersive species.
Hess, Jon E; Campbell, Nathan R; Close, David A; Docker, Margaret F; Narum, Shawn R
2013-06-01
Unlike most anadromous fishes that have evolved strict homing behaviour, Pacific lamprey (Entosphenus tridentatus) seem to lack philopatry as evidenced by minimal population structure across the species range. Yet unexplained findings of within-region population genetic heterogeneity coupled with the morphological and behavioural diversity described for the species suggest that adaptive genetic variation underlying fitness traits may be responsible. We employed restriction site-associated DNA sequencing to genotype 4439 quality filtered single nucleotide polymorphism (SNP) loci for 518 individuals collected across a broad geographical area including British Columbia, Washington, Oregon and California. A subset of putatively neutral markers (N = 4068) identified a significant amount of variation among three broad populations: northern British Columbia, Columbia River/southern coast and 'dwarf' adults (F(CT) = 0.02, P ≪ 0.001). Additionally, 162 SNPs were identified as adaptive through outlier tests, and inclusion of these markers revealed a signal of adaptive variation related to geography and life history. The majority of the 162 adaptive SNPs were not independent and formed four groups of linked loci. Analyses with matsam software found that 42 of these outlier SNPs were significantly associated with geography, run timing and dwarf life history, and 27 of these 42 SNPs aligned with known genes or highly conserved genomic regions using the genome browser available for sea lamprey. This study provides both neutral and adaptive context for observed genetic divergence among collections and thus reconciles previous findings of population genetic heterogeneity within a species that displays extensive gene flow. © 2012 John Wiley & Sons Ltd.
Huang, Dongsheng; Yang, Lei; Liu, Yehua; Zhou, Yumin; Guo, Yuan; Pan, Mingan; Wang, Yunnan; Tan, Yigang; Zhong, Haibo; Hu, Min; Lu, Wenju; Ji, Weidong; Wang, Jian; Ran, Pixin; Zhong, Nanshan; Zhou, Yifeng; Lu, Jiachun
2013-04-01
Lung inflammation is the major pathogenetic feature for both chronic obstructive pulmonary disease (COPD) and lung cancer. The nuclear factor-kappa B (NFκB) and its inhibitor (IκB) play crucial roles in inflammatory. Here, we tested the hypothesis that single nucleotide polymorphisms (SNPs) in NFκB/IκB confer consistent risks for COPD and lung cancer. Four putative functional SNPs (NFκB1: -94del>insATTG; NFκB2: -2966G>A; IκBα: -826C>T, 2758G>A) were analyzed in southern and validated in eastern Chineses to test their associations with COPD risk in 1,511 COPD patients and 1,677 normal lung function controls, as well as lung cancer risk in 1,559 lung cancer cases and 1,679 cancer-free controls. We found that the -94ins ATTG variants (ins/del + ins/ins) in NFκB1 conferred an increased risk of COPD (OR 1.27, 95% CI 1.06-1.52) and promoted COPD progression by accelerating annual FEV1 decline (P = 0.015). The 2758AA variant in IκBα had an increased risk of lung cancer (OR 1.53, 95% CI 1.30-1.80) by decreasing IκBα expression due to the modulation of microRNA hsa-miR-449a but not hsa-miR-34b. Furthermore, both adverse genotypes exerted effect on increasing lung cancer risk in individuals with pre-existing COPD, while the -94del>insATTG did not in those without pre-existing COPD. However, no significant association with COPD or lung cancer was observed for -2966G>A and -826C>T. Our data suggested a common susceptible mechanism of inflammation in lung induced by genetic variants in NFκB1 (-94del>ins ATTG) or IκBα (2758G>A) to predict risk of COPD or lung cancer.
Windelinckx, An; De Mars, Gunther; Huygens, Wim; Peeters, Maarten W; Vincent, Barbara; Wijmenga, Cisca; Lambrechts, Diether; Delecluse, Christophe; Roth, Stephen M; Metter, E Jeffrey; Ferrucci, Luigi; Aerssens, Jeroen; Vlietinck, Robert; Beunen, Gaston P; Thomis, Martine A
2011-01-01
Muscle strength is important in functional activities of daily living and the prevention of common pathologies. We describe the two-staged fine mapping of a previously identified linkage peak for knee strength on chr12q12-14. First, 209 tagSNPs in/around 74 prioritized genes were genotyped in 500 Caucasian brothers from the Leuven Genes for Muscular Strength study (LGfMS). Combined linkage and family-based association analyses identified activin receptor 1B (ACVR1B) and inhibin β C (INHBC), part of the transforming growth factor β pathway regulating myostatin – a negative regulator of muscle mass – signaling, for follow-up. Second, 33 SNPs, selected in these genes based on their likelihood to functionally affect gene expression/function, were genotyped in an extended sample of 536 LGfMS siblings. Strong associations between ACVR1B genotypes and knee muscle strength (P-values up to 0.00002) were present. Of particular interest was the association with rs2854464, located in a putative miR-24-binding site, as miR-24 was implicated in the inhibition of skeletal muscle differentiation. Rs2854464 AA individuals were ∼2% stronger than G-allele carriers. The strength increasing effect of the A-allele was also observed in an independent replication sample (n=266) selected from the Baltimore Longitudinal Study of Aging and a Flemish Policy Research Centre Sport, Physical Activity and Health study. However, no genotype-related difference in ACVR1B mRNA expression in quadriceps muscle was observed. In conclusion, we applied a two-stage fine mapping approach, and are the first to identify and partially replicate genetic variants in the ACVR1B gene that account for genetic variation in human muscle strength. PMID:21063444
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-08-15
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. © The Author 2015. Published by Oxford University Press.
Thirunavukkarasu, Nepolean; Hossain, Firoz; Arora, Kanika; Sharma, Rinku; Shiriga, Kaliyugam; Mittal, Swati; Mohan, Sweta; Namratha, Pottekatt Mohanlal; Dogga, Sreelatha; Rani, Tikka Shobha; Katragadda, Sumalini; Rathore, Abhishek; Shah, Trushar; Mohapatra, Trilochan; Gupta, Hari Shankar
2014-12-24
Earlier studies were focused on the genetics of temperate and tropical maize under drought. We identified genetic loci and their association with functional mechanisms in 240 accessions of subtropical maize using a high-density marker set under water stress. Out of 61 significant SNPs (11 were false-discovery-rate-corrected associations), identified across agronomic traits, models, and locations by subjecting the accessions to water stress at flowering stage, 48% were associated with drought-tolerant genes. Maize gene models revealed that SNPs mapped for agronomic traits were in fact associated with number of functional traits as follows: stomatal closure, 28; flowering, 15; root development, 5; detoxification, 4; and reduced water potential, 2. Interactions of these SNPS through the functional traits could lead to drought tolerance. The SNPs associated with ABA-dependent signalling pathways played a major role in the plant's response to stress by regulating a series of functions including flowering, root development, auxin metabolism, guard cell functions, and scavenging reactive oxygen species (ROS). ABA signalling genes regulate flowering through epigenetic changes in stress-responsive genes. ROS generated by ABA signalling are reduced by the interplay between ethylene, ABA, and detoxification signalling transductions. Integration of ABA-signalling genes with auxin-inducible genes regulates root development which in turn, maintains the water balance by regulating electrochemical gradient in plant. Several genes are directly or indirectly involved in the functioning of agronomic traits related to water stress. Genes involved in these crucial biological functions interacted significantly in order to maintain the primary as well as exclusive functions related to coping with water stress. SNPs associated with drought-tolerant genes involved in strategic biological functions will be useful to understand the mechanisms of drought tolerance in subtropical maize.
Portnoy, D S; Puritz, J B; Hollenbeck, C M; Gelsleichter, J; Chapman, D; Gold, J R
2015-12-01
Sex-biased dispersal is expected to homogenize nuclear genetic variation relative to variation in genetic material inherited through the philopatric sex. When site fidelity occurs across a heterogeneous environment, local selective regimes may alter this pattern. We assessed spatial patterns of variation in nuclear-encoded, single nucleotide polymorphisms (SNPs) and sequences of the mitochondrial control region in bonnethead sharks (Sphyrna tiburo), a species thought to exhibit female philopatry, collected from summer habitats used for gestation. Geographic patterns of mtDNA haplotypes and putatively neutral SNPs confirmed female philopatry and male-mediated gene flow along the northeastern coast of the Gulf of Mexico. A total of 30 outlier SNP loci were identified; alleles at over half of these loci exhibited signatures of latitude-associated selection. Our results indicate that in species with sex-biased dispersal, philopatry can facilitate sorting of locally adaptive variation, with the dispersing sex facilitating movement of potentially adaptive variation among locations and environments. © 2015 John Wiley & Sons Ltd.
Rand, Kristin A.; Song, Chi; Dean, Eric; Serie, Daniel J.; Curtin, Karen; Sheng, Xin; Hu, Donglei; Huff, Carol Ann; Bernal-Mizrachi, Leon; Tomasson, Michael H.; Ailawadhi, Sikander; Singhal, Seema; Pawlish, Karen; Peters, Edward S.; Bock, Cathryn H.; Stram, Alex; Van Den Berg, David J; Edlund, Christopher K.; V.Conti, David; Zimmerman, Todd; Hwang, Amie E.; Huntsman, Scott; Graff, John; Nooka, Ajay; Kong, Yinfei; Pregja, Silvana L.; Berndt, Sonja I.; Blot, William J.; Carpten, John; Casey, Graham; Chu, Lisa; Diver, W. Ryan; Stevens, Victoria L.; Lieber, Michael R.; Goodman, Phyllis J.; Hennis, Anselm J.M.; Hsing, Ann W.; Mehta, Jayesh; Kittles, Rick A.; Kolb, Suzanne; Klein, Eric A.; Leske, Cristina; Murphy, Adam B.; Nemesure, Barbara; Neslund-Dudas, Christine; Strom, Sara S.; Vij, Ravi; Rybicki, Benjamin A.; Stanford, Janet L.; Signorello, Lisa B.; Witte, John S.; Ambrosone, Christine B.; Bhatti, Parveen; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Olshan, Andrew F.; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah J.; Bandera, Elisa V.; Birmann, Brenda M.; Ingles, Sue A.; Press, Michael F.; Atanackovic, Djordje; Glenn, Martha J.; Cannon-Albright, Lisa A.; Jones, Brandt; Tricot, Guido; Martin, Thomas G.; Kumar, Shaji K.; Wolf, Jeffrey L.; Deming, Sandra L.; Rothman, Nathaniel; Brooks-Wilson, Angela R.; Rajkumar, S. Vincent; Kolonel, Laurence N.; Chanock, Stephen J.; Slager, Susan L.; Severson, Richard K.; Janakiraman, Nalini; Terebelo, Howard R.; Brown, Elizabeth E.; De Roos, Anneclaire J.; Mohrbacher, Ann F.; Colditz, Graham A.; Giles, Graham G.; Spinelli, John J.; Chiu, Brian C.; Munshi, Nikhil C.; Anderson, Kenneth C.; Levy, Joan; Zonder, Jeffrey A.; Orlowski, Robert Z.; Lonial, Sagar; Camp, Nicola J.; Vachon, Celine M.; Ziv, Elad; Stram, Daniel O.; Hazelett, Dennis J.; Haiman, Christopher A.; Cozen, Wendy
2017-01-01
Background Genome-wide association studies (GWAS) in European populations have identified genetic risk variants associated with multiple myeloma (MM). Methods We performed association testing of common variation in eight regions in 1,264 MM patients and 1,479 controls of European ancestry (EA) and 1,305 MM patients and 7,078 controls of African ancestry (AA) and conducted a meta-analysis to localize the signals, with epigenetic annotation used to predict functionality. Results We found that variants in 7p15.3, 17p11.2, 22q13.1 were statistically significantly (p<0.05) associated with MM risk in AAs and EAs and the variant in 3p22.1 was associated in EAs only. In a combined AA-EA meta-analysis, variation in five regions (2p23.3, 3p22.1, 7p15.3, 17p11.2, 22q13.1) was statistically signficantly associated with MM risk. In 3p22.1, the correlated variants clustered within the gene body of ULK4. Correlated variants in 7p15.3 clustered around an enhancer at the 3′ end of the CDCA7L transcription termination site. A missense variant at 17p11.2 (rs34562254, Pro251Leu, OR=1.32, p=2.93×10−7) in TNFRSF13B, encodes a lymphocyte-specific protein in the tumor necrosis factor receptor family that interacts with the NF-κB pathway. SNPs correlated with the index signal in 22q13.1 cluster around the promoter and enhancer regions of CBX7. Conclusions We found that reported MM susceptibility regions contain risk variants important across populations supporting the use of multiple racial/ethnic groups with different underlying genetic architecture to enhance the localization and identification of putatively functional alleles. Impact A subset of reported risk loci for multiple myeloma have consistent affects across populations and are likely to be functional. PMID:27587788
Gala, Manish; Abecasis, Goncalo; Bezieau, Stephane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Carlson, Christopher S.; Casey, Graham; Chang-Claude, Jenny; Conti, David V.; Curtis, Keith R.; Duggan, David; Gallinger, Steven; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Jenkins, Mark A.; Küry, Sébastien; Le Marchand, Loic; Leal, Suzanne M.; Newcomb, Polly A.; Nickerson, Deborah A.; Potter, John D.; Schoen, Robert E.; Schumacher, Fredrick R.; Seminara, Daniela; Slattery, Martha L.; Hsu, Li; Chan, Andrew T.; White, Emily; Berndt, Sonja I.; Peters, Ulrike
2016-01-01
Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s). PMID:27379672
Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David
2018-04-11
Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.
Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C. A. M.; Laurens, François; Tartarini, Stefano; Durel, Charles-Eric
2017-01-01
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs. PMID:29176988
Identification of novel drought-tolerant-associated SNPs in common bean (Phaseolus vulgaris)
Villordo-Pineda, Emiliano; González-Chavira, Mario M.; Giraldo-Carbajo, Patricia; Acosta-Gallegos, Jorge A.; Caballero-Pérez, Juan
2015-01-01
Common bean (Phaseolus vulgaris L.) is a leguminous in high demand for human nutrition and a very important agricultural product. Production of common bean is constrained by environmental stresses such as drought. Although conventional plant selection has been used to increase production yield and stress tolerance, drought tolerance selection based on phenotype is complicated by associated physiological, anatomical, cellular, biochemical, and molecular changes. These changes are modulated by differential gene expression. A common method to identify genes associated with phenotypes of interest is the characterization of Single Nucleotide Polymorphims (SNPs) to link them to specific functions. In this work, we selected two drought-tolerant parental lines from Mesoamerica, Pinto Villa, and Pinto Saltillo. The parental lines were used to generate a population of 282 families (F3:5) and characterized by 169 SNPs. We associated the segregation of the molecular markers in our population with phenotypes including flowering time, physiological maturity, reproductive period, plant, seed and total biomass, reuse index, seed yield, weight of 100 seeds, and harvest index in three cultivation cycles. We observed 83 SNPs with significant association (p < 0.0003 after Bonferroni correction) with our quantified phenotypes. Phenotypes most associated were days to flowering and seed biomass with 58 and 44 associated SNPs, respectively. Thirty-seven out of the 83 SNPs were annotated to a gene with a potential function related to drought tolerance or relevant molecular/biochemical functions. Some SNPs such as SNP28 and SNP128 are related to starch biosynthesis, a common osmotic protector; and SNP18 is related to proline biosynthesis, another well-known osmotic protector. PMID:26257755
Identification of novel drought-tolerant-associated SNPs in common bean (Phaseolus vulgaris).
Villordo-Pineda, Emiliano; González-Chavira, Mario M; Giraldo-Carbajo, Patricia; Acosta-Gallegos, Jorge A; Caballero-Pérez, Juan
2015-01-01
Common bean (Phaseolus vulgaris L.) is a leguminous in high demand for human nutrition and a very important agricultural product. Production of common bean is constrained by environmental stresses such as drought. Although conventional plant selection has been used to increase production yield and stress tolerance, drought tolerance selection based on phenotype is complicated by associated physiological, anatomical, cellular, biochemical, and molecular changes. These changes are modulated by differential gene expression. A common method to identify genes associated with phenotypes of interest is the characterization of Single Nucleotide Polymorphims (SNPs) to link them to specific functions. In this work, we selected two drought-tolerant parental lines from Mesoamerica, Pinto Villa, and Pinto Saltillo. The parental lines were used to generate a population of 282 families (F3:5) and characterized by 169 SNPs. We associated the segregation of the molecular markers in our population with phenotypes including flowering time, physiological maturity, reproductive period, plant, seed and total biomass, reuse index, seed yield, weight of 100 seeds, and harvest index in three cultivation cycles. We observed 83 SNPs with significant association (p < 0.0003 after Bonferroni correction) with our quantified phenotypes. Phenotypes most associated were days to flowering and seed biomass with 58 and 44 associated SNPs, respectively. Thirty-seven out of the 83 SNPs were annotated to a gene with a potential function related to drought tolerance or relevant molecular/biochemical functions. Some SNPs such as SNP28 and SNP128 are related to starch biosynthesis, a common osmotic protector; and SNP18 is related to proline biosynthesis, another well-known osmotic protector.
Engström, Karin; Ameer, Shegufta; Bernaudat, Ludovic; Drasch, Gustav; Baeuml, Jennifer; Skerfving, Staffan; Bose-O'Reilly, Stephan; Broberg, Karin
2013-01-01
Elemental mercury (Hg0) is widely used in small-scale gold mining. Persons working or living in mining areas have high urinary concentrations of Hg (U-Hg). Differences in genes encoding potential Hg-transporters may affect uptake and elimination of Hg. We aimed to identify single nucleotide polymorphisms (SNPs) in Hg-transporter genes that modify U-Hg. Men and women (1,017) from Indonesia, the Philippines, Tanzania, and Zimbabwe were classified either as controls (no Hg exposure from gold mining) or as having low (living in a gold-mining area) or high exposure (working as gold miners). U-Hg was analyzed by cold-vapor atomic absorption spectrometry. Eighteen SNPs in eight Hg-transporter genes were analyzed. U-Hg concentrations were higher among ABCC2/MRP2 rs1885301 A-allele carriers than among GG homozygotes in all populations, though differences were not statistically significant in most cases. MRP2 SNPs showed particularly strong associations with U-Hg in the subgroup with highest exposure (miners in Zimbabwe), whereas rs1885301 A-allele carriers had higher U-Hg than GG homozygotes [geometric mean (GM): 36.4 µg/g creatinine vs. 21.9; p = 0.027], rs2273697 GG homozygotes had higher U-Hg than A-allele carriers (GM: 37.4 vs. 16.7; p = 0.001), and rs717620 A-allele carriers had higher U-Hg than GG homozygotes (GM: 83 vs. 28; p = 0.084). The SLC7A5/LAT1 rs33916661 GG genotype was associated with higher U-Hg in all populations (statistically significant for all Tanzanians combined). SNPs in SLC22A6/OAT1 (rs4149170) and SLC22A8/OAT3 (rs4149182) were associated with U-Hg mainly in the Tanzanian study groups. SNPs in putative Hg-transporter genes may influence U-Hg concentrations.
Genome-wide DNA polymorphisms in two cultivars of mei (Prunus mume sieb. et zucc.).
Sun, Lidan; Zhang, Qixiang; Xu, Zongda; Yang, Weiru; Guo, Yu; Lu, Jiuxing; Pan, Huitang; Cheng, Tangren; Cai, Ming
2013-10-06
Mei (Prunus mume Sieb. et Zucc.) is a famous ornamental plant and fruit crop grown in East Asian countries. Limited genetic resources, especially molecular markers, have hindered the progress of mei breeding projects. Here, we performed low-depth whole-genome sequencing of Prunus mume 'Fenban' and Prunus mume 'Kouzi Yudie' to identify high-quality polymorphic markers between the two cultivars on a large scale. A total of 1464.1 Mb and 1422.1 Mb of 'Fenban' and 'Kouzi Yudie' sequencing data were uniquely mapped to the mei reference genome with about 6-fold coverage, respectively. We detected a large number of putative polymorphic markers from the 196.9 Mb of sequencing data shared by the two cultivars, which together contained 200,627 SNPs, 4,900 InDels, and 7,063 SSRs. Among these markers, 38,773 SNPs, 174 InDels, and 418 SSRs were distributed in the 22.4 Mb CDS region, and 63.0% of these marker-containing CDS sequences were assigned to GO terms. Subsequently, 670 selected SNPs were validated using an Agilent's SureSelect solution phase hybridization assay. A subset of 599 SNPs was used to assess the genetic similarity of a panel of mei germplasm samples and a plum (P. salicina) cultivar, producing a set of informative diversity data. We also analyzed the frequency and distribution of detected InDels and SSRs in mei genome and validated their usefulness as DNA markers. These markers were successfully amplified in the cultivars and in their segregating progeny. A large set of high-quality polymorphic SNPs, InDels, and SSRs were identified in parallel between 'Fenban' and 'Kouzi Yudie' using low-depth whole-genome sequencing. The study presents extensive data on these polymorphic markers, which can be useful for constructing high-resolution genetic maps, performing genome-wide association studies, and designing genomic selection strategies in mei.
Darabi, Hatef; McCue, Karen; Beesley, Jonathan; Michailidou, Kyriaki; Nord, Silje; Kar, Siddhartha; Humphreys, Keith; Thompson, Deborah; Ghoussaini, Maya; Bolla, Manjeet K.; Dennis, Joe; Wang, Qin; Canisius, Sander; Scott, Christopher G.; Apicella, Carmel; Hopper, John L.; Southey, Melissa C.; Stone, Jennifer; Broeks, Annegien; Schmidt, Marjanka K.; Scott, Rodney J.; Lophatananon, Artitaya; Muir, Kenneth; Beckmann, Matthias W.; Ekici, Arif B.; Fasching, Peter A.; Heusinger, Katharina; dos-Santos-Silva, Isabel; Peto, Julian; Tomlinson, Ian; Sawyer, Elinor J.; Burwinkel, Barbara; Marme, Frederik; Guénel, Pascal; Truong, Thérèse; Bojesen, Stig E.; Flyger, Henrik; Benitez, Javier; González-Neira, Anna; Anton-Culver, Hoda; Neuhausen, Susan L.; Arndt, Volker; Brenner, Hermann; Engel, Christoph; Meindl, Alfons; Schmutzler, Rita K.; Arnold, Norbert; Brauch, Hiltrud; Hamann, Ute; Chang-Claude, Jenny; Khan, Sofia; Nevanlinna, Heli; Ito, Hidemi; Matsuo, Keitaro; Bogdanova, Natalia V.; Dörk, Thilo; Lindblom, Annika; Margolin, Sara; Kosma, Veli-Matti; Mannermaa, Arto; Tseng, Chiu-chen; Wu, Anna H.; Floris, Giuseppe; Lambrechts, Diether; Rudolph, Anja; Peterlongo, Paolo; Radice, Paolo; Couch, Fergus J.; Vachon, Celine; Giles, Graham G.; McLean, Catriona; Milne, Roger L.; Dugué, Pierre-Antoine; Haiman, Christopher A.; Maskarinec, Gertraud; Woolcott, Christy; Henderson, Brian E.; Goldberg, Mark S.; Simard, Jacques; Teo, Soo H.; Mariapun, Shivaani; Helland, Åslaug; Haakensen, Vilde; Zheng, Wei; Beeghly-Fadiel, Alicia; Tamimi, Rulla; Jukkola-Vuorinen, Arja; Winqvist, Robert; Andrulis, Irene L.; Knight, Julia A.; Devilee, Peter; Tollenaar, Robert A.E.M.; Figueroa, Jonine; García-Closas, Montserrat; Czene, Kamila; Hooning, Maartje J.; Tilanus-Linthorst, Madeleine; Li, Jingmei; Gao, Yu-Tang; Shu, Xiao-Ou; Cox, Angela; Cross, Simon S.; Luben, Robert; Khaw, Kay-Tee; Choi, Ji-Yeob; Kang, Daehee; Hartman, Mikael; Lim, Wei Yen; Kabisch, Maria; Torres, Diana; Jakubowska, Anna; Lubinski, Jan; McKay, James; Sangrajrang, Suleeporn; Toland, Amanda E.; Yannoukakos, Drakoulis; Shen, Chen-Yang; Yu, Jyh-Cherng; Ziogas, Argyrios; Schoemaker, Minouk J.; Swerdlow, Anthony; Borresen-Dale, Anne-Lise; Kristensen, Vessela; French, Juliet D.; Edwards, Stacey L.; Dunning, Alison M.; Easton, Douglas F.; Hall, Per; Chenevix-Trench, Georgia
2015-01-01
Genome-wide association studies have identified SNPs near ZNF365 at 10q21.2 that are associated with both breast cancer risk and mammographic density. To identify the most likely causal SNPs, we fine mapped the association signal by genotyping 428 SNPs across the region in 89,050 European and 12,893 Asian case and control subjects from the Breast Cancer Association Consortium. We identified four independent sets of correlated, highly trait-associated variants (iCHAVs), three of which were located within ZNF365. The most strongly risk-associated SNP, rs10995201 in iCHAV1, showed clear evidence of association with both estrogen receptor (ER)-positive (OR = 0.85 [0.82–0.88]) and ER-negative (OR = 0.87 [0.82–0.91]) disease, and was also the SNP most strongly associated with percent mammographic density. iCHAV2 (lead SNP, chr10: 64,258,684:D) and iCHAV3 (lead SNP, rs7922449) were also associated with ER-positive (OR = 0.93 [0.91–0.95] and OR = 1.06 [1.03–1.09]) and ER-negative (OR = 0.95 [0.91–0.98] and OR = 1.08 [1.04–1.13]) disease. There was weaker evidence for iCHAV4, located 5′ of ADO, associated only with ER-positive breast cancer (OR = 0.93 [0.90–0.96]). We found 12, 17, 18, and 2 candidate causal SNPs for breast cancer in iCHAVs 1–4, respectively. Chromosome conformation capture analysis showed that iCHAV2 interacts with the ZNF365 and NRBF2 (more than 600 kb away) promoters in normal and cancerous breast epithelial cells. Luciferase assays did not identify SNPs that affect transactivation of ZNF365, but identified a protective haplotype in iCHAV2, associated with silencing of the NRBF2 promoter, implicating this gene in the etiology of breast cancer. PMID:26073781
Ameer, Shegufta; Bernaudat, Ludovic; Drasch, Gustav; Baeuml, Jennifer; Skerfving, Staffan; Bose-O’Reilly, Stephan; Broberg, Karin
2012-01-01
Background: Elemental mercury (Hg0) is widely used in small-scale gold mining. Persons working or living in mining areas have high urinary concentrations of Hg (U-Hg). Differences in genes encoding potential Hg-transporters may affect uptake and elimination of Hg. Objective: We aimed to identify single nucleotide polymorphisms (SNPs) in Hg-transporter genes that modify U-Hg. Methods: Men and women (1,017) from Indonesia, the Philippines, Tanzania, and Zimbabwe were classified either as controls (no Hg exposure from gold mining) or as having low (living in a gold-mining area) or high exposure (working as gold miners). U-Hg was analyzed by cold-vapor atomic absorption spectrometry. Eighteen SNPs in eight Hg-transporter genes were analyzed. Results: U-Hg concentrations were higher among ABCC2/MRP2 rs1885301 A–allele carriers than among GG homozygotes in all populations, though differences were not statistically significant in most cases. MRP2 SNPs showed particularly strong associations with U-Hg in the subgroup with highest exposure (miners in Zimbabwe), whereas rs1885301 A–allele carriers had higher U-Hg than GG homozygotes [geometric mean (GM): 36.4 µg/g creatinine vs. 21.9; p = 0.027], rs2273697 GG homozygotes had higher U-Hg than A–allele carriers (GM: 37.4 vs. 16.7; p = 0.001), and rs717620 A–allele carriers had higher U-Hg than GG homozygotes (GM: 83 vs. 28; p = 0.084). The SLC7A5/LAT1 rs33916661 GG genotype was associated with higher U-Hg in all populations (statistically significant for all Tanzanians combined). SNPs in SLC22A6/OAT1 (rs4149170) and SLC22A8/OAT3 (rs4149182) were associated with U-Hg mainly in the Tanzanian study groups. Conclusions: SNPs in putative Hg-transporter genes may influence U-Hg concentrations. PMID:23052037
NASA Astrophysics Data System (ADS)
Liu, Hongna; Li, Song; Wang, Zhifei; Li, Zhiyang; Deng, Yan; Wang, Hua; Shi, Zhiyang; He, Nongyue
2008-11-01
Single nucleotide polymorphisms (SNPs) comprise the most abundant source of genetic variation in the human genome wide codominant SNPs identification. Therefore, large-scale codominant SNPs identification, especially for those associated with complex diseases, has induced the need for completely high-throughput and automated SNP genotyping method. Herein, we present an automated detection system of SNPs based on two kinds of functional magnetic nanoparticles (MNPs) and dual-color hybridization. The amido-modified MNPs (NH 2-MNPs) modified with APTES were used for DNA extraction from whole blood directly by electrostatic reaction, and followed by PCR, was successfully performed. Furthermore, biotinylated PCR products were captured on the streptavidin-coated MNPs (SA-MNPs) and interrogated by hybridization with a pair of dual-color probes to determine SNP, then the genotype of each sample can be simultaneously identified by scanning the microarray printed with the denatured fluorescent probes. This system provided a rapid, sensitive and highly versatile automated procedure that will greatly facilitate the analysis of different known SNPs in human genome.
Ferchaud, Anne-Laure; Pedersen, Susanne H; Bekkevold, Dorte; Jian, Jianbo; Niu, Yongchao; Hansen, Michael M
2014-10-06
The threespine stickleback (Gasterosteus aculeatus) has become an important model species for studying both contemporary and parallel evolution. In particular, differential adaptation to freshwater and marine environments has led to high differentiation between freshwater and marine stickleback populations at the phenotypic trait of lateral plate morphology and the underlying candidate gene Ectodysplacin (EDA). Many studies have focused on this trait and candidate gene, although other genes involved in marine-freshwater adaptation may be equally important. In order to develop a resource for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. We have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene Ectodysplacin (EDA).
Nucleotide diversity maps reveal variation in diversity among wheat genomes and chromosomes
2010-01-01
Background A genome-wide assessment of nucleotide diversity in a polyploid species must minimize the inclusion of homoeologous sequences into diversity estimates and reliably allocate individual haplotypes into their respective genomes. The same requirements complicate the development and deployment of single nucleotide polymorphism (SNP) markers in polyploid species. We report here a strategy that satisfies these requirements and deploy it in the sequencing of genes in cultivated hexaploid wheat (Triticum aestivum, genomes AABBDD) and wild tetraploid wheat (Triticum turgidum ssp. dicoccoides, genomes AABB) from the putative site of wheat domestication in Turkey. Data are used to assess the distribution of diversity among and within wheat genomes and to develop a panel of SNP markers for polyploid wheat. Results Nucleotide diversity was estimated in 2114 wheat genes and was similar between the A and B genomes and reduced in the D genome. Within a genome, diversity was diminished on some chromosomes. Low diversity was always accompanied by an excess of rare alleles. A total of 5,471 SNPs was discovered in 1791 wheat genes. Totals of 1,271, 1,218, and 2,203 SNPs were discovered in 488, 463, and 641 genes of wheat putative diploid ancestors, T. urartu, Aegilops speltoides, and Ae. tauschii, respectively. A public database containing genome-specific primers, SNPs, and other information was constructed. A total of 987 genes with nucleotide diversity estimated in one or more of the wheat genomes was placed on an Ae. tauschii genetic map, and the map was superimposed on wheat deletion-bin maps. The agreement between the maps was assessed. Conclusions In a young polyploid, exemplified by T. aestivum, ancestral species are the primary source of genetic diversity. Low effective recombination due to self-pollination and a genetic mechanism precluding homoeologous chromosome pairing during polyploid meiosis can lead to the loss of diversity from large chromosomal regions. The net effect of these factors in T. aestivum is large variation in diversity among genomes and chromosomes, which impacts the development of SNP markers and their practical utility. Accumulation of new mutations in older polyploid species, such as wild emmer, results in increased diversity and its more uniform distribution across the genome. PMID:21156062
Genotypic distribution of single nucleotide polymorphisms in oral cancer: global scene.
Multani, Shaleen; Saranath, Dhananjaya
2016-11-01
Globocan 2012 reports the global oral cancer incidence of 300,373 new oral cancer cases annually, contributing to 2.1 % of the world cancer burden. The major well-established risk factors for oral cancer include tobacco, betel/areca nut, alcohol and high-risk oncogenic human papilloma virus (HPV) 16/18. However, only 5-10 % of individuals with high-risk lifestyle develop oral cancer. Thus, genomic variants in individuals represented as single nucleotide polymorphisms (SNPs) influence susceptibility to oral cancer. With a view to understanding the role of genomic variants in oral cancer, we reviewed SNPs in case-control studies with a minimum of 100 cases and 100 controls. PubMed and HuGE navigator search engines were used to obtain data published from 1990 to 2015, which identified 67 articles investigating the role of SNPs in oral cancer. Single publications reported 93 SNPs in 55 genes, with 34 SNPs associated with a risk of oral cancer. Meta-analysis of data in multiple studies defined nine SNPs associated with a risk of oral cancer. The genes were associated with critical functions deregulated in cancers, including cell proliferation, immune function, inflammation, transcription, DNA repair and xenobiotic metabolism.
Functional Characterization of Schizophrenia-Associated Variation in CACNA1C
Eckart, Nicole; Song, Qifeng; Yang, Rebecca; Wang, Ruihua; Zhu, Heng; McCallion, Andrew S.; Avramopoulos, Dimitrios
2016-01-01
Calcium channel subunits, including CACNA1C, have been associated with multiple psychiatric disorders. Specifically, genome wide association studies (GWAS) have repeatedly identified the single nucleotide polymorphism (SNP) rs1006737 in intron 3 of CACNA1C to be strongly associated with schizophrenia and bipolar disorder. Here, we show that rs1006737 marks a quantitative trait locus for CACNA1C transcript levels. We test 16 SNPs in high linkage disequilibrium with rs1007637 and find one, rs4765905, consistently showing allele-dependent regulatory function in reporter assays. We find allele-specific protein binding for 13 SNPs including rs4765905. Using protein microarrays, we identify several proteins binding ≥3 SNPs, but not control sequences, suggesting possible functional interactions and combinatorial haplotype effects. Finally, using circular chromatin conformation capture, we show interaction of the disease-associated region including the 16 SNPs with the CACNA1C promoter and other potential regulatory regions. Our results elucidate the pathogenic relevance of one of the best-supported risk loci for schizophrenia and bipolar disorder. PMID:27276213
Genetic association of ubiquilin with Alzheimer's disease and related quantitative measures.
Kamboh, M I; Minster, R L; Feingold, E; DeKosky, S T
2006-03-01
The gene coding for ubiquilin 1 (UBQLN1) is located near a linkage peak on chromosome 9q22.2 and it also impacts the function of presenilin proteins involved in early-onset Alzheimer's disease (AD). Recently, genetic variation in UBQLN1 has been shown to affect the risk of AD in two independent family-based samples. The purpose of this study was to confirm the reported association in a large case-control sample and to also examine the association of UBQLN1 SNPs with quantitative measures of AD progression, namely age-at-onset (AAO), disease duration and Mini-Mental State Examination (MMSE) score. We examined the associations of three SNPs in the UBQLN1 gene (intron 6/A>C, intron 8/T>C and intron 9/A>G) in up to 978 LOAD cases and 808 controls. All SNPs were in significant linkage disequilibrium (P<0.0001). While modest significant associations were observed in the single-site regression analysis, 3-site haplotype analysis revealed significant associations (P<0.0001 for overall haplotype analysis). One common haplotype (H4) defined by intron 6/A-intron 8/C-intron 9/G alleles was associated with AD risk and one less common haplotype (H5) defined by intron 6/C-intron 8/C-intron 9/A alleles was associated with protection. The adjusted odds ratios with potentially one and two copies of risk haplotype H4 were 1.5 (95% CI: 0.99-2.26; P=0.054) and 3.66 (95% CI: 1.43-9.39; P=0.007), respectively, and odds ratio for haplotype H5 carriers was 0.31 (95% CI: 0.10-0.95; P=0.0398). In addition to disease risk, the homozygosity of the risk haplotype was also associated with older AAO, longer disease duration and lower MMSE score. In summary, our data from a large case-control cohort indicate that genetic variation in the UBQLN1 gene has a modest effect on risk, AAO and disease duration of AD. Our haplotype data suggest the presence of additional putative functional variants either in the UBQLN1 gene or nearby genes and provide strong justification for additional work in this region on chromosome 9.
HLA-DQA1 and PLA2R1 Polymorphisms and Risk of Idiopathic Membranous Nephropathy
Bullich, Gemma; Ballarín, José; Oliver, Artur; Ayasreh, Nadia; Silva, Irene; Santín, Sheila; Díaz-Encarnación, Montserrat M.; Torra, Roser
2014-01-01
Summary Background and objectives Single nucleotide polymorphisms (SNPs) within HLA complex class II HLA-DQ α-chain 1 (HLA-DQA1) and M-type phospholipase A2 receptor (PLA2R1) genes were identified as strong risk factors for idiopathic membranous nephropathy (IMN) development in a recent genome-wide association study. Copy number variants (CNVs) within the Fc gamma receptor III (FCGR3) locus have been associated with several autoimmune diseases, but their role in IMN has not been studied. This study aimed to validate the association of HLA-DQA1 and PLA2R1 risk alleles with IMN in a Spanish cohort, test the putative association of FCGR3A and FCGR3B CNVs with IMN, and assess the use of these genetic factors to predict the clinical outcome of the disease. Design, settings, participants, & measurements A Spanish cohort of 89 IMN patients and 286 matched controls without nephropathy was recruited between October of 2009 and July of 2012. Case-control studies for SNPs within HLA-DQA1 (rs2187668) and PLA2R1 (rs4664308) genes and CNVs for FCGR3A and FCGR3B genes were performed. The contribution of these polymorphisms to predict clinical outcome and renal function decline was analyzed. Results This study validated the association of these HLA-DQA1 and PLA2R1 SNPs with IMN in a Spanish cohort and its increased risk when combining both risk genotypes. No significant association was found between FCGR3 CNVs and IMN. These results revealed that HLA-DQA1 and PLA2R1 genotype combination adjusted for baseline proteinuria strongly predicted response to immunosuppressive therapy. HLA-DQA1 genotype adjusted for proteinuria was also linked with renal function decline. Conclusion This study confirms that HLA-DQA1 and PLA2R1 genotypes are risk factors for IMN, whereas no association was identified for FCGR3 CNVs. This study provides, for the first time, evidence of the contribution of these HLA-DQA1 and PLA2R1 polymorphisms in predicting IMN response to immunosuppressors and disease progression. Future studies are needed to validate and identify prognostic markers. PMID:24262501
2010-01-01
Background Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. Results A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [Punadj. = 6.96 × 10-6]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (Punadj. = 1.61 × 10-9; PBonf. = 6.58 × 10-5). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r2 = 0.86). Conclusions Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 × 10-10; BIEC2-417495, Punadj. = 1.61 × 10-9). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds. PMID:20932346
Hill, Emmeline W; McGivney, Beatrice A; Gu, Jingjing; Whiston, Ronan; Machugh, David E
2010-10-11
Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [P(unadj.) = 6.96 x 10⁻⁶]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (P(unadj.) = 1.61 x 10⁻⁹; P(Bonf.) = 6.58 x 10⁻⁵). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r² = 0.86). Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 x 10⁻¹⁰; BIEC2-417495, P(unadj.) = 1.61 x 10⁻⁹). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds.
Paris, Margot; Marcombe, Sebastien; Coissac, Eric; Corbel, Vincent; David, Jean-Philippe; Després, Laurence
2013-01-01
Mosquito control is often the main method used to reduce mosquito-transmitted diseases. In order to investigate the genetic basis of resistance to the bio-insecticide Bacillus thuringiensis subsp. israelensis (Bti), we used information on polymorphism obtained from cDNA tag sequences from pooled larvae of laboratory Bti-resistant and susceptible Aedes aegypti mosquito strains to identify and analyse 1520 single nucleotide polymorphisms (SNPs). Of the 372 SNPs tested, 99.2% were validated using DNA Illumina GoldenGate® array, with a strong correlation between the allelic frequencies inferred from the pooled and individual data (r = 0.85). A total of 11 genomic regions and five candidate genes were detected using a genome scan approach. One of these candidate genes showed significant departures from neutrality in the resistant strain at sequence level. Six natural populations from Martinique Island were sequenced for the 372 tested SNPs with a high transferability (87%), and association mapping analyses detected 14 loci associated with Bti resistance, including one located in a putative receptor for Cry11 toxins. Three of these loci were also significantly differentiated between the laboratory strains, suggesting that most of the genes associated with resistance might differ between the two environments. It also suggests that common selected regions might harbour key genes for Bti resistance. PMID:24187584
Zinck, John W. R.
2016-01-01
Natural plant populations are often adapted to their local climate and environmental conditions, and populations of forest trees offer some of the best examples of this pattern. However, little empirical work has focused on the relative contribution of single-locus versus multilocus effects to the genetic architecture of local adaptation in plants/forest trees. Here, we employ eastern white pine (Pinus strobus) to test the hypothesis that it is the inter-genic effects that primarily drive climate-induced local adaptation. The genetic structure of 29 range-wide natural populations of eastern white pine was determined in relation to local climatic factors using both a reference set of SSR markers, and SNPs located in candidate genes putatively involved in adaptive response to climate. Comparisons were made between marker sets using standard single-locus outlier analysis, single-locus and multilocus environment association analyses and a novel implementation of Population Graphs. Magnitudes of population structure were similar between the two marker sets. Outlier loci consistent with diversifying selection were rare for both SNPs and SSRs. However, genetic distances based on the multilocus among population covariances (cGD) were significantly more correlated to climate, even after correcting for spatial effects, for SNPs as compared to SSRs. Coalescent simulations confirmed that the differences in mutation rates between SSRs and SNPs did not affect the topologies of the Population Graphs, and hence values of cGD and their correlations with associated climate variables. We conclude that the multilocus covariances among populations primarily reflect adaptation to local climate and environment in eastern white pine. This result highlights the complexity of the genetic architecture of adaptive traits, as well as the need to consider multilocus effects in studies of local adaptation. PMID:27387485
LINGO1 rs9652490 and rs11856808 polymorphisms are not associated with risk for multiple sclerosis
2013-01-01
Background Some recent experimental data suggest a possible role of LINGO-1 in the pathogenesis of multiple sclerosis (MS). In an attempt to identify genetic biomarkers related to MS susceptibility, we genotyped two common SNPs in the LINGO1 gene which have been associated to other neurological conditions, in patients with MS and in healthy subjects. These SNPs are linked to several SNPs within the LINGO1 gene, especially in individuals of Oriental or Caucasian descent. Methods We analyzed the allelic and genotype frequency of two LINGO1 variants (rs9652490 and rs11856808) in 293 patients with MS and 318 healthy controls, using KASPar assays. Results LINGO1 rs9652490 and rs11856808 allelic and genotype frequencies did not differ significantly between MS patients and controls. The minor allele frequencies for rs9652490 were 0.171 (95% CI = 0.140-0.201) and 0.167 (95% CI = 0.138-0.196 for cases and controls respectively (p = 0.853). For rs11856808 the minor allele frequencies were 0.317 (95% CI = 0.280-0.355) and 0.310 (95% CI = 0.274-0.346) for cases and controls, respectively (p = 0.773). Allele and genotype frequencies were unrelated with the age of onset of MS, gender, and clinical course of MS. In addition, haplotype analyses did not reveal any putative risk related to haplotypes. Conclusions These results suggest that LINGO1 rs9652490 and rs11856808 polymorphisms are not related with risk for MS. This study adds to other published evidence indicating that, to date, the LINGO1 SNPs studied here could be useful risk biomarkers of developing essential tremor, but not other movement disorders. PMID:23574883
Genome-wide association study with the risk of schizophrenia in a Korean population.
Kim, Lyoung Hyo; Park, Byung Lae; Cheong, Hyun Sub; Namgoong, Suhg; Kim, Ji On; Kim, Jeong-Hyun; Shin, Joong-Gon; Park, Chul Soo; Kim, Bong-Jo; Kim, Jae Won; Choi, Ihn-Geun; Hwang, Jaeuk; Shin, Hyoung Doo; Woo, Sung-Il
2016-03-01
Schizophrenia is regarded as a multifactorial and polygenic brain disorder that is attributed to different combinations of genetic and environmental risk factors. Recently, several genome-wide association studies (GWASs) of schizophrenia have identified numerous risk factors, but the replication results remain controversial and ambiguous. To identify schizophrenia susceptibility loci in the Korean population, we performed a GWAS using the Illumina HumanOmni1-Quad V1.0 Microarray. We genotyped 1,140,419 single nucleotide polymorphisms (SNPs) in 350 Korea schizophrenia patients and 700 control subjects, and approximately 620,001 autosomal SNPs were passed our quality control. In the case-control analysis, the rs9607195 A>G on intergenic area 250 kb away from the ISX gene and the rs12738007 A>G on the intron of the MECR gene were the most strongly associated SNPs with the risk of schizophrenia (P = 6.2 × 10(-8) , OR = 0.50 and P = 3.7 × 10(-7) , OR = 2.39, respectively). In subsequent fine-mapping analysis, 6 SNPs of MECR were genotyped with 310 schizophrenia patients and 604 control subjects. The association of the MECR rs12738007, a top ranked-SNP in GWAS, was replicated (P = 1.5 × 10(-2) , OR = 1.53 in fine mapping analysis, P = 1.5 × 10(-6) , OR = 1.90 in combined analysis). The identification of putative schizophrenia susceptibility loci could provide new insights into genetic factors related with schizophrenia and clues for the development of diagnosis strategies. © 2015 Wiley Periodicals, Inc.
Functional evaluation of genetic variants associated with endometriosis near GREB1.
Fung, Jenny N; Holdsworth-Carson, Sarah J; Sapkota, Yadav; Zhao, Zhen Zhen; Jones, Lincoln; Girling, Jane E; Paiva, Premila; Healey, Martin; Nyholt, Dale R; Rogers, Peter A W; Montgomery, Grant W
2015-05-01
Do DNA variants in the growth regulation by estrogen in breast cancer 1 (GREB1) region regulate endometrial GREB1 expression and increase the risk of developing endometriosis in women? We identified new single nucleotide polymorphisms (SNPs) with strong association with endometriosis at the GREB1 locus although we did not detect altered GREB1 expression in endometriosis patients with defined genotypes. Genome-wide association studies have identified the GREB1 region on chromosome 2p25.1 for increasing endometriosis risk. The differential expression of GREB1 has also been reported by others in association with endometriosis disease phenotype. Fine mapping studies comprehensively evaluated SNPs within the GREB1 region in a large-scale data set (>2500 cases and >4000 controls). Publicly available bioinformatics tools were employed to functionally annotate SNPs showing the strongest association signal with endometriosis risk. Endometrial GREB1 mRNA and protein expression was studied with respect to phases of the menstrual cycle (n = 2-45 per cycle stage) and expression quantitative trait loci (eQTL) analysis for significant SNPs were undertaken for GREB1 [mRNA (n = 94) and protein (n = 44) in endometrium]. Participants in this study are females who provided blood and/or endometrial tissue samples in a hospital setting. The key SNPs were genotyped using Sequenom MassARRAY. The functional roles and regulatory annotations for identified SNPs are predicted by various publicly available bioinformatics tools. Endometrial GREB1 expression work employed qRT-PCR, western blotting and immunohistochemistry studies. Fine mapping results identified a number of SNPs showing stronger association (0.004 < P < 0.032) with endometriosis risk than the original GWAS SNP (rs13394619) (P = 0.034). Some of these SNPs were predicted to have functional roles, for example, interaction with transcription factor motifs. The haplotype (a combination of alleles) formed by the risk alleles from two common SNPs showed significant association (P = 0.026) with endometriosis and epistasis analysis showed no evidence for interaction between the two SNPs, suggesting an additive effect of SNPs on endometriosis risk. In normal human endometrium, GREB1 protein expression was altered depending on the cycle stage (significantly different in late proliferative versus late secretory, P < 0.05) and cell type (glandular epithelium, not stromal cells). However, GREB1 expression in endometriosis cases versus controls and eQTL analyses did not reveal any significant changes. In silico prediction tools are generally based on cell lines different to our tissue and disease of interest. Functional annotations drawn from these analyses should be considered with this limitation in mind. We identified cell-specific and hormone-specific changes in GREB1 protein expression. The lack of a significant difference observed following our GREB1 expression studies may be the result of moderate power on mixed cell populations in the endometrial tissue samples. This study further implicates the GREB1 region on chromosome 2p25.1 and the GREB1 gene with involvement in endometriosis risk. More detailed functional studies are required to determine the role of the novel GREB1 transcripts in endometriosis pathophysiology. Funding for this work was provided by NHMRC Project Grants APP1012245, APP1026033, APP1049472 and APP1046880. There are no competing interests. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Heterospecific SNP diversity in humans and rhesus macaque (Macaca mulatta)
Ng, Jillian; Trask, Jessica Satkoski; Smith, David Glenn; Kanthaswamy, Sree
2018-01-01
Background Conservation of single nucleotide polymorphisms (SNPs) between human and other primates (i.e., heterospecific SNPs) in candidate genes can be used to assess the utility of those organisms as models for human biomedical research. Methods 59,691 heterospecific SNPs in 22 rhesus macaques and 20 humans were analyzed for human trait associations and 4,207 heterospecific SNPs biallelic in both taxa were compared for genetic variation. Results Variation comparisons at the 4,207 SNPs showed that humans were more genetically diverse than rhesus macaques with observed and expected heterozygosities of 0.337 and 0.323 versus 0.119 and 0.102, and minor allele frequencies of 0.239 and 0.063, respectively. 431 of the 59,691 heterospecific SNPs are reportedly associated with human-specific traits. Conclusion While comparisons between human and rhesus macaque genomes are plausible, functional studies of heterospecific SNPs are necessary to determine whether rhesus macaque alleles are associated with the same phenotypes as their corresponding human alleles. PMID:25963897
Freedman, Jennifer A; Wang, Yanru; Li, Xuechan; Liu, Hongliang; Moorman, Patricia G; George, Daniel J; Lee, Norman H; Hyslop, Terry; Wei, Qingyi; Patierno, Steven R
2018-05-03
Prostate cancer is a clinically and molecularly heterogeneous disease, with variation in outcomes only partially predicted by grade and stage. Additional tools to distinguish indolent from aggressive disease are needed. Phenotypic characteristics of stemness correlate with poor cancer prognosis. Given this correlation, we identified single nucleotide polymorphisms (SNPs) of stemness-related genes and examined their associations with prostate cancer survival. SNPs within stemness-related genes were analyzed for association with overall survival of prostate cancer in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial. Significant SNPs predicted to be functional were selected for linkage disequilibrium analysis and combined and stratified analyses. Identified SNPs were evaluated for association with gene expression. SNPs of CD44 (rs9666607), ABCC1 (rs35605 and rs212091) and GDF15 (rs1058587) were associated with prostate cancer survival and predicted to be functional. A role for rs9666607 of CD44 and rs35605 of ABCC1 in RNA splicing regulation, rs212091 of ABCC1 in miRNA binding site activity and rs1058587 of GDF15 in causing an amino acid change was predicted. These SNPs represent potential novel prognostic markers for overall survival of prostate cancer and support a contribution of the stemness pathway to prostate cancer patient outcome.
Coassin, Stefan; Friedel, Salome; Köttgen, Anna; Lamina, Claudia; Kronenberg, Florian
2016-11-01
A recent observational study with almost 2 million men reported an association between low high-density lipoprotein (HDL) cholesterol and worse kidney function. The causality of this association would be strongly supported if genetic variants associated with HDL cholesterol were also associated with kidney function. We used 68 genetic variants (single-nucleotide polymorphisms [SNPs]) associated with HDL cholesterol in genome-wide association studies including >188 000 subjects and tested their association with estimated glomerular filtration rate (eGFR) using summary statistics from another genome-wide association studies meta-analysis of kidney function including ≤133 413 subjects. Fourteen of the 68 SNPs (21%) had a P value <0.05 compared with the 5% expected by chance (Binomial test P=5.8×10 - 6 ). After Bonferroni correction, 6 SNPs were still significantly associated with eGFR. The genetic variants with the strongest associations with HDL cholesterol concentrations were not the same as those with the strongest association with kidney function and vice versa. An evaluation of pleiotropy indicated that the effects of the HDL-associated SNPs on eGFR were not mediated by HDL cholesterol. In addition, we performed a Mendelian randomization analysis. This analysis revealed a positive but nonsignificant causal effect of HDL cholesterol-increasing variants on eGFR. In summary, our findings indicate that HDL cholesterol does not causally influence eGFR and propose pleiotropic effects on eGFR for some HDL cholesterol-associated SNPs. This may cause the observed association by mechanisms other than the mere HDL cholesterol concentration. © 2016 The Authors.
Huebinger, Ryan M.; Shewale, Shantanu J.; Koenig, Jessica L.; Mitchel, Jeffrey S.; O’Bryant, Sid E.; Waring, Stephen C.; Diaz-Arrastia, Ramon; Chasse, Scott
2015-01-01
Although 24 Alzheimer’s disease (AD) risk loci have been reliably identified, a large portion of the predicted heritability for AD remains unexplained. It is expected that additional loci of small effect will be identified with an increased sample size. However, the cost of a significant increase in Case-Control sample size is prohibitive. The current study tests whether exploring the genetic basis of endophenotypes, in this case based on putative blood biomarkers for AD, can accelerate the identification of susceptibility loci using modest sample sizes. Each endophenotype was used as the outcome variable in an independent GWAS. Endophenotypes were based on circulating concentrations of proteins that contributed significantly to a published blood-based predictive algorithm for AD. Endophenotypes included Monocyte Chemoattractant Protein 1 (MCP1), Vascular Cell Adhesion Molecule 1 (VCAM1), Pancreatic Polypeptide (PP), Beta2 Microglobulin (B2M), Factor VII (F7), Adiponectin (ADN) and Tenascin C (TN-C). Across the seven endophenotypes, 47 SNPs were associated with outcome with a p-value ≤1x10-7. Each signal was further characterized with respect to known genetic loci associated with AD. Signals for several endophenotypes were observed in the vicinity of CR1, MS4A6A/MS4A4E, PICALM, CLU, and PTK2B. The strongest signal was observed in association with Factor VII levels and was located within the F7 gene. Additional signals were observed in MAP3K13, ZNF320, ATP9B and TREM1. Conditional regression analyses suggested that the SNPs contributed to variation in protein concentration independent of AD status. The identification of two putatively novel AD loci (in the Factor VII and ATP9B genes), which have not been located in previous studies despite massive sample sizes, highlights the benefits of an endophenotypic approach for resolving the genetic basis for complex diseases. The coincidence of several of the endophenotypic signals with known AD loci may point to novel genetic interactions and should be further investigated. PMID:26625115
Barber, Robert C; Phillips, Nicole R; Tilson, Jeffrey L; Huebinger, Ryan M; Shewale, Shantanu J; Koenig, Jessica L; Mitchel, Jeffrey S; O'Bryant, Sid E; Waring, Stephen C; Diaz-Arrastia, Ramon; Chasse, Scott; Wilhelmsen, Kirk C
2015-01-01
Although 24 Alzheimer's disease (AD) risk loci have been reliably identified, a large portion of the predicted heritability for AD remains unexplained. It is expected that additional loci of small effect will be identified with an increased sample size. However, the cost of a significant increase in Case-Control sample size is prohibitive. The current study tests whether exploring the genetic basis of endophenotypes, in this case based on putative blood biomarkers for AD, can accelerate the identification of susceptibility loci using modest sample sizes. Each endophenotype was used as the outcome variable in an independent GWAS. Endophenotypes were based on circulating concentrations of proteins that contributed significantly to a published blood-based predictive algorithm for AD. Endophenotypes included Monocyte Chemoattractant Protein 1 (MCP1), Vascular Cell Adhesion Molecule 1 (VCAM1), Pancreatic Polypeptide (PP), Beta2 Microglobulin (B2M), Factor VII (F7), Adiponectin (ADN) and Tenascin C (TN-C). Across the seven endophenotypes, 47 SNPs were associated with outcome with a p-value ≤1x10(-7). Each signal was further characterized with respect to known genetic loci associated with AD. Signals for several endophenotypes were observed in the vicinity of CR1, MS4A6A/MS4A4E, PICALM, CLU, and PTK2B. The strongest signal was observed in association with Factor VII levels and was located within the F7 gene. Additional signals were observed in MAP3K13, ZNF320, ATP9B and TREM1. Conditional regression analyses suggested that the SNPs contributed to variation in protein concentration independent of AD status. The identification of two putatively novel AD loci (in the Factor VII and ATP9B genes), which have not been located in previous studies despite massive sample sizes, highlights the benefits of an endophenotypic approach for resolving the genetic basis for complex diseases. The coincidence of several of the endophenotypic signals with known AD loci may point to novel genetic interactions and should be further investigated.
Genome-Wide SNP Genotyping to Infer the Effects on Gene Functions in Tomato
Hirakawa, Hideki; Shirasawa, Kenta; Ohyama, Akio; Fukuoka, Hiroyuki; Aoki, Koh; Rothan, Christophe; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi
2013-01-01
The genotype data of 7054 single nucleotide polymorphism (SNP) loci in 40 tomato lines, including inbred lines, F1 hybrids, and wild relatives, were collected using Illumina's Infinium and GoldenGate assay platforms, the latter of which was utilized in our previous study. The dendrogram based on the genotype data corresponded well to the breeding types of tomato and wild relatives. The SNPs were classified into six categories according to their positions in the genes predicted on the tomato genome sequence. The genes with SNPs were annotated by homology searches against the nucleotide and protein databases, as well as by domain searches, and they were classified into the functional categories defined by the NCBI's eukaryotic orthologous groups (KOG). To infer the SNPs' effects on the gene functions, the three-dimensional structures of the 843 proteins that were encoded by the genes with SNPs causing missense mutations were constructed by homology modelling, and 200 of these proteins were considered to carry non-synonymous amino acid substitutions in the predicted functional sites. The SNP information obtained in this study is available at the Kazusa Tomato Genomics Database (http://plant1.kazusa.or.jp/tomato/). PMID:23482505
Yue, Ming; Zhou, Dianshuang; Zhi, Hui; Wang, Peng; Zhang, Yan; Gao, Yue; Guo, Maoni; Li, Xin; Wang, Yanxia
2018-01-01
Abstract The MiRNA SNP Disease Database (MSDD, http://www.bio-bigdata.com/msdd/) is a manually curated database that provides comprehensive experimentally supported associations among microRNAs (miRNAs), single nucleotide polymorphisms (SNPs) and human diseases. SNPs in miRNA-related functional regions such as mature miRNAs, promoter regions, pri-miRNAs, pre-miRNAs and target gene 3′-UTRs, collectively called ‘miRSNPs’, represent a novel category of functional molecules. miRSNPs can lead to miRNA and its target gene dysregulation, and resulting in susceptibility to or onset of human diseases. A curated collection and summary of miRSNP-associated diseases is essential for a thorough understanding of the mechanisms and functions of miRSNPs. Here, we describe MSDD, which currently documents 525 associations among 182 human miRNAs, 197 SNPs, 153 genes and 164 human diseases through a review of more than 2000 published papers. Each association incorporates information on the miRNAs, SNPs, miRNA target genes and disease names, SNP locations and alleles, the miRNA dysfunctional pattern, experimental techniques, a brief functional description, the original reference and additional annotation. MSDD provides a user-friendly interface to conveniently browse, retrieve, download and submit novel data. MSDD will significantly improve our understanding of miRNA dysfunction in disease, and thus, MSDD has the potential to serve as a timely and valuable resource. PMID:29106642
Yue, Ming; Zhou, Dianshuang; Zhi, Hui; Wang, Peng; Zhang, Yan; Gao, Yue; Guo, Maoni; Li, Xin; Wang, Yanxia; Zhang, Yunpeng; Ning, Shangwei; Li, Xia
2018-01-04
The MiRNA SNP Disease Database (MSDD, http://www.bio-bigdata.com/msdd/) is a manually curated database that provides comprehensive experimentally supported associations among microRNAs (miRNAs), single nucleotide polymorphisms (SNPs) and human diseases. SNPs in miRNA-related functional regions such as mature miRNAs, promoter regions, pri-miRNAs, pre-miRNAs and target gene 3'-UTRs, collectively called 'miRSNPs', represent a novel category of functional molecules. miRSNPs can lead to miRNA and its target gene dysregulation, and resulting in susceptibility to or onset of human diseases. A curated collection and summary of miRSNP-associated diseases is essential for a thorough understanding of the mechanisms and functions of miRSNPs. Here, we describe MSDD, which currently documents 525 associations among 182 human miRNAs, 197 SNPs, 153 genes and 164 human diseases through a review of more than 2000 published papers. Each association incorporates information on the miRNAs, SNPs, miRNA target genes and disease names, SNP locations and alleles, the miRNA dysfunctional pattern, experimental techniques, a brief functional description, the original reference and additional annotation. MSDD provides a user-friendly interface to conveniently browse, retrieve, download and submit novel data. MSDD will significantly improve our understanding of miRNA dysfunction in disease, and thus, MSDD has the potential to serve as a timely and valuable resource. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Costas, Javier; Paramo, Mario; Arrojo, Manuel
2018-01-01
Abstract Background Genomic research has revealed that schizophrenia is a highly polygenic disease. Recent estimates indicate that at least 71% of genomic segments of 1 Mb include one or more risk loci for schizophrenia (Loh et al., Nature Genet 2015). This extremely high polygenicity represents a challenge to decipher the biological basis of schizophrenia, as it is expected that any set of SNPs with enough size will be associated with the disorder. Among the different gene sets available for study (such as those from Gene Ontology, KEGG pathway, Reactome pathways or protein protein interaction datasets), those based on brain co-expression networks represent putative functional relationships in the relevant tissue. The aim of this work was to identify brain co-expression networks that contribute disproportionately to the common polygenic risk for schizophrenia to get more insight on schizophrenia etiopathology. Methods We analyzed a case -control dataset consisting of 582 schizophrenia patients from Galicia, NW Spain, and 591 ancestrally matched controls, genotyped with the Illumina PsychArray. Using as discovery sample the summary results from the largest GWAS of schizophrenia to date (Psychiatric Genomics Consortium, SCZ2), we generated polygenic risk scores (PRS) in our sample based on SNPs located at genes belonging to brain co-expression modules determined by the CommonMind Consortium (Fromer et al., Nature Neurosci 2016). PRS were generated using the clumping procedure of PLINK, considering several different thresholds to select SNPs from the discovery sample. In order to test if any specific module increased risk to schizophrenia more than expected by their size, we generated up to 10,000 random permutations of the same number of SNPs, matched by frequency, distance to nearest gene, number of SNPs in LD and gene density, using SNPsnap. Results As expected, most modules with enough number of independent SNPs belonging to them showed a significant increase in Nagelkerke’s R2 in our case-control sample after the addition of the module-specific PRS in a logistic regression model. Our permutation strategy revealed that most modules did not show an excess of risk, measured by increase in Nagelkerke’s R2, in comparison to equal number of SNPs with similar characteristics. But one module, M2c from Fromer et al., remained highly significant after multiple tests’ correction. Reactome pathways analysis revealed an over-representation of genes involved in “Neuronal System” and “Axon guidance” among genes from this module. Using the same protocol, we detected that the 84 genes from the neuronal system pathway at this module, representing less than 6% of the genes from the module, explained a higher level of risk than expected. “Voltage-gated Potassium channels” and “Neurexins and neuroligins” are overrepresented among the Neuronal System genes from module M2c. Discussion Here, we show that, in spite of the high polygenicity of schizophrenia, it is possible to identify gene sets contributing disproportionately to total risk, as it was the case for the M2c module from Fromer et al. These authors have previously reported that the M2c module was enriched in GWAS signals, as well as CNVs and rare variants associated with schizophrenia. Therefore, this module shows a disproportionately contribution to schizophrenia risk. Study supported by Grant PI14/01020 from Instituto de Salud Carlos III, Ministry of Health, Spanish Government.
Genome-Wide Association Study of a Varroa-Specific Defense Behavior in Honeybees (Apis mellifera)
Spötter, Andreas; Gupta, Pooja; Mayer, Manfred; Reinsch, Norbert
2016-01-01
Honey bees are exposed to many damaging pathogens and parasites. The most devastating is Varroa destructor, which mainly affects the brood. A promising approach for preventing its spread is to breed Varroa-resistant honey bees. One trait that has been shown to provide significant resistance against the Varroa mite is hygienic behavior, which is a behavioral response of honeybee workers to brood diseases in general. Here, we report the use of an Affymetrix 44K SNP array to analyze SNPs associated with detection and uncapping of Varroa-parasitized brood by individual worker bees (Apis mellifera). For this study, 22 000 individually labeled bees were video-monitored and a sample of 122 cases and 122 controls was collected and analyzed to determine the dependence/independence of SNP genotypes from hygienic and nonhygienic behavior on a genome-wide scale. After false-discovery rate correction of the P values, 6 SNP markers had highly significant associations with the trait investigated (α < 0.01). Inspection of the genomic regions around these SNPs led to the discovery of putative candidate genes. PMID:26774061
Huang, Lin; Li, Guiyang; Mo, Zhaolan; Xiao, Peng; Li, Jie; Huang, Jie
2015-01-01
Background Japanese flounder (Paralichthys olivaceus) is an economically important marine fish in Asia and has suffered from disease outbreaks caused by various pathogens, which requires more information for immune relevant genes on genome background. However, genomic and transcriptomic data for Japanese flounder remain scarce, which limits studies on the immune system of this species. In this study, we characterized the Japanese flounder spleen transcriptome using an Illumina paired-end sequencing platform to identify putative genes involved in immunity. Methodology/Principal Findings A cDNA library from the spleen of P. olivaceus was constructed and randomly sequenced using an Illumina technique. The removal of low quality reads generated 12,196,968 trimmed reads, which assembled into 96,627 unigenes. A total of 21,391 unigenes (22.14%) were annotated in the NCBI Nr database, and only 1.1% of the BLASTx top-hits matched P. olivaceus protein sequences. Approximately 12,503 (58.45%) unigenes were categorized into three Gene Ontology groups, 19,547 (91.38%) were classified into 26 Cluster of Orthologous Groups, and 10,649 (49.78%) were assigned to six Kyoto Encyclopedia of Genes and Genomes pathways. Furthermore, 40,928 putative simple sequence repeats and 47, 362 putative single nucleotide polymorphisms were identified. Importantly, we identified 1,563 putative immune-associated unigenes that mapped to 15 immune signaling pathways. Conclusions/Significance The P. olivaceus transciptome data provides a rich source to discover and identify new genes, and the immune-relevant sequences identified here will facilitate our understanding of the mechanisms involved in the immune response. Furthermore, the plentiful potential SSRs and SNPs found in this study are important resources with respect to future development of a linkage map or marker assisted breeding programs for the flounder. PMID:25723398
Impact of SNPs on Protein Phosphorylation Status in Rice (Oryza sativa L.).
Lin, Shoukai; Chen, Lijuan; Tao, Huan; Huang, Jian; Xu, Chaoqun; Li, Lin; Ma, Shiwei; Tian, Tian; Liu, Wei; Xue, Lichun; Ai, Yufang; He, Huaqin
2016-11-11
Single nucleotide polymorphisms (SNPs) are widely used in functional genomics and genetics research work. The high-quality sequence of rice genome has provided a genome-wide SNP and proteome resource. However, the impact of SNPs on protein phosphorylation status in rice is not fully understood. In this paper, we firstly updated rice SNP resource based on the new rice genome Ver. 7.0, then systematically analyzed the potential impact of Non-synonymous SNPs (nsSNPs) on the protein phosphorylation status. There were 3,897,312 SNPs in Ver. 7.0 rice genome, among which 9.9% was nsSNPs. Whilst, a total 2,508,261 phosphorylated sites were predicted in rice proteome. Interestingly, we observed that 150,197 (39.1%) nsSNPs could influence protein phosphorylation status, among which 52.2% might induce changes of protein kinase (PK) types for adjacent phosphorylation sites. We constructed a database, SNP_rice, to deposit the updated rice SNP resource and phosSNPs information. It was freely available to academic researchers at http://bioinformatics.fafu.edu.cn. As a case study, we detected five nsSNPs that potentially influenced heterotrimeric G proteins phosphorylation status in rice, indicating that genetic polymorphisms showed impact on the signal transduction by influencing the phosphorylation status of heterotrimeric G proteins. The results in this work could be a useful resource for future experimental identification and provide interesting information for better rice breeding.
Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.
2015-01-01
High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Evolutionary evidence of the effect of rare variants on disease etiology.
Gorlov, I P; Gorlova, O Y; Frazier, M L; Spitz, M R; Amos, C I
2011-03-01
The common disease/common variant hypothesis has been popular for describing the genetic architecture of common human diseases for several years. According to the originally stated hypothesis, one or a few common genetic variants with a large effect size control the risk of common diseases. A growing body of evidence, however, suggests that rare single-nucleotide polymorphisms (SNPs), i.e. those with a minor allele frequency of less than 5%, are also an important component of the genetic architecture of common human diseases. In this study, we analyzed the relevance of rare SNPs to the risk of common diseases from an evolutionary perspective and found that rare SNPs are more likely than common SNPs to be functional and tend to have a stronger effect size than do common SNPs. This observation, and the fact that most of the SNPs in the human genome are rare, suggests that rare SNPs are a crucial element of the genetic architecture of common human diseases. We propose that the next generation of genomic studies should focus on analyzing rare SNPs. Further, targeting patients with a family history of the disease, an extreme phenotype, or early disease onset may facilitate the detection of risk-associated rare SNPs. © 2010 John Wiley & Sons A/S.
Genome-wide association study for feed efficiency and growth traits in U.S. beef cattle.
Seabury, Christopher M; Oldeschulte, David L; Saatchi, Mahdi; Beever, Jonathan E; Decker, Jared E; Halley, Yvette A; Bhattarai, Eric K; Molaei, Maral; Freetly, Harvey C; Hansen, Stephanie L; Yampara-Iquise, Helen; Johnson, Kristen A; Kerley, Monty S; Kim, JaeWoo; Loy, Daniel D; Marques, Elisa; Neibergs, Holly L; Schnabel, Robert D; Shike, Daniel W; Spangler, Matthew L; Weaber, Robert L; Garrick, Dorian J; Taylor, Jeremy F
2017-05-18
Single nucleotide polymorphism (SNP) arrays for domestic cattle have catalyzed the identification of genetic markers associated with complex traits for inclusion in modern breeding and selection programs. Using actual and imputed Illumina 778K genotypes for 3887 U.S. beef cattle from 3 populations (Angus, Hereford, SimAngus), we performed genome-wide association analyses for feed efficiency and growth traits including average daily gain (ADG), dry matter intake (DMI), mid-test metabolic weight (MMWT), and residual feed intake (RFI), with marker-based heritability estimates produced for all traits and populations. Moderate and/or large-effect QTL were detected for all traits in all populations, as jointly defined by the estimated proportion of variance explained (PVE) by marker effects (PVE ≥ 1.0%) and a nominal P-value threshold (P ≤ 5e-05). Lead SNPs with PVE ≥ 2.0% were considered putative evidence of large-effect QTL (n = 52), whereas those with PVE ≥ 1.0% but < 2.0% were considered putative evidence for moderate-effect QTL (n = 35). Identical or proximal lead SNPs associated with ADG, DMI, MMWT, and RFI collectively supported the potential for either pleiotropic QTL, or independent but proximal causal mutations for multiple traits within and between the analyzed populations. Marker-based heritability estimates for all investigated traits ranged from 0.18 to 0.60 using 778K genotypes, or from 0.17 to 0.57 using 50K genotypes (reduced from Illumina 778K HD to Illumina Bovine SNP50). An investigation to determine if QTL detected by 778K analysis could also be detected using 50K genotypes produced variable results, suggesting that 50K analyses were generally insufficient for QTL detection in these populations, and that relevant breeding or selection programs should be based on higher density analyses (imputed or directly ascertained). Fourteen moderate to large-effect QTL regions which ranged from being physically proximal (lead SNPs ≤ 3Mb) to fully overlapping for RFI, DMI, ADG, and MMWT were detected within and between populations, and included evidence for pleiotropy, proximal but independent causal mutations, and multi-breed QTL. Bovine positional candidate genes for these traits were functionally conserved across vertebrate species.
Fine-mapping identifies two additional breast cancer susceptibility loci at 9q31.2
Orr, Nick; Dudbridge, Frank; Dryden, Nicola; Maguire, Sarah; Novo, Daniela; Perrakis, Eleni; Johnson, Nichola; Ghoussaini, Maya; Hopper, John L.; Southey, Melissa C.; Apicella, Carmel; Stone, Jennifer; Schmidt, Marjanka K.; Broeks, Annegien; Van't Veer, Laura J.; Hogervorst, Frans B.; Fasching, Peter A.; Haeberle, Lothar; Ekici, Arif B.; Beckmann, Matthias W.; Gibson, Lorna; Aitken, Zoe; Warren, Helen; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Burwinkel, Barbara; Marme, Frederik; Schneeweiss, Andreas; Sohn, Chistof; Guénel, Pascal; Truong, Thérèse; Cordina-Duverger, Emilie; Sanchez, Marie; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Benitez, Javier; Zamora, Maria Pilar; Arias Perez, Jose Ignacio; Menéndez, Primitiva; Anton-Culver, Hoda; Neuhausen, Susan L.; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Hamann, Ute; Brauch, Hiltrud; Justenhoven, Christina; Brüning, Thomas; Ko, Yon-Dschun; Nevanlinna, Heli; Aittomäki, Kristiina; Blomqvist, Carl; Khan, Sofia; Bogdanova, Natalia; Dörk, Thilo; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Chenevix-Trench, Georgia; Beesley, Jonathan; Lambrechts, Diether; Moisse, Matthieu; Floris, Guiseppe; Beuselinck, Benoit; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Pensotti, Valeria; Couch, Fergus J.; Olson, Janet E.; Slettedahl, Seth; Vachon, Celine; Giles, Graham G.; Milne, Roger L.; McLean, Catriona; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Kristensen, Vessela; Alnæs, Grethe Grenaker; Nord, Silje; Borresen-Dale, Anne-Lise; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robertus A. E. M.; Seynaeve, Caroline M.; Van Asperen, Christi J.; Garcia-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Klevebring, Daniel; Hooning, Maartje J.; Hollestelle, Antoinette; van Deurzen, Carolien H. M.; Kriege, Mieke; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Cox, Angela; Cross, Simon S.; Reed, Malcolm W. R.; Pharoah, Paul D. P.; Dunning, Alison M.; Shah, Mitul; Perkins, Barbara J.; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Ashworth, Alan; Swerdlow, Anthony; Jones, Michael; Schoemaker, Minouk J.; Meindl, Alfons; Schmutzler, Rita K.; Olswold, Curtis; Slager, Susan; Toland, Amanda E.; Yannoukakos, Drakoulis; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Matsuo, Keitaro; Ito, Hidema; Iwata, Hiroji; Ishiguro, Junko; Wu, Anna H.; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O.; Teo, Soo Hwang; Yip, Cheng Har; Kang, Peter; Ikram, Mohammad Kamran; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Hartman, Mikael; Miao, Hui; Lim, Wei Yen; Lee, Soo Chin; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; Mckay, James; Wu, Pei-Ei; Hou, Ming-Feng; Yu, Jyh-Cherng; Shen, Chen-Yang; Blot, William; Cai, Qiuyin; Signorello, Lisa B.; Luccarini, Craig; Bayes, Caroline; Ahmed, Shahana; Maranian, Mel; Healey, Catherine S.; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Álvarez, Nuria; Herrero, Daniel; Tessier, Daniel C.; Vincent, Daniel; Bacot, Francois; Hunter, David J.; Lindstrom, Sara; Dennis, Joe; Michailidou, Kyriaki; Bolla, Manjeet K.; Easton, Douglas F.; dos Santos Silva, Isabel; Fletcher, Olivia; Peto, Julian
2015-01-01
We recently identified a novel susceptibility variant, rs865686, for estrogen-receptor positive breast cancer at 9q31.2. Here, we report a fine-mapping analysis of the 9q31.2 susceptibility locus using 43 160 cases and 42 600 controls of European ancestry ascertained from 52 studies and a further 5795 cases and 6624 controls of Asian ancestry from nine studies. Single nucleotide polymorphism (SNP) rs676256 was most strongly associated with risk in Europeans (odds ratios [OR] = 0.90 [0.88–0.92]; P-value = 1.58 × 10−25). This SNP is one of a cluster of highly correlated variants, including rs865686, that spans ∼14.5 kb. We identified two additional independent association signals demarcated by SNPs rs10816625 (OR = 1.12 [1.08–1.17]; P-value = 7.89 × 10−09) and rs13294895 (OR = 1.09 [1.06–1.12]; P-value = 2.97 × 10−11). SNP rs10816625, but not rs13294895, was also associated with risk of breast cancer in Asian individuals (OR = 1.12 [1.06–1.18]; P-value = 2.77 × 10−05). Functional genomic annotation using data derived from breast cancer cell-line models indicates that these SNPs localise to putative enhancer elements that bind known drivers of hormone-dependent breast cancer, including ER-α, FOXA1 and GATA-3. In vitro analyses indicate that rs10816625 and rs13294895 have allele-specific effects on enhancer activity and suggest chromatin interactions with the KLF4 gene locus. These results demonstrate the power of dense genotyping in large studies to identify independent susceptibility variants. Analysis of associations using subjects with different ancestry, combined with bioinformatic and genomic characterisation, can provide strong evidence for the likely causative alleles and their functional basis. PMID:25652398
Garnaud, Cécile; Botterel, Françoise; Sertour, Natacha; Bougnoux, Marie-Elisabeth; Dannaoui, Eric; Larrat, Sylvie; Hennequin, Christophe; Guinea, Jesus; Cornet, Muriel; Maubon, Danièle
2015-09-01
MDR Candida strains are emerging. Next-generation sequencing (NGS), which enables extensive and deep genome analysis, was used to investigate echinocandin and azole resistance in clinical Candida isolates. Six genes commonly involved in antifungal resistance (ERG11, ERG3, TAC1, CgPDR1, FKS1 and FKS2) were analysed using NGS in 40 Candida isolates (18 Candida albicans, 15 Candida glabrata and 7 Candida parapsilosis). The strategy was validated using strains with known sequences. Then, 8 clinical strains displaying antifungal resistance and 23 sequential isolates collected from 10 patients receiving antifungal therapy were analysed. A total of 391 SNPs were detected, among which 6 coding SNPs were reported for the first time. Novel genetic alterations were detected in both azole and echinocandin resistance genes. A C. glabrata strain, which was resistant to echinocandins but highly susceptible to azoles, harboured an FKS2 S663P mutation plus a novel presumed loss-of-function CgPDR1 mutation. This isolate was from a patient with deep-seated and urinary candidiasis. Another C. glabrata isolate, with an MDR phenotype, carried a new FKS2 S663A mutation and a new putative gain-of-function CgPDR1 mutation (T370I); this isolate showed mutated (80%) and WT (20%) populations and was collected after 75 days of exposure to caspofungin from a patient who underwent complicated abdominal surgery. This study shows that NGS can be used for extensive assessment of genetic mutations involved in antifungal resistance. This type of wide genome approach will become very valuable for detecting mechanisms of resistance in clinical strains subjected to multidrug pressure. © The Author 2015. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Fine-mapping identifies two additional breast cancer susceptibility loci at 9q31.2.
Orr, Nick; Dudbridge, Frank; Dryden, Nicola; Maguire, Sarah; Novo, Daniela; Perrakis, Eleni; Johnson, Nichola; Ghoussaini, Maya; Hopper, John L; Southey, Melissa C; Apicella, Carmel; Stone, Jennifer; Schmidt, Marjanka K; Broeks, Annegien; Van't Veer, Laura J; Hogervorst, Frans B; Fasching, Peter A; Haeberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Gibson, Lorna; Aitken, Zoe; Warren, Helen; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Burwinkel, Barbara; Marme, Frederik; Schneeweiss, Andreas; Sohn, Chistof; Guénel, Pascal; Truong, Thérèse; Cordina-Duverger, Emilie; Sanchez, Marie; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Benitez, Javier; Zamora, Maria Pilar; Arias Perez, Jose Ignacio; Menéndez, Primitiva; Anton-Culver, Hoda; Neuhausen, Susan L; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Hamann, Ute; Brauch, Hiltrud; Justenhoven, Christina; Brüning, Thomas; Ko, Yon-Dschun; Nevanlinna, Heli; Aittomäki, Kristiina; Blomqvist, Carl; Khan, Sofia; Bogdanova, Natalia; Dörk, Thilo; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Chenevix-Trench, Georgia; Beesley, Jonathan; Lambrechts, Diether; Moisse, Matthieu; Floris, Guiseppe; Beuselinck, Benoit; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Pensotti, Valeria; Couch, Fergus J; Olson, Janet E; Slettedahl, Seth; Vachon, Celine; Giles, Graham G; Milne, Roger L; McLean, Catriona; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Kristensen, Vessela; Alnæs, Grethe Grenaker; Nord, Silje; Borresen-Dale, Anne-Lise; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Devilee, Peter; Tollenaar, Robertus A E M; Seynaeve, Caroline M; Van Asperen, Christi J; Garcia-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J; Lissowska, Jolanta; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Klevebring, Daniel; Hooning, Maartje J; Hollestelle, Antoinette; van Deurzen, Carolien H M; Kriege, Mieke; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Pharoah, Paul D P; Dunning, Alison M; Shah, Mitul; Perkins, Barbara J; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Ashworth, Alan; Swerdlow, Anthony; Jones, Michael; Schoemaker, Minouk J; Meindl, Alfons; Schmutzler, Rita K; Olswold, Curtis; Slager, Susan; Toland, Amanda E; Yannoukakos, Drakoulis; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Matsuo, Keitaro; Ito, Hidema; Iwata, Hiroji; Ishiguro, Junko; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Teo, Soo Hwang; Yip, Cheng Har; Kang, Peter; Ikram, Mohammad Kamran; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Hartman, Mikael; Miao, Hui; Lim, Wei Yen; Lee, Soo Chin; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; Mckay, James; Wu, Pei-Ei; Hou, Ming-Feng; Yu, Jyh-Cherng; Shen, Chen-Yang; Blot, William; Cai, Qiuyin; Signorello, Lisa B; Luccarini, Craig; Bayes, Caroline; Ahmed, Shahana; Maranian, Mel; Healey, Catherine S; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Álvarez, Nuria; Herrero, Daniel; Tessier, Daniel C; Vincent, Daniel; Bacot, Francois; Hunter, David J; Lindstrom, Sara; Dennis, Joe; Michailidou, Kyriaki; Bolla, Manjeet K; Easton, Douglas F; dos Santos Silva, Isabel; Fletcher, Olivia; Peto, Julian
2015-05-15
We recently identified a novel susceptibility variant, rs865686, for estrogen-receptor positive breast cancer at 9q31.2. Here, we report a fine-mapping analysis of the 9q31.2 susceptibility locus using 43 160 cases and 42 600 controls of European ancestry ascertained from 52 studies and a further 5795 cases and 6624 controls of Asian ancestry from nine studies. Single nucleotide polymorphism (SNP) rs676256 was most strongly associated with risk in Europeans (odds ratios [OR] = 0.90 [0.88-0.92]; P-value = 1.58 × 10(-25)). This SNP is one of a cluster of highly correlated variants, including rs865686, that spans ∼14.5 kb. We identified two additional independent association signals demarcated by SNPs rs10816625 (OR = 1.12 [1.08-1.17]; P-value = 7.89 × 10(-09)) and rs13294895 (OR = 1.09 [1.06-1.12]; P-value = 2.97 × 10(-11)). SNP rs10816625, but not rs13294895, was also associated with risk of breast cancer in Asian individuals (OR = 1.12 [1.06-1.18]; P-value = 2.77 × 10(-05)). Functional genomic annotation using data derived from breast cancer cell-line models indicates that these SNPs localise to putative enhancer elements that bind known drivers of hormone-dependent breast cancer, including ER-α, FOXA1 and GATA-3. In vitro analyses indicate that rs10816625 and rs13294895 have allele-specific effects on enhancer activity and suggest chromatin interactions with the KLF4 gene locus. These results demonstrate the power of dense genotyping in large studies to identify independent susceptibility variants. Analysis of associations using subjects with different ancestry, combined with bioinformatic and genomic characterisation, can provide strong evidence for the likely causative alleles and their functional basis. © The Author 2015. Published by Oxford University Press.
Eveno, Emmanuelle; Collada, Carmen; Guevara, M Angeles; Léger, Valérie; Soto, Alvaro; Díaz, Luis; Léger, Patrick; González-Martínez, Santiago C; Cervera, M Teresa; Plomion, Christophe; Garnier-Géré, Pauline H
2008-02-01
The importance of natural selection for shaping adaptive trait differentiation among natural populations of allogamous tree species has long been recognized. Determining the molecular basis of local adaptation remains largely unresolved, and the respective roles of selection and demography in shaping population structure are actively debated. Using a multilocus scan that aims to detect outliers from simulated neutral expectations, we analyzed patterns of nucleotide diversity and genetic differentiation at 11 polymorphic candidate genes for drought stress tolerance in phenotypically contrasted Pinus pinaster Ait. populations across its geographical range. We compared 3 coalescent-based methods: 2 frequentist-like, including 1 approach specifically developed for biallelic single nucleotide polymorphisms (SNPs) here and 1 Bayesian. Five genes showed outlier patterns that were robust across methods at the haplotype level for 2 of them. Two genes presented higher F(ST) values than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among populations. In contrast, 3 genes presented lower F(ST) values than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations. A smaller proportion of outliers were detected at the SNP level suggesting the potential functional significance of particular combinations of sites in drought-response candidate genes. The Bayesian method appeared robust to low sample sizes, flexible to assumptions regarding migration rates, and powerful for detecting selection at the haplotype level, but the frequentist-like method adapted to SNPs was more efficient for the identification of outlier SNPs showing low differentiation. Population-specific effects estimated in the Bayesian method also revealed populations with lower immigration rates, which could have led to favorable situations for local adaptation. Outlier patterns are discussed in relation to the different genes' putative involvement in drought tolerance responses, from published results in transcriptomics and association mapping in P. pinaster and other related species. These genes clearly constitute relevant candidates for future association studies in P. pinaster.
Almlöf, Jonas Carlsson; Lundmark, Per; Lundmark, Anders; Ge, Bing; Maouche, Seraya; Göring, Harald H. H.; Liljedahl, Ulrika; Enström, Camilla; Brocheton, Jessy; Proust, Carole; Godefroy, Tiphaine; Sambrook, Jennifer G.; Jolley, Jennifer; Crisp-Hihn, Abigail; Foad, Nicola; Lloyd-Jones, Heather; Stephens, Jonathan; Gwilliam, Rhian; Rice, Catherine M.; Hengstenberg, Christian; Samani, Nilesh J.; Erdmann, Jeanette; Schunkert, Heribert; Pastinen, Tomi; Deloukas, Panos; Goodall, Alison H.; Ouwehand, Willem H.; Cambien, François; Syvänen, Ann-Christine
2012-01-01
A large number of genome-wide association studies have been performed during the past five years to identify associations between SNPs and human complex diseases and traits. The assignment of a functional role for the identified disease-associated SNP is not straight-forward. Genome-wide expression quantitative trait locus (eQTL) analysis is frequently used as the initial step to define a function while allele-specific gene expression (ASE) analysis has not yet gained a wide-spread use in disease mapping studies. We compared the power to identify cis-acting regulatory SNPs (cis-rSNPs) by genome-wide allele-specific gene expression (ASE) analysis with that of traditional expression quantitative trait locus (eQTL) mapping. Our study included 395 healthy blood donors for whom global gene expression profiles in circulating monocytes were determined by Illumina BeadArrays. ASE was assessed in a subset of these monocytes from 188 donors by quantitative genotyping of mRNA using a genome-wide panel of SNP markers. The performance of the two methods for detecting cis-rSNPs was evaluated by comparing associations between SNP genotypes and gene expression levels in sample sets of varying size. We found that up to 8-fold more samples are required for eQTL mapping to reach the same statistical power as that obtained by ASE analysis for the same rSNPs. The performance of ASE is insensitive to SNPs with low minor allele frequencies and detects a larger number of significantly associated rSNPs using the same sample size as eQTL mapping. An unequivocal conclusion from our comparison is that ASE analysis is more sensitive for detecting cis-rSNPs than standard eQTL mapping. Our study shows the potential of ASE mapping in tissue samples and primary cells which are difficult to obtain in large numbers. PMID:23300628
Supramolecular Nanoparticles for Molecular Diagnostics and Therapeutics
NASA Astrophysics Data System (ADS)
Chen, Kuan-Ju
Over the past decades, significant efforts have been devoted to explore the use of various nanoparticle-based systems in the field of nanomedicine, including molecular imaging and therapy. Supramolecular synthetic approaches have attracted lots of attention due to their flexibility, convenience, and modularity for producing nanoparticles. In this dissertation, the developmental story of our size-controllable supramolecular nanoparticles (SNPs) will be discussed, as well as their use in specific biomedical applications. To achieve the self-assembly of SNPs, the well-characterized molecular recognition system (i.e., cyclodextrin/adamantane recognition) was employed. The resulting SNPs, which were assembled from three molecular building blocks, possess incredible stability in various physiological conditions, reversible size-controllability and dynamic disassembly that were exploited for various in vitro and in vivo applications. An advantage of using the supramolecular approach is that it enables the convenient incorporation of functional ligands onto SNP surface that confers functionality ( e.g., targeting, cell penetration) to SNPs. We utilized SNPs for molecular imaging such as magnetic resonance imaging (MRI) and positron emission tomography (PET) by introducing reporter systems (i.e., radio-isotopes, MR contrast agents, and fluorophores) into SNPs. On the other hand, the incorporation of various payloads, including drugs, genes and proteins, into SNPs showed improved delivery performance and enhanced therapeutic efficacy for these therapeutic agents. Leveraging the powers of (i) a combinatorial synthetic approach based on supramolecular assembly and (ii) a digital microreactor, a rapid developmental pathway was developed that is capable of screening SNP candidates for the ideal structural and functional properties that deliver optimal performance. Moreover, SNP-based theranostic delivery systems that combine reporter systems and therapeutic payloads into a single SNP for both diagnosis and therapy were generated. The results show that this type of theranostic SNPs may have a great contribution in the optimization of therapeutic efficacy for individual patients in clinical translation in the near future. It is anticipated that our supramolecular synthetic approach could be adopted to assemble various SNP-based delivery agents for molecular diagnostics and therapeutics that pave the way toward personalized medicine.
Burfeind, Kevin G; Murchison, Charles F; Westaway, Shawn K; Simon, Matthew J; Erten-Lyons, Deniz; Kaye, Jeffrey A; Quinn, Joseph F; Iliff, Jeffrey J
2017-09-01
The glymphatic system is a brain-wide perivascular network that facilitates clearance of proteins, including amyloid β, from the brain interstitium through the perivascular exchange of cerebrospinal fluid and interstitial fluid. The astrocytic water channel aquaporin-4 (AQP4) is required for glymphatic system function, and impairment of glymphatic function in the aging brain is associated with altered AQP4 expression and localization. In human cortical tissue, alterations in AQP4 expression and localization are associated with Alzheimer's disease (AD) status and pathology. Although this suggests a potential role for AQP4 in the development or progression of AD, the relationship between of naturally occurring variants in the human AQP4 gene and cognitive function has not yet been evaluated. Using data from several longitudinal aging cohorts, we investigated the association between five AQP4 single-nucleotide polymorphisms (SNPs) and the rate of cognitive decline in participants with a diagnosis of AD. None of the five SNPs were associated with different rates of AD diagnosis, age of dementia onset in trial subjects. No association between AQP4 SNPs with histological measures of AD pathology, including Braak stage or neuritic plaque density was observed. However, AQP4 SNPs were associated with altered rates of cognitive decline after AD diagnosis, with two SNPS (rs9951307 and rs3875089) associated with slower cognitive decline and two (rs3763040 and rs3763043) associated with more rapid cognitive decline after AD diagnosis. These results provide the first evidence that variations in the AQP4 gene, whose gene product AQP4 is vital for glymphatic pathway function, may modulate the progression of cognitive decline in AD.
High-Throughput Gene Mapping in Caenorhabditis elegans
Swan, Kathryn A.; Curtis, Damian E.; McKusick, Kathleen B.; Voinov, Alexander V.; Mapa, Felipa A.; Cancilla, Michael R.
2002-01-01
Positional cloning of mutations in model genetic systems is a powerful method for the identification of targets of medical and agricultural importance. To facilitate the high-throughput mapping of mutations in Caenorhabditis elegans, we have identified a further 9602 putative new single nucleotide polymorphisms (SNPs) between two C. elegans strains, Bristol N2 and the Hawaiian mapping strain CB4856, by sequencing inserts from a CB4856 genomic DNA library and using an informatics pipeline to compare sequences with the canonical N2 genomic sequence. When combined with data from other laboratories, our marker set of 17,189 SNPs provides even coverage of the complete worm genome. To date, we have confirmed >1099 evenly spaced SNPs (one every 91 ± 56 kb) across the six chromosomes and validated the utility of our SNP marker set and new fluorescence polarization-based genotyping methods for systematic and high-throughput identification of genes in C. elegans by cloning several proprietary genes. We illustrate our approach by recombination mapping and confirmation of the mutation in the cloned gene, dpy-18. [The sequence data described in this paper have been submitted to the NCBI dbSNP data library under accession nos. 4388625–4389689 and GenBank dbSTS under accession nos. 973810–974874. The following individuals and institutions kindly provided reagents, samples, or unpublished information as indicated in the paper: The C. elegans Sequencing Consortium and The Caenorhabditis Genetics Center.] PMID:12097347
Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The genome-wide discovery and high-throughput genotyping of SNPs in chickpea natural germplasm lines is indispensable to extrapolate their natural allelic diversity, domestication, and linkage disequilibrium (LD) patterns leading to the genetic enhancement of this vital legume crop. We discovered 44,844 high-quality SNPs by sequencing of 93 diverse cultivated desi, kabuli, and wild chickpea accessions using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays that were physically mapped across eight chromosomes of desi and kabuli. Of these, 22,542 SNPs were structurally annotated in different coding and non-coding sequence components of genes. Genes with 3296 non-synonymous and 269 regulatory SNPs could functionally differentiate accessions based on their contrasting agronomic traits. A high experimental validation success rate (92%) and reproducibility (100%) along with strong sensitivity (93–96%) and specificity (99%) of GBS-based SNPs was observed. This infers the robustness of GBS as a high-throughput assay for rapid large-scale mining and genotyping of genome-wide SNPs in chickpea with sub-optimal use of resources. With 23,798 genome-wide SNPs, a relatively high intra-specific polymorphic potential (49.5%) and broader molecular diversity (13–89%)/functional allelic diversity (18–77%) was apparent among 93 chickpea accessions, suggesting their tremendous applicability in rapid selection of desirable diverse accessions/inter-specific hybrids in chickpea crossbred varietal improvement program. The genome-wide SNPs revealed complex admixed domestication pattern, extensive LD estimates (0.54–0.68) and extended LD decay (400–500 kb) in a structured population inclusive of 93 accessions. These findings reflect the utility of our identified SNPs for subsequent genome-wide association study (GWAS) and selective sweep-based domestication trait dissection analysis to identify potential genomic loci (gene-associated targets) specifically regulating important complex quantitative agronomic traits in chickpea. The numerous informative genome-wide SNPs, natural allelic diversity-led domestication pattern, and LD-based information generated in our study have got multidimensional applicability with respect to chickpea genomics-assisted breeding. PMID:25873920
A New Genomewide Association Meta-Analysis of Alcohol Dependence.
Zuo, Lingjun; Tan, Yunlong; Zhang, Xiangyang; Wang, Xiaoping; Krystal, John; Tabakoff, Boris; Zhong, Chunlong; Luo, Xingguang
2015-08-01
Conventional meta-analysis based on genetic markers may be less powerful for heterogeneous samples. In this study, we introduced a new meta-analysis for 4 genomewide association studies on alcohol dependence that integrated the information of putative causal variants. A total of 12,481 subjects in 4 independent cohorts were analyzed, including 1 European American cohort (1,409 cases with alcohol dependence and 1,518 controls), 1 European Australian cohort (a total of 6,438 family subjects with 1,645 probands), 1 African American cohort from SAGE + COGA (681 cases and 508 controls), and 1 African American cohort from Yale (1,429 cases and 498 controls). The genomewide association analysis was conducted for each cohort, and then, a new meta-analysis was performed to derive the combined p-values. cis-Acting expression of quantitative locus (cis-eQTL) analysis of each risk variant in human tissues and RNA expression analysis of each risk gene in rat brain served as functional validation. In meta-analysis of European American and European Australian cohorts, we found 10 top-ranked single nucleotide polymorphisms (SNPs) (p < 10(-6) ) that were associated with alcohol dependence. They included 6 at SERINC2 (3.1 × 10(-8) ≤ p ≤ 9.6 × 10(-8) ), 1 at STK40 (p = 1.3 × 10(-7) ), 2 at KIAA0040 (3.3 × 10(-7) ≤ p ≤ 5.2 × 10(-7) ), and 1 at IPO11 (p = 6.9 × 10(-7) ). In meta-analysis of 2 African American cohorts, we found 2 top-ranked SNPs including 1 at SLC6A11 (p = 2.7 × 10(-7) ) and 1 at CBLN2 (p = 7.4 × 10(-7) ). In meta-analysis of all 4 cohorts, we found 2 top-ranked SNPs in PTP4A1-PHF3 locus (6.0 × 10(-7) ≤ p ≤ 7.2 × 10(-7) ). In an African American cohort only, we found 1 top-ranked SNP at PLD1 (p = 8.3 × 10(-7) ; OR = 1.56). Many risk SNPs had positive cis-eQTL signals, and all these risk genes except KIAA0040 were found to express in both rat and mouse brains. We found multiple genes that were significantly or suggestively associated with alcohol dependence. They are among the most appropriate for follow-up as contributors to risk for alcohol dependence. Copyright © 2015 by the Research Society on Alcoholism.
Rašić, Gordana; Filipović, Igor; Weeks, Andrew R; Hoffmann, Ary A
2014-04-11
Genetic markers are widely used to understand the biology and population dynamics of disease vectors, but often markers are limited in the resolution they provide. In particular, the delineation of population structure, fine scale movement and patterns of relatedness are often obscured unless numerous markers are available. To address this issue in the major arbovirus vector, the yellow fever mosquito (Aedes aegypti), we used double digest Restriction-site Associated DNA (ddRAD) sequencing for the discovery of genome-wide single nucleotide polymorphisms (SNPs). We aimed to characterize the new SNP set and to test the resolution against previously described microsatellite markers in detecting broad and fine-scale genetic patterns in Ae. aegypti. We developed bioinformatics tools that support the customization of restriction enzyme-based protocols for SNP discovery. We showed that our approach for RAD library construction achieves unbiased genome representation that reflects true evolutionary processes. In Ae. aegypti samples from three continents we identified more than 18,000 putative SNPs. They were widely distributed across the three Ae. aegypti chromosomes, with 47.9% found in intergenic regions and 17.8% in exons of over 2,300 genes. Pattern of their imputed effects in ORFs and UTRs were consistent with those found in a recent transcriptome study. We demonstrated that individual mosquitoes from Indonesia, Australia, Vietnam and Brazil can be assigned with a very high degree of confidence to their region of origin using a large SNP panel. We also showed that familial relatedness of samples from a 0.4 km2 area could be confidently established with a subset of SNPs. Using a cost-effective customized RAD sequencing approach supported by our bioinformatics tools, we characterized over 18,000 SNPs in field samples of the dengue fever mosquito Ae. aegypti. The variants were annotated and positioned onto the three Ae. aegypti chromosomes. The new SNP set provided much greater resolution in detecting population structure and estimating fine-scale relatedness than a set of polymorphic microsatellites. RAD-based markers demonstrate great potential to advance our understanding of mosquito population processes, critical for implementing new control measures against this major disease vector.
Darabi, Hatef; McCue, Karen; Beesley, Jonathan; Michailidou, Kyriaki; Nord, Silje; Kar, Siddhartha; Humphreys, Keith; Thompson, Deborah; Ghoussaini, Maya; Bolla, Manjeet K; Dennis, Joe; Wang, Qin; Canisius, Sander; Scott, Christopher G; Apicella, Carmel; Hopper, John L; Southey, Melissa C; Stone, Jennifer; Broeks, Annegien; Schmidt, Marjanka K; Scott, Rodney J; Lophatananon, Artitaya; Muir, Kenneth; Beckmann, Matthias W; Ekici, Arif B; Fasching, Peter A; Heusinger, Katharina; Dos-Santos-Silva, Isabel; Peto, Julian; Tomlinson, Ian; Sawyer, Elinor J; Burwinkel, Barbara; Marme, Frederik; Guénel, Pascal; Truong, Thérèse; Bojesen, Stig E; Flyger, Henrik; Benitez, Javier; González-Neira, Anna; Anton-Culver, Hoda; Neuhausen, Susan L; Arndt, Volker; Brenner, Hermann; Engel, Christoph; Meindl, Alfons; Schmutzler, Rita K; Arnold, Norbert; Brauch, Hiltrud; Hamann, Ute; Chang-Claude, Jenny; Khan, Sofia; Nevanlinna, Heli; Ito, Hidemi; Matsuo, Keitaro; Bogdanova, Natalia V; Dörk, Thilo; Lindblom, Annika; Margolin, Sara; Kosma, Veli-Matti; Mannermaa, Arto; Tseng, Chiu-Chen; Wu, Anna H; Floris, Giuseppe; Lambrechts, Diether; Rudolph, Anja; Peterlongo, Paolo; Radice, Paolo; Couch, Fergus J; Vachon, Celine; Giles, Graham G; McLean, Catriona; Milne, Roger L; Dugué, Pierre-Antoine; Haiman, Christopher A; Maskarinec, Gertraud; Woolcott, Christy; Henderson, Brian E; Goldberg, Mark S; Simard, Jacques; Teo, Soo H; Mariapun, Shivaani; Helland, Åslaug; Haakensen, Vilde; Zheng, Wei; Beeghly-Fadiel, Alicia; Tamimi, Rulla; Jukkola-Vuorinen, Arja; Winqvist, Robert; Andrulis, Irene L; Knight, Julia A; Devilee, Peter; Tollenaar, Robert A E M; Figueroa, Jonine; García-Closas, Montserrat; Czene, Kamila; Hooning, Maartje J; Tilanus-Linthorst, Madeleine; Li, Jingmei; Gao, Yu-Tang; Shu, Xiao-Ou; Cox, Angela; Cross, Simon S; Luben, Robert; Khaw, Kay-Tee; Choi, Ji-Yeob; Kang, Daehee; Hartman, Mikael; Lim, Wei Yen; Kabisch, Maria; Torres, Diana; Jakubowska, Anna; Lubinski, Jan; McKay, James; Sangrajrang, Suleeporn; Toland, Amanda E; Yannoukakos, Drakoulis; Shen, Chen-Yang; Yu, Jyh-Cherng; Ziogas, Argyrios; Schoemaker, Minouk J; Swerdlow, Anthony; Borresen-Dale, Anne-Lise; Kristensen, Vessela; French, Juliet D; Edwards, Stacey L; Dunning, Alison M; Easton, Douglas F; Hall, Per; Chenevix-Trench, Georgia
2015-07-02
Genome-wide association studies have identified SNPs near ZNF365 at 10q21.2 that are associated with both breast cancer risk and mammographic density. To identify the most likely causal SNPs, we fine mapped the association signal by genotyping 428 SNPs across the region in 89,050 European and 12,893 Asian case and control subjects from the Breast Cancer Association Consortium. We identified four independent sets of correlated, highly trait-associated variants (iCHAVs), three of which were located within ZNF365. The most strongly risk-associated SNP, rs10995201 in iCHAV1, showed clear evidence of association with both estrogen receptor (ER)-positive (OR = 0.85 [0.82-0.88]) and ER-negative (OR = 0.87 [0.82-0.91]) disease, and was also the SNP most strongly associated with percent mammographic density. iCHAV2 (lead SNP, chr10: 64,258,684:D) and iCHAV3 (lead SNP, rs7922449) were also associated with ER-positive (OR = 0.93 [0.91-0.95] and OR = 1.06 [1.03-1.09]) and ER-negative (OR = 0.95 [0.91-0.98] and OR = 1.08 [1.04-1.13]) disease. There was weaker evidence for iCHAV4, located 5' of ADO, associated only with ER-positive breast cancer (OR = 0.93 [0.90-0.96]). We found 12, 17, 18, and 2 candidate causal SNPs for breast cancer in iCHAVs 1-4, respectively. Chromosome conformation capture analysis showed that iCHAV2 interacts with the ZNF365 and NRBF2 (more than 600 kb away) promoters in normal and cancerous breast epithelial cells. Luciferase assays did not identify SNPs that affect transactivation of ZNF365, but identified a protective haplotype in iCHAV2, associated with silencing of the NRBF2 promoter, implicating this gene in the etiology of breast cancer. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Pelov, Ilana; Teltsh, Omri; Greenbaum, Lior; Rigbi, Amihai; Kanyas-Sarner, Kyra; Lerer, Bernard; Lombroso, Paul; Kohn, Yoav
2013-01-01
Objective STriatal-Enriched protein tyrosine Phosphatase (STEP) is a brain-specific member of the PTP family that has been implicated in learning and memory. In this study, we examined the association of the PTPN5 (protein-tyrosine-phosphatase non-receptor 5) gene, which encodes for STEP, with both schizophrenia and cognitive functioning in the Israeli Jewish population. Methods A 868 subjects schizophrenia (SZ) case-control study was performed (286 cases and 582 controls). Eleven STEP tagging SNPs were selected, and single markers and haplotypes association analyses were performed. A cognitive variability study included 437 healthy females who completed a computerized cognitive battery. We performed univariate associations between the SNPs and cognitive performance. The possible functional role of these variants was examined by studying their association with gene expression levels in the brain. Results In the SZ study, we found nominal association in the whole sample between rs4075664 and SZ. SZ males showed a more significant association for 3 SNPs (rs4075664, rs2278732, rs4757710). Haplotypes of the studied SNPs were associated with SZ both in the overall sample and within the male sub-sample. Expression analysis provided some support for the effects of the associated SNPs on PTPN5 expression level. The cognitive variability study showed positive associations between PTPN5 SNPs and different cognitive subtests. Principal component analysis demonstrated an “Attention Index” neurocognitive component that was associated with two SNP pairs (rs10832983*rs10766504 and rs7932938*rs4757718). Conclusion The results imply a model in which PTPN5 may play a role in normal cognitive functioning and contributes to aspects of the neuropathology of schizophrenia. PMID:22555153
Valle, Anne M.; Radić, Zoran; Rana, Brinda K.; Mahboubi, Vafa; Wessel, Jennifer; Shih, Pei-an Betty; Rao, Fangwen; O'Connor, Daniel T.
2011-01-01
Cholinergic neurotransmission in the central and autonomic nervous systems regulates immediate variations in and longer-term maintenance of cardiovascular function with acetylcholinesterase (AChE) activity that is critical to temporal responsiveness. Butyrylcholinesterase (BChE), largely confined to the liver and plasma, subserves metabolic functions. AChE and BChE are found in hematopoietic cells and plasma, enabling one to correlate enzyme levels in whole blood with hereditary traits in twins. Using both twin and unrelated subjects, we found certain single nucleotide polymorphisms (SNPs) in the ACHE gene correlated with catalytic properties and general cardiovascular functions. SNP discovery from ACHE resequencing identified 19 SNPs: 7 coding SNPs (cSNPs), of which 4 are nonsynonymous, and 12 SNPs in untranslated regions, of which 3 are in a conserved sequence of an upstream intron. Both AChE and BChE activity traits in blood were heritable: AChE at 48.8 ± 6.1% and BChE at 81.4 ± 2.8%. Allelic and haplotype variations in the ACHE and BCHE genes were associated with changes in blood AChE and BChE activities. AChE activity was associated with BP status and SBP, whereas BChE activity was associated with features of the metabolic syndrome (especially body weight and BMI). Gene products from cDNAs with nonsynonymous cSNPs were expressed and purified. Protein expression of ACHE nonsynonymous variant D134H (SNP6) is impaired: this variant shows compromised stability and altered rates of organophosphate inhibition and oxime-assisted reactivation. A substantial fraction of the D134H instability could be reversed in the D134H/R136Q mutant. Hence, common genetic variations at ACHE and BCHE loci were associated with changes in corresponding enzymatic activities in blood. PMID:21493754
Hill, W D; Davies, G; Harris, S E; Hagenaars, S P; Liewald, D C; Penke, L; Gale, C R; Deary, I J
2016-12-13
Differences in general cognitive function have been shown to be partly heritable and to show genetic correlations with several psychiatric and physical disease states. However, to date, few single-nucleotide polymorphisms (SNPs) have demonstrated genome-wide significance, hampering efforts aimed at determining which genetic variants are most important for cognitive function and which regions drive the genetic associations between cognitive function and disease states. Here, we combine multiple large genome-wide association study (GWAS) data sets, from the CHARGE cognitive consortium (n=53 949) and UK Biobank (n=36 035), to partition the genome into 52 functional annotations and an additional 10 annotations describing tissue-specific histone marks. Using stratified linkage disequilibrium score regression we show that, in two measures of cognitive function, SNPs associated with cognitive function cluster in regions of the genome that are under evolutionary negative selective pressure. These conserved regions contained ~2.6% of the SNPs from each GWAS but accounted for ~40% of the SNP-based heritability. The results suggest that the search for causal variants associated with cognitive function, and those variants that exert a pleiotropic effect between cognitive function and health, will be facilitated by examining these enriched regions.
Hill, W D; Davies, G; Harris, S E; Hagenaars, S P; Davies, Gail; Deary, Ian J; Debette, Stephanie; Verbaas, Carla I; Bressler, Jan; Schuur, Maaike; Smith, Albert V; Bis, Joshua C; Bennett, David A; Ikram, M Arfan; Launer, Lenore J; Fitzpatrick, Annette L; Seshadri, Sudha; van Duijn, Cornelia M; Mosley Jr, Thomas H; Liewald, D C; Penke, L; Gale, C R; Deary, I J
2016-01-01
Differences in general cognitive function have been shown to be partly heritable and to show genetic correlations with several psychiatric and physical disease states. However, to date, few single-nucleotide polymorphisms (SNPs) have demonstrated genome-wide significance, hampering efforts aimed at determining which genetic variants are most important for cognitive function and which regions drive the genetic associations between cognitive function and disease states. Here, we combine multiple large genome-wide association study (GWAS) data sets, from the CHARGE cognitive consortium (n=53 949) and UK Biobank (n=36 035), to partition the genome into 52 functional annotations and an additional 10 annotations describing tissue-specific histone marks. Using stratified linkage disequilibrium score regression we show that, in two measures of cognitive function, SNPs associated with cognitive function cluster in regions of the genome that are under evolutionary negative selective pressure. These conserved regions contained ~2.6% of the SNPs from each GWAS but accounted for ~40% of the SNP-based heritability. The results suggest that the search for causal variants associated with cognitive function, and those variants that exert a pleiotropic effect between cognitive function and health, will be facilitated by examining these enriched regions. PMID:27959336
SNPdbe: constructing an nsSNP functional impacts database.
Schaefer, Christian; Meier, Alice; Rost, Burkhard; Bromberg, Yana
2012-02-15
Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe-SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from >2600 organisms; 'human' being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. http://www.rostlab.org/services/snpdbe.
Rand, Kristin A; Song, Chi; Dean, Eric; Serie, Daniel J; Curtin, Karen; Sheng, Xin; Hu, Donglei; Huff, Carol Ann; Bernal-Mizrachi, Leon; Tomasson, Michael H; Ailawadhi, Sikander; Singhal, Seema; Pawlish, Karen; Peters, Edward S; Bock, Cathryn H; Stram, Alex; Van Den Berg, David J; Edlund, Christopher K; Conti, David V; Zimmerman, Todd; Hwang, Amie E; Huntsman, Scott; Graff, John; Nooka, Ajay; Kong, Yinfei; Pregja, Silvana L; Berndt, Sonja I; Blot, William J; Carpten, John; Casey, Graham; Chu, Lisa; Diver, W Ryan; Stevens, Victoria L; Lieber, Michael R; Goodman, Phyllis J; Hennis, Anselm J M; Hsing, Ann W; Mehta, Jayesh; Kittles, Rick A; Kolb, Suzanne; Klein, Eric A; Leske, Cristina; Murphy, Adam B; Nemesure, Barbara; Neslund-Dudas, Christine; Strom, Sara S; Vij, Ravi; Rybicki, Benjamin A; Stanford, Janet L; Signorello, Lisa B; Witte, John S; Ambrosone, Christine B; Bhatti, Parveen; John, Esther M; Bernstein, Leslie; Zheng, Wei; Olshan, Andrew F; Hu, Jennifer J; Ziegler, Regina G; Nyante, Sarah J; Bandera, Elisa V; Birmann, Brenda M; Ingles, Sue A; Press, Michael F; Atanackovic, Djordje; Glenn, Martha J; Cannon-Albright, Lisa A; Jones, Brandt; Tricot, Guido; Martin, Thomas G; Kumar, Shaji K; Wolf, Jeffrey L; Deming Halverson, Sandra L; Rothman, Nathaniel; Brooks-Wilson, Angela R; Rajkumar, S Vincent; Kolonel, Laurence N; Chanock, Stephen J; Slager, Susan L; Severson, Richard K; Janakiraman, Nalini; Terebelo, Howard R; Brown, Elizabeth E; De Roos, Anneclaire J; Mohrbacher, Ann F; Colditz, Graham A; Giles, Graham G; Spinelli, John J; Chiu, Brian C; Munshi, Nikhil C; Anderson, Kenneth C; Levy, Joan; Zonder, Jeffrey A; Orlowski, Robert Z; Lonial, Sagar; Camp, Nicola J; Vachon, Celine M; Ziv, Elad; Stram, Daniel O; Hazelett, Dennis J; Haiman, Christopher A; Cozen, Wendy
2016-12-01
Genome-wide association studies (GWAS) in European populations have identified genetic risk variants associated with multiple myeloma. We performed association testing of common variation in eight regions in 1,318 patients with multiple myeloma and 1,480 controls of European ancestry and 1,305 patients with multiple myeloma and 7,078 controls of African ancestry and conducted a meta-analysis to localize the signals, with epigenetic annotation used to predict functionality. We found that variants in 7p15.3, 17p11.2, 22q13.1 were statistically significantly (P < 0.05) associated with multiple myeloma risk in persons of African ancestry and persons of European ancestry, and the variant in 3p22.1 was associated in European ancestry only. In a combined African ancestry-European ancestry meta-analysis, variation in five regions (2p23.3, 3p22.1, 7p15.3, 17p11.2, 22q13.1) was statistically significantly associated with multiple myeloma risk. In 3p22.1, the correlated variants clustered within the gene body of ULK4 Correlated variants in 7p15.3 clustered around an enhancer at the 3' end of the CDCA7L transcription termination site. A missense variant at 17p11.2 (rs34562254, Pro251Leu, OR, 1.32; P = 2.93 × 10 -7 ) in TNFRSF13B encodes a lymphocyte-specific protein in the TNF receptor family that interacts with the NF-κB pathway. SNPs correlated with the index signal in 22q13.1 cluster around the promoter and enhancer regions of CBX7 CONCLUSIONS: We found that reported multiple myeloma susceptibility regions contain risk variants important across populations, supporting the use of multiple racial/ethnic groups with different underlying genetic architecture to enhance the localization and identification of putatively functional alleles. A subset of reported risk loci for multiple myeloma has consistent effects across populations and is likely to be functional. Cancer Epidemiol Biomarkers Prev; 25(12); 1609-18. ©2016 AACR. ©2016 American Association for Cancer Research.
Bruce, H A; Kochunov, P; Paciga, S A; Hyde, C L; Chen, X; Xie, Z; Zhang, B; Xi, H S; O'Donnell, P; Whelan, C; Schubert, C R; Bellon, A; Ament, S A; Shukla, D K; Du, X; Rowland, L M; O'Neill, H; Hong, L E
2017-06-01
Patients with schizophrenia show decreased processing speed on neuropsychological testing and decreased white matter integrity as measured by diffusion tensor imaging, two traits shown to be both heritable and genetically associated indicating that there may be genes that influence both traits as well as schizophrenia disease risk. The potassium channel gene family is a reasonable candidate to harbor such a gene given the prominent role potassium channels play in the central nervous system in signal transduction, particularly in myelinated axons. We genotyped members of the large potassium channel gene family focusing on putatively functional single nucleotide polymorphisms (SNPs) in a population of 363 controls, 194 patients with schizophrenia spectrum disorder (SSD) and 28 patients with affective disorders with psychotic features who completed imaging and neuropsychological testing. We then performed three association analyses using three phenotypes - processing speed, whole-brain white matter fractional anisotropy (FA) and schizophrenia spectrum diagnosis. We extracted SNPs showing an association at a nominal P value of <0.05 with all three phenotypes in the expected direction: decreased processing speed, decreased FA and increased risk of SSD. A single SNP, rs8234, in the 3' untranslated region of voltage-gated potassium channel subfamily Q member 1 (KCNQ1) was identified. Rs8234 has been shown to affect KCNQ1 expression levels, and KCNQ1 levels have been shown to affect neuronal action potentials. This exploratory analysis provides preliminary data suggesting that KCNQ1 may contribute to the shared risk for diminished processing speed, diminished white mater integrity and increased risk of schizophrenia. © 2017 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Iskow, Rebecca C.; Austermann, Christian; Scharer, Christopher D.; Raj, Towfique; Boss, Jeremy M.; Sunyaev, Shamil; Price, Alkes; Stranger, Barbara; Simon, Viviana; Lee, Charles
2013-01-01
Ancient population structure shaping contemporary genetic variation has been recently appreciated and has important implications regarding our understanding of the structure of modern human genomes. We identified a ∼36-kb DNA segment in the human genome that displays an ancient substructure. The variation at this locus exists primarily as two highly divergent haplogroups. One of these haplogroups (the NE1 haplogroup) aligns with the Neandertal haplotype and contains a 4.6-kb deletion polymorphism in perfect linkage disequilibrium with 12 single nucleotide polymorphisms (SNPs) across diverse populations. The other haplogroup, which does not contain the 4.6-kb deletion, aligns with the chimpanzee haplotype and is likely ancestral. Africans have higher overall pairwise differences with the Neandertal haplotype than Eurasians do for this NE1 locus (p<10−15). Moreover, the nucleotide diversity at this locus is higher in Eurasians than in Africans. These results mimic signatures of recent Neandertal admixture contributing to this locus. However, an in-depth assessment of the variation in this region across multiple populations reveals that African NE1 haplotypes, albeit rare, harbor more sequence variation than NE1 haplotypes found in Europeans, indicating an ancient African origin of this haplogroup and refuting recent Neandertal admixture. Population genetic analyses of the SNPs within each of these haplogroups, along with genome-wide comparisons revealed significant FST (p = 0.00003) and positive Tajima's D (p = 0.00285) statistics, pointing to non-neutral evolution of this locus. The NE1 locus harbors no protein-coding genes, but contains transcribed sequences as well as sequences with putative regulatory function based on bioinformatic predictions and in vitro experiments. We postulate that the variation observed at this locus predates Human–Neandertal divergence and is evolving under balancing selection, especially among European populations. PMID:23593015
Polymorphisms in CTNNBL1 in relation to colorectal cancer with evolutionary implications
Huhn, Stefanie; Ingelfinger, Dierk; Bermejo, Justo Lorenzo; Bevier, Melanie; Pardini, Barbara; Naccarati, Alessio; Steinke, Verena; Rahner, Nils; Holinski-Feder, Elke; Morak, Monika; Schackert, Hans K; Görgens, Heike; Pox, Christian P; Goecke, Timm; Kloor, Matthias; Loeffler, Markus; Büttner, Reinhard; Vodickova, Ludmila; Novotny, Jan; Demir, Kubilay; Cruciat, Cristina-Maria; Renneberg, Rebecca; Huber, Wolfgang; Niehrs, Christof; Boutros, Michael; Propping, Peter; Vodička, Pavel; Hemminki, Kari; Försti, Asta
2011-01-01
Colorectal cancer (CRC) is a complex disease related to environmental and genetic risk factors. Several studies have shown that susceptibility to complex diseases can be mediated by ancestral alleles. Using RNAi screening, CTNNBL1 was identified as a putative regulator of the Wnt signaling pathway, which plays a key role in colorectal carcinogenesis. Recently, single nucleotide polymorphisms (SNPs) in CTNNBL1 have been associated with obesity, a known risk factor for CRC. We investigated whether genetic variation in CTNNBL1 affects susceptibility to CRC and tested for signals of recent selection. We applied a tagging SNP approach that cover all known common variation in CTNNBL1 (allele frequency >5%; r2>0.8). A case-control study was carried out using two well-characterized study populations: a hospital-based Czech population composed of 751 sporadic cases and 755 controls and a family/early onset-based German population (697 cases and 644 controls). Genotyping was performed using allele specific PCR based TaqMan® assays (Applied Biosystems, Weiterstadt, Germany). In the Czech cohort, containing sporadic cases, the ancestral alleles of three SNPs showed evidence of association with CRC: rs2344481 (OR 1.44, 95%CI 1.06-1.95, dominant model), rs2281148 (OR 0.59, 95%CI 0.36-0.96, dominant model) and rs2235460 (OR 1.38, 95%CI 1.01-1.89, AA vs. GG). The associations were less prominent in the family/early onset-based German cohort. Data derived from several databases and statistical tests consistently pointed to a likely shaping of CTNNBL1 by positive selection. Further studies are needed to identify the actual function of CTNNBL1 and to validate the association results in other populations. PMID:21537400
PHENOTYPIC AND GENETIC HETEROGENEITY AMONG SUBJECTS WITH MILD AIRFLOW OBSTRUCTION IN COPDGENE
Lee, Jin Hwa; Cho, Michael H.; McDonald, Merry-Lynn N.; Hersh, Craig P.; Castaldi, Peter J.; Crapo, James D.; Wan, Emily S.; Dy, Jennifer G.; Chang, Yale; Regan, Elizabeth A.; Hardin, Megan; DeMeo, Dawn L.; Silverman, Edwin K.
2014-01-01
Background Chronic obstructive pulmonary disease (COPD) is characterized by marked phenotypic heterogeneity. Most previous studies have focused on COPD subjects with FEV1 < 80% predicted. We investigated the clinical and genetic heterogeneity in subjects with mild airflow limitation in spirometry grade 1 defined by the Global Initiative for chronic Obstructive Lung Disease (GOLD 1). Methods Data from current and former smokers participating in the COPDGene Study (NCT00608764) were analyzed. K-means clustering was performed to explore subtypes within 794 GOLD 1 subjects. For all subjects with GOLD 1 and with each cluster, a genome-wide association study and candidate gene testing were performed using smokers with normal lung function as a control group. Combinations of COPD genome-wide significant single nucleotide polymorphisms (SNPs) were tested for association with FEV1 (% predicted) in GOLD 1 and in a combined group of GOLD1 and smoking control subjects. Results K-means clustering of GOLD 1 subjects identified putative “near-normal”, “airway-predominant”, “emphysema-predominant” and “lowest FEV1 % predicted” subtypes. In non-Hispanic whites, the only SNP nominally associated with GOLD 1 status relative to smoking controls was rs7671167 (FAM13A) in logistic regression models with adjustment for age, sex, pack-years of smoking, and genetic ancestry. The emphysema-predominant GOLD 1 cluster was nominally associated with rs7671167 (FAM13A) and rs161976 (BICD1). The lowest FEV1 % predicted cluster was nominally associated with rs1980057 (HHIP) and rs1051730 (CHRNA3). Combinations of COPD genome-wide significant SNPs were associated with FEV1 (% predicted) in a combined group of GOLD 1 and smoking control subjects. Conclusions Our results indicate that GOLD 1 subjects show substantial clinical heterogeneity, which is at least partially related to genetic heterogeneity. PMID:25154699
Phenotypic and genetic heterogeneity among subjects with mild airflow obstruction in COPDGene.
Lee, Jin Hwa; Cho, Michael H; McDonald, Merry-Lynn N; Hersh, Craig P; Castaldi, Peter J; Crapo, James D; Wan, Emily S; Dy, Jennifer G; Chang, Yale; Regan, Elizabeth A; Hardin, Megan; DeMeo, Dawn L; Silverman, Edwin K
2014-10-01
Chronic obstructive pulmonary disease (COPD) is characterized by marked phenotypic heterogeneity. Most previous studies have focused on COPD subjects with FEV1 < 80% predicted. We investigated the clinical and genetic heterogeneity in subjects with mild airflow limitation in spirometry grade 1 defined by the Global Initiative for chronic Obstructive Lung Disease (GOLD 1). Data from current and former smokers participating in the COPDGene Study (NCT00608764) were analyzed. K-means clustering was performed to explore subtypes within 794 GOLD 1 subjects. For all subjects with GOLD 1 and with each cluster, a genome-wide association study and candidate gene testing were performed using smokers with normal lung function as a control group. Combinations of COPD genome-wide significant single nucleotide polymorphisms (SNPs) were tested for association with FEV1 (% predicted) in GOLD 1 and in a combined group of GOLD 1 and smoking control subjects. K-means clustering of GOLD 1 subjects identified putative "near-normal", "airway-predominant", "emphysema-predominant" and "lowest FEV1% predicted" subtypes. In non-Hispanic whites, the only SNP nominally associated with GOLD 1 status relative to smoking controls was rs7671167 (FAM13A) in logistic regression models with adjustment for age, sex, pack-years of smoking, and genetic ancestry. The emphysema-predominant GOLD 1 cluster was nominally associated with rs7671167 (FAM13A) and rs161976 (BICD1). The lowest FEV1% predicted cluster was nominally associated with rs1980057 (HHIP) and rs1051730 (CHRNA3). Combinations of COPD genome-wide significant SNPs were associated with FEV1 (% predicted) in a combined group of GOLD 1 and smoking control subjects. Our results indicate that GOLD 1 subjects show substantial clinical heterogeneity, which is at least partially related to genetic heterogeneity. Copyright © 2014 Elsevier Ltd. All rights reserved.
Schousboe, Mette L.; Ranjitkar, Samir; Rajakaruna, Rupika S.; Amerasinghe, Priyanie H.; Morales, Francisco; Pearce, Richard; Ord, Rosalyn; Leslie, Toby; Rowland, Mark; Gadalla, Nahla B.; Konradsen, Flemming; Bygbjerg, Ib C.; Roper, Cally; Alifrangis, Michael
2015-01-01
Background Chloroquine combined with primaquine has been the recommended antimalarial treatment of Plasmodium vivax malaria infections for six decades but the efficacy of this treatment regimen is threatened by chloroquine resistance (CQR). Single nucleotide polymorphisms (SNPs) in the multidrug resistance gene, Pvmdr1 are putative determinants of CQR but the extent of their emergence at population level remains to be explored. Objective In this study we describe the prevalence of SNPs in the Pvmdr1 among samples collected in seven P. vivax endemic countries and we looked for molecular evidence of drug selection by characterising polymorphism at microsatellite (MS) loci flanking the Pvmdr1 gene. Methods We examined the prevalence of SNPs in the Pvmdr1 gene among 267 samples collected from Pakistan, Afghanistan, Sri Lanka, Nepal, Sudan, São Tomé and Ecuador. We measured and diversity in four microsatellite (MS) markers flanking the Pvmdr1 gene to look evidence of selection on mutant alleles. Results SNP polymorphism in the Pvmdr1 gene was largely confined to codons T958M, Y976F and F1076L. Only 2.4% of samples were wildtype at all three codons (TYF, n = 5), 13.3% (n = 28) of the samples were single mutant MYF, 63.0% of samples (n = 133) were double mutant MYL, and 21.3% (n = 45) were triple mutant MFL. Clear geographic differences in the prevalence of these Pvmdr mutation combinations were observed. Significant linkage disequilibrium (LD) between Pvmdr1 and MS alleles was found in populations sampled in Ecuador, Nepal and Sri Lanka, while significant LD between Pvmdr1 and the combined 4 MS locus haplotype was only seen in Ecuador and Sri Lanka. When combining the 5 loci, high level diversity, measured as expected heterozygosity (He), was seen in the complete sample set (He = 0.99), while He estimates for individual loci ranged from 0.00–0.93. Although Pvmdr1 haplotypes were not consistently associated with specific flanking MS alleles, there was significant differentiation between geographic sites which could indicate directional selection through local drug pressure. Conclusions Our observations suggest that Pvmdr1 mutations emerged independently on multiple occasions even within the same population. In Sri Lanka population analysis at multiple sites showed evidence of local selection and geographical dispersal of Pvmdr1 mutations between sites. PMID:26539821
Delgado-Lista, Javier; Perez-Martinez, Pablo; Solivera, Juan; Garcia-Rios, Antonio; Perez-Caballero, A I; Lovegrove, Julie A; Drevon, Christian A; Defoort, Catherine; Blaak, Ellen E; Dembinska-Kieć, Aldona; Risérus, Ulf; Herruzo-Gomez, Ezequiel; Camargo, Antonio; Ordovas, Jose M; Roche, Helen; Lopez-Miranda, José
2014-02-01
Metabolic syndrome (MetS) is a high-prevalence condition characterized by altered energy metabolism, insulin resistance, and elevated cardiovascular risk. Although many individual single nucleotide polymorphisms (SNPs) have been linked to certain MetS features, there are few studies analyzing the influence of SNPs on carbohydrate metabolism in MetS. A total of 904 SNPs (tag SNPs and functional SNPs) were tested for influence on 8 fasting and dynamic markers of carbohydrate metabolism, by performance of an intravenous glucose tolerance test in 450 participants in the LIPGENE study. From 382 initial gene-phenotype associations between SNPs and any phenotypic variables, 61 (16% of the preselected variables) remained significant after bootstrapping. Top SNPs affecting glucose metabolism variables were as follows: fasting glucose, rs26125 (PPARGC1B); fasting insulin, rs4759277 (LRP1); C-peptide, rs4759277 (LRP1); homeostasis assessment of insulin resistance, rs4759277 (LRP1); quantitative insulin sensitivity check index, rs184003 (AGER); sensitivity index, rs7301876 (ABCC9), acute insulin response to glucose, rs290481 (TCF7L2); and disposition index, rs12691 (CEBPA). We describe here the top SNPs linked to phenotypic features in carbohydrate metabolism among approximately 1000 candidate gene variations in fasting and postprandial samples of 450 patients with MetS from the LIPGENE study.
regSNPs: a strategy for prioritizing regulatory single nucleotide substitutions
Teng, Mingxiang; Ichikawa, Shoji; Padgett, Leah R.; Wang, Yadong; Mort, Matthew; Cooper, David N.; Koller, Daniel L.; Foroud, Tatiana; Edenberg, Howard J.; Econs, Michael J.; Liu, Yunlong
2012-01-01
Motivation: One of the fundamental questions in genetics study is to identify functional DNA variants that are responsible to a disease or phenotype of interest. Results from large-scale genetics studies, such as genome-wide association studies (GWAS), and the availability of high-throughput sequencing technologies provide opportunities in identifying causal variants. Despite the technical advances, informatics methodologies need to be developed to prioritize thousands of variants for potential causative effects. Results: We present regSNPs, an informatics strategy that integrates several established bioinformatics tools, for prioritizing regulatory SNPs, i.e. the SNPs in the promoter regions that potentially affect phenotype through changing transcription of downstream genes. Comparing to existing tools, regSNPs has two distinct features. It considers degenerative features of binding motifs by calculating the differences on the binding affinity caused by the candidate variants and integrates potential phenotypic effects of various transcription factors. When tested by using the disease-causing variants documented in the Human Gene Mutation Database, regSNPs showed mixed performance on various diseases. regSNPs predicted three SNPs that can potentially affect bone density in a region detected in an earlier linkage study. Potential effects of one of the variants were validated using luciferase reporter assay. Contact: yunliu@iupui.edu Supplementary information: Supplementary data are available at Bioinformatics online PMID:22611130
A genome-wide association meta-analysis on apolipoprotein A-IV concentrations
Lamina, Claudia; Friedel, Salome; Coassin, Stefan; Rueedi, Rico; Yousri, Noha A.; Seppälä, Ilkka; Gieger, Christian; Schönherr, Sebastian; Forer, Lukas; Erhart, Gertraud; Kollerits, Barbara; Marques-Vidal, Pedro; Ried, Janina; Waeber, Gerard; Bergmann, Sven; Dähnhardt, Doreen; Stöckl, Andrea; Kiechl, Stefan; Raitakari, Olli T.; Kähönen, Mika; Willeit, Johann; Kedenko, Ludmilla; Paulweber, Bernhard; Peters, Annette; Meitinger, Thomas; Strauch, Konstantin; Study Group, KORA; Lehtimäki, Terho; Hunt, Steven C.; Vollenweider, Peter; Kronenberg, Florian
2016-01-01
Apolipoprotein A-IV (apoA-IV) is a major component of HDL and chylomicron particles and is involved in reverse cholesterol transport. It is an early marker of impaired renal function. We aimed to identify genetic loci associated with apoA-IV concentrations and to investigate relationships with known susceptibility loci for kidney function and lipids. A genome-wide association meta-analysis on apoA-IV concentrations was conducted in five population-based cohorts (n = 13,813) followed by two additional replication studies (n = 2,267) including approximately 10 M SNPs. Three independent SNPs from two genomic regions were significantly associated with apoA-IV concentrations: rs1729407 near APOA4 (P = 6.77 × 10 − 44), rs5104 in APOA4 (P = 1.79 × 10−24) and rs4241819 in KLKB1 (P = 5.6 × 10−14). Additionally, a look-up of the replicated SNPs in downloadable GWAS meta-analysis results was performed on kidney function (defined by eGFR), HDL-cholesterol and triglycerides. From these three SNPs mentioned above, only rs1729407 showed an association with HDL-cholesterol (P = 7.1 × 10 − 07). Moreover, weighted SNP-scores were built involving known susceptibility loci for the aforementioned traits (53, 70 and 38 SNPs, respectively) and were associated with apoA-IV concentrations. This analysis revealed a significant and an inverse association for kidney function with apoA-IV concentrations (P = 5.5 × 10−05). Furthermore, an increase of triglyceride-increasing alleles was found to decrease apoA-IV concentrations (P = 0.0078). In summary, we identified two independent SNPs located in or next the APOA4 gene and one SNP in KLKB1. The association of KLKB1 with apoA-IV suggests an involvement of apoA-IV in renal metabolism and/or an interaction within HDL particles. Analyses of SNP-scores indicate potential causal effects of kidney function and by lesser extent triglycerides on apoA-IV concentrations. PMID:27412012
Shi, Wan; Quan, Mingyang; Du, Qingzhang; Zhang, Deqiang
2017-01-01
Long non-coding RNAs (lncRNAs) are important regulatory factors for plant growth and development, but little is known about the allelic interactions of lncRNAs with mRNA in perennial plants. Here, we analyzed the interaction of the NERD (Needed for RDR2-independent DNA methylation) Populus tomentosa gene PtoNERD with its putative regulator, the lncRNA NERDL (NERD-related lncRNA), which partially overlaps with the promoter region of this gene. Expression analysis in eight tissues showed a positive correlation between NERDL and PtoNERD (r = 0.62), suggesting that the interaction of NERDL with its putative target might be involved in wood formation. We conducted association mapping in a natural population of P. tomentosa (435 unrelated individuals) to evaluate genetic variation and the interaction of the lncRNA NERDL with PtoNERD. Using additive and dominant models, we identified 30 SNPs (P < 0.01) associated with five tree growth and wood property traits. Each SNP explained 3.90–8.57% of phenotypic variance, suggesting that NERDL and its putative target play a common role in wood formation. Epistasis analysis uncovered nine SNP-SNP association pairs between NERDL and PtoNERD, with an information gain of -7.55 to 2.16%, reflecting the strong interactions between NERDL and its putative target. This analysis provides a powerful method for deciphering the genetic interactions of lncRNAs with mRNA and dissecting the complex genetic network of quantitative traits in trees. PMID:28674544
Bajaj, Deepak; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We identified 82489 high-quality genome-wide SNPs from 93 wild and cultivated Cicer accessions through integrated reference genome- and de novo-based GBS assays. High intra- and inter-specific polymorphic potential (66–85%) and broader natural allelic diversity (6–64%) detected by genome-wide SNPs among accessions signify their efficacy for monitoring introgression and transferring target trait-regulating genomic (gene) regions/allelic variants from wild to cultivated Cicer gene pools for genetic improvement. The population-specific assignment of wild Cicer accessions pertaining to the primary gene pool are more influenced by geographical origin/phenotypic characteristics than species/gene-pools of origination. The functional significance of allelic variants (non-synonymous and regulatory SNPs) scanned from transcription factors and stress-responsive genes in differentiating wild accessions (with potential known sources of yield-contributing and stress tolerance traits) from cultivated desi and kabuli accessions, fine-mapping/map-based cloning of QTLs and determination of LD patterns across wild and cultivated gene-pools are suitably elucidated. The correlation between phenotypic (agromorphological traits) and molecular diversity-based admixed domestication patterns within six structured populations of wild and cultivated accessions via genome-wide SNPs was apparent. This suggests utility of whole genome SNPs as a potential resource for identifying naturally selected trait-regulating genomic targets/functional allelic variants adaptive to diverse agroclimatic regions for genetic enhancement of cultivated gene-pools. PMID:26208313
Vicchio, Teresa Manuela; Giovinazzo, Salvatore; Certo, Rosaria; Cucinotta, Mariapaola; Micali, Carmelo; Baldari, Sergio; Benvenga, Salvatore; Trimarchi, Francesco; Campennì, Alfredo; Ruggeri, Rosaria Maddalena
2014-07-01
Mutations of the thyrotropin receptor (TSHR) and/or Gαs gene have been found in a number of, but not all, autonomously functioning thyroid nodules (AFTNs). Recently, in a 15-year-old girl with a hyperfunctioning papillary thyroid carcinoma, we found two somatic and germline single nucleotide polymorphisms (SNPs): a SNP of the TSHR gene (exon 7, codon 187) and a SNP of Gαs gene (exon 8, codon 185). The same silent SNP of the TSHR gene had been reported in patients with AFTN or familial non-autoimmune hyperthyroidism. No further data about the prevalence of the two SNPs in AFTNs as well as in the general population are available in the literature. To clarify the possible role of these SNPs in predisposing to AFTN. Germline DNA was extracted from blood leukocytes of 115 patients with AFTNs (43 males and 72 females, aged 31-85 years, mean ± SD = 64 ± 13) and 100 sex-matched healthy individuals from the same geographic area, which is marginally iodine deficient. The genotype distribution of the two SNPs was investigated by restriction fragment length polymorphism-polymerase chain reaction. The prevalence of the two SNPs in our study population was low and not different to that found in healthy individuals: 8 % of patients vs. 9 % of controls were heterozygous for the TSHR SNP and 4 % patients vs. 6 % controls were heterozygous for the Gαs SNP. One patient harbored both SNPs. These results suggest that these two SNPs do not confer susceptibility for the development of AFTN.
Single nucleotide polymorphisms for feed efficiency and performance in crossbred beef cattle
2014-01-01
Background This study was conducted to: (1) identify new SNPs for residual feed intake (RFI) and performance traits within candidate genes identified in a genome wide association study (GWAS); (2) estimate the proportion of variation in RFI explained by the detected SNPs; (3) estimate the effects of detected SNPs on carcass traits to avoid undesirable correlated effects on these economically important traits when selecting for feed efficiency; and (4) map the genes to biological mechanisms and pathways. A total number of 339 SNPs corresponding to 180 genes were tested for association with phenotypes using a single locus regression (SLRM) and genotypic model on 726 and 990 crossbred animals for feed efficiency and carcass traits, respectively. Results Strong evidence of associations for RFI were located on chromosomes 8, 15, 16, 18, 19, 21, and 28. The strongest association with RFI (P = 0.0017) was found with a newly discovered SNP located on BTA 8 within the ELP3 gene. SNPs rs41820824 and rs41821600 on BTA 16 within the gene HMCN1 were strongly associated with RFI (P = 0.0064 and P = 0.0033, respectively). A SNP located on BTA 18 within the ZNF423 gene provided strong evidence for association with RFI (P = 0.0028). Genomic estimated breeding values (GEBV) from 98 significant SNPs were moderately correlated (0.47) to the estimated breeding values (EBVs) from a mixed animal model. The significant (P < 0.05) SNPs (98) explained 26% of the genetic variance for RFI. In silico functional analysis for the genes suggested 35 and 39 biological processes and pathways, respectively for feed efficiency traits. Conclusions This study identified several positional and functional candidate genes involved in important biological mechanisms associated with feed efficiency and performance. Significant SNPs should be validated in other populations to establish their potential utilization in genetic improvement programs. PMID:24476087
Chen, Jinyun; Wu, Xifeng; Huang, Yujing; Chen, Wei; Brand, Randall E.; Killary, Ann M.; Sen, Subrata; Frazier, Marsha L.
2016-01-01
Biomarkers are critically needed for the early detection of pancreatic cancer (PC) are urgently needed. Our purpose was to identify a panel of genetic variants that, combined, can predict increased risk for early-onset PC and thereby identify individuals who should begin screening at an early age. Previously, we identified genes using a functional genomic approach that were aberrantly expressed in early pathways to PC tumorigenesis. We now report the discovery of single nucleotide polymorphisms (SNPs) in these genes associated with early age at diagnosis of PC using a two-phase study design. In silico and bioinformatics tools were used to examine functional relevance of the identified SNPs. Eight SNPs were consistently associated with age at diagnosis in the discovery phase, validation phase and pooled analysis. Further analysis of the joint effects of these 8 SNPs showed that, compared to participants carrying none of these unfavorable genotypes (median age at PC diagnosis 70 years), those carrying 1–2, 3–4, or 5 or more unfavorable genotypes had median ages at diagnosis of 64, 63, and 62 years, respectively (P = 3.0E–04). A gene-dosage effect was observed, with age at diagnosis inversely related to number of unfavorable genotypes (Ptrend = 1.0E–04). Using bioinformatics tools, we found that all of the 8 SNPs were predicted to play functional roles in the disruption of transcription factor and/or enhancer binding sites and most of them were expression quantitative trait loci (eQTL) of the target genes. The panel of genetic markers identified may serve as susceptibility markers for earlier PC diagnosis. PMID:27486767
Associations of IL6 polymorphisms with lung function decline and COPD
He, Jian-Qing; Foreman, Marilyn G.; Shumansky, Karey; Zhang, Xuekui; Akhabir, Loubna; Sin, Don D; Man, S F Paul; DeMeo, Dawn L.; Litonjua, Augusto A.; Silverman, Edwin K.; Connett, John E; Anthonisen, Nicholas R; Wise, Robert A; Paré, Peter D; Sandford, Andrew J
2010-01-01
Background Interleukin-6 (IL6) is a pleiotropic pro-inflammatory and immunomodulatory cytokine which likely plays an important role in the pathogenesis of COPD. There is a functional single nucleotide polymorphism (SNP), −174G/C, in the promoter region of IL6. We hypothesized that IL6 SNPs influence susceptibility for impaired lung function and COPD in smokers. Methods Seven and 5 SNPs in IL6 were genotyped in two nested case-control samples derived from the Lung Health Study (LHS) based on phenotypes of rate of decline of forced expiratory volume in one second (FEV1) over 5 years and baseline FEV1 at the beginning of the LHS. Serum IL6 concentrations were measured for all subjects. A partially overlapping panel of 9 IL6 SNPs was genotyped in 389 COPD cases from the National Emphysema Treatment Trial (NETT) and 420 controls from the Normative Aging Study (NAS). Results In the LHS, three IL6 SNPs were associated with FEV1 decline (0.023 ≤ P ≤ 0.041 in additive models). Among them the IL6_−174C allele was associated with rapid decline of lung function. The association was more significant in a genotype-based analysis (P = 0.006). In the NETT-NAS study, IL6_−174G/C and four other IL6 SNPs, all of which are in linkage disequilibrium with IL6_−174G/C, were associated with susceptibility to COPD (0.01 ≤ P ≤ 0.04 in additive genetic models). Conclusion Our results suggest that the IL6_−174G/C SNP is associated with rapid decline of FEV1 and susceptibility to COPD in smokers. PMID:19359268
Sattler, Ursula; Khosravi, Mojtaba; Avila, Mislay; Pilo, Paola; Langedijk, Johannes P; Ader-Ebert, Nadine; Alves, Lisa A; Plattet, Philippe; Origgi, Francesco C
2014-07-01
The hemagglutinin (H) gene of canine distemper virus (CDV) encodes the receptor-binding protein. This protein, together with the fusion (F) protein, is pivotal for infectivity since it contributes to the fusion of the viral envelope with the host cell membrane. Of the two receptors currently known for CDV (nectin-4 and the signaling lymphocyte activation molecule [SLAM]), SLAM is considered the most relevant for host susceptibility. To investigate how evolution might have impacted the host-CDV interaction, we examined the functional properties of a series of missense single nucleotide polymorphisms (SNPs) naturally accumulating within the H-gene sequences during the transition between two distinct but related strains. The two strains, a wild-type strain and a consensus strain, were part of a single continental outbreak in European wildlife and occurred in distinct geographical areas 2 years apart. The deduced amino acid sequence of the two H genes differed at 5 residues. A panel of mutants carrying all the combinations of the SNPs was obtained by site-directed mutagenesis. The selected mutant, wild type, and consensus H proteins were functionally evaluated according to their surface expression, SLAM binding, fusion protein interaction, and cell fusion efficiencies. The results highlight that the most detrimental functional effects are associated with specific sets of SNPs. Strikingly, an efficient compensational system driven by additional SNPs appears to come into play, virtually neutralizing the negative functional effects. This system seems to contribute to the maintenance of the tightly regulated function of the H-gene-encoded attachment protein. Importance: To investigate how evolution might have impacted the host-canine distemper virus (CDV) interaction, we examined the functional properties of naturally occurring single nucleotide polymorphisms (SNPs) in the hemagglutinin gene of two related but distinct strains of CDV. The hemagglutinin gene encodes the attachment protein, which is pivotal for infection. Our results show that few SNPs have a relevant detrimental impact and they generally appear in specific combinations (molecular signatures). These drastic negative changes are neutralized by compensatory mutations, which contribute to maintenance of an overall constant bioactivity of the attachment protein. This compensational mechanism might reflect the reaction of the CDV machinery to the changes occurring in the virus following antigenic variations critical for virulence. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Pavy, Nathalie; Parsons, Lee S; Paule, Charles; MacKay, John; Bousquet, Jean
2006-01-01
Background High-throughput genotyping technologies represent a highly efficient way to accelerate genetic mapping and enable association studies. As a first step toward this goal, we aimed to develop a resource of candidate Single Nucleotide Polymorphisms (SNP) in white spruce (Picea glauca [Moench] Voss), a softwood tree of major economic importance. Results A white spruce SNP resource encompassing 12,264 SNPs was constructed from a set of 6,459 contigs derived from Expressed Sequence Tags (EST) and by using the bayesian-based statistical software PolyBayes. Several parameters influencing the SNP prediction were analysed including the a priori expected polymorphism, the probability score (PSNP), and the contig depth and length. SNP detection in 3' and 5' reads from the same clones revealed a level of inconsistency between overlapping sequences as low as 1%. A subset of 245 predicted SNPs were verified through the independent resequencing of genomic DNA of a genotype also used to prepare cDNA libraries. The validation rate reached a maximum of 85% for SNPs predicted with either PSNP ≥ 0.95 or ≥ 0.99. A total of 9,310 SNPs were detected by using PSNP ≥ 0.95 as a criterion. The SNPs were distributed among 3,590 contigs encompassing an array of broad functional categories, with an overall frequency of 1 SNP per 700 nucleotide sites. Experimental and statistical approaches were used to evaluate the proportion of paralogous SNPs, with estimates in the range of 8 to 12%. The 3,789 coding SNPs identified through coding region annotation and ORF prediction, were distributed into 39% nonsynonymous and 61% synonymous substitutions. Overall, there were 0.9 SNP per 1,000 nonsynonymous sites and 5.2 SNPs per 1,000 synonymous sites, for a genome-wide nonsynonymous to synonymous substitution rate ratio (Ka/Ks) of 0.17. Conclusion We integrated the SNP data in the ForestTreeDB database along with functional annotations to provide a tool facilitating the choice of candidate genes for mapping purposes or association studies. PMID:16824208
Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D C; Bouchard, Claude
2010-06-01
A genome-wide linkage scan identified a quantitative trait locus for exercise training-induced changes in submaximal exercise (50 W) heart rate (DeltaHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (n=472). To fine-map the region, 1450 tag SNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with DeltaHR50 was observed with 2 single-nucleotide polymorphisms (SNPs) located in the 5' region of the cAMP-responsive element-binding protein 1 (CREB1) gene (rs2253206: P=1.6x10(-5) and rs2360969: P=4.3x10(-5)). The associations remained significant (P=0.01 and P=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analysis identified 9 SNPs that collectively explained 20% of the DeltaHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs approximately 1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, 6 SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: The log of the odds score dropped from 2.10 to 0.41 after adjusting for all 6 SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (P<0.05) lower promoter activity than the minor allele. Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with DeltaHR50 and explains a considerable proportion of the quantitative trait locus variance. However, at least 5 additional SNPs seem to be required to fully account for the original linkage signal.
Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D.C.; Bouchard, Claude
2011-01-01
Background A genome-wide linkage scan identified a quantitative trait locus (QTL) for exercise training-induced changes in submaximal exercise (50W) heart rate (ΔHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (N=472). Methods and Results To fine map the region, 1,450 tagSNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with ΔHR50 was observed with two SNPs located in the 5′ region of the cAMP responsive element binding protein 1 (CREB1) gene (rs2253206: p=1.6×10−5 and rs2360969: p=4.3×10−5). The associations remained significant (p=0.01 and p=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analyses identified nine SNPs that collectively explained 20% of the ΔHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs ~1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, six SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: the LOD score dropped from 2.10 to 0.41 after adjusting for all six SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (p<0.05) lower promoter activity than the minor allele. Conclusions Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with ΔHR50 and explains considerable proportion of the QTL variance. However, at least five additional SNPs seem to be required to fully account for the original linkage signal. PMID:20407090
Sun, Xiaomei; Li, Mingxun; Hao, Dan; Hua, Liushuai; Lan, Xianyong; Lei, Chuzhao; Hu, Shenrong; Qi, Xinglei; Chen, Hong
2015-03-01
Identification of polymorphisms associated with economic traits is important for successful marker-assisted selection in cattle breeding. The family of mammalian sirtuin regulates many biological functions, such as life span extension and energy metabolism. SIRT2, a most abundant sirtuin in adipocytes, acts as a crucial regulator of adipogenic differentiation and plays a key role in controlling adipose tissue function and mass. Here we investigated single nucleotide polymorphisms (SNPs) of bovine SIRT2 in 1226 cattle from five breeds and further evaluated the effects of identified SNPs on economically important traits of Nanyang cattle. Our results revealed four novel SNPs in bovine SIRT2, one was located in intronic region and the other three were synonymous mutations. Linkage disequilibrium and haplotype analyses based on the identified SNPs showed obvious difference between crossbred breed and the other four beef breeds. Association analyses demonstrated that SNPs g.17333C > T and g.17578A > G have a significantly effect on 18-months-old body weight of Nanyang population. Animals with combined genotype TTGG at the above two loci exhibited especially higher body weight. Our data for the first time demonstrated that polymorphisms in bovine SIRT2 are associated with economic traits of Nanyang cattle, which will be helpful for future cattle selection practices.
Janse, Marcel; Lamberts, Laetitia E; Franke, Lude; Raychaudhuri, Soumya; Ellinghaus, Eva; Muri Boberg, Kirsten; Melum, Espen; Folseraas, Trine; Schrumpf, Erik; Bergquist, Annika; Björnsson, Einar; Fu, Jingyuan; Jan Westra, Harm; Groen, Harry J M; Fehrmann, Rudolf S N; Smolonska, Joanna; van den Berg, Leonard H; Ophoff, Roel A; Porte, Robert J; Weismüller, Tobias J; Wedemeyer, Jochen; Schramm, Christoph; Sterneck, Martina; Günther, Rainer; Braun, Felix; Vermeire, Severine; Henckaerts, Liesbet; Wijmenga, Cisca; Ponsioen, Cyriel Y; Schreiber, Stefan; Karlsen, Tom H; Franke, Andre; Weersma, Rinse K
2011-06-01
Primary sclerosing cholangitis (PSC) is a chronic cholestatic liver disease characterized by inflammation and fibrosis of the bile ducts. Both environmental and genetic factors contribute to its pathogenesis. To further clarify its genetic background, we investigated susceptibility loci recently identified for ulcerative colitis (UC) in a large cohort of 1,186 PSC patients and 1,748 controls. Single nucleotide polymorphisms (SNPs) tagging 13 UC susceptibility loci were initially genotyped in 854 PSC patients and 1,491 controls from Benelux (331 cases, 735 controls), Germany (265 cases, 368 controls), and Scandinavia (258 cases, 388 controls). Subsequently, a joint analysis was performed with an independent second Scandinavian cohort (332 cases, 257 controls). SNPs at chromosomes 2p16 (P-value 4.12 × 10(-4) ), 4q27 (P-value 4.10 × 10(-5) ), and 9q34 (P-value 8.41 × 10(-4) ) were associated with PSC in the joint analysis after correcting for multiple testing. In PSC patients without inflammatory bowel disease (IBD), SNPs at 4q27 and 9q34 were nominally associated (P < 0.05). We applied additional in silico analyses to identify likely candidate genes at PSC susceptibility loci. To identify nonrandom, evidence-based links we used GRAIL (Gene Relationships Across Implicated Loci) analysis showing interconnectivity between genes in six out of in total nine PSC-associated regions. Expression quantitative trait analysis from 1,469 Dutch and UK individuals demonstrated that five out of nine SNPs had an effect on cis-gene expression. These analyses prioritized IL2, CARD9, and REL as novel candidates. We have identified three UC susceptibility loci to be associated with PSC, harboring the putative candidate genes REL, IL2, and CARD9. These results add to the scarce knowledge on the genetic background of PSC and imply an important role for both innate and adaptive immunological factors. Copyright © 2011 American Association for the Study of Liver Diseases.
2013-01-01
Background Rapid and reliable identification of quarantine pests is essential for plant inspection services to prevent introduction of invasive species. For insects, this may be a serious problem when dealing with morphologically similar cryptic species complexes and early developmental stages that lack distinctive characters useful for taxonomic identification. DNA based barcoding could solve many of these problems. The standard barcode fragment, an approx. 650 base pairs long sequence of the 5′end of the mitochondrial cytochrome oxidase I (COI), enables differentiation of a very wide range of arthropods. However, problems remain in some taxa, such as Tephritidae, where recent genetic differentiation among some of the described species hinders accurate molecular discrimination. Results In order to explore the full species discrimination potential of COI, we sequenced the barcoding region of the COI gene of a range of economically important Tephritid species and complemented these data with all GenBank and BOLD entries for the systematic group available as of January 2012. We explored the limits of species delimitation of this barcode fragment among 193 putative Tephritid species and established operational taxonomic units (OTUs), between which discrimination is reliably possible. Furthermore, to enable future development of rapid diagnostic assays based on this sequence information, we characterized all single nucleotide polymorphisms (SNPs) and established “near-minimal” sets of SNPs that differentiate among all included OTUs with at least three and four SNPs, respectively. Conclusions We found that although several species cannot be differentiated based on the genetic diversity observed in COI and hence form composite OTUs, 85% of all OTUs correspond to described species. Because our SNP panels are developed based on all currently available sequence information and rely on a minimal pairwise difference of three SNPs, they are highly reliable and hence represent an important resource for developing taxon-specific diagnostic assays. For selected cases, possible explanations that may cause composite OTUs are discussed. PMID:23718854
Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick
2013-01-01
Background and Aims Despite differences in morphology, the genera representing ‘true citrus fruit trees’ are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial ‘species’ of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between ‘true citrus fruit trees’ were clarified. Methods Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. Key Results A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Conclusions Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis. PMID:23104641
Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick
2013-01-01
Despite differences in morphology, the genera representing 'true citrus fruit trees' are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial 'species' of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between 'true citrus fruit trees' were clarified. Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis.
Genetic factors controlling wool shedding in a composite Easycare sheep flock.
Matika, O; Bishop, S C; Pong-Wong, R; Riggio, V; Headon, D J
2013-12-01
Historically, sheep have been selectively bred for desirable traits including wool characteristics. However, recent moves towards extensive farming and reduced farm labour have seen a renewed interest in Easycare breeds. The aim of this study was to quantify the underlying genetic architecture of wool shedding in an Easycare flock. Wool shedding scores were collected from 565 pedigreed commercial Easycare sheep from 2002 to 2010. The wool scoring system was based on a 10-point (0-9) scale, with score 0 for animals retaining full fleece and 9 for those completely shedding. DNA was sampled from 200 animals of which 48 with extreme phenotypes were genotyped using a 50-k SNP chip. Three genetic analyses were performed: heritability analysis, complex segregation analysis to test for a major gene hypothesis and a genome-wide association study to map regions in the genome affecting the trait. Phenotypes were treated as a continuous or binary variable and categories. High estimates of heritability (0.80 when treated as a continuous, 0.65-0.75 as binary and 0.75 as categories) for shedding were obtained from linear mixed model analyses. Complex segregation analysis gave similar estimates (0.80 ± 0.06) to those above with additional evidence for a major gene with dominance effects. Mixed model association analyses identified four significant (P < 0.05) SNPs. Further analyses of these four SNPs in all 200 animals revealed that one of the SNPs displayed dominance effects similar to those obtained from the complex segregation analyses. In summary, we found strong genetic control for wool shedding, demonstrated the possibility of a single putative dominant gene controlling this trait and identified four SNPs that may be in partial linkage disequilibrium with gene(s) controlling shedding. © 2013 University of Edinburgh, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
Baranski, Matthew; Gopikrishna, Gopalapillay; Robinson, Nicholas A.; Katneni, Vinaya Kumar; Shekhar, Mudagandur S.; Shanmugakarthik, Jayakani; Jothivel, Sarangapani; Gopal, Chavali; Ravichandran, Pitchaiyappan; Kent, Matthew; Arnyasi, Mariann; Ponniah, Alphis G.
2014-01-01
Transcriptome sequencing using Illumina RNA-seq was performed on populations of black tiger shrimp from India. Samples were collected from (i) four landing centres around the east coastline (EC) of India, (ii) survivors of a severe WSSV infection during pond culture (SUR) and (iii) the Andaman Islands (AI) in the Bay of Bengal. Equal quantities of purified total RNA from homogenates of hepatopancreas, muscle, nervous tissue, intestinal tract, heart, gonad, gills, pleopod and lymphoid organs were combined to create AI, EC and SUR pools for RNA sequencing. De novo transcriptome assembly resulted in 136,223 contigs (minimum size 100 base pairs, bp) with a total length 61 Mb, an average length of 446 bp and an average coverage of 163× across all pools. Approximately 16% of contigs were annotated with BLAST hit information and gene ontology annotations. A total of 473,620 putative SNPs/indels were identified. An Illumina iSelect genotyping array containing 6,000 SNPs was developed and used to genotype 1024 offspring belonging to seven full-sibling families. A total of 3959 SNPs were mapped to 44 linkage groups. The linkage groups consisted of between 16–129 and 13–130 markers, of length between 139–10.8 and 109.1–10.5 cM and with intervals averaging between 1.2 and 0.9 cM for the female and male maps respectively. The female map was 28% longer than the male map (4060 and 2917 cM respectively) with a 1.6 higher recombination rate observed for female compared to male meioses. This approach has substantially increased expressed sequence and DNA marker resources for tiger shrimp and is a useful resource for QTL mapping and association studies for evolutionarily and commercially important traits. PMID:24465553
Siqueira, Erika Rabelo Forte de; Pereira, Luciano Beltrao; Stefano, Jose Tadeu; Patente, Thiago; Cavaleiro, Ana Mercedes; Silva Vasconcelos, Luydson Richardson; Carmo, Rodrigo Feliciano; Moreira Beltrao Pereira, Leila Maria; Carrilho, Flair Jose; Corrêa-Giannella, Maria Lucia; Oliveira, Claudia P
2015-03-28
Given the important contribution of the nicotinamide adenine dinucleotide phosphate (NADPH) oxidase system to the generation of reactive oxygen species induced by hepatitis C virus (HCV), we investigated two single nucleotide polymorphisms (SNPs) in the putative regulatory region of the genes encoding NADPH oxidase 4 catalytic subunit (NOX4) and its regulatory subunit p22phox (CYBA) and their relation with metabolic and histological variables in patients with HCV. One hundred seventy eight naïve HCV patients (49.3% male; 65% HCV genotype 1) with positive HCV RNA were genotyped using specific primers and fluorescent-labeled probes for SNPs rs3017887 in NOX4 and -675 T → A in CYBA. No association was found between the genotype frequencies of NOX4 and CYBA SNPs and inflammation scores or fibrosis stages in the overall population. The presence of the CA + AA genotypes of the NOX4 SNP was nominally associated with a lower alanine aminotransferase (ALT) concentration in the male population (CA + AA = 72.23 ± 6.34 U/L versus CC = 100.22 ± 9.85; mean ± SEM; P = 0.05). The TT genotype of the CYBA SNP was also nominally associated with a lower ALT concentration in the male population (TT = 84.01 ± 6.77 U/L versus TA + AA = 109.67 ± 18.37 U/L; mean ± SEM; P = 0.047). The minor A-allele of the NOX4 SNP was inversely associated with the frequency of metabolic syndrome (MS) in the male population (odds ratio (OR): 0.15; 95% confidence interval (CI): 0.03 to 0.79; P = 0.025). The results suggest that the evaluated NOX4 and CYBA SNPs are not direct genetic determinants of fibrosis in HCV patients, but nevertheless NOX4 rs3017887 SNP could indirectly influence fibrosis susceptibility due to its inverse association with MS in male patients.
Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals
Coulombe-Huntington, Jasmin; Lam, Kevin C. L.; Dias, Christel; Majewski, Jacek
2009-01-01
Recently, thanks to the increasing throughput of new technologies, we have begun to explore the full extent of alternative pre–mRNA splicing (AS) in the human transcriptome. This is unveiling a vast layer of complexity in isoform-level expression differences between individuals. We used previously published splicing sensitive microarray data from lymphoblastoid cell lines to conduct an in-depth analysis on splicing efficiency of known and predicted exons. By combining publicly available AS annotation with a novel algorithm designed to search for AS, we show that many real AS events can be detected within the usually unexploited, speculative majority of the array and at significance levels much below standard multiple-testing thresholds, demonstrating that the extent of cis-regulated differential splicing between individuals is potentially far greater than previously reported. Specifically, many genes show subtle but significant genetically controlled differences in splice-site usage. PCR validation shows that 42 out of 58 (72%) candidate gene regions undergo detectable AS, amounting to the largest scale validation of isoform eQTLs to date. Targeted sequencing revealed a likely causative SNP in most validated cases. In all 17 incidences where a SNP affected a splice-site region, in silico splice-site strength modeling correctly predicted the direction of the micro-array and PCR results. In 13 other cases, we identified likely causative SNPs disrupting predicted splicing enhancers. Using Fst and REHH analysis, we uncovered significant evidence that 2 putative causative SNPs have undergone recent positive selection. We verified the effect of five SNPs using in vivo minigene assays. This study shows that splicing differences between individuals, including quantitative differences in isoform ratios, are frequent in human populations and that causative SNPs can be identified using in silico predictions. Several cases affected disease-relevant genes and it is likely some of these differences are involved in phenotypic diversity and susceptibility to complex diseases. PMID:20011102
Molecular mechanisms underlying variations in lung function: a systems genetics analysis
Obeidat, Ma’en; Hao, Ke; Bossé, Yohan; Nickle, David C; Nie, Yunlong; Postma, Dirkje S; Laviolette, Michel; Sandford, Andrew J; Daley, Denise D; Hogg, James C; Elliott, W Mark; Fishbane, Nick; Timens, Wim; Hysi, Pirro G; Kaprio, Jaakko; Wilson, James F; Hui, Jennie; Rawal, Rajesh; Schulz, Holger; Stubbe, Beate; Hayward, Caroline; Polasek, Ozren; Järvelin, Marjo-Riitta; Zhao, Jing Hua; Jarvis, Deborah; Kähönen, Mika; Franceschini, Nora; North, Kari E; Loth, Daan W; Brusselle, Guy G; Smith, Albert Vernon; Gudnason, Vilmundur; Bartz, Traci M; Wilk, Jemma B; O’Connor, George T; Cassano, Patricia A; Tang, Wenbo; Wain, Louise V; Artigas, María Soler; Gharib, Sina A; Strachan, David P; Sin, Don D; Tobin, Martin D; London, Stephanie J; Hall, Ian P; Paré, Peter D
2016-01-01
Summary Background Lung function measures reflect the physiological state of the lung, and are essential to the diagnosis of chronic obstructive pulmonary disease (COPD). The SpiroMeta-CHARGE consortium undertook the largest genome-wide association study (GWAS) so far (n=48 201) for forced expiratory volume in 1 s (FEV1) and the ratio of FEV1 to forced vital capacity (FEV1/FVC) in the general population. The lung expression quantitative trait loci (eQTLs) study mapped the genetic architecture of gene expression in lung tissue from 1111 individuals. We used a systems genetics approach to identify single nucleotide polymorphisms (SNPs) associated with lung function that act as eQTLs and change the level of expression of their target genes in lung tissue; termed eSNPs. Methods The SpiroMeta-CHARGE GWAS results were integrated with lung eQTLs to map eSNPs and the genes and pathways underlying the associations in lung tissue. For comparison, a similar analysis was done in peripheral blood. The lung mRNA expression levels of the eSNP-regulated genes were tested for associations with lung function measures in 727 individuals. Additional analyses identified the pleiotropic effects of eSNPs from the published GWAS catalogue, and mapped enrichment in regulatory regions from the ENCODE project. Finally, the Connectivity Map database was used to identify potential therapeutics in silico that could reverse the COPD lung tissue gene signature. Findings SNPs associated with lung function measures were more likely to be eQTLs and vice versa. The integration mapped the specific genes underlying the GWAS signals in lung tissue. The eSNP-regulated genes were enriched for developmental and inflammatory pathways; by comparison, SNPs associated with lung function that were eQTLs in blood, but not in lung, were only involved in inflammatory pathways. Lung function eSNPs were enriched for regulatory elements and were over-represented among genes showing differential expression during fetal lung development. An mRNA gene expression signature for COPD was identified in lung tissue and compared with the Connectivity Map. This in-silico drug repurposing approach suggested several compounds that reverse the COPD gene expression signature, including a nicotine receptor antagonist. These findings represent novel therapeutic pathways for COPD. Interpretation The system genetics approach identified lung tissue genes driving the variation in lung function and susceptibility to COPD. The identification of these genes and the pathways in which they are enriched is essential to understand the pathophysiology of airway obstruction and to identify novel therapeutic targets and biomarkers for COPD, including drugs that reverse the COPD gene signature in silico. Funding The research reported in this article was not specifically funded by any agency. See Acknowledgments for a full list of funders of the lung eQTL study and the Spiro-Meta CHARGE GWAS. PMID:26404118
Cowper-Sal·lari, Richard; Zhang, Xiaoyang; Wright, Jason B.; Bailey, Swneke D.; Cole, Michael D.; Eeckhoute, Jerome; Moore, Jason H.; Lupien, Mathieu
2012-01-01
Genome-wide association studies (GWASs) have identified thousands of single nucleotide polymorphisms (SNPs) associated with human traits and diseases. But because the vast majority of these SNPs are located in the noncoding regions of the genome their risk promoting mechanisms are elusive. Employing a new methodology combining cistromics, epigenomics and genotype imputation we annotate the noncoding regions of the genome in breast cancer cells and systematically identify the functional nature of SNPs associated with breast cancer risk. Our results demonstrate that breast cancer risk-associated SNPs are enriched in the cistromes of FOXA1 and ESR1 and the epigenome of H3K4me1 in a cancer and cell-type-specific manner. Furthermore, the majority of these risk-associated SNPs modulate the affinity of chromatin for FOXA1 at distal regulatory elements, which results in allele-specific gene expression, exemplified by the effect of the rs4784227 SNP on the TOX3 gene found within the 16q12.1 risk locus. PMID:23001124
Lourenco-Jaramillo, Diana Lelidett; Sifuentes-Rincón, Ana María; Parra-Bracamonte, Gaspar Manuel; de la Rosa-Reyna, Xochitl Fabiola; Segura-Cabrera, Aldo; Arellano-Vera, Williams
2012-01-01
DNA from four cattle breeds was used to re-sequence all of the exons and 56% of the introns of the bovine tyrosine hydroxylase (TH) gene and 97% and 13% of the bovine dopamine β-hydroxylase (DBH) coding and non-coding sequences, respectively. Two novel single nucleotide polymorphisms (SNPs) and a microsatellite motif were found in the TH sequences. The DBH sequences contained 62 nucleotide changes, including eight non-synonymous SNPs (nsSNPs) that are of particular interest because they may alter protein function and therefore affect the phenotype. These DBH nsSNPs resulted in amino acid substitutions that were predicted to destabilize the protein structure. Six SNPs (one from TH and five from DBH non-synonymous SNPs) were genotyped in 140 animals; all of them were polymorphic and had a minor allele frequency of > 9%. There were significant differences in the intra- and inter-population haplotype distributions. The haplotype differences between Brahman cattle and the three B. t. taurus breeds (Charolais, Holstein and Lidia) were interesting from a behavioural point of view because of the differences in temperament between these breeds. PMID:22888292
2011-01-01
Background Stenospermocarpy is a mechanism through which certain genotypes of Vitis vinifera L. such as Sultanina produce berries with seeds reduced in size. Stenospermocarpy has not yet been characterized at the molecular level. Results Genetic and physical maps were integrated with the public genomic sequence of Vitis vinifera L. to improve QTL analysis for seedlessness and berry size in experimental progeny derived from a cross of two seedless genotypes. Major QTLs co-positioning for both traits on chromosome 18 defined a 92-kb confidence interval. Functional information from model species including Vitis suggested that VvAGL11, included in this confidence interval, might be the main positional candidate gene responsible for seed and berry development. Characterization of VvAGL11 at the sequence level in the experimental progeny identified several SNPs and INDELs in both regulatory and coding regions. In association analyses performed over three seasons, these SNPs and INDELs explained up to 78% and 44% of the phenotypic variation in seed and berry weight, respectively. Moreover, genetic experiments indicated that the regulatory region has a larger effect on the phenotype than the coding region. Transcriptional analysis lent additional support to the putative role of VvAGL11's regulatory region, as its expression is abolished in seedless genotypes at key stages of seed development. These results transform VvAGL11 into a functional candidate gene for further analyses based on genetic transformation. For breeding purposes, intragenic markers were tested individually for marker assisted selection, and the best markers were those closest to the transcription start site. Conclusion We propose that VvAGL11 is the major functional candidate gene for seedlessness, and we provide experimental evidence suggesting that the seedless phenotype might be caused by variations in its promoter region. Current knowledge of the function of its orthologous genes, its expression profile in Vitis varieties and the strong association between its sequence variation and the degree of seedlessness together indicate that the D-lineage MADS-box gene VvAGL11 corresponds to the Seed Development Inhibitor locus described earlier as a major locus for seedlessness. These results provide new hypotheses for further investigations of the molecular mechanisms involved in seed and berry development. PMID:21447172
Generation and characterization of the sea bass Dicentrarchus labrax brain and liver transcriptomes.
Magnanou, Elodie; Klopp, Christophe; Noirot, Celine; Besseau, Laurence; Falcón, Jack
2014-07-01
The sea bass Dicentrarchus labrax is the center of interest of an increasing number of basic or applied research investigations, even though few genomic or transcriptomic data is available. Current public data only represent a very partial view of its transcriptome. To fill this need, we characterized brain and liver transcriptomes in a generalist manner that would benefit the entire scientific community. We also tackled some bioinformatics questions, related to the effect of RNA fragment size on the assembly quality. Using Illumina RNA-seq, we sequenced organ pools from both wild and farmed Atlantic and Mediterranean fishes. We built two distinct cDNA libraries per organ that only differed by the length of the selected mRNA fragments. Efficiency of assemblies performed on either or both fragments size differed depending on the organ, but remained very close reflecting the quality of the technical replication. We generated more than 19,538Mbp of data. Over 193million reads were assembled into 35,073 contigs (average length=2374bp; N50=3257). 59% contigs were annotated with SwissProt, which corresponded to 12,517 unique genes. We compared the Gene Ontology (GO) contig distribution between the sea bass and the tilapia. We also looked for brain and liver GO specific signatures as well as KEGG pathway coverage. 23,050 putative micro-satellites and 134,890 putative SNPs were identified. Our sampling strategy and assembly pipeline provided a reliable and broad reference transcriptome for the sea bass. It constitutes an indisputable quantitative and qualitative improvement of the public data, as it provides 5 times more base pairs with fewer and longer contigs. Both organs present unique signatures consistent with their specific physiological functions. The discrepancy in fragment size effect on assembly quality between organs lies in their difference in complexity and thus does not allow prescribing any general strategy. This information on two key organs will facilitate further functional approaches. Copyright © 2014 Elsevier B.V. All rights reserved.
Osato, Naoki
2018-01-19
Transcriptional target genes show functional enrichment of genes. However, how many and how significantly transcriptional target genes include functional enrichments are still unclear. To address these issues, I predicted human transcriptional target genes using open chromatin regions, ChIP-seq data and DNA binding sequences of transcription factors in databases, and examined functional enrichment and gene expression level of putative transcriptional target genes. Gene Ontology annotations showed four times larger numbers of functional enrichments in putative transcriptional target genes than gene expression information alone, independent of transcriptional target genes. To compare the number of functional enrichments of putative transcriptional target genes between cells or search conditions, I normalized the number of functional enrichment by calculating its ratios in the total number of transcriptional target genes. With this analysis, native putative transcriptional target genes showed the largest normalized number of functional enrichments, compared with target genes including 5-60% of randomly selected genes. The normalized number of functional enrichments was changed according to the criteria of enhancer-promoter interactions such as distance from transcriptional start sites and orientation of CTCF-binding sites. Forward-reverse orientation of CTCF-binding sites showed significantly higher normalized number of functional enrichments than the other orientations. Journal papers showed that the top five frequent functional enrichments were related to the cellular functions in the three cell types. The median expression level of transcriptional target genes changed according to the criteria of enhancer-promoter assignments (i.e. interactions) and was correlated with the changes of the normalized number of functional enrichments of transcriptional target genes. Human putative transcriptional target genes showed significant functional enrichments. Functional enrichments were related to the cellular functions. The normalized number of functional enrichments of human putative transcriptional target genes changed according to the criteria of enhancer-promoter assignments and correlated with the median expression level of the target genes. These analyses and characters of human putative transcriptional target genes would be useful to examine the criteria of enhancer-promoter assignments and to predict the novel mechanisms and factors such as DNA binding proteins and DNA sequences of enhancer-promoter interactions.
Keel, B N; Nonneman, D J; Rohrer, G A
2017-08-01
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Powell, John H; Amish, Stephen J; Haynes, Gwilym D; Luikart, Gordon; Latch, Emily K
2016-09-01
Mule deer (Odocoileus hemionus) are an excellent nonmodel species for empirically testing hypotheses in landscape and population genomics due to their large population sizes (low genetic drift), relatively continuous distribution, diversity of occupied habitats and phenotypic variation. Because few genomic resources are currently available for this species, we used exon data from a cattle (Bos taurus) reference genome to direct targeted resequencing of 5935 genes in mule deer. We sequenced approximately 3.75 Mbp at minimum 20X coverage in each of the seven mule deer, identifying 23 204 single nucleotide polymorphisms (SNPs) within, or adjacent to, 6886 exons in 3559 genes. We found 91 SNP loci (from 69 genes) with putatively fixed allele frequency differences between the two major lineages of mule deer (mule deer and black-tailed deer), and our estimate of mean genetic divergence (genome-wide FST = 0.123) between these lineages was consistent with previous findings using microsatellite loci. We detected an over-representation of gamete generation and amino acid transport genes among the genes with SNPs exhibiting potentially fixed allele frequency differences between lineages. This targeted resequencing approach using exon capture techniques has identified a suite of loci that can be used in future research to investigate the genomic basis of adaptation and differentiation between black-tailed deer and mule deer. This study also highlights techniques (and an exon capture array) that will facilitate population genomic research in other cervids and nonmodel organisms. © 2016 John Wiley & Sons Ltd.
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.
Jordan, Rebecca; Hoffmann, Ary A; Dillon, Shannon K; Prober, Suzanne M
2017-11-01
Understanding whether populations can adapt in situ or whether interventions are required is of key importance for biodiversity management under climate change. Landscape genomics is becoming an increasingly important and powerful tool for rapid assessments of climate adaptation, especially in long-lived species such as trees. We investigated climate adaptation in Eucalyptus microcarpa using the DArTseq genomic approach. A combination of F ST outlier and environmental association analyses were performed using >4200 genomewide single nucleotide polymorphisms (SNPs) from 26 populations spanning climate gradients in southeastern Australia. Eighty-one SNPs were identified as putatively adaptive, based on significance in F ST outlier tests and significant associations with one or more climate variables related to temperature (70/81), aridity (37/81) or precipitation (35/81). Adaptive SNPs were located on all 11 chromosomes, with no particular region associated with individual climate variables. Climate adaptation appeared to be characterized by subtle shifts in allele frequencies, with no consistent fixed differences identified. Based on these associations, we predict adaptation under projected changes in climate will include a suite of shifts in allele frequencies. Whether this can occur sufficiently rapidly through natural selection within populations, or would benefit from assisted gene migration, requires further evaluation. In some populations, the absence or predicted increases to near fixation of particular adaptive alleles hint at potential limits to adaptive capacity. Together, these results reinforce the importance of standing genetic variation at the geographic level for maintaining species' evolutionary potential. © 2017 John Wiley & Sons Ltd.
Wild, Philipp S.; Felix, Janine F.; Schillert, Arne; Chen, Ming-Huei; Leening, Maarten J.G.; Völker, Uwe; Großmann, Vera; Brody, Jennifer A.; Irvin, Marguerite R.; Shah, Sanjiv J.; Pramana, Setia; Lieb, Wolfgang; Schmidt, Reinhold; Stanton, Alice V.; Malzahn, Dörthe; Lyytikäinen, Leo-Pekka; Tiller, Daniel; Smith, J. Gustav; Di Tullio, Marco R.; Musani, Solomon K.; Morrison, Alanna C.; Pers, Tune H.; Morley, Michael; Kleber, Marcus E.; Aragam, Jayashri; Bis, Joshua C.; Bisping, Egbert; Broeckel, Ulrich; Cheng, Susan; Deckers, Jaap W.; Del Greco M, Fabiola; Edelmann, Frank; Fornage, Myriam; Franke, Lude; Friedrich, Nele; Harris, Tamara B.; Hofer, Edith; Hofman, Albert; Huang, Jie; Hughes, Alun D.; Kähönen, Mika; investigators, KNHI; Kruppa, Jochen; Lackner, Karl J.; Lannfelt, Lars; Laskowski, Rafael; Launer, Lenore J.; Lindgren, Cecilia M.; Loley, Christina; Mayet, Jamil; Medenwald, Daniel; Morris, Andrew P.; Müller, Christian; Müller-Nurasyid, Martina; Nappo, Stefania; Nilsson, Peter M.; Nuding, Sebastian; Nutile, Teresa; Peters, Annette; Pfeufer, Arne; Pietzner, Diana; Pramstaller, Peter P.; Raitakari, Olli T.; Rice, Kenneth M.; Rotter, Jerome I.; Ruohonen, Saku T.; Sacco, Ralph L.; Samdarshi, Tandaw E.; Sharp, Andrew S.P.; Shields, Denis C.; Sorice, Rossella; Sotoodehnia, Nona; Stricker, Bruno H.; Surendran, Praveen; Töglhofer, Anna M.; Uitterlinden, André G.; Völzke, Henry; Ziegler, Andreas; Münzel, Thomas; März, Winfried; Cappola, Thomas P.; Hirschhorn, Joel N.; Mitchell, Gary F.; Smith, Nicholas L.; Fox, Ervin R.; Dueker, Nicole D.; Jaddoe, Vincent W.V.; Melander, Olle; Lehtimäki, Terho; Ciullo, Marina; Hicks, Andrew A.; Lind, Lars; Gudnason, Vilmundur; Pieske, Burkert; Barron, Anthony J.; Zweiker, Robert; Schunkert, Heribert; Ingelsson, Erik; Liu, Kiang; Arnett, Donna K.; Psaty, Bruce M.; Blankenberg, Stefan; Larson, Martin G.; Felix, Stephan B.; Franco, Oscar H.; Zeller, Tanja; Vasan, Ramachandran S.; Dörr, Marcus
2017-01-01
BACKGROUND. Understanding the genetic architecture of cardiac structure and function may help to prevent and treat heart disease. This investigation sought to identify common genetic variations associated with inter-individual variability in cardiac structure and function. METHODS. A GWAS meta-analysis of echocardiographic traits was performed, including 46,533 individuals from 30 studies (EchoGen consortium). The analysis included 16 traits of left ventricular (LV) structure, and systolic and diastolic function. RESULTS. The discovery analysis included 21 cohorts for structural and systolic function traits (n = 32,212) and 17 cohorts for diastolic function traits (n = 21,852). Replication was performed in 5 cohorts (n = 14,321) and 6 cohorts (n = 16,308), respectively. Besides 5 previously reported loci, the combined meta-analysis identified 10 additional genome-wide significant SNPs: rs12541595 near MTSS1 and rs10774625 in ATXN2 for LV end-diastolic internal dimension; rs806322 near KCNRG, rs4765663 in CACNA1C, rs6702619 near PALMD, rs7127129 in TMEM16A, rs11207426 near FGGY, rs17608766 in GOSR2, and rs17696696 in CFDP1 for aortic root diameter; and rs12440869 in IQCH for Doppler transmitral A-wave peak velocity. Findings were in part validated in other cohorts and in GWAS of related disease traits. The genetic loci showed associations with putative signaling pathways, and with gene expression in whole blood, monocytes, and myocardial tissue. CONCLUSION. The additional genetic loci identified in this large meta-analysis of cardiac structure and function provide insights into the underlying genetic architecture of cardiac structure and warrant follow-up in future functional studies. FUNDING. For detailed information per study, see Acknowledgments. PMID:28394258
Sheth, Harsh; Northwood, Emma; Ulrich, Cornelia M; Scherer, Dominique; Elliott, Faye; Barrett, Jennifer H; Forman, David; Wolf, C Roland; Smith, Gillian; Jackson, Michael S; Santibanez-Koref, Mauro; Haile, Robert; Casey, Graham; Jenkins, Mark; Win, Aung Ko; Hopper, John L; Marchand, Loic Le; Lindor, Noralane M; Thibodeau, Stephen N; Potter, John D; Burn, John; Bishop, D Timothy
2018-01-01
Regular aspirin use is associated with reduced risk of colorectal cancer (CRC). Variation in aspirin's chemoprevention efficacy has been attributed to the presence of single nucleotide polymorphisms (SNPs). We conducted a meta-analysis using two large population-based case-control datasets, the UK-Leeds Colorectal Cancer Study Group and the NIH-Colon Cancer Family Registry, having a combined total of 3325 cases and 2262 controls. The aim was to assess 42 candidate SNPs in 15 genes whose association with colorectal cancer risk was putatively modified by aspirin use, in the literature. Log odds ratios (ORs) and standard errors were estimated for each dataset separately using logistic regression adjusting for age, sex and study site, and dataset-specific results were combined using random effects meta-analysis. Meta-analysis showed association between SNPs rs6983267, rs11694911 and rs2302615 with CRC risk reduction (All P<0.05). Association for SNP rs6983267 in the CCAT2 gene only was noteworthy after multiple test correction (P = 0.001). Site-specific analysis showed association between SNPs rs1799853 and rs2302615 with reduced colon cancer risk only (P = 0.01 and P = 0.004, respectively), however neither reached significance threshold following multiple test correction. Meta-analysis of SNPs rs2070959 and rs1105879 in UGT1A6 gene showed interaction between aspirin use and CRC risk (Pinteraction = 0.01 and 0.02, respectively); stratification by aspirin use showed an association for decreased CRC risk for aspirin users having a wild-type genotype (rs2070959 OR = 0.77, 95% CI = 0.68-0.86; rs1105879 OR = 0.77 95% CI = 0.69-0.86) compared to variant allele cariers. The direction of the interaction however is in contrast to that published in studies on colorectal adenomas. Both SNPs showed potential site-specific interaction with aspirin use and colon cancer risk only (Pinteraction = 0.006 and 0.008, respectively), with the direction of association similar to that observed for CRC. Additionally, they showed interaction between any non-steroidal anti-inflammatory drugs (including aspirin) use and CRC risk (Pinteraction = 0.01 for both). All gene x environment (GxE) interactions however were not significant after multiple test correction. Candidate gene investigation indicated no evidence of GxE interaction between genetic variants in genes involved in aspirin pathways, regular aspirin use and colorectal cancer risk.
Ulrich, Cornelia M.; Scherer, Dominique; Elliott, Faye; Barrett, Jennifer H.; Forman, David; Wolf, C. Roland; Smith, Gillian; Jackson, Michael S.; Santibanez-Koref, Mauro; Haile, Robert; Casey, Graham; Jenkins, Mark; Win, Aung Ko; Hopper, John L.; Marchand, Loic Le; Lindor, Noralane M.; Thibodeau, Stephen N.; Potter, John D.; Burn, John; Bishop, D. Timothy
2018-01-01
Regular aspirin use is associated with reduced risk of colorectal cancer (CRC). Variation in aspirin’s chemoprevention efficacy has been attributed to the presence of single nucleotide polymorphisms (SNPs). We conducted a meta-analysis using two large population-based case-control datasets, the UK-Leeds Colorectal Cancer Study Group and the NIH-Colon Cancer Family Registry, having a combined total of 3325 cases and 2262 controls. The aim was to assess 42 candidate SNPs in 15 genes whose association with colorectal cancer risk was putatively modified by aspirin use, in the literature. Log odds ratios (ORs) and standard errors were estimated for each dataset separately using logistic regression adjusting for age, sex and study site, and dataset-specific results were combined using random effects meta-analysis. Meta-analysis showed association between SNPs rs6983267, rs11694911 and rs2302615 with CRC risk reduction (All P<0.05). Association for SNP rs6983267 in the CCAT2 gene only was noteworthy after multiple test correction (P = 0.001). Site-specific analysis showed association between SNPs rs1799853 and rs2302615 with reduced colon cancer risk only (P = 0.01 and P = 0.004, respectively), however neither reached significance threshold following multiple test correction. Meta-analysis of SNPs rs2070959 and rs1105879 in UGT1A6 gene showed interaction between aspirin use and CRC risk (Pinteraction = 0.01 and 0.02, respectively); stratification by aspirin use showed an association for decreased CRC risk for aspirin users having a wild-type genotype (rs2070959 OR = 0.77, 95% CI = 0.68–0.86; rs1105879 OR = 0.77 95% CI = 0.69–0.86) compared to variant allele cariers. The direction of the interaction however is in contrast to that published in studies on colorectal adenomas. Both SNPs showed potential site-specific interaction with aspirin use and colon cancer risk only (Pinteraction = 0.006 and 0.008, respectively), with the direction of association similar to that observed for CRC. Additionally, they showed interaction between any non-steroidal anti-inflammatory drugs (including aspirin) use and CRC risk (Pinteraction = 0.01 for both). All gene x environment (GxE) interactions however were not significant after multiple test correction. Candidate gene investigation indicated no evidence of GxE interaction between genetic variants in genes involved in aspirin pathways, regular aspirin use and colorectal cancer risk. PMID:29425227
Cole, Shelley A; Voruganti, V Saroja; Cai, Guowen; Haack, Karin; Kent, Jack W; Blangero, John; Comuzzie, Anthony G; McPherson, John D; Gibbs, Richard A
2010-01-01
Background: Melanocortin-4-receptor (MC4R) haploinsufficiency is the most common form of monogenic obesity; however, the frequency of MC4R variants and their functional effects in general populations remain uncertain. Objective: The aim was to identify and characterize the effects of MC4R variants in Hispanic children. Design: MC4R was resequenced in 376 parents, and the identified single nucleotide polymorphisms (SNPs) were genotyped in 613 parents and 1016 children from the Viva la Familia cohort. Measured genotype analysis (MGA) tested associations between SNPs and phenotypes. Bayesian quantitative trait nucleotide (BQTN) analysis was used to infer the most likely functional polymorphisms influencing obesity-related traits. Results: Seven rare SNPs in coding and 18 SNPs in flanking regions of MC4R were identified. MGA showed suggestive associations between MC4R variants and body size, adiposity, glucose, insulin, leptin, ghrelin, energy expenditure, physical activity, and food intake. BQTN analysis identified SNP 1704 in a predicted micro-RNA target sequence in the downstream flanking region of MC4R as a strong, probable functional variant influencing total, sedentary, and moderate activities with posterior probabilities of 1.0. SNP 2132 was identified as a variant with a high probability (1.0) of exerting a functional effect on total energy expenditure and sleeping metabolic rate. SNP rs34114122 was selected as having likely functional effects on the appetite hormone ghrelin, with a posterior probability of 0.81. Conclusion: This comprehensive investigation provides strong evidence that MC4R genetic variants are likely to play a functional role in the regulation of weight, not only through energy intake but through energy expenditure. PMID:19889825
Abnet, Christian C.; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D.; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M.; Liao, Linda M.; Lee, Maxwell P.; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C.; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B.; Giffen, Carol A.; Burdett, Laurie; Fraumeni, Joseph F.; Tucker, Margaret A.; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M.; Yuan, Zhi-Qing; Chanock, Stephen J.; Zhang, Xue-Jun; Taylor, Philip R.; Wang, Li-Dong
2012-01-01
Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10−8, and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19–1.40) and P= 7.63 × 10−10. An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants. PMID:22323360
Abnet, Christian C; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M; Liao, Linda M; Lee, Maxwell P; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B; Giffen, Carol A; Burdett, Laurie; Fraumeni, Joseph F; Tucker, Margaret A; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M; Yuan, Zhi-Qing; Chanock, Stephen J; Zhang, Xue-Jun; Taylor, Philip R; Wang, Li-Dong
2012-05-01
Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10(-8), and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19-1.40) and P= 7.63 × 10(-10). An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants.
Common variants in Mendelian kidney disease genes and their association with renal function.
Parsa, Afshin; Fuchsberger, Christian; Köttgen, Anna; O'Seaghdha, Conall M; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M; Borecki, Ingrid; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Bochud, Murielle; Heid, Iris M; Siscovick, David S; Fox, Caroline S; Kao, W Linda; Böger, Carsten A
2013-12-01
Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research.
Raju, Nikku L; Gnanesh, Belaghihalli N; Lekha, Pazhamala; Jayashree, Balaji; Pande, Suresh; Hiremath, Pavana J; Byregowda, Munishamappa; Singh, Nagendra K; Varshney, Rajeev K
2010-03-11
Pigeonpea (Cajanus cajan (L.) Millsp) is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW), sterility mosaic disease (SMD), etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs). A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376') and SMD ('ICP 7035' and 'TTB 7') and a total of 9,888 (9,468 high quality) ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes) including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%), rice (28.3%), Arabidopsis (33.7%) and poplar (35.4%). As expected, pigeonpea ESTs are more closely related to soybean (60.3%) and cowpea ESTs (43.6%) than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1%) out of 4,557 total unigenes correspond to known proteins in the UniProt database (
2010-01-01
Background Pigeonpea (Cajanus cajan (L.) Millsp) is one of the major grain legume crops of the tropics and subtropics, but biotic stresses [Fusarium wilt (FW), sterility mosaic disease (SMD), etc.] are serious challenges for sustainable crop production. Modern genomic tools such as molecular markers and candidate genes associated with resistance to these stresses offer the possibility of facilitating pigeonpea breeding for improving biotic stress resistance. Availability of limited genomic resources, however, is a serious bottleneck to undertake molecular breeding in pigeonpea to develop superior genotypes with enhanced resistance to above mentioned biotic stresses. With an objective of enhancing genomic resources in pigeonpea, this study reports generation and analysis of comprehensive resource of FW- and SMD- responsive expressed sequence tags (ESTs). Results A total of 16 cDNA libraries were constructed from four pigeonpea genotypes that are resistant and susceptible to FW ('ICPL 20102' and 'ICP 2376') and SMD ('ICP 7035' and 'TTB 7') and a total of 9,888 (9,468 high quality) ESTs were generated and deposited in dbEST of GenBank under accession numbers GR463974 to GR473857 and GR958228 to GR958231. Clustering and assembly analyses of these ESTs resulted into 4,557 unique sequences (unigenes) including 697 contigs and 3,860 singletons. BLASTN analysis of 4,557 unigenes showed a significant identity with ESTs of different legumes (23.2-60.3%), rice (28.3%), Arabidopsis (33.7%) and poplar (35.4%). As expected, pigeonpea ESTs are more closely related to soybean (60.3%) and cowpea ESTs (43.6%) than other plant ESTs. Similarly, BLASTX similarity results showed that only 1,603 (35.1%) out of 4,557 total unigenes correspond to known proteins in the UniProt database (≤ 1E-08). Functional categorization of the annotated unigenes sequences showed that 153 (3.3%) genes were assigned to cellular component category, 132 (2.8%) to biological process, and 132 (2.8%) in molecular function. Further, 19 genes were identified differentially expressed between FW- responsive genotypes and 20 between SMD- responsive genotypes. Generated ESTs were compiled together with 908 ESTs available in public domain, at the time of analysis, and a set of 5,085 unigenes were defined that were used for identification of molecular markers in pigeonpea. For instance, 3,583 simple sequence repeat (SSR) motifs were identified in 1,365 unigenes and 383 primer pairs were designed. Assessment of a set of 84 primer pairs on 40 elite pigeonpea lines showed polymorphism with 15 (28.8%) markers with an average of four alleles per marker and an average polymorphic information content (PIC) value of 0.40. Similarly, in silico mining of 133 contigs with ≥ 5 sequences detected 102 single nucleotide polymorphisms (SNPs) in 37 contigs. As an example, a set of 10 contigs were used for confirming in silico predicted SNPs in a set of four genotypes using wet lab experiments. Occurrence of SNPs were confirmed for all the 6 contigs for which scorable and sequenceable amplicons were generated. PCR amplicons were not obtained in case of 4 contigs. Recognition sites for restriction enzymes were identified for 102 SNPs in 37 contigs that indicates possibility of assaying SNPs in 37 genes using cleaved amplified polymorphic sequences (CAPS) assay. Conclusion The pigeonpea EST dataset generated here provides a transcriptomic resource for gene discovery and development of functional markers associated with biotic stress resistance. Sequence analyses of this dataset have showed conservation of a considerable number of pigeonpea transcripts across legume and model plant species analysed as well as some putative pigeonpea specific genes. Validation of identified biotic stress responsive genes should provide candidate genes for allele mining as well as candidate markers for molecular breeding. PMID:20222972
Zhang, Zhaohui; Ma, Fei; Zhou, Feng; Chen, Yibing; Wang, Xiaoyan; Zhang, Hongxin; Zhu, Yong; Bi, Jianwei; Zhang, Yiguan
2014-12-01
Previous studies have demonstrated that circadian negative feedback loop genes play an important role in the development and progression of many cancers. However, the associations between single-nucleotide polymorphisms (SNPs) in these genes and the clinical outcomes of hepatocellular carcinoma (HCC) after surgical resection have not been studied so far. Thirteen functional SNPs in circadian genes were genotyped using the Sequenom iPLEX genotyping system in a cohort of 489 Chinese HCC patients who received radical resection. Multivariate Cox proportional hazards model and Kaplan-Meier curve were used for the prognosis analysis. Cumulative effect analysis and survival tree analysis were used for the multiple SNPs analysis. Four individual SNPs, including rs3027178 in PER1, rs228669 and rs2640908 in PER3 and rs3809236 in CRY1, were significantly associated with overall survival (OS) of HCC patients, and three SNPs, including rs3027178 in PER1, rs228729 in PER3 and rs3809236 in CRY1, were significantly associated with recurrence-free survival (RFS). Moreover, we observed a cumulative effect of significant SNPs on OS and RFS (P for trend < 0.001 for both). Survival tree analysis indicated that wild genotype of rs228729 in PER3 was the primary risk factor contributing to HCC patients' RFS. Our study suggests that the polymorphisms in circadian negative feedback loop genes may serve as independent prognostic biomarkers in predicting clinical outcomes for HCC patients who received radical resection. Further studies with different ethnicities are needed to validate our findings and generalize its clinical utility.
Wu, Ann C.; Himes, Blanca E.; Lasky-Su, Jessica; Litonjua, Augusto; Peters, Stephen P.; Lima, John; Kubo, Michiaki; Tamari, Mayumi; Nakamura, Yusuke; Qiu, Weiliang; Weiss, Scott T.; Tantisira, Kelan
2013-01-01
Background Single nucleotide polymorphisms (SNPs) influence a patient's response to inhaled corticosteroids and β2-agonists, and the effect of treatment with inhaled corticosteroids is synergistic with the effect of β2-agonists. We hypothesized that use of inhaled corticosteroids could influence the effect of SNPs associated with bronchodilator response. Objective To assess whether, among asthma subjects, the association of SNPs with bronchodilator response is different between those treated with inhaled corticosteroids vs. those on placebo. Methods A genome-wide association analysis was conducted using 581 white subjects from the Childhood Asthma Management Program (CAMP). Using data for 449,540 SNPs, we conducted a gene by environment analysis in PLINK with inhaled corticosteroid treatment as the environmental exposure and bronchodilator response as the outcome measure. We attempted to replicate the top 12 SNPs in the Leukotriene Modifier Or Corticosteroid or Corticosteroid-Salmeterol (LOCCS) Trial. Results The combined P-value for the CAMP and LOCCS populations was 4.81E-08 for rs3752120, which is located in the zinc finger protein gene ZNF432, and has unknown function. Conclusions Inhaled corticosteroids appear to modulate the association of bronchodilator response with variant(s) in the ZNF432 gene among adults and children with asthma. Clinical Implications Clinicians who treat asthma patients with inhaled corticosteroids should be aware that the patient's genetic makeup likely influences response as measured in lung function. Capsule Summary Our study suggests that inhaled corticosteroids could influence the effect of multiple SNPs associated with bronchodilator response across the genome. PMID:24280104
Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe
2010-01-01
Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950
Genome supranucleosomal organization and genetic susceptibility to diseases
NASA Astrophysics Data System (ADS)
Jablonski, K. P.; Fretter, C.; Carron, L.; Forné, T.; Hütt, M.-T.; Lesne, A.
2017-09-01
The notion of disease-associated single-nucleotide polymorphisms (da-SNP), as determined in genome-wide association studies (GWAS), is relevant for many complex pathologies, including cancers. It appeared that da-SNPs are not only markers of causal genetic variation but may contribute to the disease development through an influence on gene expression levels. We argue that understanding this possible functional role of da-SNPs requires to consider their embedding in the tridimensional (3D) multi-scale organization of the human genome. We then focus on the potential impact of da-SNPs on chromatin loops and recently observed topologically associating domains (TADs). We show that for some diseases and cancer types, da-SNPs are over-represented in the borders of these topological domains, in a way that cannot be explained by an increased exon density. This analysis of the distribution of da-SNPs within the 3D genome organization suggests candidate loci for further experimental investigation of the mechanisms underlying genetic susceptibility to diseases, in particular cancer.
Chu, Shuyuan; Zhong, Xiaoning; Zhang, Jianquan; Lai, Xiaoying; Xie, Jiajun; Li, Yu
2016-12-01
Forkhead box P3 (FOXP3) is the essential transcription factor for the function of regulatory T-cell (Treg). However, the gene mutation of FOXP3 in patients with chronic obstructive pulmonary disease (COPD) at different stages has not been reported. We aim to investigate four single nucleotide polymorphisms (SNPs) and the mRNA expression of FOXP3 in smokers with normal lung function and smokers with COPD at different stages. FOXP3 mRNA expression and SNPs in FOXP3 were assessed in nonsmokers with normal lung function (N), smokers with normal lung function (S), smokers with COPD in the Global Initiative for Chronic Obstructive Lung Disease (GOLD) 1 or 2 grade (COPD 1-2), and smokers with COPD in GOLD 3 or 4 grade (COPD 3-4). In peripheral blood sample, FOXP3 mRNA was assessed using real-time quantitative PCR and SNPs were analyzed by TaqMan PCR. FOXP3 mRNA level in peripheral blood sample was decreased when COPD was aggravated. The frequency of FOXP3 rs5902434 genotype del/del and allele del are lower in COPD 1-2 and COPD 3-4 than that in N or S. The rs5902434 genotype del/del and allele del were, respectively, associated with decreased risk of COPD and lung function decline. The rs5902434 genotypic distribution was correlated with FOXP3 mRNA level. In conclusion, both FOXP3 rs5902434 genotypes and alleles were differently distributed in COPD patients and smokers with normal lung function. The distribution of del/del genotype was associated with systemic expression of FOXP3 mRNA. More research is needed to explore the role of FOXP3 gene polymorphism in immunoinflammation of COPD.
Hack, Laura M.; Kalsi, Gursharan; Aliev, Fazil; Kuo, Po-Hsiu; Prescott, Carol A.; Patterson, Diana G.; Walsh, Dermot; Dick, Danielle M.; Riley, Brien P.; Kendler, Kenneth S.
2012-01-01
Background Over 50 years of evidence from research has established that the central dopaminergic reward pathway is likely involved in alcohol dependence (AD). Additional evidence supports a role for dopamine (DA) in other disinhibitory psychopathology, which is often comorbid with AD. Family and twin studies demonstrate that a common genetic component accounts for most of the genetic variance in these traits. Thus, DA-related genes represent putative candidates for the genetic risk that underlies not only AD but also behavioral disinhibition. Many linkage and association studies have examined these relationships with inconsistent results, possibly because of low power, poor marker coverage, and/or an inappropriate correction for multiple testing. Methods We conducted an association study on the products encoded by 10 DA-related genes (DRD1-D5, SLC18A2, SLC6A3, DDC, TH, COMT) using a large, ethnically homogeneous sample with severe AD (n = 545) and screened controls (n = 509). We collected genotypes from linkage disequilibrium (LD)-tagging single nucleotide polymorphisms (SNPs) and employed a gene-based method of correction. We tested for association with AD diagnosis in cases and controls and with a variety of alcohol-related traits (including age-at-onset, initial sensitivity, tolerance, maximum daily drinks, and a withdrawal factor score), disinhibitory symptoms, and a disinhibitory factor score in cases only. A total of 135 SNPs were genotyped using the Illumina GoldenGate and Taqman Assays-on-Demand protocols. Results Of the 101 SNPs entered into standard analysis, 6 independent SNPs from 5 DA genes were associated with AD or a quantitative alcohol-related trait. Two SNPs across 2 genes were associated with a disinhibitory symptom count, while 1 SNP in DRD5 was positive for association with the general disinhibitory factor score. Conclusions Our study provides evidence of modest associations between a small number of DA-related genes and AD as well as a range of alcohol-related traits and measures of behavioral disinhibition. While we did conduct gene-based correction for multiple testing, we did not correct for multiple traits because the traits are correlated. However, false-positive findings remain possible, so our results must be interpreted with caution. PMID:21083670
Hiremath, Pavana J; Farmer, Andrew; Cannon, Steven B; Woodward, Jimmy; Kudapa, Himabindu; Tuteja, Reetu; Kumar, Ashish; Bhanuprakash, Amindala; Mulaosmanovic, Benjamin; Gujaria, Neha; Krishnamurthy, Laxmanan; Gaur, Pooran M; Kavikishor, Polavarapu B; Shah, Trushar; Srinivasan, Ramamurthy; Lohse, Marc; Xiao, Yongli; Town, Christopher D; Cook, Douglas R; May, Gregory D; Varshney, Rajeev K
2011-10-01
Chickpea (Cicer arietinum L.) is an important legume crop in the semi-arid regions of Asia and Africa. Gains in crop productivity have been low however, particularly because of biotic and abiotic stresses. To help enhance crop productivity using molecular breeding techniques, next generation sequencing technologies such as Roche/454 and Illumina/Solexa were used to determine the sequence of most gene transcripts and to identify drought-responsive genes and gene-based molecular markers. A total of 103,215 tentative unique sequences (TUSs) have been produced from 435,018 Roche/454 reads and 21,491 Sanger expressed sequence tags (ESTs). Putative functions were determined for 49,437 (47.8%) of the TUSs, and gene ontology assignments were determined for 20,634 (41.7%) of the TUSs. Comparison of the chickpea TUSs with the Medicago truncatula genome assembly (Mt 3.5.1 build) resulted in 42,141 aligned TUSs with putative gene structures (including 39,281 predicted intron/splice junctions). Alignment of ∼37 million Illumina/Solexa tags generated from drought-challenged root tissues of two chickpea genotypes against the TUSs identified 44,639 differentially expressed TUSs. The TUSs were also used to identify a diverse set of markers, including 728 simple sequence repeats (SSRs), 495 single nucleotide polymorphisms (SNPs), 387 conserved orthologous sequence (COS) markers, and 2088 intron-spanning region (ISR) markers. This resource will be useful for basic and applied research for genome analysis and crop improvement in chickpea. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd. No claim to original US government works.
Finno, Carrie J; Gianino, Giuliana; Perumbakkam, Sudeep; Williams, Zoë J; Bordbari, Matthew H; Gardner, Keri L; Burns, Erin; Peng, Sichong; Durward-Akhurst, Sian A; Valberg, Stephanie J
2018-03-06
The cause of immune-mediated myositis (IMM), characterized by recurrent, rapid-onset muscle atrophy in Quarter Horses (QH), is unknown. The histopathologic hallmark of IMM is lymphocytic infiltration of myofibers. The purpose of this study was to identify putative functional variants associated with equine IMM. A genome-wide association (GWA) study was performed on 36 IMM QHs and 54 breed matched unaffected QHs from the same environment using the Equine SNP50 and SNP70 genotyping arrays. A mixed model analysis identified nine SNPs within a ~ 2.87 Mb region on chr11 that were significantly (P unadjusted < 1.4 × 10 - 6 ) associated with the IMM phenotype. Associated haplotypes within this region encompassed 38 annotated genes, including four myosin genes (MYH1, MYH2, MYH3, and MYH13). Whole genome sequencing of four IMM and four unaffected QHs identified a single segregating nonsynonymous E321G mutation in MYH1 encoding myosin heavy chain 2X. Genotyping of additional 35 IMM and 22 unaffected QHs confirmed an association (P = 2.9 × 10 - 5 ), and the putative mutation was absent in 175 horses from 21 non-QH breeds. Lymphocytic infiltrates occurred in type 2X myofibers and the proportion of 2X fibers was decreased in the presence of inflammation. Protein modeling and contact/stability analysis identified 14 residues affected by the mutation which significantly decreased stability. We conclude that a mutation in MYH1 is highly associated with susceptibility to the IMM phenotype in QH-related breeds. This is the first report of a mutation in MYH1 and the first link between a skeletal muscle myosin mutation and autoimmune disease.
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.
Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart
2016-01-01
Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal.
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications
Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart
2016-01-01
Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal. PMID:27583971
Bellampalli, Ravishankara; Phani, Nagaraja M; Bhat, Kamalakshi G; Prasad, Krishna; Bhaskaranand, Nalini; Guruprasad, Kanive P; Rai, Padmalatha S; Satyamoorthy, Kapaettu
2015-05-01
Acute lymphoblastic leukemia (ALL) arises due to several genetic alterations in progenitor cells, and methotrexate is frequently used as part of the treatment regimen. Although there is evidence for an effect of 5,10-methylenetetrahydrofolate reductase gene (MTHFR) C677T and A1298C variations on drug response in ALL, its risk association for ALL is still unresolved. In a case-control study of 203 patients with ALL and 246 controls and meta-analysis in the Indian population, we showed an insignificant association of MTHFR C677T and A1298C genotypes with childhood and adult ALL. Comprehensive in silico characterization of non-synonymous single nucleotide polymorphisms (nsSNPs) and SNPs of the 3' untranslated region (UTR) revealed nine nsSNPs as deleterious, and three SNPs in the 3'UTR could possibly alter the binding of miRNAs. The study revealed that several overlooked SNPs may contribute to the risk of ALL susceptibility and further studies of these SNPs with functional characterization in a large sample size are required to understand the significant role of MTHFR in ALL development.
Zhang, Kunlin; Chang, Suhua; Cui, Sijia; Guo, Liyuan; Zhang, Liuyan; Wang, Jing
2011-07-01
Genome-wide association study (GWAS) is widely utilized to identify genes involved in human complex disease or some other trait. One key challenge for GWAS data interpretation is to identify causal SNPs and provide profound evidence on how they affect the trait. Currently, researches are focusing on identification of candidate causal variants from the most significant SNPs of GWAS, while there is lack of support on biological mechanisms as represented by pathways. Although pathway-based analysis (PBA) has been designed to identify disease-related pathways by analyzing the full list of SNPs from GWAS, it does not emphasize on interpreting causal SNPs. To our knowledge, so far there is no web server available to solve the challenge for GWAS data interpretation within one analytical framework. ICSNPathway is developed to identify candidate causal SNPs and their corresponding candidate causal pathways from GWAS by integrating linkage disequilibrium (LD) analysis, functional SNP annotation and PBA. ICSNPathway provides a feasible solution to bridge the gap between GWAS and disease mechanism study by generating hypothesis of SNP → gene → pathway(s). The ICSNPathway server is freely available at http://icsnpathway.psych.ac.cn/.
Xu, Meixiang; Nekhayeva, Ilona; Cross, Courtney E; Rondelli, Catherine M; Wickliffe, Jeffrey K; Abdel-Rahman, Sherif Z
2014-03-01
The O6-methylguanine-DNA methyltransferase gene (MGMT) encodes the direct reversal DNA repair protein that removes alkyl adducts from the O6 position of guanine. Several single-nucleotide polymorphisms (SNPs) exist in the MGMT promoter/enhancer (P/E) region. However, the haplotype structure encompassing these SNPs and their functional/biological significance are currently unknown. We hypothesized that MGMT P/E haplotypes, rather than individual SNPs, alter MGMT transcription and can thus alter human sensitivity to alkylating agents. To identify the haplotype structure encompassing the MGMT P/E region SNPs, we sequenced 104 DNA samples from healthy individuals and inferred the haplotypes using the data generated. We identified eight SNPs in this region, namely T7C (rs180989103), T135G (rs1711646), G290A (rs61859810), C485A (rs1625649), C575A (rs113813075), G666A (rs34180180), C777A (rs34138162) and C1099T (rs16906252). Phylogenetics and Sequence Evolution analysis predicted 21 potential haplotypes that encompass these SNPs ranging in frequencies from 0.000048 to 0.39. Of these, 10 were identified in our study population as 20 paired haplotype combinations. To determine the functional significance of these haplotypes, luciferase reporter constructs representing these haplotypes were transfected into glioblastoma cells and their effect on MGMT promoter activity was determined. Compared with the most common (reference) haplotype 1, seven haplotypes significantly upregulated MGMT promoter activity (18-119% increase; P < 0.05), six significantly downregulated MGMT promoter activity (29-97% decrease; P < 0.05) and one haplotype had no effect. Mechanistic studies conducted support the conclusion that MGMT P/E haplotypes, rather than individual SNPs, differentially regulate MGMT transcription and could thus play a significant role in human sensitivity to environmental and therapeutic alkylating agents.
Chen, Yi; Luo, Yan; Curry, Phillip; Timme, Ruth; Melka, David; Doyle, Matthew; Parish, Mickey; Hammack, Thomas S; Allard, Marc W; Brown, Eric W; Strain, Errol A
2017-01-01
A listeriosis outbreak in the United States implicated contaminated ice cream produced by one company, which operated 3 facilities. We performed single nucleotide polymorphism (SNP)-based whole genome sequencing (WGS) analysis on Listeria monocytogenes from food, environmental and clinical sources, identifying two clusters and a single branch, belonging to PCR serogroup IIb and genetic lineage I. WGS Cluster I, representing one outbreak strain, contained 82 food and environmental isolates from Facility I and 4 clinical isolates. These isolates differed by up to 29 SNPs, exhibited 9 pulsed-field gel electrophoresis (PFGE) profiles and multilocus sequence typing (MLST) sequence type (ST) 5 of clonal complex 5 (CC5). WGS Cluster II contained 51 food and environmental isolates from Facility II, 4 food isolates from Facility I and 5 clinical isolates. Among them the isolates from Facility II and clinical isolates formed a clade and represented another outbreak strain. Isolates in this clade differed by up to 29 SNPs, exhibited 3 PFGE profiles and ST5. The only isolate collected from Facility III belonged to singleton ST489, which was in a single branch separate from Clusters I and II, and was not associated with the outbreak. WGS analyses clustered together outbreak-associated isolates exhibiting multiple PFGE profiles, while differentiating them from epidemiologically unrelated isolates that exhibited outbreak PFGE profiles. The complete genome of a Cluster I isolate allowed the identification and analyses of putative prophages, revealing that Cluster I isolates differed by the gain or loss of three putative prophages, causing the banding pattern differences among all 3 AscI-PFGE profiles observed in Cluster I isolates. WGS data suggested that certain ice cream varieties and/or production lines might have contamination sources unique to them. The SNP-based analysis was able to distinguish CC5 as a group from non-CC5 isolates and differentiate among CC5 isolates from different outbreaks/incidents.
Chen, Yi; Luo, Yan; Curry, Phillip; Timme, Ruth; Melka, David; Doyle, Matthew; Parish, Mickey; Hammack, Thomas S.; Allard, Marc W.; Brown, Eric W.; Strain, Errol A.
2017-01-01
A listeriosis outbreak in the United States implicated contaminated ice cream produced by one company, which operated 3 facilities. We performed single nucleotide polymorphism (SNP)-based whole genome sequencing (WGS) analysis on Listeria monocytogenes from food, environmental and clinical sources, identifying two clusters and a single branch, belonging to PCR serogroup IIb and genetic lineage I. WGS Cluster I, representing one outbreak strain, contained 82 food and environmental isolates from Facility I and 4 clinical isolates. These isolates differed by up to 29 SNPs, exhibited 9 pulsed-field gel electrophoresis (PFGE) profiles and multilocus sequence typing (MLST) sequence type (ST) 5 of clonal complex 5 (CC5). WGS Cluster II contained 51 food and environmental isolates from Facility II, 4 food isolates from Facility I and 5 clinical isolates. Among them the isolates from Facility II and clinical isolates formed a clade and represented another outbreak strain. Isolates in this clade differed by up to 29 SNPs, exhibited 3 PFGE profiles and ST5. The only isolate collected from Facility III belonged to singleton ST489, which was in a single branch separate from Clusters I and II, and was not associated with the outbreak. WGS analyses clustered together outbreak-associated isolates exhibiting multiple PFGE profiles, while differentiating them from epidemiologically unrelated isolates that exhibited outbreak PFGE profiles. The complete genome of a Cluster I isolate allowed the identification and analyses of putative prophages, revealing that Cluster I isolates differed by the gain or loss of three putative prophages, causing the banding pattern differences among all 3 AscI-PFGE profiles observed in Cluster I isolates. WGS data suggested that certain ice cream varieties and/or production lines might have contamination sources unique to them. The SNP-based analysis was able to distinguish CC5 as a group from non-CC5 isolates and differentiate among CC5 isolates from different outbreaks/incidents. PMID:28166293
Mruczek, Ryan E. B.
2012-01-01
The cerebral cortex is composed of many distinct classes of neurons. Numerous studies have demonstrated corresponding differences in neuronal properties across cell types, but these comparisons have largely been limited to conditions outside of awake, behaving animals. Thus the functional role of the various cell types is not well understood. Here, we investigate differences in the functional properties of two widespread and broad classes of cells in inferior temporal cortex of macaque monkeys: inhibitory interneurons and excitatory projection cells. Cells were classified as putative inhibitory or putative excitatory neurons on the basis of their extracellular waveform characteristics (e.g., spike duration). Consistent with previous intracellular recordings in cortical slices, putative inhibitory neurons had higher spontaneous firing rates and higher stimulus-evoked firing rates than putative excitatory neurons. Additionally, putative excitatory neurons were more susceptible to spike waveform adaptation following very short interspike intervals. Finally, we compared two functional properties of each neuron's stimulus-evoked response: stimulus selectivity and response latency. First, putative excitatory neurons showed stronger stimulus selectivity compared with putative inhibitory neurons. Second, putative inhibitory neurons had shorter response latencies compared with putative excitatory neurons. Selectivity differences were maintained and latency differences were enhanced during a visual search task emulating more natural viewing conditions. Our results suggest that short-latency inhibitory responses are likely to sculpt visual processing in excitatory neurons, yielding a sparser visual representation. PMID:22933717
Vidal, Adriana C; Tucker, Cocoa; Schildkraut, Joellen M; Richardson, Ricardo M; McPhail, Megan; Freedland, Stephen J; Hoyo, Cathrine; Grant, Delores J
2013-11-22
We have previously shown that a functional polymorphism of the UGT2B15 gene (rs1902023) was associated with increased risk of prostate cancer (PC). Novel functional polymorphisms of the UGT2B17 and UGT2B15 genes have been recently characterized by in vitro assays but have not been evaluated in epidemiologic studies. Fifteen functional SNPs of the UGT2B17 and UGT2B15 genes, including cis-acting UGT2B gene SNPs, were genotyped in African American and Caucasian men (233 PC cases and 342 controls). Regression models were used to analyze the association between SNPs and PC risk. After adjusting for race, age and BMI, we found that six UGT2B15 SNPs (rs4148269, rs3100, rs9994887, rs13112099, rs7686914 and rs7696472) were associated with an increased risk of PC in log-additive models (p < 0.05). A SNP cis-acting on UGT2B17 and UGT2B15 expression (rs17147338) was also associated with increased risk of prostate cancer (OR = 1.65, 95% CI = 1.00-2.70); while a stronger association among men with high Gleason sum was observed for SNPs rs4148269 and rs3100. Although small sample size limits inference, we report novel associations between UGT2B15 and UGT2B17 variants and PC risk. These associations with PC risk in men with high Gleason sum, more frequently found in African American men, support the relevance of genetic differences in the androgen metabolism pathway, which could explain, in part, the high incidence of PC among African American men. Larger studies are required.
2013-01-01
Background We have previously shown that a functional polymorphism of the UGT2B15 gene (rs1902023) was associated with increased risk of prostate cancer (PC). Novel functional polymorphisms of the UGT2B17 and UGT2B15 genes have been recently characterized by in vitro assays but have not been evaluated in epidemiologic studies. Methods Fifteen functional SNPs of the UGT2B17 and UGT2B15 genes, including cis-acting UGT2B gene SNPs, were genotyped in African American and Caucasian men (233 PC cases and 342 controls). Regression models were used to analyze the association between SNPs and PC risk. Results After adjusting for race, age and BMI, we found that six UGT2B15 SNPs (rs4148269, rs3100, rs9994887, rs13112099, rs7686914 and rs7696472) were associated with an increased risk of PC in log-additive models (p < 0.05). A SNP cis-acting on UGT2B17 and UGT2B15 expression (rs17147338) was also associated with increased risk of prostate cancer (OR = 1.65, 95% CI = 1.00-2.70); while a stronger association among men with high Gleason sum was observed for SNPs rs4148269 and rs3100. Conclusions Although small sample size limits inference, we report novel associations between UGT2B15 and UGT2B17 variants and PC risk. These associations with PC risk in men with high Gleason sum, more frequently found in African American men, support the relevance of genetic differences in the androgen metabolism pathway, which could explain, in part, the high incidence of PC among African American men. Larger studies are required. PMID:24267955
Lu, Lingeng; Katsaros, Dionyssios; Mayne, Susan T; Risch, Harvey A; Benedetto, Chiara; Canuto, Emilie Marion; Yu, Herbert
2012-11-01
Several single-nucleotide polymorphisms (SNPs) of the stem cell-associated gene lin-28B have been identified in association with ovarian cancer and ovarian cancer-related risk factors. However, whether these SNPs are functional or might be potential biomarkers for ovarian cancer prognosis remains unknown. The purposes of this study were to investigate the functional relevance of the identified lin-28B SNPs, as well as the associations of genotype and phenotype with epithelial ovarian cancer (EOC) survival. We analyzed five SNPs and mRNA levels of lin-28B in 211 primary EOC tissues using Taqman(®) SNP genotyping assays and SYBR green-based real-time PCR, respectively. The RNA secondary structures at the region of a genome-wide association-identified intronic rs314276 were analyzed theoretically with mfold and experimentally with circular dichroism spectroscopy. We found that rs314276 was a cis-acting expression quantitative trait locus (eQTL) in both additive and dominant models, while rs7759938 and rs314277 were significant or of borderline significance in dominant models only. The rs314276 variant significantly affects RNA secondary structure. No SNPs alone were associated with patient survival. However, we found that among patients initially responding to chemotherapy, those with higher lin-28B expression had higher mortality risk (hazard ratio =3.27, 95% confidence interval: 1.63-6.56) and relapse risk (hazard ratio = 2.53, 95% confidence interval: 1.41-4.54) than those with lower expression, and these associations remained in multivariate analyses. These results suggest that rs314276 alters RNA secondary structure and thereby influences gene expression, and that lin-28B is a cancer stem cell-associated marker, which may be a pharmaceutical target in the management of EOC.
Culture adaptation of malaria parasites selects for convergent loss-of-function mutants.
Claessens, Antoine; Affara, Muna; Assefa, Samuel A; Kwiatkowski, Dominic P; Conway, David J
2017-01-24
Cultured human pathogens may differ significantly from source populations. To investigate the genetic basis of laboratory adaptation in malaria parasites, clinical Plasmodium falciparum isolates were sampled from patients and cultured in vitro for up to three months. Genome sequence analysis was performed on multiple culture time point samples from six monoclonal isolates, and single nucleotide polymorphism (SNP) variants emerging over time were detected. Out of a total of five positively selected SNPs, four represented nonsense mutations resulting in stop codons, three of these in a single ApiAP2 transcription factor gene, and one in SRPK1. To survey further for nonsense mutants associated with culture, genome sequences of eleven long-term laboratory-adapted parasite strains were examined, revealing four independently acquired nonsense mutations in two other ApiAP2 genes, and five in Epac. No mutants of these genes exist in a large database of parasite sequences from uncultured clinical samples. This implicates putative master regulator genes in which multiple independent stop codon mutations have convergently led to culture adaptation, affecting most laboratory lines of P. falciparum. Understanding the adaptive processes should guide development of experimental models, which could include targeted gene disruption to adapt fastidious malaria parasite species to culture.
Farlora, Rodolfo; Araya-Garay, José; Gallardo-Escárate, Cristian
2014-06-01
Understanding the molecular underpinnings involved in the reproduction of the salmon louse is critical for designing novel strategies of pest management for this ectoparasite. However, genomic information on sex-related genes is still limited. In the present work, sex-specific gene transcription was revealed in the salmon louse Caligus rogercresseyi using high-throughput Illumina sequencing. A total of 30,191,914 and 32,292,250 high quality reads were generated for females and males, and these were de novo assembled into 32,173 and 38,177 contigs, respectively. Gene ontology analysis showed a pattern of higher expression in the female as compared to the male transcriptome. Based on our sequence analysis and known sex-related proteins, several genes putatively involved in sex differentiation, including Dmrt3, FOXL2, VASA, and FEM1, and other potentially significant candidate genes in C. rogercresseyi, were identified for the first time. In addition, the occurrence of SNPs in several differentially expressed contigs annotating for sex-related genes was found. This transcriptome dataset provides a useful resource for future functional analyses, opening new opportunities for sea lice pest control. Copyright © 2014 Elsevier B.V. All rights reserved.
Testing of diabetes-associated WFS1 polymorphisms in the Diabetes Prevention Program
Florez, J. C.; Jablonski, K. A.; McAteer, J.; Sandhu, M. S.; Wareham, N. J.; Barroso, I.; Franks, P. W.; Altshuler, D.; Knowler, W. C.
2008-01-01
Aims/hypothesis Wolfram syndrome (diabetes insipidus, diabetes mellitus, optic atrophy and deafness) is caused by mutations in the WFS1 gene. Recently, single nucleotide polymorphisms (SNPs) in WFS1 have been reproducibly associated with type 2 diabetes. We therefore examined the effects of these variants on diabetes incidence and response to interventions in the Diabetes Prevention Program (DPP), in which a lifestyle intervention or metformin treatment was compared with placebo. Methods We genotyped the WFS1 SNPs rs10010131, rs752 854 and rs734312 (H611R) in 3,548 DPP participants and performed Cox regression analysis using genotype, intervention and their interactions as predictors of diabetes incidence. We also evaluated the effect of these SNPs on insulin resistance and beta cell function at 1 year. Results Although none of the three SNPs was associated with diabetes incidence in the overall cohort, white homozygotes for the previously reported protective alleles appeared less likely to develop diabetes in the lifestyle arm. Examination of the publicly available Diabetes Genetics Initiative genome-wide association dataset revealed that rs10012946, which is in strong linkage disequilibrium with the three WFS1 SNPs (r2=0.88–1.0), was associated with type 2 diabetes (allelic odds ratio 0.85, 95% CI 0.75–0.97, p=0.026). In the DPP, we noted a trend towards increased insulin secretion in carriers of the protective variants, although for most SNPs this was seen as compensatory for the diminished insulin sensitivity. Conclusions/interpretation The previously reported protective effect of select WFS1 alleles may be magnified by a lifestyle intervention. These variants appear to confer an improvement in beta cell function. PMID:18060660
Karaesmen, Ezgi; Rizvi, Abbas A.; Preus, Leah M.; McCarthy, Philip L.; Pasquini, Marcelo C.; Onel, Kenan; Zhu, Xiaochun; Spellman, Stephen; Haiman, Christopher A.; Stram, Daniel O.; Pooler, Loreall; Sheng, Xin; Zhu, Qianqian; Yan, Li; Liu, Qian; Hu, Qiang; Webb, Amy; Brock, Guy; Clay-Gilmour, Alyssa I.; Battaglia, Sebastiano; Tritchler, David; Liu, Song; Hahn, Theresa
2017-01-01
Multiple candidate gene-association studies of non-HLA single-nucleotide polymorphisms (SNPs) and outcomes after blood or marrow transplant (BMT) have been conducted. We identified 70 publications reporting 45 SNPs in 36 genes significantly associated with disease-related mortality, progression-free survival, transplant-related mortality, and/or overall survival after BMT. Replication and validation of these SNP associations were performed using DISCOVeRY-BMT (Determining the Influence of Susceptibility COnveying Variants Related to one-Year mortality after BMT), a well-powered genome-wide association study consisting of 2 cohorts, totaling 2888 BMT recipients with acute myeloid leukemia, acute lymphoblastic leukemia, or myelodysplastic syndrome, and their HLA-matched unrelated donors, reported to the Center for International Blood and Marrow Transplant Research. Gene-based tests were used to assess the aggregate effect of SNPs on outcome. None of the previously reported significant SNPs replicated at P < .05 in DISCOVeRY-BMT. Validation analyses showed association with one previously reported donor SNP at P < .05 and survival; more associations would be anticipated by chance alone. No gene-based tests were significant at P < .05. Functional annotation with publicly available data shows these candidate SNPs most likely do not have biochemical function; only 13% of candidate SNPs correlate with gene expression or are predicted to impact transcription factor binding. Of these, half do not impact the candidate gene of interest; the other half correlate with expression of multiple genes. These findings emphasize the peril of pursing candidate approaches and the importance of adequately powered tests of unbiased genome-wide associations with BMT clinical outcomes given the ultimate goal of improving patient outcomes. PMID:28811306
Pang, Erli; Wu, Xiaomei; Lin, Kui
2016-06-01
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
Lamontagne, Maxime; Timens, Wim; Hao, Ke; Bossé, Yohan; Laviolette, Michel; Steiling, Katrina; Campbell, Joshua D; Couture, Christian; Conti, Massimo; Sherwood, Karen; Hogg, James C; Brandsma, Corry-Anke; van den Berge, Maarten; Sandford, Andrew; Lam, Stephen; Lenburg, Marc E; Spira, Avrum; Paré, Peter D; Nickle, David; Sin, Don D; Postma, Dirkje S
2014-11-01
COPD is a complex chronic disease with poorly understood pathogenesis. Integrative genomic approaches have the potential to elucidate the biological networks underlying COPD and lung function. We recently combined genome-wide genotyping and gene expression in 1111 human lung specimens to map expression quantitative trait loci (eQTL). To determine causal associations between COPD and lung function-associated single nucleotide polymorphisms (SNPs) and lung tissue gene expression changes in our lung eQTL dataset. We evaluated causality between SNPs and gene expression for three COPD phenotypes: FEV(1)% predicted, FEV(1)/FVC and COPD as a categorical variable. Different models were assessed in the three cohorts independently and in a meta-analysis. SNPs associated with a COPD phenotype and gene expression were subjected to causal pathway modelling and manual curation. In silico analyses evaluated functional enrichment of biological pathways among newly identified causal genes. Biologically relevant causal genes were validated in two separate gene expression datasets of lung tissues and bronchial airway brushings. High reliability causal relations were found in SNP-mRNA-phenotype triplets for FEV(1)% predicted (n=169) and FEV(1)/FVC (n=80). Several genes of potential biological relevance for COPD were revealed. eQTL-SNPs upregulating cystatin C (CST3) and CD22 were associated with worse lung function. Signalling pathways enriched with causal genes included xenobiotic metabolism, apoptosis, protease-antiprotease and oxidant-antioxidant balance. By using integrative genomics and analysing the relationships of COPD phenotypes with SNPs and gene expression in lung tissue, we identified CST3 and CD22 as potential causal genes for airflow obstruction. This study also augmented the understanding of previously described COPD pathways. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Common Variants in Mendelian Kidney Disease Genes and Their Association with Renal Function
Fuchsberger, Christian; Köttgen, Anna; O’Seaghdha, Conall M.; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I.; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J.; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V.; O’Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H.-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M.; Borecki, Ingrid; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M.; Bochud, Murielle; Heid, Iris M.; Siscovick, David S.; Fox, Caroline S.; Kao, W. Linda; Böger, Carsten A.
2013-01-01
Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research. PMID:24029420
Variant discovery in the sheep milk transcriptome using RNA sequencing.
Suárez-Vega, Aroa; Gutiérrez-Gil, Beatriz; Klopp, Christophe; Tosser-Klopp, Gwenola; Arranz, Juan José
2017-02-15
The identification of genetic variation underlying desired phenotypes is one of the main challenges of current livestock genetic research. High-throughput transcriptome sequencing (RNA-Seq) offers new opportunities for the detection of transcriptome variants (SNPs and short indels) in different tissues and species. In this study, we used RNA-Seq on Milk Sheep Somatic Cells (MSCs) with the goal of characterizing the genetic variation within the coding regions of the milk transcriptome in Churra and Assaf sheep, two common dairy sheep breeds farmed in Spain. A total of 216,637 variants were detected in the MSCs transcriptome of the eight ewes analyzed. Among them, a total of 57,795 variants were detected in the regions harboring Quantitative Trait Loci (QTL) for milk yield, protein percentage and fat percentage, of which 21.44% were novel variants. Among the total variants detected, 561 (2.52%) and 1,649 (7.42%) were predicted to produce high or moderate impact changes in the corresponding transcriptional unit, respectively. In the functional enrichment analysis of the genes positioned within selected QTL regions harboring novel relevant functional variants (high and moderate impact), the KEGG pathway with the highest enrichment was "protein processing in endoplasmic reticulum". Additionally, a total of 504 and 1,063 variants were identified in the genes encoding principal milk proteins and molecules involved in the lipid metabolism, respectively. Of these variants, 20 mutations were found to have putative relevant effects on the encoded proteins. We present herein the first transcriptomic approach aimed at identifying genetic variants of the genes expressed in the lactating mammary gland of sheep. Through the transcriptome analysis of variability within regions harboring QTL for milk yield, protein percentage and fat percentage, we have found several pathways and genes that harbor mutations that could affect dairy production traits. Moreover, remarkable variants were also found in candidate genes coding for major milk proteins and proteins related to milk fat metabolism. Several of the SNPs found in this study could be included as suitable markers in genotyping platforms or custom SNP arrays to perform association analyses in commercial populations and apply genomic selection protocols in the dairy production industry.
Fontanesi, Luca; Bertolini, Francesca; Scotti, Emilio; Schiavo, Giuseppina; Colombo, Michela; Trevisi, Paolo; Ribani, Anisa; Buttazzoni, Luca; Russo, Vincenzo; Dall'Olio, Stefania
2015-01-01
The GPR120 gene (also known as FFAR4 or O3FAR1) encodes for a functional omega-3 fatty acid receptor/sensor that mediates potent insulin sensitizing effects by repressing macrophage-induced tissue inflammation. For its functional role, GPR120 could be considered a potential target gene in animal nutrigenetics. In this work we resequenced the porcine GPR120 gene by high throughput Ion Torrent semiconductor sequencing of amplified fragments obtained from 8 DNA pools derived, on the whole, from 153 pigs of different breeds/populations (two Italian Large White pools, Italian Duroc, Italian Landrace, Casertana, Pietrain, Meishan, and wild boars). Three single nucleotide polymorphisms (SNPs), two synonymous substitutions and one in the putative 3'-untranslated region (g.114765469C > T), were identified and their allele frequencies were estimated by sequencing reads count. The g.114765469C > T SNP was also genotyped by PCR-RFLP confirming estimated frequency in Italian Large White pools. Then, this SNP was analyzed in two Italian Large White cohorts using a selective genotyping approach based on extreme and divergent pigs for back fat thickness (BFT) estimated breeding value (EBV) and average daily gain (ADG) EBV. Significant differences of allele and genotype frequencies distribution was observed between the extreme ADG-EBV groups (P < 0.001) whereas this marker was not associated with BFT-EBV.
Medaka: a promising model animal for comparative population genomics
Matsumoto, Yoshifumi; Oota, Hiroki; Asaoka, Yoichi; Nishina, Hiroshi; Watanabe, Koji; Bujnicki, Janusz M; Oda, Shoji; Kawamura, Shoji; Mitani, Hiroshi
2009-01-01
Background Within-species genome diversity has been best studied in humans. The international HapMap project has revealed a tremendous amount of single-nucleotide polymorphisms (SNPs) among humans, many of which show signals of positive selection during human evolution. In most of the cases, however, functional differences between the alleles remain experimentally unverified due to the inherent difficulty of human genetic studies. It would therefore be highly useful to have a vertebrate model with the following characteristics: (1) high within-species genetic diversity, (2) a variety of gene-manipulation protocols already developed, and (3) a completely sequenced genome. Medaka (Oryzias latipes) and its congeneric species, tiny fresh-water teleosts distributed broadly in East and Southeast Asia, meet these criteria. Findings Using Oryzias species from 27 local populations, we conducted a simple screening of nonsynonymous SNPs for 11 genes with apparent orthology between medaka and humans. We found medaka SNPs for which the same sites in human orthologs are known to be highly differentiated among the HapMap populations. Importantly, some of these SNPs show signals of positive selection. Conclusion These results indicate that medaka is a promising model system for comparative population genomics exploring the functional and adaptive significance of allelic differentiations. PMID:19426554
The impact of low-frequency and rare variants on lipid levels
Surakka, Ida; Horikoshi, Momoko; Mägi, Reedik; Sarin, Antti-Pekka; Mahajan, Anubha; Lagou, Vasiliki; Marullo, Letizia; Ferreira, Teresa; Miraglio, Benjamin; Timonen, Sanna; Kettunen, Johannes; Pirinen, Matti; Karjalainen, Juha; Thorleifsson, Gudmar; Hägg, Sara; Hottenga, Jouke-Jan; Isaacs, Aaron; Ladenvall, Claes; Beekman, Marian; Esko, Tõnu; Ried, Janina S; Nelson, Christopher P; Willenborg, Christina; Gustafsson, Stefan; Westra, Harm-Jan; Blades, Matthew; de Craen, Anton JM; de Geus, Eco J; Deelen, Joris; Grallert, Harald; Hamsten, Anders; Havulinna, Aki S.; Hengstenberg, Christian; Houwing-Duistermaat, Jeanine J; Hyppönen, Elina; Karssen, Lennart C; Lehtimäki, Terho; Lyssenko, Valeriya; Magnusson, Patrik KE; Mihailov, Evelin; Müller-Nurasyid, Martina; Mpindi, John-Patrick; Pedersen, Nancy L; Penninx, Brenda WJH; Perola, Markus; Pers, Tune H; Peters, Annette; Rung, Johan; Smit, Johannes H; Steinthorsdottir, Valgerdur; Tobin, Martin D; Tsernikova, Natalia; van Leeuwen, Elisabeth M; Viikari, Jorma S; Willems, Sara M; Willemsen, Gonneke; Schunkert, Heribert; Erdmann, Jeanette; Samani, Nilesh J; Kaprio, Jaakko; Lind, Lars; Gieger, Christian; Metspalu, Andres; Slagboom, P Eline; Groop, Leif; van Duijn, Cornelia M; Eriksson, Johan G; Jula, Antti; Salomaa, Veikko; Boomsma, Dorret I; Power, Christine; Raitakari, Olli T; Ingelsson, Erik; Järvelin, Marjo-Riitta; Stefansson, Kari; Franke, Lude; Ikonen, Elina; Kallioniemi, Olli; Pietiäinen, Vilja; Lindgren, Cecilia M; Thorsteinsdottir, Unnur; Palotie, Aarno; McCarthy, Mark I; Morris, Andrew P; Prokopenko, Inga; Ripatti, Samuli
2016-01-01
Using a genome-wide screen of 9.6 million genetic variants achieved through 1000 Genomes imputation in 62,166 samples, we identify association to lipids in 93 loci including 79 previously identified loci with new lead-SNPs, 10 new loci, 15 loci with a low-frequency and 10 loci with missense lead-SNPs, and, 2 loci with an accumulation of rare variants. In six loci, SNPs with established function in lipid genetics (CELSR2, GCKR, LIPC, and APOE), or candidate missense mutations with predicted damaging function (CD300LG and TM6SF2), explained the locus associations. The low-frequency variants increased the proportion of variance explained, particularly for LDL-C and TC. Altogether, our results highlight the impact of low-frequency variants in complex traits and show that imputation offers a cost-effective alternative to re-sequencing. PMID:25961943
Valenzuela-Miranda, Diego; Nuñez-Acuña, Gustavo; Valenzuela-Muñoz, Valentina; Asgari, Sassan; Gallardo-Escárate, Cristian
2015-01-25
Despite the increasing evidence of the importance of microRNAs (miRNAs) in the regulation of multiple biological processes, the molecular bases supporting this regulation are still barely understood in crustaceans. Therefore, the molecular characterization and transcriptome modulation of the miRNA biogenesis pathway were evaluated in the salmon louse Caligus rogercresseyi, an ectoparasite that constitutes one of the biggest concerns for salmonid aquaculture industry. Hence, RNA-Seq analysis was conducted from six different developmental stages, and also after bioassays with delousing drugs Deltamethrin and Azamethiphos using adult individuals. In silico analysis evidenced 24 putative genes involved in the miRNA pathway such as biogenesis, transport, maturation and miRNA-target interaction. Moreover, 243 putative single nucleotide polymorphisms (SNPs) were identified, 15 of which showed non-synonym mutations. RNA-Seq analysis revealed that CCR4-Not complex subunit 3 (CNOT3) was upregulated at earlier developmental stages (nauplius I-II and copepodid), and also after the exposure to Azamethiphos, but not to Deltamethrin. In contrast, the subunit 7 (CNOT7) showed an inverse expression pattern. Different Argonaute transcripts were associated to chalimus and adult stages, revealing specific expression patterns in response to antiparasitic drugs. Our results suggest novel insights into the regulatory network of the post-transcriptional gene regulation in C. rogercresseyi mediated by miRNAs, evidencing a putative role during the ontogeny and drug response. Copyright © 2014 Elsevier B.V. All rights reserved.
O'Donnell, Peter H.; Gamazon, Eric; Zhang, Wei; Stark, Amy L.; Kistner-Griffin, Emily O.; Huang, R. Stephanie; Dolan, M. Eileen
2010-01-01
Objectives Clinical studies show that Asians (ASN) are more susceptible to toxicities associated with platinum-containing regimens. We hypothesized that studying ASN as an `enriched phenotype' population could enable the discovery of novel genetic determinants of platinum susceptibility. Methods Using well-genotyped lymphoblastoid cell lines from the HapMap, we determined cisplatin and carboplatin cytotoxicity phenotypes (IC50s) for ASN, Caucasians (CEU), and Africans (YRI). IC50s were used in genome-wide association studies. Results ASN were most sensitive to platinums, corroborating clinical findings. ASN genome-wide association studies produced 479 single-nucleotide polymorphisms (SNPs) associating with cisplatin susceptibility and 199 with carboplatin susceptibility (P<10−4). Considering only the most significant variants (P< 9.99 × 10−6), backwards elimination was then used to identify reduced-model SNPs, which robustly described the drug phenotypes within ASN. These SNPs comprised highly descriptive genetic signatures of susceptibility, with 12 SNPs explaining more than 95% of the susceptibility phenotype variation for cisplatin, and eight SNPs approximately 75% for carboplatin. To determine the possible function of these variants in ASN, the SNPs were tested for association with differential expression of target genes. SNPs were highly associated with the expression of multiple target genes, and notably, the histone H3 family was implicated for both drugs, suggesting a platinum-class mechanism. Histone H3 has repeatedly been described as regulating the formation of platinum-DNA adducts, but this is the first evidence that specific genetic variants might mediate these interactions in a pharmacogenetic manner. Finally, to determine whether any ASN-identified SNPs might also be important in other human populations, we interrogated all 479/199 SNPs for association with platinum susceptibility in an independent combined CEU/YRI population. Three unique SNPs for cisplatin and 10 for carboplatin replicated in CEU/YRI. Conclusion Enriched `platinum susceptible' populations can be used to discover novel genetic determinants governing interindividual platinum chemotherapy susceptibility. PMID:20393316
Du, Zhong-Jun; Cui, Guan-Qun; Zhang, Juan; Liu, Xiao-Mei; Zhang, Zhi-Hu; Jia, Qiang; Ng, Jack C; Peng, Cheng; Bo, Cun-Xiang; Shao, Hua
2017-01-01
Gap junction intercellular communication (GJIC) between cardiomyocytes is essential for synchronous heart contraction and relies on connexin-containing channels. Connexin 43 (Cx43) is a major component involved in GJIC in heart tissue, and its abnormal expression is closely associated with various cardiac diseases. Silica nanoparticles (SNPs) are known to induce cardiovascular toxicity. However, the mechanisms through which GJIC plays a role in cardiomyocytes apoptosis induced by SNPs remain unknown. The aim of the present study is to determine whether SNPs-decreased GJIC promotes apoptosis in rat cardiomyocytes cell line (H9c2 cells) via the mitochondrial pathway using CCK-8 Kit, scrape-loading dye transfer technique, Annexin V/PI double-staining assays, and Western blot analysis. The results showed that SNPs elicited cytotoxicity in H9c2 cells in a time- and concentration-dependent manner. SNPs also reduced GJIC in H9c2 cells in a concentration-dependent manner through downregulation of Cx43 and upregulation of P-Cx43. Inhibition of gap junctions by gap junction blocker carbenoxolone disodium resulted in decreased survival and increased apoptosis, whereas enhancement of the gap junctions by retinoic acid led to enhanced survival but decreased apoptosis. Furthermore, SNPs-induced apoptosis through the disrupted functional gap junction was correlated with abnormal expressions of the proteins involved in the mitochondrial pathway-related apoptosis such as Bcl-2/Bax, cytochrome C, Caspase-9, and Caspase-3. Taken together, our results provide the first evidence that SNPs-decreased GJIC promotes apoptosis in cardiomyocytes via the mitochondrial pathway. In addition, downregulation of GJIC by SNPs in cardiomyocytes is mediated through downregulation of Cx43 and upregulation of P-Cx43. These results suggest that in rat cardiomyocytes cell line, GJIC plays a protective role in SNPs-induced apoptosis and that GJIC may be one of the targets for SNPs-induced biological effects.
Malenfant, René M; Coltman, David W; Davis, Corey S
2015-05-01
Single-nucleotide polymorphisms (SNPs) offer numerous advantages over anonymous markers such as microsatellites, including improved estimation of population parameters, finer-scale resolution of population structure and more precise genomic dissection of quantitative traits. However, many SNPs are needed to equal the resolution of a single microsatellite, and reliable large-scale genotyping of SNPs remains a challenge in nonmodel species. Here, we document the creation of a 9K Illumina Infinium BeadChip for polar bears (Ursus maritimus), which will be used to investigate: (i) the fine-scale population structure among Canadian polar bears and (ii) the genomic architecture of phenotypic traits in the Western Hudson Bay subpopulation. To this end, we used restriction-site associated DNA (RAD) sequencing from 38 bears across their circumpolar range, as well as blood/fat transcriptome sequencing of 10 individuals from Western Hudson Bay. Six-thousand RAD SNPs and 3000 transcriptomic SNPs were selected for the chip, based primarily on genomic spacing and gene function respectively. Of the 9000 SNPs ordered from Illumina, 8042 were successfully printed, and - after genotyping 1450 polar bears - 5441 of these SNPs were found to be well clustered and polymorphic. Using this array, we show rapid linkage disequilibrium decay among polar bears, we demonstrate that in a subsample of 78 individuals, our SNPs detect known genetic structure more clearly than 24 microsatellites genotyped for the same individuals and that these results are not driven by the SNP ascertainment scheme. Here, we present one of the first large-scale genotyping resources designed for a threatened species. © 2014 John Wiley & Sons Ltd.
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection. PMID:25048025
Bertram, Lars; Lange, Christoph; Mullin, Kristina; Parkinson, Michele; Hsiao, Monica; Hogan, Meghan F; Schjeide, Brit M M; Hooli, Basavaraj; Divito, Jason; Ionita, Iuliana; Jiang, Hongyu; Laird, Nan; Moscarillo, Thomas; Ohlsen, Kari L; Elliott, Kathryn; Wang, Xin; Hu-Lince, Diane; Ryder, Marie; Murphy, Amy; Wagner, Steven L; Blacker, Deborah; Becker, K David; Tanzi, Rudolph E
2008-11-01
Alzheimer's disease (AD) is a genetically complex and heterogeneous disorder. To date four genes have been established to either cause early-onset autosomal-dominant AD (APP, PSEN1, and PSEN2(1-4)) or to increase susceptibility for late-onset AD (APOE5). However, the heritability of late-onset AD is as high as 80%, (6) and much of the phenotypic variance remains unexplained to date. We performed a genome-wide association (GWA) analysis using 484,522 single-nucleotide polymorphisms (SNPs) on a large (1,376 samples from 410 families) sample of AD families of self-reported European descent. We identified five SNPs showing either significant or marginally significant genome-wide association with a multivariate phenotype combining affection status and onset age. One of these signals (p = 5.7 x 10(-14)) was elicited by SNP rs4420638 and probably reflects APOE-epsilon4, which maps 11 kb proximal (r2 = 0.78). The other four signals were tested in three additional independent AD family samples composed of nearly 2700 individuals from almost 900 families. Two of these SNPs showed significant association in the replication samples (combined p values 0.007 and 0.00002). The SNP (rs11159647, on chromosome 14q31) with the strongest association signal also showed evidence of association with the same allele in GWA data generated in an independent sample of approximately 1,400 AD cases and controls (p = 0.04). Although the precise identity of the underlying locus(i) remains elusive, our study provides compelling evidence for the existence of at least one previously undescribed AD gene that, like APOE-epsilon4, primarily acts as a modifier of onset age.
Bertram, Lars; Lange, Christoph; Mullin, Kristina; Parkinson, Michele; Hsiao, Monica; Hogan, Meghan F.; Schjeide, Brit M.M.; Hooli, Basavaraj; DiVito, Jason; Ionita, Iuliana; Jiang, Hongyu; Laird, Nan; Moscarillo, Thomas; Ohlsen, Kari L.; Elliott, Kathryn; Wang, Xin; Hu-Lince, Diane; Ryder, Marie; Murphy, Amy; Wagner, Steven L.; Blacker, Deborah; Becker, K. David; Tanzi, Rudolph E.
2008-01-01
Alzheimer's disease (AD) is a genetically complex and heterogeneous disorder. To date four genes have been established to either cause early-onset autosomal-dominant AD (APP, PSEN1, and PSEN21–4) or to increase susceptibility for late-onset AD (APOE5). However, the heritability of late-onset AD is as high as 80%,6 and much of the phenotypic variance remains unexplained to date. We performed a genome-wide association (GWA) analysis using 484,522 single-nucleotide polymorphisms (SNPs) on a large (1,376 samples from 410 families) sample of AD families of self-reported European descent. We identified five SNPs showing either significant or marginally significant genome-wide association with a multivariate phenotype combining affection status and onset age. One of these signals (p = 5.7 × 10−14) was elicited by SNP rs4420638 and probably reflects APOE-ɛ4, which maps 11 kb proximal (r2 = 0.78). The other four signals were tested in three additional independent AD family samples composed of nearly 2700 individuals from almost 900 families. Two of these SNPs showed significant association in the replication samples (combined p values 0.007 and 0.00002). The SNP (rs11159647, on chromosome 14q31) with the strongest association signal also showed evidence of association with the same allele in GWA data generated in an independent sample of ∼1,400 AD cases and controls (p = 0.04). Although the precise identity of the underlying locus(i) remains elusive, our study provides compelling evidence for the existence of at least one previously undescribed AD gene that, like APOE-ɛ4, primarily acts as a modifier of onset age. PMID:18976728
Rapid Whole-Genome Sequencing for Investigation of a Neonatal MRSA Outbreak
Köser, Claudio U.; Holden, Matthew T.G.; Ellington, Matthew J.; Cartwright, Edward J.P.; Brown, Nicholas M.; Ogilvy-Stuart, Amanda L.; Hsu, Li Yang; Chewapreecha, Claire; Croucher, Nicholas J.; Harris, Simon R.; Sanders, Mandy; Enright, Mark C.; Dougan, Gordon; Bentley, Stephen D.; Parkhill, Julian; Fraser, Louise J.; Betley, Jason R.; Schulz-Trieglaff, Ole B.; Smith, Geoffrey P.; Peacock, Sharon J.
2013-01-01
Background Isolates of methicillin-resistant Staphylococcus aureus (MRSA) belonging to a single lineage are often indistinguishable by means of current typing techniques. Whole-genome sequencing may provide improved resolution to define transmission pathways and characterize outbreaks. Methods We investigated a putative MRSA outbreak in a neonatal intensive care unit. By using rapid high-throughput sequencing technology with a clinically relevant turnaround time, we retrospectively sequenced the DNA from seven isolates associated with the outbreak and another seven MRSA isolates associated with carriage of MRSA or bacteremia in the same hospital. Results We constructed a phylogenetic tree by comparing single-nucleotide polymorphisms (SNPs) in the core genome to a reference genome (an epidemic MRSA clone, EMRSA-15 [sequence type 22]). This revealed a distinct cluster of outbreak isolates and clear separation between these and the nonoutbreak isolates. A previously missed transmission event was detected between two patients with bacteremia who were not part of the outbreak. We created an artificial “resistome” of antibiotic-resistance genes and demonstrated concordance between it and the results of phenotypic susceptibility testing; we also created a “toxome” consisting of toxin genes. One outbreak isolate had a hypermutator phenotype with a higher number of SNPs than the other outbreak isolates, highlighting the difficulty of imposing a simple threshold for the number of SNPs between isolates to decide whether they are part of a recent transmission chain. Conclusions Whole-genome sequencing can provide clinically relevant data within a time frame that can influence patient care. The need for automated data interpretation and the provision of clinically meaningful reports represent hurdles to clinical implementation. (Funded by the U.K. Clinical Research Collaboration Translational Infection Research Initiative and others.) PMID:22693998
2014-01-01
Background Apparent Amylose Content (AAC), regulated by the Waxy gene, represents the key determinant of rice cooking properties. In occidental countries high AAC rice represents the most requested market class but the availability of molecular markers allowing specific selection of high AAC varieties is limited. Results In this study, the effectiveness of available molecular markers in predicting AAC was evaluated in a collection of 127 rice accessions (125 japonica ssp. and 2 indica ssp.) characterized by AAC values from glutinous to 26%. The analyses highlighted the presence of several different allelic patterns identifiable by a few molecular markers, and two of them, i.e., the SNPs at intron1 and exon 6, were able to explain a maximum of 79.5% of AAC variation. However, the available molecular markers haplotypes did not provide tools for predicting accessions with AAC higher than 24.5%. To identify additional polymorphisms, the re-sequencing of the Waxy gene and 1kbp of the putative upstream regulatory region was performed in 21 genotypes representing all the AAC classes identified. Several previously un-characterized SNPs were identified and four of them were used to develop dCAPS markers. Conclusions The addition of the SNPs newly identified slightly increased the AAC explained variation and allowed the identification of a haplotype almost unequivocally associated to AAC higher than 24.5%. Haplotypes at the waxy locus were also associated to grain length and length/width (L/W) ratio. In particular, the SNP at the first intron, which identifies the Wx a and Wx b alleles, was associated with differences in the width of the grain, the L/W ratio and the length of the kernel, most likely as a result of human selection. PMID:24383761
Inherited genetic variants associated with occurrence of multiple primary melanoma.
Gibbs, David C; Orlow, Irene; Kanetsky, Peter A; Luo, Li; Kricker, Anne; Armstrong, Bruce K; Anton-Culver, Hoda; Gruber, Stephen B; Marrett, Loraine D; Gallagher, Richard P; Zanetti, Roberto; Rosso, Stefano; Dwyer, Terence; Sharma, Ajay; La Pilla, Emily; From, Lynn; Busam, Klaus J; Cust, Anne E; Ollila, David W; Begg, Colin B; Berwick, Marianne; Thomas, Nancy E
2015-06-01
Recent studies, including genome-wide association studies, have identified several putative low-penetrance susceptibility loci for melanoma. We sought to determine their generalizability to genetic predisposition for multiple primary melanoma in the international population-based Genes, Environment, and Melanoma (GEM) Study. GEM is a case-control study of 1,206 incident cases of multiple primary melanoma and 2,469 incident first primary melanoma participants as the control group. We investigated the odds of developing multiple primary melanoma for 47 SNPs from 21 distinct genetic regions previously reported to be associated with melanoma. ORs and 95% confidence intervals were determined using logistic regression models adjusted for baseline features (age, sex, age by sex interaction, and study center). We investigated univariable models and built multivariable models to assess independent effects of SNPs. Eleven SNPs in 6 gene neighborhoods (TERT/CLPTM1L, TYRP1, MTAP, TYR, NCOA6, and MX2) and a PARP1 haplotype were associated with multiple primary melanoma. In a multivariable model that included only the most statistically significant findings from univariable modeling and adjusted for pigmentary phenotype, back nevi, and baseline features, we found TERT/CLPTM1L rs401681 (P = 0.004), TYRP1 rs2733832 (P = 0.006), MTAP rs1335510 (P = 0.0005), TYR rs10830253 (P = 0.003), and MX2 rs45430 (P = 0.008) to be significantly associated with multiple primary melanoma, while NCOA6 rs4911442 approached significance (P = 0.06). The GEM Study provides additional evidence for the relevance of these genetic regions to melanoma risk and estimates the magnitude of the observed genetic effect on development of subsequent primary melanoma. ©2015 American Association for Cancer Research.
Jasinska, A.J.; Service, S.; Jawaheer, D.; DeYoung, J.; Levinson, M.; Zhang, Z.; Kremeyer, B.; Muller, H.; Aldana, I.; Garcia, J.; Restrepo, G.; Lopez, C.; Palacio, C.; Duque, C.; Parra, M.; Vega, J.; Ortiz, D.; Bedoya, G.; Mathews, C.; Davanzo, P.; Fournier, E.; Bejarano, J.; Ramirez, M.; Ortiz, C. Araya; Araya, X.; Molina, J.; Sabatti, C.; Reus, V.; Ospina, J.; Macaya, G.; Ruiz-Linares, A.; Freimer, N.B.
2016-01-01
We previously reported linkage of bipolar disorder to 5q33-q34 in families from two closely related population isolates, the Central Valley of Costa Rica (CVCR) and Antioquia, Colombia (CO). Here we present follow up results from fine-scale mapping in large CVCR and CO families segregating severe bipolar disorder, BP-I, and in 343 population trios/duos from CVCR and CO. Employing densely spaced SNPs to fine map the prior linkage peak region increases linkage evidence and clarifies the position of the putative BP-I locus. We performed two-point linkage analysis with 1134 SNPs in an approximately 9 Mb region between markers D5S410 and D5S422. Combining pedigrees from CVCR and CO yields a LOD score of 4.9 at SNP rs10035961. Two other SNPs (rs7721142 and rs1422795) within the same 94 kb region also displayed LOD scores greater than 4. This linkage peak coincides with our prior microsatellite results and suggests a narrowed BP-I susceptibility regions in these families. To investigate if the locus implicated in the familial form of BP-I also contributes to disease risk in the population, we followed up the family results with association analysis in duo and trio samples, obtaining signals within 2 Mb of the peak linkage signal in the pedigrees; rs12523547 and rs267015 (P = 0.00004 and 0.00016, respectively) in the CO sample and rs244960 in the CVCR sample and the combined sample, with P = 0.00032 and 0.00016, respectively. It remains unclear whether these association results reflect the same locus contributing to BP susceptibility within the extended pedigrees. PMID:19319892
Dhanapal, Arun Prabhu; Ray, Jeffery D; Singh, Shardendu K; Hoyos-Villegas, Valerio; Smith, James R; Purcell, Larry C; Fritschi, Felix B
2016-08-04
Chlorophyll is a major component of chloroplasts and a better understanding of the genetic basis of chlorophyll in soybean [Glycine max (L.) Merr.] might contribute to improving photosynthetic capacity and yield in regions with adverse environmental conditions. A collection of 332 diverse soybean genotypes were grown in 2 years (2009 and 2010) and chlorophyll a (eChl_A), chlorophyll b (eChl_B), and total chlorophyll (eChl_T) content as well as chlorophyll a/b ratio (eChl_R) in leaf tissues were determined by extraction and spectrometric determination. Total chlorophyll was also derived from canopy spectral reflectance measurements using a model of wavelet transformed spectra (tChl_T) as well as with a spectral reflectance index (iChl_T). A genome-wide associating mapping approach was employed using 31,253 single nucleotide polymorphisms (SNPs) to identify loci associated with the extract based eChl_A, eChl_B, eChl_R and eChl_T measurements and the two canopy spectral reflectance-based methods (tChl_T and iChl_T). A total of 23 (14 loci), 15 (7 loci) and 14 SNPs (10 loci) showed significant association with eChl_A, eChl_B and eChl_R respectively. A total of 52 unique SNPs were significantly associated with total chlorophyll content based on at least one of the three approaches (eChl_T, tChl_T and iChl_T) and likely tagged 27 putative loci for total chlorophyll content, four of which were indicated by all three approaches. Results presented here show that markers for chlorophyll traits can be identified in soybean using both extract-based and canopy spectral reflectance-based phenotypes, and confirm that high-throughput phenotyping-amenable canopy spectral reflectance measurements can be used for association mapping.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Catfish Genome Consortium; Wang, Shaolin; Peatman, Eric
2010-03-23
Background-Through the Community Sequencing Program, a catfish EST sequencing project was carried out through a collaboration between the catfish research community and the Department of Energy's Joint Genome Institute. Prior to this project, only a limited EST resource from catfish was available for the purpose of SNP identification. Results-A total of 438,321 quality ESTs were generated from 8 channel catfish (Ictalurus punctatus) and 4 blue catfish (Ictalurus furcatus) libraries, bringing the number of catfish ESTs to nearly 500,000. Assembly of all catfish ESTs resulted in 45,306 contigs and 66,272 singletons. Over 35percent of the unique sequences had significant similarities tomore » known genes, allowing the identification of 14,776 unique genes in catfish. Over 300,000 putative SNPs have been identified, of which approximately 48,000 are high-quality SNPs identified from contigs with at least four sequences and the minor allele presence of at least two sequences in the contig. The EST resource should be valuable for identification of microsatellites, genome annotation, large-scale expression analysis, and comparative genome analysis. Conclusions-This project generated a large EST resource for catfish that captured the majority of the catfish transcriptome. The parallel analysis of ESTs from two closely related Ictalurid catfishes should also provide powerful means for the evaluation of ancient and recent gene duplications, and for the development of high-density microarrays in catfish. The inter- and intra-specific SNPs identified from all catfish EST dataset assembly will greatly benefit the catfish introgression breeding program and whole genome association studies.« less
Kitpipit, Thitika; Tobe, Shanan S; Kitchener, Andrew C; Gill, Peter; Linacre, Adrian
2012-03-01
The tiger (Panthera tigris) is currently listed on Appendix I of the Convention on the International Trade in Endangered Species of Wild Fauna and Flora; this affords it the highest level of international protection. To aid in the investigation of alleged illegal trade in tiger body parts and derivatives, molecular approaches have been developed to identify biological material as being of tiger in origin. Some countries also require knowledge of the exact tiger subspecies present in order to prosecute anyone alleged to be trading in tiger products. In this study we aimed to develop and validate a reliable single assay to identify tiger species and subspecies simultaneously; this test is based on identification of single nucleotide polymorphisms (SNPs) within the tiger mitochondrial genome. The mitochondrial DNA sequence from four of the five extant putative tiger subspecies that currently exist in the wild were obtained and combined with DNA sequence data from 492 tiger and 349 other mammalian species available on GenBank. From the sequence data a total of 11 SNP loci were identified as suitable for further analyses. Five SNPs were species-specific for tiger and six amplify one of the tiger subspecies-specific SNPs, three of which were specific to P. t. sumatrae and the other three were specific to P. t. tigris. The multiplex assay was able to reliably identify 15 voucher tiger samples. The sensitivity of the test was 15,000 mitochondrial DNA copies (approximately 0.26 pg), indicating that it will work on trace amounts of tissue, bone or hair samples. This simple test will add to the DNA-based methods currently being used to identify the presence of tiger within mixed samples. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Chiari Malformation Type I: A Case-Control Association Study of 58 Developmental Genes
Urbizu, Aintzane; Toma, Claudio; Poca, Maria A.; Sahuquillo, Juan; Cuenca-León, Ester; Cormand, Bru; Macaya, Alfons
2013-01-01
Chiari malformation type I (CMI) is a disorder characterized by hindbrain overcrowding into an underdeveloped posterior cranial fossa (PCF), often causing progressive neurological symptoms. The etiology of CMI remains unclear and is most likely multifactorial. A putative genetic contribution to CMI is suggested by familial aggregation and twin studies. Experimental models and human morphometric studies have suggested an underlying paraxial mesoderm insufficiency. We performed a case-control association study of 303 tag single nucleotide polymorphisms (SNP) across 58 candidate genes involved in early paraxial mesoderm development in a sample of 415 CMI patients and 524 sex-matched controls. A subgroup of patients diagnosed with classical, small-PCF CMI by means of MRI-based PCF morphometry (n = 186), underwent additional analysis. The genes selected are involved in signalling gradients occurring during segmental patterning of the occipital somites (FGF8, Wnt, and retinoic acid pathways and from bone morphogenetic proteins or BMP, Notch, Cdx and Hox pathways) or in placental angiogenesis, sclerotome development or CMI-associated syndromes. Single-marker analysis identified nominal associations with 18 SNPs in 14 genes (CDX1, FLT1, RARG, NKD2, MSGN1, RBPJ1, FGFR1, RDH10, NOG, RARA, LFNG, KDR, ALDH1A2, BMPR1A) considering the whole CMI sample. None of these overcame corrections for multiple comparisons, in contrast with four SNPs in CDX1, FLT1 and ALDH1A2 in the classical CMI group. Multiple marker analysis identified a risk haplotype for classical CMI in ALDH1A2 and CDX1. Furthermore, we analyzed the possible contributions of the most significantly associated SNPs to different PCF morphometric traits. These findings suggest that common variants in genes involved in somitogenesis and fetal vascular development may confer susceptibility to CMI. PMID:23437350
Mutations that Cause Human Disease: A Computational/Experimental Approach
DOE Office of Scientific and Technical Information (OSTI.GOV)
Beernink, P; Barsky, D; Pesavento, B
International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximatelymore » half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.« less
Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma.
Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dębniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M
2015-09-01
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology.
Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma
Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.
2015-01-01
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428
Qiu, Ying-Hua; Deng, Fei-Yan; Tang, Zai-Xiang; Jiang, Zhen-Huan; Lei, Shu-Feng
2015-10-01
Type 1 diabetes mellitus (type 1 DM) is an autoimmune disease. Although genome-wide association studies (GWAS) and meta-analyses have successfully identified numerous type 1 DM-associated susceptibility loci, the underlying mechanisms for these susceptibility loci are currently largely unclear. Based on publicly available datasets, we performed integrative analyses (i.e., integrated gene relationships among implicated loci, differential gene expression analysis, functional prediction and functional annotation clustering analysis) and combined with expression quantitative trait loci (eQTL) results to further explore function mechanisms underlying the associations between genetic variants and type 1 DM. Among a total of 183 type 1 DM-associated SNPs, eQTL analysis showed that 17 SNPs with cis-regulated eQTL effects on 9 genes. All the 9 eQTL genes enrich in immune-related pathways or Gene Ontology (GO) terms. Functional prediction analysis identified 5 SNPs located in transcription factor (TF) binding sites. Of the 9 eQTL genes, 6 (TAP2, HLA-DOB, HLA-DQB1, HLA-DQA1, HLA-DRB5 and CTSH) were differentially expressed in type 1 DM-associated related cells. Especially, rs3825932 in CTSH has integrative functional evidence supporting the association with type 1 DM. These findings indicated that integrative analyses can yield important functional information to link genetic variants and type 1 DM. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Camilleri, Michael; Carlson, Paula; Zinsmeister, Alan R.; McKinzie, Sanna; Busciglio, Irene; Burton, Duane; Zucchelli, Marco; D’Amato, Mauro
2009-01-01
Background & Aims NPSR1, the receptor for neuropeptide S (NPS), is expressed by gastrointestinal (GI) enteroendocrine (EE) cells, and is involved in inflammation, anxiety and nociception. NPSR1 polymorphisms are associated with asthma and inflammatory bowel disease. We aimed to determine whether NPS induces expression of GI neuropeptides; and to associate NPSR1 single nucleotide polymorphisms (SNPs) with symptom phenotype and GI functions in health and functional GI disorders (FGID). Methods The effect of NPS on mRNA expression of neuropeptides was assessed using real-time PCR in NPSR1-tranfected HEK293 cells. Seventeen NPSR1 SNPs were successfully genotyped in 699 subjects from a regional cohort of 466 FGID patients and 233 healthy controls. Associations were sought using sex-adjusted regression analysis and false discovery rate (FDR) correction. Results NPS-NPSR1 signaling induced increased expression of CCK, VIP, PYY, and somatostatin. There were no significant associations with phenotypes of FGID symptoms. There were several NPSR1 SNPs associated with individual motor or sensory functions; the associations of SNPs rs2609234, rs6972158 and rs1379928 with colonic transit rate remained significant after FDR correction. The rs1379928 polymorphism was also associated with pain, gas and urgency sensory ratings at 36 mm Hg distension, the level pre-specified for formal testing. Associations with rectal sensory ratings were not significant after FDR correction. Conclusions Expression of several neuropeptides is induced upon NPS-NPSR1 signaling; NPSR1 variants are associated with colonic transit in FGID. The role of the NPS system in FGID deserves further study. PMID:19732772
Marcolino, Antonio C S; Porto, William F; Pires, Állan S; Franco, Octavio L; Alencar, Sérgio A
2016-12-07
The guanylate cyclase activator 2B, also known as uroguanylin, is part of the guanylin peptide family, which includes peptides such as guanylin and lymphoguanylin. The guanylin peptides could be related to sodium absorption inhibition and water secretion induction and their dysfunction may be related to various pathologies such as chronic renal failure, congestive heart failure and nephrotic syndrome. Besides, uroguanylin point mutations have been associated with essential hypertension. However, currently there are no studies on the impact of missense SNPs on uroguanylin structure. This study applied in silico SNP impact prediction tools to evaluate the impact of uroguanylin missense SNPs and to filter those considered as convergent deleterious, which were then further analyzed through long-term molecular dynamics simulations of 1μs of duration. The simulations suggested that all missense SNPs considered as convergent deleterious caused some kind of structural change to the uroguanylin peptide. Additionally, four of these SNPs were also shown to cause modifications in peptide flexibility, possibly resulting in functional changes. Copyright © 2016 Elsevier Ltd. All rights reserved.
Deciphering the distance to antibiotic resistance for the pneumococcus using genome sequencing data
Mobegi, Fredrick M.; Cremers, Amelieke J. H.; de Jonge, Marien I.; Bentley, Stephen D.; van Hijum, Sacha A. F. T.; Zomer, Aldert
2017-01-01
Advances in genome sequencing technologies and genome-wide association studies (GWAS) have provided unprecedented insights into the molecular basis of microbial phenotypes and enabled the identification of the underlying genetic variants in real populations. However, utilization of genome sequencing in clinical phenotyping of bacteria is challenging due to the lack of reliable and accurate approaches. Here, we report a method for predicting microbial resistance patterns using genome sequencing data. We analyzed whole genome sequences of 1,680 Streptococcus pneumoniae isolates from four independent populations using GWAS and identified probable hotspots of genetic variation which correlate with phenotypes of resistance to essential classes of antibiotics. With the premise that accumulation of putative resistance-conferring SNPs, potentially in combination with specific resistance genes, precedes full resistance, we retrogressively surveyed the hotspot loci and quantified the number of SNPs and/or genes, which if accumulated would confer full resistance to an otherwise susceptible strain. We name this approach the ‘distance to resistance’. It can be used to identify the creep towards complete antibiotics resistance in bacteria using genome sequencing. This approach serves as a basis for the development of future sequencing-based methods for predicting resistance profiles of bacterial strains in hospital microbiology and public health settings. PMID:28205635
Garcia-Etxebarria, Koldo; Jauregi-Miguel, Amaia; Romero-Garmendia, Irati; Plaza-Izurieta, Leticia; Legarda, Maria; Irastorza, Iñaki; Bilbao, Jose Ramon
2016-12-01
To identify candidate genes in celiac disease (CD), we reanalyzed the whole Immunochip CD cohort using a different approach that clusters individuals based on immunoancestry prior to disease association analysis, rather than by geographical origin. We detected 636 new associated SNPs (P<7.02 × 10 -07 ) and identified 5 novel genomic regions, extended 8 others previously identified and also detected 18 isolated signals defined by one or very few significant SNPs. To test whether we could identify putative candidate genes, we performed expression analyses of several genes from the top novel region (chr2:134533564-136169524), from a previously identified locus that is now extended, and a gene marked by an isolated SNP, in duodenum biopsies of active and treated CD patients, and non-celiac controls. In the largest novel region, CCNT2 and R3HDM1 were constitutively underexpressed in disease, even after gluten removal. Moreover, several genes within this region were coexpressed in patients, but not in controls. Other novel genes like KIF21B, REL and SORD also showed altered expression in active disease. Apart from the identification of novel CD loci, these results suggest that ancestry-based stratified analysis is an efficient strategy for association studies in complex diseases.
NASA Astrophysics Data System (ADS)
Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli
2013-09-01
Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.
CYP19A1 fine-mapping and Mendelian randomization: estradiol is causal for endometrial cancer
Thompson, Deborah J; O'Mara, Tracy A; Glubb, Dylan M; Painter, Jodie N; Cheng, Timothy; Folkerd, Elizabeth; Doody, Deborah; Dennis, Joe; Webb, Penelope M; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Michailidou, Kyriaki; Tyrer, Jonathan P; Maranian, Mel J; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Fasching, Peter A; Hein, Alexander; Beckmann, Matthias W; Ekici, Arif B; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Zhao, Hui; Depreeuw, Jeroen; Schrauwen, Stefanie; Amant, Frederic; Goode, Ellen L; Fridley, Brooke L; Dowdy, Sean C; Winham, Stacey J; Salvesen, Helga B; Trovik, Jone; Njolstad, Tormund S; Werner, Henrica M J; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Carvajal-Carmona, Luis; Tham, Emma; Liu, Tao; Mints, Miriam; Scott, Rodney J; McEvoy, Mark; Attia, John; Holliday, Elizabeth G; Montgomery, Grant W; Martin, Nicholas G; Nyholt, Dale R; Henders, Anjali K; Hopper, John L; Traficante, Nadia; Ruebner, Matthias; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Lambrechts, Diether; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Bolla, Manjeet K; Wang, Qin; Bojesen, Stig E; Shah, Mitul; Luben, Robert; Khaw, Kay-Tee; Pharoah, Paul D P; Dunning, Alison M; Tomlinson, Ian; Dowsett, Mitch; Easton, Douglas F; Spurdle, Amanda B
2016-01-01
Candidate gene studies have reported CYP19A1 variants to be associated with endometrial cancer and with estradiol (E2) concentrations. We analyzed 2937 single nucleotide polymorphisms (SNPs) in 6608 endometrial cancer cases and 37 925 controls and report the first genome wide-significant association between endometrial cancer and a CYP19A1 SNP (rs727479 in intron 2, P=4.8×10−11). SNP rs727479 was also among those most strongly associated with circulating E2 concentrations in 2767 post-menopausal controls (P=7.4×10−8). The observed endometrial cancer odds ratio per rs727479 A-allele (1.15, CI=1.11–1.21) is compatible with that predicted by the observed effect on E2 concentrations (1.09, CI=1.03–1.21), consistent with the hypothesis that endometrial cancer risk is driven by E2. From 28 candidate-causal SNPs, 12 co-located with three putative gene-regulatory elements and their risk alleles associated with higher CYP19A1 expression in bioinformatical analyses. For both phenotypes, the associations with rs727479 were stronger among women with a higher BMI (Pinteraction=0.034 and 0.066 respectively), suggesting a biologically plausible gene-environment interaction. PMID:26574572
Genetic Variants of BMP2 and Their Association with the Risk of Non-Syndromic Tooth Agenesis.
Lu, Yun; Qian, Yajing; Zhang, Jinglu; Gong, Miao; Wang, Yuting; Gu, Ning; Ma, Lan; Xu, Min; Ma, Junqing; Zhang, Weibing; Pan, Yongchu; Wang, Lin
2016-01-01
Non-syndromic tooth agenesis (or non-syndromic congenitally missing tooth) is one of the most common congenital defects in humans affecting the craniofacial function and appearance. Single nucleotide polymorphisms (SNPs) have been associated with an individual's susceptibility to these anomalies. The aim of the present study was therefore to investigate the roles of the potentially functional SNPs of BMP2 in the occurrence of tooth agenesis. Overall, four potentially functional SNPs of BMP2 (rs15705, rs235768, rs235769 and rs3178250) were selected, and their associations with the susceptibility of tooth agenesis were evaluated in a case-control study of 335 non-syndromic tooth agenesis cases and 444 healthy controls. The SNPs rs15705 and rs3178250 were found to be associated with an individual's risk of tooth agenesis (P = 0.046 and P = 0.039, respectively). Both SNPs showed an increased risk of mandibular incisor agenesis (rs15705, AA/AC vs. CC = 1.58, 95% CI = [1.06-2.34], P = 0.024; rs3178250, TT/TC vs. CC = 1.60, 95% CI = [1.08-2.37], P = 0.020). Bioinformatics analysis indicated that these two SNPs located at the 3'-untranslated region (3'-UTR) of BMP2 might alter the binding ability of miR-1273d and miR-4639-5p, respectively, which was confirmed by luciferase activity assays in the 293A and COS7 cell lines (P < 0.001 in 293A and P < 0.01 in COS7 for miR-1273d; and P < 0.001 in both cells for miR-4639-5p). Furthermore, BMP2 mRNA expression decreased after transfecting either miR-1273d or miR-4639-5p into these two cell lines (P < 0.01 in 293A and P < 0.001 in COS7 for miR-1273d, and P < 0.01 in both cell lines for miR-4639-5p). Taken together, our findings indicate that rs15705 and rs317250 are associated with the susceptibility of non-syndromic tooth agenesis by possibly affecting miRNAs and mRNA interaction.
Zhong, Xiaoxiao; Li, Qi; Yu, Hong; Kong, Lingfeng
2014-01-01
Oysters, with high levels of phenotypic plasticity and wide geographic distribution, are a challenging group for taxonomists and phylogenetics. Our study is intended to generate new EST-SNP markers and to evaluate their potential for cross-species utilization in phylogenetic study of the genus Crassostrea. In the study, 57 novel SNPs were developed from an EST database of C. gigas by the HRM (high-resolution melting) method. Transferability of 377 SNPs developed for C. gigas was examined on four other Crassostrea species: C. sikamea, C. angulata, C. hongkongensis and C. ariakensis. Among the 377 primer pairs tested, 311 (82.5%) primers showed amplification in C. sikamea, 353 (93.6%) in C. angulata, 254 (67.4%) in C. hongkongensis and 253 (67.1%) in C. ariakensis. A total of 214 SNPs were found to be transferable to all four species. Phylogenetic analyses showed that C. hongkongensis was a sister species of C. ariakensis and that this clade was sister to the clade containing C. sikamea, C. angulata and C. gigas. Within this clade, C. gigas and C. angulata had the closest relationship, with C. sikamea being the sister group. In addition, we detected eight SNPs as potentially being under selection by two outlier tests (fdist and hierarchical methods). The SNPs studied here should be useful for genetic diversity, comparative mapping and phylogenetic studies across species in Crassostrea and the candidate outlier SNPs are worth exploring in more detail regarding association genetics and functional studies.
Genomic variation at the tips of the adaptive radiation of Darwin's finches.
Chaves, Jaime A; Cooper, Elizabeth A; Hendry, Andrew P; Podos, Jeffrey; De León, Luis F; Raeymaekers, Joost A M; MacMillan, W Owen; Uy, J Albert C
2016-11-01
Adaptive radiation unfolds as selection acts on the genetic variation underlying functional traits. The nature of this variation can be revealed by studying the tips of an ongoing adaptive radiation. We studied genomic variation at the tips of the Darwin's finch radiation; specifically focusing on polymorphism within, and variation among, three sympatric species of the genus Geospiza. Using restriction site-associated DNA (RAD-seq), we characterized 32 569 single-nucleotide polymorphisms (SNPs), from which 11 outlier SNPs for beak and body size were uncovered by a genomewide association study (GWAS). Principal component analysis revealed that these 11 SNPs formed four statistically linked groups. Stepwise regression then revealed that the first PC score, which included 6 of the 11 top SNPs, explained over 80% of the variation in beak size, suggesting that selection on these traits influences multiple correlated loci. The two SNPs most strongly associated with beak size were near genes associated with beak morphology across deeper branches of the radiation: delta-like 1 homologue (DLK1) and high-mobility group AT-hook 2 (HMGA2). Our results suggest that (i) key adaptive traits are associated with a small fraction of the genome (11 of 32 569 SNPs), (ii) SNPs linked to the candidate genes are dispersed throughout the genome (on several chromosomes), and (iii) micro- and macro-evolutionary variation (roots and tips of the radiation) involve some shared and some unique genomic regions. © 2016 John Wiley & Sons Ltd.
Do, Duy N.; Strathe, Anders B.; Ostersen, Tage; Pant, Sameer D.; Kadarmideen, Haja N.
2014-01-01
Residual feed intake (RFI) is a complex trait that is economically important for livestock production; however, the genetic and biological mechanisms regulating RFI are largely unknown in pigs. Therefore, the study aimed to identify single nucleotide polymorphisms (SNPs), candidate genes and biological pathways involved in regulating RFI using Genome-wide association (GWA) and pathway analyses. A total of 596 Yorkshire boars with phenotypes for two different measures of RFI (RFI1 and 2) and 60k genotypic data was used. GWA analysis was performed using a univariate mixed model and 12 and 7 SNPs were found to be significantly associated with RFI1 and RFI2, respectively. Several genes such as xin actin-binding repeat-containing protein 2 (XIRP2),tetratricopeptide repeat domain 29 (TTC29),suppressor of glucose, autophagy associated 1 (SOGA1),MAS1,G-protein-coupled receptor (GPCR) kinase 5 (GRK5),prospero-homeobox protein 1 (PROX1),GPCR 155 (GPR155), and FYVE domain containing the 26 (ZFYVE26) were identified as putative candidates for RFI based on their genomic location in the vicinity of these SNPs. Genes located within 50 kbp of SNPs significantly associated with RFI and RFI2 (q-value ≤ 0.2) were subsequently used for pathway analyses. These analyses were performed by assigning genes to biological pathways and then testing the association of individual pathways with RFI using a Fisher’s exact test. Metabolic pathway was significantly associated with both RFIs. Other biological pathways regulating phagosome, tight junctions, olfactory transduction, and insulin secretion were significantly associated with both RFI traits when relaxed threshold for cut-off p-value was used (p ≤ 0.05). These results implied porcine RFI is regulated by multiple biological mechanisms, although the metabolic processes might be the most important. Olfactory transduction pathway controlling the perception of feed via smell, insulin pathway controlling food intake might be important pathways for RFI. Furthermore, our study revealed key genes and genetic variants that control feed efficiency that could potentially be useful for genetic selection of more feed efficient pigs. PMID:25250046
High-density genetic map of Miscanthus sinensis reveals inheritance of zebra stripe
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Siyao; Clark, Lindsay V.; Swaminathan, Kankshita
Miscanthus is a perennial C4 grass that has recently become an important bioenergy crop. The efficiency of breeding improved Miscanthus biomass cultivars could be greatly increased by marker-assisted selection. Thus, a high-density genetic map is critical to Miscanthus improvement. In this study, a mapping population of 261 F1 progeny was developed from a cross between two diploid M. sinensis cultivars, ‘Strictus’ and ‘Kaskade’. High-density genetic maps for the two parents were produced with 3044 newly developed single nucleotide polymorphisms (SNPs) obtained from restriction site-associated DNA sequencing, and 138 previously mapped GoldenGate SNPs. The female parent (‘Strictus’) map spanned 1599 cM,more » with 1989 SNPs on 19 linkage groups, and an average intermarker spacing of 0.8 cM. The length of the male parent (‘Kaskade’) map was 1612 cM, with 1821 SNPs, and an average intermarker spacing of 0.9 cM. The utility of the map was confirmed by locating quantitative trait loci (QTL) for the zebra-striped trait, which was segregating in this population. Three QTL for zebra-striped presence/absence (zb1, zb2 on LG 7, and zb3 on LG 10) and three for zebra-striped intensity (zbi1, zbi2, zbi3 on LGs 7, 10, 3) were identified. Each allele that caused striping was recessive. Incomplete penetrance was observed for each zb QTL, but penetrance was greatest when two or more zb QTL were homozygous for the causative alleles. Similarly, the intensity of striping was greatest when two or more zbi QTL were homozygous for alleles that conferred the trait. Comparative mapping indicated putative correspondence between zb3 and/or zbi2 on LG 10 to previously sequenced genes conferring zebra stripe in maize and rice. These results demonstrate that the new map is useful for identifying marker–trait associations. The mapped markers will become a valuable community resource, facilitating comparisons among studies and the breeding of Miscanthus.« less
High-density genetic map of Miscanthus sinensis reveals inheritance of zebra stripe
Liu, Siyao; Clark, Lindsay V.; Swaminathan, Kankshita; ...
2015-05-06
Miscanthus is a perennial C4 grass that has recently become an important bioenergy crop. The efficiency of breeding improved Miscanthus biomass cultivars could be greatly increased by marker-assisted selection. Thus, a high-density genetic map is critical to Miscanthus improvement. In this study, a mapping population of 261 F1 progeny was developed from a cross between two diploid M. sinensis cultivars, ‘Strictus’ and ‘Kaskade’. High-density genetic maps for the two parents were produced with 3044 newly developed single nucleotide polymorphisms (SNPs) obtained from restriction site-associated DNA sequencing, and 138 previously mapped GoldenGate SNPs. The female parent (‘Strictus’) map spanned 1599 cM,more » with 1989 SNPs on 19 linkage groups, and an average intermarker spacing of 0.8 cM. The length of the male parent (‘Kaskade’) map was 1612 cM, with 1821 SNPs, and an average intermarker spacing of 0.9 cM. The utility of the map was confirmed by locating quantitative trait loci (QTL) for the zebra-striped trait, which was segregating in this population. Three QTL for zebra-striped presence/absence (zb1, zb2 on LG 7, and zb3 on LG 10) and three for zebra-striped intensity (zbi1, zbi2, zbi3 on LGs 7, 10, 3) were identified. Each allele that caused striping was recessive. Incomplete penetrance was observed for each zb QTL, but penetrance was greatest when two or more zb QTL were homozygous for the causative alleles. Similarly, the intensity of striping was greatest when two or more zbi QTL were homozygous for alleles that conferred the trait. Comparative mapping indicated putative correspondence between zb3 and/or zbi2 on LG 10 to previously sequenced genes conferring zebra stripe in maize and rice. These results demonstrate that the new map is useful for identifying marker–trait associations. The mapped markers will become a valuable community resource, facilitating comparisons among studies and the breeding of Miscanthus.« less
Evaluation of Parkinson Disease Risk Variants as Expression-QTLs
Latourelle, Jeanne C.; Dumitriu, Alexandra; Hadzi, Tiffany C.; Beach, Thomas G.; Myers, Richard H.
2012-01-01
The recent Parkinson Disease GWAS Consortium meta-analysis and replication study reports association at several previously confirmed risk loci SNCA, MAPT, GAK/DGKQ, and HLA and identified a novel risk locus at RIT2. To further explore functional consequences of these associations, we investigated modification of gene expression in prefrontal cortex brain samples of pathologically confirmed PD cases (N = 26) and controls (N = 24) by 67 associated SNPs in these 5 loci. Association between the eSNPs and expression was evaluated using a 2-degrees of freedom test of both association and difference in association between cases and controls, adjusted for relevant covariates. SNPs at each of the 5 loci were tested for cis-acting effects on all probes within 250 kb of each locus. Trans-effects of the SNPs on the 39,122 probes passing all QC on the microarray were also examined. From the analysis of cis-acting SNP effects, several SNPs in the MAPT region show significant association to multiple nearby probes, including two strongly correlated probes targeting the gene LOC644246 and the duplicated genes LRRC37A and LRRC37A2, and a third uncorrelated probe targeting the gene DCAKD. Significant cis-associations were also observed between SNPs and two probes targeting genes in the HLA region on chromosome 6. Expanding the association study to examine trans effects revealed an additional 23 SNP-probe associations reaching statistical significance (p<2.8×10−8) including SNPs from the SNCA, MAPT and RIT2 regions. These findings provide additional context for the interpretation of PD associated SNPs identified in recent GWAS as well as potential insight into the mechanisms underlying the observed SNP associations. PMID:23071545
Fontanesi, L; Bertolini, F; Scotti, E; Trevisi, P; Buttazzoni, L; Dall'olio, S; Davoli, R; Bosi, P; Russo, V
2012-12-01
The proprotein convertase subtilisin/kexin type 1 (PCSK1) gene encodes the prohormone convertase 1/3 enzyme that processes prohormones into functional hormones that, in turn, regulate central and peripheral energy metabolism. Mutations in the human PCSK1 gene cause severe monogenic obesity or confer risk of obesity. We herein investigated the porcine PCSK1 gene with the aim of identifying polymorphisms associated with fat deposition and production traits in Italian heavy pigs. By re-sequencing about 5.1 kb of this gene in 21 pigs of different breeds, we discovered 14 polymorphisms that were organized in nine haplotypes, clearly distributed in two clades of putative European and Asian origin. Then we re-mapped this gene on porcine chromosome 2 and analysed its expression in several tissues including gastric oxyntic mucosa of weanling pigs in which PCSK1 processes the pre-pro-ghrelin into ghrelin, which in turn is involved in the control of feed intake and energy metabolism. Association analyses between PCSK1 single-nucleotide polymorphisms (SNPs) and production, carcass and several other traits were conducted on five groups of pigs from three different experimental designs, for a total of 1221 animals. Results indicated that the analysed SNPs were associated (P < 0.01 or P < 0.05) with several traits including backfat thickness and visible intermuscular fat in Italian Duroc (ID) and growth performances in Italian Large White (ILW) and in ILW × Italian Landrace pigs. However, the effects estimated in the ILW were opposite to the effects reported in the ID pigs. Suggestive association (P < 0.10) was observed with muscle cathepsin B activity, opening, if confirmed, potential applications to reduce the excessive softness defect of the green hams that is of particular concern for the processing industry. The results obtained supported the need to further investigate the PCSK1 gene to fully exploit the value of its variability and apply this information in pig breeding programmes.
Valenzuela-Muñoz, Valentina; Araya-Garay, José Miguel; Gallardo-Escárate, Cristian
2013-06-01
The California red abalone, Haliotis rufescens that belongs to the Haliotidae family, is the largest species of abalone in the world that has sustained the major fishery and aquaculture production in the USA and Mexico. This native mollusk has not been evaluated or assigned a conservation category even though in the last few decades it was heavily exploited until it disappeared in some areas along the California coast. In Chile, the red abalone was introduced in the 1970s from California wild abalone stocks for the purposes of aquaculture. Considering the number of years that the red abalone has been cultivated in Chile crucial genetic information is scarce and critical issues remain unresolved. This study reports and validates novel single nucleotide polymorphisms (SNP) markers for the red abalone H. rufescens using cDNA pyrosequencing. A total of 622 high quality SNPs were identified in 146 sequences with an estimated frequency of 1 SNP each 1000bp. Forty-five SNPs markers with functional information for gene ontology were selected. Of these, 8 were polymorphic among the individuals screened: Heat shock protein 70 (HSP70), vitellogenin (VTG), lysin, alginate lyase enzyme (AL), Glucose-regulated protein 94 (GRP94), fructose-bisphosphate aldolase (FBA), sulfatase 1A precursor (S1AP) and ornithine decarboxylase antizyme (ODC). Two additional sequences were also identified with polymorphisms but no similarities with known proteins were achieved. To validate the putative SNP markers, High Resolution Melting Analysis (HRMA) was conducted in a wild and hatchery-bred population. Additionally, SNP cross-amplifications were tested in two further native abalone species, Haliotis fulgens and Haliotis corrugata. This study provides novel candidate genes that could be used to evaluate loss of genetic diversity due to hatchery selection or inbreeding effects. Copyright © 2013 Elsevier B.V. All rights reserved.
2011-01-01
Background Several susceptibility genetic variants for autoimmune diseases have been identified. A subset of these polymorphisms displays an opposite risk profile in different autoimmune conditions. This observation open interesting questions on the evolutionary forces shaping the frequency of these alleles in human populations. We aimed at testing the hypothesis whereby balancing selection has shaped the frequency of opposite risk alleles. Results Since balancing selection signatures are expected to extend over short genomic portions, we focused our analyses on 11 regions carrying putative functional polymorphisms that may represent the disease variants (and the selection targets). No exceptional nucleotide diversity was observed for ZSCAN23, HLA-DMB, VARS2, PTPN22, BAT3, C6orf47, and IL10; summary statistics were consistent with evolutionary neutrality for these gene regions. Conversely, CDSN/PSORS1C1, TRIM10/TRIM40, BTNL2, and TAP2 showed extremely high nucleotide diversity and most tests rejected neutrality, suggesting the action of balancing selection. For TAP2 and BTNL2 these signatures are not secondary to linkage disequilibrium with HLA class II genes. Nonetheless, with the exception of variants in TRIM40 and CDSN, our data suggest that opposite risk SNPs are not selection targets but rather have accumulated as neutral variants. Conclusion Data herein indicate that balancing selection is common within the extended MHC region and involves several non-HLA loci. Yet, the evolutionary history of most SNPs with an opposite effect for autoimmune diseases is consistent with evolutionary neutrality. We suggest that variants with an opposite effect on autoimmune diseases should not be considered a distinct class of disease alleles from the evolutionary perspective and, in a few cases, the opposite effect on distinct diseases may derive from complex haplotype structures in regions with high genetic diversity. PMID:21682861
Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry
2014-01-01
Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of large-scale protein-protein interaction networks, and can be useful for functional annotation of disease-associated SNPs. SNIP-IN tool is freely accessible as a web-server at http://korkinlab.org/snpintool/. PMID:24784581
Wang, Hao; Liu, Kan; Chen, Kuan-Ju; Lu, Yujie; Wang, Shutao; Lin, Wei-Yu; Guo, Feng; Kamei, Ken-ichiro; Chen, Yi-Chun; Ohashi, Minori; Wang, Mingwei; Garcia, Mitch André; Zhao, Xing-Zhong; Shen, Clifton K.-F.; Tseng, Hsian-Rong
2010-01-01
Nanoparticles are regarded as promising transfection reagents for effective and safe delivery of nucleic acids into specific type of cells or tissues providing an alternative manipulation/therapy strategy to viral gene delivery. However, the current process of searching novel delivery materials is limited due to conventional low-throughput and time-consuming multistep synthetic approaches. Additionally, conventional approaches are frequently accompanied with unpredictability and continual optimization refinements, impeding flexible generation of material diversity creating a major obstacle to achieving high transfection performance. Here we have demonstrated a rapid developmental pathway toward highly efficient gene delivery systems by leveraging the powers of a supramolecular synthetic approach and a custom-designed digital microreactor. Using the digital microreactor, broad structural/functional diversity can be programmed into a library of DNA-encapsulated supramolecular nanoparticles (DNA⊂SNPs) by systematically altering the mixing ratios of molecular building blocks and a DNA plasmid. In vitro transfection studies with DNA⊂SNPs library identified the DNA⊂SNPs with the highest gene transfection efficiency, which can be attributed to cooperative effects of structures and surface chemistry of DNA⊂SNPs. We envision such a rapid developmental pathway can be adopted for generating nanoparticle-based vectors for delivery of a variety of loads. PMID:20925389
Huang, Shunmou; Yang, Hongli; Zhan, Gaomiao; Wang, Xinfa; Liu, Guihua; Wang, Hanzhong
2012-01-01
Background Single nucleotide polymorphisms (SNPs) are an important class of genetic marker for target gene mapping. As of yet, there is no rapid and effective method to identify SNPs linked with agronomic traits in rapeseed and other crop species. Methodology/Principal Findings We demonstrate a novel method for identifying SNP markers in rapeseed by deep sequencing a representative library and performing bulk segregant analysis. With this method, SNPs associated with rapeseed pod shatter-resistance were discovered. Firstly, a reduced representation of the rapeseed genome was used. Genomic fragments ranging from 450–550 bp were prepared from the susceptible bulk (ten F2 plants with the silique shattering resistance index, SSRI <0.10) and the resistance bulk (ten F2 plants with SSRI >0.90), and also Solexa sequencing-produced 90 bp reads. Approximately 50 million of these sequence reads were assembled into contigs to a depth of 20-fold coverage. Secondly, 60,396 ‘simple SNPs’ were identified, and the statistical significance was evaluated using Fisher's exact test. There were 70 associated SNPs whose –log10 p value over 16 were selected to be further analyzed. The distribution of these SNPs appeared a tight cluster, which consisted of 14 associated SNPs within a 396 kb region on chromosome A09. Our evidence indicates that this region contains a major quantitative trait locus (QTL). Finally, two associated SNPs from this region were mapped on a major QTL region. Conclusions/Significance 70 associated SNPs were discovered and a major QTL for rapeseed pod shatter-resistance was found on chromosome A09 using our novel method. The associated SNP markers were used for mapping of the QTL, and may be useful for improving pod shatter-resistance in rapeseed through marker-assisted selection and map-based cloning. This approach will accelerate the discovery of major QTLs and the cloning of functional genes for important agronomic traits in rapeseed and other crop species. PMID:22529909
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tucker, Susan L., E-mail: sltucker@mdanderson.org; Li Minghuan; Xu Ting
2013-01-01
Purpose: To determine whether single-nucleotide polymorphisms (SNPs) in genes associated with DNA repair, cell cycle, transforming growth factor-{beta}, tumor necrosis factor and receptor, folic acid metabolism, and angiogenesis can significantly improve the fit of the Lyman-Kutcher-Burman (LKB) normal-tissue complication probability (NTCP) model of radiation pneumonitis (RP) risk among patients with non-small cell lung cancer (NSCLC). Methods and Materials: Sixteen SNPs from 10 different genes (XRCC1, XRCC3, APEX1, MDM2, TGF{beta}, TNF{alpha}, TNFR, MTHFR, MTRR, and VEGF) were genotyped in 141 NSCLC patients treated with definitive radiation therapy, with or without chemotherapy. The LKB model was used to estimate the risk ofmore » severe (grade {>=}3) RP as a function of mean lung dose (MLD), with SNPs and patient smoking status incorporated into the model as dose-modifying factors. Multivariate analyses were performed by adding significant factors to the MLD model in a forward stepwise procedure, with significance assessed using the likelihood-ratio test. Bootstrap analyses were used to assess the reproducibility of results under variations in the data. Results: Five SNPs were selected for inclusion in the multivariate NTCP model based on MLD alone. SNPs associated with an increased risk of severe RP were in genes for TGF{beta}, VEGF, TNF{alpha}, XRCC1 and APEX1. With smoking status included in the multivariate model, the SNPs significantly associated with increased risk of RP were in genes for TGF{beta}, VEGF, and XRCC3. Bootstrap analyses selected a median of 4 SNPs per model fit, with the 6 genes listed above selected most often. Conclusions: This study provides evidence that SNPs can significantly improve the predictive ability of the Lyman MLD model. With a small number of SNPs, it was possible to distinguish cohorts with >50% risk vs <10% risk of RP when they were exposed to high MLDs.« less
Multiple lupus-associated ITGAM variants alter Mac-1 functions on neutrophils.
Zhou, Yebin; Wu, Jianming; Kucik, Dennis F; White, Nathan B; Redden, David T; Szalai, Alexander J; Bullard, Daniel C; Edberg, Jeffrey C
2013-11-01
Multiple studies have demonstrated that single-nucleotide polymorphisms (SNPs) in the ITGAM locus (including the nonsynonymous SNPs rs1143679, rs1143678, and rs1143683) are associated with systemic lupus erythematosus (SLE). ITGAM encodes the protein CD11b, a subunit of the β2 integrin Mac-1. The purpose of this study was to determine the effects of ITGAM genetic variation on the biologic functions of neutrophil Mac-1. Neutrophils from ITGAM-genotyped and -sequenced healthy donors were isolated for functional studies. The phagocytic capacity of neutrophil ITGAM variants was probed with complement-coated erythrocytes, serum-treated zymosan, heat-treated zymosan, and IgG-coated erythrocytes. The adhesion capacity of ITGAM variants, in adhering to either purified intercellular adhesion molecule 1 or tumor necrosis factor α-stimulated endothelial cells, was assessed in a flow chamber. Expression levels of total CD11b and activation of CD11b were assessed by flow cytometry. Mac-1-mediated neutrophil phagocytosis, determined in cultures with 2 different complement-coated particles, was significantly reduced in individuals with nonsynonymous variant alleles of ITGAM. This reduction in phagocytosis was related to variation at either rs1143679 (in the β-propeller region) or rs1143678/rs1143683 (highly linked SNPs in the cytoplasmic/calf-1 regions). Phagocytosis mediated by Fcγ receptors was also significantly reduced in donors with variant ITGAM alleles. Similarly, firm adhesion of neutrophils was significantly reduced in individuals with variant ITGAM alleles. These functional alterations were not attributable to differences in total receptor expression or activation. The nonsynonymous ITGAM variants rs1143679 and rs1143678/rs113683 contribute to altered Mac-1 function on neutrophils. These results underscore the need to consider multiple nonsynonymous SNPs when assessing the functional consequences of ITGAM variation on immune cell processes and the risk of SLE. Copyright © 2013 by the American College of Rheumatology.
2011-01-01
Background Sequence variants in genes functioning in folate-mediated one-carbon metabolism are hypothesized to lead to changes in levels of homocysteine and DNA methylation, which, in turn, are associated with risk of cardiovascular disease. Methods 330 SNPs in 52 genes were studied in relation to plasma homocysteine and global genomic DNA methylation. SNPs were selected based on functional effects and gene coverage, and assays were completed on the Illumina Goldengate platform. Age-, smoking-, and nutrient-adjusted genotype--phenotype associations were estimated in regression models. Results Using a nominal P ≤ 0.005 threshold for statistical significance, 20 SNPs were associated with plasma homocysteine, 8 with Alu methylation, and 1 with LINE-1 methylation. Using a more stringent false discovery rate threshold, SNPs in FTCD, SLC19A1, and SLC19A3 genes remained associated with plasma homocysteine. Gene by vitamin B-6 interactions were identified for both Alu and LINE-1 methylation, and epistatic interactions with the MTHFR rs1801133 SNP were identified for the plasma homocysteine phenotype. Pleiotropy involving the MTHFD1L and SARDH genes for both plasma homocysteine and Alu methylation phenotypes was identified. Conclusions No single gene was associated with all three phenotypes, and the set of the most statistically significant SNPs predictive of homocysteine or Alu or LINE-1 methylation was unique to each phenotype. Genetic variation in folate-mediated one-carbon metabolism, other than the well-known effects of the MTHFR c.665C>T (known as c.677 C>T, rs1801133, p.Ala222Val), is predictive of cardiovascular disease biomarkers. PMID:22103680
Laumen, Helmut; Saningong, Akuma D.; Heid, Iris M.; Hess, Jochen; Herder, Christian; Claussnitzer, Melina; Baumert, Jens; Lamina, Claudia; Rathmann, Wolfgang; Sedlmeier, Eva-Maria; Klopp, Norman; Thorand, Barbara; Wichmann, H.-Erich; Illig, Thomas; Hauner, Hans
2009-01-01
OBJECTIVE Adiponectin (APM1, ACDC) is an adipocyte-derived protein with downregulated expression in obesity and insulin-resistant states. Several potentially regulatory single nucleotide polymorphisms (SNPs) within the APM1 gene promoter region have been associated with circulating adiponectin levels. None of them have been functionally characterized in adiponectin-expressing cells. Hence, we investigated three SNPs (rs16861194, rs17300539, and rs266729) for their influence on adiponectin promoter activity and their association with circulating adiponectin levels. RESEARCH DESIGN AND METHODS Basal and rosiglitazone-induced promoter activity of different SNP combinations (haplotypes) was analyzed in 3T3-L1 adipocytes using luciferase reporter gene assays and DNA binding studies comparing all possible APM1 haplotypes. This functional approach was complemented with analysis of epidemiological population-based data of 1,692 participants of the MONICA/KORA S123 cohort and 696 participants from the KORA S4 cohort for SNP and haplotype association with circulating adiponectin levels. RESULTS Major to minor allele replacements of the three SNPs revealed significant effects on promoter activity in luciferase assays. Particularly, a minor variant in rs16861194 resulted in reduced basal and rosiglitazone-induced promoter activity and hypoadiponectinemia in the epidemiological datasets. The haplotype with the minor allele in all three SNPs showed a complete loss of promoter activity, and no subject carried this haplotype in either of the epidemiological samples (combined P value for statistically significant difference from a random sample was 0.006). CONCLUSIONS Our results clearly demonstrate that promoter variants associated with hypoadiponectinemia in humans substantially affect adiponectin promoter activity in adipocytes. Our combination of functional experiments with epidemiological data overcomes the drawback of each approach alone. PMID:19074982
Busch, Robert; Hobbs, Brian D; Zhou, Jin; Castaldi, Peter J; McGeachie, Michael J; Hardin, Megan E; Hawrylkiewicz, Iwona; Sliwinski, Pawel; Yim, Jae-Joon; Kim, Woo Jin; Kim, Deog K; Agusti, Alvar; Make, Barry J; Crapo, James D; Calverley, Peter M; Donner, Claudio F; Lomas, David A; Wouters, Emiel F; Vestbo, Jørgen; Tal-Singer, Ruth; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A; Sparrow, David; Paré, Peter D; Levy, Robert D; Rennard, Stephen I; Beaty, Terri H; Hokanson, John; Silverman, Edwin K; Cho, Michael H
2017-07-01
The heritability of chronic obstructive pulmonary disease (COPD) cannot be fully explained by recognized genetic risk factors identified as achieving genome-wide significance. In addition, the combined contribution of genetic variation to COPD risk has not been fully explored. We sought to determine: (1) whether studies of variants from previous studies of COPD or lung function in a larger sample could identify additional associated variants, particularly for severe COPD; and (2) the impact of genetic risk scores on COPD. We genotyped 3,346 single-nucleotide polymorphisms (SNPs) in 2,588 cases (1,803 severe COPD) and 1,782 control subjects from four cohorts, and performed association testing with COPD, combining these results with existing genotyping data from 6,633 cases (3,497 severe COPD) and 5,704 control subjects. In addition, we developed genetic risk scores from SNPs associated with lung function and COPD and tested their discriminatory power for COPD-related measures. We identified significant associations between SNPs near PPIC (P = 1.28 × 10 -8 ) and PPP4R4/SERPINA1 (P = 1.01 × 10 -8 ) and severe COPD; the latter association may be driven by recognized variants in SERPINA1. Genetic risk scores based on SNPs previously associated with COPD and lung function had a modest ability to discriminate COPD (area under the curve, ∼0.6), and accounted for a mean 0.9-1.9% lower forced expiratory volume in 1 second percent predicted for each additional risk allele. In a large genetic association analysis, we identified associations with severe COPD near PPIC and SERPINA1. A risk score based on combining genetic variants had modest, but significant, effects on risk of COPD and lung function.
Gunst, Annika; Jern, Patrick; Westberg, Lars; Johansson, Ada; Salo, Benny; Burri, Andrea; Spector, Tim; Eriksson, Elias; Sandnabba, N Kenneth; Santtila, Pekka
2015-03-01
Female sexual desire and arousal problems have been shown to have a heritable component of moderate size. Previous molecular genetic studies on sexual desire have mainly focused on genes associated with neurotransmitters such as dopamine and serotonin. Nevertheless, there is reason to believe that hormones with more specific functions concerning sexuality could have an impact on sexual desire and arousal. The aim of the present study was to investigate the possible effects of 17 single nucleotide polymorphisms (SNPs) located in estrogen receptor genes on female sexual desire and subjective and genital arousal (lubrication). Based on previous research, we hypothesized that ESR1 and ESR2 are relevant genes that contribute to female sexual desire and arousal. The desire, arousal, and lubrication subdomains of the Female Sexual Function Index self-report questionnaire were used. The present study involved 2,448 female twins and their sisters aged 18-49 who had submitted saliva samples for genotyping. The participants were a subset from a large-scale, population-based sample. We found nominally significant main effects on sexual desire for three ESR2 -linked SNPs when controlled for anxiety, suggesting that individuals homozygous for the G allele of the rs1271572 SNP, and the A allele of the rs4986938 and rs928554 SNPs had lower levels of sexual desire. The rs4986938 SNP also had a nominally significant effect on lubrication. No effects for any of the SNPs on subjective arousal could be detected. The number of nominally significant results for SNPs in the ESR2 gene before correcting for multiple testing suggests that further studies on the possible influence of this gene on interindividual variation in female sexual functioning are warranted. In contrast, no support for an involvement of ESR1 was obtained. Our results should be interpreted with caution until replicated in independent, large samples. © 2014 International Society for Sexual Medicine.
Saba, Reuben; Medina, Sarah J; Booth, Stephanie A
2014-10-01
The involvement of SNPs in miRNA target sites remains poorly investigated in neurodegenerative disease. In addition to associations with disease risk, such genetic variations can also provide novel insight into mechanistic pathways that may be responsible for disease etiology and/or pathobiology. To identify SNPs associated specifically with degenerating neurons, we restricted our analysis to genes that are dysregulated in CA1 hippocampal neurons of mice during early, preclinical phase of Prion disease. The 125 genes chosen are also implicated in other numerous degenerative and neurological diseases and disorders and are therefore likely to be of fundamental importance. We predicted those SNPs that could increase, decrease, or have neutral effects on miRNA binding. This group of genes was more likely to possess DNA variants than were genes chosen at random. Furthermore, many of the SNPs are common within the human population, and could contribute to the growing awareness that miRNAs and associated SNPs could account for detrimental neurological states. Interestingly, SNPs that overlapped miRNA-binding sites in the 3'-UTR of GABA-receptor subunit coding genes were particularly enriched. Moreover, we demonstrated that SNP rs9291296 would strengthen miR-26a-5p binding to a highly conserved site in the 3'-UTR of gamma-aminobutyric acid receptor subunit alpha-4. © 2014 WILEY PERIODICALS, INC.
Single nucleotide polymorphisms and haplotype frequencies of CYP3A5 in a Japanese population.
Saeki, Mayumi; Saito, Yoshiro; Nakamura, Takahiro; Murayama, Norie; Kim, Su-Ryang; Ozawa, Shogo; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Nakajima, Toshiharu; Saito, Hirohisa; Kitamura, Yutaka; Kamatani, Naoyuki; Sawada, Jun-ichi
2003-06-01
In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A5 in a Japanese population, we sequenced the proximal promoter region, all exons, and the surrounding intronic regions using genomic DNA from 187 Japanese subjects. Thirteen SNPs, including seven novel ones: 13108T>C, 16025A>G, 16903A>G, 16993C>G, 27448C>A, 29782A>G, and 31551T>C (A of the translational start codon of GenBank Accession # NG_000004.2 is numbered 1 according to the CYP Allele Nomenclature), were identified. The most common SNP was 6986A>G (key SNP for CYP3A5*3), with a 0.759 frequency. Two novel SNPs, 29782A>G (I456V) and 31551T>C (I488T), as well as 12952T>C (*5 marker) were found, but these alterations were always associated with the *3A marker SNPs, 6986A>G and 31611C>T. Using these 13 SNPs, haplotype analysis was performed and five novel *1 haplotypes (subtypes) (*1e to *1i) and six novel *3 haplotypes (subtypes) (*3d to *3i) were identified. Our findings suggest that CYP3A5*3 is the major defective allele and that other functional exonic SNPs are rare in the Japanese. Copyright 2003 Wiley-Liss, Inc.
Ingle, James N.; Schaid, Daniel J.; Goss, Paul E.; Liu, Mohan; Mushiroda, Taisei; Chapman, Judy-Anne W.; Kubo, Michiaki; Jenkins, Gregory D.; Batzler, Anthony; Shepherd, Lois; Pater, Joseph; Wang, Liewei; Ellis, Matthew J.; Stearns, Vered; Rohrer, Daniel C.; Goetz, Matthew P.; Pritchard, Kathleen I.; Flockhart, David A.; Nakamura, Yusuke; Weinshilboum, Richard M.
2010-01-01
Purpose We performed a case-control genome-wide association study (GWAS) to identify single nucleotide polymorphisms (SNPs) associated with musculoskeletal adverse events (MS-AEs) in women treated with aromatase inhibitors (AIs) for early breast cancer. Patients and Methods A nested case-control design was used to select patients enrolled onto the MA.27 phase III trial comparing anastrozole with exemestane. Cases were matched to two controls and were defined as patients with grade 3 or 4 MS-AEs (according to the National Cancer Institute's Common Terminology Criteria for Adverse Events v3.0) or those who discontinued treatment for any grade of MS-AE within the first 2 years. Genotyping was performed with the Illumina Human610-Quad BeadChip. Results The GWAS included 293 cases and 585 controls. A total of 551,358 SNPs were analyzed, followed by imputation and fine mapping of a region of interest on chromosome 14. Four SNPs on chromosome 14 had the lowest P values (2.23E-06 to 6.67E-07). T-cell leukemia 1A (TCL1A) was the gene closest (926-7000 bp) to the four SNPs. Functional genomic studies revealed that one of these SNPs (rs11849538) created an estrogen response element and that TCL1A expression was estrogen dependent, was associated with the variant SNP genotypes in estradiol-treated lymphoblastoid cells transfected with estrogen receptor alpha and was directly related to interleukin 17 receptor A (IL17RA) expression. Conclusion This GWAS identified SNPs associated with MS-AEs in women treated with AIs and with a gene (TCL1A) which, in turn, was related to a cytokine (IL17). These findings provide a focus for further research to identify patients at risk for MS-AEs and to explore the mechanisms for these adverse events. PMID:20876420
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kerns, Sarah L.; Departments of Pathology and Genetics, Albert Einstein College of Medicine, Bronx, New York; Stock, Richard
2013-01-01
Purpose: To identify single nucleotide polymorphisms (SNPs) associated with development of erectile dysfunction (ED) among prostate cancer patients treated with radiation therapy. Methods and Materials: A 2-stage genome-wide association study was performed. Patients were split randomly into a stage I discovery cohort (132 cases, 103 controls) and a stage II replication cohort (128 cases, 102 controls). The discovery cohort was genotyped using Affymetrix 6.0 genome-wide arrays. The 940 top ranking SNPs selected from the discovery cohort were genotyped in the replication cohort using Illumina iSelect custom SNP arrays. Results: Twelve SNPs identified in the discovery cohort and validated in themore » replication cohort were associated with development of ED following radiation therapy (Fisher combined P values 2.1 Multiplication-Sign 10{sup -5} to 6.2 Multiplication-Sign 10{sup -4}). Notably, these 12 SNPs lie in or near genes involved in erectile function or other normal cellular functions (adhesion and signaling) rather than DNA damage repair. In a multivariable model including nongenetic risk factors, the odds ratios for these SNPs ranged from 1.6 to 5.6 in the pooled cohort. There was a striking relationship between the cumulative number of SNP risk alleles an individual possessed and ED status (Sommers' D P value = 1.7 Multiplication-Sign 10{sup -29}). A 1-allele increase in cumulative SNP score increased the odds for developing ED by a factor of 2.2 (P value = 2.1 Multiplication-Sign 10{sup -19}). The cumulative SNP score model had a sensitivity of 84% and specificity of 75% for prediction of developing ED at the radiation therapy planning stage. Conclusions: This genome-wide association study identified a set of SNPs that are associated with development of ED following radiation therapy. These candidate genetic predictors warrant more definitive validation in an independent cohort.« less
GESPA: classifying nsSNPs to predict disease association.
Khurana, Jay K; Reeder, Jay E; Shrimpton, Antony E; Thakar, Juilee
2015-07-25
Non-synonymous single nucleotide polymorphisms (nsSNPs) are the most common DNA sequence variation associated with disease in humans. Thus determining the clinical significance of each nsSNP is of great importance. Potential detrimental nsSNPs may be identified by genetic association studies or by functional analysis in the laboratory, both of which are expensive and time consuming. Existing computational methods lack accuracy and features to facilitate nsSNP classification for clinical use. We developed the GESPA (GEnomic Single nucleotide Polymorphism Analyzer) program to predict the pathogenicity and disease phenotype of nsSNPs. GESPA is a user-friendly software package for classifying disease association of nsSNPs. It allows flexibility in acceptable input formats and predicts the pathogenicity of a given nsSNP by assessing the conservation of amino acids in orthologs and paralogs and supplementing this information with data from medical literature. The development and testing of GESPA was performed using the humsavar, ClinVar and humvar datasets. Additionally, GESPA also predicts the disease phenotype associated with a nsSNP with high accuracy, a feature unavailable in existing software. GESPA's overall accuracy exceeds existing computational methods for predicting nsSNP pathogenicity. The usability of GESPA is enhanced by fast SQL-based cloud storage and retrieval of data. GESPA is a novel bioinformatics tool to determine the pathogenicity and phenotypes of nsSNPs. We anticipate that GESPA will become a useful clinical framework for predicting the disease association of nsSNPs. The program, executable jar file, source code, GPL 3.0 license, user guide, and test data with instructions are available at http://sourceforge.net/projects/gespa.
Nagasundaram, N; Priya Doss, C George
2011-01-01
Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype.
Sallman, David A.; Basiorka, Ashley A.; Irvine, Brittany A.; Zhang, Ling; Epling-Burnette, P.K.; Rollison, Dana E.; Mallo, Mar; Sokol, Lubomir; Solé, Francesc; Maciejewski, Jaroslaw; List, Alan F.
2015-01-01
P53 is a key regulator of many cellular processes and is negatively regulated by the human homolog of murine double minute-2 (MDM2) E3 ubiquitin ligase. Single nucleotide polymorphisms (SNPs) of either gene alone, and in combination, are linked to cancer susceptibility, disease progression, and therapy response. We analyzed the interaction of TP53 R72P and MDM2 SNP309 SNPs in relationship to outcome in patients with myelodysplastic syndromes (MDS). Sanger sequencing was performed on DNA isolated from 208 MDS cases. Utilizing a novel functional SNP scoring system ranging from +2 to −2 based on predicted p53 activity, we found statistically significant differences in overall survival (OS) (p = 0.02) and progression-free survival (PFS) (p = 0.02) in non-del(5q) MDS patients with low functional scores. In univariate analysis, only IPSS and the functional SNP score predicted OS and PFS in non-del(5q) patients. In multivariate analysis, the functional SNP score was independent of IPSS for OS and PFS. These data underscore the importance of TP53 R72P and MDM2 SNP309 SNPs in MDS, and provide a novel scoring system independent of IPSS that is predictive for disease outcome. PMID:26416416
Zeng, Chenjie; Guo, Xingyi; Long, Jirong; Kuchenbaecker, Karoline B; Droit, Arnaud; Michailidou, Kyriaki; Ghoussaini, Maya; Kar, Siddhartha; Freeman, Adam; Hopper, John L; Milne, Roger L; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Agata, Simona; Ahmed, Shahana; Aittomäki, Kristiina; Andrulis, Irene L; Anton-Culver, Hoda; Antonenkova, Natalia N; Arason, Adalgeir; Arndt, Volker; Arun, Banu K; Arver, Brita; Bacot, Francois; Barrowdale, Daniel; Baynes, Caroline; Beeghly-Fadiel, Alicia; Benitez, Javier; Bermisheva, Marina; Blomqvist, Carl; Blot, William J; Bogdanova, Natalia V; Bojesen, Stig E; Bonanni, Bernardo; Borresen-Dale, Anne-Lise; Brand, Judith S; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Buys, Saundra S; Cai, Qiuyin; Caldes, Trinidad; Campbell, Ian; Carpenter, Jane; Chang-Claude, Jenny; Choi, Ji-Yeob; Claes, Kathleen B M; Clarke, Christine; Cox, Angela; Cross, Simon S; Czene, Kamila; Daly, Mary B; de la Hoya, Miguel; De Leeneer, Kim; Devilee, Peter; Diez, Orland; Domchek, Susan M; Doody, Michele; Dorfling, Cecilia M; Dörk, Thilo; Dos-Santos-Silva, Isabel; Dumont, Martine; Dwek, Miriam; Dworniczak, Bernd; Egan, Kathleen; Eilber, Ursula; Einbeigi, Zakaria; Ejlertsen, Bent; Ellis, Steve; Frost, Debra; Lalloo, Fiona; Fasching, Peter A; Figueroa, Jonine; Flyger, Henrik; Friedlander, Michael; Friedman, Eitan; Gambino, Gaetana; Gao, Yu-Tang; Garber, Judy; García-Closas, Montserrat; Gehrig, Andrea; Damiola, Francesca; Lesueur, Fabienne; Mazoyer, Sylvie; Stoppa-Lyonnet, Dominique; Giles, Graham G; Godwin, Andrew K; Goldgar, David E; González-Neira, Anna; Greene, Mark H; Guénel, Pascal; Haeberle, Lothar; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hansen, Thomas V O; Hart, Steven; Hartikainen, Jaana M; Hartman, Mikael; Hassan, Norhashimah; Healey, Sue; Hogervorst, Frans B L; Verhoef, Senno; Hendricks, Carolyn B; Hillemanns, Peter; Hollestelle, Antoinette; Hulick, Peter J; Hunter, David J; Imyanitov, Evgeny N; Isaacs, Claudine; Ito, Hidemi; Jakubowska, Anna; Janavicius, Ramunas; Jaworska-Bieniek, Katarzyna; Jensen, Uffe Birk; John, Esther M; Joly Beauparlant, Charles; Jones, Michael; Kabisch, Maria; Kang, Daehee; Karlan, Beth Y; Kauppila, Saila; Kerin, Michael J; Khan, Sofia; Khusnutdinova, Elza; Knight, Julia A; Konstantopoulou, Irene; Kraft, Peter; Kwong, Ava; Laitman, Yael; Lambrechts, Diether; Lazaro, Conxi; Le Marchand, Loic; Lee, Chuen Neng; Lee, Min Hyuk; Lester, Jenny; Li, Jingmei; Liljegren, Annelie; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mai, Phuong L; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Marme, Frederik; Matsuo, Keitaro; McGuffog, Lesley; Meindl, Alfons; Menegaux, Florence; Montagna, Marco; Muir, Kenneth; Mulligan, Anna Marie; Nathanson, Katherine L; Neuhausen, Susan L; Nevanlinna, Heli; Newcomb, Polly A; Nord, Silje; Nussbaum, Robert L; Offit, Kenneth; Olah, Edith; Olopade, Olufunmilayo I; Olswold, Curtis; Osorio, Ana; Papi, Laura; Park-Simon, Tjoung-Won; Paulsson-Karlsson, Ylva; Peeters, Stephanie; Peissel, Bernard; Peterlongo, Paolo; Peto, Julian; Pfeiler, Georg; Phelan, Catherine M; Presneau, Nadege; Radice, Paolo; Rahman, Nazneen; Ramus, Susan J; Rashid, Muhammad Usman; Rennert, Gad; Rhiem, Kerstin; Rudolph, Anja; Salani, Ritu; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Schoemaker, Minouk J; Schürmann, Peter; Seynaeve, Caroline; Shen, Chen-Yang; Shrubsole, Martha J; Shu, Xiao-Ou; Sigurdson, Alice; Singer, Christian F; Slager, Susan; Soucy, Penny; Southey, Melissa; Steinemann, Doris; Swerdlow, Anthony; Szabo, Csilla I; Tchatchou, Sandrine; Teixeira, Manuel R; Teo, Soo H; Terry, Mary Beth; Tessier, Daniel C; Teulé, Alex; Thomassen, Mads; Tihomirova, Laima; Tischkowitz, Marc; Toland, Amanda E; Tung, Nadine; Turnbull, Clare; van den Ouweland, Ans M W; van Rensburg, Elizabeth J; Ven den Berg, David; Vijai, Joseph; Wang-Gohrke, Shan; Weitzel, Jeffrey N; Whittemore, Alice S; Winqvist, Robert; Wong, Tien Y; Wu, Anna H; Yannoukakos, Drakoulis; Yu, Jyh-Cherng; Pharoah, Paul D P; Hall, Per; Chenevix-Trench, Georgia; Dunning, Alison M; Simard, Jacques; Couch, Fergus J; Antoniou, Antonis C; Easton, Douglas F; Zheng, Wei
2016-06-21
Multiple recent genome-wide association studies (GWAS) have identified a single nucleotide polymorphism (SNP), rs10771399, at 12p11 that is associated with breast cancer risk. We performed a fine-scale mapping study of a 700 kb region including 441 genotyped and more than 1300 imputed genetic variants in 48,155 cases and 43,612 controls of European descent, 6269 cases and 6624 controls of East Asian descent and 1116 cases and 932 controls of African descent in the Breast Cancer Association Consortium (BCAC; http://bcac.ccge.medschl.cam.ac.uk/ ), and in 15,252 BRCA1 mutation carriers in the Consortium of Investigators of Modifiers of BRCA1/2 (CIMBA). Stepwise regression analyses were performed to identify independent association signals. Data from the Encyclopedia of DNA Elements project (ENCODE) and the Cancer Genome Atlas (TCGA) were used for functional annotation. Analysis of data from European descendants found evidence for four independent association signals at 12p11, represented by rs7297051 (odds ratio (OR) = 1.09, 95 % confidence interval (CI) = 1.06-1.12; P = 3 × 10(-9)), rs805510 (OR = 1.08, 95 % CI = 1.04-1.12, P = 2 × 10(-5)), and rs1871152 (OR = 1.04, 95 % CI = 1.02-1.06; P = 2 × 10(-4)) identified in the general populations, and rs113824616 (P = 7 × 10(-5)) identified in the meta-analysis of BCAC ER-negative cases and BRCA1 mutation carriers. SNPs rs7297051, rs805510 and rs113824616 were also associated with breast cancer risk at P < 0.05 in East Asians, but none of the associations were statistically significant in African descendants. Multiple candidate functional variants are located in putative enhancer sequences. Chromatin interaction data suggested that PTHLH was the likely target gene of these enhancers. Of the six variants with the strongest evidence of potential functionality, rs11049453 was statistically significantly associated with the expression of PTHLH and its nearby gene CCDC91 at P < 0.05. This study identified four independent association signals at 12p11 and revealed potentially functional variants, providing additional insights into the underlying biological mechanism(s) for the association observed between variants at 12p11 and breast cancer risk.
de Manuel, Marc; Shiina, Takashi; Suzuki, Shingo; Dereuddre-Bosquet, Nathalie; Garchon, Henri-Jean; Tanaka, Masayuki; Congy-Jolivet, Nicolas; Aarnink, Alice; Le Grand, Roger; Marques-Bonet, Tomas; Blancher, Antoine
2018-05-08
In the Mauritian macaque experimentally inoculated with SIV, gene polymorphisms potentially associated with the plasma virus load at a set point, approximately 100 days post inoculation, were investigated. Among the 42 animals inoculated with 50 AID 50 of the same strain of SIV, none of which received any preventive or curative treatment, nine individuals were selected: three with a plasma virus load (PVL) among the lowest, three with intermediate PVL values and three among the highest PVL values. The complete genomes of these nine animals were then analyzed. Initially, attention was focused on variants with a potential functional impact on protein encoding genes (non-synonymous SNPs (NS-SNPs) and splicing variants). Thus, 424 NS-SNPs possibly associated with PVL were detected. The 424 candidates SNPs were genotyped in these 42 SIV experimentally infected animals (including the nine animals subjected to whole genome sequencing). The genes containing variants most probably associated with PVL at a set time point are analyzed herein.
Purfield, Deirdre C.; McParland, Sinead; Wall, Eamon; Berry, Donagh P.
2017-01-01
Domestication and the subsequent selection of animals for either economic or morphological features can leave a variety of imprints on the genome of a population. Genomic regions subjected to high selective pressures often show reduced genetic diversity and frequent runs of homozygosity (ROH). Therefore, the objective of the present study was to use 42,182 autosomal SNPs to identify genomic regions in 3,191 sheep from six commercial breeds subjected to selection pressure and to quantify the genetic diversity within each breed using ROH. In addition, the historical effective population size of each breed was also estimated and, in conjunction with ROH, was used to elucidate the demographic history of the six breeds. ROH were common in the autosomes of animals in the present study, but the observed breed differences in patterns of ROH length and burden suggested differences in breed effective population size and recent management. ROH provided a sufficient predictor of the pedigree inbreeding coefficient, with an estimated correlation between both measures of 0.62. Genomic regions under putative selection were identified using two complementary algorithms; the fixation index and hapFLK. The identified regions under putative selection included candidate genes associated with skin pigmentation, body size and muscle formation; such characteristics are often sought after in modern-day breeding programs. These regions of selection frequently overlapped with high ROH regions both within and across breeds. Multiple yet uncharacterised genes also resided within putative regions of selection. This further substantiates the need for a more comprehensive annotation of the sheep genome as these uncharacterised genes may contribute to traits of interest in the animal sciences. Despite this, the regions identified as under putative selection in the current study provide an insight into the mechanisms leading to breed differentiation and genetic variation in meat production. PMID:28463982
A large scale analysis of genetic variants within putative miRNA binding sites in prostate cancer
Stegeman, Shane; Amankwah, Ernest; Klein, Kerenaftali; O’Mara, Tracy A.; Kim, Donghwa; Lin, Hui-Yi; Permuth-Wey, Jennifer; Sellers, Thomas A.; Srinivasan, Srilakshmi; Eeles, Rosalind; Easton, Doug; Kote-Jarai, Zsofia; Olama, Ali Amin Al; Benlloch, Sara; Muir, Kenneth; Giles, Graham G.; Wiklund, Fredrik; Gronberg, Henrik; Haiman, Christopher A.; Schleutker, Johanna; Nordestgaard, Børge G.; Travis, Ruth C.; Neal, David; Pharoah, Paul; Khaw, Kay-Tee; Stanford, Janet L.; Blot, William J.; Thibodeau, Stephen; Maier, Christiane; Kibel, Adam S.; Cybulski, Cezary; Cannon-Albright, Lisa; Brenner, Hermann; Kaneva, Radka; Teixeira, Manuel R.; Consortium, PRACTICAL; Spurdle, Amanda B.; Clements, Judith A.; Park, Jong Y.; Batra, Jyotsna
2015-01-01
Prostate cancer is the second most common malignancy among men worldwide. Genome-wide association studies (GWAS) have identified 100 risk variants for prostate cancer, which can explain ~33% of the familial risk of the disease. We hypothesized that a comprehensive analysis of genetic variations found within the 3′ UTR of genes predicted to affect miRNA binding (miRSNPs) can identify additional prostate cancer risk variants. We investigated the association between 2,169 miRSNPs and prostate cancer risk in a large-scale analysis of 22,301 cases and 22,320 controls of European ancestry from 23 participating studies. Twenty-two miRSNPs were associated (p<2.3×10−5) with risk of prostate cancer, 10 of which were within the 7 genes previously not mapped by GWASs. Further, using miRNA mimics and reporter gene assays, we showed that miR-3162-5p has specific affinity for the KLK3 rs1058205 miRSNP T-allele whilst miR-370 has greater affinity for the VAMP8 rs1010 miRSNP A-allele, validating their functional role. Significance Findings from this large association study suggest that a focus on miRSNPs, including functional evaluation, can identify candidate risk loci below currently accepted statistical levels of genome-wide significance. Studies of miRNAs and their interactions with SNPs could provide further insights into the mechanisms of prostate cancer risk. PMID:25691096
Kapoor, Abhijeet; Shandilya, Manish; Kundu, Suman
2011-01-01
Human dopamine β-hydroxylase (DBH) is an important therapeutic target for complex traits. Several single nucleotide polymorphisms (SNPs) have also been identified in DBH with potential adverse physiological effect. However, difficulty in obtaining diffractable crystals and lack of a suitable template for modeling the protein has ensured that neither crystallographic three-dimensional structure nor computational model for the enzyme is available to aid rational drug design, prediction of functional significance of SNPs or analytical protein engineering. Adequate biochemical information regarding human DBH, structural coordinates for peptidylglycine alpha-hydroxylating monooxygenase and computational data from a partial model of rat DBH were used along with logical manual intervention in a novel way to build an in silico model of human DBH. The model provides structural insight into the active site, metal coordination, subunit interface, substrate recognition and inhibitor binding. It reveals that DOMON domain potentially promotes tetramerization, while substrate dopamine and a potential therapeutic inhibitor nepicastat are stabilized in the active site through multiple hydrogen bonding. Functional significance of several exonic SNPs could be described from a structural analysis of the model. The model confirms that SNP resulting in Ala318Ser or Leu317Pro mutation may not influence enzyme activity, while Gly482Arg might actually do so being in the proximity of the active site. Arg549Cys may cause abnormal oligomerization through non-native disulfide bond formation. Other SNPs like Glu181, Glu250, Lys239 and Asp290 could potentially inhibit tetramerization thus affecting function. The first three-dimensional model of full-length human DBH protein was obtained in a novel manner with a set of experimental data as guideline for consistency of in silico prediction. Preliminary physicochemical tests validated the model. The model confirms, rationalizes and provides structural basis for several biochemical data and claims testable hypotheses regarding function. It provides a reasonable template for drug design as well.
Genetic Variants of BMP2 and Their Association with the Risk of Non-Syndromic Tooth Agenesis
Wang, Yuting; Gu, Ning; Ma, Lan; Xu, Min; Ma, Junqing; Zhang, Weibing; Pan, Yongchu; Wang, Lin
2016-01-01
Non-syndromic tooth agenesis (or non-syndromic congenitally missing tooth) is one of the most common congenital defects in humans affecting the craniofacial function and appearance. Single nucleotide polymorphisms (SNPs) have been associated with an individual’s susceptibility to these anomalies. The aim of the present study was therefore to investigate the roles of the potentially functional SNPs of BMP2 in the occurrence of tooth agenesis. Overall, four potentially functional SNPs of BMP2 (rs15705, rs235768, rs235769 and rs3178250) were selected, and their associations with the susceptibility of tooth agenesis were evaluated in a case-control study of 335 non-syndromic tooth agenesis cases and 444 healthy controls. The SNPs rs15705 and rs3178250 were found to be associated with an individual’s risk of tooth agenesis (P = 0.046 and P = 0.039, respectively). Both SNPs showed an increased risk of mandibular incisor agenesis (rs15705, AA/AC vs. CC = 1.58, 95% CI = [1.06–2.34], P = 0.024; rs3178250, TT/TC vs. CC = 1.60, 95% CI = [1.08–2.37], P = 0.020). Bioinformatics analysis indicated that these two SNPs located at the 3’-untranslated region (3’-UTR) of BMP2 might alter the binding ability of miR-1273d and miR-4639-5p, respectively, which was confirmed by luciferase activity assays in the 293A and COS7 cell lines (P < 0.001 in 293A and P < 0.01 in COS7 for miR-1273d; and P < 0.001 in both cells for miR-4639-5p). Furthermore, BMP2 mRNA expression decreased after transfecting either miR-1273d or miR-4639-5p into these two cell lines (P < 0.01 in 293A and P < 0.001 in COS7 for miR-1273d, and P < 0.01 in both cell lines for miR-4639-5p). Taken together, our findings indicate that rs15705 and rs317250 are associated with the susceptibility of non-syndromic tooth agenesis by possibly affecting miRNAs and mRNA interaction. PMID:27362534
Genome-wide association analysis of seedling root development in maize (Zea mays L.).
Pace, Jordon; Gardner, Candice; Romay, Cinta; Ganapathysubramanian, Baskar; Lübberstedt, Thomas
2015-02-05
Plants rely on the root system for anchorage to the ground and the acquisition and absorption of nutrients critical to sustaining productivity. A genome wide association analysis enables one to analyze allelic diversity of complex traits and identify superior alleles. 384 inbred lines from the Ames panel were genotyped with 681,257 single nucleotide polymorphism markers using Genotyping-by-Sequencing technology and 22 seedling root architecture traits were phenotyped. Utilizing both a general linear model and mixed linear model, a GWAS study was conducted identifying 268 marker trait associations (p ≤ 5.3×10(-7)). Analysis of significant SNP markers for multiple traits showed that several were located within gene models with some SNP markers localized within regions of previously identified root quantitative trait loci. Gene model GRMZM2G153722 located on chromosome 4 contained nine significant markers. This predicted gene is expressed in roots and shoots. This study identifies putatively associated SNP markers associated with root traits at the seedling stage. Some SNPs were located within or near (<1 kb) gene models. These gene models identify possible candidate genes involved in root development at the seedling stage. These and respective linked or functional markers could be targets for breeders for marker assisted selection of seedling root traits.
Zhang, Huiqing; Guo, Xu; Dai, Jingyao; Wu, Yousheng; Ge, Naijian; Yang, Yefa; Ji, Jiansong; Zhang, Hongxin
2014-11-01
Metabolic reprogramming is an important hallmark of cancer cells, including the alterations of activity and expression in tricarboxylic acid (TCA) cycle key enzymes. Previous studies have reported the associations between tumor formation and three core enzymes involved in the TCA cycle. However, the association between functional single nucleotide polymorphisms (SNPs) in one of TCA cycle key gene isocitrate dehydrogenase (IDH) and the overall survival of hepatocellular carcinoma (HCC) patients treated with transcatheter arterial chemoembolization (TACE) has never been investigated. Five functional SNPs in IDH1 and IDH2 genes were genotyped using the Sequenom iPLEX genotyping system in a cohort of 419 unresectable Chinese HCC patients treated with TACE. Multivariate Cox proportional hazards model and Kaplan-Meier curve were used for the prognosis analysis. We found that SNPs rs12478635 in IDH1 and rs11632348 in IDH2 gene exhibited significant associations with death risk in HCC patients in the dominant model (HR 1.33; 95 % CI 1.02-1.73; P = 0.037) and in recessive model (HR 1.87; 95 % CI 1.27-2.75; P = 0.001), respectively. Moreover, we observed a cumulative effect of these two SNPs on HCC overall survival, indicating a significant trend of death risk increase with increasing number of unfavorable genotypes (P for trend = 0.001). Additionally, our data suggest that unfavorable genotypes of two SNPs may be used as an independent prognostic marker in those with advanced stage and patients with serum AFP <200 μg/L. Our results for the first time suggest that IDH gene polymorphisms may serve as an independent prognostic marker for HCC patients treated with TACE.
Khan, Imran; Ansari, Irfan A; Singh, Pratichi; Dass J, Febin Prabhu
2017-09-01
The phosphatase and tensin homolog (PTEN) gene plays a crucial role in signal transduction by negatively regulating the PI3K signaling pathway. It is the most frequent mutated gene in many human-related cancers. Considering its critical role, a functional analysis of missense mutations of PTEN gene was undertaken in this study. Thirty five nonsynonymous single nucleotide polymorphisms (nsSNPs) within the coding region of the PTEN gene were selected for our in silico investigation, and five nsSNPs (G129E, C124R, D252G, H61D, and R130G) were found to be deleterious based on combinatorial predictions of different computational tools. Moreover, molecular dynamics (MD) simulation was performed to investigate the conformational variation between native and all the five mutant PTEN proteins having predicted deleterious nsSNPs. The results of MD simulation of all mutant models illustrated variation in structural attributes such as root-mean-square deviation, root-mean-square fluctuation, radius of gyration, and total energy; which depicts the structural stability of PTEN protein. Furthermore, mutant PTEN protein structures also showed a significant variation in the solvent accessible surface area and hydrogen bond frequencies from the native PTEN structure. In conclusion, results of this study have established the deleterious effect of the all the five predicted nsSNPs on the PTEN protein structure. Thus, results of the current study can pave a new platform to sort out nsSNPs that can be undertaken for the confirmation of their phenotype and their correlation with diseased status in case of control studies. © 2016 International Union of Biochemistry and Molecular Biology, Inc.
Genetics of Sputum Gene Expression in Chronic Obstructive Pulmonary Disease
Qiu, Weiliang; Cho, Michael H.; Riley, John H.; Anderson, Wayne H.; Singh, Dave; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A.; Lomas, David A.; Crapo, James D.; Beaty, Terri H.; Celli, Bartolome R.; Rennard, Stephen; Tal-Singer, Ruth; Fox, Steven M.; Silverman, Edwin K.; Hersh, Craig P.
2011-01-01
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus. PMID:21949713
Lei, Huimeng; Yan, Zhangming; Sun, Xiaohong; Zhang, Yue; Wang, Jianhong; Ma, Caihong; Xu, Qunyuan; Wang, Rui; Jarvis, Erich D; Sun, Zhirong
2017-11-01
Human and several nonhuman species share the rare ability of modifying acoustic and/or syntactic features of sounds produced, i.e. vocal learning, which is the important neurobiological and behavioral substrate of human speech/language. This convergent trait was suggested to be associated with significant genomic convergence and best manifested at the ROBO-SLIT axon guidance pathway. Here we verified the significance of such genomic convergence and assessed its functional relevance to human speech/language using human genetic variation data. In normal human populations, we found the affected amino acid sites were well fixed and accompanied with significantly more associated protein-coding SNPs in the same genes than the rest genes. Diseased individuals with speech/language disorders have significant more low frequency protein coding SNPs but they preferentially occurred outside the affected genes. Such patients' SNPs were enriched in several functional categories including two axon guidance pathways (mediated by netrin and semaphorin) that interact with ROBO-SLITs. Four of the six patients have homozygous missense SNPs on PRAME gene family, one youngest gene family in human lineage, which possibly acts upon retinoic acid receptor signaling, similarly as FOXP2, to modulate axon guidance. Taken together, we suggest the axon guidance pathways (e.g. ROBO-SLIT, PRAME gene family) served as common targets for human speech/language evolution and related disorders. Copyright © 2017 Elsevier Inc. All rights reserved.
Zhao, Na; Xiao, Jianqiu; Zheng, Zhiyong; Fei, Guoqiang; Zhang, Feng; Jin, Lirong; Zhong, Chunjiu
2015-04-01
Our previous studies have demonstrated that ceruloplasmin (CP) dysmetabolism is correlated with Parkinson's disease (PD). However, the causes of decreased serum CP levels in PD patients remain to be clarified. This study aimed to explore the potential association between genetic variants of the CP gene and PD. Clinical features, serum CP levels, and the CP gene (both promoter and coding regions) were analyzed in 60 PD patients and 50 controls. A luciferase reporter system was used to investigate the function of promoter single-nucleotide polymorphisms (SNPs). High-density comparative genomic hybridization microarrays were also used to detect large-scale copy-number variations in CP and an additional 47 genes involved in PD and/or copper/iron metabolism. The frequencies of eight SNPs (one intronic SNP and seven promoter SNPs of the CP gene) and their haplotypes were significantly different between PD patients, especially those with lowered serum CP levels, and controls. However, the luciferase reporter system revealed no significant effect of the risk haplotype on promoter activity of the CP gene. Neither these SNPs nor their haplotypes were correlated with the Hoehn and Yahr staging of PD. The results of this study suggest that common genetic variants of CP are associated with PD and further investigation is needed to explore their functions in PD.
The association of telomere length and genetic variation in telomere biology genes.
Mirabello, Lisa; Yu, Kai; Kraft, Peter; De Vivo, Immaculata; Hunter, David J; Prescott, Jennifer; Wong, Jason Y Y; Chatterjee, Nilanjan; Hayes, Richard B; Savage, Sharon A
2010-09-01
Telomeres cap chromosome ends and are critical for genomic stability. Many telomere-associated proteins are important for telomere length maintenance. Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) in genes encoding telomere-associated proteins (RTEL1 and TERT-CLPTM1) as markers of cancer risk. We conducted an association study of telomere length and 743 SNPs in 43 telomere biology genes. Telomere length in peripheral blood DNA was determined by Q-PCR in 3,646 participants from the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial and Nurses' Health Study. We investigated associations by SNP, gene, and pathway (functional group). We found no associations between telomere length and SNPs in TERT-CLPTM1L or RTEL1. Telomere length was not significantly associated with specific functional groups. Thirteen SNPs from four genes (MEN1, MRE11A, RECQL5, and TNKS) were significantly associated with telomere length. The strongest findings were in MEN1 (gene-based P=0.006), menin, which associates with the telomerase promoter and may negatively regulate telomerase. This large association study did not find strong associations with telomere length. The combination of limited diversity and evolutionary conservation suggest that these genes may be under selective pressure. More work is needed to explore the role of genetic variants in telomere length regulation. Published 2010 Wiley-Liss, Inc.
Iquebal, M A; Jaiswal, Sarika; Mahato, Ajay Kumar; Jayaswal, Pawan K; Angadi, U B; Kumar, Neeraj; Sharma, Nimisha; Singh, Anand K; Srivastav, Manish; Prakash, Jai; Singh, S K; Khan, Kasim; Mishra, Rupesh K; Rajan, Shailendra; Bajpai, Anju; Sandhya, B S; Nischita, Puttaraju; Ravishankar, K V; Dinesh, M R; Rai, Anil; Kumar, Dinesh; Sharma, Tilak R; Singh, Nagendra K
2017-11-02
Mango is one of the most important fruits of tropical ecological region of the world, well known for its nutritive value, aroma and taste. Its world production is >45MT worth >200 billion US dollars. Genomic resources are required for improvement in productivity and management of mango germplasm. There is no web-based genomic resources available for mango. Hence rapid and cost-effective high throughput putative marker discovery is required to develop such resources. RAD-based marker discovery can cater this urgent need till whole genome sequence of mango becomes available. Using a panel of 84 mango varieties, a total of 28.6 Gb data was generated by ddRAD-Seq approach on Illumina HiSeq 2000 platform. A total of 1.25 million SNPs were discovered. Phylogenetic tree using 749 common SNPs across these varieties revealed three major lineages which was compared with geographical locations. A web genomic resources MiSNPDb, available at http://webtom.cabgrid.res.in/mangosnps/ is based on 3-tier architecture, developed using PHP, MySQL and Javascript. This web genomic resources can be of immense use in the development of high density linkage map, QTL discovery, varietal differentiation, traceability, genome finishing and SNP chip development for future GWAS in genomic selection program. We report here world's first web-based genomic resources for genetic improvement and germplasm management of mango.
Whole-genome analyses of Korean native and Holstein cattle breeds by massively parallel sequencing.
Choi, Jung-Woo; Liao, Xiaoping; Stothard, Paul; Chung, Won-Hyong; Jeon, Heoyn-Jeong; Miller, Stephen P; Choi, So-Young; Lee, Jeong-Koo; Yang, Bokyoung; Lee, Kyung-Tai; Han, Kwang-Jin; Kim, Hyeong-Cheol; Jeong, Dongkee; Oh, Jae-Don; Kim, Namshin; Kim, Tae-Hun; Lee, Hak-Kyo; Lee, Sung-Jin
2014-01-01
A main goal of cattle genomics is to identify DNA differences that account for variations in economically important traits. In this study, we performed whole-genome analyses of three important cattle breeds in Korea--Hanwoo, Jeju Heugu, and Korean Holstein--using the Illumina HiSeq 2000 sequencing platform. We achieved 25.5-, 29.6-, and 29.5-fold coverage of the Hanwoo, Jeju Heugu, and Korean Holstein genomes, respectively, and identified a total of 10.4 million single nucleotide polymorphisms (SNPs), of which 54.12% were found to be novel. We also detected 1,063,267 insertions-deletions (InDels) across the genomes (78.92% novel). Annotations of the datasets identified a total of 31,503 nonsynonymous SNPs and 859 frameshift InDels that could affect phenotypic variations in traits of interest. Furthermore, genome-wide copy number variation regions (CNVRs) were detected by comparing the Hanwoo, Jeju Heugu, and previously published Chikso genomes against that of Korean Holstein. A total of 992, 284, and 1881 CNVRs, respectively, were detected throughout the genome. Moreover, 53, 65, 45, and 82 putative regions of homozygosity (ROH) were identified in Hanwoo, Jeju Heugu, Chikso, and Korean Holstein respectively. The results of this study provide a valuable foundation for further investigations to dissect the molecular mechanisms underlying variation in economically important traits in cattle and to develop genetic markers for use in cattle breeding.
Liu, Jun-Jun; Shamoun, Simon Francis; Leal, Isabel; Kowbel, Robert; Sumampong, Grace; Zamany, Arezoo
2018-05-01
Characterization of genes involved in differentiation of pathogen species and isolates with variations of virulence traits provides valuable information to control tree diseases for meeting the challenges of sustainable forest health and phytosanitary trade issues. Lack of genetic knowledge and genomic resources hinders novel gene discovery, molecular mechanism studies and development of diagnostic tools in the management of forest pathogens. Here, we report on transcriptome profiling of Heterobasidion occidentale isolates with contrasting virulence levels. Comparative transcriptomic analysis identified orthologous groups exclusive to H. occidentale and its isolates, revealing biological processes involved in the differentiation of isolates. Further bioinformatics analyses identified an H. occidentale secretome, CYPome and other candidate effectors, from which genes with species- and isolate-specific expression were characterized. A large proportion of differentially expressed genes were revealed to have putative activities as cell wall modification enzymes and transcription factors, suggesting their potential roles in virulence and fungal pathogenesis. Next, large numbers of simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were detected, including more than 14 000 interisolate non-synonymous SNPs. These polymorphic loci and species/isolate-specific genes may contribute to virulence variations and provide ideal DNA markers for development of diagnostic tools and investigation of genetic diversity. © 2018 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Post-traumatic stress disorder is associated with PACAP and the PAC1 receptor.
Ressler, Kerry J; Mercer, Kristina B; Bradley, Bekh; Jovanovic, Tanja; Mahan, Amy; Kerley, Kimberly; Norrholm, Seth D; Kilaru, Varun; Smith, Alicia K; Myers, Amanda J; Ramirez, Manuel; Engel, Anzhelika; Hammack, Sayamwong E; Toufexis, Donna; Braas, Karen M; Binder, Elisabeth B; May, Victor
2011-02-24
Pituitary adenylate cyclase-activating polypeptide (PACAP) is known to broadly regulate the cellular stress response. In contrast, it is unclear if the PACAP-PAC1 receptor pathway has a role in human psychological stress responses, such as post-traumatic stress disorder (PTSD). Here we find, in heavily traumatized subjects, a sex-specific association of PACAP blood levels with fear physiology, PTSD diagnosis and symptoms in females. We examined 44 single nucleotide polymorphisms (SNPs) spanning the PACAP (encoded by ADCYAP1) and PAC1 (encoded by ADCYAP1R1) genes, demonstrating a sex-specific association with PTSD. A single SNP in a putative oestrogen response element within ADCYAP1R1, rs2267735, predicts PTSD diagnosis and symptoms in females only. This SNP also associates with fear discrimination and with ADCYAP1R1 messenger RNA expression in human brain. Methylation of ADCYAP1R1 in peripheral blood is also associated with PTSD. Complementing these human data, ADCYAP1R1 mRNA is induced with fear conditioning or oestrogen replacement in rodent models. These data suggest that perturbations in the PACAP-PAC1 pathway are involved in abnormal stress responses underlying PTSD. These sex-specific effects may occur via oestrogen regulation of ADCYAP1R1. PACAP levels and ADCYAP1R1 SNPs may serve as useful biomarkers to further our mechanistic understanding of PTSD.
Whole-Genome Analyses of Korean Native and Holstein Cattle Breeds by Massively Parallel Sequencing
Stothard, Paul; Chung, Won-Hyong; Jeon, Heoyn-Jeong; Miller, Stephen P.; Choi, So-Young; Lee, Jeong-Koo; Yang, Bokyoung; Lee, Kyung-Tai; Han, Kwang-Jin; Kim, Hyeong-Cheol; Jeong, Dongkee; Oh, Jae-Don; Kim, Namshin; Kim, Tae-Hun; Lee, Hak-Kyo; Lee, Sung-Jin
2014-01-01
A main goal of cattle genomics is to identify DNA differences that account for variations in economically important traits. In this study, we performed whole-genome analyses of three important cattle breeds in Korea—Hanwoo, Jeju Heugu, and Korean Holstein—using the Illumina HiSeq 2000 sequencing platform. We achieved 25.5-, 29.6-, and 29.5-fold coverage of the Hanwoo, Jeju Heugu, and Korean Holstein genomes, respectively, and identified a total of 10.4 million single nucleotide polymorphisms (SNPs), of which 54.12% were found to be novel. We also detected 1,063,267 insertions–deletions (InDels) across the genomes (78.92% novel). Annotations of the datasets identified a total of 31,503 nonsynonymous SNPs and 859 frameshift InDels that could affect phenotypic variations in traits of interest. Furthermore, genome-wide copy number variation regions (CNVRs) were detected by comparing the Hanwoo, Jeju Heugu, and previously published Chikso genomes against that of Korean Holstein. A total of 992, 284, and 1881 CNVRs, respectively, were detected throughout the genome. Moreover, 53, 65, 45, and 82 putative regions of homozygosity (ROH) were identified in Hanwoo, Jeju Heugu, Chikso, and Korean Holstein respectively. The results of this study provide a valuable foundation for further investigations to dissect the molecular mechanisms underlying variation in economically important traits in cattle and to develop genetic markers for use in cattle breeding. PMID:24992012
Kulkarni, Krishnanand P; Patil, Gunvant; Valliyodan, Babu; Vuong, Tri D; Shannon, J Grover; Nguyen, Henry T; Lee, Jeong-Dong
2018-03-01
The objective of this study was to determine the genetic relationship between the oleic acid and protein content. The genotypes having high oleic acid and elevated protein (HOEP) content were crossed with five elite lines having normal oleic acid and average protein (NOAP) content. The selected accessions were grown at six environments in three different locations and phenotyped for protein, oil, and fatty acid components. The mean protein content of parents, HOEP, and NOAP lines was 34.6%, 38%, and 34.9%, respectively. The oleic acid concentration of parents, HOEP, and NOAP lines was 21.7%, 80.5%, and 20.8%, respectively. The HOEP plants carried both FAD2-1A (S117N) and FAD2-1B (P137R) mutant alleles contributing to the high oleic acid phenotype. Comparative genome analysis using whole-genome resequencing data identified six genes having single nucleotide polymorphism (SNP) significantly associated with the traits analyzed. A single SNP in the putative gene Glyma.10G275800 was associated with the elevated protein content, and palmitic, oleic, and linoleic acids. The genes from the marker intervals of previously identified QTL did not carry SNPs associated with protein content and fatty acid composition in the lines used in this study, indicating that all the genes except Glyma.10G278000 may be the new genes associated with the respective traits.
Carroll, Matthew B; Smith, Derek M; Shaak, Thomas L
2017-03-01
It remains unclear why the dose of xanthine oxidase inhibitors (XOI) allopurinol or febuxostat varies among patients though they reach similar serum uric acid (SUA) goal. We pursued genomic sequencing of XOI metabolism and clearance genes to identify single-nucleotide polymorphisms (SNPs) relate to differences in XOI dose. Subjects with a diagnosis of Gout based on the 1977 American College of Rheumatology Classification Criteria for the disorder, who were on stable doses of a XOI, and who were at their goal SUA level, were enrolled. The primary outcome was relationship between SNPs in any of these genes to XOI dose. The secondary outcome was relationship between SNPs and change in pre- and post-treatment SUA. We enrolled 100 subjects. The average patient age was 68.6 ± 10.6 years old. Over 80% were men and 77% were Caucasian. One SNP was associated with a higher XOI dose: rs75995567 (p = 0.031). Two SNPs were associated with 300 mg daily of allopurinol: rs11678615 (p = 0.022) and rs3731722 on Aldehyde Oxidase (AO) (His1297Arg) (p = 0.001). Two SNPs were associated with a lower dose of allopurinol: rs1884725 (p = 0.033) and rs34650714 (p = 0.006). For the secondary outcome, rs13415401 was the only SNP related to a smaller mean SUA change. Ten SNPs were identified with a larger change in SUA. Though multiple SNPs were identified in the primary and secondary outcomes of this study, rs3731722 is known to alter catalytic function for some aldehyde oxidase substrates.
Krishnan, Michelle L.; Wang, Zi; Aljabar, Paul; Ball, Gareth; Mirza, Ghazala; Saxena, Alka; Counsell, Serena J.; Hajnal, Joseph V.; Montana, Giovanni
2017-01-01
Preterm infants show abnormal structural and functional brain development, and have a high risk of long-term neurocognitive problems. The molecular and cellular mechanisms involved are poorly understood, but novel methods now make it possible to address them by examining the relationship between common genetic variability and brain endophenotype. We addressed the hypothesis that variability in the Peroxisome Proliferator Activated Receptor (PPAR) pathway would be related to brain development. We employed machine learning in an unsupervised, unbiased, combined analysis of whole-brain diffusion tractography together with genomewide, single-nucleotide polymorphism (SNP)-based genotypes from a cohort of 272 preterm infants, using Sparse Reduced Rank Regression (sRRR) and correcting for ethnicity and age at birth and imaging. Empirical selection frequencies for SNPs associated with cerebral connectivity ranged from 0.663 to zero, with multiple highly selected SNPs mapping to genes for PPARG (six SNPs), ITGA6 (four SNPs), and FXR1 (two SNPs). SNPs in PPARG were significantly overrepresented (ranked 7–11 and 67 of 556,000 SNPs; P < 2.2 × 10−7), and were mostly in introns or regulatory regions with predicted effects including protein coding and nonsense-mediated decay. Edge-centric graph-theoretic analysis showed that highly selected white-matter tracts were consistent across the group and important for information transfer (P < 2.2 × 10−17); they most often connected to the insula (P < 6 × 10−17). These results suggest that the inhibited brain development seen in humans exposed to the stress of a premature extrauterine environment is modulated by genetic factors, and that PPARG signaling has a previously unrecognized role in cerebral development. PMID:29229843
Butte, Nancy F; Voruganti, V Saroja; Cole, Shelley A; Haack, Karin; Comuzzie, Anthony G; Muzny, Donna M; Wheeler, David A; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A
2011-09-22
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3'-UTR, and 2 in the 5'-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001-0.009) were associated with obesity-related traits (P = 0.01-0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77-0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children.
El-Dakdouki, Mohammad H; Puré, Ellen; Huang, Xuefei
2013-05-07
Nanoparticles (NPs) are being extensively studied as carriers for drug delivery, but they often have limited penetration inside tumors. We envision that by targeting an endocytic receptor on the cell surface, the uptake of NPs can be significantly enhanced through receptor mediated endocytosis. In addition, if the receptor is recycled to the cell surface, the NP cargo can be transported out of the cells, which is then taken up by neighboring cells thus enhancing solid tumor penetration. To validate our hypothesis, in the first of two articles, we report the synthesis of doxorubicin (DOX)-loaded, hyaluronan (HA) coated silica nanoparticles (SNPs) containing a highly fluorescent core to target CD44, a receptor expressed on the cancer cell surface. HA was conjugated onto amine-functionalized SNPs prepared through an oil-water microemulsion method. The immobilization of the cytotoxic drug DOX was achieved through an acid sensitive hydrazone linkage. The NPs were fully characterized by transmission electron microscopy (TEM), dynamic light scattering (DLS), zeta potential measurements, thermogravimetric analysis (TGA), UV-vis absorbance, and nuclear magnetic resonance (NMR). Initial biological evaluation experiments demonstrated that compared to ligand-free SNPs, the uptake of HA-SNPs by the CD44-expressing SKOV-3 ovarian cancer cells was significantly enhanced when evaluated in the 2D monolayer cell culture. Mechanistic studies suggested that cellular uptake of HA-SNPs was mainly through CD44 mediated endocytosis. HA-SNPs with immobilized DOX were endocytosed efficiently by the SKOV-3 cells as well. The enhanced tumor penetration and drug delivery properties of HA-SNPs will be evaluated in 3D tumor models in the subsequent paper.
Rankinen, Tuomo; Sung, Yun Ju; Sarzynski, Mark A; Rice, Treva K; Rao, D C; Bouchard, Claude
2012-03-01
Endurance training-induced changes in hemodynamic traits are heritable. However, few genes associated with heart rate training responses have been identified. The purpose of our study was to perform a genome-wide association study to uncover DNA sequence variants associated with submaximal exercise heart rate training responses in the HERITAGE Family Study. Heart rate was measured during steady-state exercise at 50 W (HR50) on 2 separate days before and after a 20-wk endurance training program in 483 white subjects from 99 families. Illumina HumanCNV370-Quad v3.0 BeadChips were genotyped using the Illumina BeadStation 500GX platform. After quality control procedures, 320,000 single-nucleotide polymorphisms (SNPs) were available for the genome-wide association study analyses, which were performed using the MERLIN software package (single-SNP analyses and conditional heritability tests) and standard regression models (multivariate analyses). The strongest associations for HR50 training response adjusted for age, sex, body mass index, and baseline HR50 were detected with SNPs at the YWHAQ locus on chromosome 2p25 (P = 8.1 × 10(-7)), the RBPMS locus on chromosome 8p12 (P = 3.8 × 10(-6)), and the CREB1 locus on chromosome 2q34 (P = 1.6 × 10(-5)). In addition, 37 other SNPs showed P values <9.9 × 10(-5). After removal of redundant SNPs, the 10 most significant SNPs explained 35.9% of the ΔHR50 variance in a multivariate regression model. Conditional heritability tests showed that nine of these SNPs (all intragenic) accounted for 100% of the ΔHR50 heritability. Our results indicate that SNPs in nine genes related to cardiomyocyte and neuronal functions, as well as cardiac memory formation, fully account for the heritability of the submaximal heart rate training response.
Sung, Yun Ju; Sarzynski, Mark A.; Rice, Treva K.; Rao, D. C.; Bouchard, Claude
2012-01-01
Endurance training-induced changes in hemodynamic traits are heritable. However, few genes associated with heart rate training responses have been identified. The purpose of our study was to perform a genome-wide association study to uncover DNA sequence variants associated with submaximal exercise heart rate training responses in the HERITAGE Family Study. Heart rate was measured during steady-state exercise at 50 W (HR50) on 2 separate days before and after a 20-wk endurance training program in 483 white subjects from 99 families. Illumina HumanCNV370-Quad v3.0 BeadChips were genotyped using the Illumina BeadStation 500GX platform. After quality control procedures, 320,000 single-nucleotide polymorphisms (SNPs) were available for the genome-wide association study analyses, which were performed using the MERLIN software package (single-SNP analyses and conditional heritability tests) and standard regression models (multivariate analyses). The strongest associations for HR50 training response adjusted for age, sex, body mass index, and baseline HR50 were detected with SNPs at the YWHAQ locus on chromosome 2p25 (P = 8.1 × 10−7), the RBPMS locus on chromosome 8p12 (P = 3.8 × 10−6), and the CREB1 locus on chromosome 2q34 (P = 1.6 × 10−5). In addition, 37 other SNPs showed P values <9.9 × 10−5. After removal of redundant SNPs, the 10 most significant SNPs explained 35.9% of the ΔHR50 variance in a multivariate regression model. Conditional heritability tests showed that nine of these SNPs (all intragenic) accounted for 100% of the ΔHR50 heritability. Our results indicate that SNPs in nine genes related to cardiomyocyte and neuronal functions, as well as cardiac memory formation, fully account for the heritability of the submaximal heart rate training response. PMID:22174390
The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies
Barnett, Ian; Mukherjee, Rajarshi; Lin, Xihong
2017-01-01
It is of substantial interest to study the effects of genes, genetic pathways, and networks on the risk of complex diseases. These genetic constructs each contain multiple SNPs, which are often correlated and function jointly, and might be large in number. However, only a sparse subset of SNPs in a genetic construct is generally associated with the disease of interest. In this article, we propose the generalized higher criticism (GHC) to test for the association between an SNP set and a disease outcome. The higher criticism is a test traditionally used in high-dimensional signal detection settings when marginal test statistics are independent and the number of parameters is very large. However, these assumptions do not always hold in genetic association studies, due to linkage disequilibrium among SNPs and the finite number of SNPs in an SNP set in each genetic construct. The proposed GHC overcomes the limitations of the higher criticism by allowing for arbitrary correlation structures among the SNPs in an SNP-set, while performing accurate analytic p-value calculations for any finite number of SNPs in the SNP-set. We obtain the detection boundary of the GHC test. We compared empirically using simulations the power of the GHC method with existing SNP-set tests over a range of genetic regions with varied correlation structures and signal sparsity. We apply the proposed methods to analyze the CGEM breast cancer genome-wide association study. Supplementary materials for this article are available online. PMID:28736464
Association of Genetic Variants of Small Non-Coding RNAs with Survival in Colorectal Cancer
Pao, Jiunn-Bey; Lu, Te-Ling; Ting, Wen-Chien; Chen, Lu-Min; Bao, Bo-Ying
2018-01-01
Background: Single nucleotide polymorphisms (SNPs) of small non-coding RNAs (sncRNAs) can influence sncRNA function and target gene expression to mediate the risk of certain diseases. The aim of the present study was to evaluate the prognostic relevance of sncRNA SNPs for colorectal cancer, which has not been well characterized to date. Methods: We comprehensively examined 31 common SNPs of sncRNAs, and assessed the impact of these variants on survival in a cohort of 188 patients with colorectal cancer. Results: Three SNPs were significantly associated with survival of patients with colorectal cancer after correction for multiple testing, and two of the SNPs (hsa-mir-196a-2 rs11614913 and U85 rs714775) remained significant in multivariate analyses. Additional in silico analysis provided further evidence of this association, since the expression levels of the target genes of the hsa-miR-196a (HOXA7, HOXB8, and AKT1) were significantly correlated with colorectal cancer progression. Furthermore, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses indicated that hsa-miR-196a is associated with well-known oncogenic pathways, including cellular protein modification process, mitotic cell cycle, adherens junction, and extracellular matrix receptor interaction pathways. Conclusion: Our results suggest that SNPs of sncRNAs could play a critical role in cancer progression, and that hsa-miR-196a might be a valuable biomarker or therapeutic target for colorectal cancer patients. PMID:29483812
Prioritizing individual genetic variants after kernel machine testing using variable selection.
He, Qianchuan; Cai, Tianxi; Liu, Yang; Zhao, Ni; Harmon, Quaker E; Almli, Lynn M; Binder, Elisabeth B; Engel, Stephanie M; Ressler, Kerry J; Conneely, Karen N; Lin, Xihong; Wu, Michael C
2016-12-01
Kernel machine learning methods, such as the SNP-set kernel association test (SKAT), have been widely used to test associations between traits and genetic polymorphisms. In contrast to traditional single-SNP analysis methods, these methods are designed to examine the joint effect of a set of related SNPs (such as a group of SNPs within a gene or a pathway) and are able to identify sets of SNPs that are associated with the trait of interest. However, as with many multi-SNP testing approaches, kernel machine testing can draw conclusion only at the SNP-set level, and does not directly inform on which one(s) of the identified SNP set is actually driving the associations. A recently proposed procedure, KerNel Iterative Feature Extraction (KNIFE), provides a general framework for incorporating variable selection into kernel machine methods. In this article, we focus on quantitative traits and relatively common SNPs, and adapt the KNIFE procedure to genetic association studies and propose an approach to identify driver SNPs after the application of SKAT to gene set analysis. Our approach accommodates several kernels that are widely used in SNP analysis, such as the linear kernel and the Identity by State (IBS) kernel. The proposed approach provides practically useful utilities to prioritize SNPs, and fills the gap between SNP set analysis and biological functional studies. Both simulation studies and real data application are used to demonstrate the proposed approach. © 2016 WILEY PERIODICALS, INC.
Kim, Kyung-Seon; Kim, Ghi-Su; Hwang, Joo-Yeon; Lee, Hye-Ja; Park, Mi-Hyun; Kim, Kwang-joong; Jung, Jongsun; Cha, Hyo-Soung; Shin, Hyoung Doo; Kang, Jong-Ho; Park, Eui Kyun; Kim, Tae-Ho; Hong, Jung-Min; Koh, Jung-Min; Oh, Bermseok; Kimm, Kuchan; Kim, Shin-Yoon; Lee, Jong-Young
2007-01-01
Background Osteoporosis is defined as the loss of bone mineral density that leads to bone fragility with aging. Population-based case-control studies have identified polymorphisms in many candidate genes that have been associated with bone mass maintenance or osteoporotic fracture. To investigate single nucleotide polymorphisms (SNPs) that are associated with osteoporosis, we examined the genetic variation among Koreans by analyzing 81 genes according to their function in bone formation and resorption during bone remodeling. Methods We resequenced all the exons, splice junctions and promoter regions of candidate osteoporosis genes using 24 unrelated Korean individuals. Using the common SNPs from our study and the HapMap database, a statistical analysis of deviation in heterozygosity depicted. Results We identified 942 variants, including 888 SNPs, 43 insertion/deletion polymorphisms, and 11 microsatellite markers. Of the SNPs, 557 (63%) had been previously identified and 331 (37%) were newly discovered in the Korean population. When compared SNPs in the Korean population with those in HapMap database, 1% (or less) of SNPs in the Japanese and Chinese subpopulations and 20% of those in Caucasian and African subpopulations were significantly differentiated from the Hardy-Weinberg expectations. In addition, an analysis of the genetic diversity showed that there were no significant differences among Korean, Han Chinese and Japanese populations, but African and Caucasian populations were significantly differentiated in selected genes. Nevertheless, in the detailed analysis of genetic properties, the LD and Haplotype block patterns among the five sub-populations were substantially different from one another. Conclusion Through the resequencing of 81 osteoporosis candidate genes, 118 unknown SNPs with a minor allele frequency (MAF) > 0.05 were discovered in the Korean population. In addition, using the common SNPs between our study and HapMap, an analysis of genetic diversity and deviation in heterozygosity was performed and the polymorphisms of the above genes among the five populations were substantially differentiated from one another. Further studies of osteoporosis could utilize the polymorphisms identified in our data since they may have important implications for the selection of highly informative SNPs for future association studies. PMID:18036257
Influences of APOA5 Variants on Plasma Triglyceride Levels in Uyghur Population
Wang, Yi; Wu, Di; Jin, Li; Wang, Xiaofeng
2014-01-01
Objective Single nucleotide polymorphisms (SNPs) in apolipoprotein A5 (APOA5) gene are associated with triglyceride (TG) levels. However, the minor allele frequencies and linkage disequilibriums (LDs) of the SNPs in addition to their effects on TG levels vary greatly between Caucasians and East Asians. The distributions of the SNPs/haplotypes and their associations with TG levels in Uyghur population, an admixture population of Caucasians and East Asians, have not been reported to date. Here, we performed a cross-sectional study to address these. Methods Genotyping of four SNPs in APOA5 (rs662799, rs3135506, rs2075291, and rs2266788) was performed in 1174 unrelated Uyghur subjects. SNP/haplotype and TG association analyses were conducted. Results The frequencies of the SNPs in Uyghurs were in between those in Caucasians and East Asians. The LD between rs662799 and rs2266788 in Uyghurs was stronger than that in East Asians but weaker than that in Caucasians, and the four SNPs resulted in four haplotypes (TGGT, CGGC, TCGT, and CGTT arranged in the order of rs662799, rs3135506, rs2075291, and rs2266788) representing 99.2% of the population. All the four SNPs were significantly associated with TG levels. Compared with non-carriers, carriers of rs662799-C, rs3135506-C, rs2075291-T, and rs2266788-C alleles had 16.0%, 15.1%, 17.1%, and 12.4% higher TG levels, respectively. When haplotype TGGT was defined as the reference, the haplotypes CGGC, TCGT, and CGTT resulted in 16.1%, 19.0%, and 19.8% higher TG levels, respectively. The proportions of variance in TG explained by APOA5 locus were 2.5%, 0.3%, 0.4%, and 1.9% for single SNP rs662799, rs3135506, rs2075291, and rs2266788, respectively, and 3.0% for the haplotypes constructed by them. Conclusions The association profiles between the SNPs and haplotypes at APOA5 locus and TG levels in this admixture population differed from those in Caucasians and East Asians. The functions of these SNPs and haplotypes need to be elucidated comprehensively. PMID:25313938
Du, Qing-qing; Wang, Zhi-jun; He, Lin; Jiang, Xue-hua; Wang, Ling
2013-11-01
CYP3A4 is the main isoform of cytochrome P450 oxidases involved in the metabolism of approximately 60 % drugs, and its expression level is highly variable in human subjects. CYP3A4 is regulated by many transcription factors, among which the pregnane X receptor/steroid and xenobiotic receptor (PXR/SXR, NR1I2) have been identified as the most critical. Genetic polymorphisms (such as SNPs) in PXR may affect the expression level of CYP3A4. Although numerous SNPs have been identified in PXR and have appeared to affect PXR function, their impact on the expression of CYP3A4 in human subjects has not been well studied. Thus, a clinical study in healthy Chinese subjects was conducted to investigate the impact of PXR polymorphisms on repaglinide (an endogenous marker for CYP3A4 activity) pharmacokinetics used alone or in combination with a PXR inducer, flucloxacillin. Two SNPs, -298A>G and 11193T>C, were identified as the tag SNPs to represent the overall genetic polymorphic profile of PXR. To evaluate the potential functional change of these two SNPs, 24 healthy subjects were recruited in a pharmacokinetics/pharmacodynamics study of repaglinide with or without flucloxacillin. The pharmacokinetic parameters including AUC and T1/2 were significantly different among the PXR genotype groups. The SNPs of -298G/G and 11193C/C were found to be associated with a lower PXR activity resulting in reduction of CYP3A4 activity in vivo. After administration of flucloxacillin, a significant drug-drug interaction was observed. The clearance of repagnilide was significantly increased by concomitant flucloxacillin in a genotype dependent manner. The subjects with SNPs of -298G/G and 11193C/C appeared to be less sensitive to flucloxacillin. Our study results demonstrated for the first time the impact of genetic polymorphisms of PXR on the PK and PD of repaglinide, and showed that subjects with genotype of -298G/G and 11193C/C in PXR has a decreased elimination rate of 3A4/2C8. Furthermore, flucloxacillin was able to induce 3A4/2C8 expression mediated by PXR in a genotype dependent manner.
Comparative analysis of myostatin gene and promoter sequences of Qinchuan and Red Angus cattle.
He, Y L; Wu, Y H; Quan, F S; Liu, Y G; Zhang, Y
2013-09-04
To better understand the function of the myostatin gene and its promoter region in bovine, we amplified and sequenced the myostatin gene and promoter from the blood of Qinchuan and Red Angus cattle by using polymerase chain reaction. The sequences of Qinchuan and Red Angus cattle were compared with those of other cattle breeds available in GenBank. Exon splice sites were confirmed by mRNA sequencing. Compared to the published sequence (GenBank accession No. AF320998), 69 single nucleotide polymorphisms (SNPs) were identified in the Qinchuan myostatin gene, only one of which was an insertion mutation in Qinchuan cattle. There was a 16-bp insertion in the first 705-bp intron in 3 Qinchuan cattle. A total of 7 SNPs were identified in exon 3, in which the mutation occurred in the third base of the codon and was synonymous. On comparing the Qinchuan myostatin gene sequence to that of Red Angus cattle, a total of 50 SNPs were identified in the first and third exons. In addition, there were 18 SNPs identified in the Qinchuan cattle promoter region compared with those of other cattle compared to the Red Angus cattle myostatin promoter region. breeds (GenBank accession No. AF348479), but only 14 SNPs when compared to the Red Angus cattle myostatin promoter region.
Suciu, Maria C.; Telenius, Jelena
2017-01-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k-mer-based analysis of DNase footprints to determine any k-mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. PMID:28904015
NagaSundaram, N; Priya Doss, C George
2011-01-01
Background: Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. Materials and Methods: We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. Results: By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Conclusion: Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype. PMID:22190868
snpGeneSets: An R Package for Genome-Wide Study Annotation
Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian
2016-01-01
Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048
SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.
Merelli, Ivan; Calabria, Andrea; Cozzi, Paolo; Viti, Federica; Mosca, Ettore; Milanesi, Luciano
2013-01-01
The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects. We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http://www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores. Different databases and resources are already available for SNPs annotation, but they do not prioritize or re-score SNPs relying on a-priori biomolecular knowledge. SNPranker 2.0 attempts to fill this gap through a user-friendly integrated web resource. End users, such as researchers in medical genetics and epidemiology, may find in SNPranker 2.0 a new tool for data mining and interpretation able to support SNPs analysis. Possible scenarios are GWAS data re-scoring, SNPs selection for custom genotyping arrays and SNPs/diseases association studies.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yin Ming; Liao Zhongxing; Liu Zhensheng
2011-11-01
Purpose: To explore whether functional single nucleotide polymorphisms (SNPs) of base-excision repair genes are predictors of radiation treatment-related pneumonitis (RP), we investigated associations between functional SNPs of ADPRT, APEX1, and XRCC1 and RP development. Methods and Materials: We genotyped SNPs of ADPRT (rs1136410 [V762A]), XRCC1 (rs1799782 [R194W], rs25489 [R280H], and rs25487 [Q399R]), and APEX1 (rs1130409 [D148E]) in 165 patients with non-small cell lung cancer (NSCLC) who received definitive chemoradiation therapy. Results were assessed by both Logistic and Cox regression models for RP risk. Kaplan-Meier curves were generated for the cumulative RP probability by the genotypes. Results: We found that SNPsmore » of XRCC1 Q399R and APEX1 D148E each had a significant effect on the development of Grade {>=}2 RP (XRCC1: AA vs. GG, adjusted hazard ratio [HR] = 0.48, 95% confidence interval [CI], 0.24-0.97; APEX1: GG vs. TT, adjusted HR = 3.61, 95% CI, 1.64-7.93) in an allele-dose response manner (Trend tests: p = 0.040 and 0.001, respectively). The number of the combined protective XRCC1 A and APEX1 T alleles (from 0 to 4) also showed a significant trend of predicting RP risk (p = 0.001). Conclusions: SNPs of the base-excision repair genes may be biomarkers for susceptibility to RP. Larger prospective studies are needed to validate our findings.« less
D'Avolio, Antonio; De Nicolò, Amedeo; Cusato, Jessica; Ciancio, Alessia; Boglione, Lucio; Strona, Silvia; Cariti, Giuseppe; Troshina, Giulia; Caviglia, Gian Paolo; Smedile, Antonina; Rizzetto, Mario; Di Perri, Giovanni
2013-10-01
Functional variants rs7270101 and rs1127354 of inosine triphosphatase (ITPA) were recently found to protect against ribavirin (RBV)-induced hemolytic anemia. However, no definitive data are yet available on the role of no functional rs6051702 polymorphism. Since a simultaneous evaluation of the three ITPA SNPs for hemolytic anemia has not yet been investigated, we aimed to understand the contribution of each SNPs and its potential clinical use to predict anemia in HCV treated patients. A retrospective analysis included 379 HCV treated patients. The ITPA variants rs6051702, rs7270101 and rs1127354 were genotyped and tested for association with achieving anemia at week 4. We also investigated, using multivariate logistic regression, the impact of each single and paired associated polymorphism on anemia onset. All SNPs were associated with Hb decrease. The carrier of at least one variant allele in the functional ITPA SNPs was associated with a lower decrement of Hb, as compared to patients without a variant allele. In multivariate logistic regression analyses the carrier of a variant allele in the rs6051702/rs1127354 association (OR=0.11, p=1.75×10(-5)) and Hb at baseline (OR=1.51, p=1.21×10(-4)) were independently associated with protection against clinically significant anemia at week 4. All ITPA polymorphisms considered were shown to be significantly associated with anemia onset. A multivariate regression model based on ITPA genetic polymorphisms was developed for predicting the risk of anemia. Considering the characterization of pre-therapy anemia predictors, rs6051702 SNP in association to rs1127354 is more informative in order to avoid this relevant adverse event. Copyright © 2013 Elsevier B.V. All rights reserved.
Lehman, D M; Leach, R J; Johnson-Pais, T; Hamlington, J; Fowler, S; Almasy, L; Duggirala, R; Stern, M P; Abboud, H E
2006-09-01
Albuminuria, a hallmark of diabetic nephropathy, has been shown to be significantly heritable in multiple studies. Therefore, the identification of genes that affect susceptibility to albuminuria may lead to novel avenues of intervention. Current evidence suggests that the podocyte and slit diaphragm play a key role in controlling the selective sieve of the glomerular filtration barrier, and podocyte-specific genes have been identified that are necessary for maintaining its integrity. We therefore investigated the role of gene variants of tight junction protein (TJP1) which encodes another slit diaphragm-associated protein zona occludens 1 as risk factors for albuminuria in the San Antonio Family Diabetes/Gallbladder Study (SAFDGS), which consists of extended Mexican-American families with a high prevalence of type 2 diabetes. Albuminuria, defined as an albumin (mg/dl) to creatinine (mg/dl) ratio (ACR) of 0.03, which is approximately equivalent to a urinary albumin excretion (UAE) >30 mg/day, was present in a total of 14.9% of participants, and 31% had type 2 diabetes. The TJP1 exons, flanking intronic sequence, and putative proximal promoter regions were investigated in this population. Twentynine polymorphisms, including 7 nonsynonymous SNPs, were identified and genotyped in all subjects of this study for association analysis. Three sets of correlated SNPs, which include 3 exonic SNPs, were nominally associated with ACR (p value range 0.007-0.049); however, the association with the discrete trait albuminuria was not significant (p value range 0.094-0.338). We conclude that these variants in TJP1 do not appear to be major determinants for albuminuria in the SAFDGS; however, they may play a minor role in its severity in this Mexican-American population. Further examination of the TJP1 gene region in this and other cohorts will be useful to determine whether ZO-1 plays a significant role in glomerular permselectivity.
Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M; Kenney, P Brett; Vallejo, Roger L; Palti, Yniv; Leeds, Timothy D
2016-01-01
Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0-1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout.
Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M.; Kenney, P. Brett; Vallejo, Roger L.; Palti, Yniv; Leeds, Timothy D.
2016-01-01
Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0–1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout. PMID:27920797
Prediction of gene expression with cis-SNPs using mixed models and regularization methods.
Zeng, Ping; Zhou, Xiang; Huang, Shuiping
2017-05-11
It has been shown that gene expression in human tissues is heritable, thus predicting gene expression using only SNPs becomes possible. The prediction of gene expression can offer important implications on the genetic architecture of individual functional associated SNPs and further interpretations of the molecular basis underlying human diseases. We compared three types of methods for predicting gene expression using only cis-SNPs, including the polygenic model, i.e. linear mixed model (LMM), two sparse models, i.e. Lasso and elastic net (ENET), and the hybrid of LMM and sparse model, i.e. Bayesian sparse linear mixed model (BSLMM). The three kinds of prediction methods have very different assumptions of underlying genetic architectures. These methods were evaluated using simulations under various scenarios, and were applied to the Geuvadis gene expression data. The simulations showed that these four prediction methods (i.e. Lasso, ENET, LMM and BSLMM) behaved best when their respective modeling assumptions were satisfied, but BSLMM had a robust performance across a range of scenarios. According to R 2 of these models in the Geuvadis data, the four methods performed quite similarly. We did not observe any clustering or enrichment of predictive genes (defined as genes with R 2 ≥ 0.05) across the chromosomes, and also did not see there was any clear relationship between the proportion of the predictive genes and the proportion of genes in each chromosome. However, an interesting finding in the Geuvadis data was that highly predictive genes (e.g. R 2 ≥ 0.30) may have sparse genetic architectures since Lasso, ENET and BSLMM outperformed LMM for these genes; and this observation was validated in another gene expression data. We further showed that the predictive genes were enriched in approximately independent LD blocks. Gene expression can be predicted with only cis-SNPs using well-developed prediction models and these predictive genes were enriched in some approximately independent LD blocks. The prediction of gene expression can shed some light on the functional interpretation for identified SNPs in GWASs.
Das, Shouvik; Singh, Mohar; Srivastava, Rishi; Bajaj, Deepak; Saxena, Maneesha S.; Rana, Jai C.; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2016-01-01
The present study used a whole-genome, NGS resequencing-based mQTL-seq (multiple QTL-seq) strategy in two inter-specific mapping populations (Pusa 1103 × ILWC 46 and Pusa 256 × ILWC 46) to scan the major genomic region(s) underlying QTL(s) governing pod number trait in chickpea. Essentially, the whole-genome resequencing of low and high pod number-containing parental accessions and homozygous individuals (constituting bulks) from each of these two mapping populations discovered >8 million high-quality homozygous SNPs with respect to the reference kabuli chickpea. The functional significance of the physically mapped SNPs was apparent from the identified 2,264 non-synonymous and 23,550 regulatory SNPs, with 8–10% of these SNPs-carrying genes corresponding to transcription factors and disease resistance-related proteins. The utilization of these mined SNPs in Δ (SNP index)-led QTL-seq analysis and their correlation between two mapping populations based on mQTL-seq, narrowed down two (CaqaPN4.1: 867.8 kb and CaqaPN4.2: 1.8 Mb) major genomic regions harbouring robust pod number QTLs into the high-resolution short QTL intervals (CaqbPN4.1: 637.5 kb and CaqbPN4.2: 1.28 Mb) on chickpea chromosome 4. The integration of mQTL-seq-derived one novel robust QTL with QTL region-specific association analysis delineated the regulatory (C/T) and coding (C/A) SNPs-containing one pentatricopeptide repeat (PPR) gene at a major QTL region regulating pod number in chickpea. This target gene exhibited anther, mature pollen and pod-specific expression, including pronounced higher up-regulated (∼3.5-folds) transcript expression in high pod number-containing parental accessions and homozygous individuals of two mapping populations especially during pollen and pod development. The proposed mQTL-seq-driven combinatorial strategy has profound efficacy in rapid genome-wide scanning of potential candidate gene(s) underlying trait-associated high-resolution robust QTL(s), thereby expediting genomics-assisted breeding and genetic enhancement of crop plants, including chickpea. PMID:26685680
Genetic association between ghrelin polymorphisms and Alzheimer's disease in a Japanese population.
Shibata, Nobuto; Ohnuma, Tohru; Kuerban, Bolati; Komatsu, Miwa; Arai, Heii
2011-01-01
Ghrelin has been reported to enter the hippocampus and to bind to the neurons of the hippocampal formation. This peptide also affects neuronal glucose uptake and decreases tau hyperphosphorylation. There is increasing evidence suggesting an association between ghrelin and Alzheimer's disease (AD) pathology. The aim of this study was to investigate whether single nucleotide polymorphisms (SNPs) of the ghrelin gene are associated with AD. The SNPs were genotyped using TaqMan technology and were analyzed using a case-control study design. Our case-control dataset consisted of 182 AD patients and 143 age-matched controls. Hardy-Weinberg equilibrium and linkage disequilibrium analyses suggest that the region in and around the gene is highly polymorphic. One SNP, rs4684677 (Leu90Gln), showed a marginal association with age of AD onset. We did not detect any association between the other SNPs of the ghrelin gene and AD. There have been few genetic studies on the relationship between circulating ghrelin and functional SNPs. Further multifactorial studies are needed to clarify the relationship between ghrelin and AD. Copyright © 2011 S. Karger AG, Basel.
Convergent evidence from systematic analysis of GWAS revealed genetic basis of esophageal cancer.
Gao, Xue-Xin; Gao, Lei; Wang, Jiu-Qiang; Qu, Su-Su; Qu, Yue; Sun, Hong-Lei; Liu, Si-Dang; Shang, Ying-Li
2016-07-12
Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with risk of esophageal cancer (EC). However, investigation of genetic basis from the perspective of systematic biology and integrative genomics remains scarce.In this study, we explored genetic basis of EC based on GWAS data and implemented a series of bioinformatics methods including functional annotation, expression quantitative trait loci (eQTL) analysis, pathway enrichment analysis and pathway grouped network analysis.Two hundred and thirteen risk SNPs were identified, in which 44 SNPs were found to have significantly differential gene expression in esophageal tissues by eQTL analysis. By pathway enrichment analysis, 170 risk genes mapped by risk SNPs were enriched into 38 significant GO terms and 17 significant KEGG pathways, which were significantly grouped into 9 sub-networks by pathway grouped network analysis. The 9 groups of interconnected pathways were mainly involved with muscle cell proliferation, cellular response to interleukin-6, cell adhesion molecules, and ethanol oxidation, which might participate in the development of EC.Our findings provide genetic evidence and new insight for exploring the molecular mechanisms of EC.
Fine-mapping and initial characterization of QT interval loci in African Americans.
Avery, Christy L; Sethupathy, Praveen; Buyske, Steven; He, Qianchuan; Lin, Dan-Yu; Arking, Dan E; Carty, Cara L; Duggan, David; Fesinmeyer, Megan D; Hindorff, Lucia A; Jeff, Janina M; Klein, Liviu; Patton, Kristen K; Peters, Ulrike; Shohet, Ralph V; Sotoodehnia, Nona; Young, Alicia M; Kooperberg, Charles; Haiman, Christopher A; Mohlke, Karen L; Whitsel, Eric A; North, Kari E
2012-01-01
The QT interval (QT) is heritable and its prolongation is a risk factor for ventricular tachyarrhythmias and sudden death. Most genetic studies of QT have examined European ancestral populations; however, the increased genetic diversity in African Americans provides opportunities to narrow association signals and identify population-specific variants. We therefore evaluated 6,670 SNPs spanning eleven previously identified QT loci in 8,644 African American participants from two Population Architecture using Genomics and Epidemiology (PAGE) studies: the Atherosclerosis Risk in Communities study and Women's Health Initiative Clinical Trial. Of the fifteen known independent QT variants at the eleven previously identified loci, six were significantly associated with QT in African American populations (P≤1.20×10(-4)): ATP1B1, PLN1, KCNQ1, NDRG4, and two NOS1AP independent signals. We also identified three population-specific signals significantly associated with QT in African Americans (P≤1.37×10(-5)): one at NOS1AP and two at ATP1B1. Linkage disequilibrium (LD) patterns in African Americans assisted in narrowing the region likely to contain the functional variants for several loci. For example, African American LD patterns showed that 0 SNPs were in LD with NOS1AP signal rs12143842, compared with European LD patterns that indicated 87 SNPs, which spanned 114.2 Kb, were in LD with rs12143842. Finally, bioinformatic-based characterization of the nine African American signals pointed to functional candidates located exclusively within non-coding regions, including predicted binding sites for transcription factors such as TBX5, which has been implicated in cardiac structure and conductance. In this detailed evaluation of QT loci, we identified several African Americans SNPs that better define the association with QT and successfully narrowed intervals surrounding established loci. These results demonstrate that the same loci influence variation in QT across multiple populations, that novel signals exist in African Americans, and that the SNPs identified as strong candidates for functional evaluation implicate gene regulatory dysfunction in QT prolongation.
Fine-Mapping and Initial Characterization of QT Interval Loci in African Americans
Avery, Christy L.; Sethupathy, Praveen; Buyske, Steven; He, Qianchuan; Lin, Dan-Yu; Arking, Dan E.; Carty, Cara L.; Duggan, David; Fesinmeyer, Megan D.; Hindorff, Lucia A.; Jeff, Janina M.; Klein, Liviu; Patton, Kristen K.; Peters, Ulrike; Shohet, Ralph V.; Sotoodehnia, Nona; Young, Alicia M.; Kooperberg, Charles; Haiman, Christopher A.; Mohlke, Karen L.; Whitsel, Eric A.; North, Kari E.
2012-01-01
The QT interval (QT) is heritable and its prolongation is a risk factor for ventricular tachyarrhythmias and sudden death. Most genetic studies of QT have examined European ancestral populations; however, the increased genetic diversity in African Americans provides opportunities to narrow association signals and identify population-specific variants. We therefore evaluated 6,670 SNPs spanning eleven previously identified QT loci in 8,644 African American participants from two Population Architecture using Genomics and Epidemiology (PAGE) studies: the Atherosclerosis Risk in Communities study and Women's Health Initiative Clinical Trial. Of the fifteen known independent QT variants at the eleven previously identified loci, six were significantly associated with QT in African American populations (P≤1.20×10−4): ATP1B1, PLN1, KCNQ1, NDRG4, and two NOS1AP independent signals. We also identified three population-specific signals significantly associated with QT in African Americans (P≤1.37×10−5): one at NOS1AP and two at ATP1B1. Linkage disequilibrium (LD) patterns in African Americans assisted in narrowing the region likely to contain the functional variants for several loci. For example, African American LD patterns showed that 0 SNPs were in LD with NOS1AP signal rs12143842, compared with European LD patterns that indicated 87 SNPs, which spanned 114.2 Kb, were in LD with rs12143842. Finally, bioinformatic-based characterization of the nine African American signals pointed to functional candidates located exclusively within non-coding regions, including predicted binding sites for transcription factors such as TBX5, which has been implicated in cardiac structure and conductance. In this detailed evaluation of QT loci, we identified several African Americans SNPs that better define the association with QT and successfully narrowed intervals surrounding established loci. These results demonstrate that the same loci influence variation in QT across multiple populations, that novel signals exist in African Americans, and that the SNPs identified as strong candidates for functional evaluation implicate gene regulatory dysfunction in QT prolongation. PMID:22912591
Love-Gregory, Latisha; Kraja, Aldi T; Allum, Fiona; Aslibekyan, Stella; Hedman, Åsa K; Duan, Yanan; Borecki, Ingrid B; Arnett, Donna K; McCarthy, Mark I; Deloukas, Panos; Ordovas, Jose M; Hopkins, Paul N; Grundberg, Elin; Abumrad, Nada A
2016-12-01
Cluster of differentiation 36 (CD36) variants influence fasting lipids and risk of metabolic syndrome, but their impact on postprandial lipids, an independent risk factor for cardiovascular disease, is unclear. We determined the effects of SNPs within a ∼410 kb region encompassing CD36 and its proximal and distal promoters on chylomicron (CM) remnants and LDL particles at fasting and at 3.5 and 6 h following a high-fat meal (Genetics of Lipid Lowering Drugs and Diet Network study, n = 1,117). Five promoter variants associated with CMs, four with delayed TG clearance and five with LDL particle number. To assess mechanisms underlying the associations, we queried expression quantitative trait loci, DNA methylation, and ChIP-seq datasets for adipose and heart tissues that function in postprandial lipid clearance. Several SNPs that associated with higher serum lipids correlated with lower adipose and heart CD36 mRNA and aligned to active motifs for PPARγ, a major CD36 regulator. The SNPs also associated with DNA methylation sites that related to reduced CD36 mRNA and higher serum lipids, but mixed-model analyses indicated that the SNPs and methylation independently influence CD36 mRNA. The findings support contributions of CD36 SNPs that reduce adipose and heart CD36 RNA expression to inter-individual variability of postprandial lipid metabolism and document changes in CD36 DNA methylation that influence both CD36 expression and lipids. Copyright © 2016 by the American Society for Biochemistry and Molecular Biology, Inc.
Yamashita, Shinpei; Takigahira, Tomohiro; Takahashi, Kazuo H
2018-06-01
Accumulating evidence suggests that genotype of host insects influences the development of koinobiont endoparasitoids. Although there are many potential genetic variations that lead to the internal body environmental variations of host insects, association between the host genotype and the parasitoid development has not been examined in a genome-wide manner. In the present study, we used highly inbred whole genome sequenced strains of Drosophila melanogaster to associate single nucleotide polymorphisms (SNPs) of host flies with morphological traits of Asobara japonica, a larval-pupal parasitoid wasp that infected those hosts. We quantified the outline shape of the forewings of A. japonica with two major principal components (PC1 and PC2) calculated from Fourier coefficients obtained from elliptic Fourier analysis. We also quantified wing size and estimated wasp survival. We then examined the association between the PC scores, wing size and 1,798,561 SNPs and the association between the estimated wasp survival and 1,790,544 SNPs. As a result, we obtained 22, 24 and 14 SNPs for PC1, PC2 and wing size and four SNPs for the estimated survival with P values smaller than 10 -5 . Based on the location of the SNPs, 12, 17, 11 and five protein coding genes were identified as potential candidates for PC1, PC2, wing size and the estimated survival, respectively. Based on the function of the candidate genes, it is suggested that the host genetic variation associated with the cell growth and morphogenesis may influence the wasp's morphogenetic variation.
Soerensen, Mette; Nygaard, Marianne; Debrabant, Birgit; Mengel-From, Jonas; Dato, Serena; Thinggaard, Mikael; Christensen, Kaare; Christiansen, Lene
2016-06-01
In this study we explored the association between aging-related phenotypes previously reported to predict survival in old age and variation in 77 genes from the DNA repair pathway, 32 genes from the growth hormone 1/ insulin-like growth factor 1/insulin (GH/IGF-1/INS) signalling pathway and 16 additional genes repeatedly considered as candidates for human longevity: APOE, APOA4, APOC3, ACE, CETP, HFE, IL6, IL6R, MTHFR, TGFB1, SIRTs 1, 3, 6; and HSPAs 1A, 1L, 14. Altogether, 1,049 single nucleotide polymorphisms (SNPs) were genotyped in 1,088 oldest-old (age 92-93 years) Danes and analysed with phenotype data on physical functioning (hand grip strength), cognitive functioning (mini mental state examination and a cognitive composite score), activity of daily living and self-rated health. Five SNPs showed association to one of the phenotypes; however, none of these SNPs were associated with a change in the relevant phenotype over time (7 years of follow-up) and none of the SNPs could be confirmed in a replication sample of 1,281 oldest-old Danes (age 94-100). Hence, our study does not support association between common variation in the investigated longevity candidate genes and aging-related phenotypes consistently shown to predict survival. It is possible that larger sample sizes are needed to robustly reveal associations with small effect sizes. Copyright © 2016 Elsevier Inc. All rights reserved.
Wang, Juxiang; Zhuo, Zhenjian; Chen, Min; Zhu, Jinhong; Zhao, Jie; Zhang, Jiao; Chen, Shanshan; He, Jing; Zhou, Haixia
2018-04-28
The genetic etiology of sporadic neuroblastoma remains largely obscure. RAN and RANBP2 genes encode Ras-related nuclear protein and Ran-binding protein 2, respectively. These two proteins form Ran-RanBP2 complex that regulate various cellular activities including nuclear transport. Aberrant functions of the two proteins are implicated in carcinogenesis. Given the unknown role of RAN/RANBP2 single nucleotide polymorphisms (SNPs) in neuroblastoma risk, we performed a multi-center case-control study in Chinese children to assess the association of the RAN/RANBP2 SNPs with neuroblastoma risk. We analyzed three potentially functional SNPs in RAN gene (rs56109543 C>T, rs7132224 A>G, rs14035 C>T) and one in RANBP2 (rs2462788 C>T) in 429 cases and 884 controls. Odds ratios (ORs) and 95% confidence intervals (CIs) were used to access the association between these four polymorphisms and neuroblastoma risk. No single variant was found to statistically significantly associate with neuroblastoma risk. However, individuals with 3 protective genotypes were less likely to develop neuroblastoma, in comparison to non-carriers (adjusted OR=0.33; 95% CI=0.12-0.96; P =0.042), as well as those with 0-2 protective genotypes (adjusted OR=0.33; 95% CI=0.11-0.94; P =0.038). Stratified analysis revealed no significant association for any of the four polymorphisms. Further studies are warranted to validate the weak impact of RAN/RANBP2 SNPs on neuroblastoma risk.
Sun, Jing; Xie, Wenjie; Zhu, Xufeng; Xu, Mengmeng; Liu, Jie
2018-04-18
Functionalized nanomaterials, which have been applied widely to inhibit amyloid-β protein (Aβ) aggregation, show enormous potential in the field of prevention and treatment of Alzheimer's disease (AD). A significant body of data has demonstrated that the morphology and size of nanomaterials have remarkable effects on their biological behaviors. In this work, we proposed and designed three kinds of brain-targeting sulfur nanoparticles (RVG@Met@SNPs) with novel morphologies (volute-like, tadpole-like, and sphere-like) and investigated the effect of different RVG@Met@SNPs on Aβ-Cu 2+ complex aggregation and their corresponding neurotoxicity. Among them, the sphere-like nanoparticles (RVG@Met@SS) exhibited the most effective inhibitory activity, due to their unique mini size effect, and they reduced 61.6% the Aβ-Cu 2+ complex aggregation and increased 92.4% SH-SY5Y cell viability in a dose of 10 μg/mL. In vitro and in vivo, the abilities of different morphologies of RVG@Met@SNPs to cross the blood-brain barrier (BBB) and target brain parenchymal cells were significantly different. Moreover, improvements in learning disability and cognitive loss were shown in the transgenic AD mice model using the Morris water maze test after multiple doses of RVG@Met@SNPs treatment. In general, the purpose of this research is to develop a biological application of sulfur nanoparticles and to provide a novel functionalized nanomaterial to treat AD.
Zhang, Yanxia; Fan, Mei; Wang, Qingzhong; He, Guang; Fu, Yingmei; Li, Huafang; Yu, Shunying
2015-08-10
Disturbances in glutamate signaling caused by disruption of N-methyl-D-aspartate-type glutamate receptor (NMDAR) have been implicated in schizophrenia. Findings suggested that miR-219, miR-132 and miR-107 could involve in NMDAR signaling by influencing the expression of pathway genes or the signaling transmission and single nucleotide polymorphisms (SNPs) within miRNA genes or miRNA target sites could result in their functional changes. Therefore, we hypothesized that SNPs in miRNAs and/or their target sites were associated with schizophrenia. 3 SNPs in hsa-pri-miR-219/132/107 and 6 SNPs in 3'UTRs of GRIN2A/2B/3A and CAMK2G were selected and genotyped in a case-control study of 1041 schizophrenia cases and 953 healthy controls in Chinese Han population. In the present study, GRIN2B rs890 showed significant associations with schizophrenia. Further functional analyses showed that the rs890 variant C allele led to significantly lower luciferase activity, compared with the A allele. MDR analysis showed that a 4-locus model including rs107822, rs2306327, rs890 and rs12342026 was the best model. These findings suggest that GRIN2B may be associated with schizophrenia and interaction effects of the polymorphisms in hsa-miR-219, CAKM2G, GRIN2B and GRIN3A may confer susceptibility to schizophrenia in the Chinese Han population.
Uemura, Satoshi; Mochizuki, Takahiro; Kurosaka, Goyu; Hashimoto, Takanori; Masukawa, Yuki; Abe, Fumiyoshi
2017-10-01
Tryptophan is an essential amino acid in humans and an important serotonin and melatonin precursor. Monocarboxylate transporter MCT10 is a member of the SLC16A family proteins that mediates low-affinity tryptophan transport across basolateral membranes of kidney, small intestine, and liver epithelial cells, although the precise transport mechanism remains unclear. Here we developed a simple functional assay to analyze tryptophan transport by human MCT10 using a deletion mutant for the high-affinity tryptophan permease Tat2 in Saccharomyces cerevisiae. tat2Δtrp1 cells are defective in growth in YPD medium because tyrosine present in the medium competes for the low-affinity tryptophan permease Tat1 with tryptophan. MCT10 appeared to allow growth of tat2Δtrp1 cells in YPD medium, and accumulate in cells deficient for Rsp5 ubiquitin ligase. These results suggest that MCT10 is functional in yeast, and is subject to ubiquitin-dependent quality control. Whereas growth of Tat2-expressing cells was significantly impaired by neutral pH, that of MCT10-expressing cells was nearly unaffected. This property is consistent with the transport mechanism of MCT10 via facilitated diffusion without a need for pH gradient across the plasma membrane. Single-nucleotide polymorphisms (SNPs) are known to occur in the human MCT10 coding region. Among eight SNP amino acid changes in MCT10, the N81K mutation completely abrogated tryptophan import without any abnormalities in the expression or localization. In the MCT10 modeled structure, N81 appeared to protrude into the putative trajectory of tryptophan. Plasma membrane localization of MCT10 and the variant proteins was also verified in human embryonic kidney 293T cells. Copyright © 2017 Elsevier B.V. All rights reserved.
Association of CLU and PICALM variants with Alzheimer's disease
Kamboh, M.I.; Minster, R. L.; Demirci, F.Y.; Ganguli, M.; DeKosky, S.T.; Lopez, O.L.; Barmada, M.M.
2010-01-01
Two recent large genome-wide association studies have reported significant associations in the CLU (APOJ), CR1 and PICALM genes. In order to replicate these findings, we examined 7 single nucleotide polymorphisms (SNPs) most significantly implicated by these studies in a large case-control sample comprising of 2,707 individuals. Principle components analysis revealed no population substructure in our sample. While no association was observed with CR1 SNPs (P=0.30–0.457), a trend of association was seen with the PICALM (P=0.071–0.086) and CLU (P=0.148–0.258) SNPs. A meta-analysis of three studies revealed significant associations with all three genes. Our data from an independent and large case-control sample suggest that these gene regions should be followed up by comprehensive resequencing to find functional variants. PMID:20570404
GWAS of 126,559 Individuals Identifies Genetic Variants Associated with Educational Attainment
Rietveld, Cornelius A.; Medland, Sarah E.; Derringer, Jaime; Yang, Jian; Esko, Tõnu; Martin, Nicolas W.; Westra, Harm-Jan; Shakhbazov, Konstantin; Abdellaoui, Abdel; Agrawal, Arpana; Albrecht, Eva; Alizadeh, Behrooz Z.; Amin, Najaf; Barnard, John; Baumeister, Sebastian E.; Benke, Kelly S.; Bielak, Lawrence F.; Boatman, Jeffrey A.; Boyle, Patricia A.; Davies, Gail; de Leeuw, Christiaan; Eklund, Niina; Evans, Daniel S.; Ferhmann, Rudolf; Fischer, Krista; Gieger, Christian; Gjessing, Håkon K.; Hägg, Sara; Harris, Jennifer R.; Hayward, Caroline; Holzapfel, Christina; Ibrahim-Verbaas, Carla A.; Ingelsson, Erik; Jacobsson, Bo; Joshi, Peter K.; Jugessur, Astanand; Kaakinen, Marika; Kanoni, Stavroula; Karjalainen, Juha; Kolcic, Ivana; Kristiansson, Kati; Kutalik, Zoltán; Lahti, Jari; Lee, Sang H.; Lin, Peng; Lind, Penelope A.; Liu, Yongmei; Lohman, Kurt; Loitfelder, Marisa; McMahon, George; Vidal, Pedro Marques; Meirelles, Osorio; Milani, Lili; Myhre, Ronny; Nuotio, Marja-Liisa; Oldmeadow, Christopher J.; Petrovic, Katja E.; Peyrot, Wouter J.; Polašek, Ozren; Quaye, Lydia; Reinmaa, Eva; Rice, John P.; Rizzi, Thais S.; Schmidt, Helena; Schmidt, Reinhold; Smith, Albert V.; Smith, Jennifer A.; Tanaka, Toshiko; Terracciano, Antonio; van der Loos, Matthijs J.H.M.; Vitart, Veronique; Völzke, Henry; Wellmann, Jürgen; Yu, Lei; Zhao, Wei; Allik, Jüri; Attia, John R.; Bandinelli, Stefania; Bastardot, François; Beauchamp, Jonathan; Bennett, David A.; Berger, Klaus; Bierut, Laura J.; Boomsma, Dorret I.; Bültmann, Ute; Campbell, Harry; Chabris, Christopher F.; Cherkas, Lynn; Chung, Mina K.; Cucca, Francesco; de Andrade, Mariza; De Jager, Philip L.; De Neve, Jan-Emmanuel; Deary, Ian J.; Dedoussis, George V.; Deloukas, Panos; Dimitriou, Maria; Eiriksdottir, Gudny; Elderson, Martin F.; Eriksson, Johan G.; Evans, David M.; Faul, Jessica D.; Ferrucci, Luigi; Garcia, Melissa E.; Grönberg, Henrik; Gudnason, Vilmundur; Hall, Per; Harris, Juliette M.; Harris, Tamara B.; Hastie, Nicholas D.; Heath, Andrew C.; Hernandez, Dena G.; Hoffmann, Wolfgang; Hofman, Adriaan; Holle, Rolf; Holliday, Elizabeth G.; Hottenga, Jouke-Jan; Iacono, William G.; Illig, Thomas; Järvelin, Marjo-Riitta; Kähönen, Mika; Kaprio, Jaakko; Kirkpatrick, Robert M.; Kowgier, Matthew; Latvala, Antti; Launer, Lenore J.; Lawlor, Debbie A.; Lehtimäki, Terho; Li, Jingmei; Lichtenstein, Paul; Lichtner, Peter; Liewald, David C.; Madden, Pamela A.; Magnusson, Patrik K. E.; Mäkinen, Tomi E.; Masala, Marco; McGue, Matt; Metspalu, Andres; Mielck, Andreas; Miller, Michael B.; Montgomery, Grant W.; Mukherjee, Sutapa; Nyholt, Dale R.; Oostra, Ben A.; Palmer, Lyle J.; Palotie, Aarno; Penninx, Brenda; Perola, Markus; Peyser, Patricia A.; Preisig, Martin; Räikkönen, Katri; Raitakari, Olli T.; Realo, Anu; Ring, Susan M.; Ripatti, Samuli; Rivadeneira, Fernando; Rudan, Igor; Rustichini, Aldo; Salomaa, Veikko; Sarin, Antti-Pekka; Schlessinger, David; Scott, Rodney J.; Snieder, Harold; Pourcain, Beate St; Starr, John M.; Sul, Jae Hoon; Surakka, Ida; Svento, Rauli; Teumer, Alexander; Tiemeier, Henning; Rooij, Frank JAan; Van Wagoner, David R.; Vartiainen, Erkki; Viikari, Jorma; Vollenweider, Peter; Vonk, Judith M.; Waeber, Gérard; Weir, David R.; Wichmann, H.-Erich; Widen, Elisabeth; Willemsen, Gonneke; Wilson, James F.; Wright, Alan F.; Conley, Dalton; Davey-Smith, George; Franke, Lude; Groenen, Patrick J. F.; Hofman, Albert; Johannesson, Magnus; Kardia, Sharon L.R.; Krueger, Robert F.; Laibson, David; Martin, Nicholas G.; Meyer, Michelle N.; Posthuma, Danielle; Thurik, A. Roy; Timpson, Nicholas J.; Uitterlinden, André G.; van Duijn, Cornelia M.; Visscher, Peter M.; Benjamin, Daniel J.; Cesarini, David; Koellinger, Philipp D.
2013-01-01
A genome-wide association study of educational attainment was conducted in a discovery sample of 101,069 individuals and a replication sample of 25,490. Three independent SNPs are genome-wide significant (rs9320913, rs11584700, rs4851266), and all three replicate. Estimated effects sizes are small (R2 ≈ 0.02%), approximately 1 month of schooling per allele. A linear polygenic score from all measured SNPs accounts for ≈ 2% of the variance in both educational attainment and cognitive function. Genes in the region of the loci have previously been associated with health, cognitive, and central nervous system phenotypes, and bioinformatics analyses suggest the involvement of the anterior caudate nucleus. These findings provide promising candidate SNPs for follow-up work, and our effect size estimates can anchor power analyses in social-science genetics. PMID:23722424
Liao, R; Zhang, X; Chen, Q; Wang, Z; Wang, Q; Yang, C; Pan, Y
2016-10-01
This study was designed to investigate the genetic basis of growth and egg traits in Dongxiang blue-shelled chickens and White Leghorn chickens. In this study, we employed a reduced representation sequencing approach called genotyping by genome reducing and sequencing to detect genome-wide SNPs in 252 Dongxiang blue-shelled chickens and 252 White Leghorn chickens. The Dongxiang blue-shelled chicken breed has many specific traits and is characterized by blue-shelled eggs, black plumage, black skin, black bone and black organs. The White Leghorn chicken is an egg-type breed with high productivity. As multibreed genome-wide association studies (GWASs) can improve precision due to less linkage disequilibrium across breeds, a multibreed GWAS was performed with 156 575 SNPs to identify the associated variants underlying growth and egg traits within the two chicken breeds. The analysis revealed 32 SNPs exhibiting a significant genome-wide association with growth and egg traits. Some of the significant SNPs are located in genes that are known to impact growth and egg traits, but nearly half of the significant SNPs are located in genes with unclear functions in chickens. To our knowledge, this is the first multibreed genome-wide report for the genetics of growth and egg traits in the Dongxiang blue-shelled and White Leghorn chickens. © 2016 Stichting International Foundation for Animal Genetics.
Ning, Shangwei; Yue, Ming; Wang, Peng; Liu, Yue; Zhi, Hui; Zhang, Yan; Zhang, Jizhou; Gao, Yue; Guo, Maoni; Zhou, Dianshuang; Li, Xin; Li, Xia
2017-01-04
We describe LincSNP 2.0 (http://bioinfo.hrbmu.edu.cn/LincSNP), an updated database that is used specifically to store and annotate disease-associated single nucleotide polymorphisms (SNPs) in human long non-coding RNAs (lncRNAs) and their transcription factor binding sites (TFBSs). In LincSNP 2.0, we have updated the database with more data and several new features, including (i) expanding disease-associated SNPs in human lncRNAs; (ii) identifying disease-associated SNPs in lncRNA TFBSs; (iii) updating LD-SNPs from the 1000 Genomes Project; and (iv) collecting more experimentally supported SNP-lncRNA-disease associations. Furthermore, we developed three flexible online tools to retrieve and analyze the data. Linc-Mart is a convenient way for users to customize their own data. Linc-Browse is a tool for all data visualization. Linc-Score predicts the associations between lncRNA and disease. In addition, we provided users a newly designed, user-friendly interface to search and download all the data in LincSNP 2.0 and we also provided an interface to submit novel data into the database. LincSNP 2.0 is a continually updated database and will serve as an important resource for investigating the functions and mechanisms of lncRNAs in human diseases. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Yue, S J; Zhao, Y Q; Gu, X R; Yin, B; Jiang, Y L; Wang, Z H; Shi, K R
2017-12-01
A genome-wide association study (GWAS) was conducted on 15 milk production traits in Chinese Holstein. The experimental population consisted of 445 cattle, each genotyped by the GGP (GeneSeek genomic profiling)-BovineLD V3 SNP chip, which had 26 151 public SNPs in its manifest file. After data cleaning, 20 326 SNPs were retained for the GWAS. The phenotypes were estimated breeding values of traits, provided by a public dairy herd improvement program center that had been collected once a month for 3 years. Two statistical models, a fixed-effect linear regression model and a mixed-effect linear model, were used to estimate the association effects of SNPs on each of the phenotypes. Genome-wide significant and suggestive thresholds were set at 2.46E-06 and 4.95E-05 respectively. The two statistical models concurrently identified two genome-wide significant (P < 0.05) SNPs on milk production traits in this Chinese Holstein population. The positional candidate genes, which were the ones closest to these two identified SNPs, were EEF2K (eukaryotic elongation factor 2 kinase) and KLHL1 (kelch like family member 1). These two genes could serve as new candidate genes for milk yield and lactation persistence, yet their roles need to be verified in further function studies. © 2017 Stichting International Foundation for Animal Genetics.
Schwessinger, Ron; Suciu, Maria C; McGowan, Simon J; Telenius, Jelena; Taylor, Stephen; Higgs, Doug R; Hughes, Jim R
2017-10-01
In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k -mer-based analysis of DNase footprints to determine any k -mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. © 2017 Schwessinger et al.; Published by Cold Spring Harbor Laboratory Press.
Cloete, Ruben; Akurugu, Wisdom A; Werely, Cedric J; van Helden, Paul D; Christoffels, Alan
2017-08-01
The human arylamine N-acetyltransferase 1 (NAT1) enzyme plays a vital role in determining the duration of action of amine-containing drugs such as para-aminobenzoic acid (PABA) by influencing the balance between detoxification and metabolic activation of these drugs. Recently, four novel single nucleotide polymorphisms (SNPs) were identified within a South African mixed ancestry population. Modeling the effects of these SNPs within the structural protein was done to assess possible structure and function changes in the enzyme. The use of molecular dynamics simulations and stability predictions indicated less thermodynamically stable protein structures containing E264K and V231G, while the N245I change showed a stabilizing effect. Coincidently the N245I change displayed a similar free energy landscape profile to the known R64W amino acid substitution (slow acetylator), while the R242M displayed a similar profile to the published variant, I263V (proposed fast acetylator), and the wild type protein structure. Similarly, principal component analysis indicated that two amino acid substitutions (E264K and V231G) occupied less conformational clusters of folded states as compared to the WT and were found to be destabilizing (may affect protein function). However, two of the four novel SNPs that result in amino acid changes: (V231G and N245I) were predicted by both SIFT and POLYPHEN-2 algorithms to affect NAT1 protein function, while two other SNPs that result in R242M and E264K substitutions showed contradictory results based on SIFT and POLYPHEN-2 analysis. In conclusion, the structural methods were able to verify that two non-synonymous substitutions (E264K and V231G) can destabilize the protein structure, and are in agreement with mCSM predictions, and should therefore be experimentally tested for NAT1 activity. These findings could inform a strategy of incorporating genotypic data (i.e., functional SNP alleles) with phenotypic information (slow or fast acetylator) to better prescribe effective treatment using drugs metabolized by NAT1. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Association of the variants in the BUD13-ZNF259 genes and the risk of hyperlipidaemia.
Aung, Lynn Htet Htet; Yin, Rui-Xing; Wu, Dong-Feng; Wang, Wei; Liu, Cheng-Wu; Pan, Shang-Ling
2014-07-01
The single nucleotide polymorphisms (SNPs) in the BUD13 homolog (BUD13) and zinc finger protein 259 (ZNF259) genes have been associated with one or more serum lipid traits in the European populations. However, little is known about such association in the Chinese populations. Our objectives were to determine the association of the BUD13/ZNF259 SNPs and their haplotypes with hypercholesterolaemia (HCH)/hypertriglyceridaemia (HTG) and to identify the possible gene-gene interactions among these SNPs. Genotyping of 6 SNPs was performed in 634 hyperlipidaemic and 547 normolipidaemic participants. The ZNF259 rs2075290, ZNF259 rs964184 and BUD13 rs10790162 SNPs were significantly associated with serum lipid levels in both HCH and non-HCH populations (P < 0.008-0.001). On single locus analysis, only BUD13 rs10790162 was associated with HCH (OR: 2.23, 95% CI: 1.05, 4.75, P = 0.015). The G-G-A-A-C-C haplotype, carrying rs964184-G-allele, was associated with increased risk of HCH (OR: 1.35, 95% CI: 1.10, 1.66, P = 0.005) and HTG (OR: 1.75, 95% CI: 1.39, 2.21, P = 0.000). The A-C-G-G-C-C and A-C-A-G-T-C haplotypes, carrying rs964184-C-allele, were associated with reduced risk of HCH (OR: 0.77, 95% CI: 0.61, 0.99, P = 0.039 and OR: 0.66, 95% CI: 0.47, 0.94, P = 0.021 respectively). On multifactor dimensionality reduction analyses, the two- to three-locus models showed a significant association with HCH and HTG (P < 0.01-0.001). The BUD13/ZNF259 SNPs, which were significant in the European populations, are also replicable in the Southern Chinese population. Moreover, inter-locus interactions may exist among these SNPs. However, further functional studies are required to clarify how these SNPs and genes actually affect the serum lipid levels. © 2014 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.
Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin
2010-01-01
Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease. PMID:21203467
Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin
2010-12-23
Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease.
2013-01-01
Background The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the genomic architecture of adaptive traits in black spruce, which is a widely distributed conifer of the North American boreal forest. As an alternative to the usual candidate gene approach, a candidate SNP approach was developed for association testing. Results A genetic map containing 231 gene loci was used to identify QTL that were related to budset timing and to tree height assessed over multiple years and sites. Twenty-two unique genomic regions were identified, including 20 that were related to budset timing and 6 that were related to tree height. From results of outlier detection and bulk segregant analysis for adaptive traits using DNA pool sequencing of 434 genes, 52 candidate SNPs were identified and subsequently tested in genetic association studies for budset timing and tree height assessed over multiple years and sites. A total of 34 (65%) SNPs were significantly associated with budset timing, or tree height, or both. Although the percentages of explained variance (PVE) by individual SNPs were small, several significant SNPs were shared between sites and among years. Conclusions The sharing of genomic regions and significant SNPs between budset timing and tree height indicates pleiotropic effects. Significant QTLs and SNPs differed quite greatly among years, suggesting that different sets of genes for the same characters are involved at different stages in the tree’s life history. The functional diversity of genes carrying significant SNPs and low observed PVE further indicated that a large number of polymorphisms are involved in adaptive genetic variation. Accordingly, for undomesticated species such as black spruce with natural populations of large effective size and low linkage disequilibrium, efficient marker systems that are predictive of adaptation should require the survey of large numbers of SNPs. Candidate SNP approaches like the one developed in the present study could contribute to reducing these numbers. PMID:23724860
Complexin2 modulates working memory-related neural activity in patients with schizophrenia
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hass, Johanna; Walton, Esther; Kirsten, Holger
The specific contribution of risk or candidate gene variants to the complex phenotype of schizophrenia is largely unknown. Studying the effects of such variants on brain function can provide insight into disease-associated mechanisms on a neural systems level. Previous studies found common variants in the complexin2 ( CPLX2) gene to be highly associated with cognitive dysfunction in schizophrenia patients. Similarly, cognitive functioning was found to be impaired in Cplx2 gene-deficient mice if they were subjected to maternal deprivation or mild brain trauma during puberty. Here, we aimed to study seven common CPLX2 single-nucleotide polymorphisms (SNPs) and their neurogenetic risk mechanismsmore » by investigating their relationship to a schizophrenia-related functional neuroimaging intermediate phenotype. In this paper, we examined functional MRI and genotype data collected from 104 patients with DSM-IV-diagnosed schizophrenia and 122 healthy controls who participated in the Mind Clinical Imaging Consortium study of schizophrenia. Seven SNPs distributed over the whole CPLX2 gene were tested for association with working memory-elicited neural activity in a frontoparietal neural network. Three CPLX2 SNPs were significantly associated with increased neural activity in the dorsolateral prefrontal cortex and intraparietal sulcus in the schizophrenia sample, but showed no association in healthy controls. Finally, since increased working memory-related neural activity in individuals with or at risk for schizophrenia has been interpreted as ‘neural inefficiency,’ these findings suggest that certain variants of CPLX2 may contribute to impaired brain function in schizophrenia, possibly combined with other deleterious genetic variants, adverse environmental events, or developmental insults.« less
Complexin2 modulates working memory-related neural activity in patients with schizophrenia
Hass, Johanna; Walton, Esther; Kirsten, Holger; ...
2014-10-09
The specific contribution of risk or candidate gene variants to the complex phenotype of schizophrenia is largely unknown. Studying the effects of such variants on brain function can provide insight into disease-associated mechanisms on a neural systems level. Previous studies found common variants in the complexin2 ( CPLX2) gene to be highly associated with cognitive dysfunction in schizophrenia patients. Similarly, cognitive functioning was found to be impaired in Cplx2 gene-deficient mice if they were subjected to maternal deprivation or mild brain trauma during puberty. Here, we aimed to study seven common CPLX2 single-nucleotide polymorphisms (SNPs) and their neurogenetic risk mechanismsmore » by investigating their relationship to a schizophrenia-related functional neuroimaging intermediate phenotype. In this paper, we examined functional MRI and genotype data collected from 104 patients with DSM-IV-diagnosed schizophrenia and 122 healthy controls who participated in the Mind Clinical Imaging Consortium study of schizophrenia. Seven SNPs distributed over the whole CPLX2 gene were tested for association with working memory-elicited neural activity in a frontoparietal neural network. Three CPLX2 SNPs were significantly associated with increased neural activity in the dorsolateral prefrontal cortex and intraparietal sulcus in the schizophrenia sample, but showed no association in healthy controls. Finally, since increased working memory-related neural activity in individuals with or at risk for schizophrenia has been interpreted as ‘neural inefficiency,’ these findings suggest that certain variants of CPLX2 may contribute to impaired brain function in schizophrenia, possibly combined with other deleterious genetic variants, adverse environmental events, or developmental insults.« less
In search of causal variants: refining disease association signals using cross-population contrasts.
Saccone, Nancy L; Saccone, Scott F; Goate, Alison M; Grucza, Richard A; Hinrichs, Anthony L; Rice, John P; Bierut, Laura J
2008-08-29
Genome-wide association (GWA) using large numbers of single nucleotide polymorphisms (SNPs) is now a powerful, state-of-the-art approach to mapping human disease genes. When a GWA study detects association between a SNP and the disease, this signal usually represents association with a set of several highly correlated SNPs in strong linkage disequilibrium. The challenge we address is to distinguish among these correlated loci to highlight potential functional variants and prioritize them for follow-up. We implemented a systematic method for testing association across diverse population samples having differing histories and LD patterns, using a logistic regression framework. The hypothesis is that important underlying biological mechanisms are shared across human populations, and we can filter correlated variants by testing for heterogeneity of genetic effects in different population samples. This approach formalizes the descriptive comparison of p-values that has typified similar cross-population fine-mapping studies to date. We applied this method to correlated SNPs in the cholinergic nicotinic receptor gene cluster CHRNA5-CHRNA3-CHRNB4, in a case-control study of cocaine dependence composed of 504 European-American and 583 African-American samples. Of the 10 SNPs genotyped in the r2 > or = 0.8 bin for rs16969968, three demonstrated significant cross-population heterogeneity and are filtered from priority follow-up; the remaining SNPs include rs16969968 (heterogeneity p = 0.75). Though the power to filter out rs16969968 is reduced due to the difference in allele frequency in the two groups, the results nevertheless focus attention on a smaller group of SNPs that includes the non-synonymous SNP rs16969968, which retains a similar effect size (odds ratio) across both population samples. Filtering out SNPs that demonstrate cross-population heterogeneity enriches for variants more likely to be important and causative. Our approach provides an important and effective tool to help interpret results from the many GWA studies now underway.
Voruganti, V. Saroja; Cole, Shelley A.; Haack, Karin; Comuzzie, Anthony G.; Muzny, Donna M.; Wheeler, David A.; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A.
2011-01-01
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5′ and 3′ flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3′-UTR, and 2 in the 5′-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001–0.009) were associated with obesity-related traits (P = 0.01–0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77–0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children. PMID:21771880
Association Analysis of WNT10B With Bone Mass and Structure Among Individuals of African Ancestry
Zmuda, Joseph M; Yerges, Laura M; Kammerer, Candace M; Cauley, Jane A; Wang, Xiaojing; Nestlerode, Cara S; Wheeler, Victor W; Patrick, Alan L; Bunker, ClareAnn H; Moffett, Susan P; Ferrell, Robert E
2009-01-01
Wnts comprise a family of secreted growth factors that regulate the development and maintenance of many organs. Recently, Wnt10b was shown to stimulate osteoblastogenesis and bone formation in mice. To evaluate further the role of Wnt10b in bone health in humans, we performed bidirectional sequencing of ∼8 kb of the WNT10B gene region in 192 individuals (96 African, 96 white) to identify single nucleotide polymorphisms (SNPs). We identified 19 SNPs with minor allele frequency (MAF) ≥0.01. Ten of these SNPs were not present in the NCBI dbSNP database (build 127), whereas 10 of the 20 SNPs (50%) reported in dbSNP were not verified. We initially genotyped seven tagging SNPs that captured common (MAF ≥ 0.05) variation in the region with r 2 > 0.80 and a potentially functional SNP in exon 5 in 1035 Afro-Caribbean men ≥40 yr of age. Association analysis showed three SNPs in a 3′ region of linkage disequilibrium that were associated with DXA measures of hip BMD. Associations between two of these three SNPs (rs1051886, rs3741627) with hip BMD were replicated in an additional 980 Afro-Caribbean men (p < 0.05), in the combined sample of 2015 men (p ≤ 0.006), and in 416 individuals ≥18 yr of age (mean, 44 yr) belonging to eight extended, multigenerational Afro-Caribbean families with mean family size >50 (3535 relative pairs; p < 0.05). Further analysis showed that rs1051886 and rs3741627 were associated with cortical cross-sectional area, periosteal circumference, and BMC in the radius, such that individuals with the minor alleles had lower biomechanical indices of long-bone bending strength. This analysis implicates the WNT10B locus as a genetic element in the regulation of bone mass and structural geometry. PMID:19016593
Association analysis of WNT10B with bone mass and structure among individuals of African ancestry.
Zmuda, Joseph M; Yerges, Laura M; Kammerer, Candace M; Cauley, Jane A; Wang, Xiaojing; Nestlerode, Cara S; Wheeler, Victor W; Patrick, Alan L; Bunker, ClareAnn H; Moffett, Susan P; Ferrell, Robert E
2009-03-01
Wnts comprise a family of secreted growth factors that regulate the development and maintenance of many organs. Recently, Wnt10b was shown to stimulate osteoblastogenesis and bone formation in mice. To evaluate further the role of Wnt10b in bone health in humans, we performed bidirectional sequencing of approximately 8 kb of the WNT10B gene region in 192 individuals (96 African, 96 white) to identify single nucleotide polymorphisms (SNPs). We identified 19 SNPs with minor allele frequency (MAF) > or =0.01. Ten of these SNPs were not present in the NCBI dbSNP database (build 127), whereas 10 of the 20 SNPs (50%) reported in dbSNP were not verified. We initially genotyped seven tagging SNPs that captured common (MAF > or = 0.05) variation in the region with r (2) > 0.80 and a potentially functional SNP in exon 5 in 1035 Afro-Caribbean men > or =40 yr of age. Association analysis showed three SNPs in a 3' region of linkage disequilibrium that were associated with DXA measures of hip BMD. Associations between two of these three SNPs (rs1051886, rs3741627) with hip BMD were replicated in an additional 980 Afro-Caribbean men (p < 0.05), in the combined sample of 2015 men (p < or = 0.006), and in 416 individuals > or =18 yr of age (mean, 44 yr) belonging to eight extended, multigenerational Afro-Caribbean families with mean family size >50 (3535 relative pairs; p < 0.05). Further analysis showed that rs1051886 and rs3741627 were associated with cortical cross-sectional area, periosteal circumference, and BMC in the radius, such that individuals with the minor alleles had lower biomechanical indices of long-bone bending strength. This analysis implicates the WNT10B locus as a genetic element in the regulation of bone mass and structural geometry.
Wang, Le; Yu, Cuiping; Guo, Liang; Lin, Haoran; Meng, Zining
2015-01-01
The common coral trout is one species of major importance in commercial fisheries and aquaculture. Recently, two different color morphs of Plectropomus leopardus were discovered and the biological importance of the color difference is unknown. Since coral trout species are poorly characterized at the molecular level, we undertook the transcriptomic characterization of the two color morphs, one black and one red coral trout, using Illumina next generation sequencing technologies. The study produced 55162966 and 54588952 paired-end reads, for black and red trout, respectively. De novo transcriptome assembly generated 95367 and 99424 unique sequences in black and red trout, respectively, with 88813 sequences shared between them. Approximately 50% of both trancriptomes were functionally annotated by BLAST searches against protein databases. The two trancriptomes were enriched into 25 functional categories and showed similar profiles of Gene Ontology category compositions. 34110 unigenes were grouped into 259 KEGG pathways. Moreover, we identified 14649 simple sequence repeats (SSRs) and designed primers for potential application. We also discovered 130524 putative single nucleotide polymorphisms (SNPs) in the two transcriptomes, supplying potential genomic resources for the coral trout species. In addition, we identified 936 fast-evolving genes and 165 candidate genes under positive selection between the two color morphs. Finally, 38 candidate genes underlying the mechanism of color and pigmentation were also isolated. This study presents the first transcriptome resources for the common coral trout and provides basic information for the development of genomic tools for the identification, conservation, and understanding of the speciation and local adaptation of coral reef fish species. PMID:26713756
USDA-ARS?s Scientific Manuscript database
The concept of utilizing putative and unique gene sequences for the design of species specific probes was tested. The abundance profile of assigned functions within the Lactobacillus plantarum genome was used for the identification of the putative and unique gene sequence, csh. The targeted gene (cs...
Enhancer scanning to locate regulatory regions in genomic loci
Buckley, Melissa; Gjyshi, Anxhela; Mendoza-Fandiño, Gustavo; Baskin, Rebekah; Carvalho, Renato S.; Carvalho, Marcelo A.; Woods, Nicholas T.; Monteiro, Alvaro N.A.
2016-01-01
The present protocol provides a rapid, streamlined and scalable strategy to systematically scan genomic regions for the presence of transcriptional regulatory regions active in a specific cell type. It creates genomic tiles spanning a region of interest that are subsequently cloned by recombination into a luciferase reporter vector containing the Simian Virus 40 promoter. Tiling clones are transfected into specific cell types to test for the presence of transcriptional regulatory regions. The protocol includes testing of different SNP (single nucleotide polymorphism) alleles to determine their effect on regulatory activity. This procedure provides a systematic framework to identify candidate functional SNPs within a locus during functional analysis of genome-wide association studies. This protocol adapts and combines previous well-established molecular biology methods to provide a streamlined strategy, based on automated primer design and recombinational cloning to rapidly go from a genomic locus to a set of candidate functional SNPs in eight weeks. PMID:26658467
2017-01-01
Arsenic methylation, the primary biotransformation in the human body, is catalyzed by the enzyme As(III) S-adenosylmethionine (SAM) methyltransferases (hAS3MT). This process is thought to be protective from acute high-level arsenic exposure. However, with long-term low-level exposure, hAS3MT produces intracellular methylarsenite (MAs(III)) and dimethylarsenite (DMAs(III)), which are considerably more toxic than inorganic As(III) and may contribute to arsenic-related diseases. Several single nucleotide polymorphisms (SNPs) in putative regulatory elements of the hAS3MT gene have been shown to be protective. In contrast, three previously identified exonic SNPs (R173W, M287T, and T306I) may be deleterious. The goal of this study was to examine the effect of single amino acid substitutions in hAS3MT on the activity of the enzyme that might explain their contributions to adverse health effects of environmental arsenic. We identified five additional intragenic variants in hAS3MT (H51R, C61W, I136T, W203C, and R251H). We purified the eight polymorphic hAS3MT proteins and characterized their enzymatic properties. Each enzyme had low methylation activity through decreased affinity for substrate, lower overall rates of catalysis, or lower stability. We propose that amino acid substitutions in hAS3MT with decreased catalytic activity lead to detrimental responses to environmental arsenic and may increase the risk of arsenic-related diseases. PMID:28537708
Sturgeon conservation genomics: SNP discovery and validation using RAD sequencing.
Ogden, R; Gharbi, K; Mugue, N; Martinsohn, J; Senn, H; Davey, J W; Pourkazemi, M; McEwing, R; Eland, C; Vidotto, M; Sergeev, A; Congiu, L
2013-06-01
Caviar-producing sturgeons belonging to the genus Acipenser are considered to be one of the most endangered species groups in the world. Continued overfishing in spite of increasing legislation, zero catch quotas and extensive aquaculture production have led to the collapse of wild stocks across Europe and Asia. The evolutionary relationships among Adriatic, Russian, Persian and Siberian sturgeons are complex because of past introgression events and remain poorly understood. Conservation management, traceability and enforcement suffer a lack of appropriate DNA markers for the genetic identification of sturgeon at the species, population and individual level. This study employed RAD sequencing to discover and characterize single nucleotide polymorphism (SNP) DNA markers for use in sturgeon conservation in these four tetraploid species over three biological levels, using a single sequencing lane. Four population meta-samples and eight individual samples from one family were barcoded separately before sequencing. Analysis of 14.4 Gb of paired-end RAD data focused on the identification of SNPs in the paired-end contig, with subsequent in silico and empirical validation of candidate markers. Thousands of putatively informative markers were identified including, for the first time, SNPs that show population-wide differentiation between Russian and Persian sturgeons, representing an important advance in our ability to manage these cryptic species. The results highlight the challenges of genotyping-by-sequencing in polyploid taxa, while establishing the potential genetic resources for developing a new range of caviar traceability and enforcement tools. © 2013 John Wiley & Sons Ltd.
Wragg, Peter D; Johnson, Steven D
2011-09-01
Transitions from wind pollination to insect pollination were pivotal to the radiation of land plants, yet only a handful are known and the trait shifts required are poorly understood. We tested the hypothesis that a transition to insect pollination took place in the ancestrally wind-pollinated sedges (Cyperaceae) and that floral traits modified during this transition have functional significance. We paired putatively insect-pollinated Cyperus obtusiflorus and Cyperus sphaerocephalus with related, co-flowering, co-occurring wind-pollinated species, and compared pairs in terms of pollination mode and functional roles of floral traits. Experimentally excluding insects reduced seed set by 56-89% in putatively insect-pollinated species but not in intermingled wind-pollinated species. The pollen of putatively insect-pollinated species was less motile in a wind tunnel than that of wind-pollinated species. Bees, beetles and flies preferred inflorescences, and color-matched white or yellow models, of putatively insect-pollinated species over inflorescences, or color-matched brown models, of wind-pollinated species. Floral scents of putatively insect-pollinated species were chemically consistent with those of other insect-pollinated plants, and attracted pollinators; wind-pollinated species were unscented. These results show that a transition from wind pollination to insect pollination occurred in sedges and shed new light on the function of traits involved in this important transition. © 2011 The Authors. New Phytologist © 2011 New Phytologist Trust.
Flot, Jean-François; Tillier, Simon
2007-10-15
The complete mitochondrial genomes of two individuals attributed to different morphospecies of the scleractinian coral genus Pocillopora have been sequenced. Both genomes, respectively 17,415 and 17,422 nt long, share the presence of a previously undescribed ORF encoding a putative protein made up of 302 amino acids and of unknown function. Surprisingly, this ORF turns out to be the second most variable region of the mitochondrial genome (1% nucleotide sequence difference between the two individuals) after the putative control region (1.5% sequence difference). Except for the presence of this ORF and for the location of the putative control region, the mitochondrial genome of Pocillopora is organized in a fashion similar to the other scleractinian coral genomes published to date. For the first time in a cnidarian, a putative second origin of replication is described based on its secondary structure similar to the stem-loop structure of O(L), the origin of L-strand replication in vertebrates.
Zhuo, Rui Qun; Zhou, Ting Ting; Yang, Shi Ping; Chan, Siuming Francis
2017-07-01
Myostatin is an important member of the transforming growth factor (TGF) family that functions to regulate muscle growth in animals. In this study, the myostatin gene (FmMstn) and two slightly different (short and long forms) cDNAs of the banana shrimp Fenneropenaeus merguiensis were cloned and characterized. Similar to Mstn gene of the scallop, fish and mammal, FmMstn gene consists of 3 exons and 2 introns. The 2kb upstream promoter region of the FmMstn gene consists of putative response elements for myocyte enhancing factor (MEF2) and E-box factors. The longest open reading frame of the short Mstn consists of 1260bp encoding for a protein with 420 amino acid residues. The long FmMstn is almost identical to the short FmMstn with the exception of 8 amino acid insertions. FmMstn is most similar to the Mstn of Litopenaeus vannamei and Penaeus monodon sharing >92-98% amino acid sequence identity. Multiple sequence alignment results revealed high degree of amino acid conservation of the cysteine residues and mature peptide of the FmMstn with Mstn from other animals. FmMstn transcript was detected in the heart, muscle, optic nerve and thoracic ganglion. FmMstn transcript level in muscle is higher in early postmolt, decreases in intermolt and increases again towards ecdysis. Higher expression level of FmMstn is also observed in smaller shrimp of the same age. Knock-down of FmMstn gene by RNAi can cause a significant increase in molt cycle duration and failure of some shrimp to undergo ecdysis. Direct DNA sequencing results revealed that FmMstn gene is highly polymorphic and several potential SNPs have been identified. Some SNPs are associated with the size difference of the shrimp. In summary, the result of this study indicates that shrimp FmMstn gene is molt/growth-related and the presence of SNP suggests that it could be a candidate gene for shrimp genetic improvement research. Copyright © 2017. Published by Elsevier Inc.
RAD51B in Familial Breast Cancer
Pelttari, Liisa M.; Khan, Sofia; Vuorela, Mikko; Kiiski, Johanna I.; Vilske, Sara; Nevanlinna, Viivi; Ranta, Salla; Schleutker, Johanna; Winqvist, Robert; Kallioniemi, Anne; Dörk, Thilo; Bogdanova, Natalia V.; Figueroa, Jonine; Pharoah, Paul D. P.; Schmidt, Marjanka K.; Dunning, Alison M.; García-Closas, Montserrat; Bolla, Manjeet K.; Dennis, Joe; Michailidou, Kyriaki; Wang, Qin; Hopper, John L.; Southey, Melissa C.; Rosenberg, Efraim H.; Fasching, Peter A.; Beckmann, Matthias W.; Peto, Julian; dos-Santos-Silva, Isabel; Sawyer, Elinor J.; Tomlinson, Ian; Burwinkel, Barbara; Surowy, Harald; Guénel, Pascal; Truong, Thérèse; Bojesen, Stig E.; Nordestgaard, Børge G.; Benitez, Javier; González-Neira, Anna; Neuhausen, Susan L.; Anton-Culver, Hoda; Brenner, Hermann; Arndt, Volker; Meindl, Alfons; Schmutzler, Rita K.; Brauch, Hiltrud; Brüning, Thomas; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Hartikainen, Jaana M.; Chenevix-Trench, Georgia; Van Dyck, Laurien; Janssen, Hilde; Chang-Claude, Jenny; Rudolph, Anja; Radice, Paolo; Peterlongo, Paolo; Hallberg, Emily; Olson, Janet E.; Giles, Graham G.; Milne, Roger L.; Haiman, Christopher A.; Schumacher, Fredrick; Simard, Jacques; Dumont, Martine; Kristensen, Vessela; Borresen-Dale, Anne-Lise; Zheng, Wei; Beeghly-Fadiel, Alicia; Grip, Mervi; Andrulis, Irene L.; Glendon, Gord; Devilee, Peter; Seynaeve, Caroline; Hooning, Maartje J.; Collée, Margriet; Cox, Angela; Cross, Simon S.; Shah, Mitul; Luben, Robert N.; Hamann, Ute; Torres, Diana; Jakubowska, Anna; Lubinski, Jan; Couch, Fergus J.; Yannoukakos, Drakoulis; Orr, Nick; Swerdlow, Anthony; Darabi, Hatef; Li, Jingmei; Czene, Kamila; Hall, Per; Easton, Douglas F.; Mattson, Johanna; Blomqvist, Carl; Aittomäki, Kristiina; Nevanlinna, Heli
2016-01-01
Common variation on 14q24.1, close to RAD51B, has been associated with breast cancer: rs999737 and rs2588809 with the risk of female breast cancer and rs1314913 with the risk of male breast cancer. The aim of this study was to investigate the role of RAD51B variants in breast cancer predisposition, particularly in the context of familial breast cancer in Finland. We sequenced the coding region of RAD51B in 168 Finnish breast cancer patients from the Helsinki region for identification of possible recurrent founder mutations. In addition, we studied the known rs999737, rs2588809, and rs1314913 SNPs and RAD51B haplotypes in 44,791 breast cancer cases and 43,583 controls from 40 studies participating in the Breast Cancer Association Consortium (BCAC) that were genotyped on a custom chip (iCOGS). We identified one putatively pathogenic missense mutation c.541C>T among the Finnish cancer patients and subsequently genotyped the mutation in additional breast cancer cases (n = 5259) and population controls (n = 3586) from Finland and Belarus. No significant association with breast cancer risk was seen in the meta-analysis of the Finnish datasets or in the large BCAC dataset. The association with previously identified risk variants rs999737, rs2588809, and rs1314913 was replicated among all breast cancer cases and also among familial cases in the BCAC dataset. The most significant association was observed for the haplotype carrying the risk-alleles of all the three SNPs both among all cases (odds ratio (OR): 1.15, 95% confidence interval (CI): 1.11–1.19, P = 8.88 x 10−16) and among familial cases (OR: 1.24, 95% CI: 1.16–1.32, P = 6.19 x 10−11), compared to the haplotype with the respective protective alleles. Our results suggest that loss-of-function mutations in RAD51B are rare, but common variation at the RAD51B region is significantly associated with familial breast cancer risk. PMID:27149063
SNPs in PTGS2 and LTA Predict Pain and Quality of Life in Long Term Lung Cancer Survivors
Rausch, Sarah M.; Gonzalez, Brian D.; Clark, Matthew M.; Patten, Christi; Felten, Sara; Liu, Heshan; Li, Yafei; Sloan, Jeff; Yang, Ping
2015-01-01
PURPOSE Lung cancer survivors report the lowest quality of life relative to other cancer survivors. Pain is one of the most devastating, persistent, and incapacitating symptoms for lung cancer survivors. Prevalence rates vary with 80–100% of survivors experiencing cancer pain and healthcare costs are five times higher in cancer survivors with uncontrolled pain. Cancer pain often has a considerable impact on quality of life among cancer patients and cancer survivors. Therefore, early identification, and treatment is important. Although recent studies have suggested a relationship between single nucleotide polymorphisms (SNPs) in several cytokine and inflammation genes with cancer prognosis, associations with cancer pain are not clear. Therefore, the primary aim of this study was to identify SNPs related to pain in long term lung cancer survivors. PATIENTS AND METHODS Participants were enrolled in the Mayo Clinic Lung Cancer Cohort upon diagnosis of their lung cancer. 1149 Caucasian lung cancer survivors, (440 surviving < 3 years; 354 surviving 3–5 years; and 355 surviving> 5 years) completed study questionnaires and had genetic samples available. Ten SNPS from PTGS2 and LTA genes were selected based on the serum literature. Outcomes included pain, and quality of life as measured by the SF-8. RESULTS Of the 10 SNPs evaluated in LTA and PTGS2 genes, 3 were associated with pain severity (rs5277; rs1799964), social function (rs5277) and mental health (rs5275). These results suggested both specificity and consistency of these inflammatory gene SNPs in predicting pain severity in long term lung cancer survivors. CONCLUSION These results provide support for genetic predisposition to pain severity and may aid in identification of lung cancer survivors at high risk for morbidity and poor QOL. PMID:22464751
da Silva, Bruna S; Rovaris, Diego L; Schuch, Jaqueline B; Mota, Nina R; Cupertino, Renata B; Aroche, Angelita P; Bertuzzi, Guilherme P; Karam, Rafael G; Vitola, Eduardo S; Tovo-Rodrigues, Luciana; Grevet, Eugenio H; Bau, Claiton H D
2016-11-15
The corticotropin-releasing hormone receptor 1 (CRHR1) gene has been repeatedly implicated in Major Depressive Disorder (MDD) in humans and animal models; however, the findings are not absolutely convergent. Since recent evidence from genome-wide association studies suggests that narrowing the phenotypic heterogeneity may be crucial in genetic studies of MDD, the aim of this study was to evaluate the effects of CRHR1 polymorphisms on MDD while addressing the influence of sex and smoking status. The association of the CRHR1 SNPs rs12944712, rs110402, and rs878886 with MDD was evaluated in 629 Brazilian adults of European descent recruited from the general population [180 (28.6%) with lifetime MDD]. The sample was subdivided according to sex and smoking status RESULTS: Among nonsmokers, there were nominal associations between MDD and all tested SNPs (rs12944712, P=0.042; rs110402, P=0.031, and rs878886, P=0.040), regardless of sex. In addition, there were significant effects of rs110402 in women (P corr =0.034) and rs878886 in men (P corr =0.013). Among lifetime smokers, there were no significant associations between CRHR1 SNPs and MDD LIMITATIONS: The lack of a depression rating scale; scarcity of information on the functionality of the CRHR1 SNPs; and relatively small sample sizes in some subgroups. Our results strengthen the evidence for the role of CRHR1 SNPs in MDD susceptibility and suggest that their effects may be modulated by sex and smoking status. These findings suggest the perspective that reducing phenotypic heterogeneity is warranted in genetic studies of MDD. Copyright © 2016 Elsevier B.V. All rights reserved.
Borowska, Alicja; Szwaczkowski, Tomasz; Kamiński, Stanisław; Hering, Dorota M; Kordan, Władysław; Lecewicz, Marek
2018-05-01
Use of information theory can be an alternative statistical approach to detect genome regions and candidate genes that are associated with livestock traits. The aim of this study was to verify the validity of the SNPs effects on some semen quality variables of bulls using entropy analysis. Records from 288 Holstein-Friesian bulls from one AI station were included. The following semen quality variables were analyzed: CASA kinematic variables of sperm (total motility, average path velocity, straight line velocity, curvilinear velocity, amplitude of lateral head displacement, beat cross frequency, straightness, linearity), sperm membrane integrity (plazmolema, mitochondrial function), sperm ATP content. Molecular data included 48,192 SNPs. After filtering (call rate = 0.95 and MAF = 0.05), 34,794 SNPs were included in the entropy analysis. The entropy and conditional entropy were estimated for each SNP. Conditional entropy quantifies the remaining uncertainty about values of the variable with the knowledge of SNP. The most informative SNPs for each variable were determined. The computations were performed using the R statistical package. A majority of the loci had relatively small contributions. The most informative SNPs for all variables were mainly located on chromosomes: 3, 4, 5 and 16. The results from the study indicate that important genome regions and candidate genes that determine semen quality variables in bulls are located on a number of chromosomes. Some detected clusters of SNPs were located in RNA (U6 and 5S_rRNA) for all the variables for which analysis occurred. Associations between PARK2 as well GALNT13 genes and some semen characteristics were also detected. Copyright © 2018 Elsevier B.V. All rights reserved.
Ostrovsky, Olga; Korostishevsky, Michael; Shafat, Itay; Mayorov, Margarita; Ilan, Neta; Vlodavsky, Israel; Nagler, Arnon
2009-01-01
Heparanase is an endo-β-glucuronidase that specifically cleaves the saccharide chains of heparan sulfate proteoglycans. Heparanase plays important roles in processes such as angiogenesis, tumor metastasis, tissue repair and remodeling, inflammation and autoimmunity. Genetic variations of the heparanase gene (HPSE) have been associated with heparanase transcription level. The present study was undertaken to identify haplotype or single nucleotide polymorphisms (SNPs) genotype combinations that correlate with heparanase expression both at the mRNA and protein levels. For this purpose, 11 HPSE gene SNPs were genotyped among 108 healthy individuals. Five out of the eleven polymorphisms revealed an association between the SNPs and heparanase expression. SNP rs4693608 exhibited a strong evidence of association. Analysis of haplotypes distribution revealed that the combination of two SNPs (rs4693608 and rs4364254) disclosed the most significant result. This approach allowed segregation of possible genotype combinations to three groups that correlate with low (LR: GG-CC, GG-CT, GG-TT, GA-CC), intermediate (MR: GA-CT, GA-TT) and high (HR: AA-TT, AA-CT) heparanase expression. Unexpectedly, LR genotype combinations were associated with low mRNA expressions level and high heparanase concentration in plasma, while HR genotype combinations were associated with high expression of mRNA and low plasma protein level. Because the main site of activity of secreted active heparanase is the extracellular matrix and cell surface, the origin and functional significance of plasma heparanase remain to be investigated. The current study indicates that rs4693608 and rs4364254 SNPs are involved in the regulation of heparanase expression and provides the basis for further studies on the association between HPSE gene SNPs and disease outcome. PMID:19406828
Keene, Keith L.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Leak, Tennille S.; Perlegas, Peter S.; Langefeld, Carl D.; Herrington, David M.; Freedman, Barry I.; Rich, Stephen S.; Bowden, Donald W.; Sale, Michèle M.
2009-01-01
We previously investigated the estrogen receptor α gene (ESR1) as a positional candidate for type 2 diabetes (T2DM), and found evidence for association between the intron 1-intron 2 region of this gene and type 2 diabetes and/or nephropathy in an African American (AA) population. Our objective was to comprehensively evaluate variants across the entire ESR1 gene for association in AA with T2DM and End Stage Renal Disease (T2DM-ESRD). One hundred fifty SNPs in ESR1, spanning 476 kb, were genotyped in 577 AA individuals with T2DM-ESRD and 596 AA controls. Genotypic association tests for dominant, additive, and recessive models, and haplotypic association, were calculated using a χ2 statistic and corresponding P value. Thirty-one SNPs showed nominal evidence for association (P< 0.05) with T2DM-ESRD in one or more genotypic model. After correcting for multiple tests, promoter SNP rs11964281 (nominal P=0.000291, adjusted P=0.0289), and intron 4 SNPs rs1569788 (nominal P=0.000754, adjusted P=0.0278) and rs9340969 (nominal P=0.00109, adjusted P=0.0467) remained significant at experimentwise error rate (EER) P<0.05 for the dominant class of tests. Twenty-three of the thirty-one associated SNPs cluster within the intron 4-intron 6 region. Gender stratification revealed nominal evidence for association with 35 SNPs in females (352 cases; 306 controls) and seven SNPs in males (225 cases; 290 controls). We have identified a novel region of the ESR1 gene that may contain important functional polymorphisms in relation to susceptibility to T2DM and/or diabetic nephropathy. PMID:18305958
Keene, Keith L; Mychaleckyj, Josyf C; Smith, Shelly G; Leak, Tennille S; Perlegas, Peter S; Langefeld, Carl D; Herrington, David M; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M
2008-05-01
We previously investigated the estrogen receptor alpha gene (ESR1) as a positional candidate for type 2 diabetes (T2DM), and found evidence for association between the intron 1-intron 2 region of this gene and T2DM and/or nephropathy in an African American (AA) population. Our objective was to comprehensively evaluate variants across the entire ESR1 gene for association in AA with T2DM and end stage renal disease (T2DM-ESRD). One hundred fifty SNPs in ESR1, spanning 476 kb, were genotyped in 577 AA individuals with T2DM-ESRD and 596 AA controls. Genotypic association tests for dominant, additive, and recessive models, and haplotypic association, were calculated using a chi(2) statistic and corresponding P value. Thirty-one SNPs showed nominal evidence for association (P < 0.05) with T2DM-ESRD in one or more genotypic model. After correcting for multiple tests, promoter SNP rs11964281 (nominal P = 0.000291, adjusted P = 0.0289), and intron 4 SNPs rs1569788 (nominal P = 0.000754, adjusted P = 0.0278) and rs9340969 (nominal P = 0.00109, adjusted P = 0.0467) remained significant at experimentwise error rate (EER) P = 0.05 for the dominant class of tests. Twenty-three of the thirty-one associated SNPs cluster within the intron 4-intron 6 regions. Gender stratification revealed nominal evidence for association with 35 SNPs in females (352 cases; 306 controls) and seven SNPs in males (225 cases; 290 controls). We have identified a novel region of the ESR1 gene that may contain important functional polymorphisms in relation to susceptibility to T2DM and/or diabetic nephropathy.
Genome-Wide Association of CKD Progression: The Chronic Renal Insufficiency Cohort Study.
Parsa, Afshin; Kanetsky, Peter A; Xiao, Rui; Gupta, Jayanta; Mitra, Nandita; Limou, Sophie; Xie, Dawei; Xu, Huichun; Anderson, Amanda Hyre; Ojo, Akinlolu; Kusek, John W; Lora, Claudia M; Hamm, L Lee; He, Jiang; Sandholm, Niina; Jeff, Janina; Raj, Dominic E; Böger, Carsten A; Bottinger, Erwin; Salimi, Shabnam; Parekh, Rulan S; Adler, Sharon G; Langefeld, Carl D; Bowden, Donald W; Groop, Per-Henrik; Forsblom, Carol; Freedman, Barry I; Lipkowitz, Michael; Fox, Caroline S; Winkler, Cheryl A; Feldman, Harold I
2017-03-01
The rate of decline of renal function varies significantly among individuals with CKD. To understand better the contribution of genetics to CKD progression, we performed a genome-wide association study among participants in the Chronic Renal Insufficiency Cohort Study. Our outcome of interest was CKD progression measured as change in eGFR over time among 1331 blacks and 1476 whites with CKD. We stratified all analyses by race and subsequently, diabetes status. Single-nucleotide polymorphisms (SNPs) that surpassed a significance threshold of P <1×10 -6 for association with eGFR slope were selected as candidates for follow-up and secondarily tested for association with proteinuria and time to ESRD. We identified 12 such SNPs among black patients and six such SNPs among white patients. We were able to conduct follow-up analyses of three candidate SNPs in similar (replication) cohorts and eight candidate SNPs in phenotype-related (validation) cohorts. Among blacks without diabetes, rs653747 in LINC00923 replicated in the African American Study of Kidney Disease and Hypertension cohort (discovery P =5.42×10 -7 ; replication P =0.039; combined P =7.42×10 -9 ). This SNP also associated with ESRD (hazard ratio, 2.0 (95% confidence interval, 1.5 to 2.7); P =4.90×10 -6 ). Similarly, rs931891 in LINC00923 associated with eGFR decline ( P =1.44×10 -4 ) in white patients without diabetes. In summary, SNPs in LINC00923 , an RNA gene expressed in the kidney, significantly associated with CKD progression in individuals with nondiabetic CKD. However, the lack of equivalent cohorts hampered replication for most discovery loci. Further replication of our findings in comparable study populations is warranted. Copyright © 2017 by the American Society of Nephrology.
MicroRNA Related Polymorphisms and Breast Cancer Risk
Khan, Sofia; Greco, Dario; Michailidou, Kyriaki; Milne, Roger L.; Muranen, Taru A.; Heikkinen, Tuomas; Aaltonen, Kirsimari; Dennis, Joe; Bolla, Manjeet K.; Liu, Jianjun; Hall, Per; Irwanto, Astrid; Humphreys, Keith; Li, Jingmei; Czene, Kamila; Chang-Claude, Jenny; Hein, Rebecca; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Fletcher, Olivia; Peto, Julian; dos Santos Silva, Isabel; Johnson, Nichola; Gibson, Lorna; Aitken, Zoe; Hopper, John L.; Tsimiklis, Helen; Bui, Minh; Makalic, Enes; Schmidt, Daniel F.; Southey, Melissa C.; Apicella, Carmel; Stone, Jennifer; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Adank, Muriel A.; van der Luijt, Rob B.; Meindl, Alfons; Schmutzler, Rita K.; Müller-Myhsok, Bertram; Lichtner, Peter; Turnbull, Clare; Rahman, Nazneen; Chanock, Stephen J.; Hunter, David J.; Cox, Angela; Cross, Simon S.; Reed, Malcolm W. R.; Schmidt, Marjanka K.; Broeks, Annegien; Veer, Laura J. V. a. n't.; Hogervorst, Frans B.; Fasching, Peter A.; Schrauder, Michael G.; Ekici, Arif B.; Beckmann, Matthias W.; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Benitez, Javier; Zamora, Pilar M.; Perez, Jose I. A.; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Pharoah, Paul D. P.; Dunning, Alison M.; Shah, Mitul; Luben, Robert; Brown, Judith; Couch, Fergus J.; Wang, Xianshu; Vachon, Celine; Olson, Janet E.; Lambrechts, Diether; Moisse, Matthieu; Paridaens, Robert; Christiaens, Marie-Rose; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Mulot, Claire; Marme, Frederick; Burwinkel, Barbara; Schneeweiss, Andreas; Sohn, Christof; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Andrulis, Irene L.; Knight, Julia A.; Tchatchou, Sandrine; Mulligan, Anna Marie; Dörk, Thilo; Bogdanova, Natalia V.; Antonenkova, Natalia N.; Anton-Culver, Hoda; Darabi, Hatef; Eriksson, Mikael; Garcia-Closas, Montserrat; Figueroa, Jonine; Lissowska, Jolanta; Brinton, Louise; Devilee, Peter; Tollenaar, Robert A. E. M.; Seynaeve, Caroline; van Asperen, Christi J.; Kristensen, Vessela N.; Slager, Susan; Toland, Amanda E.; Ambrosone, Christine B.; Yannoukakos, Drakoulis; Lindblom, Annika; Margolin, Sara; Radice, Paolo; Peterlongo, Paolo; Barile, Monica; Mariani, Paolo; Hooning, Maartje J.; Martens, John W. M.; Collée, J. Margriet; Jager, Agnes; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Giles, Graham G.; McLean, Catriona; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Jones, Michael; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Mannermaa, Arto; Hamann, Ute; Chenevix-Trench, Georgia; Blomqvist, Carl; Aittomäki, Kristiina; Easton, Douglas F.; Nevanlinna, Heli
2014-01-01
Genetic variations, such as single nucleotide polymorphisms (SNPs) in microRNAs (miRNA) or in the miRNA binding sites may affect the miRNA dependent gene expression regulation, which has been implicated in various cancers, including breast cancer, and may alter individual susceptibility to cancer. We investigated associations between miRNA related SNPs and breast cancer risk. First we evaluated 2,196 SNPs in a case-control study combining nine genome wide association studies (GWAS). Second, we further investigated 42 SNPs with suggestive evidence for association using 41,785 cases and 41,880 controls from 41 studies included in the Breast Cancer Association Consortium (BCAC). Combining the GWAS and BCAC data within a meta-analysis, we estimated main effects on breast cancer risk as well as risks for estrogen receptor (ER) and age defined subgroups. Five miRNA binding site SNPs associated significantly with breast cancer risk: rs1045494 (odds ratio (OR) 0.92; 95% confidence interval (CI): 0.88–0.96), rs1052532 (OR 0.97; 95% CI: 0.95–0.99), rs10719 (OR 0.97; 95% CI: 0.94–0.99), rs4687554 (OR 0.97; 95% CI: 0.95–0.99, and rs3134615 (OR 1.03; 95% CI: 1.01–1.05) located in the 3′ UTR of CASP8, HDDC3, DROSHA, MUSTN1, and MYCL1, respectively. DROSHA belongs to miRNA machinery genes and has a central role in initial miRNA processing. The remaining genes are involved in different molecular functions, including apoptosis and gene expression regulation. Further studies are warranted to elucidate whether the miRNA binding site SNPs are the causative variants for the observed risk effects. PMID:25390939
Identification of type 2 diabetes-associated combination of SNPs using support vector machine.
Ban, Hyo-Jeong; Heo, Jee Yeon; Oh, Kyung-Soo; Park, Keun-Joon
2010-04-23
Type 2 diabetes mellitus (T2D), a metabolic disorder characterized by insulin resistance and relative insulin deficiency, is a complex disease of major public health importance. Its incidence is rapidly increasing in the developed countries. Complex diseases are caused by interactions between multiple genes and environmental factors. Most association studies aim to identify individual susceptibility single markers using a simple disease model. Recent studies are trying to estimate the effects of multiple genes and multi-locus in genome-wide association. However, estimating the effects of association is very difficult. We aim to assess the rules for classifying diseased and normal subjects by evaluating potential gene-gene interactions in the same or distinct biological pathways. We analyzed the importance of gene-gene interactions in T2D susceptibility by investigating 408 single nucleotide polymorphisms (SNPs) in 87 genes involved in major T2D-related pathways in 462 T2D patients and 456 healthy controls from the Korean cohort studies. We evaluated the support vector machine (SVM) method to differentiate between cases and controls using SNP information in a 10-fold cross-validation test. We achieved a 65.3% prediction rate with a combination of 14 SNPs in 12 genes by using the radial basis function (RBF)-kernel SVM. Similarly, we investigated subpopulation data sets of men and women and identified different SNP combinations with the prediction rates of 70.9% and 70.6%, respectively. As the high-throughput technology for genome-wide SNPs improves, it is likely that a much higher prediction rate with biologically more interesting combination of SNPs can be acquired by using this method. Support Vector Machine based feature selection method in this research found novel association between combinations of SNPs and T2D in a Korean population.
2014-01-01
Background To better understand the genetic determination of udder health, we performed a genome-wide association study (GWAS) on a population of 2354 German Holstein bulls for which daughter yield deviations (DYD) for somatic cell score (SCS) were available. For this study, we used genetic information of 44 576 informative single nucleotide polymorphisms (SNPs) and 11 725 inferred haplotype blocks. Results When accounting for the sub-structure of the analyzed population, 16 SNPs and 10 haplotypes in six genomic regions were significant at the Bonferroni threshold of P ≤ 1.14 × 10-6. The size of the identified regions ranged from 0.05 to 5.62 Mb. Genomic regions on chromosomes 5, 6, 18 and 19 coincided with known QTL affecting SCS, while additional genomic regions were found on chromosomes 13 and X. Of particular interest is the region on chromosome 6 between 85 and 88 Mb, where QTL for mastitis traits and significant SNPs for SCS in different Holstein populations coincide with our results. In all identified regions, except for the region on chromosome X, significant SNPs were present in significant haplotypes. The minor alleles of identified SNPs on chromosomes 18 and 19, and the major alleles of SNPs on chromosomes 6 and X were favorable for a lower SCS. Differences in somatic cell count (SCC) between alternative SNP alleles reached 14 000 cells/mL. Conclusions The results support the polygenic nature of the genetic determination of SCS, confirm the importance of previously reported QTL, and provide evidence for the segregation of additional QTL for SCS in Holstein cattle. The small size of the regions identified here will facilitate the search for causal genetic variations that affect gene functions. PMID:24898131
MicroRNA related polymorphisms and breast cancer risk.
Khan, Sofia; Greco, Dario; Michailidou, Kyriaki; Milne, Roger L; Muranen, Taru A; Heikkinen, Tuomas; Aaltonen, Kirsimari; Dennis, Joe; Bolla, Manjeet K; Liu, Jianjun; Hall, Per; Irwanto, Astrid; Humphreys, Keith; Li, Jingmei; Czene, Kamila; Chang-Claude, Jenny; Hein, Rebecca; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Fletcher, Olivia; Peto, Julian; dos Santos Silva, Isabel; Johnson, Nichola; Gibson, Lorna; Aitken, Zoe; Hopper, John L; Tsimiklis, Helen; Bui, Minh; Makalic, Enes; Schmidt, Daniel F; Southey, Melissa C; Apicella, Carmel; Stone, Jennifer; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Adank, Muriel A; van der Luijt, Rob B; Meindl, Alfons; Schmutzler, Rita K; Müller-Myhsok, Bertram; Lichtner, Peter; Turnbull, Clare; Rahman, Nazneen; Chanock, Stephen J; Hunter, David J; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Schmidt, Marjanka K; Broeks, Annegien; Van't Veer, Laura J; Hogervorst, Frans B; Fasching, Peter A; Schrauder, Michael G; Ekici, Arif B; Beckmann, Matthias W; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Benitez, Javier; Zamora, Pilar M; Perez, Jose I A; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Pharoah, Paul D P; Dunning, Alison M; Shah, Mitul; Luben, Robert; Brown, Judith; Couch, Fergus J; Wang, Xianshu; Vachon, Celine; Olson, Janet E; Lambrechts, Diether; Moisse, Matthieu; Paridaens, Robert; Christiaens, Marie-Rose; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Mulot, Claire; Marme, Frederick; Burwinkel, Barbara; Schneeweiss, Andreas; Sohn, Christof; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Andrulis, Irene L; Knight, Julia A; Tchatchou, Sandrine; Mulligan, Anna Marie; Dörk, Thilo; Bogdanova, Natalia V; Antonenkova, Natalia N; Anton-Culver, Hoda; Darabi, Hatef; Eriksson, Mikael; Garcia-Closas, Montserrat; Figueroa, Jonine; Lissowska, Jolanta; Brinton, Louise; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; van Asperen, Christi J; Kristensen, Vessela N; Slager, Susan; Toland, Amanda E; Ambrosone, Christine B; Yannoukakos, Drakoulis; Lindblom, Annika; Margolin, Sara; Radice, Paolo; Peterlongo, Paolo; Barile, Monica; Mariani, Paolo; Hooning, Maartje J; Martens, John W M; Collée, J Margriet; Jager, Agnes; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Giles, Graham G; McLean, Catriona; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Jones, Michael; Simard, Jacques; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Mannermaa, Arto; Hamann, Ute; Chenevix-Trench, Georgia; Blomqvist, Carl; Aittomäki, Kristiina; Easton, Douglas F; Nevanlinna, Heli
2014-01-01
Genetic variations, such as single nucleotide polymorphisms (SNPs) in microRNAs (miRNA) or in the miRNA binding sites may affect the miRNA dependent gene expression regulation, which has been implicated in various cancers, including breast cancer, and may alter individual susceptibility to cancer. We investigated associations between miRNA related SNPs and breast cancer risk. First we evaluated 2,196 SNPs in a case-control study combining nine genome wide association studies (GWAS). Second, we further investigated 42 SNPs with suggestive evidence for association using 41,785 cases and 41,880 controls from 41 studies included in the Breast Cancer Association Consortium (BCAC). Combining the GWAS and BCAC data within a meta-analysis, we estimated main effects on breast cancer risk as well as risks for estrogen receptor (ER) and age defined subgroups. Five miRNA binding site SNPs associated significantly with breast cancer risk: rs1045494 (odds ratio (OR) 0.92; 95% confidence interval (CI): 0.88-0.96), rs1052532 (OR 0.97; 95% CI: 0.95-0.99), rs10719 (OR 0.97; 95% CI: 0.94-0.99), rs4687554 (OR 0.97; 95% CI: 0.95-0.99, and rs3134615 (OR 1.03; 95% CI: 1.01-1.05) located in the 3' UTR of CASP8, HDDC3, DROSHA, MUSTN1, and MYCL1, respectively. DROSHA belongs to miRNA machinery genes and has a central role in initial miRNA processing. The remaining genes are involved in different molecular functions, including apoptosis and gene expression regulation. Further studies are warranted to elucidate whether the miRNA binding site SNPs are the causative variants for the observed risk effects.
Won, Hong-Hee; Natarajan, Pradeep; Dobbyn, Amanda; Jordan, Daniel M.; Roussos, Panos; Lage, Kasper; Raychaudhuri, Soumya
2015-01-01
Large genome-wide association studies (GWAS) have identified many genetic loci associated with risk for myocardial infarction (MI) and coronary artery disease (CAD). Concurrently, efforts such as the National Institutes of Health (NIH) Roadmap Epigenomics Project and the Encyclopedia of DNA Elements (ENCODE) Consortium have provided unprecedented data on functional elements of the human genome. In the present study, we systematically investigate the biological link between genetic variants associated with this complex disease and their impacts on gene function. First, we examined the heritability of MI/CAD according to genomic compartments. We observed that single nucleotide polymorphisms (SNPs) residing within nearby regulatory regions show significant polygenicity and contribute between 59–71% of the heritability for MI/CAD. Second, we showed that the polygenicity and heritability explained by these SNPs are enriched in histone modification marks in specific cell types. Third, we found that a statistically higher number of 45 MI/CAD-associated SNPs that have been identified from large-scale GWAS studies reside within certain functional elements of the genome, particularly in active enhancer and promoter regions. Finally, we observed significant heterogeneity of this signal across cell types, with strong signals observed within adipose nuclei, as well as brain and spleen cell types. These results suggest that the genetic etiology of MI/CAD is largely explained by tissue-specific regulatory perturbation within the human genome. PMID:26509271
Shen, Kui; Ramirez, Benjamin; Mapes, Brandon; Shen, Grace R.; Gokhale, Vijay; Brown, Mary E.; Santarsiero, Bernard; Ishii, Yoshitaka; Dudek, Steven M.; Wang, Ting; Garcia, Joe G. N.
2015-01-01
The MYLK gene encodes the multifunctional enzyme, myosin light chain kinase (MLCK), involved in isoform-specific non-muscle and smooth muscle contraction and regulation of vascular permeability during inflammation. Three MYLK SNPs (P21H, S147P, V261A) alter the N-terminal amino acid sequence of the non-muscle isoform of MLCK (nmMLCK) and are highly associated with susceptibility to acute lung injury (ALI) and asthma, especially in individuals of African descent. To understand the functional effects of SNP associations, we examined the N-terminal segments of nmMLCK by 1H-15N heteronuclear single quantum correlation (HSQC) spectroscopy, a 2-D NMR technique, and by in silico molecular modeling. Both NMR analysis and molecular modeling indicated SNP localization to loops that connect the immunoglobulin-like domains of nmMLCK, consistent with minimal structural changes evoked by these SNPs. Molecular modeling analysis identified protein-protein interaction motifs adversely affected by these MYLK SNPs including binding by the scaffold protein 14-3-3, results confirmed by immunoprecipitation and western blot studies. These structure-function studies suggest novel mechanisms for nmMLCK regulation, which may confirm MYLK as a candidate gene in inflammatory lung disease and advance knowledge of the genetic underpinning of lung-related health disparities. PMID:26111161