Sample records for non-synonymous coding snps

  1. Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences.

    PubMed

    Pang, Erli; Wu, Xiaomei; Lin, Kui

    2016-06-01

    Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.

  2. Genetic diversity of tyrosine hydroxylase (TH) and dopamine β-hydroxylase (DBH) genes in cattle breeds

    PubMed Central

    Lourenco-Jaramillo, Diana Lelidett; Sifuentes-Rincón, Ana María; Parra-Bracamonte, Gaspar Manuel; de la Rosa-Reyna, Xochitl Fabiola; Segura-Cabrera, Aldo; Arellano-Vera, Williams

    2012-01-01

    DNA from four cattle breeds was used to re-sequence all of the exons and 56% of the introns of the bovine tyrosine hydroxylase (TH) gene and 97% and 13% of the bovine dopamine β-hydroxylase (DBH) coding and non-coding sequences, respectively. Two novel single nucleotide polymorphisms (SNPs) and a microsatellite motif were found in the TH sequences. The DBH sequences contained 62 nucleotide changes, including eight non-synonymous SNPs (nsSNPs) that are of particular interest because they may alter protein function and therefore affect the phenotype. These DBH nsSNPs resulted in amino acid substitutions that were predicted to destabilize the protein structure. Six SNPs (one from TH and five from DBH non-synonymous SNPs) were genotyped in 140 animals; all of them were polymorphic and had a minor allele frequency of > 9%. There were significant differences in the intra- and inter-population haplotype distributions. The haplotype differences between Brahman cattle and the three B. t. taurus breeds (Charolais, Holstein and Lidia) were interesting from a behavioural point of view because of the differences in temperament between these breeds. PMID:22888292

  3. Genome Analysis of the Domestic Dog (Korean Jindo) by Massively Parallel Sequencing

    PubMed Central

    Kim, Ryong Nam; Kim, Dae-Soo; Choi, Sang-Haeng; Yoon, Byoung-Ha; Kang, Aram; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Jong-Joo; Ha, Ji-Hong; Toyoda, Atsushi; Fujiyama, Asao; Kim, Aeri; Kim, Min-Young; Park, Kun-Hyang; Lee, Kang Seon; Park, Hong-Seog

    2012-01-01

    Although pioneering sequencing projects have shed light on the boxer and poodle genomes, a number of challenges need to be met before the sequencing and annotation of the dog genome can be considered complete. Here, we present the DNA sequence of the Jindo dog genome, sequenced to 45-fold average coverage using Illumina massively parallel sequencing technology. A comparison of the sequence to the reference boxer genome led to the identification of 4 675 437 single nucleotide polymorphisms (SNPs, including 3 346 058 novel SNPs), 71 642 indels and 8131 structural variations. Of these, 339 non-synonymous SNPs and 3 indels are located within coding sequences (CDS). In particular, 3 non-synonymous SNPs and a 26-bp deletion occur in the TCOF1 locus, implying that the difference observed in cranial facial morphology between Jindo and boxer dogs might be influenced by those variations. Through the annotation of the Jindo olfactory receptor gene family, we found 2 unique olfactory receptor genes and 236 olfactory receptor genes harbouring non-synonymous homozygous SNPs that are likely to affect smelling capability. In addition, we determined the DNA sequence of the Jindo dog mitochondrial genome and identified Jindo dog-specific mtDNA genotypes. This Jindo genome data upgrade our understanding of dog genomic architecture and will be a very valuable resource for investigating not only dog genetics and genomics but also human and dog disease genetics and comparative genomics. PMID:22474061

  4. Whole-Genome Sequencing of Theileria parva Strains Provides Insight into Parasite Migration and Diversification in the African Continent

    PubMed Central

    Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

    2013-01-01

    The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814–121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent. PMID:23404454

  5. Whole-genome sequencing of Theileria parva strains provides insight into parasite migration and diversification in the African continent.

    PubMed

    Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

    2013-06-01

    The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814-121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent.

  6. Polymorphisms of clip domain serine proteinase and serine proteinase homolog in the swimming crab Portunus trituberculatus and their association with Vibrio alginolyticus

    NASA Astrophysics Data System (ADS)

    Liu, Meng; Liu, Yuan; Hui, Min; Song, Chengwen; Cui, Zhaoxia

    2017-03-01

    Clip domain serine proteases (cSPs) and their homologs (SPHs) play an important role in various biological processes that are essential components of extracellular signaling cascades, especially in the innate immune responses of invertebrates. Here, polymorphisms of PtcSP and PtSPH from the swimming crab Portunus trituberculatus were investigated to explore their association with resistance/susceptibility to Vibrio alginolyticus. Polymorphic loci were identified using Clustal X, and characterized with SPSS 16.0 software, and then the significance of genotype and allele frequencies between resistant and susceptible stocks was determined by a χ 2 test. A total of 109 and 77 single nucleotide polymorphisms (SNPs) were identified in the genomic fragments of PtcSP and PtSPH, respectively. Notably, nearly half of PtSPH polymorphisms were found in the non-coding exon 1. Fourteen SNPs investigated were significantly associated with susceptibility/resistance to V. alginolyticus ( P <0.05). Among them, eight SNPs were observed in introns, and one synonymous, four non-synonymous SNPs and one ins-del were found in coding exons. In addition, five simple sequence repeats (SSRs) were detected in intron 3 of PtcSP. Although there was no statistically significant difference of allele frequencies, the SSRs showed different polymorphic alleles on the basis of the repeat number between resistant and susceptible stocks. After further validation, polymorphisms investigated here might be applied to select potential molecular markers of P. trituberculatus with resistance to V. alginolyticus.

  7. Regions of extreme synonymous codon selection in mammalian genes

    PubMed Central

    Schattner, Peter; Diekhans, Mark

    2006-01-01

    Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911

  8. Transcriptome-wide single nucleotide polymorphisms (SNPs) for abalone (Haliotis midae): validation and application using GoldenGate medium-throughput genotyping assays.

    PubMed

    Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay

    2013-09-23

    Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.

  9. Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout.

    PubMed

    Al-Tobasei, Rafet; Ali, Ali; Leeds, Timothy D; Liu, Sixin; Palti, Yniv; Kenney, Brett; Salem, Mohamed

    2017-08-07

    Coding/functional SNPs change the biological function of a gene and, therefore, could serve as "large-effect" genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7-93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.

  10. Profiling deleterious non-synonymous SNPs of smoker's gene CYP1A1.

    PubMed

    Ramesh, A Sai; Khan, Imran; Farhan, Md; Thiagarajan, Padma

    2013-01-01

    CYP1A1 gene belongs to the cytochrome P450 family and is known better as smokers' gene due to its hyperactivation as a consequence of long term smoking. The expression of CYP1A1 induces polycyclic aromatic hydrocarbon production in the lungs, which when over expressed, is known to cause smoking related diseases, such as cardiovascular pathologies, cancer, and diabetes. Single nucleotide polymorphisms (SNPs) are the simplest form of genetic variations that occur at a higher frequency, and are denoted as synonymous and non-synonymous SNPs on the basis of their effects on the amino acids. This study adopts a systematic in silico approach to predict the deleterious SNPs that are associated with disease conditions. It is inferred that four SNPs are highly deleterious, among which the SNP with rs17861094 is commonly predicted to be harmful by all tools. Hydrophobic (isoleucine) to hydrophilic (serine) amino acid variation was observed in the candidate gene. Hence, this investigation aims to characterize a candidate gene from 159 SNPs of CYP1A1.

  11. Associations between variants of FADS genes and omega-3 and omega-6 milk fatty acids of Canadian Holstein cows

    PubMed Central

    2014-01-01

    Background Fatty acid desaturase 1 (FADS1) and 2 (FADS2) genes code respectively for the enzymes delta-5 and delta-6 desaturases which are rate limiting enzymes in the synthesis of polyunsaturated omega-3 and omega-6 fatty acids (FAs). Omega-3 and-6 FAs as well as conjugated linoleic acid (CLA) are present in bovine milk and have demonstrated positive health effects in humans. Studies in humans have shown significant relationships between genetic variants in FADS1 and 2 genes with plasma and tissue concentrations of omega-3 and-6 FAs. The aim of this study was to evaluate the extent of sequence variations within these two genes in Canadian Holstein cows as well as the association between sequence variants and health promoting FAs in milk. Results Thirty three SNPs were detected within the studied regions of genes including a synonymous mutation (FADS1-07, rs42187261, 306Tyr > Tyr) in exon 8 of FADS1, a non-synonymous mutation (FADS2-14, rs211580559, 294Ala > Val) within FADS2 exon 7, a splice site SNP (FADS2-05, rs211263660), a 3′UTR SNP (FADS2-23, rs109772589), and another 3′UTR SNP with an effect on a microRNA binding site within FADS2 gene (FADS2-19, rs210169303). Association analyses showed significant relations between three out of seven tested SNPs and several FAs. Significant associations (FDR P < 0.05) were recorded between FADS2-23 (rs109772589) and two omega-6 FAs (dihomogamma linolenic acid [C20:3n6] and arachidonic acid [C20:4n6]), FADS1-07 (rs42187261) and one omega-3 FA (eicosapentaenoic acid, C20:5n3) and tricosanoic acid (C23:0), and one intronic SNP, FADS1-01 (rs136261927) and C20:3n6. Conclusion Our study has demonstrated positive associations between three SNPs within FADS1 and FADS2 genes (a SNP within the 3’UTR, a synonymous SNP and an intronic SNP), with three milk PUFAs of Canadian Holstein cows thus suggesting possible involvement of synonymous and non-coding region variants in FA synthesis. These SNPs may serve as potential genetic markers in breeding programs to increase milk FAs that are of benefit to human health. PMID:24533445

  12. Associations between variants of FADS genes and omega-3 and omega-6 milk fatty acids of Canadian Holstein cows.

    PubMed

    Ibeagha-Awemu, Eveline M; Akwanji, Kingsley A; Beaudoin, Frédéric; Zhao, Xin

    2014-02-17

    Fatty acid desaturase 1 (FADS1) and 2 (FADS2) genes code respectively for the enzymes delta-5 and delta-6 desaturases which are rate limiting enzymes in the synthesis of polyunsaturated omega-3 and omega-6 fatty acids (FAs). Omega-3 and-6 FAs as well as conjugated linoleic acid (CLA) are present in bovine milk and have demonstrated positive health effects in humans. Studies in humans have shown significant relationships between genetic variants in FADS1 and 2 genes with plasma and tissue concentrations of omega-3 and-6 FAs. The aim of this study was to evaluate the extent of sequence variations within these two genes in Canadian Holstein cows as well as the association between sequence variants and health promoting FAs in milk. Thirty three SNPs were detected within the studied regions of genes including a synonymous mutation (FADS1-07, rs42187261, 306Tyr > Tyr) in exon 8 of FADS1, a non-synonymous mutation (FADS2-14, rs211580559, 294Ala > Val) within FADS2 exon 7, a splice site SNP (FADS2-05, rs211263660), a 3'UTR SNP (FADS2-23, rs109772589), and another 3'UTR SNP with an effect on a microRNA binding site within FADS2 gene (FADS2-19, rs210169303). Association analyses showed significant relations between three out of seven tested SNPs and several FAs. Significant associations (FDR P < 0.05) were recorded between FADS2-23 (rs109772589) and two omega-6 FAs (dihomogamma linolenic acid [C20:3n6] and arachidonic acid [C20:4n6]), FADS1-07 (rs42187261) and one omega-3 FA (eicosapentaenoic acid, C20:5n3) and tricosanoic acid (C23:0), and one intronic SNP, FADS1-01 (rs136261927) and C20:3n6. Our study has demonstrated positive associations between three SNPs within FADS1 and FADS2 genes (a SNP within the 3'UTR, a synonymous SNP and an intronic SNP), with three milk PUFAs of Canadian Holstein cows thus suggesting possible involvement of synonymous and non-coding region variants in FA synthesis. These SNPs may serve as potential genetic markers in breeding programs to increase milk FAs that are of benefit to human health.

  13. Association of two synonymous splicing-associated CpG single nucleotide polymorphisms in calpain 10 and solute carrier family 2 member 2 with type 2 diabetes

    PubMed Central

    Karambataki, Maria; Malousi, Andigoni; Tzimagiorgis, Georgios; Haitoglou, Constantinos; Fragou, Aikaterini; Georgiou, Elisavet; Papadopoulou, Foteini; Krassas, Gerasimos E.; Kouidou, Sofia

    2017-01-01

    Coding synonymous single nucleotide polymorphisms (SNPs) have attracted little attention until recently. However, such SNPs located in epigenetic, CpG sites modifying exonic splicing enhancers (ESEs) can be informative with regards to the recently verified association of intragenic methylation and splicing. The present study describes the association of type 2 diabetes (T2D) with the exonic, synonymous, epigenetic SNPs, rs3749166 in calpain 10 (CAPN10) glucose transporter (GLUT4) translocator and rs5404 in solute carrier family 2, member 2 (SLC2A2), also termed GLUT2, which, according to prior bioinformatic analysis, strongly modify the splicing potential of glucose transport-associated genes. Previous association studies reveal that only rs5404 exhibits a strong negative T2D association, while data on the CAPN10 polymorphism are contradictory. In the present study DNA from blood samples of 99 Greek non-diabetic control subjects and 71 T2D patients was analyzed. In addition, relevant publicly available cases (40) resulting from examination of 110 Personal Genome Project data files were analyzed. The frequency of the rs3749166 A allele, was similar in the patients and non-diabetic control subjects. However, AG heterozygotes were more frequent among patients (73.24% for Greek patients and 54.55% for corresponding non-diabetic control subjects; P=0.0262; total cases, 52.99 and 75.00%, respectively; P=0.0039). The rs5404 T allele was only observed in CT heterozygotes (Greek non-diabetic control subjects, 39.39% and Greek patients, 22.54%; P=0.0205; total cases, 34.69 and 21.28%, respectively; P=0.0258). Notably, only one genotype, heterozygous AG/CC, was T2D-associated (Greek non-diabetic control subjects, 29.29% and Greek patients, 56.33%; P=0.004; total cases, 32.84 and 56.58%, respectively; P=0.0008). Furthermore, AG/CC was strongly associated with very high (≥8.5%) glycosylated plasma hemoglobin levels among patients (P=0.0002 for all cases). These results reveal the complex heterozygotic SNP association with T2D, and indicate possible synergies of these epigenetic, splicing-regulatory, synonymous SNPs, which modify the splicing potential of two alternative glucose transport-associated genes. PMID:28357066

  14. The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein-protein interactions.

    PubMed

    Yates, Christopher M; Sternberg, Michael J E

    2013-11-01

    Non-synonymous single nucleotide polymorphisms (nsSNPs) are single base changes leading to a change to the amino acid sequence of the encoded protein. Many of these variants are associated with disease, so nsSNPs have been well studied, with studies looking at the effects of nsSNPs on individual proteins, for example, on stability and enzyme active sites. In recent years, the impact of nsSNPs upon protein-protein interactions has also been investigated, giving a greater insight into the mechanisms by which nsSNPs can lead to disease. In this review, we summarize these studies, looking at the various mechanisms by which nsSNPs can affect protein-protein interactions. We focus on structural changes that can impair interaction, changes to disorder, gain of interaction, and post-translational modifications before looking at some examples of nsSNPs at human-pathogen protein-protein interfaces and the analysis of nsSNPs from a network perspective. © 2013.

  15. LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.

    PubMed

    Karchin, Rachel; Diekhans, Mark; Kelly, Libusha; Thomas, Daryl J; Pieper, Ursula; Eswar, Narayanan; Haussler, David; Sali, Andrej

    2005-06-15

    The NCBI dbSNP database lists over 9 million single nucleotide polymorphisms (SNPs) in the human genome, but currently contains limited annotation information. SNPs that result in amino acid residue changes (nsSNPs) are of critical importance in variation between individuals, including disease and drug sensitivity. We have developed LS-SNP, a genomic scale software pipeline to annotate nsSNPs. LS-SNP comprehensively maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models, and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding or severely impact human health. It currently annotates 28,043 validated SNPs that produce amino acid residue substitutions in human proteins from the SwissProt/TrEMBL database. Annotations can be viewed via a web interface either in the context of a genomic region or by selecting sets of SNPs, genes, proteins or pathways. These results are useful for identifying candidate functional SNPs within a gene, haplotype or pathway and in probing molecular mechanisms responsible for functional impacts of nsSNPs. http://www.salilab.org/LS-SNP CONTACT: rachelk@salilab.org http://salilab.org/LS-SNP/supp-info.pdf.

  16. EST-derived SNP discovery and selective pressure analysis in Pacific white shrimp ( Litopenaeus vannamei)

    NASA Astrophysics Data System (ADS)

    Liu, Chengzhang; Wang, Xia; Xiang, Jianhai; Li, Fuhua

    2012-09-01

    Pacific white shrimp has become a major aquaculture and fishery species worldwide. Although a large scale EST resource has been publicly available since 2008, the data have not yet been widely used for SNP discovery or transcriptome-wide assessment of selective pressure. In this study, a set of 155 411 expressed sequence tags (ESTs) from the NCBI database were computationally analyzed and 17 225 single nucleotide polymorphisms (SNPs) were predicted, including 9 546 transitions, 5 124 transversions and 2 481 indels. Among the 7 298 SNP substitutions located in functionally annotated contigs, 58.4% (4 262) are non-synonymous SNPs capable of introducing amino acid mutations. Two hundred and fifty nonsynonymous SNPs in genes associated with economic traits have been identified as candidates for markers in selective breeding. Diversity estimates among the synonymous nucleotides were on average 3.49 times greater than those in non-synonymous, suggesting negative selection. Distribution of non-synonymous to synonymous substitutions (Ka/Ks) ratio ranges from 0 to 4.01, (average 0.42, median 0.26), suggesting that the majority of the affected genes are under purifying selection. Enrichment analysis identified multiple gene ontology categories under positive or negative selection. Categories involved in innate immune response and male gamete generation are rich in positively selected genes, which is similar to reports in Drosophila and primates. This work is the first transcriptome-wide assessment of selective pressure in a Penaeid shrimp species. The functionally annotated SNPs provide a valuable resource of potential molecular markers for selective breeding.

  17. LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures.

    PubMed

    Ryan, Michael; Diekhans, Mark; Lien, Stephanie; Liu, Yun; Karchin, Rachel

    2009-06-01

    LS-SNP/PDB is a new WWW resource for genome-wide annotation of human non-synonymous (amino acid changing) SNPs. It serves high-quality protein graphics rendered with UCSF Chimera molecular visualization software. The system is kept up-to-date by an automated, high-throughput build pipeline that systematically maps human nsSNPs onto Protein Data Bank structures and annotates several biologically relevant features. LS-SNP/PDB is available at (http://ls-snp.icm.jhu.edu/ls-snp-pdb) and via links from protein data bank (PDB) biology and chemistry tabs, UCSC Genome Browser Gene Details and SNP Details pages and PharmGKB Gene Variants Downloads/Cross-References pages.

  18. Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus).

    PubMed

    Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

    2015-07-27

    Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1-8) were identified and genotyped via direct sequencing covering most of the coding region and 3'UTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3'UTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs.

  19. Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus)

    PubMed Central

    Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

    2015-01-01

    Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1–8) were identified and genotyped via direct sequencing covering most of the coding region and 3ʹUTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3ʹUTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs. PMID:26225956

  20. Pharmacogenetics of human 3'-phosphoadenosine 5'-phosphosulfate synthetase 1 (PAPSS1): gene resequencing, sequence variation, and functional genomics.

    PubMed

    Xu, Zhen-Hua; Thomae, Bianca A; Eckloff, Bruce W; Wieben, Eric D; Weinshilboum, Richard M

    2003-06-01

    3'-Phosphoadenosine 5'-phosphosulfate (PAPS) is the high-energy "sulfate donor" for reactions catalyzed by sulfotransferase (SULT) enzymes. The strict requirement of SULTs for PAPS suggests that PAPS synthesis might influence the rate of sulfate conjugation. In humans, PAPS is synthesized from ATP and SO(4)(2-) by two isoforms of PAPS synthetase (PAPSS): PAPSS1 and PAPSS2. As a step toward pharmacogenetic studies, we have resequenced the entire coding sequence of the human PAPSS1 gene, including exon-intron splice junctions, using DNA samples from 60 Caucasian-American and 58 African-American subjects. Twenty-one genetic polymorphisms were observed-1 insertion-deletion event and 20 single nucleotide polymorphisms (SNPs)-including two non-synonymous coding SNPs (cSNPs) that altered the following amino acids: Arg333Cys and Glu531Gln. Twelve pairs of these polymorphisms were tightly linked, and a total of twelve unequivocal haplotypes could be identified-two that were common to both ethnic groups and ten that were ethnic-specific. The Arg333Cys polymorphism, with an allele frequency of 2.5%, was observed only in DNA samples from Caucasian subjects. The Glu531Gln polymorphism was rare, with only a single copy of that allele in a DNA sample from an African-American subject. Transient expression in mammalian cells showed that neither of the non-synonymous cSNPs resulted in a change in the basal level of enzyme activity measured under optimal assay conditions. However, the Glu531Gln polymorphism altered the substrate kinetic properties of the enzyme. The Gln531 variant allozyme had a 5-fold higher K(m) value for SO(4)(2-) than did the wild-type allozyme and displayed monophasic kinetics for Na(2)SO(4). The wild-type allozyme (Glu531) showed biphasic kinetics for that substrate. These observations represent a step toward testing the hypothesis that genetic variation in PAPS synthesis catalyzed by PAPSS1 might alter in vivo sulfate conjugation.

  1. Automated SNP detection from a large collection of white spruce expressed sequences: contributing factors and approaches for the categorization of SNPs

    PubMed Central

    Pavy, Nathalie; Parsons, Lee S; Paule, Charles; MacKay, John; Bousquet, Jean

    2006-01-01

    Background High-throughput genotyping technologies represent a highly efficient way to accelerate genetic mapping and enable association studies. As a first step toward this goal, we aimed to develop a resource of candidate Single Nucleotide Polymorphisms (SNP) in white spruce (Picea glauca [Moench] Voss), a softwood tree of major economic importance. Results A white spruce SNP resource encompassing 12,264 SNPs was constructed from a set of 6,459 contigs derived from Expressed Sequence Tags (EST) and by using the bayesian-based statistical software PolyBayes. Several parameters influencing the SNP prediction were analysed including the a priori expected polymorphism, the probability score (PSNP), and the contig depth and length. SNP detection in 3' and 5' reads from the same clones revealed a level of inconsistency between overlapping sequences as low as 1%. A subset of 245 predicted SNPs were verified through the independent resequencing of genomic DNA of a genotype also used to prepare cDNA libraries. The validation rate reached a maximum of 85% for SNPs predicted with either PSNP ≥ 0.95 or ≥ 0.99. A total of 9,310 SNPs were detected by using PSNP ≥ 0.95 as a criterion. The SNPs were distributed among 3,590 contigs encompassing an array of broad functional categories, with an overall frequency of 1 SNP per 700 nucleotide sites. Experimental and statistical approaches were used to evaluate the proportion of paralogous SNPs, with estimates in the range of 8 to 12%. The 3,789 coding SNPs identified through coding region annotation and ORF prediction, were distributed into 39% nonsynonymous and 61% synonymous substitutions. Overall, there were 0.9 SNP per 1,000 nonsynonymous sites and 5.2 SNPs per 1,000 synonymous sites, for a genome-wide nonsynonymous to synonymous substitution rate ratio (Ka/Ks) of 0.17. Conclusion We integrated the SNP data in the ForestTreeDB database along with functional annotations to provide a tool facilitating the choice of candidate genes for mapping purposes or association studies. PMID:16824208

  2. Bacterial Genetic Architecture of Ecological Interactions in Co-culture by GWAS-Taking Escherichia coli and Staphylococcus aureus as an Example.

    PubMed

    He, Xiaoqing; Jin, Yi; Ye, Meixia; Chen, Nan; Zhu, Jing; Wang, Jingqi; Jiang, Libo; Wu, Rongling

    2017-01-01

    How a species responds to such a biotic environment in the community, ultimately leading to its evolution, has been a topic of intense interest to ecological evolutionary biologists. Until recently, limited knowledge was available regarding the genotypic changes that underlie phenotypic changes. Our study implemented GWAS (Genome-Wide Association Studies) to illustrate the genetic architecture of ecological interactions that take place in microbial populations. By choosing 45 such interspecific pairs of Escherichia coli and Staphylococcus aureus strains that were all genotyped throughout the entire genome, we employed Q-ROADTRIPS to analyze the association between single SNPs and microbial abundance measured at each time point for bacterial populations reared in monoculture and co-culture, respectively. We identified a large number of SNPs and indels across the genomes (35.69 G clean data of E. coli and 50.41 G of S. aureus ). We reported 66 and 111 SNPs that were associated with interaction in E. coli and S. aureus , respectively. 23 out of 66 polymorphic changes resulted in amino acid alterations.12 significant genes, such as murE, treA, argS , and relA , which were also identified in previous evolutionary studies. In S. aureus , 111 SNPs detected in coding sequences could be divided into 35 non-synonymous and 76 synonymous SNPs. Our study illustrated the potential of genome-wide association methods for studying rapidly evolving traits in bacteria. Genetic association study methods will facilitate the identification of genetic elements likely to cause phenotypes of interest and provide targets for further laboratory investigation.

  3. How immunogenetically different are domestic pigs from wild boars: a perspective from single-nucleotide polymorphisms of 19 immunity-related candidate genes.

    PubMed

    Chen, Shanyuan; Gomes, Rui; Costa, Vânia; Santos, Pedro; Charneca, Rui; Zhang, Ya-ping; Liu, Xue-hong; Wang, Shao-qing; Bento, Pedro; Nunes, Jose-Luis; Buzgó, József; Varga, Gyula; Anton, István; Zsolnai, Attila; Beja-Pereira, Albano

    2013-10-01

    The coexistence of wild boars and domestic pigs across Eurasia makes it feasible to conduct comparative genetic or genomic analyses for addressing how genetically different a domestic species is from its wild ancestor. To test whether there are differences in patterns of genetic variability between wild and domestic pigs at immunity-related genes and to detect outlier loci putatively under selection that may underlie differences in immune responses, here we analyzed 54 single-nucleotide polymorphisms (SNPs) of 19 immunity-related candidate genes on 11 autosomes in three pairs of wild boar and domestic pig populations from China, Iberian Peninsula, and Hungary. Our results showed no statistically significant differences in allele frequency and heterozygosity across SNPs between three pairs of wild and domestic populations. This observation was more likely due to the widespread and long-lasting gene flow between wild boars and domestic pigs across Eurasia. In addition, we detected eight coding SNPs from six genes as outliers being under selection consistently by three outlier tests (BayeScan2.1, FDIST2, and Arlequin3.5). Among four non-synonymous outlier SNPs, one from TLR4 gene was identified as being subject to positive (diversifying) selection and three each from CD36, IFNW1, and IL1B genes were suggested as under balancing selection. All of these four non-synonymous variants were predicted as being benign by PolyPhen-2. Our results were supported by other independent lines of evidence for positive selection or balancing selection acting on these four immune genes (CD36, IFNW1, IL1B, and TLR4). Our study showed an example applying a candidate gene approach to identify functionally important mutations (i.e., outlier loci) in wild and domestic pigs for subsequent functional experiments.

  4. Employing genome-wide SNP discovery and genotyping strategy to extrapolate the natural allelic diversity and domestication patterns in chickpea

    PubMed Central

    Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    The genome-wide discovery and high-throughput genotyping of SNPs in chickpea natural germplasm lines is indispensable to extrapolate their natural allelic diversity, domestication, and linkage disequilibrium (LD) patterns leading to the genetic enhancement of this vital legume crop. We discovered 44,844 high-quality SNPs by sequencing of 93 diverse cultivated desi, kabuli, and wild chickpea accessions using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays that were physically mapped across eight chromosomes of desi and kabuli. Of these, 22,542 SNPs were structurally annotated in different coding and non-coding sequence components of genes. Genes with 3296 non-synonymous and 269 regulatory SNPs could functionally differentiate accessions based on their contrasting agronomic traits. A high experimental validation success rate (92%) and reproducibility (100%) along with strong sensitivity (93–96%) and specificity (99%) of GBS-based SNPs was observed. This infers the robustness of GBS as a high-throughput assay for rapid large-scale mining and genotyping of genome-wide SNPs in chickpea with sub-optimal use of resources. With 23,798 genome-wide SNPs, a relatively high intra-specific polymorphic potential (49.5%) and broader molecular diversity (13–89%)/functional allelic diversity (18–77%) was apparent among 93 chickpea accessions, suggesting their tremendous applicability in rapid selection of desirable diverse accessions/inter-specific hybrids in chickpea crossbred varietal improvement program. The genome-wide SNPs revealed complex admixed domestication pattern, extensive LD estimates (0.54–0.68) and extended LD decay (400–500 kb) in a structured population inclusive of 93 accessions. These findings reflect the utility of our identified SNPs for subsequent genome-wide association study (GWAS) and selective sweep-based domestication trait dissection analysis to identify potential genomic loci (gene-associated targets) specifically regulating important complex quantitative agronomic traits in chickpea. The numerous informative genome-wide SNPs, natural allelic diversity-led domestication pattern, and LD-based information generated in our study have got multidimensional applicability with respect to chickpea genomics-assisted breeding. PMID:25873920

  5. Differential contribution of genomic regions to marked genetic variation and prediction of quantitative traits in broiler chickens.

    PubMed

    Abdollahi-Arpanahi, Rostam; Morota, Gota; Valente, Bruno D; Kranis, Andreas; Rosa, Guilherme J M; Gianola, Daniel

    2016-02-03

    Genome-wide association studies in humans have found enrichment of trait-associated single nucleotide polymorphisms (SNPs) in coding regions of the genome and depletion of these in intergenic regions. However, a recent release of the ENCyclopedia of DNA elements showed that ~80 % of the human genome has a biochemical function. Similar studies on the chicken genome are lacking, thus assessing the relative contribution of its genic and non-genic regions to variation is relevant for biological studies and genetic improvement of chicken populations. A dataset including 1351 birds that were genotyped with the 600K Affymetrix platform was used. We partitioned SNPs according to genome annotation data into six classes to characterize the relative contribution of genic and non-genic regions to genetic variation as well as their predictive power using all available quality-filtered SNPs. Target traits were body weight, ultrasound measurement of breast muscle and hen house egg production in broiler chickens. Six genomic regions were considered: intergenic regions, introns, missense, synonymous, 5' and 3' untranslated regions, and regions that are located 5 kb upstream and downstream of coding genes. Genomic relationship matrices were constructed for each genomic region and fitted in the models, separately or simultaneously. Kernel-based ridge regression was used to estimate variance components and assess predictive ability. Contribution of each class of genomic regions to dominance variance was also considered. Variance component estimates indicated that all genomic regions contributed to marked additive genetic variation and that the class of synonymous regions tended to have the greatest contribution. The marked dominance genetic variation explained by each class of genomic regions was similar and negligible (~0.05). In terms of prediction mean-square error, the whole-genome approach showed the best predictive ability. All genic and non-genic regions contributed to phenotypic variation for the three traits studied. Overall, the contribution of additive genetic variance to the total genetic variance was much greater than that of dominance variance. Our results show that all genomic regions are important for the prediction of the targeted traits, and the whole-genome approach was reaffirmed as the best tool for genome-enabled prediction of quantitative traits.

  6. Allelic variation of the Waxy gene in foxtail millet [Setaria italica (L.) P. Beauv.] by single nucleotide polymorphisms.

    PubMed

    Van, K; Onoda, S; Kim, M Y; Kim, K D; Lee, S-H

    2008-03-01

    The Waxy (Wx) gene product controls the formation of a straight chain polymer of amylose in the starch pathway. Dominance/recessiveness of the Wx allele is associated with amylose content, leading to non-waxy/waxy phenotypes. For a total of 113 foxtail millet accessions, agronomic traits and the molecular differences of the Wx gene were surveyed to evaluate genetic diversities. Molecular types were associated with phenotypes determined by four specific primer sets (non-waxy, Type I; low amylose, Type VI; waxy, Type IV or V). Additionally, the insertion of transposable element in waxy was confirmed by ex1/TSI2R, TSI2F/ex2, ex2int2/TSI7R and TSI7F/ex4r. Seventeen single nucleotide polymorphims (SNPs) were observed from non-coding regions, while three SNPs from coding regions were non-synonymous. Interestingly, the phenotype of No. 88 was still non-waxy, although seven nucleotides (AATTGGT) insertion at 2,993 bp led to 78 amino acids shorter. The rapid decline of r (2) in the sequenced region (exon 1-intron 1-exon 2) suggested a low level of linkage disequilibrium and limited haplotype structure. K (s) values and estimation of evolutionary events indicate early divergence of S. italica among cereal crops. This study suggested the Wx gene was one of the targets in the selection process during domestication.

  7. Polymorphisms in the Tlr4 and Tlr5 Gene Are Significantly Associated with Inflammatory Bowel Disease in German Shepherd Dogs

    PubMed Central

    Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

    2010-01-01

    Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease. PMID:21203467

  8. Polymorphisms in the TLR4 and TLR5 gene are significantly associated with inflammatory bowel disease in German shepherd dogs.

    PubMed

    Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

    2010-12-23

    Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease.

  9. Evaluation and identification of damaged single nucleotide polymorphisms in COL1A1 gene involved in osteoporosis

    PubMed Central

    Alsaif, Mohammed A.; Al Shammari, Sulaiman A.; Alhamdan, Adel A.

    2012-01-01

    Introduction Single-nucleotide polymorphisms (SNPs) are biomarkers for exploring the genetic basis of many complex human diseases. The prediction of SNPs is promising in modern genetic analysis but it is still a great challenge to identify the functional SNPs in a disease-related gene. The computational approach has overcome this challenge and an increase in the successful rate of genetic association studies and reduced cost of genotyping have been achieved. The objective of this study is to identify deleterious non-synonymous SNPs (nsSNPs) associated with the COL1A1 gene. Material and methods The SNPs were retrieved from the Single Nucleotide Polymorphism Database (dbSNP). Using I-Mutant, protein stability change was calculated. The potentially functional nsSNPs and their effect on proteins were predicted by PolyPhen and SIFT respectively. FASTSNP was used for estimation of risk score. Results Our analysis revealed 247 SNPs as non-synonymous, out of which 5 nsSNPs were found to be least stable by I-Mutant 2.0 with a DDG value of > –1.0. Four nsSNPs, namely rs17853657, rs17857117, rs57377812 and rs1059454, showed a highly deleterious tolerance index score of 0.00 with a change in their physicochemical properties by the SIFT server. Seven nsSNPs, namely rs1059454, rs8179178, rs17853657, rs17857117, rs72656340, rs72656344 and rs72656351, were found to be probably damaging with a PSIC score difference between 2.0 and 3.5 by the PolyPhen server. Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, were found to be highly polymorphic with a risk score of 3-4 with a possible effect of non-conservative change and splicing regulation by FASTSNP. Conclusions Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, are potential functional polymorphisms that are likely to have a functional impact on the COL1A1 gene. PMID:24273577

  10. Genome-wide association studies to identify rice salt-tolerance markers.

    PubMed

    Patishtan, Juan; Hartley, Tom N; Fonseca de Carvalho, Raquel; Maathuis, Frans J M

    2018-05-01

    Salinity is an ever increasing menace that affects agriculture worldwide. Crops such as rice are salt sensitive, but its degree of susceptibility varies widely between cultivars pointing to extensive genetic diversity that can be exploited to identify genes and proteins that are relevant in the response of rice to salt stress. We used a diversity panel of 306 rice accessions and collected phenotypic data after short (6 h), medium (7 d) and long (30 d) salinity treatment (50 mm NaCl). A genome-wide association study (GWAS) was subsequently performed, which identified around 1200 candidate genes from many functional categories, but this was treatment period dependent. Further analysis showed the presence of cation transporters and transcription factors with a known role in salinity tolerance and those that hitherto were not known to be involved in salt stress. Localization analysis of single nucleotide polymorphisms (SNPs) showed the presence of several hundred non-synonymous SNPs (nsSNPs) in coding regions and earmarked specific genomic regions with increased numbers of nsSNPs. It points to components of the ubiquitination pathway as important sources of genetic diversity that could underpin phenotypic variation in stress tolerance. © 2017 John Wiley & Sons Ltd.

  11. Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning

    PubMed Central

    Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry

    2014-01-01

    Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of large-scale protein-protein interaction networks, and can be useful for functional annotation of disease-associated SNPs. SNIP-IN tool is freely accessible as a web-server at http://korkinlab.org/snpintool/. PMID:24784581

  12. Partition dataset according to amino acid type improves the prediction of deleterious non-synonymous SNPs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Jing; Li, Yuan-Yuan; Shanghai Center for Bioinformation Technology, Shanghai 200235

    2012-03-02

    Highlights: Black-Right-Pointing-Pointer Proper dataset partition can improve the prediction of deleterious nsSNPs. Black-Right-Pointing-Pointer Partition according to original residue type at nsSNP is a good criterion. Black-Right-Pointing-Pointer Similar strategy is supposed promising in other machine learning problems. -- Abstract: Many non-synonymous SNPs (nsSNPs) are associated with diseases, and numerous machine learning methods have been applied to train classifiers for sorting disease-associated nsSNPs from neutral ones. The continuously accumulated nsSNP data allows us to further explore better prediction approaches. In this work, we partitioned the training data into 20 subsets according to either original or substituted amino acid type at the nsSNPmore » site. Using support vector machine (SVM), training classification models on each subset resulted in an overall accuracy of 76.3% or 74.9% depending on the two different partition criteria, while training on the whole dataset obtained an accuracy of only 72.6%. Moreover, the dataset was also randomly divided into 20 subsets, but the corresponding accuracy was only 73.2%. Our results demonstrated that partitioning the whole training dataset into subsets properly, i.e., according to the residue type at the nsSNP site, will improve the performance of the trained classifiers significantly, which should be valuable in developing better tools for predicting the disease-association of nsSNPs.« less

  13. Study of five novel non-synonymous polymorphisms in human brain-expressed genes in a Colombian sample.

    PubMed

    Ojeda, Diego A; Forero, Diego A

    2014-10-01

    Non-synonymous single nucleotide polymorphisms (nsSNPs) in brain-expressed genes represent interesting candidates for genetic research in neuropsychiatric disorders. To study novel nsSNPs in brain-expressed genes in a sample of Colombian subjects. We applied an approach based on in silico mining of available genomic data to identify and select novel nsSNPs in brain-expressed genes. We developed novel genotyping assays, based in allele-specific PCR methods, for these nsSNPs and genotyped them in 171 Colombian subjects. Five common nsSNPs (rs6855837; p.Leu395Ile, rs2305160; p.Thr394Ala, rs10503929; p.Met289Thr, rs2270641; p.Thr4Pro and rs3822659; p.Ser735Ala) were studied, located in the CLOCK, NPAS2, NRG1, SLC18A1 and WWC1 genes. We reported allele and genotype frequencies in a sample of South American healthy subjects. There is previous experimental evidence, arising from genome-wide expression and association studies, for the involvement of these genes in several neuropsychiatric disorders and endophenotypes, such as schizophrenia, mood disorders or memory performance. Frequencies for these nsSNPSs in the Colombian samples varied in comparison to different HapMap populations. Future study of these nsSNPs in brain-expressed genes, a synaptogenomics approach, will be important for a better understanding of neuropsychiatric diseases and endophenotypes in different populations.

  14. How to calculate the non-synonymous to synonymous rate ratio of protein-coding genes under the Fisher-Wright mutation-selection framework.

    PubMed

    Dos Reis, Mario

    2015-04-01

    First principles of population genetics are used to obtain formulae relating the non-synonymous to synonymous substitution rate ratio to the selection coefficients acting at codon sites in protein-coding genes. Two theoretical cases are discussed and two examples from real data (a chloroplast gene and a virus polymerase) are given. The formulae give much insight into the dynamics of non-synonymous substitutions and may inform the development of methods to detect adaptive evolution. © 2015 The Author(s) Published by the Royal Society. All rights reserved.

  15. Age-related macular degeneration-associated silent polymorphisms in HtrA1 impair its ability to antagonize insulin-like growth factor 1.

    PubMed

    Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius

    2013-05-01

    Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.

  16. Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium.

    PubMed

    Milne, Roger L; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk; Ko, Yon-Dschun; Brauch, Hiltrud; Hamann, Ute; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tajima, Kazuo; Li, Jingmei; Brand, Judith S; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lambrechts, Diether; Peuteman, Gilian; Christiaens, Marie-Rose; Smeets, Ann; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katazyna; Hartman, Mikael; Hui, Miao; Yen Lim, Wei; Wan Chan, Ching; Marme, Federick; Yang, Rongxi; Bugert, Peter; Lindblom, Annika; Margolin, Sara; García-Closas, Montserrat; Chanock, Stephen J; Lissowska, Jolanta; Figueroa, Jonine D; Bojesen, Stig E; Nordestgaard, Børge G; Flyger, Henrik; Hooning, Maartje J; Kriege, Mieke; van den Ouweland, Ans M W; Koppert, Linetta B; Fletcher, Olivia; Johnson, Nichola; dos-Santos-Silva, Isabel; Peto, Julian; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha J; Long, Jirong; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Schmidt, Marjanka K; Broeks, Annegien; Cornelissen, Sten; Braaf, Linde; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Simard, Jacques; Dumont, Martine; Goldberg, Mark S; Labrèche, France; Fasching, Peter A; Hein, Alexander; Ekici, Arif B; Beckmann, Matthias W; Radice, Paolo; Peterlongo, Paolo; Azzollini, Jacopo; Barile, Monica; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Hopper, John L; Schmidt, Daniel F; Makalic, Enes; Southey, Melissa C; Hwang Teo, Soo; Har Yip, Cheng; Sivanandan, Kavitta; Tay, Wan-Ting; Shen, Chen-Yang; Hsiung, Chia-Ni; Yu, Jyh-Cherng; Hou, Ming-Feng; Guénel, Pascal; Truong, Therese; Sanchez, Marie; Mulot, Claire; Blot, William; Cai, Qiuyin; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Bogdanova, Natalia; Dörk, Thilo; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Zhang, Ben; Couch, Fergus J; Toland, Amanda E; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; McKay, James; Wang, Xianshu; Olson, Janet E; Vachon, Celine; Purrington, Kristen; Severi, Gianluca; Baglietto, Laura; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Ahmed, Shahana; Shah, Mitul; Pharoah, Paul D P; Hall, Per; Giles, Graham G; Benítez, Javier; Dunning, Alison M; Chenevix-Trench, Georgia; Easton, Douglas F

    2014-11-15

    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) and analyzed using unconditional logistic regression. Strong evidence of association was observed for three nsSNPs: ATXN7-K264R at 3p21 [rs1053338, per allele OR = 1.07, 95% confidence interval (CI) = 1.04-1.10, P = 2.9 × 10(-6)], AKAP9-M463I at 7q21 (rs6964587, OR = 1.05, 95% CI = 1.03-1.07, P = 1.7 × 10(-6)) and NEK10-L513S at 3p24 (rs10510592, OR = 1.10, 95% CI = 1.07-1.12, P = 5.1 × 10(-17)). The first two associations reached genome-wide statistical significance in a combined analysis of available data, including independent data from nine genome-wide association studies (GWASs): for ATXN7-K264R, OR = 1.07 (95% CI = 1.05-1.10, P = 1.0 × 10(-8)); for AKAP9-M463I, OR = 1.05 (95% CI = 1.04-1.07, P = 2.0 × 10(-10)). Further analysis of other common variants in these two regions suggested that intronic SNPs nearby are more strongly associated with disease risk. We have thus identified a novel susceptibility locus at 3p21, and confirmed previous suggestive evidence that rs6964587 at 7q21 is associated with risk. The third locus, rs10510592, is located in an established breast cancer susceptibility region; the association was substantially attenuated after adjustment for the known GWAS hit. Thus, each of the associated nsSNPs is likely to be a marker for another, non-coding, variant causally related to breast cancer risk. Further fine-mapping and functional studies are required to identify the underlying risk-modifying variants and the genes through which they act. © The Author 2014. Published by Oxford University Press.

  17. Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium

    PubMed Central

    Milne, Roger L.; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M. Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K.; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk; Ko, Yon-Dschun; Brauch, Hiltrud; Hamann, Ute; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tajima, Kazuo; Li, Jingmei; Brand, Judith S.; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lambrechts, Diether; Peuteman, Gilian; Christiaens, Marie-Rose; Smeets, Ann; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katazyna; Hartman, Mikael; Hui, Miao; Yen Lim, Wei; Wan Chan, Ching; Marme, Federick; Yang, Rongxi; Bugert, Peter; Lindblom, Annika; Margolin, Sara; García-Closas, Montserrat; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Bojesen, Stig E.; Nordestgaard, Børge G.; Flyger, Henrik; Hooning, Maartje J.; Kriege, Mieke; van den Ouweland, Ans M.W.; Koppert, Linetta B.; Fletcher, Olivia; Johnson, Nichola; dos-Santos-Silva, Isabel; Peto, Julian; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha J.; Long, Jirong; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Cox, Angela; Cross, Simon S.; Reed, Malcolm W.R.; Schmidt, Marjanka K.; Broeks, Annegien; Cornelissen, Sten; Braaf, Linde; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Simard, Jacques; Dumont, Martine; Goldberg, Mark S.; Labrèche, France; Fasching, Peter A.; Hein, Alexander; Ekici, Arif B.; Beckmann, Matthias W.; Radice, Paolo; Peterlongo, Paolo; Azzollini, Jacopo; Barile, Monica; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Hopper, John L.; Schmidt, Daniel F.; Makalic, Enes; Southey, Melissa C.; Hwang Teo, Soo; Har Yip, Cheng; Sivanandan, Kavitta; Tay, Wan-Ting; Shen, Chen-Yang; Hsiung, Chia-Ni; Yu, Jyh-Cherng; Hou, Ming-Feng; Guénel, Pascal; Truong, Therese; Sanchez, Marie; Mulot, Claire; Blot, William; Cai, Qiuyin; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Wu, Anna H.; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O.; Bogdanova, Natalia; Dörk, Thilo; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Zhang, Ben; Couch, Fergus J.; Toland, Amanda E.; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; McKay, James; Wang, Xianshu; Olson, Janet E.; Vachon, Celine; Purrington, Kristen; Severi, Gianluca; Baglietto, Laura; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Ahmed, Shahana; Shah, Mitul; Pharoah, Paul D.P.; Hall, Per; Giles, Graham G.; Benítez, Javier; Dunning, Alison M.; Chenevix-Trench, Georgia; Easton, Douglas F.; Berchuck, Andrew; Eeles, Rosalind A.; Olama, Ali Amin Al; Kote-Jarai, Zsofia; Benlloch, Sara; Antoniou, Antonis; McGuffog, Lesley; Offit, Ken; Lee, Andrew; Dicks, Ed; Luccarini, Craig; Tessier, Daniel C.; Bacot, Francois; Vincent, Daniel; LaBoissière, Sylvie; Robidoux, Frederic; Nielsen, Sune F.; Cunningham, Julie M.; Windebank, Sharon A.; Hilker, Christopher A.; Meyer, Jeffrey; Angelakos, Maggie; Maskiell, Judi; van der Schoot, Ellen; Rutgers, Emiel; Verhoef, Senno; Hogervorst, Frans; Boonyawongviroj, Prat; Siriwanarungsan, Pornthep; Schrauder, Michael; Rübner, Matthias; Oeser, Sonja; Landrith, Silke; Williams, Eileen; Ryder-Mills, Elaine; Sargus, Kara; McInerney, Niall; Colleran, Gabrielle; Rowan, Andrew; Jones, Angela; Sohn, Christof; Schneeweiß, Andeas; Bugert, Peter; Álvarez, Núria; Lacey, James; Wang, Sophia; Ma, Huiyan; Lu, Yani; Deapen, Dennis; Pinder, Rich; Lee, Eunjung; Schumacher, Fred; Horn-Ross, Pam; Reynolds, Peggy; Nelson, David; Ziegler, Hartwig; Wolf, Sonja; Hermann, Volker; Lo, Wing-Yee; Justenhoven, Christina; Baisch, Christian; Fischer, Hans-Peter; Brüning, Thomas; Pesch, Beate; Rabstein, Sylvia; Lotz, Anne; Harth, Volker; Heikkinen, Tuomas; Erkkilä, Irja; Aaltonen, Kirsimari; von Smitten, Karl; Antonenkova, Natalia; Hillemanns, Peter; Christiansen, Hans; Myöhänen, Eija; Kemiläinen, Helena; Thorne, Heather; Niedermayr, Eveline; Bowtell, D; Chenevix-Trench, G; deFazio, A; Gertig, D; Green, A; Webb, P; Green, A.; Parsons, P.; Hayward, N.; Webb, P.; Whiteman, D.; Fung, Annie; Yashiki, June; Peuteman, Gilian; Smeets, Dominiek; Brussel, Thomas Van; Corthouts, Kathleen; Obi, Nadia; Heinz, Judith; Behrens, Sabine; Eilber, Ursula; Celik, Muhabbet; Olchers, Til; Manoukian, Siranoush; Peissel, Bernard; Scuvera, Giulietta; Zaffaroni, Daniela; Bonanni, Bernardo; Feroce, Irene; Maniscalco, Angela; Rossi, Alessandra; Bernard, Loris; Tranchant, Martine; Valois, Marie-France; Turgeon, Annie; Heguy, Lea; Sze Yee, Phuah; Kang, Peter; Nee, Kang In; Mariapun, Shivaani; Sook-Yee, Yoon; Lee, Daphne; Ching, Teh Yew; Taib, Nur Aishah Mohd; Otsukka, Meeri; Mononen, Kari; Selander, Teresa; Weerasooriya, Nayana; staff, OFBCR; Krol-Warmerdam, E.; Molenaar, J.; Blom, J.; Brinton, Louise; Szeszenia-Dabrowska, Neonila; Peplonska, Beata; Zatonski, Witold; Chao, Pei; Stagner, Michael; Bos, Petra; Blom, Jannet; Crepin, Ellen; Nieuwlaat, Anja; Heemskerk, Annette; Higham, Sue; Cross, Simon; Cramp, Helen; Connley, Dan; Balasubramanian, Sabapathy; Brock, Ian; Luccarini, Craig; Conroy, Don; Baynes, Caroline; Chua, Kimberley

    2014-01-01

    Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) and analyzed using unconditional logistic regression. Strong evidence of association was observed for three nsSNPs: ATXN7-K264R at 3p21 [rs1053338, per allele OR = 1.07, 95% confidence interval (CI) = 1.04–1.10, P = 2.9 × 10−6], AKAP9-M463I at 7q21 (rs6964587, OR = 1.05, 95% CI = 1.03–1.07, P = 1.7 × 10−6) and NEK10-L513S at 3p24 (rs10510592, OR = 1.10, 95% CI = 1.07–1.12, P = 5.1 × 10−17). The first two associations reached genome-wide statistical significance in a combined analysis of available data, including independent data from nine genome-wide association studies (GWASs): for ATXN7-K264R, OR = 1.07 (95% CI = 1.05–1.10, P = 1.0 × 10−8); for AKAP9-M463I, OR = 1.05 (95% CI = 1.04–1.07, P = 2.0 × 10−10). Further analysis of other common variants in these two regions suggested that intronic SNPs nearby are more strongly associated with disease risk. We have thus identified a novel susceptibility locus at 3p21, and confirmed previous suggestive evidence that rs6964587 at 7q21 is associated with risk. The third locus, rs10510592, is located in an established breast cancer susceptibility region; the association was substantially attenuated after adjustment for the known GWAS hit. Thus, each of the associated nsSNPs is likely to be a marker for another, non-coding, variant causally related to breast cancer risk. Further fine-mapping and functional studies are required to identify the underlying risk-modifying variants and the genes through which they act. PMID:24943594

  18. Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms.

    PubMed

    Zhang, Wei; Qi, Weihong; Albert, Thomas J; Motiwala, Alifiya S; Alland, David; Hyytia-Trees, Eija K; Ribot, Efrain M; Fields, Patricia I; Whittam, Thomas S; Swaminathan, Bala

    2006-06-01

    Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7x10(-9) per site per year), we estimate that the most recent common ancestor of the contemporary beta-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens.

  19. Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms

    PubMed Central

    Zhang, Wei; Qi, Weihong; Albert, Thomas J.; Motiwala, Alifiya S.; Alland, David; Hyytia-Trees, Eija K.; Ribot, Efrain M.; Fields, Patricia I.; Whittam, Thomas S.; Swaminathan, Bala

    2006-01-01

    Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7 × 10−9 per site per year), we estimate that the most recent common ancestor of the contemporary β-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens. PMID:16606700

  20. A survey of genome-wide single nucleotide polymorphisms through genome resequencing in the Périgord black truffle (Tuber melanosporum Vittad.).

    PubMed

    Payen, Thibaut; Murat, Claude; Gigant, Anaïs; Morin, Emmanuelle; De Mita, Stéphane; Martin, Francis

    2015-09-01

    The Périgord black truffle (Tuber melanosporum Vittad.), considered a gastronomic delicacy worldwide, is an ectomycorrhizal filamentous fungus that is ecologically important in Mediterranean French, Italian and Spanish woodlands. In this study, we developed a novel resource of single nucleotide polymorphisms (SNPs) for T. melanosporum using Illumina high-throughput resequencing. The genome from six T. melanosporum geographical accessions was sequenced to a depth of approximately 20×. These geographical accessions were selected from different populations within the northern and southern regions of the geographical species distribution. Approximately 80% of the reads for each of the six resequenced geographical accessions mapped against the reference T. melanosporum genome assembly, estimating the core genome size of this organism to be approximately 110 Mbp. A total of 442 326 SNPs corresponding to 3540 SNPs/Mbps were identified as being included in all seven genomes. The SNPs occurred more frequently in repeated sequences (85%), although 4501 SNPs were also identified in the coding regions of 2587 genes. Using the ratio of nonsynonymous mutations per nonsynonymous site (pN) to synonymous mutations per synonymous site (pS) and Tajima's D index scanning the whole genome, we were able to identify genomic regions and genes potentially subjected to positive or purifying selection. The SNPs identified represent a valuable resource for future population genetics and genomics studies. © 2015 John Wiley & Sons Ltd.

  1. The impact of single nucleotide polymorphism in monomeric alpha-amylase inhibitor genes from wild emmer wheat, primarily from Israel and Golan

    PubMed Central

    2010-01-01

    Background Various enzyme inhibitors act on key insect gut digestive hydrolases, including alpha-amylases and proteinases. Alpha-amylase inhibitors have been widely investigated for their possible use in strengthening a plant's defense against insects that are highly dependent on starch as an energy source. We attempted to unravel the diversity of monomeric alpha-amylase inhibitor genes of Israeli and Golan Heights' wild emmer wheat with different ecological factors (e.g., geography, water, and temperature). Population methods that analyze the nature and frequency of allele diversity within a species and the codon analysis method (comparing patterns of synonymous and non-synonymous changes in protein coding sequences) were used to detect natural selection. Results Three hundred and forty-eight sequences encoding monomeric alpha-amylase inhibitors (WMAI) were obtained from 14 populations of wild emmer wheat. The frequency of SNPs in WMAI genes was 1 out of 16.3 bases, where 28 SNPs were detected in the coding sequence. The results of purifying and the positive selection hypothesis (p < 0.05) showed that the sequences of WMAI were contributed by both natural selection and co-evolution, which ensured conservation of protein function and inhibition against diverse insect amylases. The majority of amino acid substitutions occurred at the C-terminal (positive selection domain), which ensured the stability of WMAI. SNPs in this gene could be classified into several categories associated with water, temperature, and geographic factors, respectively. Conclusions Great diversity at the WMAI locus, both between and within populations, was detected in the populations of wild emmer wheat. It was revealed that WMAI were naturally selected for across populations by a ratio of dN/dS as expected. Ecological factors, singly or in combination, explained a significant proportion of the variations in the SNPs. A sharp genetic divergence over very short geographic distances compared to a small genetic divergence between large geographic distances also suggested that the SNPs were subjected to natural selection, and ecological factors had an important evolutionary role in polymorphisms at this locus. According to population and codon analysis, these results suggested that monomeric alpha-amylase inhibitors are adaptively selected under different environmental conditions. PMID:20534122

  2. Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa

    PubMed Central

    Doddapaneni, Harshavardhan; Yao, Jiqiang; Lin, Hong; Walker, M Andrew; Civerolo, Edwin L

    2006-01-01

    Background The Gram-negative, xylem-limited phytopathogenic bacterium Xylella fastidiosa is responsible for causing economically important diseases in grapevine, citrus and many other plant species. Despite its economic impact, relatively little is known about the genomic variations among strains isolated from different hosts and their influence on the population genetics of this pathogen. With the availability of genome sequence information for four strains, it is now possible to perform genome-wide analyses to identify and categorize such DNA variations and to understand their influence on strain functional divergence. Results There are 1,579 genes and 194 non-coding homologous sequences present in the genomes of all four strains, representing a 76. 2% conservation of the sequenced genome. About 60% of the X. fastidiosa unique sequences exist as tandem gene clusters of 6 or more genes. Multiple alignments identified 12,754 SNPs and 14,449 INDELs in the 1528 common genes and 20,779 SNPs and 10,075 INDELs in the 194 non-coding sequences. The average SNP frequency was 1.08 × 10-2 per base pair of DNA and the average INDEL frequency was 2.06 × 10-2 per base pair of DNA. On an average, 60.33% of the SNPs were synonymous type while 39.67% were non-synonymous type. The mutation frequency, primarily in the form of external INDELs was the main type of sequence variation. The relative similarity between the strains was discussed according to the INDEL and SNP differences. The number of genes unique to each strain were 60 (9a5c), 54 (Dixon), 83 (Ann1) and 9 (Temecula-1). A sub-set of the strain specific genes showed significant differences in terms of their codon usage and GC composition from the native genes suggesting their xenologous origin. Tandem repeat analysis of the genomic sequences of the four strains identified associations of repeat sequences with hypothetical and phage related functions. Conclusion INDELs and strain specific genes have been identified as the main source of variations among strains, with individual strains showing different rates of genome evolution. Based on these genome comparisons, it appears that the Pierce's disease strain Temecula-1 genome represents the ancestral genome of the X. fastidiosa. Results of this analysis are publicly available in the form of a web database. PMID:16948851

  3. Single nucleotide polymorphism analysis of Korean native chickens using next generation sequencing data.

    PubMed

    Seo, Dong-Won; Oh, Jae-Don; Jin, Shil; Song, Ki-Duk; Park, Hee-Bok; Heo, Kang-Nyeong; Shin, Younhee; Jung, Myunghee; Park, Junhyung; Jo, Cheorun; Lee, Hak-Kyo; Lee, Jun-Heon

    2015-02-01

    There are five native chicken lines in Korea, which are mainly classified by plumage colors (black, white, red, yellow, gray). These five lines are very important genetic resources in the Korean poultry industry. Based on a next generation sequencing technology, whole genome sequence and reference assemblies were performed using Gallus_gallus_4.0 (NCBI) with whole genome sequences from these lines to identify common and novel single nucleotide polymorphisms (SNPs). We obtained 36,660,731,136 ± 1,257,159,120 bp of raw sequence and average 26.6-fold of 25-29 billion reference assembly sequences representing 97.288 % coverage. Also, 4,006,068 ± 97,534 SNPs were observed from 29 autosomes and the Z chromosome and, of these, 752,309 SNPs are the common SNPs across lines. Among the identified SNPs, the number of novel- and known-location assigned SNPs was 1,047,951 ± 14,956 and 2,948,648 ± 81,414, respectively. The number of unassigned known SNPs was 1,181 ± 150 and unassigned novel SNPs was 8,238 ± 1,019. Synonymous SNPs, non-synonymous SNPs, and SNPs having character changes were 26,266 ± 1,456, 11,467 ± 604, 8,180 ± 458, respectively. Overall, 443,048 ± 26,389 SNPs in each bird were identified by comparing with dbSNP in NCBI. The presently obtained genome sequence and SNP information in Korean native chickens have wide applications for further genome studies such as genetic diversity studies to detect causative mutations for economic and disease related traits.

  4. A second generation human haplotype map of over 3.1 million SNPs.

    PubMed

    Frazer, Kelly A; Ballinger, Dennis G; Cox, David R; Hinds, David A; Stuve, Laura L; Gibbs, Richard A; Belmont, John W; Boudreau, Andrew; Hardenbol, Paul; Leal, Suzanne M; Pasternak, Shiran; Wheeler, David A; Willis, Thomas D; Yu, Fuli; Yang, Huanming; Zeng, Changqing; Gao, Yang; Hu, Haoran; Hu, Weitao; Li, Chaohua; Lin, Wei; Liu, Siqi; Pan, Hao; Tang, Xiaoli; Wang, Jian; Wang, Wei; Yu, Jun; Zhang, Bo; Zhang, Qingrun; Zhao, Hongbin; Zhao, Hui; Zhou, Jun; Gabriel, Stacey B; Barry, Rachel; Blumenstiel, Brendan; Camargo, Amy; Defelice, Matthew; Faggart, Maura; Goyette, Mary; Gupta, Supriya; Moore, Jamie; Nguyen, Huy; Onofrio, Robert C; Parkin, Melissa; Roy, Jessica; Stahl, Erich; Winchester, Ellen; Ziaugra, Liuda; Altshuler, David; Shen, Yan; Yao, Zhijian; Huang, Wei; Chu, Xun; He, Yungang; Jin, Li; Liu, Yangfan; Shen, Yayun; Sun, Weiwei; Wang, Haifeng; Wang, Yi; Wang, Ying; Xiong, Xiaoyan; Xu, Liang; Waye, Mary M Y; Tsui, Stephen K W; Xue, Hong; Wong, J Tze-Fei; Galver, Luana M; Fan, Jian-Bing; Gunderson, Kevin; Murray, Sarah S; Oliphant, Arnold R; Chee, Mark S; Montpetit, Alexandre; Chagnon, Fanny; Ferretti, Vincent; Leboeuf, Martin; Olivier, Jean-François; Phillips, Michael S; Roumy, Stéphanie; Sallée, Clémentine; Verner, Andrei; Hudson, Thomas J; Kwok, Pui-Yan; Cai, Dongmei; Koboldt, Daniel C; Miller, Raymond D; Pawlikowska, Ludmila; Taillon-Miller, Patricia; Xiao, Ming; Tsui, Lap-Chee; Mak, William; Song, You Qiang; Tam, Paul K H; Nakamura, Yusuke; Kawaguchi, Takahisa; Kitamoto, Takuya; Morizono, Takashi; Nagashima, Atsushi; Ohnishi, Yozo; Sekine, Akihiro; Tanaka, Toshihiro; Tsunoda, Tatsuhiko; Deloukas, Panos; Bird, Christine P; Delgado, Marcos; Dermitzakis, Emmanouil T; Gwilliam, Rhian; Hunt, Sarah; Morrison, Jonathan; Powell, Don; Stranger, Barbara E; Whittaker, Pamela; Bentley, David R; Daly, Mark J; de Bakker, Paul I W; Barrett, Jeff; Chretien, Yves R; Maller, Julian; McCarroll, Steve; Patterson, Nick; Pe'er, Itsik; Price, Alkes; Purcell, Shaun; Richter, Daniel J; Sabeti, Pardis; Saxena, Richa; Schaffner, Stephen F; Sham, Pak C; Varilly, Patrick; Altshuler, David; Stein, Lincoln D; Krishnan, Lalitha; Smith, Albert Vernon; Tello-Ruiz, Marcela K; Thorisson, Gudmundur A; Chakravarti, Aravinda; Chen, Peter E; Cutler, David J; Kashuk, Carl S; Lin, Shin; Abecasis, Gonçalo R; Guan, Weihua; Li, Yun; Munro, Heather M; Qin, Zhaohui Steve; Thomas, Daryl J; McVean, Gilean; Auton, Adam; Bottolo, Leonardo; Cardin, Niall; Eyheramendy, Susana; Freeman, Colin; Marchini, Jonathan; Myers, Simon; Spencer, Chris; Stephens, Matthew; Donnelly, Peter; Cardon, Lon R; Clarke, Geraldine; Evans, David M; Morris, Andrew P; Weir, Bruce S; Tsunoda, Tatsuhiko; Mullikin, James C; Sherry, Stephen T; Feolo, Michael; Skol, Andrew; Zhang, Houcan; Zeng, Changqing; Zhao, Hui; Matsuda, Ichiro; Fukushima, Yoshimitsu; Macer, Darryl R; Suda, Eiko; Rotimi, Charles N; Adebamowo, Clement A; Ajayi, Ike; Aniagwu, Toyin; Marshall, Patricia A; Nkwodimmah, Chibuzor; Royal, Charmaine D M; Leppert, Mark F; Dixon, Missy; Peiffer, Andy; Qiu, Renzong; Kent, Alastair; Kato, Kazuto; Niikawa, Norio; Adewole, Isaac F; Knoppers, Bartha M; Foster, Morris W; Clayton, Ellen Wright; Watkin, Jessica; Gibbs, Richard A; Belmont, John W; Muzny, Donna; Nazareth, Lynne; Sodergren, Erica; Weinstock, George M; Wheeler, David A; Yakub, Imtaz; Gabriel, Stacey B; Onofrio, Robert C; Richter, Daniel J; Ziaugra, Liuda; Birren, Bruce W; Daly, Mark J; Altshuler, David; Wilson, Richard K; Fulton, Lucinda L; Rogers, Jane; Burton, John; Carter, Nigel P; Clee, Christopher M; Griffiths, Mark; Jones, Matthew C; McLay, Kirsten; Plumb, Robert W; Ross, Mark T; Sims, Sarah K; Willey, David L; Chen, Zhu; Han, Hua; Kang, Le; Godbout, Martin; Wallenburg, John C; L'Archevêque, Paul; Bellemare, Guy; Saeki, Koji; Wang, Hongguang; An, Daochang; Fu, Hongbo; Li, Qing; Wang, Zhen; Wang, Renwu; Holden, Arthur L; Brooks, Lisa D; McEwen, Jean E; Guyer, Mark S; Wang, Vivian Ota; Peterson, Jane L; Shi, Michael; Spiegel, Jack; Sung, Lawrence M; Zacharia, Lynn F; Collins, Francis S; Kennedy, Karen; Jamieson, Ruth; Stewart, John

    2007-10-18

    We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.

  5. GESPA: classifying nsSNPs to predict disease association.

    PubMed

    Khurana, Jay K; Reeder, Jay E; Shrimpton, Antony E; Thakar, Juilee

    2015-07-25

    Non-synonymous single nucleotide polymorphisms (nsSNPs) are the most common DNA sequence variation associated with disease in humans. Thus determining the clinical significance of each nsSNP is of great importance. Potential detrimental nsSNPs may be identified by genetic association studies or by functional analysis in the laboratory, both of which are expensive and time consuming. Existing computational methods lack accuracy and features to facilitate nsSNP classification for clinical use. We developed the GESPA (GEnomic Single nucleotide Polymorphism Analyzer) program to predict the pathogenicity and disease phenotype of nsSNPs. GESPA is a user-friendly software package for classifying disease association of nsSNPs. It allows flexibility in acceptable input formats and predicts the pathogenicity of a given nsSNP by assessing the conservation of amino acids in orthologs and paralogs and supplementing this information with data from medical literature. The development and testing of GESPA was performed using the humsavar, ClinVar and humvar datasets. Additionally, GESPA also predicts the disease phenotype associated with a nsSNP with high accuracy, a feature unavailable in existing software. GESPA's overall accuracy exceeds existing computational methods for predicting nsSNP pathogenicity. The usability of GESPA is enhanced by fast SQL-based cloud storage and retrieval of data. GESPA is a novel bioinformatics tool to determine the pathogenicity and phenotypes of nsSNPs. We anticipate that GESPA will become a useful clinical framework for predicting the disease association of nsSNPs. The program, executable jar file, source code, GPL 3.0 license, user guide, and test data with instructions are available at http://sourceforge.net/projects/gespa.

  6. Candidate chromosome 1 disease susceptibility genes for Sjogren’s syndrome xerostomia are narrowed by novel NOD.B10 congenic mice

    PubMed Central

    Mongini, Patricia K. A.; Kramer, Jill M.; Ishikawa, Tomo-o; Herschman, Harvey; Esposito, Donna

    2014-01-01

    Sjogren’s syndrome (SS) is characterized by salivary gland leukocytic infiltrates and impaired salivation (xerostomia). Cox-2 (Ptgs2) is located on chromosome 1 within the span of the Aec2 region. In an attempt to demonstrate that COX-2 drives antibody-dependent hyposalivation, NOD.B10 congenic mice bearing a Cox-2flox gene were generated. A congenic line with non-NOD alleles in Cox-2-flanking genes failed manifest xerostomia. Further backcrossing yielded disease-susceptible NOD.B10 Cox-2flox lines; fine genetic mapping determined that critical Aec2 genes lie within a 1.56 to 2.17 Mb span of DNA downstream of Cox-2. Bioinformatics analysis revealed that susceptible and non-susceptible lines exhibit non-synonymous coding SNPs in 8 protein-encoding genes of this region, thereby better delineating candidate Aec2 alleles needed for SS xerostomia. PMID:24685748

  7. SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

    PubMed Central

    Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

    2014-01-01

    The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047

  8. Fine Mapping for Weaver Syndrome in Brown Swiss Cattle and the Identification of 41 Concordant Mutations across NRCAM, PNPLA8 and CTTNBP2

    PubMed Central

    McClure, Matthew; Kim, Euisoo; Bickhart, Derek; Null, Daniel; Cooper, Tabatha; Cole, John; Wiggans, George; Ajmone-Marsan, Paolo; Colli, Licia; Santus, Enrico; Liu, George E.; Schroeder, Steve; Matukumalli, Lakshmi; Van Tassell, Curt; Sonstegard, Tad

    2013-01-01

    Bovine Progressive Degenerative Myeloencephalopathy (Weaver Syndrome) is a recessive neurological disease that has been observed in the Brown Swiss cattle breed since the 1970’s in North America and Europe. Bilateral hind leg weakness and ataxia appear in afflicted animals at 6 to 18 months of age, and slowly progresses to total loss of hind limb control by 3 to 4 years of age. While Weaver has previously been mapped to Bos taurus autosome (BTA) 4∶46–56 Mb and a diagnostic test based on the 6 microsatellite (MS) markers is commercially available, neither the causative gene nor mutation has been identified; therefore misdiagnosis can occur due to recombination between the diagnostic MS markers and the causative mutation. Analysis of 34,980 BTA 4 SNPs genotypes derived from the Illumina BovineHD assay for 20 Brown Swiss Weaver carriers and 49 homozygous normal bulls refined the Weaver locus to 48–53 Mb. Genotyping of 153 SNPs, identified from whole genome sequencing of 10 normal and 10 carrier animals, across a validation set of 841 animals resulted in the identification of 41 diagnostic SNPs that were concordant with the disease. Except for one intergenic SNP all are associated with genes expressed in nervous tissues: 37 distal to NRCAM, one non-synonymous (serine to asparagine) in PNPLA8, one synonymous and one non-synonymous (lysine to glutamic acid) in CTTNBP2. Haplotype and imputation analyses of 7,458 Brown Swiss animals with Illumina BovineSNP50 data and the 41 diagnostic SNPs resulted in the identification of only one haplotype concordant with the Weaver phenotype. Use of this haplotype and the diagnostic SNPs more accurately identifies Weaver carriers in both Brown Swiss purebred and influenced herds. PMID:23527149

  9. Patterns of population differentiation of candidate genes for cardiovascular disease.

    PubMed

    Kullo, Iftikhar J; Ding, Keyue

    2007-07-12

    The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (FST) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database. We calculated FST for 15,559 common SNPs (minor allele frequency > or = 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.

  10. In Silico Analysis of Single Nucleotide Polymorphism (SNPs) in Human β-Globin Gene

    PubMed Central

    Alanazi, Mohammed; Abduljaleel, Zainularifeen; Khan, Wajahatullah; Warsy, Arjumand S.; Elrobh, Mohamed; Khan, Zahid; Amri, Abdullah Al; Bazzi, Mohammad D.

    2011-01-01

    Single amino acid substitutions in the globin chain are the most common forms of genetic variations that produce hemoglobinopathies- the most widespread inherited disorders worldwide. Several hemoglobinopathies result from homozygosity or compound heterozygosity to beta-globin (HBB) gene mutations, such as that producing sickle cell hemoglobin (HbS), HbC, HbD and HbE. Several of these mutations are deleterious and result in moderate to severe hemolytic anemia, with associated complications, requiring lifelong care and management. Even though many hemoglobinopathies result from single amino acid changes producing similar structural abnormalities, there are functional differences in the generated variants. Using in silico methods, we examined the genetic variations that can alter the expression and function of the HBB gene. Using a sequence homology-based Sorting Intolerant from Tolerant (SIFT) server we have searched for the SNPs, which showed that 200 (80%) non-synonymous polymorphism were found to be deleterious. The structure-based method via PolyPhen server indicated that 135 (40%) non-synonymous polymorphism may modify protein function and structure. The Pupa Suite software showed that the SNPs will have a phenotypic consequence on the structure and function of the altered protein. Structure analysis was performed on the key mutations that occur in the native protein coded by the HBB gene that causes hemoglobinopathies such as: HbC (E→K), HbD (E→Q), HbE (E→K) and HbS (E→V). Atomic Non-Local Environment Assessment (ANOLEA), Yet Another Scientific Artificial Reality Application (YASARA), CHARMM-GUI webserver for macromolecular dynamics and mechanics, and Normal Mode Analysis, Deformation and Refinement (NOMAD-Ref) of Gromacs server were used to perform molecular dynamics simulations and energy minimization calculations on β-Chain residue of the HBB gene before and after mutation. Furthermore, in the native and altered protein models, amino acid residues were determined and secondary structures were observed for solvent accessibility to confirm the protein stability. The functional study in this investigation may be a good model for additional future studies. PMID:22028795

  11. Functional effects of single nucleotide polymorphisms in the coding region of human N-acetyltransferase 1

    PubMed Central

    Zhu, Yuanqi; Hein, David W.

    2007-01-01

    Genetic variants of human N-acetyltransferase 1 (NAT1) are associated with cancer and birth defects. N- and O-acetyltransferase catalytic activities, Michaelis-Menten kinetic constants (Km & Vmax), and steady state expression levels of NAT1-specific mRNA and protein were determined for the reference NAT1*4 and variant human NAT1 haplotypes possessing single nucleotide polymorphisms (SNPs) in the open reading frame. Although none of the SNPs caused a significant effect on steady state levels of NAT1-specific mRNA, C97T(R33stop), C190T(R64W), C559T (R187stop) and A752T(D251V) each reduced NAT1 protein level and/or N- and O-acetyltransferase catalytic activities to levels below detection. G560A(R187Q) substantially reduced NAT1 protein level and catalytic activities and increased substrate Km. The G445A(V149I), G459A(synonymous) and T640G(S214A) haplotype present in NAT1*11 significantly (p<0.05) increased NAT1 protein level and catalytic activity. Neither T21G(synonymous), T402C(synonymous), A613G(M205V), T777C(synonymous), G781A(E261K), or A787G(I263V) significantly affected Km, catalytic activity, mRNA or protein level. These results suggest heterogeneity among slow NAT1 acetylator phenotypes. PMID:17909564

  12. Analysis of consequences of non-synonymous SNP in feed conversion ratio associated TGF-β receptor type 3 gene in chicken.

    PubMed

    Rasal, Kiran D; Shah, Tejas M; Vaidya, Megha; Jakhesara, Subhash J; Joshi, Chaitanya G

    2015-06-01

    The recent advances in high throughput sequencing technology accelerate possible ways for the study of genome wide variation in several organisms and associated consequences. In the present study, mutations in TGFBR3 showing significant association with FCR trait in chicken during exome sequencing were further analyzed. Out of four SNPs, one nsSNP p.Val451Leu was found in the coding region of TGFBR3. In silico tools such as SnpSift and PANTHER predicted it as deleterious (0.04) and to be tolerated, respectively, while I-Mutant revealed that protein stability decreased. The TGFBR3 I-TASSER model has a C-score of 0.85, which was validated using PROCHECK. Based on MD simulation, mutant protein structure deviated from native with RMSD 0.08 Å due to change in the H-bonding distances of mutant residue. The docking of TGFBR3 with interacting TGFBR2 inferred that mutant required more global energy. Therefore, the present study will provide useful information about functional SNPs that have an impact on FCR traits.

  13. Patterns of population differentiation of candidate genes for cardiovascular disease

    PubMed Central

    Kullo, Iftikhar J; Ding, Keyue

    2007-01-01

    Background The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (FST) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database. Results We calculated FST for 15,559 common SNPs (minor allele frequency ≥ 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. Conclusion These results suggest a possible basis for varying susceptibility to CVD among ethnic groups. PMID:17626638

  14. Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity

    PubMed Central

    Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna

    2013-01-01

    Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005

  15. Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci

    PubMed Central

    Brorsson, Caroline A.; Pociot, Flemming

    2014-01-01

    Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376

  16. DoGSD: the dog and wolf genome SNP database.

    PubMed

    Bai, Bing; Zhao, Wen-Ming; Tang, Bi-Xia; Wang, Yan-Qing; Wang, Lu; Zhang, Zhang; Yang, He-Chuan; Liu, Yan-Hu; Zhu, Jun-Wei; Irwin, David M; Wang, Guo-Dong; Zhang, Ya-Ping

    2015-01-01

    The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor, grey wolves, which have simultaneously broadened our understanding of domestication and diseases that are shared by humans and dogs. To address the scarcity of single nucleotide polymorphism (SNP) data provided by authorized databases and to make SNP data more easily/friendly usable and available, we propose DoGSD (http://dogsd.big.ac.cn), the first canidae-specific database which focuses on whole genome SNP data from domesticated dogs and grey wolves. The DoGSD is a web-based, open-access resource comprising ∼ 19 million high-quality whole-genome SNPs. In addition to the dbSNP data set (build 139), DoGSD incorporates a comprehensive collection of SNPs from two newly sequenced samples (1 wolf and 1 dog) and collected SNPs from three latest dog/wolf genetic studies (7 wolves and 68 dogs), which were taken together for analysis with the population genetic statistics, Fst. In addition, DoGSD integrates some closely related information including SNP annotation, summary lists of SNPs located in genes, synonymous and non-synonymous SNPs, sampling location and breed information. All these features make DoGSD a useful resource for in-depth analysis in dog-/wolf-related studies. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Effect of two non-synonymous ecto-5'-nucleotidase variants on the genetic architecture of inosine 5'-monophosphate (IMP) and its degradation products in Japanese Black beef.

    PubMed

    Uemoto, Yoshinobu; Ohtake, Tsuyoshi; Sasago, Nanae; Takeda, Masayuki; Abe, Tsuyoshi; Sakuma, Hironori; Kojima, Takatoshi; Sasaki, Shinji

    2017-11-13

    Umami is a Japanese term for the fifth basic taste and is an important sensory property of beef palatability. Inosine 5'-monophosphate (IMP) contributes to umami taste in beef. Thus, the overall change in concentration of IMP and its degradation products can potentially affect the beef palatability. In this study, we investigated the genetic architecture of IMP and its degradation products in Japanese Black beef. First, we performed genome-wide association study (GWAS), candidate gene analysis, and functional analysis to detect the causal variants that affect IMP, inosine, and hypoxanthine. Second, we evaluated the allele frequencies in the different breeds, the contribution of genetic variance, and the effect on other economical traits using the detected variants. A total of 574 Japanese Black cattle were genotyped using the Illumina BovineSNP50 BeadChip and were then used for GWAS. The results of GWAS showed that the genome-wide significant single nucleotide polymorphisms (SNPs) on BTA9 were detected for IMP, inosine, and hypoxanthine. The ecto-5'-nucleotidase (NT5E) gene, which encodes the enzyme NT5E for the extracellular degradation of IMP to inosine, was located near the significant region on BTA9. The results of candidate gene analysis and functional analysis showed that two non-synonymous SNPs (c.1318C > T and c.1475 T > A) in NT5E affected the amount of IMP and its degradation products in beef by regulating the enzymatic activity of NT5E. The Q haplotype showed a positive effect on IMP and a negative effect on the enzymatic activity of NT5E in IMP degradation. The two SNPs were under perfect linkage disequilibrium in five different breeds, and different haplotype frequencies were seen among breeds. The two SNPs contribute to about half of the total genetic variance in IMP, and the results of genetic relationship between IMP and its degradation products showed that NT5E affected the overall concentration balance of IMP and its degradation products. In addition, the SNPs in NT5E did not have an unfavorable effect on the other economical traits. Based on all the above findings taken together, two non-synonymous SNPs in NT5E would be useful for improving IMP and its degradation products by marker-assisted selection in Japanese Black cattle.

  18. A complete mitochondrial genome of wheat (Triticum aestivum cv. Chinese Yumai), and fast evolving mitochondrial genes in higher plants.

    PubMed

    Cui, Peng; Liu, Huitao; Lin, Qiang; Ding, Feng; Zhuo, Guoyin; Hu, Songnian; Liu, Dongcheng; Yang, Wenlong; Zhan, Kehui; Zhang, Aimin; Yu, Jun

    2009-12-01

    Plant mitochondrial genomes, encoding necessary proteins involved in the system of energy production, play an important role in the development and reproduction of the plant. They occupy a specific evolutionary pattern relative to their nuclear counterparts. Here, we determined the winter wheat (Triticum aestivum cv. Chinese Yumai) mitochondrial genome in a length of 452 and 526 bp by shotgun sequencing its BAC library. It contains 202 genes, including 35 known protein-coding genes, three rRNA and 17 tRNA genes, as well as 149 open reading frames (ORFs; greater than 300 bp in length). The sequence is almost identical to the previously reported sequence of the spring wheat (T. aestivum cv. Chinese Spring); we only identified seven SNPs (three transitions and four transversions) and 10 indels (insertions and deletions) between the two independently acquired sequences, and all variations were found in non-coding regions. This result confirmed the accuracy of the previously reported mitochondrial sequence of the Chinese Spring wheat. The nucleotide frequency and codon usage of wheat are common among the lineage of higher plant with a high AT-content of 58%. Molecular evolutionary analysis demonstrated that plant mitochondrial genomes evolved at different rates, which may correlate with substantial variations in metabolic rate and generation time among plant lineages. In addition, through the estimation of the ratio of non-synonymous to synonymous substitution rates between orthologous mitochondrion-encoded genes of higher plants, we found an accelerated evolutionary rate that seems to be the result of relaxed selection.

  19. Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase

    PubMed Central

    Hawkins, Vivian N; Auliff, Alyson; Prajapati, Surendra Kumar; Rungsihirunrat, Kanchana; Hapuarachchi, Hapuarachchige C; Maestre, Amanda; O'Neil, Michael T; Cheng, Qin; Joshi, Hema; Na-Bangchang, Kesara; Sibley, Carol Hopkins

    2008-01-01

    Background In order to maximize the useful therapeutic life of antimalarial drugs, it is crucial to understand the mechanisms by which parasites resistant to antimalarial drugs are selected and spread in natural populations. Recent work has demonstrated that pyrimethamine-resistance conferring mutations in Plasmodium falciparum dihydrofolate reductase (dhfr) have arisen rarely de novo, but spread widely in Asia and Africa. The origin and spread of mutations in Plasmodium vivax dhfr were assessed by constructing haplotypes based on sequencing dhfr and its flanking regions. Methods The P. vivax dhfr coding region, 792 bp upstream and 683 bp downstream were amplified and sequenced from 137 contemporary patient isolates from Colombia, India, Indonesia, Papua New Guinea, Sri Lanka, Thailand, and Vanuatu. A repeat motif located 2.6 kb upstream of dhfr was also sequenced from 75 of 137 patient isolates, and mutational relationships among the haplotypes were visualized using the programme Network. Results Synonymous and non-synonymous single nucleotide polymorphisms (SNPs) within the dhfr coding region were identified, as was the well-documented in-frame insertion/deletion (indel). SNPs were also identified upstream and downstream of dhfr, with an indel and a highly polymorphic repeat region identified upstream of dhfr. The regions flanking dhfr were highly variable. The double mutant (58R/117N) dhfr allele has evolved from several origins, because the 58R is encoded by at least 3 different codons. The triple (58R/61M/117T) and quadruple (57L/61M/117T/173F, 57I/58R/61M/117T and 57L/58R/61M/117T) mutant alleles had at least three independent origins in Thailand, Indonesia, and Papua New Guinea/Vanuatu. Conclusion It was found that the P. vivax dhfr coding region and its flanking intergenic regions are highly polymorphic and that mutations in P. vivax dhfr that confer antifolate resistance have arisen several times in the Asian region. This contrasts sharply with the selective sweep of rare antifolate resistant alleles observed in the P. falciparum populations in Asia and Africa. The finding of multiple origins of resistance-conferring mutations has important implications for drug policy. PMID:18442404

  20. Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase.

    PubMed

    Hawkins, Vivian N; Auliff, Alyson; Prajapati, Surendra Kumar; Rungsihirunrat, Kanchana; Hapuarachchi, Hapuarachchige C; Maestre, Amanda; O'Neil, Michael T; Cheng, Qin; Joshi, Hema; Na-Bangchang, Kesara; Sibley, Carol Hopkins

    2008-04-28

    In order to maximize the useful therapeutic life of antimalarial drugs, it is crucial to understand the mechanisms by which parasites resistant to antimalarial drugs are selected and spread in natural populations. Recent work has demonstrated that pyrimethamine-resistance conferring mutations in Plasmodium falciparum dihydrofolate reductase (dhfr) have arisen rarely de novo, but spread widely in Asia and Africa. The origin and spread of mutations in Plasmodium vivax dhfr were assessed by constructing haplotypes based on sequencing dhfr and its flanking regions. The P. vivax dhfr coding region, 792 bp upstream and 683 bp downstream were amplified and sequenced from 137 contemporary patient isolates from Colombia, India, Indonesia, Papua New Guinea, Sri Lanka, Thailand, and Vanuatu. A repeat motif located 2.6 kb upstream of dhfr was also sequenced from 75 of 137 patient isolates, and mutational relationships among the haplotypes were visualized using the programme Network. Synonymous and non-synonymous single nucleotide polymorphisms (SNPs) within the dhfr coding region were identified, as was the well-documented in-frame insertion/deletion (indel). SNPs were also identified upstream and downstream of dhfr, with an indel and a highly polymorphic repeat region identified upstream of dhfr. The regions flanking dhfr were highly variable. The double mutant (58R/117N) dhfr allele has evolved from several origins, because the 58R is encoded by at least 3 different codons. The triple (58R/61M/117T) and quadruple (57L/61M/117T/173F, 57I/58R/61M/117T and 57L/58R/61M/117T) mutant alleles had at least three independent origins in Thailand, Indonesia, and Papua New Guinea/Vanuatu. It was found that the P. vivax dhfr coding region and its flanking intergenic regions are highly polymorphic and that mutations in P. vivax dhfr that confer antifolate resistance have arisen several times in the Asian region. This contrasts sharply with the selective sweep of rare antifolate resistant alleles observed in the P. falciparum populations in Asia and Africa. The finding of multiple origins of resistance-conferring mutations has important implications for drug policy.

  1. Coding variants in NOD-like receptors: An association study on risk and survival of colorectal cancer.

    PubMed

    Huhn, Stefanie; da Silva Filho, Miguel I; Sanmuganantham, Tharmila; Pichulik, Tica; Catalano, Calogerina; Pardini, Barbara; Naccarati, Alessio; Polakova-Vymetálkova, Veronika; Jiraskova, Katerina; Vodickova, Ludmila; Vodicka, Pavel; Löffler, Markus W; Courth, Lioba; Wehkamp, Jan; Din, Farhat V N; Timofeeva, Maria; Farrington, Susan M; Jansen, Lina; Hemminki, Kari; Chang-Claude, Jenny; Brenner, Hermann; Hoffmeister, Michael; Dunlop, Malcolm G; Weber, Alexander N R; Försti, Asta

    2018-01-01

    Nod-like receptors (NLRs) are important innate pattern recognition receptors and regulators of inflammation or play a role during development. We systematically analysed 41 non-synonymous single nucleotide polymorphisms (SNPs) in 21 NLR genes in a Czech discovery cohort of sporadic colorectal cancer (CRC) (1237 cases, 787 controls) for their association with CRC risk and survival. Five SNPs were found to be associated with CRC risk and eight with survival at 5% significance level. In a replication analysis using data of two large genome-wide association studies (GWASs) from Germany (DACHS: 1798 cases and 1810 controls) and Scotland (2210 cases and 9350 controls) the associations found in the Czech discovery set were not confirmed. However, expression analysis in human gut-related tissues and immune cells revealed that the NLRs associated with CRC risk or survival in the discovery set were expressed in primary human colon or rectum cells, CRC tissue and/or cell lines, providing preliminary evidence for a potential involvement of NLRs in general in CRC development and/or progression. Most interesting was the finding that the enigmatic development-related NLRP5 (also known as MATER) was not expressed in normal colon tissue but in colon cancer tissue and cell lines. Future studies may show whether regulatory variants instead of coding variants might affect the expression of NLRs and contribute to CRC risk and survival.

  2. Choline dehydrogenase polymorphism rs12676 is a functional variation and is associated with changes in human sperm cell function.

    PubMed

    Johnson, Amy R; Lao, Sai; Wang, Tongwen; Galanko, Joseph A; Zeisel, Steven H

    2012-01-01

    Approximately 15% of couples are affected by infertility and up to half of these cases arise from male factor infertility. Unidentified genetic aberrations such as chromosomal deletions, translocations and single nucleotide polymorphisms (SNPs) may be the underlying cause of many cases of idiopathic male infertility. Deletion of the choline dehydrogenase (Chdh) gene in mice results in decreased male fertility due to diminished sperm motility; sperm from Chdh(-/-) males have decreased ATP concentrations likely stemming from abnormal sperm mitochondrial morphology and function in these cells. Several SNPs have been identified in the human CHDH gene that may result in altered CHDH enzymatic activity. rs12676 (G233T), a non-synonymous SNP located in the CHDH coding region, is associated with increased susceptibility to dietary choline deficiency and risk of breast cancer. We now report evidence that this SNP is also associated with altered sperm motility patterns and dysmorphic mitochondrial structure in sperm. Sperm produced by men who are GT or TT for rs12676 have 40% and 73% lower ATP concentrations, respectively, in their sperm. rs12676 is associated with decreased CHDH protein in sperm and hepatocytes. A second SNP located in the coding region of IL17BR, rs1025689, is linked to altered sperm motility characteristics and changes in choline metabolite concentrations in sperm.

  3. Choline Dehydrogenase Polymorphism rs12676 Is a Functional Variation and Is Associated with Changes in Human Sperm Cell Function

    PubMed Central

    Johnson, Amy R.; Lao, Sai; Wang, Tongwen; Galanko, Joseph A.; Zeisel, Steven H.

    2012-01-01

    Approximately 15% of couples are affected by infertility and up to half of these cases arise from male factor infertility. Unidentified genetic aberrations such as chromosomal deletions, translocations and single nucleotide polymorphisms (SNPs) may be the underlying cause of many cases of idiopathic male infertility. Deletion of the choline dehydrogenase (Chdh) gene in mice results in decreased male fertility due to diminished sperm motility; sperm from Chdh−/− males have decreased ATP concentrations likely stemming from abnormal sperm mitochondrial morphology and function in these cells. Several SNPs have been identified in the human CHDH gene that may result in altered CHDH enzymatic activity. rs12676 (G233T), a non-synonymous SNP located in the CHDH coding region, is associated with increased susceptibility to dietary choline deficiency and risk of breast cancer. We now report evidence that this SNP is also associated with altered sperm motility patterns and dysmorphic mitochondrial structure in sperm. Sperm produced by men who are GT or TT for rs12676 have 40% and 73% lower ATP concentrations, respectively, in their sperm. rs12676 is associated with decreased CHDH protein in sperm and hepatocytes. A second SNP located in the coding region of IL17BR, rs1025689, is linked to altered sperm motility characteristics and changes in choline metabolite concentrations in sperm. PMID:22558321

  4. SNP rs403212791 in exon 2 of the MTNR1A gene is associated with reproductive seasonality in the Rasa aragonesa sheep breed.

    PubMed

    Calvo, J H; Serrano, M; Martinez-Royo, A; Lahoz, B; Sarto, P; Ibañez-Deler, A; Folch, J; Alabart, J L

    2018-06-01

    The aim of this study was to characterize and identify causative SNPs in the MTNR1A gene responsible for the reproductive seasonality traits in the Rasa aragonesa sheep breed. A total of 290 ewes (155, 84 and 51 mature, young and ewe lambs, respectively) from one flock were controlled from January to August. The following three reproductive seasonality traits were considered: the total days of anoestrus (TDA) and the progesterone cycling months (P4CM); both ovarian function seasonality traits based on blood progesterone levels; and the oestrus cycling months (OCM) based on oestrous detection, which indicate behavioural signs of oestrous. We have sequenced the total coding region plus 733 and 251 bp from the promoter and 3'-UTR regions, respectively, from the gene in 268 ewes. We found 9 and 4 SNPs associated with seasonality traits in the promoter (for TDA and P4CM) and exon 2 (for the three traits), respectively. The SNPs located in the gene promoter modify the putative binding sites for various trans-acting factors. In exon 2, two synonymous SNPs affect RFLP sites, rs406779174/RsaI (for the three traits) and rs430181568/MnlI (for OCM), and they have been related with seasonal reproductive activity in previous association studies with other breeds. SNP rs400830807, which is located in the 3'-UTR, was associated with the three traits, but this did not modify the putative target sites for ovine miRNAs according to in silico predictions. Finally, the SNP rs403212791 (NW_014639035.1: g.15099004G > A), which is also associated with the three seasonality phenotypes, was the most significant SNP detected in this study and was a non-synonymous polymorphism, leading a change from an Arginine to a Cysteine (R336C). Haplotype analyses confirmed the association results and showed that the effects found for the seasonality traits were caused by the SNPs located in exon 2. We have demonstrated that the T allele in the SNP rs403212791 in the MNTR1A gene is associated with a lower TDA and higher P4CM and OCM values in the Rasa Aragonesa breed. Copyright © 2018 Elsevier Inc. All rights reserved.

  5. Mitochondrial pathogenic mutations are population-specific.

    PubMed

    Breen, Michael S; Kondrashov, Fyodor A

    2010-12-31

    Surveying deleterious variation in human populations is crucial for our understanding, diagnosis and potential treatment of human genetic pathologies. A number of recent genome-wide analyses focused on the prevalence of segregating deleterious alleles in the nuclear genome. However, such studies have not been conducted for the mitochondrial genome. We present a systematic survey of polymorphisms in the human mitochondrial genome, including those predicted to be deleterious and those that correspond to known pathogenic mutations. Analyzing 4458 completely sequenced mitochondrial genomes we characterize the genetic diversity of different types of single nucleotide polymorphisms (SNPs) in African (L haplotypes) and non-African (M and N haplotypes) populations. We find that the overall level of polymorphism is higher in the mitochondrial compared to the nuclear genome, although the mitochondrial genome appears to be under stronger selection as indicated by proportionally fewer nonsynonymous than synonymous substitutions. The African mitochondrial genomes show higher heterozygosity, a greater number of polymorphic sites and higher frequencies of polymorphisms for synonymous, benign and damaging polymorphism than non-African genomes. However, African genomes carry significantly fewer SNPs that have been previously characterized as pathogenic compared to non-African genomes. Finding SNPs classified as pathogenic to be the only category of polymorphisms that are more abundant in non-African genomes is best explained by a systematic ascertainment bias that favours the discovery of pathogenic polymorphisms segregating in non-African populations. This further suggests that, contrary to the common disease-common variant hypothesis, pathogenic mutations are largely population-specific and different SNPs may be associated with the same disease in different populations. Therefore, to obtain a comprehensive picture of the deleterious variability in the human population, as well as to improve the diagnostics of individuals carrying African mitochondrial haplotypes, it is necessary to survey different populations independently. This article was reviewed by Dr Mikhail Gelfand, Dr Vasily Ramensky (nominated by Dr Eugene Koonin) and Dr David Rand (nominated by Dr Laurence Hurst).

  6. Genetic Diversity and Population Structure of Whitebark Pine (Pinus albicaulis Engelm.) in Western North America

    PubMed Central

    Liu, Jun-Jun; Sniezko, Richard; Murray, Michael; Wang, Ning; Chen, Hao; Zamany, Arezoo; Sturrock, Rona N.; Savin, Douglas; Kegley, Angelia

    2016-01-01

    Whitebark pine (WBP, Pinus albicaulis Engelm.) is an endangered conifer species due to heavy mortality from white pine blister rust (WPBR, caused by Cronartium ribicola) and mountain pine beetle (Dendroctonus ponderosae). Information about genetic diversity and population structure is of fundamental importance for its conservation and restoration. However, current knowledge on the genetic constitution and genomic variation is still limited for WBP. In this study, an integrated genomics approach was applied to characterize seed collections from WBP breeding programs in western North America. RNA-seq analysis was used for de novo assembly of the WBP needle transcriptome, which contains 97,447 protein-coding transcripts. Within the transcriptome, single nucleotide polymorphisms (SNPs) were discovered, and more than 22,000 of them were non-synonymous SNPs (ns-SNPs). Following the annotation of genes with ns-SNPs, 216 ns-SNPs within candidate genes with putative functions in disease resistance and plant defense were selected to design SNP arrays for high-throughput genotyping. Among these SNP loci, 71 were highly polymorphic, with sufficient variation to identify a unique genotype for each of the 371 individuals originating from British Columbia (Canada), Oregon and Washington (USA). A clear genetic differentiation was evident among seed families. Analyses of genetic spatial patterns revealed varying degrees of diversity and the existence of several genetic subgroups in the WBP breeding populations. Genetic components were associated with geographic variables and phenotypic rating of WPBR disease severity across landscapes, which may facilitate further identification of WBP genotypes and gene alleles contributing to local adaptation and quantitative resistance to WPBR. The WBP genomic resources developed here provide an invaluable tool for further studies and for exploitation and utilization of the genetic diversity preserved within this endangered conifer and other five-needle pines. PMID:27992468

  7. Variation in conserved non-coding sequences on chromosome 5q andsusceptibility to asthma and atopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Donfack, Joseph; Schneider, Daniel H.; Tan, Zheng

    2005-09-10

    Background: Evolutionarily conserved sequences likely havebiological function. Methods: To determine whether variation in conservedsequences in non-coding DNA contributes to risk for human disease, westudied six conserved non-coding elements in the Th2 cytokine cluster onhuman chromosome 5q31 in a large Hutterite pedigree and in samples ofoutbred European American and African American asthma cases and controls.Results: Among six conserved non-coding elements (>100 bp,>70percent identity; human-mouse comparison), we identified one singlenucleotide polymorphism (SNP) in each of two conserved elements and sixSNPs in the flanking regions of three conserved elements. We genotypedour samples for four of these SNPs and an additional three SNPs eachmore » inthe IL13 and IL4 genes. While there was only modest evidence forassociation with single SNPs in the Hutterite and European Americansamples (P<0.05), there were highly significant associations inEuropean Americans between asthma and haplotypes comprised of SNPs in theIL4 gene (P<0.001), including a SNP in a conserved non-codingelement. Furthermore, variation in the IL13 gene was strongly associatedwith total IgE (P = 0.00022) and allergic sensitization to mold allergens(P = 0.00076) in the Hutterites, and more modestly associated withsensitization to molds in the European Americans and African Americans (P<0.01). Conclusion: These results indicate that there is overalllittle variation in the conserved non-coding elements on 5q31, butvariation in IL4 and IL13, including possibly one SNP in a conservedelement, influence asthma and atopic phenotypes in diversepopulations.« less

  8. Allelic clustering and ancestry-dependent frequencies of rs6232, rs6234, and rs6235 PCSK1 SNPs in a Northern Ontario population sample.

    PubMed

    Sirois, Francine; Kaefer, Nadine; Currie, Krista A; Chrétien, Michel; Nkongolo, Kabwe K; Mbikay, Majambu

    2012-10-01

    The PCSK1 (proprotein convertase subtilisin/kexin type 1) locus encodes proprotein convertase 1/3, an endoprotease that converts prohormones and proneuropeptides to their active forms. Spontaneous loss-of-function mutations in the coding sequence of its gene have been linked to obesity in humans. Minor alleles of two common non-synonymous single-nucleotide polymorphisms (SNPs), rs6232 (T > C, N221D) and rs6235 (C > G, S690T), have been associated with increased risk of obesity in European populations. In this study, we compared the frequencies of the rs6232 and rs6234 (G > C, Q665E) SNPs in Aboriginal and Caucasian populations of Northern Ontario. The two SNPs were all relatively less frequent in Aboriginals: The minor allele frequency of the rs6232 SNP was 0.01 in Aboriginals and 0.08 in Caucasians (P < 4.10(-6)); for the rs6234 SNP, it was 0.20 and 0.32, respectively (P < 0.001). Resequencing revealed that the rs6234 SNP variation was tightly linked to that of the rs6235 SNP, as previously reported. Most interestingly, all carriers of the rs6232 SNP variation also carried the rs6234/rs6235 SNP clustered variations, but not the reverse, suggesting the former occurred later on an allele already carrying the latter. These data indicate that, in Northern Ontario Aboriginals, the triple-variant PCSK1 allele is relatively rare and might be of lesser significance for obesity risk in this population.

  9. The human pregnane X receptor: genomic structure and identification and functional characterization of natural allelic variants.

    PubMed

    Zhang, J; Kuehl, P; Green, E D; Touchman, J W; Watkins, P B; Daly, A; Hall, S D; Maurel, P; Relling, M; Brimer, C; Yasuda, K; Wrighton, S A; Hancock, M; Kim, R B; Strom, S; Thummel, K; Russell, C G; Hudson, J R; Schuetz, E G; Boguski, M S

    2001-10-01

    The pregnane X receptor (PXR)/steroid and xenobiotic receptor (SXR) transcriptionally activates cytochrome P4503A4 (CYP3A4) when ligand activated by endobiotics and xenobiotics. We cloned the human PXR gene and analysed the sequence in DNAs of individuals whose CYP3A phenotype was known. The PXR gene spans 35 kb, contains nine exons, and mapped to chromosome 13q11-13. Thirty-eight single nucleotide polymorphisms (SNPs) were identified including six SNPs in the coding region. Three of the coding SNPs are non-synonymous creating new PXR alleles [PXR*2, P27S (79C to T); PXR*3, G36R (106G to A); and PXR*4, R122Q (4321G to A)]. The frequency of PXR*2 was 0.20 in African Americans and was never found in Caucasians. Hepatic expression of CYP3A4 protein was not significantly different between African Americans homozygous for PXR*1 compared to those with one PXR*2 allele. PXR*4 was a rare variant found in only one Caucasian person. Homology modelling suggested that R122Q, (PXR*4) is a direct DNA contact site variation in the third alpha-helix in the DNA binding domain. Compared with PXR*1, and variants PXR*2 and PXR*3, only the variant PXR*4 protein had significantly decreased affinity for the PXR binding sequence in electromobility shift assays and attenuated ligand activation of the CYP3A4 reporter plasmids in transient transfection assays. However, the person heterozygous for PXR*4 is normal for CYP3A4 metabolism phenotype. The relevance of each of the 38 PXR SNPs identified in DNA of individuals whose CYP3A basal and rifampin-inducible CYP3A4 expression was determined in vivo and/or in vitro was demonstrated by univariate statistical analysis. Because ligand activation of PXR and upregulation of a system of drug detoxification genes are major determinants of drug interactions, it will now be useful to extend this work to determine the association of these common PXR SNPs to human variation in induction of other drug detoxification gene targets.

  10. Resequencing and association analysis of OXTR with autism spectrum disorder in a Japanese population.

    PubMed

    Egawa, Jun; Watanabe, Yuichiro; Shibuya, Masako; Endo, Taro; Sugimoto, Atsunori; Igeta, Hirofumi; Nunokawa, Ayako; Inoue, Emiko; Someya, Toshiyuki

    2015-03-01

    The oxytocin receptor (OXTR) is implicated in the pathophysiology of autism spectrum disorder (ASD). A recent study found a rare non-synonymous OXTR gene variation, rs35062132 (R376G), associated with ASD in a Japanese population. In order to investigate the association between rare non-synonymous OXTR variations and ASD, we resequenced OXTR and performed association analysis with ASD in a Japanese population. We resequenced the OXTR coding region in 213 ASD patients. Rare non-synonymous OXTR variations detected by resequencing were genotyped in 213 patients and 667 controls. We detected three rare non-synonymous variations: rs35062132 (R376G/C), rs151257822 (G334D), and g.8809426G>T (R150S). However, there was no significant association between these rare non-synonymous variations and ASD. Our present study does not support the contribution of rare non-synonymous OXTR variations to ASD susceptibility in the Japanese population. © 2014 The Authors. Psychiatry and Clinical Neurosciences © 2014 Japanese Society of Psychiatry and Neurology.

  11. Identification of IDUA and WNT16 Phosphorylation-Related Non-Synonymous Polymorphisms for Bone Mineral Density in Meta-Analyses of Genome-Wide Association Studies

    PubMed Central

    Niu, Tianhua; Liu, Ning; Yu, Xun; Zhao, Ming; Choi, Hyung Jin; Leo, Paul J.; Brown, Matthew A.; Zhang, Lei; Pei, Yu-Fang; Shen, Hui; He, Hao; Fu, Xiaoying; Lu, Shan; Chen, Xiang-Ding; Tan, Li-Jun; Yang, Tie-Lin; Guo, Yan; Cho, Nam H.; Shen, Jie; Guo, Yan-Fang; Nicholson, Geoffrey C.; Prince, Richard L.; Eisman, John A.; Jones, Graeme; Sambrook, Philip N.; Tian, Qing; Zhu, Xue-Zhen; Papasian, Christopher J.; Duncan, Emma L.; Uitterlinden, André G.; Shin, Chan Soo; Xiang, Shuanglin; Deng, Hong-Wen

    2016-01-01

    Protein phosphorylation regulates a wide variety of cellular processes. Thus, we hypothesize that single nucleotide polymorphisms (SNPs) that may modulate protein phosphorylation could affect osteoporosis risk. Based on a previous conventional genome-wide association (GWA) study, we conducted a three-stage meta-analysis targeting phosphorylation-related SNPs (phosSNPs) for femoral neck (FN)-, total hip (HIP)-, and Lumbar Spine (LS)-BMD phenotypes. In stage 1, 9,593 phosSNPs were meta-analyzed in 11,140 individuals of various ancestries. Genome-wide significance (GWS) and suggestive significance were defined by α = 5.21×10−6 (0.05/9,593) and 1.00×10−4, respectively. In stage 2, 9 stage 1-discovered phosSNPs (based on α = 1.00×10−4) were in silico meta-analyzed in Dutch, Korean, and Australian cohorts. In stage 3, four phosSNPs that replicated in stage 2 (based on α = 5.56×10−3, 0.05/9) were de novo genotyped in two independent cohorts. IDUA rs3755955 and rs6831280, and WNT16 rs2707466 were associated with BMD phenotypes in each respective stage, and in 3 stages combined, achieving GWS for both FN-BMD (P-value = 8.36×10−10, 5.26×10−10, and 3.01×10−10, respectively) and HIP-BMD (P-value = 3.26×10−6, 1.97×10−6, and 1.63×10−12, respectively). Although in vitro studies demonstrated no differences in expressions of wild-type and mutant forms of IDUA and WNT16B proteins, in silico analysis predicts that WNT16 rs2707466 directly abolishes a phosphorylation site, which could cause a deleterious effect on WNT16 protein, and that IDUA phosSNPs rs3755955 and rs6831280 could exert indirect effects on nearby phosphorylation sites. Further studies will be required to determine the detailed and specific molecular effects of these BMD-associated non-synonymous variants. PMID:26256109

  12. Paraoxonase promoter and intronic variants modify risk of sporadic amyotrophic lateral sclerosis

    PubMed Central

    Cronin, Simon; Greenway, Matthew J; Prehn, Jochen H M; Hardiman, Orla

    2007-01-01

    Background The paraoxonases, PON1–3, play a major protective role both against environmental toxins and as part of the antioxidant defence system. Recently, non‐synonymous coding single nucleotide polymorphisms (SNPs), known to lower serum PON activity, have been associated with sporadic ALS (SALS) in a Polish population. A separate trio based study described a detrimental allele at the PON3 intronic variant INS2+3651 (rs10487132). Association between PON gene cluster variants and SALS requires external validation in an independent dataset. Aims To examine the association of the promoter SNPs PON1−162G>A and PON1−108T>C; the non‐synonymous functional SNPs PON1Q192R and L55M and PON2C311S and A148G; and the intronic marker PON3INS2+3651A>G, with SALS in a genetically homogenous population. Methods 221 Irish patients with SALS and 202 unrelated control subjects were genotyped using KASPar chemistries. Statistical analyses and haplotype estimations were conducted using Haploview and Unphased software. Multiple permutation testing, as implemented in Unphased, was applied to haplotype p values to correct for multiple hypotheses. Results Two of the seven SNPs were associated with SALS in the Irish population: PON155M (OR 1.52, p = 0.006) and PON3INS2+3651 G (OR 1.36, p = 0.03). Two locus haplotype analysis showed association only when both of these risk alleles were present (OR 1.7, p = 0.005), suggesting a potential effect modification. Low functioning promoter variants were observed to influence this effect when compared with wild‐type. Conclusions These data provide additional evidence that genetic variation across the paroxanase loci may be common susceptibility factors for SALS. PMID:17702780

  13. Is really endogenous ghrelin a hunger signal in chickens? Association of GHSR SNPs with increase appetite, growth traits, expression and serum level of GHRL, and GH.

    PubMed

    El-Magd, Mohammed Abu; Saleh, Ayman A; Abdel-Hamid, Tamer M; Saleh, Rasha M; Afifi, Mohammed A

    2016-10-01

    Chicken growth hormone secretagogue receptor (GHSR) is a receptor for ghrelin (GHRL), a peptide hormone produced by chicken proventriculus, which stimulates growth hormone (GH) release and food intake. The purpose of this study was to search for single nucleotide polymorphisms (SNPs) in exon 2 of GHSR gene and to analyze their effect on the appetite, growth traits and expression levels of GHSR, GHRL, and GH genes as well as serum levels of GH and GHRL in Mandara chicken. Two adjacent SNPs, A239G and G244A, were detected in exon 2 of GHSR gene. G244A SNP was non-synonymous mutation and led to replacement of lysine amino acid (aa) by arginine aa, while A239G SNP was synonymous mutation. The combined genotypes of A239G and G244A SNPs produced three haplotypes; GG/GG, GG/AG, AG/AG, which associated significantly (P<0.05) with growth traits (body weight, average daily gain, shank length, keel length, chest circumference) at age from >4 to 16w. Chickens with the homozygous GG/GG haplotype showed higher growth performance than other chickens. The two SNPs were also correlated with mRNA levels of GHSR and GH (in pituitary gland), and GHRL (in proventriculus and hypothalamus) as well as with serum level of GH and GHRL. Also, chickens with GG/GG haplotype showed higher mRNA and serum levels. This is the first study to demonstrate that SNPs in GHSR can increase appetite, growth traits, expression and level of GHRL, suggesting a hunger signal role for endogenous GHRL. Copyright © 2016 Elsevier Inc. All rights reserved.

  14. Complete mitochondrial genome of Concholepas concholepas inferred by 454 pyrosequencing and mtDNA expression in two mollusc populations.

    PubMed

    Núñez-Acuña, Gustavo; Aguilar-Espinoza, Andrea; Gallardo-Escárate, Cristian

    2013-03-01

    Despite the great relevance of mitochondrial genome analysis in evolutionary studies, there is scarce information on how the transcripts associated with the mitogenome are expressed and their role in the genetic structuring of populations. This work reports the complete mitochondrial genome of the marine gastropod Concholepas concholepas, obtained by 454 pryosequencing, and an analysis of mitochondrial transcripts of two populations 1000 km apart along the Chilean coast. The mitochondrion of C. concholepas is 15,495 base pairs (bp) in size and contains the 37 subunits characteristic of metazoans, as well as a non-coding region of 330 bp. In silico analysis of mitochondrial gene variability showed significant differences among populations. In terms of levels of relative abundance of transcripts associated with mitochondrion in the two populations (assessed by qPCR), the genes associated with complexes III and IV of the mitochondrial genome had the highest levels of expression in the northern population while transcripts associated with the ATP synthase complex had the highest levels of expression in the southern population. Moreover, fifteen polymorphic SNPs were identified in silico between the mitogenomes of the two populations. Four of these markers implied different amino acid substitutions (non-synonymous SNPs). This work contributes novel information regarding the mitochondrial genome structure and mRNA expression levels of C. concholepas. Copyright © 2012 Elsevier Inc. All rights reserved.

  15. T cells are influenced by a long non-coding RNA in the autoimmune associated PTPN2 locus.

    PubMed

    Houtman, Miranda; Shchetynsky, Klementy; Chemin, Karine; Hensvold, Aase Haj; Ramsköld, Daniel; Tandre, Karolina; Eloranta, Maija-Leena; Rönnblom, Lars; Uebe, Steffen; Catrina, Anca Irinel; Malmström, Vivianne; Padyukov, Leonid

    2018-06-01

    Non-coding SNPs in the protein tyrosine phosphatase non-receptor type 2 (PTPN2) locus have been linked with several autoimmune diseases, including rheumatoid arthritis, type I diabetes, and inflammatory bowel disease. However, the functional consequences of these SNPs are poorly characterized. Herein, we show in blood cells that SNPs in the PTPN2 locus are highly correlated with DNA methylation levels at four CpG sites downstream of PTPN2 and expression levels of the long non-coding RNA (lncRNA) LINC01882 downstream of these CpG sites. We observed that LINC01882 is mainly expressed in T cells and that anti-CD3/CD28 activated naïve CD4 + T cells downregulate the expression of LINC01882. RNA sequencing analysis of LINC01882 knockdown in Jurkat T cells, using a combination of antisense oligonucleotides and RNA interference, revealed the upregulation of the transcription factor ZEB1 and kinase MAP2K4, both involved in IL-2 regulation. Overall, our data suggests the involvement of LINC01882 in T cell activation and hints towards an auxiliary role of these non-coding SNPs in autoimmunity associated with the PTPN2 locus. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

  16. Polymorphisms of EpCAM gene and prognosis for non-small-cell lung cancer in Han Chinese

    PubMed Central

    Yang, Yuefan; Fei, Fei; Song, Yang; Li, Xiaofei; Zhang, Zhipei; Fei, Zhou; Su, Haichuan; Wan, Shaogui

    2014-01-01

    The epithelial cell adhesion molecule (EpCAM) is overexpressed in a wide variety of human cancers and is associated with patient prognosis, including those with lung cancer. However, the association of single nucleotide polymorphisms (SNPs) in the EpCAM gene with the prognosis for non-small-cell lung cancer (NSCLC) patients has never been investigated. We evaluated the association between two SNPs, rs1126497 and rs1421, in the EpCAM gene and clinical outcomes in a Chinese cohort of 506 NSCLC patients. The SNPs were genotyped using the Sequenom iPLEX genotyping system. Multivariate Cox proportional hazards model and Kaplan–Meier curves were used to assess the association of EpCAM gene genotypes with the prognosis of NSCLC. We found that the non-synonymous SNP rs1126497 was significantly associated with survival. Compared with the CC genotype, the CT+TT genotype was a risk factor for both death (hazard ratio, 1.40; 95% confidence interval [CI], 1.02–1.94; P = 0.040) and recurrence (hazard ratio, 1.34; 95% CI, 1.02–1.77; P = 0.039). However, the SNP rs1421 did not show any significant effect on patient prognosis. Instead, the AG+GG genotype in rs1421 was significantly associated with early T stages (T1/T2) when compared with the AA genotype (odds ratio for late stage = 0.65; 95% CI, 0.44–0.96, P = 0.029). Further stratified analysis showed notable modulating effects of clinical characteristics on the associations between variant genotypes of rs1126497 and NSCLC outcomes. In conclusion, our study indicated that the non-synonymous SNP rs1126497 may be a potential prognostic marker for NSCLC patients. PMID:24304228

  17. A non-synonymous SNP in the NOS2 associated with septic shock in patients with sepsis in Chinese populations.

    PubMed

    Wang, Zhifu; Feng, Kai; Yue, Maoxing; Lu, Xiaoguang; Zheng, Qihan; Zhang, Hongxing; Zhai, Yun; Li, Peiyao; Yu, Lixia; Cai, Mi; Zhang, Xiumei; Kang, Xin; Shi, Weihai; Xia, Xia; Chen, Xi; Cao, Pengbo; Li, Yuanfeng; Chen, Huipeng; Ling, Yan; Li, Yuxia; He, Fuchu; Zhou, Gangqiao

    2013-03-01

    Sepsis represents a systemic inflammatory response to infection and its sequelae include severe sepsis, septic shock, multiple organ dysfunction syndrome (MODS) and death. Studies in mice and humans indicate that the inducible nitric oxide synthase (iNOS, NOS2) plays an important role in the development of sepsis and its sequelae. It was reported that several single nucleotide polymorphisms (SNPs) within NOS2 could influence the production or activity of NOS2. In this study, we assessed whether SNPs within NOS2 gene were associated with severity of sepsis in Chinese populations. A case-control study was conducted, which included 299 and 280 unrelated patients with sepsis recruited from Liaoning and Jiangsu provinces in China, respectively. Six SNPs within NOS2 were genotyped using Sequenom MassARRAY system. The associations between the SNPs and risk of sepsis complications were estimated by a binary logistic regression model adjusted for confounding factors. Functional assay was performed to assess the biological significance. The GA + AA genotype of a non-synonymous SNP in the exon 16 of NOS2 (rs2297518: G>A) was significantly associated with increased susceptibility to septic shock compared with GG genotype in Liaoning population (OR = 3.29, 95% CI = 1.40-7.72, P = 0.0047). This association was confirmed in the Jiangsu population (OR = 3.49, 95% CI = 1.57-7.79, P = 0.0019). Furthermore, the functional assay performed in the immortalized lymphocyte cell lines indicated that the at-risk GA genotype had a tendency of higher NOS2 activity than the GG genotype (P = 0.32). Our findings suggest that the NOS2 rs2297518 may play a role in mediating the susceptibility to septic shock in patients with sepsis in Chinese populations.

  18. Tissue expression and predicted protein structures of the bovine ANGPTL3 and association of novel SNPs with growth and meat quality traits.

    PubMed

    Chen, N B; Ma, Y; Yang, T; Lin, F; Fu, W W; Xu, Y J; Li, F; Li, J Y; Gao, S X

    2015-08-01

    Angiopoietin-like protein 3 (ANGPTL3) is a secreted protein that regulates lipid, glucose and energy metabolism. This study was conducted to better understand the effect of ANGPTL3 on important economic traits in cattle. First, transcript profiles for ANGPTL3 were measured in nine different Jiaxian cattle tissues. Second, polymorphisms were identified in the complete coding region and promoter region of the bovine ANGPTL3 gene in 707 cattle samples. Finally, an association study was carried out utilizing these single nucleotide polymorphisms (SNPs) to determine the effect of these SNPs on the growth and meat quality traits. Quantitative real-time PCR analysis showed that ANGPTL3 was mainly expressed in the liver. The promoter of the bovine ANGPTL3 contained several putative transcription factor binding sites (SF1, HNF-1, LXRα, NFκβ, HNF-3 and C/EBP). In total, four SNPs of the bovine ANGPTL3 gene were identified by direct sequencing. SNP1 (rs469906272: g.-38T>C) was identified in the promoter, SNP2 (rs451104723:g.104A>T) and SNP3 (rs482516226: g.509A>G) were identified in exon 1, and SNP4 (rs477165942: g.8661T>C) was identified in exon 6. Changes in predicted protein structures due to non-synonymous SNPs were analyzed. Haplotype frequencies and linkage disequilibrium were also investigated. Analysis of four SNPs in cattle from different native Chinese breeds (Nanyang (NY) and Jiaxian (JX)) and commercial breeds (Angus (AG), Hereford (HF), Limousin (LM), Luxi (LX), Simmental (ST) and Jinnan (JN)) revealed a significant association with growth traits (including: BW and hipbone width) and meat quality traits (including: Warner-Bratzler shear force and ribeye area). Therefore, implementation of these four mutations in selection indices in the beef industry may be beneficial in selecting individuals with superior growth and meat quality traits.

  19. Genetic Polymorphisms in Host Antiviral Genes: Associations with Humoral and Cellular Immunity to Measles Vaccine

    PubMed Central

    Haralambieva, Iana H.; Ovsyannikova, Inna G.; Umlauf, Benjamin J.; Vierkant, Robert A.; Pankratz, V. Shane; Jacobson, Robert M.; Poland, Gregory A.

    2014-01-01

    Host antiviral genes are important regulators of antiviral immunity and plausible genetic determinants of immune response heterogeneity after vaccination. We genotyped and analyzed 307 common candidate tagSNPs from 12 antiviral genes in a cohort of 745 schoolchildren immunized with two doses of measles-mumps-rubella vaccine. Associations between SNPs/haplotypes and measles virus-specific immune outcomes were assessed using linear regression methodologies in Caucasians and African-Americans. Genetic variants within the DDX58/RIG-I gene, including a coding polymorphism (rs3205166/Val800Val), were associated as single-SNPs (p≤0.017; although these SNPs did not remain significant after correction for false discovery rate/FDR) and in haplotype-level analysis, with measles-specific antibody variations in Caucasians (haplotype allele p-value=0.021; haplotype global p-value=0.076). Four DDX58 polymorphisms, in high LD, demonstrated also associations (after correction for FDR) with variations in both measles-specific IFN-γ and IL-2 secretion in Caucasians (p≤0.001, q=0.193). Two intronic OAS1 polymorphisms, including the functional OAS1 SNP rs10774671 (p=0.003), demonstrated evidence of association with a significant allele-dose-related increase in neutralizing antibody levels in African-Americans. Genotype and haplotype-level associations demonstrated the role of ADAR genetic variants, including a non-synonymous SNP (rs2229857/Arg384Lys; p=0.01), in regulating measles virus-specific IFN-γ Elispot responses in Caucasians (haplotype global p-value=0.017). After correction FDR, 15 single-SNP associations (11 SNPs in Caucasians and 4 SNPs in African-Americans) still remained significant at the q-value<0.20. In conclusion, our findings strongly point to genetic variants/genes, involved in antiviral sensing and antiviral control, as critical determinants, differentially modulating the adaptive immune responses to live attenuated measles vaccine in Caucasians and African-Americans. PMID:21939710

  20. Identification of bovine NPC1 gene cSNPs and their effects on body size traits of Qinchuan cattle.

    PubMed

    Dang, Yonglong; Li, Mingxun; Yang, Mingjuan; Cao, Xiukai; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Lin, Qing; Chen, Hong

    2014-05-01

    NPC1 gene is an important gene closely related to the Niemann-Pick type C (NPC). Mutations in the NPC1 gene tend to cause Niemann-Pick type C, a lysosomal storage disorder. Previous studies have shown that NPC1 protein plays an important role in subcellular lipid transport, homeostasis, platelet function and formation, which are basic metabolic activities in the process of development. In this study, to explore the association between the NPC1 gene variation and body size traits in Qinchuan cattle, we detected four novel coding single nucleotide polymorphisms (cSNPs) in the bovine NPC1 gene, including one missense mutation (SNP1) and three synonymous mutations (SNP2, SNP3 and SNP4). Population genetic analyses of 518 individuals and association correlations between cSNPs and bovine body size traits were conducted in this research. A missense mutation at SNP1 locus was found to be significantly related to the heart girth, hip width and body weight (P<0.01 or P<0.05, 3.5-year-old). Two synonymous mutations at SNP2 and SNP3 loci also showed significant effects on hip width (P<0.05, 3.5-year-old). One synonymous mutation at SNP4 locus showed significant effect on body weight (P<0.05, 2.0-year-old). Combined haplotypes H2H6 and H6H6 showed significant effects on body size traits such as heart girth, hip width, and body weight (3.5-year-old, P<0.01 or P<0.05). This study provides evidence that the NPC1 gene might be involved in the regulation of bovine growth and body development, and may be considered as a candidate gene for marker assisted selection (MAS) in beef cattle breeding industry. Copyright © 2014. Published by Elsevier B.V.

  1. Variability of the caprine whey protein genes and their association with milk yield, composition and renneting properties in the Sarda breed. 1. The LALBA gene.

    PubMed

    Dettori, Maria Luisa; Pazzola, Michele; Paschino, Pietro; Pira, Maria Giovanna; Vacca, Giuseppe Massimo

    2015-11-01

    The 5' flanking region and 3' UTR of the caprine LALBA gene were analysed by SSCP and sequencing. A total of nine SNPs were detected: three in the promoter region, two were synonymous coding SNPs at exon-1, and four SNPs were in exon-4, within the 3'UTR. The nucleotide changes located in the promoter region (c.-358T>C, c.-163G>A, c.-121T>G) were genotyped by SSCP in 263 Sarda goats to evaluate their possible effect on milk yield, composition and renneting properties. We observed an effect of the three SNPs on milk yield and lactose content. Genotypes TT and CT at c.-358T>C (P A (P C and c.-121T>G were part of transcription factors binding sites, potentially involved in modulating the LALBA gene expression. The LALBA genotype affected renneting properties (P < 0.001), as heterozygotes c.-358CT and c.-163GA were characterised by delayed rennet coagulation time and curd firming time and the lowest value of curd firmness. The present investigation increases the panel of SNPs and adds new information about the effects of the caprine LALBA gene polymorphism.

  2. Refined mapping of autoimmune disease associated genetic variants with gene expression suggests an important role for non-coding RNAs.

    PubMed

    Ricaño-Ponce, Isis; Zhernakova, Daria V; Deelen, Patrick; Luo, Oscar; Li, Xingwang; Isaacs, Aaron; Karjalainen, Juha; Di Tommaso, Jennifer; Borek, Zuzanna Agnieszka; Zorro, Maria M; Gutierrez-Achury, Javier; Uitterlinden, Andre G; Hofman, Albert; van Meurs, Joyce; Netea, Mihai G; Jonkers, Iris H; Withoff, Sebo; van Duijn, Cornelia M; Li, Yang; Ruan, Yijun; Franke, Lude; Wijmenga, Cisca; Kumar, Vinod

    2016-04-01

    Genome-wide association and fine-mapping studies in 14 autoimmune diseases (AID) have implicated more than 250 loci in one or more of these diseases. As more than 90% of AID-associated SNPs are intergenic or intronic, pinpointing the causal genes is challenging. We performed a systematic analysis to link 460 SNPs that are associated with 14 AID to causal genes using transcriptomic data from 629 blood samples. We were able to link 71 (39%) of the AID-SNPs to two or more nearby genes, providing evidence that for part of the AID loci multiple causal genes exist. While 54 of the AID loci are shared by one or more AID, 17% of them do not share candidate causal genes. In addition to finding novel genes such as ULK3, we also implicate novel disease mechanisms and pathways like autophagy in celiac disease pathogenesis. Furthermore, 42 of the AID SNPs specifically affected the expression of 53 non-coding RNA genes. To further understand how the non-coding genome contributes to AID, the SNPs were linked to functional regulatory elements, which suggest a model where AID genes are regulated by network of chromatin looping/non-coding RNAs interactions. The looping model also explains how a causal candidate gene is not necessarily the gene closest to the AID SNP, which was the case in nearly 50% of cases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  3. Studying the genetic basis of speciation in high gene flow marine invertebrates

    PubMed Central

    2016-01-01

    A growing number of genes responsible for reproductive incompatibilities between species (barrier loci) exhibit the signals of positive selection. However, the possibility that genes experiencing positive selection diverge early in speciation and commonly cause reproductive incompatibilities has not been systematically investigated on a genome-wide scale. Here, I outline a research program for studying the genetic basis of speciation in broadcast spawning marine invertebrates that uses a priori genome-wide information on a large, unbiased sample of genes tested for positive selection. A targeted sequence capture approach is proposed that scores single-nucleotide polymorphisms (SNPs) in widely separated species populations at an early stage of allopatric divergence. The targeted capture of both coding and non-coding sequences enables SNPs to be characterized at known locations across the genome and at genes with known selective or neutral histories. The neutral coding and non-coding SNPs provide robust background distributions for identifying FST-outliers within genes that can, in principle, identify specific mutations experiencing diversifying selection. If natural hybridization occurs between species, the neutral coding and non-coding SNPs can provide a neutral admixture model for genomic clines analyses aimed at finding genes exhibiting strong blocks to introgression. Strongylocentrotid sea urchins are used as a model system to outline the approach but it can be used for any group that has a complete reference genome available. PMID:29491951

  4. Non-Synonymous Single-Nucleotide Polymorphisms and Physical Activity Interactions on Adiposity Parameters in Malaysian Adolescents.

    PubMed

    Zaharan, Nur Lisa; Muhamad, Nor Hanisah; Jalaludin, Muhammad Yazid; Su, Tin Tin; Mohamed, Zahurin; Mohamed, M N A; A Majid, Hazreen

    2018-01-01

    Several non-synonymous single-nucleotide polymorphisms (nsSNPs) have been shown to be associated with obesity. Little is known about their associations and interactions with physical activity (PA) in relation to adiposity parameters among adolescents in Malaysia. We examined whether (a) PA and (b) selected nsSNPs are associated with adiposity parameters and whether PA interacts with these nsSNPs on these outcomes in adolescents from the Malaysian Health and Adolescents Longitudinal Research Team study ( n  = 1,151). Body mass indices, waist-hip ratio, and percentage body fat (% BF) were obtained. PA was assessed using Physical Activity Questionnaire for Older Children (PAQ-C). Five nsSNPs were included: beta-3 adrenergic receptor (ADRB3) rs4994, FABP2 rs1799883, GHRL rs696217, MC3R rs3827103, and vitamin D receptor rs2228570, individually and as combined genetic risk score (GRS). Associations and interactions between nsSNPs and PAQ-C scores were examined using generalized linear model. PAQ-C scores were associated with % BF (β = -0.44 [95% confidence interval -0.72, -0.16], p  = 0.002). The CC genotype of ADRB3 rs4994 (β = -0.16 [-0.28, -0.05], corrected p  = 0.01) and AA genotype of MC3R rs3827103 (β = -0.06 [-0.12, -0.00], p  = 0.02) were significantly associated with % BF compared to TT and GG genotypes, respectively. Significant interactions with PA were found between ADRB3 rs4994 (β = -0.05 [-0.10, -0.01], p  = 0.02) and combined GRS (β = -0.03 [-0.04, -0.01], p  = 0.01) for % BF. Higher PA score was associated with reduced % BF in Malaysian adolescents. Of the nsSNPs, ADRB3 rs4994 and MC3R rs3827103 were associated with % BF. Significant interactions with PA were found for ADRB3 rs4994 and combined GRS on % BF but not on measurements of weight or circumferences. Targeting body fat represent prospects for molecular studies and lifestyle intervention in this population.

  5. Simulation Based Investigation of Deleterious nsSNPs in ATXN2 Gene and Its Structural Consequence Toward Spinocerebellar Ataxia.

    PubMed

    Sinha, Siddharth; Verma, Sharad; Singh, Aditi; Somvanshi, Pallavi; Grover, Abhinav

    2018-01-01

    Spinocerebellar degeneration, termed as ataxia is a neurological disorder of central nervous system, characterized by limb in-coordination and a progressive gait. The patient also demonstrates specific symptoms of muscle weakness, slurring of speech, and decreased vibration senses. Expansion of polyglutamine trinucleotide (CAG) within ATXN2 gene with 35 or more repeats, results in spinocerebellar ataxia type-2. Protein ataxin-2 coded by ATXN2 gene has been reported to have a crucial role in translation of the genetic information through sequestering the histone acetyl transferases (HAT) resulting in a state of hypo-acetylation. In the present study, we have evaluated the outcome for 122 non synonymous single nucleotide polymorphisms (nsSNPs) reported within ATXN2 gene through computational tools such as SIFT, PolyPhen 2.0, PANTHER, I-mutant 2.0, Phd-SNP, Pmut, MutPred. The apo and mutant (L305V and Q339L) form of structures for the ataxin-2 protein were modeled for gaining insights toward 3D spatial arrangement. Further, molecular dynamics simulations and structural analysis were performed to observe the brunt of disease associated nsSNPs toward the strength and secondary properties of ataxin-2 protein structure. Our results showed that, L305V is a highly deleterious and disease causing point substitution. Analysis based on RMSD, RMSF, Rg, SASA, number of hydrogen bonds (NH bonds), covariance matrix trace, projection analysis for eigen vector demonstrated a significant instability and conformation along with rise in mutant flexibility values in comparison to the apo form of ataxin-2 protein. The study provides a blue print of computational methodologies to examine the ataxin-blend SNPs. J. Cell. Biochem. 119: 499-510, 2018. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  6. Combined sequence and sequence-structure-based methods for analyzing RAAS gene SNPs: a computational approach.

    PubMed

    Singh, Kh Dhanachandra; Karthikeyan, Muthusamy

    2014-12-01

    The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.

  7. Impact of SNPs on Protein Phosphorylation Status in Rice (Oryza sativa L.).

    PubMed

    Lin, Shoukai; Chen, Lijuan; Tao, Huan; Huang, Jian; Xu, Chaoqun; Li, Lin; Ma, Shiwei; Tian, Tian; Liu, Wei; Xue, Lichun; Ai, Yufang; He, Huaqin

    2016-11-11

    Single nucleotide polymorphisms (SNPs) are widely used in functional genomics and genetics research work. The high-quality sequence of rice genome has provided a genome-wide SNP and proteome resource. However, the impact of SNPs on protein phosphorylation status in rice is not fully understood. In this paper, we firstly updated rice SNP resource based on the new rice genome Ver. 7.0, then systematically analyzed the potential impact of Non-synonymous SNPs (nsSNPs) on the protein phosphorylation status. There were 3,897,312 SNPs in Ver. 7.0 rice genome, among which 9.9% was nsSNPs. Whilst, a total 2,508,261 phosphorylated sites were predicted in rice proteome. Interestingly, we observed that 150,197 (39.1%) nsSNPs could influence protein phosphorylation status, among which 52.2% might induce changes of protein kinase (PK) types for adjacent phosphorylation sites. We constructed a database, SNP_rice, to deposit the updated rice SNP resource and phosSNPs information. It was freely available to academic researchers at http://bioinformatics.fafu.edu.cn. As a case study, we detected five nsSNPs that potentially influenced heterotrimeric G proteins phosphorylation status in rice, indicating that genetic polymorphisms showed impact on the signal transduction by influencing the phosphorylation status of heterotrimeric G proteins. The results in this work could be a useful resource for future experimental identification and provide interesting information for better rice breeding.

  8. Prediction of Disease Causing Non-Synonymous SNPs by the Artificial Neural Network Predictor NetDiseaseSNP

    PubMed Central

    Johansen, Morten Bo; Izarzugaza, Jose M. G.; Brunak, Søren; Petersen, Thomas Nordahl; Gupta, Ramneek

    2013-01-01

    We have developed a sequence conservation-based artificial neural network predictor called NetDiseaseSNP which classifies nsSNPs as disease-causing or neutral. Our method uses the excellent alignment generation algorithm of SIFT to identify related sequences and a combination of 31 features assessing sequence conservation and the predicted surface accessibility to produce a single score which can be used to rank nsSNPs based on their potential to cause disease. NetDiseaseSNP classifies successfully disease-causing and neutral mutations. In addition, we show that NetDiseaseSNP discriminates cancer driver and passenger mutations satisfactorily. Our method outperforms other state-of-the-art methods on several disease/neutral datasets as well as on cancer driver/passenger mutation datasets and can thus be used to pinpoint and prioritize plausible disease candidates among nsSNPs for further investigation. NetDiseaseSNP is publicly available as an online tool as well as a web service: http://www.cbs.dtu.dk/services/NetDiseaseSNP PMID:23935863

  9. Whole genome sequencing in the search for genes associated with the control of SIV infection in the Mauritian macaque model.

    PubMed

    de Manuel, Marc; Shiina, Takashi; Suzuki, Shingo; Dereuddre-Bosquet, Nathalie; Garchon, Henri-Jean; Tanaka, Masayuki; Congy-Jolivet, Nicolas; Aarnink, Alice; Le Grand, Roger; Marques-Bonet, Tomas; Blancher, Antoine

    2018-05-08

    In the Mauritian macaque experimentally inoculated with SIV, gene polymorphisms potentially associated with the plasma virus load at a set point, approximately 100 days post inoculation, were investigated. Among the 42 animals inoculated with 50 AID 50 of the same strain of SIV, none of which received any preventive or curative treatment, nine individuals were selected: three with a plasma virus load (PVL) among the lowest, three with intermediate PVL values and three among the highest PVL values. The complete genomes of these nine animals were then analyzed. Initially, attention was focused on variants with a potential functional impact on protein encoding genes (non-synonymous SNPs (NS-SNPs) and splicing variants). Thus, 424 NS-SNPs possibly associated with PVL were detected. The 424 candidates SNPs were genotyped in these 42 SIV experimentally infected animals (including the nine animals subjected to whole genome sequencing). The genes containing variants most probably associated with PVL at a set time point are analyzed herein.

  10. Common genetic variants of the ion channel transient receptor potential membrane melastatin 6 and 7 (TRPM6 and TRPM7), magnesium intake, and risk of type 2 diabetes in women.

    PubMed

    Song, Yiqing; Hsu, Yi-Hsiang; Niu, Tianhua; Manson, Joann E; Buring, Julie E; Liu, Simin

    2009-01-17

    Ion channel transient receptor potential membrane melastatin 6 and 7 (TRPM6 and TRPM7) play a central role in magnesium homeostasis, which is critical for maintaining glucose and insulin metabolism. However, it is unclear whether common genetic variation in TRPM6 and TRPM7 contributes to risk of type 2 diabetes. We conducted a nested case-control study in the Women's Health Study. During a median of 10 years of follow-up, 359 incident diabetes cases were diagnosed and matched by age and ethnicity with 359 controls. We analyzed 20 haplotype-tagging single nucleotide polymorphisms (SNPs) in TRPM6 and 5 common SNPs in TRPM7 for their association with diabetes risk. Overall, there was no robust and significant association between any single SNP and diabetes risk. Neither was there any evidence of association between common TRPM6 and TRPM7 haplotypes and diabetes risk. Our haplotype analyses suggested a significant risk of type 2 diabetes among carriers of both the rare alleles from two non-synomous SNPs in TRPM6 (Val1393Ile in exon 26 [rs3750425] and Lys1584Glu in exon 27 [rs2274924]) when their magnesium intake was lower than 250 mg per day. Compared with non-carriers, women who were carriers of the haplotype 1393Ile-1584Glu had an increased risk of type 2 diabetes (OR, 4.92, 95% CI, 1.05-23.0) only when they had low magnesium intake (<250 mg/day). Our results provide suggestive evidence that two common non-synonymous TRPM6 coding region variants, Ile1393Val and Lys1584Glu polymorphisms, might confer susceptibility to type 2 diabetes in women with low magnesium intake. Further replication in large-scale studies is warranted.

  11. Genome-wide re-sequencing of multidrug-resistant Mycobacterium leprae Airaku-3.

    PubMed

    Singh, P; Benjak, A; Carat, S; Kai, M; Busso, P; Avanzi, C; Paniz-Mondolfi, A; Peter, C; Harshman, K; Rougemont, J; Matsuoka, M; Cole, S T

    2014-10-01

    Genotyping and molecular characterization of drug resistance mechanisms in Mycobacterium leprae enables disease transmission and drug resistance trends to be monitored. In the present study, we performed genome-wide analysis of Airaku-3, a multidrug-resistant strain with an unknown mechanism of resistance to rifampicin. We identified 12 unique non-synonymous single-nucleotide polymorphisms (SNPs) including two in the transporter-encoding ctpC and ctpI genes. In addition, two SNPs were found that improve the resolution of SNP-based genotyping, particularly for Venezuelan and South East Asian strains of M. leprae. © 2014 The Authors Clinical Microbiology and Infection © 2014 European Society of Clinical Microbiology and Infectious Diseases.

  12. Significance of 5,10-methylenetetrahydrofolate reductase gene variants in acute lymphoblastic leukemia in Indian population: an experimental, computational and meta-analysis.

    PubMed

    Bellampalli, Ravishankara; Phani, Nagaraja M; Bhat, Kamalakshi G; Prasad, Krishna; Bhaskaranand, Nalini; Guruprasad, Kanive P; Rai, Padmalatha S; Satyamoorthy, Kapaettu

    2015-05-01

    Acute lymphoblastic leukemia (ALL) arises due to several genetic alterations in progenitor cells, and methotrexate is frequently used as part of the treatment regimen. Although there is evidence for an effect of 5,10-methylenetetrahydrofolate reductase gene (MTHFR) C677T and A1298C variations on drug response in ALL, its risk association for ALL is still unresolved. In a case-control study of 203 patients with ALL and 246 controls and meta-analysis in the Indian population, we showed an insignificant association of MTHFR C677T and A1298C genotypes with childhood and adult ALL. Comprehensive in silico characterization of non-synonymous single nucleotide polymorphisms (nsSNPs) and SNPs of the 3' untranslated region (UTR) revealed nine nsSNPs as deleterious, and three SNPs in the 3'UTR could possibly alter the binding of miRNAs. The study revealed that several overlooked SNPs may contribute to the risk of ALL susceptibility and further studies of these SNPs with functional characterization in a large sample size are required to understand the significant role of MTHFR in ALL development.

  13. Transcriptomic investigation of meat tenderness in two Italian cattle breeds.

    PubMed

    Bongiorni, S; Gruber, C E M; Bueno, S; Chillemi, G; Ferrè, F; Failla, S; Moioli, B; Valentini, A

    2016-06-01

    Our objectives for this study were to understand the biological basis of meat tenderness and to provide an overview of the gene expression profiles related to meat quality as a tool for selection. Through deep mRNA sequencing, we analyzed gene expression in muscle tissues of two Italian cattle breeds: Maremmana and Chianina. We uncovered several differentially expressed genes that encode for proteins belonging to a family of tripartite motif proteins, which are involved in growth, cell differentiation and apoptosis, such as TRIM45, or play an essential role in regulating skeletal muscle differentiation and the regeneration of adult skeletal muscle, such as TRIM32. Other differentially expressed genes (SCN2B, SLC9A7 and KCNK3) emphasize the involvement of potassium-sodium pumps in tender meat. By mapping splice junctions in RNA-Seq reads, we found significant differences in gene isoform expression levels. The PRKAG3 gene, which is involved in the regulation of energy metabolism, showed four isoforms that were differentially expressed. This distinct pattern of PRKAG3 gene expression could indicate impaired glycogen storage in skeletal muscle, and consequently, this gene very likely has a role in the tenderization process. Furthermore, with this deep RNA-sequencing, we captured a high number of expressed SNPs, for example, we found 1462 homozygous SNPs showing the alternative allele with a 100% frequency when comparing tender and tough meat. SNPs were then classified into categories by their position and also by their effect on gene coding (174 non-synonymous polymorphisms) based on the available UMD_3.1 annotations. © 2016 Stichting International Foundation for Animal Genetics.

  14. Sequencing of the needle transcriptome from Norway spruce (Picea abies Karst L.) reveals lower substitution rates, but similar selective constraints in gymnosperms and angiosperms

    PubMed Central

    2012-01-01

    Background A detailed knowledge about spatial and temporal gene expression is important for understanding both the function of genes and their evolution. For the vast majority of species, transcriptomes are still largely uncharacterized and even in those where substantial information is available it is often in the form of partially sequenced transcriptomes. With the development of next generation sequencing, a single experiment can now simultaneously identify the transcribed part of a species genome and estimate levels of gene expression. Results mRNA from actively growing needles of Norway spruce (Picea abies) was sequenced using next generation sequencing technology. In total, close to 70 million fragments with a length of 76 bp were sequenced resulting in 5 Gbp of raw data. A de novo assembly of these reads, together with publicly available expressed sequence tag (EST) data from Norway spruce, was used to create a reference transcriptome. Of the 38,419 PUTs (putative unique transcripts) longer than 150 bp in this reference assembly, 83.5% show similarity to ESTs from other spruce species and of the remaining PUTs, 3,704 show similarity to protein sequences from other plant species, leaving 4,167 PUTs with limited similarity to currently available plant proteins. By predicting coding frames and comparing not only the Norway spruce PUTs, but also PUTs from the close relatives Picea glauca and Picea sitchensis to both Pinus taeda and Taxus mairei, we obtained estimates of synonymous and non-synonymous divergence among conifer species. In addition, we detected close to 15,000 SNPs of high quality and estimated gene expression differences between samples collected under dark and light conditions. Conclusions Our study yielded a large number of single nucleotide polymorphisms as well as estimates of gene expression on transcriptome scale. In agreement with a recent study we find that the synonymous substitution rate per year (0.6 × 10−09 and 1.1 × 10−09) is an order of magnitude smaller than values reported for angiosperm herbs. However, if one takes generation time into account, most of this difference disappears. The estimates of the dN/dS ratio (non-synonymous over synonymous divergence) reported here are in general much lower than 1 and only a few genes showed a ratio larger than 1. PMID:23122049

  15. Fine Mapping and Functional Analysis of the Multiple Sclerosis Risk Gene CD6

    PubMed Central

    Swaminathan, Bhairavi; Cuapio, Angélica; Alloza, Iraide; Matesanz, Fuencisla; Alcina, Antonio; García-Barcina, Maria; Fedetz, Maria; Fernández, Óscar; Lucas, Miguel; Órpez, Teresa; Pinto-Medel, Mª Jesus; Otaegui, David; Olascoaga, Javier; Urcelay, Elena; Ortiz, Miguel A.; Arroyo, Rafael; Oksenberg, Jorge R.; Antigüedad, Alfredo; Tolosa, Eva; Vandenbroeck, Koen

    2013-01-01

    CD6 has recently been identified and validated as risk gene for multiple sclerosis (MS), based on the association of a single nucleotide polymorphism (SNP), rs17824933, located in intron 1. CD6 is a cell surface scavenger receptor involved in T-cell activation and proliferation, as well as in thymocyte differentiation. In this study, we performed a haptag SNP screen of the CD6 gene locus using a total of thirteen tagging SNPs, of which three were non-synonymous SNPs, and replicated the recently reported GWAS SNP rs650258 in a Spanish-Basque collection of 814 controls and 823 cases. Validation of the six most strongly associated SNPs was performed in an independent collection of 2265 MS patients and 2600 healthy controls. We identified association of haplotypes composed of two non-synonymous SNPs [rs11230563 (R225W) and rs2074225 (A257V)] in the 2nd SRCR domain with susceptibility to MS (P max(T) permutation = 1×10−4). The effect of these haplotypes on CD6 surface expression and cytokine secretion was also tested. The analysis showed significantly different CD6 expression patterns in the distinct cell subsets, i.e. – CD4+ naïve cells, P = 0.0001; CD8+ naïve cells, P<0.0001; CD4+ and CD8+ central memory cells, P = 0.01 and 0.05, respectively; and natural killer T (NKT) cells, P = 0.02; with the protective haplotype (RA) showing higher expression of CD6. However, no significant changes were observed in natural killer (NK) cells, effector memory and terminally differentiated effector memory T cells. Our findings reveal that this new MS-associated CD6 risk haplotype significantly modifies expression of CD6 on CD4+ and CD8+ T cells. PMID:23638056

  16. Allelic expression mapping across cellular lineages to establish impact of non-coding SNPs

    PubMed Central

    Adoue, Veronique; Schiavi, Alicia; Light, Nicholas; Almlöf, Jonas Carlsson; Lundmark, Per; Ge, Bing; Kwan, Tony; Caron, Maxime; Rönnblom, Lars; Wang, Chuan; Chen, Shu-Huang; Goodall, Alison H; Cambien, Francois; Deloukas, Panos; Ouwehand, Willem H; Syvänen, Ann-Christine; Pastinen, Tomi

    2014-01-01

    Most complex disease-associated genetic variants are located in non-coding regions and are therefore thought to be regulatory in nature. Association mapping of differential allelic expression (AE) is a powerful method to identify SNPs with direct cis-regulatory impact (cis-rSNPs). We used AE mapping to identify cis-rSNPs regulating gene expression in 55 and 63 HapMap lymphoblastoid cell lines from a Caucasian and an African population, respectively, 70 fibroblast cell lines, and 188 purified monocyte samples and found 40–60% of these cis-rSNPs to be shared across cell types. We uncover a new class of cis-rSNPs, which disrupt footprint-derived de novo motifs that are predominantly bound by repressive factors and are implicated in disease susceptibility through overlaps with GWAS SNPs. Finally, we provide the proof-of-principle for a new approach for genome-wide functional validation of transcription factor–SNP interactions. By perturbing NFκB action in lymphoblasts, we identified 489 cis-regulated transcripts with altered AE after NFκB perturbation. Altogether, we perform a comprehensive analysis of cis-variation in four cell populations and provide new tools for the identification of functional variants associated to complex diseases. PMID:25326100

  17. mQTL-seq delineates functionally relevant candidate gene harbouring a major QTL regulating pod number in chickpea

    PubMed Central

    Das, Shouvik; Singh, Mohar; Srivastava, Rishi; Bajaj, Deepak; Saxena, Maneesha S.; Rana, Jai C.; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.

    2016-01-01

    The present study used a whole-genome, NGS resequencing-based mQTL-seq (multiple QTL-seq) strategy in two inter-specific mapping populations (Pusa 1103 × ILWC 46 and Pusa 256 × ILWC 46) to scan the major genomic region(s) underlying QTL(s) governing pod number trait in chickpea. Essentially, the whole-genome resequencing of low and high pod number-containing parental accessions and homozygous individuals (constituting bulks) from each of these two mapping populations discovered >8 million high-quality homozygous SNPs with respect to the reference kabuli chickpea. The functional significance of the physically mapped SNPs was apparent from the identified 2,264 non-synonymous and 23,550 regulatory SNPs, with 8–10% of these SNPs-carrying genes corresponding to transcription factors and disease resistance-related proteins. The utilization of these mined SNPs in Δ (SNP index)-led QTL-seq analysis and their correlation between two mapping populations based on mQTL-seq, narrowed down two (CaqaPN4.1: 867.8 kb and CaqaPN4.2: 1.8 Mb) major genomic regions harbouring robust pod number QTLs into the high-resolution short QTL intervals (CaqbPN4.1: 637.5 kb and CaqbPN4.2: 1.28 Mb) on chickpea chromosome 4. The integration of mQTL-seq-derived one novel robust QTL with QTL region-specific association analysis delineated the regulatory (C/T) and coding (C/A) SNPs-containing one pentatricopeptide repeat (PPR) gene at a major QTL region regulating pod number in chickpea. This target gene exhibited anther, mature pollen and pod-specific expression, including pronounced higher up-regulated (∼3.5-folds) transcript expression in high pod number-containing parental accessions and homozygous individuals of two mapping populations especially during pollen and pod development. The proposed mQTL-seq-driven combinatorial strategy has profound efficacy in rapid genome-wide scanning of potential candidate gene(s) underlying trait-associated high-resolution robust QTL(s), thereby expediting genomics-assisted breeding and genetic enhancement of crop plants, including chickpea. PMID:26685680

  18. Who's behind that mask and cape? The Asian leopard cat's Agouti (ASIP) allele likely affects coat colour phenotype in the Bengal cat breed.

    PubMed

    Gershony, L C; Penedo, M C T; Davis, B W; Murphy, W J; Helps, C R; Lyons, L A

    2014-12-01

    Coat colours and patterns are highly variable in cats and are determined mainly by several genes with Mendelian inheritance. A 2-bp deletion in agouti signalling protein (ASIP) is associated with melanism in domestic cats. Bengal cats are hybrids between domestic cats and Asian leopard cats (Prionailurus bengalensis), and the charcoal coat colouration/pattern in Bengals presents as a possible incomplete melanism. The complete coding region of ASIP was directly sequenced in Asian leopard, domestic and Bengal cats. Twenty-seven variants were identified between domestic and leopard cats and were investigated in Bengals and Savannahs, a hybrid with servals (Leptailurus serval). The leopard cat ASIP haplotype was distinguished from domestic cat by four synonymous and four non-synonymous exonic SNPs, as well as 19 intronic variants, including a 42-bp deletion in intron 4. Fifty-six of 64 reported charcoal cats were compound heterozygotes at ASIP, with leopard cat agouti (A(P) (be) ) and domestic cat non-agouti (a) haplotypes. Twenty-four Bengals had an additional unique haplotype (A2) for exon 2 that was not identified in leopard cats, servals or jungle cats (Felis chaus). The compound heterozygote state suggests the leopard cat allele, in combination with the recessive non-agouti allele, influences Bengal markings, producing a darker, yet not completely melanistic coat. This is the first validation of a leopard cat allele segregating in the Bengal breed and likely affecting their overall pelage phenotype. Genetic testing services need to be aware of the possible segregation of wild felid alleles in all assays performed on hybrid cats. © 2014 The Authors. Animal Genetics published by John Wiley & Sons Ltd on behalf of Stichting International Foundation for Animal Genetics.

  19. Variation in Genes Encoding the Neuroactive Steroid Synthetic Enzymes 5α-Reductase Type 1 and 3α-Reductase Type 2 is Associated with Alcohol Dependence

    PubMed Central

    Milivojevic, Verica; Kranzler, Henry R.; Gelernter, Joel; Burian, Linda; Covault, Jonathan

    2010-01-01

    Background Studies of alcohol effects in rodents and in vitro implicate endogenous neuroactive steroids as key mediators of alcohol effects at GABAA receptors. We used a case-control sample to test the association with alcohol dependence (AD) of single nucleotide polymorphisms (SNPs) in the genes encoding two key enzymes required for the generation of endogenous neuroactive steroids: 5α–reductase, type I (5α-R) and 3α-hydroxysteroid dehydrogenase, type 2 (3α-HSD), both of which are expressed in human brain. Methods We focused on markers previously associated with a biological phenotype. For 5α-R, we examined the synonymous SRD5A1 exon 1 SNP rs248793, which has been associated with the ratio of dihydrotestosterone to testosterone. For 3α-HSD, we examined the non-synonymous AKR1C3 SNP rs12529 (H5Q), which has been associated with bladder cancer. The SNPs were genotyped in a sample of 1,083 non-Hispanic Caucasians including 552 controls and 531 subjects with AD. Results The minor allele for both SNPs was more common among controls than subjects with AD: SRD5A1 rs248793 C-allele (χ2(1)=7.6, p=0.006) and AKR1C3 rs12529 G-allele (χ2(1)=14.6, p=0.0001). There was also an interaction of these alleles such that the “protective” effect of the minor allele at each marker for AD was conditional on the genotype of the second marker. Conclusions We found evidence of an association with AD of polymorphisms in two genes encoding neuroactive steroid biosynthetic enzymes, providing indirect evidence that neuroactive steroids are important mediators of alcohol effects in humans. PMID:21323680

  20. Identification and characterization of single nucleotide polymorphisms in 6 growth-correlated genes in porcine by denaturing high performance liquid chromatography.

    PubMed

    Liu, Dewu; Zhang, Yushan; Du, Yinjun; Yang, Guanfu; Zhang, Xiquan

    2007-06-01

    The growth-correlated genes that are part of the neuroendocrine growth axis play crucial roles in the regulation of growth and development of pig. The identification of genetic polymorphisms in these genes will enable the scientist to evaluate the biological relevance of such polymorphisms and to gain a better understanding of quantitative traits like growth. In the present study, seven pairs of primers were designed to obtain unknown sequences of growth-correlated genes, and other 25 pairs of primers were designed to identify single nucleotide polymorphisms (SNP) using the denaturing high-performance liquid chromatography (DHPLC) technology in four pig breeds (Duroc, Landrace, Lantang and Wuzhishan), significantly differing in growth and development characteristics. A total of 101 polymorphisms were discovered in 10,707 base pairs (bp) from six genes of the ghrelin (GHRL), leptin (LEP), insulin-like growth factor II (IGF-II), insulin-like growth factor binding protein 2 (IGFBP-2), insulin-like growth factor binding protein 3 (IGFBP-3), and somatostatin (SS). The observed average distances between the SNP in the 5'UTR, coding regions, introns and 3'UTR were 134, 521, 81 and 92 bp, respectively. Four SNPs were found in the coding regions of IGF-II, IGFBP-2 and LEP, respectively. Two synonymous mutations were obtained in IGF-II and LEP genes respectively, and two non-synonymous were found in IGFBP-2 and LEP genes, respectively. Seven other mutations were also observed. Thirty-two PCR-RFLP markers were found among 101 polymorphisms of the six genes. The SNP discovered in this study would provide suitable markers for association studies of candidate genes with growth related traits in pig.

  1. Effect prediction of identified SNPs linked to fruit quality and chilling injury in peach [Prunus persica (L.) Batsch].

    PubMed

    Martínez-García, Pedro J; Fresnedo-Ramírez, Jonathan; Parfitt, Dan E; Gradziel, Thomas M; Crisosto, Carlos H

    2013-01-01

    Single nucleotide polymorphisms (SNPs) are a fundamental source of genomic variation. Large SNP panels have been developed for Prunus species. Fruit quality traits are essential peach breeding program objectives since they determine consumer acceptance, fruit consumption, industry trends and cultivar adoption. For many cultivars, these traits are negatively impacted by cold storage, used to extend fruit market life. The major symptoms of chilling injury are lack of flavor, off flavor, mealiness, flesh browning, and flesh bleeding. A set of 1,109 SNPs was mapped previously and 67 were linked with these complex traits. The prediction of the effects associated with these SNPs on downstream products from the 'peach v1.0' genome sequence was carried out. A total of 2,163 effects were detected, 282 effects (non-synonymous, synonymous or stop codon gained) were located in exonic regions (13.04 %) and 294 placed in intronic regions (13.59 %). An extended list of genes and proteins that could be related to these traits was developed. Two SNP markers that explain a high percentage of the observed phenotypic variance, UCD_SNP_1084 and UCD_SNP_46, are associated with zinc finger (C3HC4-type RING finger) family protein and AOX1A (alternative oxidase 1a) protein groups, respectively. In addition, phenotypic variation suggests that the observed polymorphism for SNP UCD_SNP_1084 [A/G] mutation could be a candidate quantitative trait nucleotide affecting quantitative trait loci for mealiness. The interaction and expression of affected proteins could explain the variation observed in each individual and facilitate understanding of gene regulatory networks for fruit quality traits in peach.

  2. Natural variation in genes potentially involved in plant architecture and adaptation in switchgrass (Panicum virgatum L.).

    PubMed

    Bahri, Bochra A; Daverdin, Guillaume; Xu, Xiangyang; Cheng, Jan-Fang; Barry, Kerrie W; Brummer, E Charles; Devos, Katrien M

    2018-06-14

    Advances in genomic technologies have expanded our ability to accurately and exhaustively detect natural genomic variants that can be applied in crop improvement and to increase our knowledge of plant evolution and adaptation. Switchgrass (Panicum virgatum L.), an allotetraploid (2n = 4× = 36) perennial C4 grass (Poaceae family) native to North America and a feedstock crop for cellulosic biofuel production, has a large potential for genetic improvement due to its high genotypic and phenotypic variation. In this study, we analyzed single nucleotide polymorphism (SNP) variation in 372 switchgrass genotypes belonging to 36 accessions for 12 genes putatively involved in biomass production to investigate signatures of selection that could have led to ecotype differentiation and to population adaptation to geographic zones. A total of 11,682 SNPs were mined from ~ 15 Gb of sequence data, out of which 251 SNPs were retained after filtering. Population structure analysis largely grouped upland accessions into one subpopulation and lowland accessions into two additional subpopulations. The most frequent SNPs were in homozygous state within accessions. Sixty percent of the exonic SNPs were non-synonymous and, of these, 45% led to non-conservative amino acid changes. The non-conservative SNPs were largely in linkage disequilibrium with one haplotype being predominantly present in upland accessions while the other haplotype was commonly present in lowland accessions. Tajima's test of neutrality indicated that PHYB, a gene involved in photoperiod response, was under positive selection in the switchgrass population. PHYB carried a SNP leading to a non-conservative amino acid change in the PAS domain, a region that acts as a sensor for light and oxygen in signal transduction. Several non-conservative SNPs in genes potentially involved in plant architecture and adaptation have been identified and led to population structure and genetic differentiation of ecotypes in switchgrass. We suggest here that PHYB is a key gene involved in switchgrass natural selection. Further analyses are needed to determine whether any of the non-conservative SNPs identified play a role in the differential adaptation of upland and lowland switchgrass.

  3. A survey of single nucleotide polymorphisms identified from whole-genome sequencing and their functional effect in the porcine genome.

    PubMed

    Keel, B N; Nonneman, D J; Rohrer, G A

    2017-08-01

    Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.

  4. Functional identification of the promoter of SLC4A5, a gene associated with cardiovascular and metabolic phenotypes in the HERITAGE Family Study.

    PubMed

    Stütz, Adrian M; Teran-Garcia, Margarita; Rao, D C; Rice, Treva; Bouchard, Claude; Rankinen, Tuomo

    2009-11-01

    The sodium bicarbonate cotransporter gene SLC4A5, associated earlier with cardiovascular phenotypes, was tested for associations in the HERITAGE Family Study, and possible mechanisms were investigated. Twelve tag-single nucleotide polymorphisms (SNPs) covering the SLC4A5 gene were analyzed in 276 Black and 503 White healthy, sedentary subjects. Associations were tested using a variance components-based (QTDT) method with data adjusted for age, sex and body size. In Whites, rs6731545 and rs7571842 were significantly associated with resting and submaximal exercise pulse pressure (PP) (0.0004

  5. Functional identification of the promoter of SLC4A5, a gene associated with cardiovascular and metabolic phenotypes in the HERITAGE Family Study

    PubMed Central

    Stütz, Adrian M; Teran-Garcia, Margarita; Rao, D C; Rice, Treva; Bouchard, Claude; Rankinen, Tuomo

    2009-01-01

    The sodium bicarbonate cotransporter gene SLC4A5, associated earlier with cardiovascular phenotypes, was tested for associations in the HERITAGE Family Study, and possible mechanisms were investigated. Twelve tag-single nucleotide polymorphisms (SNPs) covering the SLC4A5 gene were analyzed in 276 Black and 503 White healthy, sedentary subjects. Associations were tested using a variance components-based (QTDT) method with data adjusted for age, sex and body size. In Whites, rs6731545 and rs7571842 were significantly associated with resting and submaximal exercise pulse pressure (PP) (0.0004

  6. Mutations that Cause Human Disease: A Computational/Experimental Approach

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Beernink, P; Barsky, D; Pesavento, B

    International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximatelymore » half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.« less

  7. Population-based study of the association of variants in mismatch repair genes with prostate cancer risk and outcomes

    PubMed Central

    Langeberg, Wendy J.; Kwon, Erika M.; Koopmeiners, Joseph S.; Ostrander, Elaine A.; Stanford, Janet L.

    2009-01-01

    Background Mismatch repair (MMR) gene activity may be associated with prostate cancer (PC) risk and outcomes. This study evaluated whether single nucleotide polymorphisms (SNPs) in key MMR genes are related to PC outcomes. Methods Data from two population-based case-control studies of PC among Caucasian and African-American men residing in King County, Washington were combined for this analysis. Cases (n=1,458) were diagnosed with PC in 1993–96 or 2002–05 and identified via the Seattle-Puget Sound SEER cancer registry. Controls (n=1,351) were age-matched to cases and identified via random digit dialing. Logistic regression was used to assess the relationship between haplotype-tagging SNPs and PC risk and disease aggressiveness. Cox proportional hazards regression was used to assess the relationship between SNPs and PC recurrence and PC-specific death. Results Nineteen SNPs were evaluated in the key MMR genes: five in MLH1, 10 in MSH2, and 4 in PMS2. Among Caucasian men, one SNP in MLH1 (rs9852810) was associated with: overall PC risk (OR=1.21, 95% CI=1.02, 1.44; p=0.03), more aggressive PC (OR=1.49, 95% CI=1.15–1.91; p<0.01), and PC recurrence (HR=1.83, 95% CI=1.18, 2.86; p<0.01), but not PC-specific mortality. A non-synonymous coding SNP in MLH1, rs1799977 (I219V), was also found to be associated with more aggressive disease. These results did not remain significant after adjusting for multiple comparisons. Conclusion This population-based case-control study provides evidence for a possible association with a gene variant in MLH1 in relation to risk of overall PC, more aggressive disease, and PC recurrence, which warrants replication. PMID:20056646

  8. Genetic polymorphisms in Na+-taurocholate co-transporting polypeptide (NTCP) and ileal apical sodium-dependent bile acid transporter (ASBT) and ethnic comparisons of functional variants of NTCP among Asian populations.

    PubMed

    Pan, Wei; Song, Im-Sook; Shin, Ho-Jung; Kim, Min-Hye; Choi, Yeong-Lim; Lim, Su-Jeong; Kim, Woo-Young; Lee, Sang-Seop; Shin, Jae-Gook

    2011-06-01

    Genetic variants of Na(+)-taurocholate co-transporting polypeptide (NTCP; SLC10A1) and ileal apical sodium-dependent bile acid transporter (ASBT; SLC10A2), which greatly contribute to bile acid homeostasis, were extensively explored in the Korean population and functional variants of NTCP were compared among Asian populations. From direct DNA sequencing, six SNPs were identified in the SLC10A1 gene and 14 SNPs in the SLC10A2 gene. Three of seven coding variants were non-synonymous SNPs: two variants from SLC10A1 (A64T, S267F) and one from SLC10A2 (A171S). No linkage was analysed in the SLC10A1 gene because of low frequencies of genetic variants, and the SLC10A2 gene was composed of two separated linkage disequilibrium blocks contrary to the white population. The stably transfected NTCP-A64T variant showed significantly decreased uptakes of taurocholate and rosuvastatin compared with wild-type NTCP. The decreased taurocholate uptake and increased rosuvastatin uptake were shown in the NTCP-S267F variant. The allele frequencies of these functional variants were 1.0% and 3.1%, respectively, in a Korean population. However, NTCP-A64T was not found in Chinese and Vietnamese subjects. The frequency distribution of NTCP-S267F in Koreans was significantly lower than those in Chinese and Vietnamese populations. Our data suggest that NTCP-A64T and -S267F variants cause substrate-dependent functional change in vitro, and show ethnic difference in their allelic frequencies among Asian populations although the clinical relevance of these variants is remained to be evaluated.

  9. In silico SNP analysis of the breast cancer antigen NY-BR-1.

    PubMed

    Kosaloglu, Zeynep; Bitzer, Julia; Halama, Niels; Huang, Zhiqin; Zapatka, Marc; Schneeweiss, Andreas; Jäger, Dirk; Zörnig, Inka

    2016-11-18

    Breast cancer is one of the most common malignancies with increasing incidences every year and a leading cause of death among women. Although early stage breast cancer can be effectively treated, there are limited numbers of treatment options available for patients with advanced and metastatic disease. The novel breast cancer associated antigen NY-BR-1 was identified by SEREX analysis and is expressed in the majority (>70%) of breast tumors as well as metastases, in normal breast tissue, in testis and occasionally in prostate tissue. The biological function and regulation of NY-BR-1 is up to date unknown. We performed an in silico analysis on the genetic variations of the NY-BR-1 gene using data available in public SNP databases and the tools SIFT, Polyphen and Provean to find possible functional SNPs. Additionally, we considered the allele frequency of the found damaging SNPs and also analyzed data from an in-house sequencing project of 55 breast cancer samples for recurring SNPs, recorded in dbSNP. Over 2800 SNPs are recorded in the dbSNP and NHLBI ESP databases for the NY-BR-1 gene. Of these, 65 (2.07%) are synonymous SNPs, 191 (6.09%) are non-synoymous SNPs, and 2430 (77.48%) are noncoding intronic SNPs. As a result, 69 non-synoymous SNPs were predicted to be damaging by at least two, and 16 SNPs were predicted as damaging by all three of the used tools. The SNPs rs200639888, rs367841401 and rs377750885 were categorized as highly damaging by all three tools. Eight damaging SNPs are located in the ankyrin repeat domain (ANK), a domain known for its frequent involvement in protein-protein interactions. No distinctive features could be observed in the allele frequency of the analyzed SNPs. Considering these results we expect to gain more insights into the variations of the NY-BR-1 gene and their possible impact on giving rise to splice variants and therefore influence the function of NY-BR-1 in healthy tissue as well as in breast cancer.

  10. Dissecting the genetics of the human transcriptome identifies novel trait-related trans-eQTLs and corroborates the regulatory relevance of non-protein coding loci†

    PubMed Central

    Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus

    2015-01-01

    Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. PMID:26019233

  11. Unraveling patterns of site-to-site synonymous rates variation and associated gene properties of protein domains and families.

    PubMed

    Dimitrieva, Slavica; Anisimova, Maria

    2014-01-01

    In protein-coding genes, synonymous mutations are often thought not to affect fitness and therefore are not subject to natural selection. Yet increasingly, cases of non-neutral evolution at certain synonymous sites were reported over the last decade. To evaluate the extent and the nature of site-specific selection on synonymous codons, we computed the site-to-site synonymous rate variation (SRV) and identified gene properties that make SRV more likely in a large database of protein-coding gene families and protein domains. To our knowledge, this is the first study that explores the determinants and patterns of the SRV in real data. We show that the SRV is widespread in the evolution of protein-coding sequences, putting in doubt the validity of the synonymous rate as a standard neutral proxy. While protein domains rarely undergo adaptive evolution, the SRV appears to play important role in optimizing the domain function at the level of DNA. In contrast, protein families are more likely to evolve by positive selection, but are less likely to exhibit SRV. Stronger SRV was detected in genes with stronger codon bias and tRNA reusage, those coding for proteins with larger number of interactions or forming larger number of structures, located in intracellular components and those involved in typically conserved complex processes and functions. Genes with extreme SRV show higher expression levels in nearly all tissues. This indicates that codon bias in a gene, which often correlates with gene expression, may often be a site-specific phenomenon regulating the speed of translation along the sequence, consistent with the co-translational folding hypothesis. Strikingly, genes with SRV were strongly overrepresented for metabolic pathways and those associated with several genetic diseases, particularly cancers and diabetes.

  12. Effect of polymorphisms in the CSN3 (κ-casein) gene on milk production traits in Chinese Holstein Cattle.

    PubMed

    Alim, M A; Dong, T; Xie, Y; Wu, X P; Zhang, Yi; Zhang, Shengli; Sun, D X

    2014-11-01

    This study was designed to evaluate significant associations between single nucleotide polymorphisms (SNPs) and milk composition and milk production traits in Chinese Holstein cows. Six SNPs were identified in the κ-casein gene using pooled DNA sequencing. The identified SNPs were genotyped by Matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS) methods from 507 individuals. Out of six, we identified three non-synonymous SNPs (g.10888T>C, g.10924C>A and g.10944A>G) that changed in the protein product. SIFT (Sorting_Intolerant_From_Tolerant) prediction score (0.01) demonstrated that protein changed Isoleucine > Threonine (g.10888T>C) will affect the phenotypes. Significant associations between identified SNPs and three yield traits (milk, protein and fat) and two composition traits (fat and protein percentages) were found whereas it did not reach significance for fat percentage in haplotypes association. Importantly, the significant SNPs in our results showed a large proportion of the phenotypic variation of milk protein yield and concentration. Our results suggest that CSN3 is an important candidate gene that influences milk production traits, and identified polymorphisms and haplotypes could be used as a genetic marker in programs of marker-assisted selection for the genetic improvement of milk production traits in dairy cattle.

  13. Genome-wide DNA polymorphisms in Kavuni, a traditional rice cultivar with nutritional and therapeutic properties.

    PubMed

    Rathinasabapathi, Pasupathi; Purushothaman, Natarajan; Parani, Madasamy

    2016-05-01

    Although rice genome was sequenced in the year 2002, efforts in resequencing the large number of available accessions, landraces, traditional cultivars, and improved varieties of this important food crop are limited. We have initiated resequencing of the traditional cultivars from India. Kavuni is an important traditional rice cultivar from South India that attracts premium price for its nutritional and therapeutic properties. Whole-genome sequencing of Kavuni using Illumina platform and SNPs analysis using Nipponbare reference genome identified 1 150 711 SNPs of which 377 381 SNPs were located in the genic regions. Non-synonymous SNPs (62 708) were distributed in 19 251 genes, and their number varied between 1 and 115 per gene. Large-effect DNA polymorphisms (7769) were present in 3475 genes. Pathway mapping of these polymorphisms revealed the involvement of genes related to carbohydrate metabolism, translation, protein-folding, and cell death. Analysis of the starch biosynthesis related genes revealed that the granule-bound starch synthase I gene had T/G SNPs at the first intron/exon junction and a two-nucleotide combination, which were reported to favour high amylose content and low glycemic index. The present study provided a valuable genomics resource to study the rice varieties with nutritional and medicinal properties.

  14. IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants

    PubMed Central

    Knecht, Carolin; Mort, Matthew; Junge, Olaf; Cooper, David N.; Krawczak, Michael

    2017-01-01

    Abstract The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. PMID:28180317

  15. Targeted Deep Sequencing Identifies Rare ‘loss-of-function’ Variants in IFNGR1 for Risk of Atopic Dermatitis Complicated by Eczema Herpeticum

    PubMed Central

    Gao, Li; Rafaels, Nicholas M; Huang, Lili; Potee, Joseph; Ruczinski, Ingo; Beaty, Terri H.; Paller, Amy S.; Schneider, Lynda C.; Gallo, Rich; Hanifin, Jon M.; Beck, Lisa A.; Geha, Raif S.; Mathias, Rasika A.; Leung, Donald Y. M.

    2015-01-01

    Background A subset of atopic dermatitis (AD) is associated with increased susceptibility to eczema herpeticum (ADEH+). We previously reported that common single nucleotide polymorphisms (SNPs) in interferon-gamma (IFNG) and receptor 1 (IFNGR1) were associated with ADEH+ phenotype. Objective To interrogate the role of rare variants in IFN-pathway genes for risk of ADEH+. Methods We performed targeted sequencing of interferon-pathway genes (IFNG, IFNGR1, IFNAR1 and IL12RB1) in 228 European American (EA) AD patients selected according to their EH status and severity measured by Eczema Area and Severity Index (EASI). Replication genotyping was performed in independent samples of 219 EA and 333 African Americans (AA). Functional investigation of ‘loss-of-function’ variants was conducted using site-directed mutagenesis. Results We identified 494 single nucleotide variants (SNVs) encompassing 105kb of sequence, including 145 common, 349 (70.6%) rare (minor allele frequency (MAF) <5%) and 86 (17.4%) novel variants, of which 2.8% were coding-synonymous, 93.3% were non-coding (64.6% intronic), and 3.8% were missense. We identified six rare IFNGR1 missense including three damaging variants (Val14Met (V14M), Val61Ile and Tyr397Cys (Y397C)) conferring a higher risk for ADEH+ (P=0.031). Variants V14M and Y397C were confirmed to be deleterious leading to partial IFNGR1 deficiency. Seven common IFNGR1 SNPs, along with common protective haplotypes (2 to 7-SNPs) conferred a reduced risk of ADEH+ (P=0.015-0.002, P=0.0015-0.0004, respectively), and both SNP and haplotype associations were replicated in an independent AA sample (P=0.004-0.0001 and P=0.001-0.0001, respectively). Conclusion Our results provide evidence that both genetic variants in the gene encoding IFNGR1 are implicated in susceptibility to the ADEH+ phenotype. CAPSULE SUMMARY We provided the first evidence that rare functional IFNGR1 mutations contribute to a defective systemic IFN-γ immune response that accounts for the propensity of AD patients to disseminated viral skin infections. PMID:26343451

  16. Association between long non-coding RNA polymorphisms and cancer risk: a meta-analysis.

    PubMed

    Huang, Xin; Zhang, Weiyue; Shao, Zengwu

    2018-05-25

    Several studies have suggested that long non-coding RNA (lncRNA) gene polymorphisms are associated with cancer risk. In the present study, we conducted a meta-analysis related to studies on the association between lncRNA single-nucleotide polymorphisms (SNPs) and the overall risk of cancer. A total 12 SNPs in five common lncRNA genes were finally included in the meta-analysis. In the lncRNA antisense noncoding RNA in the INK4 locus (ANRIL), the rs1333048 A/C, rs4977574 A/G, and rs10757278 A/G polymorphisms, but not rs1333045 C/T, were correlated with overall cancer risk. Our study also demonstrated that other SNPs were correlated with overall cancer risk, namely, metastasis-associated lung adenocarcinoma transcript 1 (MALAT1, rs619586 A/G), HOXA distal transcript antisense RNA (HOTTIP, rs1859168 A/C) and highly up-regulated in liver cancer (HULC, rs7763881 A/C). Moreover, four prostate cancer‑associated non‑coding RNA 1 (PRNCR1, rs16901946 G/A, rs13252298 G/A, rs1016343 T/C, and rs1456315 G/A) SNPs were in association with cancer risk. No association was found between the PRNCR1 (rs7007694 C/T) SNP and the risk of cancer. In conclusion, our results suggest that several studied lncRNA SNPs are associated with overall cancer risk. Therefore, they might be potential predictive biomarkers for the risk of cancer. More studies based on larger sample sizes and more lncRNA SNPs are warranted to confirm these findings. ©2018 The Author(s).

  17. Accumulation of slightly deleterious mutations in the mitochondrial genome: a hallmark of animal domestication.

    PubMed

    Hughes, Austin L

    2013-02-15

    The hypothesis that domestication leads to a relaxation of purifying selection on mitochondrial (mt) genomes was tested by comparative analysis of mt genes from dog, pig, chicken, and silkworm. The three vertebrate species showed mt genome phylogenies in which domestic and wild isolates were intermingled, whereas the domestic silkworm (Bombyx mori) formed a distinct cluster nested within its closest wild relative (Bombyx mandarina). In spite of these differences in phylogenetic pattern, significantly greater proportions of nonsynonymous SNPs than of synonymous SNPs were unique to the domestic populations of all four species. Likewise, in all four species, significantly greater proportions of RNA-encoding SNPs than of synonymous SNPs were unique to the domestic populations. Thus, domestic populations were characterized by an excess of unique polymorphisms in two categories generally subject to purifying selection: nonsynonymous sites and RNA-encoding sites. Many of these unique polymorphisms thus seem likely to be slightly deleterious; the latter hypothesis was supported by the generally lower gene diversities of polymorphisms unique to domestic populations in comparison to those of polymorphisms shared by domestic and wild populations. Copyright © 2012 Elsevier B.V. All rights reserved.

  18. No association of the Arg51Gln and Leu72Met polymorphisms of the ghrelin gene and polycystic ovary syndrome.

    PubMed

    Wang, Kehua; Wang, Leiguang; Zhao, Yueran; Shi, Yuhua; Wang, Laicheng; Chen, Zi-Jiang

    2009-02-01

    Ghrelin plays a role in regulating glucose metabolism and energy balance. Polymorphisms in preproghrelin and ghrelin gene could be responsible for obesity, insulin resistance and low ghrelin levels observed in some individuals. The objective of this study was to evaluate the influence of two single-nucleotide polymorphisms (SNPs) of ghrelin gene on the clinical, the hormonal and metabolic features in women with polycystic ovary syndrome (PCOS) in a Chinese population. A large sample of Chinese PCOS (n = 271) women and a control group (n = 296) of healthy women matched for age were studied. Hormone and metabolic profiles were measured and blood samples were collected for genotype and allelic frequency analysis. Non-synonymous SNPs in the coding region (exon 2) of the preproghrelin gene (Arg51Gln (346 G>A) and Leu72Met (408 C>A) were studied using PCR and restriction fragment length polymorphism analysis. The polymorphism Arg51Gln was not found in the cohorts studied. The distribution of Leu72Met was similar in PCOS group and in healthy controls. There was no significant difference in age, BMI, waist-hip-ratio and levels of FSH, LH, estradiol, testosterone and prolactin between PCOS patients with different genotypes, and the level of plasma glucose and insulin was also similar. No association was found between Leu72Met and Arg51Gln polymorphisms in the ghrelin gene and PCOS in Chinese population.

  19. Integrating transcriptome and genome re-sequencing data to identify key genes and mutations affecting chicken eggshell qualities.

    PubMed

    Zhang, Quan; Zhu, Feng; Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua

    2015-01-01

    Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as revealed by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus.

  20. Genome-Wide SNP Genotyping to Infer the Effects on Gene Functions in Tomato

    PubMed Central

    Hirakawa, Hideki; Shirasawa, Kenta; Ohyama, Akio; Fukuoka, Hiroyuki; Aoki, Koh; Rothan, Christophe; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi

    2013-01-01

    The genotype data of 7054 single nucleotide polymorphism (SNP) loci in 40 tomato lines, including inbred lines, F1 hybrids, and wild relatives, were collected using Illumina's Infinium and GoldenGate assay platforms, the latter of which was utilized in our previous study. The dendrogram based on the genotype data corresponded well to the breeding types of tomato and wild relatives. The SNPs were classified into six categories according to their positions in the genes predicted on the tomato genome sequence. The genes with SNPs were annotated by homology searches against the nucleotide and protein databases, as well as by domain searches, and they were classified into the functional categories defined by the NCBI's eukaryotic orthologous groups (KOG). To infer the SNPs' effects on the gene functions, the three-dimensional structures of the 843 proteins that were encoded by the genes with SNPs causing missense mutations were constructed by homology modelling, and 200 of these proteins were considered to carry non-synonymous amino acid substitutions in the predicted functional sites. The SNP information obtained in this study is available at the Kazusa Tomato Genomics Database (http://plant1.kazusa.or.jp/tomato/). PMID:23482505

  1. Differential Single Nucleotide Polymorphism-Based Analysis of an Outbreak Caused by Salmonella enterica Serovar Manhattan Reveals Epidemiological Details Missed by Standard Pulsed-Field Gel Electrophoresis

    PubMed Central

    Scaltriti, Erika; Sassera, Davide; Comandatore, Francesco; Morganti, Marina; Mandalari, Carmen; Gaiarsa, Stefano; Bandi, Claudio; Zehender, Gianguglielmo; Bolzoni, Luca; Casadei, Gabriele

    2015-01-01

    We retrospectively analyzed a rare Salmonella enterica serovar Manhattan outbreak that occurred in Italy in 2009 to evaluate the potential of new genomic tools based on differential single nucleotide polymorphism (SNP) analysis in comparison with the gold standard genotyping method, pulsed-field gel electrophoresis. A total of 39 isolates were analyzed from patients (n = 15) and food, feed, animal, and environmental sources (n = 24), resulting in five different pulsed-field gel electrophoresis (PFGE) profiles. Isolates epidemiologically related to the outbreak clustered within the same pulsotype, SXB_BS.0003, without any further differentiation. Thirty-three isolates were considered for genomic analysis based on different sets of SNPs, core, synonymous, nonsynonymous, as well as SNPs in different codon positions, by Bayesian and maximum likelihood algorithms. Trees generated from core and nonsynonymous SNPs, as well as SNPs at the second and first plus second codon positions detailed four distinct groups of isolates within the outbreak pulsotype, discriminating outbreak-related isolates of human and food origins. Conversely, the trees derived from synonymous and third-codon-position SNPs clustered food and human isolates together, indicating that all outbreak-related isolates constituted a single clone, which was in line with the epidemiological evidence. Further experiments are in place to extend this approach within our regional enteropathogen surveillance system. PMID:25653407

  2. Differential single nucleotide polymorphism-based analysis of an outbreak caused by Salmonella enterica serovar Manhattan reveals epidemiological details missed by standard pulsed-field gel electrophoresis.

    PubMed

    Scaltriti, Erika; Sassera, Davide; Comandatore, Francesco; Morganti, Marina; Mandalari, Carmen; Gaiarsa, Stefano; Bandi, Claudio; Zehender, Gianguglielmo; Bolzoni, Luca; Casadei, Gabriele; Pongolini, Stefano

    2015-04-01

    We retrospectively analyzed a rare Salmonella enterica serovar Manhattan outbreak that occurred in Italy in 2009 to evaluate the potential of new genomic tools based on differential single nucleotide polymorphism (SNP) analysis in comparison with the gold standard genotyping method, pulsed-field gel electrophoresis. A total of 39 isolates were analyzed from patients (n=15) and food, feed, animal, and environmental sources (n=24), resulting in five different pulsed-field gel electrophoresis (PFGE) profiles. Isolates epidemiologically related to the outbreak clustered within the same pulsotype, SXB_BS.0003, without any further differentiation. Thirty-three isolates were considered for genomic analysis based on different sets of SNPs, core, synonymous, nonsynonymous, as well as SNPs in different codon positions, by Bayesian and maximum likelihood algorithms. Trees generated from core and nonsynonymous SNPs, as well as SNPs at the second and first plus second codon positions detailed four distinct groups of isolates within the outbreak pulsotype, discriminating outbreak-related isolates of human and food origins. Conversely, the trees derived from synonymous and third-codon-position SNPs clustered food and human isolates together, indicating that all outbreak-related isolates constituted a single clone, which was in line with the epidemiological evidence. Further experiments are in place to extend this approach within our regional enteropathogen surveillance system. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  3. Genetic variation in eleven phase I drug metabolism genes in an ethnically diverse population.

    PubMed

    Solus, Joseph F; Arietta, Brenda J; Harris, James R; Sexton, David P; Steward, John Q; McMunn, Chara; Ihrie, Patrick; Mehall, Janelle M; Edwards, Todd L; Dawson, Elliott P

    2004-10-01

    The extent of genetic variation found in drug metabolism genes and its contribution to interindividual variation in response to medication remains incompletely understood. To better determine the identity and frequency of variation in 11 phase I drug metabolism genes, the exons and flanking intronic regions of the cytochrome P450 (CYP) isoenzyme genes CYP1A1, CYP1A2, CYP2A6, CYP2B6, CYP2C8, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4 and CYP3A5 were amplified from genomic DNA and sequenced. A total of 60 kb of bi-directional sequence was generated from each of 93 human DNAs, which included Caucasian, African-American and Asian samples. There were 388 different polymorphisms identified. These included 269 non-coding, 45 synonymous and 74 non-synonymous polymorphisms. Of these, 54% were novel and included 176 non-coding, 14 synonymous and 21 non-synonymous polymorphisms. Of the novel variants observed, 85 were represented by single occurrences of the minor allele in the sample set. Much of the variation observed was from low-frequency alleles. Comparatively, these genes are variation-rich. Calculations measuring genetic diversity revealed that while the values for the individual genes are widely variable, the overall nucleotide diversity of 7.7 x 10(-4) and polymorphism parameter of 11.5 x 10(-4) are higher than those previously reported for other gene sets. Several independent measurements indicate that these genes are under selective pressure, particularly for polymorphisms corresponding to non-synonymous amino acid changes. There is relatively little difference in measurements of diversity among the ethnic groups, but there are large differences among the genes and gene subfamilies themselves. Of the three CYP subfamilies involved in phase I drug metabolism (1, 2, and 3), subfamily 2 displays the highest levels of genetic diversity.

  4. Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.

    PubMed

    Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S

    2013-12-10

    Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding region of the caprine BMP4 gene. But whether the reproduction trait of goat is associated with the BMP4 polymorphism, needs to be further defined by association studies in more populations so as to delineate an effect on it. © 2013 Elsevier B.V. All rights reserved.

  5. SNiPlay: a web-based tool for detection, management and analysis of SNPs. Application to grapevine diversity projects.

    PubMed

    Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice

    2011-05-05

    High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to export genotyping data or sequences in various formats. Our experiments on grapevine genetic projects showed that SNiPlay allows geneticists to rapidly obtain advanced results in several key research areas of plant genetic diversity. Both the management and treatment of large amounts of SNP data are rendered considerably easier for end-users through automation and integration. Current developments are taking into account new advances in high-throughput technologies.SNiPlay is available at: http://sniplay.cirad.fr/.

  6. In silico prediction of a disease-associated STIL mutant and its affect on the recruitment of centromere protein J (CENPJ).

    PubMed

    Kumar, Ambuj; Rajendran, Vidya; Sethumadhavan, Rao; Purohit, Rituraj

    2012-01-01

    Human STIL (SCL/TAL1 interrupting locus) protein maintains centriole stability and spindle pole localisation. It helps in recruitment of CENPJ (Centromere protein J)/CPAP (centrosomal P4.1-associated protein) and other centrosomal proteins. Mutations in STIL protein are reported in several disorders, especially in deregulation of cell cycle cascades. In this work, we examined the non-synonymous single nucleotide polymorphisms (nsSNPs) reported in STIL protein for their disease association. Different SNP prediction tools were used to predict disease-associated nsSNPs. Our evaluation technique predicted rs147744459 (R242C) as a highly deleterious disease-associated nsSNP and its interaction behaviour with CENPJ protein. Molecular modelling, docking and molecular dynamics simulation were conducted to examine the structural consequences of the predicted disease-associated mutation. By molecular dynamic simulation we observed structural consequences of R242C mutation which affects interaction of STIL and CENPJ functional domains. The result obtained in this study will provide a biophysical insight into future investigations of pathological nsSNPs using a computational platform.

  7. SNPdbe: constructing an nsSNP functional impacts database.

    PubMed

    Schaefer, Christian; Meier, Alice; Rost, Burkhard; Bromberg, Yana

    2012-02-15

    Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe-SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from >2600 organisms; 'human' being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. http://www.rostlab.org/services/snpdbe.

  8. Genetic loci associated with heart rate variability and their effects on cardiac disease risk

    PubMed Central

    Nolte, Ilja M.; Munoz, M. Loretto; Tragante, Vinicius; Amare, Azmeraw T.; Jansen, Rick; Vaez, Ahmad; von der Heyde, Benedikt; Avery, Christy L.; Bis, Joshua C.; Dierckx, Bram; van Dongen, Jenny; Gogarten, Stephanie M.; Goyette, Philippe; Hernesniemi, Jussi; Huikari, Ville; Hwang, Shih-Jen; Jaju, Deepali; Kerr, Kathleen F.; Kluttig, Alexander; Krijthe, Bouwe P.; Kumar, Jitender; van der Laan, Sander W.; Lyytikäinen, Leo-Pekka; Maihofer, Adam X.; Minassian, Arpi; van der Most, Peter J.; Müller-Nurasyid, Martina; Nivard, Michel; Salvi, Erika; Stewart, James D.; Thayer, Julian F.; Verweij, Niek; Wong, Andrew; Zabaneh, Delilah; Zafarmand, Mohammad H.; Abdellaoui, Abdel; Albarwani, Sulayma; Albert, Christine; Alonso, Alvaro; Ashar, Foram; Auvinen, Juha; Axelsson, Tomas; Baker, Dewleen G.; de Bakker, Paul I. W.; Barcella, Matteo; Bayoumi, Riad; Bieringa, Rob J.; Boomsma, Dorret; Boucher, Gabrielle; Britton, Annie R.; Christophersen, Ingrid; Dietrich, Andrea; Ehret, George B.; Ellinor, Patrick T.; Eskola, Markku; Felix, Janine F.; Floras, John S.; Franco, Oscar H.; Friberg, Peter; Gademan, Maaike G. J.; Geyer, Mark A.; Giedraitis, Vilmantas; Hartman, Catharina A.; Hemerich, Daiane; Hofman, Albert; Hottenga, Jouke-Jan; Huikuri, Heikki; Hutri-Kähönen, Nina; Jouven, Xavier; Junttila, Juhani; Juonala, Markus; Kiviniemi, Antti M.; Kors, Jan A.; Kumari, Meena; Kuznetsova, Tatiana; Laurie, Cathy C.; Lefrandt, Joop D.; Li, Yong; Li, Yun; Liao, Duanping; Limacher, Marian C.; Lin, Henry J.; Lindgren, Cecilia M.; Lubitz, Steven A.; Mahajan, Anubha; McKnight, Barbara; zu Schwabedissen, Henriette Meyer; Milaneschi, Yuri; Mononen, Nina; Morris, Andrew P.; Nalls, Mike A.; Navis, Gerjan; Neijts, Melanie; Nikus, Kjell; North, Kari E.; O'Connor, Daniel T.; Ormel, Johan; Perz, Siegfried; Peters, Annette; Psaty, Bruce M.; Raitakari, Olli T.; Risbrough, Victoria B.; Sinner, Moritz F.; Siscovick, David; Smit, Johannes H.; Smith, Nicholas L.; Soliman, Elsayed Z.; Sotoodehnia, Nona; Staessen, Jan A.; Stein, Phyllis K.; Stilp, Adrienne M.; Stolarz-Skrzypek, Katarzyna; Strauch, Konstantin; Sundström, Johan; Swenne, Cees A.; Syvänen, Ann-Christine; Tardif, Jean-Claude; Taylor, Kent D.; Teumer, Alexander; Thornton, Timothy A.; Tinker, Lesley E.; Uitterlinden, André G.; van Setten, Jessica; Voss, Andreas; Waldenberger, Melanie; Wilhelmsen, Kirk C.; Willemsen, Gonneke; Wong, Quenna; Zhang, Zhu-Ming; Zonderman, Alan B.; Cusi, Daniele; Evans, Michele K.; Greiser, Halina K.; van der Harst, Pim; Hassan, Mohammad; Ingelsson, Erik; Järvelin, Marjo-Riitta; Kääb, Stefan; Kähönen, Mika; Kivimaki, Mika; Kooperberg, Charles; Kuh, Diana; Lehtimäki, Terho; Lind, Lars; Nievergelt, Caroline M.; O'Donnell, Chris J.; Oldehinkel, Albertine J.; Penninx, Brenda; Reiner, Alexander P.; Riese, Harriëtte; van Roon, Arie M.; Rioux, John D.; Rotter, Jerome I.; Sofer, Tamar; Stricker, Bruno H.; Tiemeier, Henning; Vrijkotte, Tanja G. M.; Asselbergs, Folkert W.; Brundel, Bianca J. J. M.; Heckbert, Susan R.; Whitsel, Eric A.; den Hoed, Marcel; Snieder, Harold; de Geus, Eco J. C.

    2017-01-01

    Reduced cardiac vagal control reflected in low heart rate variability (HRV) is associated with greater risks for cardiac morbidity and mortality. In two-stage meta-analyses of genome-wide association studies for three HRV traits in up to 53,174 individuals of European ancestry, we detect 17 genome-wide significant SNPs in eight loci. HRV SNPs tag non-synonymous SNPs (in NDUFA11 and KIAA1755), expression quantitative trait loci (eQTLs) (influencing GNG11, RGS6 and NEO1), or are located in genes preferentially expressed in the sinoatrial node (GNG11, RGS6 and HCN4). Genetic risk scores account for 0.9 to 2.6% of the HRV variance. Significant genetic correlation is found for HRV with heart rate (−0.74

  9. Exploration of structural stability in deleterious nsSNPs of the XPA gene: A molecular dynamics approach.

    PubMed

    Nagasundaram, N; Priya Doss, C George

    2011-01-01

    Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype.

  10. Identification of Novel Single Nucleotide Polymorphisms Associated with Acute Respiratory Distress Syndrome by Exome-Seq

    PubMed Central

    Shortt, Katherine; Chaudhary, Suman; Grigoryev, Dmitry; Heruth, Daniel P.; Venkitachalam, Lakshmi; Zhang, Li Q.; Ye, Shui Q.

    2014-01-01

    Acute respiratory distress syndrome (ARDS) is a lung condition characterized by impaired gas exchange with systemic release of inflammatory mediators, causing pulmonary inflammation, vascular leak and hypoxemia. Existing biomarkers have limited effectiveness as diagnostic and therapeutic targets. To identify disease-associating variants in ARDS patients, whole-exome sequencing was performed on 96 ARDS patients, detecting 1,382,399 SNPs. By comparing these exome data to those of the 1000 Genomes Project, we identified a number of single nucleotide polymorphisms (SNP) which are potentially associated with ARDS. 50,190SNPs were found in all case subgroups and controls, of which89 SNPs were associated with susceptibility. We validated three SNPs (rs78142040, rs9605146 and rs3848719) in additional ARDS patients to substantiate their associations with susceptibility, severity and outcome of ARDS. rs78142040 (C>T) occurs within a histone mark (intron 6) of the Arylsulfatase D gene. rs9605146 (G>A) causes a deleterious coding change (proline to leucine) in the XK, Kell blood group complex subunit-related family, member 3 gene. rs3848719 (G>A) is a synonymous SNP in the Zinc-Finger/Leucine-Zipper Co-Transducer NIF1 gene. rs78142040, rs9605146, and rs3848719 are associated significantly with susceptibility to ARDS. rs3848719 is associated with APACHE II score quartile. rs78142040 is associated with 60-day mortality in the overall ARDS patient population. Exome-seq is a powerful tool to identify potential new biomarkers for ARDS. We selectively validated three SNPs which have not been previously associated with ARDS and represent potential new genetic biomarkers for ARDS. Additional validation in larger patient populations and further exploration of underlying molecular mechanisms are warranted. PMID:25372662

  11. Polymorphisms of the artemisinin resistant marker (K13) in Plasmodium falciparum parasite populations of Grande Comore Island 10 years after artemisinin combination therapy.

    PubMed

    Huang, Bo; Deng, Changsheng; Yang, Tao; Xue, Linlu; Wang, Qi; Huang, Shiguang; Su, Xin-zhuan; Liu, Yajun; Zheng, Shaoqin; Guan, Yezhi; Xu, Qin; Zhou, Jiuyao; Yuan, Jie; Bacar, Afane; Abdallah, Kamal Said; Attoumane, Rachad; Mliva, Ahamada M S A; Zhong, Yanchun; Lu, Fangli; Song, Jianping

    2015-12-15

    Plasmodium falciparum malaria is a significant public health problem in Comoros, and artemisinin combination therapy (ACT) remains the first choice for treating acute uncomplicated P. falciparum. The emergence and spread of artemisinin-resistant P. falciparum in Southeast Asia, associated with mutations in K13-propeller gene, poses a potential threat to ACT efficacy. Detection of mutations in the P. falciparum K13-propeller gene may provide the first-hand information on changes in parasite susceptibility to artemisinin. The objective of this study is to determinate the prevalence of mutant K13-propeller gene among the P. falciparum isolates collected from Grande Comore Island, Union of Comoros, where ACT has been in use since 2004. A total of 207 P. falciparum clinical isolates were collected from the island during March 2006 and October 2007 (n = 118) and March 2013 and December 2014 (n = 89). All isolates were analysed for single nucleotide polymorphisms (SNPs) and haplotypes in the K13-propeller gene using nested PCR and DNA sequencing. Only three 2006-2007 samples carried SNPs in the K13-propeller gene, one having a synonymous (G538G) and the other having two non-synonymous (S477Y and D584E) substitutions leading to two mutated haplotypes (2.2%, 2/95). Three synonymous mutations (R471R, Y500Y, and G538G) (5.9%, 5/85) and 7 non-synonymous substitutions (21.2%, 18/85) with nine mutated haplotypes (18.8%, 16/85) were found in isolates from 2013 to 2014. However, none of the polymorphisms associated with artemisinin-resistance in Southeast Asia was detected from any of the parasites examined. This study showed increased K13-propeller gene diversity among P. falciparum populations on the Island over the course of 8 years (2006-2014). Nevertheless, none of the polymorphisms known to be associated with artemisinin resistance in Asia was detected in the parasite populations examined. Our data suggest that P. falciparum populations in Grande Comore are still effectively susceptible to artemisinin. Our results provide insights into P. falciparum populations regarding mutations in the gene associated with artemisinin resistance and will be useful for developing and updating anti-malarial guidance in Comoros.

  12. Dissecting the genetics of the human transcriptome identifies novel trait-related trans-eQTLs and corroborates the regulatory relevance of non-protein coding loci†.

    PubMed

    Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus

    2015-08-15

    Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. © The Author 2015. Published by Oxford University Press.

  13. Teicoplanin resistance in Staphylococcus haemolyticus is associated with mutations in histidine kinases VraS and WalK.

    PubMed

    Vimberg, Vladimir; Cavanagh, Jorunn Pauline; Benada, Oldřich; Kofroňová, Olga; Hjerde, Erik; Zieglerová, Leona; Balíková Novotná, Gabriela

    2018-03-01

    We investigated the genetic basis of glycopeptide resistance in laboratory-derived strains of S. haemolyticus with emphasis on differences between vancomycin and teicoplanin. The genomes of two stable teicoplanin-resistant laboratory mutants selected on vancomycin or teicoplanin were sequenced and compared to parental S. haemolyticus strain W2/124. Only the two non-synonymous mutations, VraS Q289K and WalK V550L were identified. No other mutations or genome rearrangements were detected. Increased cell wall thickness, resistance to lysostaphin-induced lysis and adaptation of cell growth rates specifically to teicoplanin were phenotypes observed in a sequenced strain with the VraS Q289K mutation. Neither of the VraS Q289K and WalK V550L mutations was present in the genomes of 121S. haemolyticus clinical isolates. However, all but two of the teicoplanin resistant strains carried non-synonymous SNPs in vraSRTU and walKR-YycHIJ operons pointing to their importance for the glycopeptide resistance. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. [Mutation analysis of seven patients with Waardenburg syndrome].

    PubMed

    Hao, Ziqi; Zhou, Yongan; Li, Pengli; Zhang, Quanbin; Li, Jiao; Wang, Pengfei; Li, Xiangshao; Feng, Yong

    2016-06-01

    To perform genetic analysis for 7 patients with Waardenburg syndrome. Potential mutation of MITF, PAX3, SOX10 and SNAI2 genes was screened by polymerase chain reaction and direct sequencing. Functions of non-synonymous polymorphisms were predicted with PolyPhen2 software. Seven mutations, including c.649-651delAGA (p.R217del), c.72delG (p.G24fs), c.185T>C (p.M62T), c.118C>T (p.Q40X), c.422T>C (p.L141P), c.640C>T (p.R214X) and c.28G>T(p.G43V), were detected in the patients. Among these, four mutations of the PAX3 gene (c.72delG, c.185T>C, c.118C>T and c.128G>T) and one SOX10 gene mutation (c.422T>C) were not reported previously. Three non-synonymous SNPs (c.185T>C, c.128G>T and c.422T>C) were predicted as harmful. Genetic mutations have been detected in all patients with Waardenburg syndrome.

  15. Identification and validation of single nucleotide polymorphisms in growth- and maturation-related candidate genes in sole (Solea solea L.).

    PubMed

    Diopere, Eveline; Hellemans, Bart; Volckaert, Filip A M; Maes, Gregory E

    2013-03-01

    Genomic methodologies applied in evolutionary and fisheries research have been of great benefit to understand the marine ecosystem and the management of natural resources. Although single nucleotide polymorphisms (SNPs) are attractive for the study of local adaptation, spatial stock management and traceability, and investigating the effects of fisheries-induced selection, they have rarely been exploited in non-model organisms. This is partly due to difficulties in finding and validating SNPs in species with limited or no genomic resources. Complementary to random genome-scan approaches, a targeted candidate gene approach has the potential to unveil pre-selected functional diversity and provides more in depth information on the action of selection at specific genes. For example genes can be under selective pressure due to climate change and sustained periods of heavy fishing pressure. In this study, we applied a candidate gene approach in sole (Solea solea L.), an important member of the demersal ecosystem. As consumption flatfish it is heavy exploited and has experienced associated life-history changes over the last 60years. To discover novel genetic polymorphisms in or around genes linked to important life history traits in sole, we screened a total of 76 candidate genes related to growth and maturation using a targeted resequencing approach. We identified in total 86 putative SNPs in 22 genes and validated 29 SNPs using a multiplex single-base extension genotyping assay. We found 22 informative SNPs, of which two represent non-synonymous mutations, potentially of functional relevance. These novel markers should be rapidly and broadly applicable in analyses of natural sole populations, as a measure of the evolutionary signature of overfishing and for initiatives on marker assisted selection. Copyright © 2012 Elsevier B.V. All rights reserved.

  16. Association of Genetic Variants of Small Non-Coding RNAs with Survival in Colorectal Cancer

    PubMed Central

    Pao, Jiunn-Bey; Lu, Te-Ling; Ting, Wen-Chien; Chen, Lu-Min; Bao, Bo-Ying

    2018-01-01

    Background: Single nucleotide polymorphisms (SNPs) of small non-coding RNAs (sncRNAs) can influence sncRNA function and target gene expression to mediate the risk of certain diseases. The aim of the present study was to evaluate the prognostic relevance of sncRNA SNPs for colorectal cancer, which has not been well characterized to date. Methods: We comprehensively examined 31 common SNPs of sncRNAs, and assessed the impact of these variants on survival in a cohort of 188 patients with colorectal cancer. Results: Three SNPs were significantly associated with survival of patients with colorectal cancer after correction for multiple testing, and two of the SNPs (hsa-mir-196a-2 rs11614913 and U85 rs714775) remained significant in multivariate analyses. Additional in silico analysis provided further evidence of this association, since the expression levels of the target genes of the hsa-miR-196a (HOXA7, HOXB8, and AKT1) were significantly correlated with colorectal cancer progression. Furthermore, Gene Ontology and Kyoto Encyclopedia of Genes and Genomes enrichment analyses indicated that hsa-miR-196a is associated with well-known oncogenic pathways, including cellular protein modification process, mitotic cell cycle, adherens junction, and extracellular matrix receptor interaction pathways. Conclusion: Our results suggest that SNPs of sncRNAs could play a critical role in cancer progression, and that hsa-miR-196a might be a valuable biomarker or therapeutic target for colorectal cancer patients. PMID:29483812

  17. Association between FTO polymorphism in exon 3 with carcass and meat quality traits in crossbred ducks.

    PubMed

    Gan, W; Song, Q; Zhang, N N; Xiong, X P; Wang, D M C; Li, L

    2015-06-18

    The fat mass and obesity-associated gene (FTO) is an excellent candidate gene that affects energy metabolism. Single nucleotide polymorphisms (SNPs) in FTO are associated with carcass and meat quality traits in pigs, cattle, and rabbits. The aim of this study was to investigate the association between novel SNPs in the FTO coding region and carcass and meat quality traits in 95 crossbred ducks, using DNA sequencing. We found two transitions G/A (SNP 387 and 473) within exon 3. SNP 387 was a synonymous mutation, whereas SNP 473 was a missense mutation. Association analysis suggested that SNP g.387G>A was significantly associated with all of the carcass traits measured, the intramuscular fat content (IMF), cooking yield (CY), pH values 45 min after slaughter (pH45m), drip losses from the breast muscle, and the leg muscle (P < 0.05). For SNP g.473G>A, the genotype AA exhibited greater leg muscle weight than the genotypes GG or AG (P < 0.05). The D value suggested that the two SNPs exhibited strong linkage disequilibrium. Three haplotypes (G1G2, G1A2, and A1A2) were significantly associated with IMF, CY, the a* value, and all of the carcass traits measured (P < 0.05). The results suggest that FTO is a candidate locus that affects carcass and meat quality traits in ducks.

  18. Integrating Transcriptome and Genome Re-Sequencing Data to Identify Key Genes and Mutations Affecting Chicken Eggshell Qualities

    PubMed Central

    Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua

    2015-01-01

    Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as reveled by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus. PMID:25974068

  19. Endothelin Receptor B2 (EDNRB2) Gene Is Associated with Spot Plumage Pattern in Domestic Ducks (Anas platyrhynchos).

    PubMed

    Li, Ling; Li, Dan; Liu, Li; Li, Shijun; Feng, Yanping; Peng, Xiuli; Gong, Yanzhang

    2015-01-01

    Endothelin receptor B subtype 2 (EDNRB2) is a seven-transmembrane G-protein coupled receptor. In this study, we investigated EDNRB2 gene as a candidate gene for duck spot plumage pattern according to studies of chicken and Japanese quail. The entire coding region was cloned by the reverse transcription polymerase chain reaction (RT-PCR). Sequence analysis showed that duck EDNRB2 cDNA contained a 1311 bp open reading frame and encoded a putative protein of 436 amino acids residues. The transcript shared 89%-90% identity with the counterparts in other avian species. A phylogenetic tree based on amino acid sequences showed that duck EDNRB2 was evolutionary conserved in avian clade. The entire coding region of EDNRB2 were sequenced in 20 spot and 20 non-spot ducks, and 13 SNPs were identified. Two of them (c.940G>A and c.995G>A) were non-synonymous substitutions, and were genotyped in 647 ducks representing non-spot and spot phenotypes. The c.995G>A mutation, which results in the amino acid substitution of Arg332His, was completely associated with the spot phenotype: all 152 spot ducks were carriers of the AA genotype and the other 495 individuals with non-spot phenotype were carriers of GA or GG genotype, respectively. Segregation in 17 GA×GG and 22 GA×GA testing combinations confirmed this association since the segregation ratios and genotypes of the offspring were in agreement with the hypothesis. In order to investigate the underlying mechanism of the spot phenotype, MITF gene was used as cell type marker of melanocyte progenitor cells while TYR and TYRP1 gene were used as cell type markers of mature melanocytes. Transcripts of MITF, TYR and TYRP1 gene with expected size were identified in all pigmented skin tissues while PCR products were not obtained from non-pigmented skin tissues. It was inferred that melanocytes are absent in non-pigmented skin tissues of spot ducks.

  20. Genomic Footprints in Selected and Unselected Beef Cattle Breeds in Korea.

    PubMed

    Lim, Dajeong; Strucken, Eva M; Choi, Bong Hwan; Chai, Han Ha; Cho, Yong Min; Jang, Gul Won; Kim, Tae-Hun; Gondro, Cedric; Lee, Seung Hwan

    2016-01-01

    Korean Hanwoo cattle have been subjected to intensive artificial selection over the past four decades to improve meat production traits. Another three cattle varieties very closely related to Hanwoo reside in Korea (Jeju Black and Brindle) and in China (Yanbian). These breeds have not been part of a breeding scheme to improve production traits. Here, we compare the selected Hanwoo against these similar but presumed to be unselected populations to identify genomic regions that have been under recent selection pressure due to the breeding program. Rsb statistics were used to contrast the genomes of Hanwoo versus a pooled sample of the three unselected population (UN). We identified 37 significant SNPs (FDR corrected) in the HW/UN comparison and 21 known protein coding genes were within 1 MB to the identified SNPs. These genes were previously reported to affect traits important for meat production (14 genes), reproduction including mammary gland development (3 genes), coat color (2 genes), and genes affecting behavioral traits in a broader sense (2 genes). We subsequently sequenced (Illumina HiSeq 2000 platform) 10 individuals of the brown Hanwoo and the Chinese Yanbian to identify SNPs within the candidate genomic regions. Based on allele frequency differences, haplotype structures, and literature research, we singled out one non-synonymous SNP in the APP gene (APP: c.569C>T, Ala199Val) and predicted the mutational effect on the protein structure. We found that protein-protein interactions might be impaired due to increased exposed hydrophobic surfaces of the mutated protein. The APP gene has also been reported to affect meat tenderness in pigs and obesity in humans. Meat tenderness has been linked to intramuscular fat content, which is one of the main breeding goals for brown Hanwoo, potentially supporting a causal influence of the herein described nsSNP in the APP gene.

  1. LincSNP 2.0: an updated database for linking disease-associated SNPs to human long non-coding RNAs and their TFBSs.

    PubMed

    Ning, Shangwei; Yue, Ming; Wang, Peng; Liu, Yue; Zhi, Hui; Zhang, Yan; Zhang, Jizhou; Gao, Yue; Guo, Maoni; Zhou, Dianshuang; Li, Xin; Li, Xia

    2017-01-04

    We describe LincSNP 2.0 (http://bioinfo.hrbmu.edu.cn/LincSNP), an updated database that is used specifically to store and annotate disease-associated single nucleotide polymorphisms (SNPs) in human long non-coding RNAs (lncRNAs) and their transcription factor binding sites (TFBSs). In LincSNP 2.0, we have updated the database with more data and several new features, including (i) expanding disease-associated SNPs in human lncRNAs; (ii) identifying disease-associated SNPs in lncRNA TFBSs; (iii) updating LD-SNPs from the 1000 Genomes Project; and (iv) collecting more experimentally supported SNP-lncRNA-disease associations. Furthermore, we developed three flexible online tools to retrieve and analyze the data. Linc-Mart is a convenient way for users to customize their own data. Linc-Browse is a tool for all data visualization. Linc-Score predicts the associations between lncRNA and disease. In addition, we provided users a newly designed, user-friendly interface to search and download all the data in LincSNP 2.0 and we also provided an interface to submit novel data into the database. LincSNP 2.0 is a continually updated database and will serve as an important resource for investigating the functions and mechanisms of lncRNAs in human diseases. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Is α‐T catenin (VR22) an Alzheimer's disease risk gene?

    PubMed Central

    Bertram, Lars; Mullin, Kristina; Parkinson, Michele; Hsiao, Monica; Moscarillo, Thomas J; Wagner, Steven L; Becker, K David; Velicelebi, Gonul; Blacker, Deborah; Tanzi, Rudolph E

    2007-01-01

    Background Recently, conflicting reports have been published on the potential role of genetic variants in the α‐T catenin gene (VR22; CTNNA3) on the risk for Alzheimer's disease. In these papers, evidence for association is mostly observed in multiplex families with Alzheimer's disease, whereas case–control samples of sporadic Alzheimer's disease are predominantly negative. Methods After sequencing VR22 in multiplex families with Alzheimer's disease linked to chromosome 10q21, we identified a novel non‐synonymous (Ser596Asn; rs4548513) single nucleotide polymorphism (SNP). This and four non‐coding SNPs were assessed in two independent samples of families with Alzheimer's disease, one with 1439 subjects from 437 multiplex families with Alzheimer's disease and the other with 489 subjects from 217 discordant sibships. Results A weak association with the Ser596Asn SNP in the multiplex sample, predominantly in families with late‐onset Alzheimer's disease (p = 0.02), was observed. However, this association does not seem to contribute substantially to the chromosome 10 Alzheimer's disease linkage signal that we and others have reported previously. No evidence was found of association with any of the four additional SNPs tested in the multiplex families with Alzheimer's disease. Finally, the Ser596Asn change was not associated with the risk for Alzheimer's disease in the independent discordant sibship sample. Conclusions This is the first study to report evidence of an association between a potentially functional, non‐synonymous SNP in VR22 and the risk for Alzheimer's disease. As the underlying effects are probably small, and are only seen in families with multiple affected members, the population‐wide significance of this finding remains to be determined. PMID:17209133

  3. Molecular Characterization and Comparative Sequence Analysis of Defense-Related Gene, Oryza rufipogon Receptor-Like Protein Kinase 1

    PubMed Central

    Law, Yee-Song; Gudimella, Ranganath; Song, Beng-Kah; Ratnam, Wickneswari; Harikrishna, Jennifer Ann

    2012-01-01

    Many of the plant leucine rich repeat receptor-like kinases (LRR-RLKs) have been found to regulate signaling during plant defense processes. In this study, we selected and sequenced an LRR-RLK gene, designated as Oryza rufipogon receptor-like protein kinase 1 (OrufRPK1), located within yield QTL yld1.1 from the wild rice Oryza rufipogon (accession IRGC105491). A 2055 bp coding region and two exons were identified. Southern blotting determined OrufRPK1 to be a single copy gene. Sequence comparison with cultivated rice orthologs (OsI219RPK1, OsI9311RPK1 and OsJNipponRPK1, respectively derived from O. sativa ssp. indica cv. MR219, O. sativa ssp. indica cv. 9311 and O. sativa ssp. japonica cv. Nipponbare) revealed the presence of 12 single nucleotide polymorphisms (SNPs) with five non-synonymous substitutions, and 23 insertion/deletion sites. The biological role of the OrufRPK1 as a defense related LRR-RLK is proposed on the basis of cDNA sequence characterization, domain subfamily classification, structural prediction of extra cellular domains, cluster analysis and comparative gene expression. PMID:22942769

  4. Influence of ghrelin gene polymorphisms on hypertension and atherosclerotic disease.

    PubMed

    Berthold, Heiner K; Giannakidou, Eleni; Krone, Wilhelm; Trégouët, David-Alexandre; Gouni-Berthold, Ioanna

    2010-02-01

    Ghrelin is involved in several metabolic and cardiovascular processes. Recent evidence suggests its involvement in blood pressure regulation and hypertension. The aim of the study was to determine associations of single-nucleotide polymorphisms (SNPs) and haplotypes of the ghrelin gene (GHRL) with hypertension and atherosclerotic disease. Six GHRL SNPs (rs27647, rs26802, rs34911341, rs696217, rs4684677 and a -473G/A (with no assigned rsID)) were investigated in a sample of 1143 hypertensive subjects and 1489 controls of Caucasian origin. Both single-locus and haplotype association analyses were performed. In single-locus analyses, only the non-synonymous rs34911341 was associated with hypertension (odds ratio (OR)=1.95 (95% confidence interval (CI): 1.26-3.02), P=0.003). Six common haplotypes with frequency >1% were inferred from the studied GHRL SNPs, and their frequency distribution was significantly different between hypertensive subjects and controls (chi(2)=12.96 with 5 d.f. (degree of freedom), P=0.024). The effect of rs26802 was found to be significantly (P=0.017) modulated by other GHRL SNPs, as its C allele conferred either an increased risk (OR=1.30 (1.08-1.57), P=0.005) or a decreased risk (OR=0.50 (0.23-1.06), P=0.07) of hypertension according to the two different haplotypes on which it can be found. No association of GHRL SNPs or haplotypes with atherosclerotic disease was observed. In conclusion, we observed statistical evidence for association between GHRL SNPs and risk of hypertension.

  5. Genetic Association and Gene-Gene Interaction Analyses in African American Dialysis Patients With Nondiabetic Nephropathy

    PubMed Central

    Bostrom, Meredith A.; Kao, W.H. Linda; Li, Man; Abboud, Hanna E.; Adler, Sharon G.; Iyengar, Sudha K.; Kimmel, Paul L.; Hanson, Robert L.; Nicholas, Susanne B.; Rasooly, Rebekah S.; Sedor, John R.; Coresh, Josef; Kohn, Orly F.; Leehey, David J.; Thornley-Brown, Denyse; Bottinger, Erwin P.; Lipkowitz, Michael S.; Meoni, Lucy A.; Klag, Michael J.; Lu, Lingyi; Hicks, Pamela J.; Langefeld, Carl D.; Parekh, Rulan S.; Bowden, Donald W.; Freedman, Barry I.

    2011-01-01

    Background African Americans (AAs) have increased susceptibility to non-diabetic nephropathy relative to European Americans. Study Design Follow-up of a pooled genome-wide association study (GWAS) in AA dialysis patients with nondiabetic nephropathy; novel gene-gene interaction analyses. Setting & Participants Wake Forest sample: 962 AA nondiabetic nephropathy cases; 931 non-nephropathy controls. Replication sample: 668 Family Investigation of Nephropathy and Diabetes (FIND) AA nondiabetic nephropathy cases; 804 non-nephropathy controls. Predictors Individual genotyping of top 1420 pooled GWAS-associated single nucleotide polymorphisms (SNPs) and 54 SNPs in six nephropathy susceptibility genes. Outcomes APOL1 genetic association and additional candidate susceptibility loci interacting with, or independently from, APOL1. Results The strongest GWAS associations included two non-coding APOL1 SNPs, rs2239785 (odds ratio [OR], 0.33; dominant; p = 5.9 × 10−24) and rs136148 (OR, 0.54; additive; p = 1.1 × 10−7) with replication in FIND (p = 5.0 × 10−21 and 1.9 × 10−05, respectively). Rs2239785 remained significantly associated after controlling for the APOL1 G1 and G2 coding variants. Additional top hits included a CFH SNP(OR from meta-analysis in above 3367 AA cases and controls, 0.81; additive; p = 6.8 × 10−4). The 1420 SNPs were tested for interaction with APOL1 G1 and G2 variants. Several interactive SNPs were detected, the most significant was rs16854341 in the podocin gene (NPHS2) (p = 0.0001). Limitations Non-pooled GWAS have not been performed in AA nondiabetic nephropathy. Conclusions This follow-up of a pooled GWAS provides additional and independent evidence that APOL1 variants contribute to nondiabetic nephropathy in AAs and identified additional associated and interactive non-diabetic nephropathy susceptibility genes. PMID:22119407

  6. Genome-wide high-throughput SNP discovery and genotyping for understanding natural (functional) allelic diversity and domestication patterns in wild chickpea

    PubMed Central

    Bajaj, Deepak; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    We identified 82489 high-quality genome-wide SNPs from 93 wild and cultivated Cicer accessions through integrated reference genome- and de novo-based GBS assays. High intra- and inter-specific polymorphic potential (66–85%) and broader natural allelic diversity (6–64%) detected by genome-wide SNPs among accessions signify their efficacy for monitoring introgression and transferring target trait-regulating genomic (gene) regions/allelic variants from wild to cultivated Cicer gene pools for genetic improvement. The population-specific assignment of wild Cicer accessions pertaining to the primary gene pool are more influenced by geographical origin/phenotypic characteristics than species/gene-pools of origination. The functional significance of allelic variants (non-synonymous and regulatory SNPs) scanned from transcription factors and stress-responsive genes in differentiating wild accessions (with potential known sources of yield-contributing and stress tolerance traits) from cultivated desi and kabuli accessions, fine-mapping/map-based cloning of QTLs and determination of LD patterns across wild and cultivated gene-pools are suitably elucidated. The correlation between phenotypic (agromorphological traits) and molecular diversity-based admixed domestication patterns within six structured populations of wild and cultivated accessions via genome-wide SNPs was apparent. This suggests utility of whole genome SNPs as a potential resource for identifying naturally selected trait-regulating genomic targets/functional allelic variants adaptive to diverse agroclimatic regions for genetic enhancement of cultivated gene-pools. PMID:26208313

  7. Exploration of structural stability in deleterious nsSNPs of the XPA gene: A molecular dynamics approach

    PubMed Central

    NagaSundaram, N; Priya Doss, C George

    2011-01-01

    Background: Distinguishing the deleterious from the massive number of non-functional nsSNPs that occur within a single genome is a considerable challenge in mutation research. In this approach, we have used the existing in silico methods to explore the mutation-structure-function relationship in the XPAgene. Materials and Methods: We used the Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping (PolyPhen), I-Mutant 2.0, and the Protein Analysis THrough Evolutionary Relationships methods to predict the effects of deleterious nsSNPs on protein function and evaluated the impact of mutation on protein stability by Molecular Dynamics simulations. Results: By comparing the scores of all the four in silico methods, nsSNP with an ID rs104894131 at position C108F was predicted to be highly deleterious. We extended our Molecular dynamics approach to gain insight into the impact of this non-synonymous polymorphism on structural changes that may affect the activity of the XPAgene. Conclusion: Based on the in silico methods score, potential energy, root-mean-square deviation, and root-mean-square fluctuation, we predict that deleterious nsSNP at position C108F would play a significant role in causing disease by the XPA gene. Our approach would present the application of in silicotools in understanding the functional variation from the perspective of structure, evolution, and phenotype. PMID:22190868

  8. The genomes of three stocks comprising the most widely utilized live sporozoite Theileria parva vaccine exhibit very different degrees and patterns of sequence divergence.

    PubMed

    Norling, Martin; Bishop, Richard P; Pelle, Roger; Qi, Weihong; Henson, Sonal; Drábek, Elliott F; Tretina, Kyle; Odongo, David; Mwaura, Stephen; Njoroge, Thomas; Bongcam-Rudloff, Erik; Daubenberger, Claudia A; Silva, Joana C

    2015-09-24

    There are no commercially available vaccines against human protozoan parasitic diseases, despite the success of vaccination-induced long-term protection against infectious diseases. East Coast fever, caused by the protist Theileria parva, kills one million cattle each year in sub-Saharan Africa, and contributes significantly to hunger and poverty in the region. A highly effective, live, multi-isolate vaccine against T. parva exists, but its component isolates have not been characterized. Here we sequence and compare the three component T. parva stocks within this vaccine, the Muguga Cocktail, namely Muguga, Kiambu5 and Serengeti-transformed, aiming to identify genomic features that contribute to vaccine efficacy. We find that Serengeti-transformed, originally isolated from the wildlife carrier, the African Cape buffalo, is remarkably and unexpectedly similar to the Muguga isolate. The 420 detectable non-synonymous SNPs were distributed among only 53 genes, primarily subtelomeric antigens and antigenic families. The Kiambu5 isolate is considerably more divergent, with close to 40,000 SNPs relative to Muguga, including >8,500 non-synonymous mutations distributed among >1,700 (42.5 %) of the predicted genes. These genetic markers of the component stocks can be used to characterize the composition of new batches of the Muguga Cocktail. Differences among these three isolates, while extensive, represent only a small proportion of the genetic variation in the entire species. Given the efficacy of the Muguga Cocktail in inducing long-lasting protection against infections in the field, our results suggest that whole-organism vaccines against parasitic diseases can be highly efficacious despite considerable genome-wide differences relative to the isolates against which they protect.

  9. A candidate gene approach to study nematode resistance traits in naturally infected sheep.

    PubMed

    Wilkie, Hazel; Riggio, Valentina; Matika, Oswald; Nicol, Louise; Watt, Kathryn A; Sinclair, Rona; Sparks, Alexandra M; Nussey, Daniel H; Pemberton, Josephine M; Houston, Ross D; Hopkins, John

    2017-08-30

    Sheep naturally acquire a degree of resistant immunity to parasitic worm infection through repeated exposure. However, the immune response and clinical outcome vary greatly between animals. Genetic polymorphisms in genes integral to differential T helper cell polarization may contribute to variation in host response and disease outcome. A total of twelve single nucleotide polymorphisms (SNPs) were sequenced in IL23R, RORC2 and TBX21 from genomic DNA of Scottish Blackface lambs. Of the twelve SNPs, six were non-synonymous (missense), four were within the 3' UTRs and two were intronic. The association between nine of these SNPs and the traits of body weight, faecal egg count (FEC) and relative T. circumcincta L3-specific IgA antibody levels was assessed in a population of domestic Scottish Blackface ewe lambs and a population of free-living Soay ewe lambs both naturally infected with a mixture of nematodes. There were no significant associations identified between any of the SNPs and phenotypes recorded in either of the populations after adjustment for multiple testing (Bonferroni corrected P value≤0.002). In the Blackface lambs, there was a nominally significant association (P=0.007) between IL23R p.V324M and weight at 20 weeks. This association may be worthy of further investigation in a larger sample of sheep. Copyright © 2017. Published by Elsevier B.V.

  10. Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing.

    PubMed

    Yang, Hua; Wei, Chao-Ling; Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun

    2016-01-01

    Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants.

  11. Linkage and association study of late-onset Alzheimer disease families linked to 9p21.3.

    PubMed

    Züchner, S; Gilbert, J R; Martin, E R; Leon-Guerrero, C R; Xu, P-T; Browning, C; Bronson, P G; Whitehead, P; Schmechel, D E; Haines, J L; Pericak-Vance, M A

    2008-11-01

    A chromosomal locus for late-onset Alzheimer disease (LOAD) has previously been mapped to 9p21.3. The most significant results were reported in a sample of autopsy-confirmed families. Linkage to this locus has been independently confirmed in AD families from a consanguineous Israeli-Arab community. In the present study we analyzed an expanded clinical sample of 674 late-onset AD families, independently ascertained by three different consortia. Sample subsets were stratified by site and autopsy-confirmation. Linkage analysis of a dense array of SNPs across the chromosomal locus revealed the most significant results in the 166 autopsy-confirmed families of the NIMH sample. Peak HLOD scores of 4.95 at D9S741 and 2.81 at the nearby SNP rs2772677 were obtained in a dominant model. The linked region included the cyclin-dependent kinase inhibitor 2A gene (CDKN2A), which has been suggested as an AD candidate gene. By re-sequencing all exons in the vicinity of CDKN2A in 48 AD cases, we identified and genotyped four novel SNPs, including a non-synonymous, a synonymous, and two variations located in untranslated RNA sequences. Family-based allelic and genotypic association analysis yielded significant results in CDKN2A (rs11515: PDT p = 0.003, genotype-PDT p = 0.014). We conclude that CDKN2A is a promising new candidate gene potentially contributing to AD susceptibility on chromosome 9p.

  12. PolyTB: A genomic variation map for Mycobacterium tuberculosis

    PubMed Central

    Coll, Francesc; Preston, Mark; Guerra-Assunção, José Afonso; Hill-Cawthorn, Grant; Harris, David; Perdigão, João; Viveiros, Miguel; Portugal, Isabel; Drobniewski, Francis; Gagneux, Sebastien; Glynn, Judith R.; Pain, Arnab; Parkhill, Julian; McNerney, Ruth; Martin, Nigel; Clark, Taane G.

    2014-01-01

    Summary Tuberculosis (TB) caused by Mycobacterium tuberculosis (Mtb) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information in clinical isolates of M. tuberculosis complex (MTBC). The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies. Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies. Here we have processed the raw sequence data (>1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800). We have developed the PolyTB web-based tool (http://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest, as well as examine the genomic diversity and distribution of strains. PolyTB source code is freely available to researchers wishing to develop similar tools for their pathogen of interest. PMID:24637013

  13. Helicobacter pylori genetic diversification in the Mongolian gerbil model.

    PubMed

    Beckett, Amber C; Loh, John T; Chopra, Abha; Leary, Shay; Lin, Aung Soe; McDonnell, Wyatt J; Dixon, Beverly R E A; Noto, Jennifer M; Israel, Dawn A; Peek, Richard M; Mallal, Simon; Algood, Holly M Scott; Cover, Timothy L

    2018-01-01

    Helicobacter pylori requires genetic agility to infect new hosts and establish long-term colonization of changing gastric environments. In this study, we analyzed H. pylori genetic adaptation in the Mongolian gerbil model. This model is of particular interest because H. pylori -infected gerbils develop a high level of gastric inflammation and often develop gastric adenocarcinoma or gastric ulceration. We analyzed the whole genome sequences of H. pylori strains cultured from experimentally infected gerbils, in comparison to the genome sequence of the input strain. The mean annualized single nucleotide polymorphism (SNP) rate per site was 1.5e -5 , which is similar to the rates detected previously in H. pylori- infected humans. Many of the mutations occurred within or upstream of genes associated with iron-related functions ( fur , tonB1 , fecA2 , fecA3 , and frpB3 ) or encoding outer membrane proteins ( alpA, oipA, fecA2, fecA3, frpB3 and cagY ). Most of the SNPs within coding regions (86%) were non-synonymous mutations. Several deletion or insertion mutations led to disruption of open reading frames, suggesting that the corresponding gene products are not required or are deleterious during chronic H. pylori colonization of the gerbil stomach. Five variants (three SNPs and two deletions) were detected in isolates from multiple animals, which suggests that these mutations conferred a selective advantage. One of the mutations (FurR88H) detected in isolates from multiple animals was previously shown to confer increased resistance to oxidative stress, and we now show that this SNP also confers a survival advantage when H. pylori is co-cultured with neutrophils. Collectively, these analyses allow the identification of mutations that are positively selected during H. pylori colonization of the gerbil model.

  14. Single nucleotide polymorphisms in common bean: their discovery and genotyping using a multiplex detection system

    USDA-ARS?s Scientific Manuscript database

    Single-nucleotide Polymorphism (SNP) markers are by far the most common form of DNA polymorphism in a genome. The objectives of this study were to discover SNPs in common bean comparing sequences from coding and non-coding regions obtained from Genbank and genomic DNA and to compare sequencing resu...

  15. Detecting Adaptation in Protein-Coding Genes Using a Bayesian Site-Heterogeneous Mutation-Selection Codon Substitution Model.

    PubMed

    Rodrigue, Nicolas; Lartillot, Nicolas

    2017-01-01

    Codon substitution models have traditionally attempted to uncover signatures of adaptation within protein-coding genes by contrasting the rates of synonymous and non-synonymous substitutions. Another modeling approach, known as the mutation-selection framework, attempts to explicitly account for selective patterns at the amino acid level, with some approaches allowing for heterogeneity in these patterns across codon sites. Under such a model, substitutions at a given position occur at the neutral or nearly neutral rate when they are synonymous, or when they correspond to replacements between amino acids of similar fitness; substitutions from high to low (low to high) fitness amino acids have comparatively low (high) rates. Here, we study the use of such a mutation-selection framework as a null model for the detection of adaptation. Following previous works in this direction, we include a deviation parameter that has the effect of capturing the surplus, or deficit, in non-synonymous rates, relative to what would be expected under a mutation-selection modeling framework that includes a Dirichlet process approach to account for across-codon-site variation in amino acid fitness profiles. We use simulations, along with a few real data sets, to study the behavior of the approach, and find it to have good power with a low false-positive rate. Altogether, we emphasize the potential of recent mutation-selection models in the detection of adaptation, calling for further model refinements as well as large-scale applications. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  16. Clonal evolution in relapsed and refractory diffuse large B-cell lymphoma is characterized by high dynamics of subclones.

    PubMed

    Melchardt, Thomas; Hufnagl, Clemens; Weinstock, David M; Kopp, Nadja; Neureiter, Daniel; Tränkenschuh, Wolfgang; Hackl, Hubert; Weiss, Lukas; Rinnerthaler, Gabriel; Hartmann, Tanja N; Greil, Richard; Weigert, Oliver; Egle, Alexander

    2016-08-09

    Little information is available about the role of certain mutations for clonal evolution and the clinical outcome during relapse in diffuse large B-cell lymphoma (DLBCL). Therefore, we analyzed formalin-fixed-paraffin-embedded tumor samples from first diagnosis, relapsed or refractory disease from 28 patients using next-generation sequencing of the exons of 104 coding genes. Non-synonymous mutations were present in 74 of the 104 genes tested. Primary tumor samples showed a median of 8 non-synonymous mutations (range: 0-24) with the used gene set. Lower numbers of non-synonymous mutations in the primary tumor were associated with a better median OS compared with higher numbers (28 versus 15 months, p=0.031). We observed three patterns of clonal evolution during relapse of disease: large global change, subclonal selection and no or minimal change possibly suggesting preprogrammed resistance. We conclude that targeted re-sequencing is a feasible and informative approach to characterize the molecular pattern of relapse and it creates novel insights into the role of dynamics of individual genes.

  17. HERC1 polymorphisms: population-specific variations in haplotype composition.

    PubMed

    Yuasa, Isao; Umetsu, Kazuo; Nishimukai, Hiroaki; Fukumori, Yasuo; Harihara, Shinji; Saitou, Naruya; Jin, Feng; Chattopadhyay, Prasanta K; Henke, Lotte; Henke, Jürgen

    2009-08-01

    Human HERC1 is one of six HERC proteins and may play an important role in intracellular membrane trafficking. The human HERC1 gene is suggested to have been affected by local positive selection. To assess the global frequency distributions of coding and non-coding single nucleotide polymorphisms (SNPs) in the HERC1 gene, we developed a new simultaneous genotyping method for four SNPs, and applied this method to investigate 1213 individuals from 12 global populations. The results confirmed remarked differences in the allele and haplotype frequencies between East Asian and non-East Asian populations. One of the three common haplotypes observed was found to be characteristic of East Asians, who showed a relatively uniform distribution of haplotypes. Information on haplotypes would be useful for testing the function of polymorphisms in the HERC1 gene. This is the first study to investigate the distribution of HERC1 polymorphisms in various populations. (c) 2009 John Wiley & Sons, Ltd.

  18. In search of causal variants: refining disease association signals using cross-population contrasts.

    PubMed

    Saccone, Nancy L; Saccone, Scott F; Goate, Alison M; Grucza, Richard A; Hinrichs, Anthony L; Rice, John P; Bierut, Laura J

    2008-08-29

    Genome-wide association (GWA) using large numbers of single nucleotide polymorphisms (SNPs) is now a powerful, state-of-the-art approach to mapping human disease genes. When a GWA study detects association between a SNP and the disease, this signal usually represents association with a set of several highly correlated SNPs in strong linkage disequilibrium. The challenge we address is to distinguish among these correlated loci to highlight potential functional variants and prioritize them for follow-up. We implemented a systematic method for testing association across diverse population samples having differing histories and LD patterns, using a logistic regression framework. The hypothesis is that important underlying biological mechanisms are shared across human populations, and we can filter correlated variants by testing for heterogeneity of genetic effects in different population samples. This approach formalizes the descriptive comparison of p-values that has typified similar cross-population fine-mapping studies to date. We applied this method to correlated SNPs in the cholinergic nicotinic receptor gene cluster CHRNA5-CHRNA3-CHRNB4, in a case-control study of cocaine dependence composed of 504 European-American and 583 African-American samples. Of the 10 SNPs genotyped in the r2 > or = 0.8 bin for rs16969968, three demonstrated significant cross-population heterogeneity and are filtered from priority follow-up; the remaining SNPs include rs16969968 (heterogeneity p = 0.75). Though the power to filter out rs16969968 is reduced due to the difference in allele frequency in the two groups, the results nevertheless focus attention on a smaller group of SNPs that includes the non-synonymous SNP rs16969968, which retains a similar effect size (odds ratio) across both population samples. Filtering out SNPs that demonstrate cross-population heterogeneity enriches for variants more likely to be important and causative. Our approach provides an important and effective tool to help interpret results from the many GWA studies now underway.

  19. Vkorc1 sequencing suggests anticoagulant resistance in rats in New Zealand.

    PubMed

    Cowan, Phil E; Gleeson, Dianne M; Howitt, Robyn Lj; Ramón-Laca, Ana; Esther, Alexandra; Pelz, Hans-Joachim

    2017-01-01

    Anticoagulant toxins are used globally to control rats. Resistance of Rattus species to these toxins now occurs in at least 18 countries in Europe, America and Asia. Resistance is often associated with single nucleotide polymorphisms (SNPs) in the Vkorc1 gene. This study gives a first overview of the distribution and frequency of Vkorc1 SNPs in rats in New Zealand. New Zealand is unusual in having no native rodents but three species of introduced Rattus - norvegicus Berk., rattus L. and exulans Peale. Sequence variants occurred in at least one species of rat at all 30 of the sites sampled. Three new SNPs were identified, one in kiore and two in ship rats. No SNPs previously associated with resistance were found in Norway rats or kiore, but seven ship rats were heterozygous and one homozygous for the A74T variant. Its resultant Tyr25Phe mutation has previously been associated with resistance to both first- and second-generation anticoagulants in ship rats in Spain. This is the first evidence of potential resistance to anticoagulant toxins in rats in New Zealand. Further testing using blood clotting response times in dosed rats is needed to confirm resistance potentially conferred by the Tyr25Phe mutation. Assessment is also needed of the potential of the other non-synonymous variants (Ala14Val, Ala26Val) recorded in this study to confer resistance to anticoagulant toxins. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.

  20. Endothelin Receptor B2 (EDNRB2) Gene Is Associated with Spot Plumage Pattern in Domestic Ducks (Anas platyrhynchos)

    PubMed Central

    Li, Ling; Li, Dan; Liu, Li; Li, Shijun; Feng, Yanping; Peng, Xiuli; Gong, Yanzhang

    2015-01-01

    Endothelin receptor B subtype 2 (EDNRB2) is a seven-transmembrane G-protein coupled receptor. In this study, we investigated EDNRB2 gene as a candidate gene for duck spot plumage pattern according to studies of chicken and Japanese quail. The entire coding region was cloned by the reverse transcription polymerase chain reaction (RT-PCR). Sequence analysis showed that duck EDNRB2 cDNA contained a 1311bp open reading frame and encoded a putative protein of 436 amino acids residues. The transcript shared 89%-90% identity with the counterparts in other avian species. A phylogenetic tree based on amino acid sequences showed that duck EDNRB2 was evolutionary conserved in avian clade. The entire coding region of EDNRB2 were sequenced in 20 spot and 20 non-spot ducks, and 13 SNPs were identified. Two of them (c.940G>A and c.995G>A) were non-synonymous substitutions, and were genotyped in 647 ducks representing non-spot and spot phenotypes. The c.995G>A mutation, which results in the amino acid substitution of Arg332His, was completely associated with the spot phenotype: all 152 spot ducks were carriers of the AA genotype and the other 495 individuals with non-spot phenotype were carriers of GA or GG genotype, respectively. Segregation in 17 GA×GG and 22 GA×GA testing combinations confirmed this association since the segregation ratios and genotypes of the offspring were in agreement with the hypothesis. In order to investigate the underlying mechanism of the spot phenotype, MITF gene was used as cell type marker of melanocyte progenitor cells while TYR and TYRP1 gene were used as cell type markers of mature melanocytes. Transcripts of MITF, TYR and TYRP1 gene with expected size were identified in all pigmented skin tissues while PCR products were not obtained from non-pigmented skin tissues. It was inferred that melanocytes are absent in non-pigmented skin tissues of spot ducks. PMID:25955279

  1. Exploring synonymous codon usage preferences of disulfide-bonded and non-disulfide bonded cysteines in the E. coli genome.

    PubMed

    Song, Jiangning; Wang, Minglei; Burrage, Kevin

    2006-07-21

    High-quality data about protein structures and their gene sequences are essential to the understanding of the relationship between protein folding and protein coding sequences. Firstly we constructed the EcoPDB database, which is a high-quality database of Escherichia coli genes and their corresponding PDB structures. Based on EcoPDB, we presented a novel approach based on information theory to investigate the correlation between cysteine synonymous codon usages and local amino acids flanking cysteines, the correlation between cysteine synonymous codon usages and synonymous codon usages of local amino acids flanking cysteines, as well as the correlation between cysteine synonymous codon usages and the disulfide bonding states of cysteines in the E. coli genome. The results indicate that the nearest neighboring residues and their synonymous codons of the C-terminus have the greatest influence on the usages of the synonymous codons of cysteines and the usage of the synonymous codons has a specific correlation with the disulfide bond formation of cysteines in proteins. The correlations may result from the regulation mechanism of protein structures at gene sequence level and reflect the biological function restriction that cysteines pair to form disulfide bonds. The results may also be helpful in identifying residues that are important for synonymous codon selection of cysteines to introduce disulfide bridges in protein engineering and molecular biology. The approach presented in this paper can also be utilized as a complementary computational method and be applicable to analyse the synonymous codon usages in other model organisms.

  2. Genome Features of “Dark-Fly”, a Drosophila Line Reared Long-Term in a Dark Environment

    PubMed Central

    Zhou, Jun; Sugiyama, Yuzo; Nishimura, Osamu; Aizu, Tomoyuki; Toyoda, Atsushi; Fujiyama, Asao; Agata, Kiyokazu

    2012-01-01

    Organisms are remarkably adapted to diverse environments by specialized metabolisms, morphology, or behaviors. To address the molecular mechanisms underlying environmental adaptation, we have utilized a Drosophila melanogaster line, termed “Dark-fly”, which has been maintained in constant dark conditions for 57 years (1400 generations). We found that Dark-fly exhibited higher fecundity in dark than in light conditions, indicating that Dark-fly possesses some traits advantageous in darkness. Using next-generation sequencing technology, we determined the whole genome sequence of Dark-fly and identified approximately 220,000 single nucleotide polymorphisms (SNPs) and 4,700 insertions or deletions (InDels) in the Dark-fly genome compared to the genome of the Oregon-R-S strain, a control strain. 1.8% of SNPs were classified as non-synonymous SNPs (nsSNPs: i.e., they alter the amino acid sequence of gene products). Among them, we detected 28 nonsense mutations (i.e., they produce a stop codon in the protein sequence) in the Dark-fly genome. These included genes encoding an olfactory receptor and a light receptor. We also searched runs of homozygosity (ROH) regions as putative regions selected during the population history, and found 21 ROH regions in the Dark-fly genome. We identified 241 genes carrying nsSNPs or InDels in the ROH regions. These include a cluster of alpha-esterase genes that are involved in detoxification processes. Furthermore, analysis of structural variants in the Dark-fly genome showed the deletion of a gene related to fatty acid metabolism. Our results revealed unique features of the Dark-fly genome and provided a list of potential candidate genes involved in environmental adaptation. PMID:22432011

  3. PolyTB: a genomic variation map for Mycobacterium tuberculosis.

    PubMed

    Coll, Francesc; Preston, Mark; Guerra-Assunção, José Afonso; Hill-Cawthorn, Grant; Harris, David; Perdigão, João; Viveiros, Miguel; Portugal, Isabel; Drobniewski, Francis; Gagneux, Sebastien; Glynn, Judith R; Pain, Arnab; Parkhill, Julian; McNerney, Ruth; Martin, Nigel; Clark, Taane G

    2014-05-01

    Tuberculosis (TB) caused by Mycobacterium tuberculosis (Mtb) is the second major cause of death from an infectious disease worldwide. Recent advances in DNA sequencing are leading to the ability to generate whole genome information in clinical isolates of M. tuberculosis complex (MTBC). The identification of informative genetic variants such as phylogenetic markers and those associated with drug resistance or virulence will help barcode Mtb in the context of epidemiological, diagnostic and clinical studies. Mtb genomic datasets are increasingly available as raw sequences, which are potentially difficult and computer intensive to process, and compare across studies. Here we have processed the raw sequence data (>1500 isolates, eight studies) to compile a catalogue of SNPs (n = 74,039, 63% non-synonymous, 51.1% in more than one isolate, i.e. non-private), small indels (n = 4810) and larger structural variants (n = 800). We have developed the PolyTB web-based tool (http://pathogenseq.lshtm.ac.uk/polytb) to visualise the resulting variation and important meta-data (e.g. in silico inferred strain-types, location) within geographical map and phylogenetic views. This resource will allow researchers to identify polymorphisms within candidate genes of interest, as well as examine the genomic diversity and distribution of strains. PolyTB source code is freely available to researchers wishing to develop similar tools for their pathogen of interest. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.

  4. Protein functional features are reflected in the patterns of mRNA translation speed.

    PubMed

    López, Daniel; Pazos, Florencio

    2015-07-09

    The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.

  5. Contrasting association of a non-synonymous leptin receptor gene polymorphism with Wegener's granulomatosis and Churg-Strauss syndrome.

    PubMed

    Wieczorek, Stefan; Holle, Julia U; Bremer, Jan P; Wibisono, David; Moosig, Frank; Fricke, Harald; Assmann, Gunter; Harper, Lorraine; Arning, Larissa; Gross, Wolfgang L; Epplen, Joerg T

    2010-05-01

    There is evidence that the leptin/ghrelin system is involved in T-cell regulation and plays a role in (auto)immune disorders such as SLE, RA and ANCA-associated vasculitides (AAVs). Here, we evaluate the genetic background of this system in WG. We screened variations in the genes encoding leptin, ghrelin and their receptors, the leptin receptor (LEPR) and the growth hormone secretagogue receptor (GHSR). Three single nucleotide polymorphisms (SNPs) in each gene region were analysed in 460 German WG cases and 878 ethnically matched healthy controls. A three-SNP haplotype of GHSR was significantly associated with WG [P = 0.0067; corrected P-value (P(c)) = 0.026; odds ratio (OR) = 1.30; 95% CI 1.08, 1.57], as was one non-synonymous SNP in LEPR (Lys656Asn, P = 0.0034; P(c) = 0.013; OR = 0.72; 95% CI 0.58, 0.90). These four SNPs were re-analysed in independent cohorts of 226 German WG cases and 519 controls. While the GHSR association was not confirmed, allele frequencies of the LEPR SNP were virtually identical to those from the initial cohorts. Analysis of this SNP in the combined WG and control panels revealed a significant association of the LEPR 656Lys allele with WG (P = 0.00032; P(c) = 0.0013; OR = 0.72; 95% CI 0.60, 0.86). Remarkably, the Lys656Asn SNP showed contrasting allele distribution in two cohorts of 108 and 88 German cases diagnosed with Churg-Strauss syndrome (CSS, combined P = 0.0067; OR = 1.41; 95% CI 1.10, 1.81), whereas identical allele frequencies were revealed when comparing British WG and microscopic polyangiitis cases. While GHSR has to be further evaluated, these data provide profound evidence for an association of the LEPR Lys656Asn SNP with AAV, resulting in opposing effects in WG and CSS.

  6. Sequence variations of the bovine prion protein gene (PRNP) in native Korean Hanwoo cattle

    PubMed Central

    Choi, Sangho

    2012-01-01

    Bovine spongiform encephalopathy (BSE) is one of the fatal neurodegenerative diseases known as transmissible spongiform encephalopathies (TSEs) caused by infectious prion proteins. Genetic variations correlated with susceptibility or resistance to TSE in humans and sheep have not been reported for bovine strains including those from Holstein, Jersey, and Japanese Black cattle. Here, we investigated bovine prion protein gene (PRNP) variations in Hanwoo cattle [Bos (B.) taurus coreanae], a native breed in Korea. We identified mutations and polymorphisms in the coding region of PRNP, determined their frequency, and evaluated their significance. We identified four synonymous polymorphisms and two non-synonymous mutations in PRNP, but found no novel polymorphisms. The sequence and number of octapeptide repeats were completely conserved, and the haplotype frequency of the coding region was similar to that of other B. taurus strains. When we examined the 23-bp and 12-bp insertion/deletion (indel) polymorphisms in the non-coding region of PRNP, Hanwoo cattle had a lower deletion allele and 23-bp del/12-bp del haplotype frequency than healthy and BSE-affected animals of other strains. Thus, Hanwoo are seemingly less susceptible to BSE than other strains due to the 23-bp and 12-bp indel polymorphisms. PMID:22705734

  7. Microarray-based SNP genotyping to identify genetic risk factors of triple-negative breast cancer (TNBC) in South Indian population.

    PubMed

    Aravind Kumar, M; Singh, Vineeta; Naushad, Shaik Mohammad; Shanker, Uday; Lakshmi Narasu, M

    2018-05-01

    In the view of aggressive nature of Triple-Negative Breast cancer (TNBC) due to the lack of receptors (ER, PR, HER2) and high incidence of drug resistance associated with it, a case-control association study was conducted to identify the contributing genetic risk factors for Triple-negative breast cancer (TNBC). A total of 30 TNBC patients and 50 age and gender-matched controls of Indian origin were screened for 9,00,000 SNP markers using microarray-based SNP genotyping approach. The initial PLINK association analysis (p < 0.01, MAF 0.14-0.44, OR 10-24) identified 28 non-synonymous SNPs and one stop gain mutation in the exonic region as possible determinants of TNBC risk. All the 29 SNPs were annotated using ANNOVAR. The interactions between these markers were evaluated using Multifactor dimensionality reduction (MDR) analysis. The interactions were in the following order: exm408776 > exm1278309 > rs316389 > rs1651654 > rs635538 > exm1292477. Recursive partitioning analysis (RPA) was performed to construct decision tree useful in predicting TNBC risk. As shown in this analysis, rs1651654 and exm585172 SNPs are found to be determinants of TNBC risk. Artificial neural network model was used to generate the Receiver operating characteristic curves (ROC), which showed high sensitivity and specificity (AUC-0.94) of these markers. To conclude, among the 9,00,000 SNPs tested, CCDC42 exm1292477, ANXA3 exm408776, SASH1 exm585172 are found to be the most significant genetic predicting factors for TNBC. The interactions among exm408776, exm1278309, rs316389, rs1651654, rs635538, exm1292477 SNPs inflate the risk for TNBC further. Targeted analysis of these SNPs and genes alone also will have similar clinical utility in predicting TNBC.

  8. Complete mitochondrial genome sequence from an endangered Indian snake, Python molurus molurus (Serpentes, Pythonidae).

    PubMed

    Dubey, Bhawna; Meganathan, P R; Haque, Ikramul

    2012-07-01

    This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.

  9. Explaining the disease phenotype of intergenic SNP through predicted long range regulation

    PubMed Central

    Chen, Jingqi; Tian, Weidong

    2016-01-01

    Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. PMID:27280978

  10. Genomic and transcriptome analyses reveal that MAPK- and phosphatidylinositol-signaling pathways mediate tolerance to 5-hydroxymethyl-2-furaldehyde for industrial yeast Saccharomyces cerevisiae

    PubMed Central

    Zhou, Qian; Liu, Z. Lewis; Ning, Kang; Wang, Anhui; Zeng, Xiaowei; Xu, Jian

    2014-01-01

    The industrial yeast Saccharomyces cerevisiae is a traditional ethanologenic agent and a promising biocatalyst for advanced biofuels production using lignocellulose mateials. Here we present the genomic background of type strain NRRL Y-12632 and its transcriptomic response to 5-hydroxymethyl-2-furaldehyde (HMF), a commonly encountered toxic compound liberated from lignocellulosic-biomass pretreatment, in dissecting the genomic mechanisms of yeast tolerance. Compared with the genome of laboratory model strain S288C, we identified more than 32,000 SNPs in Y-12632 with 23,000 missense and nonsense SNPs. Enriched sequence mutations occurred for genes involved in MAPK- and phosphatidylinositol (PI)- signaling pathways in strain Y-12632, with 41 and 13 genes containing non-synonymous SNPs, respectively. Many of these mutated genes displayed consistent up-regulated signature expressions in response to challenges of 30 mM HMF. Analogous single-gene deletion mutations of these genes showed significantly sensitive growth response on a synthetic medium containing 20 mM HMF. Our results suggest at least three MAPK-signaling pathways, especially for the cell-wall integrity pathway, and PI-signaling pathways to be involved in mediation of yeast tolerance against HMF in industrial yeast Saccharomyces cerevisiae. Higher levels of sequence variations were also observed for genes involved in purine and pyrimidine metabolism pathways. PMID:25296911

  11. A genomic scale map of genetic diversity in Trypanosoma cruzi

    PubMed Central

    2012-01-01

    Background Trypanosoma cruzi, the causal agent of Chagas Disease, affects more than 16 million people in Latin America. The clinical outcome of the disease results from a complex interplay between environmental factors and the genetic background of both the human host and the parasite. However, knowledge of the genetic diversity of the parasite, is currently limited to a number of highly studied loci. The availability of a number of genomes from different evolutionary lineages of T. cruzi provides an unprecedented opportunity to look at the genetic diversity of the parasite at a genomic scale. Results Using a bioinformatic strategy, we have clustered T. cruzi sequence data available in the public domain and obtained multiple sequence alignments in which one or two alleles from the reference CL-Brener were included. These data covers 4 major evolutionary lineages (DTUs): TcI, TcII, TcIII, and the hybrid TcVI. Using these set of alignments we have identified 288,957 high quality single nucleotide polymorphisms and 1,480 indels. In a reduced re-sequencing study we were able to validate ~ 97% of high-quality SNPs identified in 47 loci. Analysis of how these changes affect encoded protein products showed a 0.77 ratio of synonymous to non-synonymous changes in the T. cruzi genome. We observed 113 changes that introduce or remove a stop codon, some causing significant functional changes, and a number of tri-allelic and tetra-allelic SNPs that could be exploited in strain typing assays. Based on an analysis of the observed nucleotide diversity we show that the T. cruzi genome contains a core set of genes that are under apparent purifying selection. Interestingly, orthologs of known druggable targets show statistically significant lower nucleotide diversity values. Conclusions This study provides the first look at the genetic diversity of T. cruzi at a genomic scale. The analysis covers an estimated ~ 60% of the genetic diversity present in the population, providing an essential resource for future studies on the development of new drugs and diagnostics, for Chagas Disease. These data is available through the TcSNP database (http://snps.tcruzi.org). PMID:23270511

  12. Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analysis analyses of pooled RNA-Seq samples in rainbow trout

    USDA-ARS?s Scientific Manuscript database

    Coding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, mus...

  13. Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation.

    PubMed

    Aris-Brosou, Stéphane; Bielawski, Joseph P

    2006-08-15

    A popular approach to examine the roles of mutation and selection in the evolution of genomes has been to consider the relationship between codon bias and synonymous rates of molecular evolution. A significant relationship between these two quantities is taken to indicate the action of weak selection on substitutions among synonymous codons. The neutral theory predicts that the rate of evolution is inversely related to the level of functional constraint. Therefore, selection against the use of non-preferred codons among those coding for the same amino acid should result in lower rates of synonymous substitution as compared with sites not subject to such selection pressures. However, reliably measuring the extent of such a relationship is problematic, as estimates of synonymous rates are sensitive to our assumptions about the process of molecular evolution. Previous studies showed the importance of accounting for unequal codon frequencies, in particular when synonymous codon usage is highly biased. Yet, unequal codon frequencies can be modeled in different ways, making different assumptions about the mutation process. Here we conduct a simulation study to evaluate two different ways of modeling uneven codon frequencies and show that both model parameterizations can have a dramatic impact on rate estimates and affect biological conclusions about genome evolution. We reanalyze three large data sets to demonstrate the relevance of our results to empirical data analysis.

  14. SNP discovery in common bean by restriction-associated DNA (RAD) sequencing for genetic diversity and population structure analysis.

    PubMed

    Valdisser, Paula Arielle M R; Pappas, Georgios J; de Menezes, Ivandilson P P; Müller, Bárbara S F; Pereira, Wendell J; Narciso, Marcelo G; Brondani, Claudio; Souza, Thiago L P O; Borba, Tereza C O; Vianello, Rosana P

    2016-06-01

    Researchers have made great advances into the development and application of genomic approaches for common beans, creating opportunities to driving more real and applicable strategies for sustainable management of the genetic resource towards plant breeding. This work provides useful polymorphic single-nucleotide polymorphisms (SNPs) for high-throughput common bean genotyping developed by RAD (restriction site-associated DNA) sequencing. The RAD tags were generated from DNA pooled from 12 common bean genotypes, including breeding lines of different gene pools and market classes. The aligned sequences identified 23,748 putative RAD-SNPs, of which 3357 were adequate for genotyping; 1032 RAD-SNPs with the highest ADT (assay design tool) score are presented in this article. The RAD-SNPs were structurally annotated in different coding (47.00 %) and non-coding (53.00 %) sequence components of genes. A subset of 384 RAD-SNPs with broad genome distribution was used to genotype a diverse panel of 95 common bean germplasms and revealed a successful amplification rate of 96.6 %, showing 73 % of polymorphic SNPs within the Andean group and 83 % in the Mesoamerican group. A slightly increased He (0.161, n = 21) value was estimated for the Andean gene pool, compared to the Mesoamerican group (0.156, n = 74). For the linkage disequilibrium (LD) analysis, from a group of 580 SNPs (289 RAD-SNPs and 291 BARC-SNPs) genotyped for the same set of genotypes, 70.2 % were in LD, decreasing to 0.10 %in the Andean group and 0.77 % in the Mesoamerican group. Haplotype patterns spanning 310 Mb of the genome (60 %) were characterized in samples from different origins. However, the haplotype frameworks were under-represented for the Andean (7.85 %) and Mesoamerican (5.55 %) gene pools separately. In conclusion, RAD sequencing allowed the discovery of hundreds of useful SNPs for broad genetic analysis of common bean germplasm. From now, this approach provides an excellent panel of molecular tools for whole genome analysis, allowing integrating and better exploring the common bean breeding practices.

  15. Two candidate genes (FTO and INSIG2) for fat accumulation in four canids: chromosome mapping, gene polymorphisms and association studies of body and skin weight of red foxes.

    PubMed

    Grzes, M; Szczerbal, I; Fijak-Nowak, H; Szydlowski, M; Switonski, M

    2011-01-01

    Fat accumulation is a polygenic trait which has a significant impact on human health and animal production. Obesity is also an increasingly serious problem in dog breeding. The FTO and INSIG2 are considered as candidate genes associated with predisposition for human obesity. In this report we present a comparative genomic analysis of these 2 genes in 4 species belonging to the family Canidae - the dog and 3 species which are kept in captivity for fur production, i.e. red fox, arctic fox and Chinese raccoon dog. We cytogenetically mapped these 2 loci by FISH and compared the entire coding sequence of INSIG2 and a fragment of the coding sequence of FTO. The FTO gene was assigned to the following chromosomes: CFA2q25 (dog), VVU2q21 (red fox), ALA8q25 (arctic fox) and NPP10q24-25 (Chinese raccoon dog), while the INSIG2 was mapped to CFA19q17, VVU5p14, ALA24q15 and NPP9q22, respectively. Altogether, 29 SNPs were identified (16 in INSIG2 and 13 in FTO) and among them 2 were missense substitutions in the dog (23C/T, Thr>Met in the FTO gene and 40C/A, Arg>Ser in INSIG2). The distribution of these 2 SNPs was studied in 14 dog breeds. Two synonymous SNPs, one in the FTO gene (-28T>C in the 5'-flanking region) and one in the INSIG2 (10175C>T in intron 2), were used for the association studies in red foxes (n = 390) and suggestive evidence was observed for their association with body weight (FTO, p < 0.08) and weight of raw skin (INSIG2, p < 0.05). These associations indicate that both genes are potential candidates for growth or adipose tissue accumulation in canids. We also suggest that the 2 missense substitutions found in dogs should be studied in terms of genetic predisposition to obesity. Copyright © 2011 S. Karger AG, Basel.

  16. Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

    PubMed Central

    Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio

    2004-01-01

    The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394

  17. Genetic Architecture of Natural Variation in Rice Nonphotochemical Quenching Capacity Revealed by Genome-Wide Association Study

    PubMed Central

    Wang, Quanxiu; Zhao, Hu; Jiang, Junpeng; Xu, Jiuyue; Xie, Weibo; Fu, Xiangkui; Liu, Chang; He, Yuqing; Wang, Gongwei

    2017-01-01

    The photoprotective processes conferred by nonphotochemical quenching (NPQ) serve fundamental roles in maintaining plant fitness and sustainable yield. So far, few loci have been reported to be involved in natural variation of NPQ capacity in rice (Oryza sativa), and the extents of variation explored are very limited. Here we conducted a genome-wide association study (GWAS) for NPQ capacity using a diverse worldwide collection of 529 O. sativa accessions. A total of 33 significant association loci were identified. To check the validity of the GWAS signals, three F2 mapping populations with parents selected from the association panel were constructed and assayed. All QTLs detected in mapping populations could correspond to at least one GWAS signal, indicating the GWAS results were quite reliable. OsPsbS1 was repeatedly detected and explained more than 40% of the variation in the whole association population in two years, and demonstrated to be a common major QTL in all three mapping populations derived from inter-group crosses. We revealed 43 single nucleotide polymorphisms (SNPs) and 7 insertions and deletions (InDels) within a 6,997-bp DNA fragment of OsPsbS1, but found no non-synonymous SNPs or InDels in the coding region, indicating the PsbS1 protein sequence is highly conserved. Haplotypes with the 2,674-bp insertion in the promoter region exhibited significantly higher NPQ values and higher expression levels of OsPsbS1. The OsPsbS1 RNAi plants and CRISPR/Cas9 mutants exhibited drastically decreased NPQ values. OsPsbS1 had specific and high-level expression in green tissues of rice. However, we didn't find significant function for OsPsbS2, the other rice PsbS homologue. Manipulation of the significant loci or candidate genes identified may enhance photoprotection and improve photosynthesis and yield in rice. PMID:29081789

  18. Genetic Architecture of Natural Variation in Rice Nonphotochemical Quenching Capacity Revealed by Genome-Wide Association Study.

    PubMed

    Wang, Quanxiu; Zhao, Hu; Jiang, Junpeng; Xu, Jiuyue; Xie, Weibo; Fu, Xiangkui; Liu, Chang; He, Yuqing; Wang, Gongwei

    2017-01-01

    The photoprotective processes conferred by nonphotochemical quenching (NPQ) serve fundamental roles in maintaining plant fitness and sustainable yield. So far, few loci have been reported to be involved in natural variation of NPQ capacity in rice ( Oryza sativa ), and the extents of variation explored are very limited. Here we conducted a genome-wide association study (GWAS) for NPQ capacity using a diverse worldwide collection of 529 O. sativa accessions. A total of 33 significant association loci were identified. To check the validity of the GWAS signals, three F2 mapping populations with parents selected from the association panel were constructed and assayed. All QTLs detected in mapping populations could correspond to at least one GWAS signal, indicating the GWAS results were quite reliable. OsPsbS1 was repeatedly detected and explained more than 40% of the variation in the whole association population in two years, and demonstrated to be a common major QTL in all three mapping populations derived from inter-group crosses. We revealed 43 single nucleotide polymorphisms (SNPs) and 7 insertions and deletions (InDels) within a 6,997-bp DNA fragment of OsPsbS1 , but found no non-synonymous SNPs or InDels in the coding region, indicating the PsbS1 protein sequence is highly conserved. Haplotypes with the 2,674-bp insertion in the promoter region exhibited significantly higher NPQ values and higher expression levels of OsPsbS1 . The OsPsbS1 RNAi plants and CRISPR/Cas9 mutants exhibited drastically decreased NPQ values. OsPsbS1 had specific and high-level expression in green tissues of rice. However, we didn't find significant function for OsPsbS2 , the other rice PsbS homologue. Manipulation of the significant loci or candidate genes identified may enhance photoprotection and improve photosynthesis and yield in rice.

  19. Rare versus common variants in pharmacogenetics: SLCO1B1 variation and methotrexate disposition

    PubMed Central

    Ramsey, Laura B.; Bruun, Gitte H.; Yang, Wenjian; Treviño, Lisa R.; Vattathil, Selina; Scheet, Paul; Cheng, Cheng; Rosner, Gary L.; Giacomini, Kathleen M.; Fan, Yiping; Sparreboom, Alex; Mikkelsen, Torben S.; Corydon, Thomas J.; Pui, Ching-Hon; Evans, William E.; Relling, Mary V.

    2012-01-01

    Methotrexate is used to treat autoimmune diseases and malignancies, including acute lymphoblastic leukemia (ALL). Inter-individual variation in clearance of methotrexate results in heterogeneous systemic exposure, clinical efficacy, and toxicity. In a genome-wide association study of children with ALL, we identified SLCO1B1 as harboring multiple common polymorphisms associated with methotrexate clearance. The extent of influence of rare versus common variants on pharmacogenomic phenotypes remains largely unexplored. We tested the hypothesis that rare variants in SLCO1B1 could affect methotrexate clearance and compared the influence of common versus rare variants in addition to clinical covariates on clearance. From deep resequencing of SLCO1B1 exons in 699 children, we identified 93 SNPs, 15 of which were non-synonymous (NS). Three of these NS SNPs were common, with a minor allele frequency (MAF) >5%, one had low frequency (MAF 1%–5%), and 11 were rare (MAF <1%). NS SNPs (common or rare) predicted to be functionally damaging were more likely to be found among patients with the lowest methotrexate clearance than patients with high clearance. We verified lower function in vitro of four SLCO1B1 haplotypes that were associated with reduced methotrexate clearance. In a multivariate stepwise regression analysis adjusting for other genetic and non-genetic covariates, SLCO1B1 variants accounted for 10.7% of the population variability in clearance. Of that variability, common NS variants accounted for the majority, but rare damaging NS variants constituted 17.8% of SLCO1B1's effects (1.9% of total variation) and had larger effect sizes than common NS variants. Our results show that rare variants are likely to have an important effect on pharmacogenetic phenotypes. PMID:22147369

  20. Explaining the disease phenotype of intergenic SNP through predicted long range regulation.

    PubMed

    Chen, Jingqi; Tian, Weidong

    2016-10-14

    Thousands of disease-associated SNPs (daSNPs) are located in intergenic regions (IGR), making it difficult to understand their association with disease phenotypes. Recent analysis found that non-coding daSNPs were frequently located in or approximate to regulatory elements, inspiring us to try to explain the disease phenotypes of IGR daSNPs through nearby regulatory sequences. Hence, after locating the nearest distal regulatory element (DRE) to a given IGR daSNP, we applied a computational method named INTREPID to predict the target genes regulated by the DRE, and then investigated their functional relevance to the IGR daSNP's disease phenotypes. 36.8% of all IGR daSNP-disease phenotype associations investigated were possibly explainable through the predicted target genes, which were enriched with, were functionally relevant to, or consisted of the corresponding disease genes. This proportion could be further increased to 60.5% if the LD SNPs of daSNPs were also considered. Furthermore, the predicted SNP-target gene pairs were enriched with known eQTL/mQTL SNP-gene relationships. Overall, it's likely that IGR daSNPs may contribute to disease phenotypes by interfering with the regulatory function of their nearby DREs and causing abnormal expression of disease genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Polymorphisms in the Myostatin-1 gene and their association with growth traits in Ancherythroculter nigrocauda

    NASA Astrophysics Data System (ADS)

    Sun, Yanhong; Li, Qing; Wang, Guiying; Zhu, Dongmei; Chen, Jian; Li, Pei; Tong, Jingou

    2017-05-01

    Myostatin ( MSTN) is a member of the transforming growth factor-β gene superfamily that negatively regulates skeletal muscle development and growth. In the present study, partial genomic fragments of Myostatin-1 ( MSTN-1) in two commercial hatchery populations of Ancherythroculter nigrocauda, an economically important freshwater fish, were screened for single nucleotide polymorphisms (SNPs) and then genotyped by direct sequencing of PCR products. Five SNPs were identified in intron 1 and exon 2, including a non-synonymous mutation causing an amino acid change (Val to Ile) at position 180. Association analyses based on 300 individuals revealed that the g.1129T>C SNP locus was significantly associated with total length (TL), body length (BL), body height (BH) and body weight (BW) in 6- and 18-month-old populations, while the g.1289G>A locus was significantly associated with BH and BW in the 6-month-old population. Haplotype analyses revealed that fish with the genotype combinations TC/TC or TC/GA showed better growth performance. Our results suggest that g.1129T>C and g.1289G>A have positive effects on growth traits and may be candidate gene markers for marker-assisted selection in A. nigrocauda.

  2. [Medicinal plant DNA marker assisted breeding (Ⅱ) the assistant identification of SNPs assisted identification and breeding research of high yield Perilla frutescens new variety].

    PubMed

    Shen, Qi; Zhang, Dong; Sun, Wei; Zhang, Yu-Jun; Shang, Zhi-Wei; Chen, Shi-Lin

    2017-05-01

    Perilla frutescens is one of 60 kinds of food and medicine plants in the initial directory announced by health ministry of China. With the development of Perilla domain in recent , the breeding and application of good varieties has become the main bottleneck of its development. This study reported that applied to the system selection, add to marker-assisted method to breed perilla varieties. Through the whole genome sequencing and consistency matching, annotated the mutation locus according to genome data, and comparison analysis with Perilla common variants database, finally selected 30 non-synonymous mutation SNPs used as characteristic markers of Zhongyan Feishu No.1. those SNP marker were used as chosen standard of Perilla varieties. Finally breeding new perilla variety Zhongyan Feishu No.1, which possess to characters of the leaf and seed dual-used, high yield, high resistance, and could used to green fertilizer. The Zhongyan Feishu No.1 acquired the plant new varieties identification of Beijing city , the identification numbers is 2016054. Marker assisted identification guide new varieties breeding in plants, which can provide a new reference for breeding of medicinal plants. Copyright© by the Chinese Pharmaceutical Association.

  3. Genetic variants of the MAVS, MITA and MFN2 genes are not associated with leprosy in Han Chinese from Southwest China.

    PubMed

    Wang, Dong; Li, Guo-Dong; Zhang, Deng-Feng; Xu, Ling; Li, Xiao-An; Yu, Xiu-Feng; Long, Heng; Li, Yu-Ye; Yao, Yong-Gang

    2016-11-01

    Leprosy is a chronic infectious disease caused by Mycobacterium leprae (M. leprae), which has massive genomic decay and dependence on host metabolism. Accumulating evidence showed a crucial role of mitochondria in metabolism and innate immunity. We hypothesized that the mitochondrial-related antimicrobial/antiviral immune genes MAVS (mitochondrial antiviral signaling protein), MITA (mediator of IRF3 activation) and MFN2 (mitofusin 2) would confer a risk to leprosy. In this study, we performed a case-control study to analyze 11 tag and/or non-synonymous SNPs of the MAVS, MITA and MFN2 genes in 527 leprosy patients and 583 healthy individuals, and directly sequenced the three genes in 80 leprosy patients with a family history from Yunnan, Southwest China. We found no association between these SNPs and leprosy (including its subtypes) based on the frequencies of alleles, genotypes and haplotypes between the cases and controls. There was also no enrichment of potential pathogenic variants of the three genes in leprosy patients. Our results suggested that genetic variants of the MAVS, MITA and MFN2 genes might not affect the susceptibility to leprosy. Copyright © 2016 Elsevier B.V. All rights reserved.

  4. Characteristic mutations found in the ML0411 gene of Mycobacterium leprae isolated in Northeast Asian countries.

    PubMed

    Kai, M; Nakata, N; Matsuoka, M; Sekizuka, T; Kuroda, M; Makino, M

    2013-10-01

    Genome analysis of Mycobacterium leprae strain Kyoto-2 in this study revealed characteristic nucleotide substitutions in gene ML0411, compared to the reference genome M. leprae strain TN. The ML0411 gene of Kyoto-2 had six SNPs compared to that of TN. All SNPs in ML0411 were non-synonymous mutations that result in amino acid replacements. In addition, a seventh SNP was found 41 bp upstream of the start codon in the regulatory region. The seven SNP sites in the ML0411 region were investigated by sequencing in 36 M. leprae isolates from the Leprosy Research Center in Japan. The SNP pattern in 14 of the 36 isolates showed similarity to that of Kyoto-2. Determination of the standard SNP types within the 36 stocked isolates revealed that almost all of the Japanese strains belonged to SNP type III, with nucleotide substitutions at position 14676, 164275, and 2935685 of the M. leprae TN genome. The geographical distribution pattern of east Asian M. leprae isolates by discrimination of ML0411 SNPs was investigated and interestingly turned out to be similar to that of tandem repeat numbers of GACATC in the rpoT gene (3 copies or 4 copies), which has been established as a tool for M. leprae genotyping. All seven Korean M. leprae isolates examined in this study, as well as those derived from Honshu Island of Japan, showed 4 copies of the 6-base tandem repeat plus the ML0411 SNPs observed in M. leprae Kyoto-2. They are termed Northeast Asian (NA) strain of M. leprae. On the other hand, many of isolates derived from the Okinawa Islands of Japan and from the Philippines showed 3 copies of the 6-base tandem repeat in addition to the M. leprae TN ML0411 type of SNPs. These results demonstrate the existence of M. leprae strains in Northeast Asian region having characteristic SNP patterns. Copyright © 2013 Elsevier B.V. All rights reserved.

  5. Common Variants in Cardiac Ion Channel Genes are Associated with Sudden Cardiac Death

    PubMed Central

    Albert, Christine M.; MacRae, Calum A.; Chasman, Daniel I.; VanDenburgh, Martin; Buring, Julie E; Manson, JoAnn E; Cook, Nancy R; Newton-Cheh, Christopher

    2010-01-01

    Background Rare variants in cardiac ion channel genes are associated with sudden cardiac death (SCD) in rare primary arrhythmic syndromes; however, it is unknown whether common variation in these same genes may contribute to SCD risk at the population level. Methods and Results We examined the association between 147 single nucleotide polymorphisms (SNPs) (137 tag, 5 non-coding SNPs associated with QT interval duration and 5 nonsynonymous SNPs) in 5 cardiac ion channel genes, KCNQ1, KCNH2, SCN5A, KCNE1 and KCNE2 and sudden and/or arrhythmic death in a combined nested case-control analysis among 516 cases and 1522 matched controls of European ancestry enrolled in six prospective cohort studies. After accounting for multiple testing, two SNPs (rs2283222 located in intron 11 in KCNQ1 and rs11720524 located in intron 1 in SCN5A) remained significantly associated with sudden/arrhythmic death (FDR = 0.01 and 0.03 respectively). Each increasing copy of the major T allele of rs2283222 or the major C allele of rs1172052 was associated with an OR = 1.36 (95% CI 1.16-1.60, P=0.0002) and 1.30 (95% CI 1.12-1.51, P=0.0005) respectively. Control for cardiovascular risk factors and/or limiting the analysis to definite SCDs did not significantly alter these relationships. Conclusion In this combined analysis of 6 prospective cohort studies, two common intronic variants in KCNQ1 and SCN5A were associated with SCD in individuals of European ancestry. Further study in other populations and investigation into the functional abnormalities associated with non-coding variation in these genes may lead to important insights into predisposition to lethal arrhythmias. PMID:20400777

  6. Inheritance-mode specific pathogenicity prioritization (ISPP) for human protein coding genes.

    PubMed

    Hsu, Jacob Shujui; Kwan, Johnny S H; Pan, Zhicheng; Garcia-Barcelo, Maria-Mercè; Sham, Pak Chung; Li, Miaoxin

    2016-10-15

    Exome sequencing studies have facilitated the detection of causal genetic variants in yet-unsolved Mendelian diseases. However, the identification of disease causal genes among a list of candidates in an exome sequencing study is still not fully settled, and it is often difficult to prioritize candidate genes for follow-up studies. The inheritance mode provides crucial information for understanding Mendelian diseases, but none of the existing gene prioritization tools fully utilize this information. We examined the characteristics of Mendelian disease genes under different inheritance modes. The results suggest that Mendelian disease genes with autosomal dominant (AD) inheritance mode are more haploinsufficiency and de novo mutation sensitive, whereas those autosomal recessive (AR) genes have significantly more non-synonymous variants and regulatory transcript isoforms. In addition, the X-linked (XL) Mendelian disease genes have fewer non-synonymous and synonymous variants. As a result, we derived a new scoring system for prioritizing candidate genes for Mendelian diseases according to the inheritance mode. Our scoring system assigned to each annotated protein-coding gene (N = 18 859) three pathogenic scores according to the inheritance mode (AD, AR and XL). This inheritance mode-specific framework achieved higher accuracy (area under curve  = 0.84) in XL mode. The inheritance-mode specific pathogenicity prioritization (ISPP) outperformed other well-known methods including Haploinsufficiency, Recessive, Network centrality, Genic Intolerance, Gene Damage Index and Gene Constraint scores. This systematic study suggests that genes manifesting disease inheritance modes tend to have unique characteristics. ISPP is included in KGGSeq v1.0 (http://grass.cgs.hku.hk/limx/kggseq/), and source code is available from (https://github.com/jacobhsu35/ISPP.git). mxli@hku.hkSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. A genetic variant of the NTCP gene is associated with HBV infection status in a Chinese population.

    PubMed

    Yang, Jingmin; Yang, Yuan; Xia, Mingying; Wang, Lianghui; Zhou, Weiping; Yang, Yajun; Jiang, Yueming; Wang, Hongyang; Qian, Ji; Jin, Li; Wang, Xiaofeng

    2016-03-12

    To investigate whether genetic variants of the HBV receptor gene NTCP are associated with HBV infection in the Han Chinese population. We sequenced the entire 23 kb NTCP gene from 111 HBeAg-positive HBsAg carriers (PSE group), 110 HBeAg-negative HBsAg carriers (PS group), and 110 control subjects. Then, we performed association analyses of suggestively significant SNPs with HBV infection in 1075 controls, 1936 PSs and 639 PSEs. In total, 109 rare variants (74 novel) and 38 single nucleotide polymorphisms (SNPs, one novel) were screened. Of the seven non-synonymous rare variants, six were singletons and one was a double hit. All three damaging rare singletons presented exclusively in the PSE group. Of the five SNPs validated in all 3650 subjects, the T allele of rs4646287 was significantly decreased (p = 0.002) in the PS group (10.1%) and PSE group (8.1%) compared to the controls (10.9%) and was decreased to 7.4% in the PSE hepatocellular carcinoma (HCC) subgroup. Additionally, rs4646287-T was associated with a 0.68-fold (95% CI = 0.51-0.89, p = 0.006) decreased risk of PSE compared with the controls. The NTCP mRNA level was lower in HCC tissues in "CT + TT" carriers than in "CC" carriers. We found a genetic variant (rs4646287) located in intron 1 of NTCP that may be associated with increased risk of HBV infection in Han Chinese.

  8. Single-nucleotide polymorphisms and haplotypes of non-coding area in the CP gene are correlated with Parkinson's disease.

    PubMed

    Zhao, Na; Xiao, Jianqiu; Zheng, Zhiyong; Fei, Guoqiang; Zhang, Feng; Jin, Lirong; Zhong, Chunjiu

    2015-04-01

    Our previous studies have demonstrated that ceruloplasmin (CP) dysmetabolism is correlated with Parkinson's disease (PD). However, the causes of decreased serum CP levels in PD patients remain to be clarified. This study aimed to explore the potential association between genetic variants of the CP gene and PD. Clinical features, serum CP levels, and the CP gene (both promoter and coding regions) were analyzed in 60 PD patients and 50 controls. A luciferase reporter system was used to investigate the function of promoter single-nucleotide polymorphisms (SNPs). High-density comparative genomic hybridization microarrays were also used to detect large-scale copy-number variations in CP and an additional 47 genes involved in PD and/or copper/iron metabolism. The frequencies of eight SNPs (one intronic SNP and seven promoter SNPs of the CP gene) and their haplotypes were significantly different between PD patients, especially those with lowered serum CP levels, and controls. However, the luciferase reporter system revealed no significant effect of the risk haplotype on promoter activity of the CP gene. Neither these SNPs nor their haplotypes were correlated with the Hoehn and Yahr staging of PD. The results of this study suggest that common genetic variants of CP are associated with PD and further investigation is needed to explore their functions in PD.

  9. Adjusting for background mutation frequency biases improves the identification of cancer driver genes.

    PubMed

    Evans, Perry; Avey, Stefan; Kong, Yong; Krauthammer, Michael

    2013-09-01

    A common goal of tumor sequencing projects is finding genes whose mutations are selected for during tumor development. This is accomplished by choosing genes that have more non-synonymous mutations than expected from an estimated background mutation frequency. While this background frequency is unknown, it can be estimated using both the observed synonymous mutation frequency and the non-synonymous to synonymous mutation ratio. The synonymous mutation frequency can be determined across all genes or in a gene-specific manner. This choice introduces an interesting trade-off. A gene-specific frequency adjusts for an underlying mutation bias, but is difficult to estimate given missing synonymous mutation counts. Using a genome-wide synonymous frequency is more robust, but is less suited for adjusting biases. Studying four evaluation criteria for identifying genes with high non-synonymous mutation burden (reflecting preferential selection of expressed genes, genes with mutations in conserved bases, genes with many protein interactions, and genes that show loss of heterozygosity), we find that the gene-specific synonymous frequency is superior in the gene expression and protein interaction tests. In conclusion, the use of the gene-specific synonymous mutation frequency is well suited for assessing a gene's non-synonymous mutation burden.

  10. The polymorphisms of LCR, E6, and E7 of HPV-58 isolates in Yunnan, Southwest China.

    PubMed

    Xi, Juemin; Chen, Junying; Xu, Miaoling; Yang, Hongying; Wen, Songjiao; Pan, Yue; Wang, Xiaodan; Ye, Chao; Qiu, Lijuan; Sun, Qiangming

    2018-04-25

    Variations in HPV LCR/E6/E7 have been shown to be associated with the viral persistence and cervical cancer development. So far, there are few reports about the polymorphisms of the HPV-58 LCR/E6/E7 sequences in Southwest China. This study aims to characterize the gene polymorphisms of the HPV-58 LCR/E6/E7 sequences in women of Southwest China, and assess the effects of variations on the immune recognition of viral E6 and E7 antigens. Twelve LCR/E6/E7 of the HPV-58 isolates were amplified and sequenced. A neighbor-joining phylogenetic tree was constructed by MEGA 7.0, followed by the secondary structure prediction of the related proteins using PSIPRED v3.3. The selection pressure acting on the HPV-58 E6 and E7 coding regions was estimated by Bayes empirical Bayes analysis of PAML 4.8. Meanwhile, the MHC class-I and II binding peptides were predicted by the ProPred-I server and ProPred server. The transcription factor binding sites in the HPV-58 LCR were analyzed using the JASPAR database. Twenty nine SNPs (20 in the LCR, 3 in the E6, 6 in the E7) were identified at 27 nucleotide sites across the HPV-58 LCR/E6/E7. From the most variable to the least variable, the nucleotide variations were LCR > E7 > E6. The combinations of all the SNPs resulted in 11 unique sequences, which were clustered into the A lineage (7 belong to A1, 2 belong to A2, and 2 belong to A3). An insertion (TGTCAGTTTCCT) was found between the nucleotide sites 7280 and 7281 in 2 variants, and a deletion (TTTAT) was found between 7429 and 7433 in 1 variant. The most common non-synonymous substitution V77A in the E7 was observed in the sequences encoding the α-helix. 63G in the E7 was determined to be the only one positively selected site in the HPV-58 E6/E7 sequences. Six non-synonymous amino acid substitutions (including S71F and K93 N in the E6, and T20I, G41R, G63S/D, and V77A in the E7) were affecting multiple putative epitopes for both CD4 + and CD8 + T-cells. In the LCR, C7265G and C7266T were the most variable sites and were the potential binding sites for the transcription factor SOX10. These results provide an insight into the intrinsic geographical relatedness and biological differences of the HPV-58 variants, and contribute to further research on the HPV-58 epidemiology, carcinogenesis, and therapeutic vaccine development.

  11. Identification of a splicing variant that regulates type 2 diabetes risk factor CDKAL1 level by a coding-independent mechanism in human.

    PubMed

    Zhou, Bo; Wei, Fan-Yan; Kanai, Narumi; Fujimura, Atsushi; Kaitsuka, Taku; Tomizawa, Kazuhito

    2014-09-01

    Single-nucleotide polymorphisms (SNPs) in CDKAL1 have been associated with the development of type 2 diabetes (T2D). CDKAL1 catalyzes 2-methylthio modification of adenosine at position 37 of tRNA(Lys)(UUU). A deficit of this modification causes aberrant protein synthesis, and is associated with impairment of insulin secretion in both mouse model and human. However, it is unknown whether the T2D-associated SNPs in CDKAL1 are associated with downregulation of CDKAL1 by regulating the gene expression. Here, we report a specific splicing variant of CDKAL1 termed CDKAL1-v1 that is markedly lower in individuals carrying risk SNPs of CDKAL1. Interestingly, CDKAL1-v1 is a non-coding transcript, which regulates the CDKAL1 level by competitive binding to a CDKAL1-targeting miRNA. By direct editing of the genome, we further show that the nucleotides around the SNP regions are critical for the alternative splicing of CDKAL1-v1. These findings reveal that the T2D-associated SNPs in CDKAL1 reduce CDKAL1-v1 levels by impairing splicing, which in turn increases miRNA-mediated suppression of CDKAL1. Our results suggest that CDKAL1-v1-mediated suppression of CDKAL1 might underlie the pathogenesis of T2D in individuals carrying the risk SNPs. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  12. Prediction and analysis of three gene families related to leaf rust (Puccinia triticina) resistance in wheat (Triticum aestivum L.).

    PubMed

    Peng, Fred Y; Yang, Rong-Cai

    2017-06-20

    The resistance to leaf rust (Lr) caused by Puccinia triticina in wheat (Triticum aestivum L.) has been well studied over the past decades with over 70 Lr genes being mapped on different chromosomes and numerous QTLs (quantitative trait loci) being detected or mapped using DNA markers. Such resistance is often divided into race-specific and race-nonspecific resistance. The race-nonspecific resistance can be further divided into resistance to most or all races of the same pathogen and resistance to multiple pathogens. At the molecular level, these three types of resistance may cover across the whole spectrum of pathogen specificities that are controlled by genes encoding different protein families in wheat. The objective of this study is to predict and analyze genes in three such families: NBS-LRR (nucleotide-binding sites and leucine-rich repeats or NLR), START (Steroidogenic Acute Regulatory protein [STaR] related lipid-transfer) and ABC (ATP-Binding Cassette) transporter. The focus of the analysis is on the patterns of relationships between these protein-coding genes within the gene families and QTLs detected for leaf rust resistance. We predicted 526 ABC, 1117 NLR and 144 START genes in the hexaploid wheat genome through a domain analysis of wheat proteome. Of the 1809 SNPs from leaf rust resistance QTLs in seedling and adult stages of wheat, 126 SNPs were found within coding regions of these genes or their neighborhood (5 Kb upstream from transcription start site [TSS] or downstream from transcription termination site [TTS] of the genes). Forty-three of these SNPs for adult resistance and 18 SNPs for seedling resistance reside within coding or neighboring regions of the ABC genes whereas 14 SNPs for adult resistance and 29 SNPs for seedling resistance reside within coding or neighboring regions of the NLR gene. Moreover, we found 17 nonsynonymous SNPs for adult resistance and five SNPs for seedling resistance in the ABC genes, and five nonsynonymous SNPs for adult resistance and six SNPs for seedling resistance in the NLR genes. Most of these coding SNPs were predicted to alter encoded amino acids and such information may serve as a starting point towards more thorough molecular and functional characterization of the designated Lr genes. Using the primer sequences of 99 known non-SNP markers from leaf rust resistance QTLs, we found candidate genes closely linked to these markers, including Lr34 with distances to its two gene-specific markers being 1212 bases (to cssfr1) and 2189 bases (to cssfr2). This study represents a comprehensive analysis of ABC, NLR and START genes in the hexaploid wheat genome and their physical relationships with QTLs for leaf rust resistance at seedling and adult stages. Our analysis suggests that the ABC (and START) genes are more likely to be co-located with QTLs for race-nonspecific, adult resistance whereas the NLR genes are more likely to be co-located with QTLs for race-specific resistance that would be often expressed at the seedling stage. Though our analysis was hampered by inaccurate or unknown physical positions of numerous QTLs due to the incomplete assembly of the complex hexaploid wheat genome that is currently available, the observed associations between (i) QTLs for race-specific resistance and NLR genes and (ii) QTLs for nonspecific resistance and ABC genes will help discover SNP variants for leaf rust resistance at seedling and adult stages. The genes containing nonsynonymous SNPs are promising candidates that can be investigated in future studies as potential new sources of leaf rust resistance in wheat breeding.

  13. Non-synonymous single nucleotide polymorphisms in the watermelon eIF4E gene are closely associated with resistance to zucchini yellow mosaic virus.

    PubMed

    Ling, Kai-Shu; Harris, Karen R; Meyer, Jenelle D F; Levi, Amnon; Guner, Nihat; Wehner, Todd C; Bendahmane, Abdelhafid; Havey, Michael J

    2009-12-01

    Zucchini yellow mosaic virus (ZYMV) is one of the most economically important potyviruses infecting cucurbit crops worldwide. Using a candidate gene approach, we cloned and sequenced eIF4E and eIF(iso)4E gene segments in watermelon. Analysis of the nucleotide sequences between the ZYMV-resistant watermelon plant introduction PI 595203 (Citrullus lanatus var. lanatus) and the ZYMV-susceptible watermelon cultivar 'New Hampshire Midget' ('NHM') showed the presence of single nucleotide polymorphisms (SNPs). Initial analysis of the identified SNPs in association studies indicated that SNPs in the eIF4E, but not eIF(iso)4E, were closely associated to the phenotype of ZYMV-resistance in 70 F(2) and 114 BC(1R) progenies. Subsequently, we focused our efforts in obtaining the entire genomic sequence of watermelon eIF4E. Three SNPs were identified between PI 595203 and NHM. One of the SNPs (A241C) was in exon 1 and the other two SNPs (C309A and T554G) were in the first intron of the gene. SNP241 which resulted in an amino acid substitution (proline to threonine) was shown to be located in the critical cap recognition and binding area, similar to that of several plant species resistance to potyviruses. Analysis of a cleaved amplified polymorphism sequence (CAPS) marker derived from this SNP in F(2) and BC(1R) populations demonstrated a cosegregation between the CAPS-2 marker and their ZYMV resistance or susceptibility phenotype. When we investigated whether such SNP mutation in the eIF4E was also conserved in several other PIs of C. lanatus var. citroides, we identified a different SNP (A171G) resulting in another amino acid substitution (D71G) from four ZYMV-resistant C. lanatus var. citroides (PI 244018, PI 482261, PI 482299, and PI 482322). Additional CAPS markers were also identified. Availability of all these CAPS markers will enable marker-aided breeding of watermelon for ZYMV resistance.

  14. Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.

    PubMed

    Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi

    2004-01-01

    In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.

  15. A house finch (Haemorhous mexicanus) spleen transcriptome reveals intra- and interspecific patterns of gene expression, alternative splicing and genetic diversity in passerines.

    PubMed

    Zhang, Qu; Hill, Geoffrey E; Edwards, Scott V; Backström, Niclas

    2014-04-24

    With its plumage color dimorphism and unique history in North America, including a recent population expansion and an epizootic of Mycoplasma gallisepticum (MG), the house finch (Haemorhous mexicanus) is a model species for studying sexual selection, plumage coloration and host-parasite interactions. As part of our ongoing efforts to make available genomic resources for this species, here we report a transcriptome assembly derived from genes expressed in spleen. We characterize transcriptomes from two populations with different histories of demography and disease exposure: a recently founded population in the eastern US that has been exposed to MG for over a decade and a native population from the western range that has never been exposed to MG. We utilize this resource to quantify conservation in gene expression in passerine birds over approximately 50 MY by comparing splenic expression profiles for 9,646 house finch transcripts and those from zebra finch and find that less than half of all genes expressed in spleen in either species are expressed in both species. Comparative gene annotations from several vertebrate species suggest that the house finch transcriptomes contain ~15 genes not yet found in previously sequenced vertebrate genomes. The house finch transcriptomes harbour ~85,000 SNPs, ~20,000 of which are non-synonymous. Although not yet validated by biological or technical replication, we identify a set of genes exhibiting differences between populations in gene expression (n = 182; 2% of all transcripts), allele frequencies (76 FST ouliers) and alternative splicing as well as genes with several fixed non-synonymous substitutions; this set includes genes with functions related to double-strand break repair and immune response. The two house finch spleen transcriptome profiles will add to the increasing data on genome and transcriptome sequence information from natural populations. Differences in splenic expression between house finch and zebra finch imply either significant evolutionary turnover of splenic expression patterns or different physiological states of the individuals examined. The transcriptome resource will enhance the potential to annotate an eventual house finch genome, and the set of gene-based high-quality SNPs will help clarify the genetic underpinnings of host-pathogen interactions and sexual selection.

  16. Long Non-Coding RNAs Differentially Expressed between Normal versus Primary Breast Tumor Tissues Disclose Converse Changes to Breast Cancer-Related Protein-Coding Genes

    PubMed Central

    Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U.; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N.; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O.

    2014-01-01

    Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes. PMID:25264628

  17. Long non-coding RNAs differentially expressed between normal versus primary breast tumor tissues disclose converse changes to breast cancer-related protein-coding genes.

    PubMed

    Reiche, Kristin; Kasack, Katharina; Schreiber, Stephan; Lüders, Torben; Due, Eldri U; Naume, Bjørn; Riis, Margit; Kristensen, Vessela N; Horn, Friedemann; Børresen-Dale, Anne-Lise; Hackermüller, Jörg; Baumbusch, Lars O

    2014-01-01

    Breast cancer, the second leading cause of cancer death in women, is a highly heterogeneous disease, characterized by distinct genomic and transcriptomic profiles. Transcriptome analyses prevalently assessed protein-coding genes; however, the majority of the mammalian genome is expressed in numerous non-coding transcripts. Emerging evidence supports that many of these non-coding RNAs are specifically expressed during development, tumorigenesis, and metastasis. The focus of this study was to investigate the expression features and molecular characteristics of long non-coding RNAs (lncRNAs) in breast cancer. We investigated 26 breast tumor and 5 normal tissue samples utilizing a custom expression microarray enclosing probes for mRNAs as well as novel and previously identified lncRNAs. We identified more than 19,000 unique regions significantly differentially expressed between normal versus breast tumor tissue, half of these regions were non-coding without any evidence for functional open reading frames or sequence similarity to known proteins. The identified non-coding regions were primarily located in introns (53%) or in the intergenic space (33%), frequently orientated in antisense-direction of protein-coding genes (14%), and commonly distributed at promoter-, transcription factor binding-, or enhancer-sites. Analyzing the most diverse mRNA breast cancer subtypes Basal-like versus Luminal A and B resulted in 3,025 significantly differentially expressed unique loci, including 682 (23%) for non-coding transcripts. A notable number of differentially expressed protein-coding genes displayed non-synonymous expression changes compared to their nearest differentially expressed lncRNA, including an antisense lncRNA strongly anticorrelated to the mRNA coding for histone deacetylase 3 (HDAC3), which was investigated in more detail. Previously identified chromatin-associated lncRNAs (CARs) were predominantly downregulated in breast tumor samples, including CARs located in the protein-coding genes for CALD1, FTX, and HNRNPH1. In conclusion, a number of differentially expressed lncRNAs have been identified with relation to cancer-related protein-coding genes.

  18. Identification of proteins that interact with TANK binding kinase 1 and testing for mutations associated with glaucoma.

    PubMed

    Seo, Seongjin; Solivan-Timpe, Frances; Roos, Ben R; Robin, Alan L; Stone, Edwin M; Kwon, Young H; Alward, Wallace L M; Fingert, John H

    2013-02-01

    Copy number variations (duplications) of TANK binding kinase 1 (TBK1) have been associated with normal tension glaucoma (NTG), a common cause of blindness worldwide. Mutations in other genes involved in autophagy (TLR4 and OPTN) have been associated with NTG. Here we report searching for additional proteins involved in autophagy that may also have roles in NTG. HEK-293T cells were transfected to produce synthetic TBK1 protein with FLAG and S tags. Proteins that associate with TBK1 were isolated from HEK-293T lysates using tandem affinity purification (TAP) and polyacrylamide gel electrophoresis (PAGE). Isolated proteins were identified with mass spectrometry. A cohort of 148 NTG patients and 77 controls from Iowa were tested for glaucoma-causing mutations in genes that encode identified proteins that interact with TBK1 using high resolution melt (HRM) analysis and DNA sequencing. TAP studies show that three proteins expressed in HEK-293T cells (NAP1, TANK and TBKBP1) interact with TBK1. Testing cohorts of NTG and normal controls for disease-causing mutations in TANK, identified a total of nine unique variants including three non-synonymous changes, one synonymous changes and five intronic changes. When analyzed alone or as a group, the non-synonymous TBK1 coding sequence changes were not associated with either NTG or primary open angle glaucoma. TAP showed that NAP1, TANK and TBKBP1 interact with TBK1 and are good candidates for contributing to NTG. A mutation screen of TANK detected three non-synonymous variants. Although, it remains possible that one or more of these TANK mutations may have a role in NTG, the data in this report do not provide statistical support for an association between TANK variants and NTG.

  19. Prospecting for pig single nucleotide polymorphisms in the human genome: have we struck gold?

    PubMed

    Grapes, L; Rudd, S; Fernando, R L; Megy, K; Rocha, D; Rothschild, M F

    2006-06-01

    Gene-to-gene variation in the frequency of single nucleotide polymorphisms (SNPs) has been observed in humans, mice, rats, primates and pigs, but a relationship across species in this variation has not been described. Here, the frequency of porcine coding SNPs (cSNPs) identified by in silico methods, and the frequency of murine cSNPs, were compared with the frequency of human cSNPs across homologous genes. From 150,000 porcine expressed sequence tag (EST) sequences, a total of 452 SNP-containing sequence clusters were found, totalling 1394 putative SNPs. All the clustered porcine EST annotations and SNP data have been made publicly available at http://sputnik.btk.fi/project?name=swine. Human and murine cSNPs were identified from dbSNP and were characterized as either validated or total number of cSNPs (validated plus non-validated) for comparison purposes. The correlation between in silico pig cSNP and validated human cSNP densities was found to be 0.77 (p < 0.00001) for a set of 25 homologous genes, while a correlation of 0.48 (p < 0.0005) was found for a primarily random sample of 50 homologous human and mouse genes. This is the first evidence of conserved gene-to-gene variability in cSNP frequency across species and indicates that site-directed screening of porcine genes that are homologous to cSNP-rich human genes may rapidly advance cSNP discovery in pigs.

  20. Color differences among feral pigeons (Columba livia) are not attributable to sequence variation in the coding region of the melanocortin-1 receptor gene (MC1R)

    PubMed Central

    2013-01-01

    Background Genetic variation at the melanocortin-1 receptor (MC1R) gene is correlated with melanin color variation in many birds. Feral pigeons (Columba livia) show two major melanin-based colorations: a red coloration due to pheomelanic pigment and a black coloration due to eumelanic pigment. Furthermore, within each color type, feral pigeons display continuous variation in the amount of melanin pigment present in the feathers, with individuals varying from pure white to a full dark melanic color. Coloration is highly heritable and it has been suggested that it is under natural or sexual selection, or both. Our objective was to investigate whether MC1R allelic variants are associated with plumage color in feral pigeons. Findings We sequenced 888 bp of the coding sequence of MC1R among pigeons varying both in the type, eumelanin or pheomelanin, and the amount of melanin in their feathers. We detected 10 non-synonymous substitutions and 2 synonymous substitution but none of them were associated with a plumage type. It remains possible that non-synonymous substitutions that influence coloration are present in the short MC1R fragment that we did not sequence but this seems unlikely because we analyzed the entire functionally important region of the gene. Conclusions Our results show that color differences among feral pigeons are probably not attributable to amino acid variation at the MC1R locus. Therefore, variation in regulatory regions of MC1R or variation in other genes may be responsible for the color polymorphism of feral pigeons. PMID:23915680

  1. Comparative analysis of myostatin gene and promoter sequences of Qinchuan and Red Angus cattle.

    PubMed

    He, Y L; Wu, Y H; Quan, F S; Liu, Y G; Zhang, Y

    2013-09-04

    To better understand the function of the myostatin gene and its promoter region in bovine, we amplified and sequenced the myostatin gene and promoter from the blood of Qinchuan and Red Angus cattle by using polymerase chain reaction. The sequences of Qinchuan and Red Angus cattle were compared with those of other cattle breeds available in GenBank. Exon splice sites were confirmed by mRNA sequencing. Compared to the published sequence (GenBank accession No. AF320998), 69 single nucleotide polymorphisms (SNPs) were identified in the Qinchuan myostatin gene, only one of which was an insertion mutation in Qinchuan cattle. There was a 16-bp insertion in the first 705-bp intron in 3 Qinchuan cattle. A total of 7 SNPs were identified in exon 3, in which the mutation occurred in the third base of the codon and was synonymous. On comparing the Qinchuan myostatin gene sequence to that of Red Angus cattle, a total of 50 SNPs were identified in the first and third exons. In addition, there were 18 SNPs identified in the Qinchuan cattle promoter region compared with those of other cattle compared to the Red Angus cattle myostatin promoter region. breeds (GenBank accession No. AF348479), but only 14 SNPs when compared to the Red Angus cattle myostatin promoter region.

  2. Two novel polymorphisms of bovine SIRT2 gene are associated with higher body weight in Nanyang cattle.

    PubMed

    Sun, Xiaomei; Li, Mingxun; Hao, Dan; Hua, Liushuai; Lan, Xianyong; Lei, Chuzhao; Hu, Shenrong; Qi, Xinglei; Chen, Hong

    2015-03-01

    Identification of polymorphisms associated with economic traits is important for successful marker-assisted selection in cattle breeding. The family of mammalian sirtuin regulates many biological functions, such as life span extension and energy metabolism. SIRT2, a most abundant sirtuin in adipocytes, acts as a crucial regulator of adipogenic differentiation and plays a key role in controlling adipose tissue function and mass. Here we investigated single nucleotide polymorphisms (SNPs) of bovine SIRT2 in 1226 cattle from five breeds and further evaluated the effects of identified SNPs on economically important traits of Nanyang cattle. Our results revealed four novel SNPs in bovine SIRT2, one was located in intronic region and the other three were synonymous mutations. Linkage disequilibrium and haplotype analyses based on the identified SNPs showed obvious difference between crossbred breed and the other four beef breeds. Association analyses demonstrated that SNPs g.17333C > T and g.17578A > G have a significantly effect on 18-months-old body weight of Nanyang population. Animals with combined genotype TTGG at the above two loci exhibited especially higher body weight. Our data for the first time demonstrated that polymorphisms in bovine SIRT2 are associated with economic traits of Nanyang cattle, which will be helpful for future cattle selection practices.

  3. Genetic Architecture of Natural Variation in Rice Chlorophyll Content Revealed by a Genome-Wide Association Study.

    PubMed

    Wang, Quanxiu; Xie, Weibo; Xing, Hongkun; Yan, Ju; Meng, Xiangzhou; Li, Xinglei; Fu, Xiangkui; Xu, Jiuyue; Lian, Xingming; Yu, Sibin; Xing, Yongzhong; Wang, Gongwei

    2015-06-01

    Chlorophyll content is one of the most important physiological traits as it is closely related to leaf photosynthesis and crop yield potential. So far, few genes have been reported to be involved in natural variation of chlorophyll content in rice (Oryza sativa) and the extent of variations explored is very limited. We conducted a genome-wide association study (GWAS) using a diverse worldwide collection of 529 O. sativa accessions. A total of 46 significant association loci were identified. Three F2 mapping populations with parents selected from the association panel were tested for validation of GWAS signals. We clearly demonstrated that Grain number, plant height, and heading date7 (Ghd7) was a major locus for natural variation of chlorophyll content at the heading stage by combining evidence from near-isogenic lines and transgenic plants. The enhanced expression of Ghd7 decreased the chlorophyll content, mainly through down-regulating the expression of genes involved in the biosynthesis of chlorophyll and chloroplast. In addition, Narrow leaf1 (NAL1) corresponded to one significant association region repeatedly detected over two years. We revealed a high degree of polymorphism in the 5' UTR and four non-synonymous SNPs in the coding region of NAL1, and observed diverse effects of the major haplotypes. The loci or candidate genes identified would help to fine-tune and optimize the antenna size of canopies in rice breeding. Copyright © 2015 The Author. Published by Elsevier Inc. All rights reserved.

  4. Identification and characterization of transcript polymorphisms in soybean lines varying in oil composition and content.

    PubMed

    Goettel, Wolfgang; Xia, Eric; Upchurch, Robert; Wang, Ming-Li; Chen, Pengyin; An, Yong-Qiang Charles

    2014-04-23

    Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement. In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively. As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality.

  5. Capturing the Biofuel Wellhead and Powerhouse: The Chloroplast and Mitochondrial Genomes of the Leguminous Feedstock Tree Pongamia pinnata

    PubMed Central

    Kazakoff, Stephen H.; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T.; Gresshoff, Peter M.

    2012-01-01

    Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® ‘Second Generation DNA Sequencing (2GS)’ and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites. PMID:23272141

  6. Capturing the biofuel wellhead and powerhouse: the chloroplast and mitochondrial genomes of the leguminous feedstock tree Pongamia pinnata.

    PubMed

    Kazakoff, Stephen H; Imelfort, Michael; Edwards, David; Koehorst, Jasper; Biswas, Bandana; Batley, Jacqueline; Scott, Paul T; Gresshoff, Peter M

    2012-01-01

    Pongamia pinnata (syn. Millettia pinnata) is a novel, fast-growing arboreal legume that bears prolific quantities of oil-rich seeds suitable for the production of biodiesel and aviation biofuel. Here, we have used Illumina® 'Second Generation DNA Sequencing (2GS)' and a new short-read de novo assembler, SaSSY, to assemble and annotate the Pongamia chloroplast (152,968 bp; cpDNA) and mitochondrial (425,718 bp; mtDNA) genomes. We also show that SaSSY can be used to accurately assemble 2GS data, by re-assembling the Lotus japonicus cpDNA and in the process assemble its mtDNA (380,861 bp). The Pongamia cpDNA contains 77 unique protein-coding genes and is almost 60% gene-dense. It contains a 50 kb inversion common to other legumes, as well as a novel 6.5 kb inversion that is responsible for the non-disruptive, re-orientation of five protein-coding genes. Additionally, two copies of an inverted repeat firmly place the species outside the subclade of the Fabaceae lacking the inverted repeat. The Pongamia and L. japonicus mtDNA contain just 33 and 31 unique protein-coding genes, respectively, and like other angiosperm mtDNA, have expanded intergenic and multiple repeat regions. Through comparative analysis with Vigna radiata we measured the average synonymous and non-synonymous divergence of all three legume mitochondrial (1.59% and 2.40%, respectively) and chloroplast (8.37% and 8.99%, respectively) protein-coding genes. Finally, we explored the relatedness of Pongamia within the Fabaceae and showed the utility of the organellar genome sequences by mapping transcriptomic data to identify up- and down-regulated stress-responsive gene candidates and confirm in silico predicted RNA editing sites.

  7. Analysis of the mitochondrial genome of cheetahs (Acinonyx jubatus) with neurodegenerative disease.

    PubMed

    Burger, Pamela A; Steinborn, Ralf; Walzer, Christian; Petit, Thierry; Mueller, Mathias; Schwarzenberger, Franz

    2004-08-18

    The complete mitochondrial genome of Acinonyx jubatus was sequenced and mitochondrial DNA (mtDNA) regions were screened for polymorphisms as candidates for the cause of a neurodegenerative demyelinating disease affecting captive cheetahs. The mtDNA reference sequences were established on the basis of the complete sequences of two diseased and two nondiseased animals as well as partial sequences of 26 further individuals. The A. jubatus mitochondrial genome is 17,047-bp long and shows a high sequence similarity (91%) to the domestic cat. Based on single nucleotide polymorphisms (SNPs) in the control region (CR) and pedigree information, the 18 myelopathic and 12 non-myelopathic cheetahs included in this study were classified into haplotypes I, II and III. In view of the phenotypic comparability of the neurodegenerative disease observed in cheetahs and human mtDNA-associated diseases, specific coding regions including the tRNAs leucine UUR, lysine, serine UCN, and partial complex I and V sequences were screened. We identified a heteroplasmic and a homoplasmic SNP at codon 507 in the subunit 5 (MTND5) of complex I. The heteroplasmic haplotype I-specific valine to methionine substitution represents a nonconservative amino acid change and was found in 11 myelopathic and eight non-myelopathic cheetahs with levels ranging from 29% to 79%. The homoplasmic conservative amino acid substitution valine to alanine was identified in two myelopathic animals of haplotype II. In addition, a synonymous SNP in the codon 76 of the MTND4L gene was found in the single haplotype III animal. The amino acid exchanges in the MTND5 gene were not associated with the occurrence of neurodegenerative disease in captive cheetahs.

  8. Polymorphisms in ERAP1 and ERAP2 are shared by Caninae and segregate within and between random- and pure-breeds of dogs.

    PubMed

    Pedersen, N C; Dhanota, J K; Liu, H

    2016-10-15

    Specific polymorphisms in the endoplasmic reticulum amino peptidase genes ERAP1 and ERAP2, when present with certain MHC class receptor types, have been associated with increased risk for specific cancers, infectious diseases and autoimmune disorders in humans. This increased risk has been linked to distinct polymorphisms in both ERAPs and MHC class I receptors that affect the way cell-generated peptides are screened for antigenicity. The incidence of cancer, infectious disease and autoimmune disorders differ greatly among pure breeds of dogs as it does in humans and it is possible that this heightened susceptibility is also due to specific polymorphisms in ERAP1 and ERAP2. In order to determine if such polymorphisms exist, the ERAP1 and ERAP2 genes of 10 dogs of nine diverse breeds were sequenced and SNPs causing synonymous or non-synonymous amino acid changes, deletions or insertions were identified. Eight ERAP1 and 10 ERAP2 SNPs were used to create a Sequenom MassARRAY iPLEX based test panel which defined 24 ERAP1, 36 ERAP2 and 128 ERAP1/2 haplotypes. The prevalence of these haplotypes was then measured among dog, wolf, coyote, jackal and red fox populations. Some haplotypes were species specific, while others were shared across species, especially between dog, wolf, coyote and jackal. The prevalence of these haplotypes was then compared among various canid populations, and in particular between various populations of random- and pure-bred dogs. Human-directed positive selection has led to loss of ERAP diversity and segregation of certain haplotypes among various dog breeds. A phylogenetic tree generated from 45 of the most common ERAP1/2 haplotypes demonstrated three distinct clades, all of which were rooted with haplotypes either shared among species or specific to contemporary dogs, coyote and wolf. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  9. Exome sequencing identifies rare LDLR and APOA5 alleles conferring risk for myocardial infarction.

    PubMed

    Do, Ron; Stitziel, Nathan O; Won, Hong-Hee; Jørgensen, Anders Berg; Duga, Stefano; Angelica Merlini, Pier; Kiezun, Adam; Farrall, Martin; Goel, Anuj; Zuk, Or; Guella, Illaria; Asselta, Rosanna; Lange, Leslie A; Peloso, Gina M; Auer, Paul L; Girelli, Domenico; Martinelli, Nicola; Farlow, Deborah N; DePristo, Mark A; Roberts, Robert; Stewart, Alexander F R; Saleheen, Danish; Danesh, John; Epstein, Stephen E; Sivapalaratnam, Suthesh; Hovingh, G Kees; Kastelein, John J; Samani, Nilesh J; Schunkert, Heribert; Erdmann, Jeanette; Shah, Svati H; Kraus, William E; Davies, Robert; Nikpay, Majid; Johansen, Christopher T; Wang, Jian; Hegele, Robert A; Hechter, Eliana; Marz, Winfried; Kleber, Marcus E; Huang, Jie; Johnson, Andrew D; Li, Mingyao; Burke, Greg L; Gross, Myron; Liu, Yongmei; Assimes, Themistocles L; Heiss, Gerardo; Lange, Ethan M; Folsom, Aaron R; Taylor, Herman A; Olivieri, Oliviero; Hamsten, Anders; Clarke, Robert; Reilly, Dermot F; Yin, Wu; Rivas, Manuel A; Donnelly, Peter; Rossouw, Jacques E; Psaty, Bruce M; Herrington, David M; Wilson, James G; Rich, Stephen S; Bamshad, Michael J; Tracy, Russell P; Cupples, L Adrienne; Rader, Daniel J; Reilly, Muredach P; Spertus, John A; Cresci, Sharon; Hartiala, Jaana; Tang, W H Wilson; Hazen, Stanley L; Allayee, Hooman; Reiner, Alex P; Carlson, Christopher S; Kooperberg, Charles; Jackson, Rebecca D; Boerwinkle, Eric; Lander, Eric S; Schwartz, Stephen M; Siscovick, David S; McPherson, Ruth; Tybjaerg-Hansen, Anne; Abecasis, Goncalo R; Watkins, Hugh; Nickerson, Deborah A; Ardissino, Diego; Sunyaev, Shamil R; O'Donnell, Christopher J; Altshuler, David; Gabriel, Stacey; Kathiresan, Sekar

    2015-02-05

    Myocardial infarction (MI), a leading cause of death around the world, displays a complex pattern of inheritance. When MI occurs early in life, genetic inheritance is a major component to risk. Previously, rare mutations in low-density lipoprotein (LDL) genes have been shown to contribute to MI risk in individual families, whereas common variants at more than 45 loci have been associated with MI risk in the population. Here we evaluate how rare mutations contribute to early-onset MI risk in the population. We sequenced the protein-coding regions of 9,793 genomes from patients with MI at an early age (≤50 years in males and ≤60 years in females) along with MI-free controls. We identified two genes in which rare coding-sequence mutations were more frequent in MI cases versus controls at exome-wide significance. At low-density lipoprotein receptor (LDLR), carriers of rare non-synonymous mutations were at 4.2-fold increased risk for MI; carriers of null alleles at LDLR were at even higher risk (13-fold difference). Approximately 2% of early MI cases harbour a rare, damaging mutation in LDLR; this estimate is similar to one made more than 40 years ago using an analysis of total cholesterol. Among controls, about 1 in 217 carried an LDLR coding-sequence mutation and had plasma LDL cholesterol > 190 mg dl(-1). At apolipoprotein A-V (APOA5), carriers of rare non-synonymous mutations were at 2.2-fold increased risk for MI. When compared with non-carriers, LDLR mutation carriers had higher plasma LDL cholesterol, whereas APOA5 mutation carriers had higher plasma triglycerides. Recent evidence has connected MI risk with coding-sequence mutations at two genes functionally related to APOA5, namely lipoprotein lipase and apolipoprotein C-III (refs 18, 19). Combined, these observations suggest that, as well as LDL cholesterol, disordered metabolism of triglyceride-rich lipoproteins contributes to MI risk.

  10. Polymorphisms in the bovine ghrelin precursor (GHRL) and Syndecan-1 (SDC1) genes that are associated with growth traits in cattle.

    PubMed

    Sun, Jiajie; Jin, Qijiang; Zhang, Chunlei; Fang, Xingtang; Gu, Chuanwen; Lei, Chuzhao; Wang, Juqiang; Chen, Hong

    2011-06-01

    Transgenically expressed Syndecan-1 was found in the hypothalamic nuclei that control energy balance, and was associated with maturity-onset obesity, while ghrelin has been shown to play important roles in the control of food intake, gastric acid secretion, energy homeostasis, and glucose and lipid metabolism. However, the roles of genetic variations of Syndecan-1 and ghrelin on growth trait have few been reported in cattle. Herein, five Chinese cattle breeds were analyzed by PCR-SSCP and DNA sequencing methods. The bovine ghrelin gene showed eleven SNPs g.[267G>A, 271G>A, 290C>T, 326A>G, 327T>C, 420C>A, 569A>G, 945C>T, 993C>T, 4491A>G, 4644G>A] and three SNPs g.[420C>A, 569 A>G, 945C>T] were firstly detected in cattle. The bovine Syndecan-1 gene showed two SNPs. One SNP showed a transition C>G at position 21514, resulting in a synonymous mutation p.G(GGC)169G(GGG) and another showed a transversion C>T at position 22591, resulting in a synonymous mutation p.D(GAC)283D(GAT). In ghrelin gene, no significant associations were revealed between any variant sites and body weight, average daily gain, body sizes for different growth periods (6, 12, 18, and 24 months old), as well as for the milk yield at 305 days, milk protein rate and milk fat percentage. However, the polymorphism of Syndecan-1 gene was significantly associated with bovine birth weight and body length. Hence, we first suggested that Syndecan-1 gene could be regarded as molecular marker for superior birth weight and body length.

  11. LAMB1 polymorphism is associated with autism symptom severity in Korean autism spectrum disorder patients.

    PubMed

    Kim, Young Jong; Park, Jin Kyung; Kang, Won Sub; Kim, Su Kang; Park, Hae Jeong; Nam, Min; Kim, Jong Woo

    2015-01-01

    LAMB1 encodes laminin beta-1, which is expressed during early development of the human nervous system, and could be involved in the pathogenesis of neurodevelopmental disorders. In our study, we aimed to investigate whether single nucleotide polymorphisms (SNPs) in LAMB1 were associated with autism spectrum disorder (ASD) and with related clinical severities of ASD. Two coding SNPs (rs20556 and rs25659) and two intronic SNPs (rs2158836 and rs2237659) were compared between 180 patients with ASD and 147 healthy control subjects using direct sequencing. The Korean version of the Childhood Autism Rating Scale (K-CARS) was used to assess clinical severities. Multiple logistic regression models were employed to analyze genetic data, and associations with symptom severity were tested with the Kruskal-Wallis and the Mann-Whitney U tests. None of the four examined SNPs was associated with ASD risk. However, the GG genotype of rs2158836 was associated with more severe symptoms for the "object use" and "non-verbal communication" measures. The results of our study suggest the association between rs2158836 polymorphisms and symptom severity in ASD.

  12. Report on the Project for Establishment of the Standardized Korean Laboratory Terminology Database, 2015.

    PubMed

    Jung, Bo Kyeung; Kim, Jeeyong; Cho, Chi Hyun; Kim, Ju Yeon; Nam, Myung Hyun; Shin, Bong Kyung; Rho, Eun Youn; Kim, Sollip; Sung, Heungsup; Kim, Shinyoung; Ki, Chang Seok; Park, Min Jung; Lee, Kap No; Yoon, Soo Young

    2017-04-01

    The National Health Information Standards Committee was established in 2004 in Korea. The practical subcommittee for laboratory test terminology was placed in charge of standardizing laboratory medicine terminology in Korean. We aimed to establish a standardized Korean laboratory terminology database, Korea-Logical Observation Identifier Names and Codes (K-LOINC) based on former products sponsored by this committee. The primary product was revised based on the opinions of specialists. Next, we mapped the electronic data interchange (EDI) codes that were revised in 2014, to the corresponding K-LOINC. We established a database of synonyms, including the laboratory codes of three reference laboratories and four tertiary hospitals in Korea. Furthermore, we supplemented the clinical microbiology section of K-LOINC using an alternative mapping strategy. We investigated other systems that utilize laboratory codes in order to investigate the compatibility of K-LOINC with statistical standards for a number of tests. A total of 48,990 laboratory codes were adopted (21,539 new and 16,330 revised). All of the LOINC synonyms were translated into Korean, and 39,347 Korean synonyms were added. Moreover, 21,773 synonyms were added from reference laboratories and tertiary hospitals. Alternative strategies were established for mapping within the microbiology domain. When we applied these to a smaller hospital, the mapping rate was successfully increased. Finally, we confirmed K-LOINC compatibility with other statistical standards, including a newly proposed EDI code system. This project successfully established an up-to-date standardized Korean laboratory terminology database, as well as an updated EDI mapping to facilitate the introduction of standard terminology into institutions. © 2017 The Korean Academy of Medical Sciences.

  13. Insecticide-driven patterns of genetic variation in the dengue vector Aedes aegypti in Martinique Island.

    PubMed

    Marcombe, Sébastien; Paris, Margot; Paupy, Christophe; Bringuier, Charline; Yebakima, André; Chandre, Fabrice; David, Jean-Philippe; Corbel, Vincent; Despres, Laurence

    2013-01-01

    Effective vector control is currently challenged worldwide by the evolution of resistance to all classes of chemical insecticides in mosquitoes. In Martinique, populations of the dengue vector Aedes aegypti have been intensively treated with temephos and deltamethrin insecticides over the last fifty years, resulting in heterogeneous levels of resistance across the island. Resistance spreading depends on standing genetic variation, selection intensity and gene flow among populations. To determine gene flow intensity, we first investigated neutral patterns of genetic variability in sixteen populations representative of the many environments found in Martinique and experiencing various levels of insecticide pressure, using 6 microsatellites. Allelic richness was lower in populations resistant to deltamethrin, and consanguinity was higher in populations resistant to temephos, consistent with a negative effect of insecticide pressure on neutral genetic diversity. The global genetic differentiation was low, suggesting high gene flow among populations, but significant structure was found, with a pattern of isolation-by-distance at the global scale. Then, we investigated adaptive patterns of divergence in six out of the 16 populations using 319 single nucleotide polymorphisms (SNPs). Five SNP outliers displaying levels of genetic differentiation out of neutral expectations were detected, including the kdr-V1016I mutation in the voltage-gated sodium channel gene. Association tests revealed a total of seven SNPs associated with deltamethrin resistance. Six other SNPs were associated with temephos resistance, including two non-synonymous substitutions in an alkaline phosphatase and in a sulfotransferase respectively. Altogether, both neutral and adaptive patterns of genetic variation in mosquito populations appear to be largely driven by insecticide pressure in Martinique.

  14. Association study for the role of Matrix metalloproteinases 2 and 3 gene polymorphisms in dental caries susceptibility.

    PubMed

    Karayasheva, Dobrina; Glushkova, Maria; Boteva, Ekaterina; Mitev, Vanyo; Kadiyska, Tanya

    2016-08-01

    Various exogenous and endogenous risk factors have been described as contributing to dental caries susceptibility. In the last decade it has been established that both pro and active forms of host derived Matrix metalloproteinases (MMPs) are present in the oral cavity. MMPs role in caries development has been hypothesized. The aim of this study was to analyse MMP2 (rs2287074) and MMP3 (rs679620) single nucleotide polymorphisms (SNPs) and their role in caries susceptibility. The two SNPs were analysed by PCR- restriction fragment length polymorphism (RFLP) in a sample of 102 ethnic Bulgarian volunteers (42 males and 60 females), all students in Sofia Medical University. Statistical analysis of the MMP2 SNP showed significant differences for the genotype frequencies between the caries free (CF, DMFT=0) and low caries experience (LCE, DMFT≤5) groups. Analysis for the non-synonymous MMP3 SNP found significant differences between both CF vs caries experience groups (LCE+ high caries experience (HCE, DMFT≥5)) and LCE vs HCE groups. The presence of allele G decreased the risk of HCE about 4 times. MMP2 and MMP3 genes are likely to be involved in caries susceptibility in our population. However, as dental caries is a multifactorial disorder and several genes are likely to have influence on it, it is reasonable to expect that SNPs, even those proven to be functional like rs679620, potentially play a significant, but not major role in the disease outcome. Copyright © 2016 Elsevier Ltd. All rights reserved.

  15. Computational insights of K1444N substitution in GAP-related domain of NF1 gene associated with neurofibromatosis type 1 disease: a molecular modeling and dynamics approach.

    PubMed

    Agrahari, Ashish Kumar; Muskan, Meghana; George Priya Doss, C; Siva, R; Zayed, Hatem

    2018-05-27

    The NF1 gene encodes for neurofibromin protein, which is ubiquitously expressed, but most highly in the central nervous system. Non-synonymous SNPs (nsSNPs) in the NF1 gene were found to be associated with Neurofibromatosis Type 1 disease, which is characterized by the growth of tumors along nerves in the skin, brain, and other parts of the body. In this study, we used several in silico predictions tools to analyze 16 nsSNPs in the RAS-GAP domain of neurofibromin, the K1444N (K1423N) mutation was predicted as the most pathogenic. The comparative molecular dynamic simulation (MDS; 50 ns) between the wild type and the K1444N (K1423N) mutant suggested a significant change in the electrostatic potential. In addition, the RMSD, RMSF, Rg, hydrogen bonds, and PCA analysis confirmed the loss of flexibility and increase in compactness of the mutant protein. Further, SASA analysis revealed exchange between hydrophobic and hydrophilic residues from the core of the RAS-GAP domain to the surface of the mutant domain, consistent with the secondary structure analysis that showed significant alteration in the mutant protein conformation. Our data concludes that the K1444N (K1423N) mutant lead to increasing the rigidity and compactness of the protein. This study provides evidence of the benefits of the computational tools in predicting the pathogenicity of genetic mutations and suggests the application of MDS and different in silico prediction tools for variant assessment and classification in genetic clinics.

  16. Associations between polymorphisms in the antiviral TRIM genes and measles vaccine immunity.

    PubMed

    Ovsyannikova, Inna G; Haralambieva, Iana H; Vierkant, Robert A; O'Byrne, Megan M; Poland, Gregory A

    2013-06-01

    The role of polymorphisms within the antiviral tripartite motif (TRIM) genes in measles vaccine adaptive immune responses was examined. A limited association was found between TRIM5 (rs7122620) and TRIM25 (rs205499) gene polymorphisms and measles-specific antibody levels. However, many associations were found between TRIM gene SNPs and variations in cellular responses (IFN-γ Elispot and secreted cytokines IL-2, IL-6, IL-10, IFN-γ, and TNF-α). TRIM22 rs2291841 was significantly associated with an increased IFN-γ Elispot response (35 vs. 102 SFC per 2×10(5)PBMC, p=0.009, q=0.71) in Caucasians. A non-synonymous TRIM25 rs205498 (in LD with other SNPs, r(2)≥0.56), as well as the TRIM25 AAAGGAAAGGAGT haplotype, was associated with a decreased IFN-γ Elispot response (t-statistic -2.32, p=0.02) in African-Americans. We also identified polymorphisms in the TRIM5, TRIM22, and TRIM25 genes that were associated with significant differences in cytokine responses. Additional studies are necessary to replicate our findings and to examine the functional consequences of these associations. Copyright © 2013 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  17. In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation

    PubMed Central

    Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F.; Sampson, Juliana K.; Khalid, Haniya; Sheth, Nihar U.; Batalo, Michael; Serrano, Myrna G.; Roberts, Catherine H.; Hess, Michael L.; Buck, Gregory A.; Neale, Michael C.; Manjili, Masoud H.; Toor, Amir Ahmed

    2014-01-01

    Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor–recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential. PMID:25414699

  18. In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation.

    PubMed

    Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F; Sampson, Juliana K; Khalid, Haniya; Sheth, Nihar U; Batalo, Michael; Serrano, Myrna G; Roberts, Catherine H; Hess, Michael L; Buck, Gregory A; Neale, Michael C; Manjili, Masoud H; Toor, Amir Ahmed

    2014-01-01

    Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor-recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential.

  19. Significance of genetic variants in DLC1 and their association with hepatocellular carcinoma

    PubMed Central

    XIE, CHENG-RONG; SUN, HONG-GUANG; SUN, YU; ZHAO, WEN-XIU; ZHANG, SHENG; WANG, XIAO-MIN; YIN, ZHEN-YU

    2015-01-01

    DLC1 has been shown to be downregulated or absent in hepatocellular carcinoma (HCC) and is associated with tumorigenesis and development. However, only a small number of studies have focused on genetic variations of DLC1. The present study performed exon sequencing for the DLC1 gene in HCC tissue samples from 105 patients to identify functional genetic variation of DLC1 and its association with HCC susceptibility, clinicopathological features and prognosis. A novel missense mutation and four non-synonymous single nucleotide polymorphisms (SNPs; rs3816748, rs11203495, rs3816747 and rs532841) were identified. A significant correlation of rs3816747 polymorphisms with HCC susceptibility was identified. Compared to individuals with the GG genotype of rs3816747, those with the GA (odds ratio (OR)=0.486; P=0.037) or GA+AA genotype (OR=0.51; P=0.039) were associated with a significantly decreased HCC risk. Furthermore, patients with the GC+CC genotype of rs3816748, the TC+CC genotype of rs11203495 or the GA+AA genotype of rs3816747 had small-sized tumors compared with those carrying the wild-type genotype. No significant association of DLC1 SNPs with the patients' prognosis was found. These results indicated that genetic variations in the DLC1 gene may confer a risk for HCC. PMID:26095787

  20. Metabolic Interactions of Purine Derivatives with Human ABC Transporter ABCG2: Genetic Testing to Assess Gout Risk.

    PubMed

    Ishikawa, Toshihisa; Aw, Wanping; Kaneko, Kiyoko

    2013-11-04

    In mammals, excess purine nucleosides are removed from the body by breakdown in the liver and excretion from the kidneys. Uric acid is the end product of purine metabolism in humans. Two-thirds of uric acid in the human body is normally excreted through the kidney, whereas one-third undergoes uricolysis (decomposition of uric acid) in the gut. Elevated serum uric acid levels result in gout and could be a risk factor for cardiovascular disease and diabetes. Recent studies have shown that human ATP-binding cassette transporter ABCG2 plays a role of renal excretion of uric acid. Two non-synonymous single nucleotide polymorphisms (SNPs), i.e., 421C>A (major) and 376C>T (minor), in the ABCG2 gene result in impaired transport activity, owing to ubiquitination-mediated proteosomal degradation and truncation of ABCG2, respectively. These genetic polymorphisms are associated with hyperuricemia and gout. Allele frequencies of those SNPs are significantly higher in Asian populations than they are in African and Caucasian populations. A rapid and isothermal genotyping method has been developed to detect the SNP 421C>A, where one drop of peripheral blood is sufficient for the detection. Development of simple genotyping methods would serve to improve prevention and early therapeutic intervention for high-risk individuals in personalized healthcare.

  1. Development of 101 novel EST-derived single nucleotide polymorphism markers for Zhikong scallop ( Chlamys farreri)

    NASA Astrophysics Data System (ADS)

    Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli

    2013-09-01

    Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.

  2. Polymorphism in the interleukin-7 receptor-alpha and outcome after allogeneic hematopoietic cell transplantation with matched unrelated donor.

    PubMed

    Shamim, Z; Spellman, S; Haagenson, M; Wang, T; Lee, S J; Ryder, L P; Müller, K

    2013-08-01

    Interleukin-7 (IL-7) is essential for T cell development in the thymus and maintenance of peripheral T cells. The α-chain of the IL-7R is polymorphic with the existence of SNPs that give rise to non-synonymous amino acid substitutions. We previously found an association between donor genotypes and increased treatment-related mortality (TRM) (rs1494555G) and acute graft versus host disease (aGvHD) (rs1494555G and rs1494558T) after hematopoietic cell transplantation (HCT). Some studies have confirmed an association between rs6897932C and multiple sclerosis. In this study, we evaluated the prognostic significance of IL-7Rα SNP genotypes in 590-recipient/donor pairs that received HLA-matched unrelated donor HCT for haematological malignancies. Consistent with the primary studies, the rs1494555GG and rs1494558TT genotypes of the donor were associated with aGvHD and chronic GvHD in the univariate analysis. The Tallele of rs6897932 was suggestive of an association with increased frequency of relapse by univariate analysis (P = 0.017) and multivariate analysis (P = 0.015). In conclusion, this study provides further evidence of a role of the IL-7 pathway and IL-7Rα SNPs in HCT. © 2013 John Wiley & Sons Ltd.

  3. Codon Usage Selection Can Bias Estimation of the Fraction of Adaptive Amino Acid Fixations.

    PubMed

    Matsumoto, Tomotaka; John, Anoop; Baeza-Centurion, Pablo; Li, Boyang; Akashi, Hiroshi

    2016-06-01

    A growing number of molecular evolutionary studies are estimating the proportion of adaptive amino acid substitutions (α) from comparisons of ratios of polymorphic and fixed DNA mutations. Here, we examine how violations of two of the model assumptions, neutral evolution of synonymous mutations and stationary base composition, affect α estimation. We simulated the evolution of coding sequences assuming weak selection on synonymous codon usage bias and neutral protein evolution, α = 0. We show that weak selection on synonymous mutations can give polymorphism/divergence ratios that yield α-hat (estimated α) considerably larger than its true value. Nonstationary evolution (changes in population size, selection, or mutation) can exacerbate such biases or, in some scenarios, give biases in the opposite direction, α-hat < α. These results demonstrate that two factors that appear to be prevalent among taxa, weak selection on synonymous mutations and non-steady-state nucleotide composition, should be considered when estimating α. Estimates of the proportion of adaptive amino acid fixations from large-scale analyses of Drosophila melanogaster polymorphism and divergence data are positively correlated with codon usage bias. Such patterns are consistent with α-hat inflation from weak selection on synonymous mutations and/or mutational changes within the examined gene trees. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  4. Transcriptome and Biochemical Analysis of a Flower Color Polymorphism in Silene littorea (Caryophyllaceae)

    PubMed Central

    Casimiro-Soriguer, Inés; Narbona, Eduardo; Buide, M. L.; del Valle, José C.; Whittall, Justen B.

    2016-01-01

    Flower color polymorphisms are widely used as model traits from genetics to ecology, yet determining the biochemical and molecular basis can be challenging. Anthocyanin-based flower color variations can be caused by at least 12 structural and three regulatory genes in the anthocyanin biosynthetic pathway (ABP). We use mRNA-Seq to simultaneously sequence and estimate expression of these candidate genes in nine samples of Silene littorea representing three color morphs (dark pink, light pink and white) across three developmental stages in hopes of identifying the cause of flower color variation. We identified 29 putative paralogs for the 15 candidate genes in the ABP. We assembled complete coding sequences for 16 structural loci and nine of ten regulatory loci. Among these 29 putative paralogs, we identified 622 SNPs, yet only nine synonymous SNPs in Ans had allele frequencies that differentiated pigmented petals (dark pink and light pink) from white petals. These Ans allele frequency differences were further investigated with an expanded sequencing survey of 38 individuals, yet no SNPs consistently differentiated the color morphs. We also found one locus, F3h1, with strong differential expression between pigmented and white samples (>42x). This may be caused by decreased expression of Myb1a in white petal buds. Myb1a in S. littorea is a regulatory locus closely related to Subgroup 7 Mybs known to regulate F3h and other loci in the first half of the ABP in model species. We then compare the mRNA-Seq results with petal biochemistry which revealed cyanidin as the primary anthocyanin and five flavonoid intermediates. Concentrations of three of the flavonoid intermediates were significantly lower in white petals than in pigmented petals (rutin, quercetin and isovitexin). The biochemistry results for rutin, quercetin, luteolin and apigenin are consistent with the transcriptome results suggesting a blockage at F3h, possibly caused by downregulation of Myb1a. PMID:26973662

  5. Decoding Mechanisms by which Silent Codon Changes Influence Protein Biogenesis and Function

    PubMed Central

    Bali, Vedrana; Bebok, Zsuzsanna

    2015-01-01

    Scope Synonymous codon usage has been a focus of investigation since the discovery of the genetic code and its redundancy. The occurrences of synonymous codons vary between species and within genes of the same genome, known as codon usage bias. Today, bioinformatics and experimental data allow us to compose a global view of the mechanisms by which the redundancy of the genetic code contributes to the complexity of biological systems from affecting survival in prokaryotes, to fine tuning the structure and function of proteins in higher eukaryotes. Studies analyzing the consequences of synonymous codon changes in different organisms have revealed that they impact nucleic acid stability, protein levels, structure and function without altering amino acid sequence. As such, synonymous mutations inevitably contribute to the pathogenesis of complex human diseases. Yet, fundamental questions remain unresolved regarding the impact of silent mutations in human disorders. In the present review we describe developments in this area concentrating on mechanisms by which synonymous mutations may affect protein function and human health. Purpose This synopsis illustrates the significance of synonymous mutations in disease pathogenesis. We review the different steps of gene expression affected by silent mutations, and assess the benefits and possible harmful effects of codon optimization applied in the development of therapeutic biologics. Physiological and medical relevance Understanding mechanisms by which synonymous mutations contribute to complex diseases such as cancer, neurodegeneration and genetic disorders, including the limitations of codon-optimized biologics, provides insight concerning interpretation of silent variants and future molecular therapies. PMID:25817479

  6. A cautionary tale: the non-causal association between type 2 diabetes risk SNP, rs7756992, and levels of non-coding RNA, CDKAL1-v1.

    PubMed

    Locke, Jonathan M; Wei, Fan-Yan; Tomizawa, Kazuhito; Weedon, Michael N; Harries, Lorna W

    2015-04-01

    Intronic single nucleotide polymorphisms (SNPs) in the CDKAL1 gene are associated with risk of developing type 2 diabetes. A strong correlation between risk alleles and lower levels of the non-coding RNA, CDKAL1-v1, has recently been reported in whole blood extracted from Japanese individuals. We sought to replicate this association in two independent cohorts: one using whole blood from white UK-resident individuals, and one using a collection of human pancreatic islets, a more relevant tissue type to study with respect to the aetiology of diabetes. Levels of CDKAL1-v1 were measured by real-time PCR using RNA extracted from human whole blood (n = 70) and human pancreatic islets (n = 48). Expression with respect to genotype was then determined. In a simple linear regression model, expression of CDKAL1-v1 was associated with the lead type 2 diabetes-associated SNP, rs7756992, in whole blood and islets. However, these associations were abolished or substantially reduced in multiple regression models taking into account rs9366357 genotype: a moderately linked SNP explaining a much larger amount of the variation in CDKAL1-v1 levels, but not strongly associated with risk of type 2 diabetes. Contrary to previous findings, we provide evidence against a role for dysregulated expression of CDKAL1-v1 in mediating the association between intronic SNPs in CDKAL1 and susceptibility to type 2 diabetes. The results of this study illustrate how caution should be exercised when inferring causality from an association between disease-risk genotype and non-coding RNA expression.

  7. Potential relationship between single nucleotide polymorphisms used in forensic genetics and diseases or other traits in European population.

    PubMed

    Pombar-Gomez, Maria; Lopez-Lopez, Elixabet; Martin-Guerrero, Idoia; Garcia-Orad Carles, Africa; de Pancorbo, Marian M

    2015-05-01

    Single nucleotide polymorphisms (SNPs) are an interesting option to facilitate the analysis of highly degraded DNA by allowing the reduction of the size of the DNA amplicons. The SNPforID 52-plex panel is a clear example of the use of non-coding SNPs in forensic genetics. However, nonstop advances in studies of genetic polymorphisms are leading to the discovery of new associations between SNPs and diseases. The aim of this study was to perform a comprehensive review of the state of association between the 52 SNPs in the 52-plex panel and diseases or other traits related to their treatment, such as drug response characters. In order to achieve this goal, we have conducted a bioinformatic search for each SNP included in the panel and the SNPs in linkage disequilibrium (LD) with them in the European population (r (2)  > 0.8). A total of 424 SNPs (52 in the panel and 372 in LD) were investigated in PubMed, Scopus, and dbSNP databases. Our results show that three SNPs in the SNPforID 52-plex panel (rs2107612, rs1979255, rs1463729) have been associated with diseases such as hypertension or macular degeneration, as well as drug response. Similarly, three out of the 372 SNPs in LD (rs2107614, r (2)  = 0.859; rs765250, r (2)  = 0.858; rs11064560, r (2)  = 0,887) are also associated with various pathologies. In view of these results, we propose the need for a periodic review of the SNPs used in forensic genetics in order to keep their associations with diseases or related phenotypes updated and to evaluate their continuity in forensic panels for avoiding legal and ethical conflicts.

  8. Linkage Analysis in Autoimmune Addison's Disease: NFATC1 as a Potential Novel Susceptibility Locus.

    PubMed

    Mitchell, Anna L; Bøe Wolff, Anette; MacArthur, Katie; Weaver, Jolanta U; Vaidya, Bijay; Erichsen, Martina M; Darlay, Rebecca; Husebye, Eystein S; Cordell, Heather J; Pearce, Simon H S

    2015-01-01

    Autoimmune Addison's disease (AAD) is a rare, highly heritable autoimmune endocrinopathy. It is possible that there may be some highly penetrant variants which confer disease susceptibility that have yet to be discovered. DNA samples from 23 multiplex AAD pedigrees from the UK and Norway (50 cases, 67 controls) were genotyped on the Affymetrix SNP 6.0 array. Linkage analysis was performed using Merlin. EMMAX was used to carry out a genome-wide association analysis comparing the familial AAD cases to 2706 UK WTCCC controls. To explore some of the linkage findings further, a replication study was performed by genotyping 64 SNPs in two of the four linked regions (chromosomes 7 and 18), on the Sequenom iPlex platform in three European AAD case-control cohorts (1097 cases, 1117 controls). The data were analysed using a meta-analysis approach. In a parametric analysis, applying a rare dominant model, loci on chromosomes 7, 9 and 18 had LOD scores >2.8. In a non-parametric analysis, a locus corresponding to the HLA region on chromosome 6, known to be associated with AAD, had a LOD score >3.0. In the genome-wide association analysis, a SNP cluster on chromosome 2 and a pair of SNPs on chromosome 6 were associated with AAD (P <5x10-7). A meta-analysis of the replication study data demonstrated that three chromosome 18 SNPs were associated with AAD, including a non-synonymous variant in the NFATC1 gene. This linkage study has implicated a number of novel chromosomal regions in the pathogenesis of AAD in multiplex AAD families and adds further support to the role of HLA in AAD. The genome-wide association analysis has also identified a region of interest on chromosome 2. A replication study has demonstrated that the NFATC1 gene is worthy of future investigation, however each of the regions identified require further, systematic analysis.

  9. Genome-Wide Sequence Variation Identification and Floral-Associated Trait Comparisons Based on the Re-sequencing of the ‘Nagafu No. 2’ and ‘Qinguan’ Varieties of Apple (Malus domestica Borkh.)

    PubMed Central

    Xing, Libo; Zhang, Dong; Song, Xiaomin; Weng, Kai; Shen, Yawen; Li, Youmei; Zhao, Caiping; Ma, Juanjuan; An, Na; Han, Mingyu

    2016-01-01

    Apple (Malus domestica Borkh.) is a commercially important fruit worldwide. Detailed information on genomic DNA polymorphisms, which are important for understanding phenotypic traits, is lacking for the apple. We re-sequenced two elite apple varieties, ‘Nagafu No. 2’ and ‘Qinguan,’ which have different characteristics. We identified many genomic variations, including 2,771,129 single nucleotide polymorphisms (SNPs), 82,663 structural variations (SVs), and 1,572,803 insertion/deletions (INDELs) in ‘Nagafu No. 2’ and 2,262,888 SNPs, 63,764 SVs, and 1,294,060 INDELs in ‘Qinguan.’ The ‘SNP,’ ‘INDEL,’ and ‘SV’ distributions were non-random, with variation-rich or -poor regions throughout the genomes. In ‘Nagafu No. 2’ and ‘Qinguan’ there were 171,520 and 147,090 non-synonymous SNPs spanning 23,111 and 21,400 genes, respectively; 3,963 and 3,196 SVs in 3,431 and 2,815 genes, respectively; and 1,834 and 1,451 INDELs in 1,681 and 1,345 genes, respectively. Genetic linkage maps of 190 flowering genes associated with multiple flowering pathways in ‘Nagafu No. 2,’ ‘Qinguan,’ and ‘Golden Delicious,’ identified complex regulatory mechanisms involved in floral induction, flower bud formation, and flowering characteristics, which might reflect the genetic variation of the flowering genes. Expression profiling of key flowering genes in buds and leaves suggested that the photoperiod and autonomous flowering pathways are major contributors to the different floral-associated traits between ‘Nagafu No. 2’ and ‘Qinguan.’ The genome variation data provided a foundation for the further exploration of apple diversity and gene–phenotype relationships, and for future research on molecular breeding to improve apple and related species. PMID:27446138

  10. Genetic diversity and natural selection of Plasmodium knowlesi merozoite surface protein 1 paralog gene in Malaysia.

    PubMed

    Ahmed, Md Atique; Fauzi, Muh; Han, Eun-Taek

    2018-03-14

    Human infections due to the monkey malaria parasite Plasmodium knowlesi is on the rise in most Southeast Asian countries specifically Malaysia. The C-terminal 19 kDa domain of PvMSP1P is a potential vaccine candidate, however, no study has been conducted in the orthologous gene of P. knowlesi. This study investigates level of polymorphisms, haplotypes and natural selection of full-length pkmsp1p in clinical samples from Malaysia. A total of 36 full-length pkmsp1p sequences along with the reference H-strain and 40 C-terminal pkmsp1p sequences from clinical isolates of Malaysia were downloaded from published genomes. Genetic diversity, polymorphism, haplotype and natural selection were determined using DnaSP 5.10 and MEGA 5.0 software. Genealogical relationships were determined using haplotype network tree in NETWORK software v5.0. Population genetic differentiation index (F ST ) and population structure of parasite was determined using Arlequin v3.5 and STRUCTURE v2.3.4 software. Comparison of 36 full-length pkmsp1p sequences along with the H-strain identified 339 SNPs (175 non-synonymous and 164 synonymous substitutions). The nucleotide diversity across the full-length gene was low compared to its ortholog pvmsp1p. The nucleotide diversity was higher toward the N-terminal domains (pkmsp1p-83 and 30) compared to the C-terminal domains (pkmsp1p-38, 33 and 19). Phylogenetic analysis of full-length genes identified 2 distinct clusters of P. knowlesi from Malaysian Borneo. The 40 pkmsp1p-19 sequences showed low polymorphisms with 16 polymorphisms leading to 18 haplotypes. In total there were 10 synonymous and 6 non-synonymous substitutions and 12 cysteine residues were intact within the two EGF domains. Evidence of strong purifying selection was observed within the full-length sequences as well in all the domains. Shared haplotypes of 40 pkmsp1p-19 were identified within Malaysian Borneo haplotypes. This study is the first to report on the genetic diversity and natural selection of pkmsp1p. A low level of genetic diversity and strong evidence of negative selection was detected and observed in all the domains of pkmsp1p of P. knowlesi indicating functional constrains. Shared haplotypes were identified within pkmsp1p-19 highlighting further evaluation using larger number of clinical samples from Malaysia.

  11. PGen: large-scale genomic variations analysis workflow and browser in SoyKB.

    PubMed

    Liu, Yang; Khan, Saad M; Wang, Juexin; Rynge, Mats; Zhang, Yuanxun; Zeng, Shuai; Chen, Shiyuan; Maldonado Dos Santos, Joao V; Valliyodan, Babu; Calyam, Prasad P; Merchant, Nirav; Nguyen, Henry T; Xu, Dong; Joshi, Trupti

    2016-10-06

    With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed "PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. We have developed both a Linux version in GitHub ( https://github.com/pegasus-isi/PGen-GenomicVariations-Workflow ) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), ( http://soykb.org/Pegasus/index.php ). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser ( http://soykb.org/NGS_Resequence/NGS_index.php ) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. PGen workflow has been optimized for the most efficient analysis of soybean data using thorough testing and validation. This research serves as an example of best practices for development of genomics data analysis workflows by integrating remote HPC resources and efficient data management with ease of use for biological users. PGen workflow can also be easily customized for analysis of data in other species.

  12. Hypoxia adaptations in the grey wolf (Canis lupus chanco) from Qinghai-Tibet Plateau.

    PubMed

    Zhang, Wenping; Fan, Zhenxin; Han, Eunjung; Hou, Rong; Zhang, Liang; Galaverni, Marco; Huang, Jie; Liu, Hong; Silva, Pedro; Li, Peng; Pollinger, John P; Du, Lianming; Zhang, XiuyYue; Yue, Bisong; Wayne, Robert K; Zhang, Zhihe

    2014-07-01

    The Tibetan grey wolf (Canis lupus chanco) occupies habitats on the Qinghai-Tibet Plateau, a high altitude (>3000 m) environment where low oxygen tension exerts unique selection pressure on individuals to adapt to hypoxic conditions. To identify genes involved in hypoxia adaptation, we generated complete genome sequences of nine Chinese wolves from high and low altitude populations at an average coverage of 25× coverage. We found that, beginning about 55,000 years ago, the highland Tibetan grey wolf suffered a more substantial population decline than lowland wolves. Positively selected hypoxia-related genes in highland wolves are enriched in the HIF signaling pathway (P = 1.57E-6), ATP binding (P = 5.62E-5), and response to an oxygen-containing compound (P≤5.30E-4). Of these positively selected hypoxia-related genes, three genes (EPAS1, ANGPT1, and RYR2) had at least one specific fixed non-synonymous SNP in highland wolves based on the nine genome data. Our re-sequencing studies on a large panel of individuals showed a frequency difference greater than 58% between highland and lowland wolves for these specific fixed non-synonymous SNPs and a high degree of LD surrounding the three genes, which imply strong selection. Past studies have shown that EPAS1 and ANGPT1 are important in the response to hypoxic stress, and RYR2 is involved in heart function. These three genes also exhibited significant signals of natural selection in high altitude human populations, which suggest similar evolutionary constraints on natural selection in wolves and humans of the Qinghai-Tibet Plateau.

  13. Nucleotide polymorphisms in the bovine lymphotoxin A gene and their distribution among Bos indicus zebu cattle breeds.

    PubMed

    Behl, Jyotsna Dhingra; Mishra, Priyanka; Verma, N K; Niranjan, S K; Dangi, P S; Sharma, Rekha; Behl, Rahul

    2016-03-15

    The present study was undertaken to characterize the genetic variation present in lymphoxin A gene (LTA gene) encoding for the lymphotoxin A protein also known as tumor necrosis factor beta, a cytokine produced by lymphocytes, known to be cytotoxic for a wide range of tumor cells both in vitro and in vivo, and, which is essential for normal immunological development; in 40 animals of 5 diverse Bos indicus Indian zebu cattle breeds. These breeds survive under the harsh and tough tropical climatic conditions of various parts of the Indian subcontinent. The LTA gene in the present study was observed to contain 33 SNPs and 3 small insertion/deletion polymorphisms. Four SNPs occurred in the coding regions of the gene viz. g.1327A>G and g.1400C>T in exon 2 and g.1840C>T and g.1942C>T in exon 3, of which the SNP g.1327A>G in exon 2 resulted in a non-synonymous amino acid change G38D. This amino acid change was however predicted not be affecting the protein function in any manner. The gene contained putative transcription factor binding sites for the c-Re1 and for Pax-4 transcription factors. A putative promoter region was also predicted on the reverse DNA strand from position 894 to 644. Several repeat elements and microsatellite repeats were detected to be occurring across the 3.2kb LTA gene sequence. The study showed the occurrence of 40 genotypes and 48 most probable haplotypes. The genotypes at the observed SNP positions in the LTA gene were in near Hardy-Weinberg equilibrium. A negative Tajima's D value that was not significant statistically at P>0.10 indicated that the neutral mutation hypothesis could not be excluded. The genetic variations observed in the LTA gene in the present study have not been reported earlier and these could possibly be used as molecular markers for further studies involving association of the gene variability with disease resistance/tolerance traits. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. Sequence Analysis of APOA5 Among the Kuwaiti Population Identifies Association of rs2072560, rs2266788, and rs662799 With TG and VLDL Levels

    PubMed Central

    Jasim, Anfal A.; Al-Bustan, Suzanne A.; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda

    2018-01-01

    Common variants of Apolipoprotein A5 (APOA5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3′ UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism. PMID:29686695

  15. Sequence Analysis of APOA5 Among the Kuwaiti Population Identifies Association of rs2072560, rs2266788, and rs662799 With TG and VLDL Levels.

    PubMed

    Jasim, Anfal A; Al-Bustan, Suzanne A; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda

    2018-01-01

    Common variants of Apolipoprotein A5 ( APOA 5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3' UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism.

  16. New genetic variants of LATS1 detected in urinary bladder and colon cancer.

    PubMed

    Saadeldin, Mona K; Shawer, Heba; Mostafa, Ahmed; Kassem, Neemat M; Amleh, Asma; Siam, Rania

    2014-01-01

    LATS1, the large tumor suppressor 1 gene, encodes for a serine/threonine kinase protein and is implicated in cell cycle progression. LATS1 is down-regulated in various human cancers, such as breast cancer, and astrocytoma. Point mutations in LATS1 were reported in human sarcomas. Additionally, loss of heterozygosity of LATS1 chromosomal region predisposes to breast, ovarian, and cervical tumors. In the current study, we investigated LATS1 genetic variations including single nucleotide polymorphisms (SNPs), in 28 Egyptian patients with either urinary bladder or colon cancers. The LATS1 gene was amplified and sequenced and the expression of LATS1 at the RNA level was assessed in 12 urinary bladder cancer samples. We report, the identification of a total of 29 variants including previously identified SNPs within LATS1 coding and non-coding sequences. A total of 18 variants were novel. Majority of the novel variants, 13, were mapped to intronic sequences and un-translated regions of the gene. Four of the five novel variants located in the coding region of the gene, represented missense mutations within the serine/threonine kinase catalytic domain. Interestingly, LATS1 RNA steady state levels was lost in urinary bladder cancerous tissue harboring four specific SNPs (16045 + 41736 + 34614 + 56177) positioned in the 5'UTR, intron 6, and two silent mutations within exon 4 and exon 8, respectively. This study identifies novel single-base-sequence alterations in the LATS1 gene. These newly identified variants could potentially be used as novel diagnostic or prognostic tools in cancer.

  17. In silico screening of the chicken genome for overlaps between genomic regions: microRNA genes, coding and non-coding transcriptional units, QTL, and genetic variations.

    PubMed

    Zorc, Minja; Kunej, Tanja

    2016-05-01

    MicroRNAs (miRNAs) are a class of non-coding RNAs involved in posttranscriptional regulation of target genes. Regulation requires complementarity between target mRNA and the mature miRNA seed region, responsible for their recognition and binding. It has been estimated that each miRNA targets approximately 200 genes, and genetic variability of miRNA genes has been reported to affect phenotypic variability and disease susceptibility in humans, livestock species, and model organisms. Polymorphisms in miRNA genes could therefore represent biomarkers for phenotypic traits in livestock animals. In our previous study, we collected polymorphisms within miRNA genes in chicken. In the present study, we identified miRNA-related genomic overlaps to prioritize genomic regions of interest for further functional studies and biomarker discovery. Overlapping genomic regions in chicken were analyzed using the following bioinformatics tools and databases: miRNA SNiPer, Ensembl, miRBase, NCBI Blast, and QTLdb. Out of 740 known pre-miRNA genes, 263 (35.5 %) contain polymorphisms; among them, 35 contain more than three polymorphisms The most polymorphic miRNA genes in chicken are gga-miR-6662, containing 23 single nucleotide polymorphisms (SNPs) within the pre-miRNA region, including five consecutive SNPs, and gga-miR-6688, containing ten polymorphisms including three consecutive polymorphisms. Several miRNA-related genomic hotspots have been revealed in chicken genome; polymorphic miRNA genes are located within protein-coding and/or non-coding transcription units and quantitative trait loci (QTL) associated with production traits. The present study includes the first description of an exonic miRNA in a chicken genome, an overlap between the miRNA gene and the exon of the protein-coding gene (gga-miR-6578/HADHB), and the first report of a missense polymorphism located within a mature miRNA seed region. Identified miRNA-related genomic hotspots in chicken can serve researchers as a starting point for further functional studies and association studies with poultry production and health traits and the basis for systematic screening of exonic miRNAs and missense/miRNA seed polymorphisms in other genomes.

  18. A house finch (Haemorhous mexicanus) spleen transcriptome reveals intra- and interspecific patterns of gene expression, alternative splicing and genetic diversity in passerines

    PubMed Central

    2014-01-01

    Background With its plumage color dimorphism and unique history in North America, including a recent population expansion and an epizootic of Mycoplasma gallisepticum (MG), the house finch (Haemorhous mexicanus) is a model species for studying sexual selection, plumage coloration and host-parasite interactions. As part of our ongoing efforts to make available genomic resources for this species, here we report a transcriptome assembly derived from genes expressed in spleen. Results We characterize transcriptomes from two populations with different histories of demography and disease exposure: a recently founded population in the eastern US that has been exposed to MG for over a decade and a native population from the western range that has never been exposed to MG. We utilize this resource to quantify conservation in gene expression in passerine birds over approximately 50 MY by comparing splenic expression profiles for 9,646 house finch transcripts and those from zebra finch and find that less than half of all genes expressed in spleen in either species are expressed in both species. Comparative gene annotations from several vertebrate species suggest that the house finch transcriptomes contain ~15 genes not yet found in previously sequenced vertebrate genomes. The house finch transcriptomes harbour ~85,000 SNPs, ~20,000 of which are non-synonymous. Although not yet validated by biological or technical replication, we identify a set of genes exhibiting differences between populations in gene expression (n = 182; 2% of all transcripts), allele frequencies (76 FST ouliers) and alternative splicing as well as genes with several fixed non-synonymous substitutions; this set includes genes with functions related to double-strand break repair and immune response. Conclusions The two house finch spleen transcriptome profiles will add to the increasing data on genome and transcriptome sequence information from natural populations. Differences in splenic expression between house finch and zebra finch imply either significant evolutionary turnover of splenic expression patterns or different physiological states of the individuals examined. The transcriptome resource will enhance the potential to annotate an eventual house finch genome, and the set of gene-based high-quality SNPs will help clarify the genetic underpinnings of host-pathogen interactions and sexual selection. PMID:24758272

  19. A SNP panel and online tool for checking genotype concordance through comparing QR codes.

    PubMed

    Du, Yonghong; Martin, Joshua S; McGee, John; Yang, Yuchen; Liu, Eric Yi; Sun, Yingrui; Geihs, Matthias; Kong, Xuejun; Zhou, Eric Lingfeng; Li, Yun; Huang, Jie

    2017-01-01

    In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs) that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech), nicknamed QRC (for QR code based Concordance check), which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR) code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine.

  20. A SNP panel and online tool for checking genotype concordance through comparing QR codes

    PubMed Central

    Du, Yonghong; Martin, Joshua S.; McGee, John; Yang, Yuchen; Liu, Eric Yi; Sun, Yingrui; Geihs, Matthias; Kong, Xuejun; Zhou, Eric Lingfeng; Li, Yun

    2017-01-01

    In the current precision medicine era, more and more samples get genotyped and sequenced. Both researchers and commercial companies expend significant time and resources to reduce the error rate. However, it has been reported that there is a sample mix-up rate of between 0.1% and 1%, not to mention the possibly higher mix-up rate during the down-stream genetic reporting processes. Even on the low end of this estimate, this translates to a significant number of mislabeled samples, especially over the projected one billion people that will be sequenced within the next decade. Here, we first describe a method to identify a small set of Single nucleotide polymorphisms (SNPs) that can uniquely identify a personal genome, which utilizes allele frequencies of five major continental populations reported in the 1000 genomes project and the ExAC Consortium. To make this panel more informative, we added four SNPs that are commonly used to predict ABO blood type, and another two SNPs that are capable of predicting sex. We then implement a web interface (http://qrcme.tech), nicknamed QRC (for QR code based Concordance check), which is capable of extracting the relevant ID SNPs from a raw genetic data, coding its genotype as a quick response (QR) code, and comparing QR codes to report the concordance of underlying genetic datasets. The resulting 80 fingerprinting SNPs represent a significant decrease in complexity and the number of markers used for genetic data labelling and tracking. Our method and web tool is easily accessible to both researchers and the general public who consider the accuracy of complex genetic data as a prerequisite towards precision medicine. PMID:28926565

  1. [A preliminary study on the molecular characteristics of D-cycloserine resistance of Mycobacterium tuberculosis].

    PubMed

    Li, C; Li, G L; Luo, Q; Li, S J; Wang, R B; Lou, Y L; Lyu, J X; Wan, K L

    2017-02-10

    Objective: To investigate the relationship between D-cycloserine resistance and the gene mutations of alrA , ddlA and cycA of Mycobacterium ( M. ) tuberculosis , as well as the association between D-cycloserine resistance and spoligotyping genotyping. Methods: A total of 145 M. tuberculosis strains were selected from the strain bank. D-cycloserine resistant phenotypes of the strains were determined by the proportion method and the minimal inhibitory concentration was determined by resazurin microtiter assay. PCR amplification and DNA direct sequencing methods were used for the analysis of gene mutations. Relationship between the resistance phenotype and genotype was analyzed by chi -square test. Results: Of the 145 clinically collected strains, 24 (16.6%) of them were D-cycloserine resistant and 121 (83.4%) were sensitive. There were only synonymous mutations noticed on alrA , ddlA and cycA in sensitive strains. Of the 24 D-cycloserine resistant strains, 3 (12.5%) isolates' cycA and 1 (4.2%) isolates' alrA happened to be non-synonymous mutations, in which the codes were 188, 318 and 508 of cycA , and 261 of alrA , respectively. Results on drug sensitivity tests confirmed the minimal inhibitory concentration of the mutant strains were all increased to some degrees. The D-cycloserine resistant rates of 88 Beijing genotype and 57 non-Beijing genotype strains were 20.5% and 10.5% , respectively, but with no statistically significant difference ( χ (2) =2.47, P >0.05). Conclusions: The non-synonymous mutations of alrA and cycA might contribute to one of the mechanisms of M. tuberculosis D-cycloserine resistance. M. tuberculosis Beijing genotype or non-Beijing genotype was not considered to be associated with the D-cycloserine resistance.

  2. Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort

    PubMed Central

    Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Masters, Bettie Sue Siler; Martásek, Pavel

    2015-01-01

    Background Gene polymorphisms encoding the enzyme NADPH–cytochrome P450 oxidoreductase (POR) contribute to inter-individual differences in drug response. Aim To estimate polymorphic allele frequencies of the POR gene in a Czech Slavic population. Materials & Methods The gene POR was analyzed in 322 Czech Slavic individuals from a control cohort by sequencing and HRM analysis. Results Twenty-five SNP genetic variations were identified. Of these variants, 7 were new, unreported SNPs, including two SNPs in the 5´flanking region (g.4965 C>T and g.4994 G>T), one intronic variant (c.1899 −20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared to wild type. Conclusion New POR variant identification indicates that the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYPs in the endoplasmic reticulum. PMID:25712184

  3. Role of Exonic Variation in Chemokine Receptor Genes on AIDS: CCRL2 F167Y Association with Pneumocystis Pneumonia

    PubMed Central

    An, Ping; Li, Rongling; Wang, Ji Ming; Yoshimura, Teizo; Takahashi, Munehisa; Samudralal, Ram; O'Brien, Stephen J.; Phair, John; Goedert, James J.; Kirk, Gregory D.; Troyer, Jennifer L.; Sezgin, Efe; Buchbinder, Susan P.; Donfield, Sharyne; Nelson, George W.; Winkler, Cheryl A.

    2011-01-01

    Chromosome 3p21–22 harbors two clusters of chemokine receptor genes, several of which serve as major or minor coreceptors of HIV-1. Although the genetic association of CCR5 and CCR2 variants with HIV-1 pathogenesis is well known, the role of variation in other nearby chemokine receptor genes remain unresolved. We genotyped exonic single nucleotide polymorphisms (SNPs) in chemokine receptor genes: CCR3, CCRL2, and CXCR6 (at 3p21) and CCR8 and CX3CR1 (at 3p22), the majority of which were non-synonymous. The individual SNPs were tested for their effects on disease progression and outcomes in five treatment-naïve HIV-1/AIDS natural history cohorts. In addition to the known CCR5 and CCR2 associations, significant associations were identified for CCR3, CCR8, and CCRL2 on progression to AIDS. A multivariate survival analysis pointed to a previously undetected association of a non-conservative amino acid change F167Y in CCRL2 with AIDS progression: 167F is associated with accelerated progression to AIDS (RH = 1.90, P = 0.002, corrected). Further analysis indicated that CCRL2-167F was specifically associated with more rapid development of pneumocystis pneumonia (PCP) (RH = 2.84, 95% CI 1.28–6.31) among four major AIDS–defining conditions. Considering the newly defined role of CCRL2 in lung dendritic cell trafficking, this atypical chemokine receptor may affect PCP through immune regulation and inducing inflammation. PMID:22046140

  4. Genomic Locus Modulating IOP in the BXD RI Mouse Strains

    PubMed Central

    King, Rebecca; Li, Ying; Wang, Jiaxing; Struebing, Felix L.; Geisert, Eldon E.

    2018-01-01

    Intraocular pressure (IOP) is the primary risk factor for developing glaucoma, yet little is known about the contribution of genomic background to IOP regulation. The present study leverages an array of systems genetics tools to study genomic factors modulating normal IOP in the mouse. The BXD recombinant inbred (RI) strain set was used to identify genomic loci modulating IOP. We measured the IOP in a total of 506 eyes from 38 different strains. Strain averages were subjected to conventional quantitative trait analysis by means of composite interval mapping. Candidate genes were defined, and immunohistochemistry and quantitative PCR (qPCR) were used for validation. Of the 38 BXD strains examined the mean IOP ranged from a low of 13.2mmHg to a high of 17.1mmHg. The means for each strain were used to calculate a genome wide interval map. One significant quantitative trait locus (QTL) was found on Chr.8 (96 to 103 Mb). Within this 7 Mb region only 4 annotated genes were found: Gm15679, Cdh8, Cdh11 and Gm8730. Only two genes (Cdh8 and Cdh11) were candidates for modulating IOP based on the presence of non-synonymous SNPs. Further examination using SIFT (Sorting Intolerant From Tolerant) analysis revealed that the SNPs in Cdh8 (Cadherin 8) were predicted to not change protein function; while the SNPs in Cdh11 (Cadherin 11) would not be tolerated, affecting protein function. Furthermore, immunohistochemistry demonstrated that CDH11 is expressed in the trabecular meshwork of the mouse. We have examined the genomic regulation of IOP in the BXD RI strain set and found one significant QTL on Chr. 8. Within this QTL, there is one good candidate gene, Cdh11. PMID:29496776

  5. Diversity and population structure of northern switchgrass as revealed through exome capture sequencing

    DOE PAGES

    Evans, Joseph; Crisovan, Emily; Barry, Kerrie; ...

    2015-10-01

    Panicum virgatum L. (switchgrass) is a polyploid, perennial grass species that is native to North America, and is being developed as a future biofuel feedstock crop. Switchgrass is present primarily in two ecotypes: a northern upland ecotype, composed of tetraploid and octoploid accessions, and a southern lowland ecotype, composed of primarily tetraploid accessions. We employed high-coverage exome capture sequencing (~2.4 Tb) to genotype 537 individuals from 45 upland and 21 lowland populations. From these data, we identified ~27 million single-nucleotide polymorphisms (SNPs), of which 1 590 653 high-confidence SNPs were used in downstream analyses of diversity within and between themore » populations. From the 66 populations, we identified five primary population groups within the upland and lowland ecotypes, a result that was further supported through genetic distance analysis. We identified conserved, ecotype-restricted, non-synonymous SNPs that are predicted to affect the protein function of CONSTANS (CO) and EARLY HEADING DATE 1 (EHD1), key genes involved in flowering, which may contribute to the phenotypic differences between the two ecotypes. We also identified, relative to the near-reference Kanlow population, 17 228 genes present in more copies than in the reference genome (up-CNVs), 112 630 genes present in fewer copies than in the reference genome (down-CNVs) and 14 430 presence/absence variants (PAVs), affecting a total of 9979 genes, including two upland-specific CNV clusters. In total, 45 719 genes were affected by an SNP, CNV, or PAV across the panel, providing a firm foundation to identify functional variation associated with phenotypic traits of interest for biofuel feedstock production.« less

  6. Diversity and population structure of northern switchgrass as revealed through exome capture sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Evans, Joseph; Crisovan, Emily; Barry, Kerrie

    Panicum virgatum L. (switchgrass) is a polyploid, perennial grass species that is native to North America, and is being developed as a future biofuel feedstock crop. Switchgrass is present primarily in two ecotypes: a northern upland ecotype, composed of tetraploid and octoploid accessions, and a southern lowland ecotype, composed of primarily tetraploid accessions. We employed high-coverage exome capture sequencing (~2.4 Tb) to genotype 537 individuals from 45 upland and 21 lowland populations. From these data, we identified ~27 million single-nucleotide polymorphisms (SNPs), of which 1 590 653 high-confidence SNPs were used in downstream analyses of diversity within and between themore » populations. From the 66 populations, we identified five primary population groups within the upland and lowland ecotypes, a result that was further supported through genetic distance analysis. We identified conserved, ecotype-restricted, non-synonymous SNPs that are predicted to affect the protein function of CONSTANS (CO) and EARLY HEADING DATE 1 (EHD1), key genes involved in flowering, which may contribute to the phenotypic differences between the two ecotypes. We also identified, relative to the near-reference Kanlow population, 17 228 genes present in more copies than in the reference genome (up-CNVs), 112 630 genes present in fewer copies than in the reference genome (down-CNVs) and 14 430 presence/absence variants (PAVs), affecting a total of 9979 genes, including two upland-specific CNV clusters. In total, 45 719 genes were affected by an SNP, CNV, or PAV across the panel, providing a firm foundation to identify functional variation associated with phenotypic traits of interest for biofuel feedstock production.« less

  7. Intergenic disease-associated regions are abundant in novel transcripts.

    PubMed

    Bartonicek, N; Clark, M B; Quek, X C; Torpy, J R; Pritchard, A L; Maag, J L V; Gloss, B S; Crawford, J; Taft, R J; Hayward, N K; Montgomery, G W; Mattick, J S; Mercer, T R; Dinger, M E

    2017-12-28

    Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. This resource of previously unreported transcripts in disease-associated regions ( http://gwas-captureseq.dingerlab.org ) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets.

  8. Geographic Structuring of the Plasmodium falciparum Sarco(endo)plasmic Reticulum Ca2+ ATPase (PfSERCA) Gene Diversity

    PubMed Central

    Pinto, João; Gribaldo, Simonetta; Legrand, Eric; Niang, Makhtar; Kim, Nimol; Pharath, Lim; Volnay, Béatrice; Ekala, Marie Therese; Bouchier, Christiane; Fandeur, Thierry; Berzosa, Pedro; Benito, Agustin; Ferreira, Isabel Dinis; Ferreira, Cynthia; Vieira, Pedro Paulo; Alecrim, Maria das Graças; Mercereau-Puijalon, Odile; Cravo, Pedro

    2010-01-01

    Artemisinin, a thapsigargin-like sesquiterpene has been shown to inhibit the Plasmodium falciparum sarco/endoplasmic reticulum calcium-ATPase PfSERCA. To collect baseline pfserca sequence information before field deployment of Artemisinin-based Combination therapies that may select mutant parasites, we conducted a sequence analysis of 100 isolates from multiple sites in Africa, Asia and South America. Coding sequence diversity was large, with 29 mutated codons, including 32 SNPs (average of one SNP/115 bp), of which 19 were novel mutations. Most SNP detected in this study were clustered within a region in the cytosolic head of the protein. The PfSERCA functional domains were very well conserved, with non synonymous mutations located outside the functional domains, except for the S769N mutation associated in French Guiana with elevated IC50 for artemether. The S769N mutation is located close to the hinge of the headpiece, which in other species modulates calcium affinity and in consequence efficacy of inhibitors, possibly linking calcium homeostasis to drug resistance. Genetic diversity was highest in Senegal, Brazil and French Guiana, and few mutations were identified in Asia. Population genetic analysis was conducted for a partial fragment of the gene encompassing nucleotide coordinates 87-2862 (unambiguous sequence available for 96 isolates). This supported a geographic clustering, with a separation between Old and New World samples and one dominant ancestral haplotype. Genetic drift alone cannot explain the observed polymorphism, suggesting that other evolutionary mechanisms are operating. One possible contributor could be the frequency of haemoglobinopathies that are associated with calcium dysregulation in the erythrocyte. PMID:20195531

  9. Identification and characterization of transcript polymorphisms in soybean lines varying in oil composition and content

    PubMed Central

    2014-01-01

    Background Variation in seed oil composition and content among soybean varieties is largely attributed to differences in transcript sequences and/or transcript accumulation of oil production related genes in seeds. Discovery and analysis of sequence and expression variations in these genes will accelerate soybean oil quality improvement. Results In an effort to identify these variations, we sequenced the transcriptomes of soybean seeds from nine lines varying in oil composition and/or total oil content. Our results showed that 69,338 distinct transcripts from 32,885 annotated genes were expressed in seeds. A total of 8,037 transcript expression polymorphisms and 50,485 transcript sequence polymorphisms (48,792 SNPs and 1,693 small Indels) were identified among the lines. Effects of the transcript polymorphisms on their encoded protein sequences and functions were predicted. The studies also provided independent evidence that the lack of FAD2-1A gene activity and a non-synonymous SNP in the coding sequence of FAB2C caused elevated oleic acid and stearic acid levels in soybean lines M23 and FAM94-41, respectively. Conclusions As a proof-of-concept, we developed an integrated RNA-seq and bioinformatics approach to identify and functionally annotate transcript polymorphisms, and demonstrated its high effectiveness for discovery of genetic and transcript variations that result in altered oil quality traits. The collection of transcript polymorphisms coupled with their predicted functional effects will be a valuable asset for further discovery of genes, gene variants, and functional markers to improve soybean oil quality. PMID:24755115

  10. Functional analysis of four naturally occurring variants of human constitutive androstane receptor.

    PubMed

    Ikeda, Shinobu; Kurose, Kouichi; Jinno, Hideto; Sai, Kimie; Ozawa, Shogo; Hasegawa, Ryuichi; Komamura, Kazuo; Kotake, Takeshi; Morishita, Hideki; Kamakura, Shiro; Kitakaze, Masafumi; Tomoike, Hitonobu; Tamura, Tomohide; Yamamoto, Noboru; Kunitoh, Hideo; Yamada, Yasuhide; Ohe, Yuichiro; Shimada, Yasuhiro; Shirao, Kuniaki; Kubota, Kaoru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Saito, Yoshiro; Sawada, Jun-ichi

    2005-01-01

    The human constitutive androstane receptor (CAR, NR1I3) is a member of the orphan nuclear receptor superfamily that plays an important role in the control of drug metabolism and disposition. In this study, we sequenced all the coding exons of the NR1I3 gene for 334 Japanese subjects. We identified three novel single nucleotide polymorphisms (SNPs) that induce non-synonymous alterations of amino acids (His246Arg, Leu308Pro, and Asn323Ser) residing in the ligand-binding domain of CAR, in addition to the Val133Gly variant, which was another CAR variant identified in our previous study. We performed functional analysis of these four naturally occurring CAR variants in COS-7 cells using a CYP3A4 promoter/enhancer reporter gene that includes the CAR responsive elements. The His246Arg variant caused marked reductions in both transactivation of the reporter gene and in the response to 6-(4-chlorophenyl)imidazo[2,1-b][1,3]thiazole-5-carbaldehyde O-(3,4-dichlorobenzyl)oxime (CITCO), which is a human CAR-specific agonist. The transactivation ability of the Leu308Pro variant was also significantly decreased, but its responsiveness to CITCO was not abrogated. The transactivation ability and CITCO response of the Val133Gly and Asn323Ser variants did not change as compared to the wild-type CAR. These data suggest that the His246Arg and Leu308Pro variants, especially His246Arg, may influence the expression of drug-metabolizing enzymes and transporters that are transactivated by CAR.

  11. Exome sequencing identifies complex I NDUFV2 mutations as a novel cause of Leigh syndrome.

    PubMed

    Cameron, Jessie M; MacKay, Nevena; Feigenbaum, Annette; Tarnopolsky, Mark; Blaser, Susan; Robinson, Brian H; Schulze, Andreas

    2015-09-01

    Two siblings with hypertrophic cardiomyopathy and brain atrophy were diagnosed with Complex I deficiency based on low enzyme activity in muscle and high lactate/pyruvate ratio in fibroblasts. Whole exome sequencing results of fibroblast gDNA from one sibling was narrowed down to 190 SNPs or In/Dels in 185 candidate genes by selecting non-synonymous coding sequence base pair changes that were not present in the SNP database. Two compound heterozygous mutations were identified in both siblings in NDUFV2, encoding the 24 kDa subunit of Complex I. The intronic mutation (c.IVS2 + 1delGTAA) is disease causing and has been reported before. The other mutation is novel (c.669_670insG, p.Ser224Valfs*3) and predicted to cause a pathogenic frameshift in the protein. Subsequent investigation of 10 probands with complex I deficiency from different families revealed homozygosity for the intronic c.IVS2 + 1delGTAA mutation in a second, consanguineous family. In this family three of five siblings were affected. Interestingly, they presented with Leigh syndrome but no cardiac involvement. The same genotype had been reported previously in a two families but presenting with hypertrophic cardiomyopathy, trunk hypotonia and encephalopathy. We have identified NDUFV2 mutations in two families with Complex I deficiency, including a novel mutation. The diagnosis of Leigh syndrome expands the clinical phenotypes associated with the c.IVS2 + 1delGTAA mutation in this gene. Copyright © 2015 European Paediatric Neurology Society. Published by Elsevier Ltd. All rights reserved.

  12. A coding single-nucleotide polymorphism in lysine demethylase KDM4A associates with increased sensitivity to mTOR inhibitors.

    PubMed

    Van Rechem, Capucine; Black, Joshua C; Greninger, Patricia; Zhao, Yang; Donado, Carlos; Burrowes, Paul D; Ladd, Brendon; Christiani, David C; Benes, Cyril H; Whetstine, Johnathan R

    2015-03-01

    SNPs occur within chromatin-modulating factors; however, little is known about how these variants within the coding sequence affect cancer progression or treatment. Therefore, there is a need to establish their biochemical and/or molecular contribution, their use in subclassifying patients, and their impact on therapeutic response. In this report, we demonstrate that coding SNP-A482 within the lysine tridemethylase gene KDM4A/JMJD2A has different allelic frequencies across ethnic populations, associates with differential outcome in patients with non-small cell lung cancer (NSCLC), and promotes KDM4A protein turnover. Using an unbiased drug screen against 87 preclinical and clinical compounds, we demonstrate that homozygous SNP-A482 cells have increased mTOR inhibitor sensitivity. mTOR inhibitors significantly reduce SNP-A482 protein levels, which parallels the increased drug sensitivity observed with KDM4A depletion. Our data emphasize the importance of using variant status as candidate biomarkers and highlight the importance of studying SNPs in chromatin modifiers to achieve better targeted therapy. This report documents the first coding SNP within a lysine demethylase that associates with worse outcome in patients with NSCLC. We demonstrate that this coding SNP alters the protein turnover and associates with increased mTOR inhibitor sensitivity, which identifies a candidate biomarker for mTOR inhibitor therapy and a therapeutic target for combination therapy. ©2015 American Association for Cancer Research.

  13. Deep Resequencing of GWAS Loci Identifies Rare Variants in CARD9, IL23R and RNF186 That Are Associated with Ulcerative Colitis

    PubMed Central

    Boucher, Gabrielle; Lo, Ken Sin; Rivas, Manuel A.; Stevens, Christine; Alikashani, Azadeh; Ladouceur, Martin; Ellinghaus, David; Törkvist, Leif; Goel, Gautam; Lagacé, Caroline; Annese, Vito; Bitton, Alain; Begun, Jakob; Brant, Steve R.; Bresso, Francesca; Cho, Judy H.; Duerr, Richard H.; Halfvarson, Jonas; McGovern, Dermot P. B.; Radford-Smith, Graham; Schreiber, Stefan; Schumm, Philip L.; Sharma, Yashoda; Silverberg, Mark S.; Weersma, Rinse K.; D'Amato, Mauro; Vermeire, Severine; Franke, Andre; Lettre, Guillaume; Xavier, Ramnik J.; Daly, Mark J.; Rioux, John D.

    2013-01-01

    Genome-wide association studies and follow-up meta-analyses in Crohn's disease (CD) and ulcerative colitis (UC) have recently identified 163 disease-associated loci that meet genome-wide significance for these two inflammatory bowel diseases (IBD). These discoveries have already had a tremendous impact on our understanding of the genetic architecture of these diseases and have directed functional studies that have revealed some of the biological functions that are important to IBD (e.g. autophagy). Nonetheless, these loci can only explain a small proportion of disease variance (∼14% in CD and 7.5% in UC), suggesting that not only are additional loci to be found but that the known loci may contain high effect rare risk variants that have gone undetected by GWAS. To test this, we have used a targeted sequencing approach in 200 UC cases and 150 healthy controls (HC), all of French Canadian descent, to study 55 genes in regions associated with UC. We performed follow-up genotyping of 42 rare non-synonymous variants in independent case-control cohorts (totaling 14,435 UC cases and 20,204 HC). Our results confirmed significant association to rare non-synonymous coding variants in both IL23R and CARD9, previously identified from sequencing of CD loci, as well as identified a novel association in RNF186. With the exception of CARD9 (OR = 0.39), the rare non-synonymous variants identified were of moderate effect (OR = 1.49 for RNF186 and OR = 0.79 for IL23R). RNF186 encodes a protein with a RING domain having predicted E3 ubiquitin-protein ligase activity and two transmembrane domains. Importantly, the disease-coding variant is located in the ubiquitin ligase domain. Finally, our results suggest that rare variants in genes identified by genome-wide association in UC are unlikely to contribute significantly to the overall variance for the disease. Rather, these are expected to help focus functional studies of the corresponding disease loci. PMID:24068945

  14. Prevalence and Molecular Characterization of Glucose-6-Phosphate Dehydrogenase Deficiency at the China-Myanmar Border.

    PubMed

    Li, Qing; Yang, Fang; Liu, Rong; Luo, Lan; Yang, Yuling; Zhang, Lu; Liu, Huaie; Zhang, Wen; Fan, Zhixiang; Yang, Zhaoqing; Cui, Liwang; He, Yongshu

    2015-01-01

    Glucose-6-phosphate dehydrogenase (G6PD) deficiency is an X-linked hereditary disease that predisposes red blood cells to oxidative damage. G6PD deficiency is particularly prevalent in historically malaria-endemic areas. Use of primaquine for malaria treatment may result in severe hemolysis in G6PD deficient patients. In this study, we systematically evaluated the prevalence of G6PD deficiency in the Kachin (Jingpo) ethnic group along the China-Myanmar border and determined the underlying G6PD genotypes. We surveyed G6PD deficiency in 1770 adult individuals (671 males and 1099 females) of the Kachin ethnicity using a G6PD fluorescent spot test. The overall prevalence of G6PD deficiency in the study population was 29.6% (523/1770), among which 27.9% and 30.6% were males and females, respectively. From these G6PD deficient samples, 198 unrelated individuals (147 females and 51 males) were selected for genotyping at 11 known G6PD single nucleotide polymorphisms (SNPs) in Southeast Asia (ten in exons and one in intron 11) using a multiplex SNaPshot assay. Mutations with known association to a deficient phenotype were detected in 43.9% (87/198) of cases, intronic and synonymous mutations were detected alone in 34.8% (69/198) cases and no mutation were found in 21.2% (42/198) cases. Five non-synonymous mutations, Mahidol 487G>A, Kaiping 1388G>A, Canton 1376G>T, Chinese 4 392G>T, and Viangchan 871G>A were detected. Of the 87 cases with known deficient mutations, the Mahidol variant was the most common (89.7%; 78/87), followed by the Kaiping (8.0%; 7/87) and the Viangchan (2.2%; 2/87) variants. The Canton and Chinese 4 variants were found in 1.1% of these 87 cases. Among them, two females carried the Mahidol/Viangchan and Mahidol/Kaiping double mutations, respectively. Interestingly, the silent SNPs 1311C>T and IVS11nt93T>C both occurred in the same 95 subjects with frequencies at 56.4% and 23.5% in tested females and males, respectively (P<0.05). It is noteworthy that 24 subjects carrying the Mahidol mutation and two carrying the Kaiping mutation also carried the 1311C>T/IVS11nt93T>C SNPs. Further studies are needed to determine the enzyme levels of the G6PD deficient people and presence of additional G6PD mutations in the study population.

  15. Development of a set of SNP markers present in expressed genes of the apple.

    PubMed

    Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S

    2008-11-01

    Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.

  16. A non-synonymous SNP within the isopentenyl transferase 2 locus is associated with kernel weight in Chinese maize inbreds (Zea mays L.).

    PubMed

    Weng, Jianfeng; Li, Bo; Liu, Changlin; Yang, Xiaoyan; Wang, Hongwei; Hao, Zhuanfang; Li, Mingshun; Zhang, Degui; Ci, Xiaoke; Li, Xinhai; Zhang, Shihuang

    2013-07-05

    Kernel weight, controlled by quantitative trait loci (QTL), is an important component of grain yield in maize. Cytokinins (CKs) participate in determining grain morphology and final grain yield in crops. ZmIPT2, which is expressed mainly in the basal transfer cell layer, endosperm, and embryo during maize kernel development, encodes an isopentenyl transferase (IPT) that is involved in CK biosynthesis. The coding region of ZmIPT2 was sequenced across a panel of 175 maize inbred lines that are currently used in Chinese maize breeding programs. Only 16 single nucleotide polymorphisms (SNPs) and seven haplotypes were detected among these inbred lines. Nucleotide diversity (π) within the ZmIPT2 window and coding region were 0.347 and 0.0047, respectively, and they were significantly lower than the mean nucleotide diversity value of 0.372 for maize Chromosome 2 (P < 0.01). Association mapping revealed that a single nucleotide change from cytosine (C) to thymine (T) in the ZmIPT2 coding region, which converted a proline residue into a serine residue, was significantly associated with hundred kernel weight (HKW) in three environments (P <0.05), and explained 4.76% of the total phenotypic variation. In vitro characterization suggests that the dimethylallyl diphospate (DMAPP) IPT activity of ZmIPT2-T is higher than that of ZmIPT2-C, as the amounts of adenosine triphosphate (ATP), adenosine diphosphate (ADP), and adenosine monophosphate (AMP) consumed by ZmIPT2-T were 5.48-, 2.70-, and 1.87-fold, respectively, greater than those consumed by ZmIPT2-C. The effects of artificial selection on the ZmIPT2 coding region were evaluated using Tajima's D tests across six subgroups of Chinese maize germplasm, with the most frequent favorable allele identified in subgroup PB (Partner B). These results showed that ZmIPT2, which is associated with kernel weight, was subjected to artificial selection during the maize breeding process. ZmIPT2-T had higher IPT activity than ZmIPT2-C, and this favorable allele for kernel weight could be used in molecular marker-assisted selection for improvement of grain yield components in Chinese maize breeding programs.

  17. Pharmacogenetic screening for polymorphisms in drug-metabolizing enzymes and drug transporters in a Dutch population.

    PubMed

    Bosch, T M; Doodeman, V D; Smits, P H M; Meijerman, I; Schellens, J H M; Beijnen, J H

    2006-01-01

    A possible explanation for the wide interindividual variability in toxicity and efficacy of drug therapy is variation in genes encoding drug-metabolizing enzymes and drug transporters. The allelic frequency of these genetic variants, linkage disequilibrium (LD), and haplotype of these polymorphisms are important parameters in determining the genetic differences between patients. The aim of this study was to explore the frequencies of polymorphisms in drug-metabolizing enzymes (CYP1A1, CYP2C9, CYP2C19, CYP3A4, CYP2D6, CYP3A5, DPYD, UGT1A1, GSTM1, GSTP1, GSTT1) and drug transporters (ABCB1[MDR1] and ABCC2[MRP2]), and to investigate the LD and perform haplotype analysis of these polymorphisms in a Dutch population. Blood samples were obtained from 100 healthy volunteers and genomic DNA was isolated and amplified by PCR. The amplification products were sequenced and analyzed for the presence of polymorphisms by sequence alignment. In the study population, we identified 13 new single nucleotide polymorphisms (SNPs) in Caucasians and three new SNPs in non-Caucasians, in addition to previously recognized SNPs. Three of the new SNPs were found within exons, of which two resulted in amino acid changes (A428T in CYP2C9 resulting in the amino acid substitution D143V; and C4461T in ABCC2 in a non-Caucasian producing the amino acid change T1476M). Several LDs and haplotypes were found in the Caucasian individuals. In this Dutch population, the frequencies of 16 new SNPs and those of previously recognized SNPs were determined in genes coding for drug-metabolizing enzymes and drug transporters. Several LDs and haplotypes were also inferred. These data are important for further research to help explain the interindividual pharmacokinetic and pharmacodynamic variability in response to drug therapy.

  18. DR-GAS: a database of functional genetic variants and their phosphorylation states in human DNA repair systems.

    PubMed

    Sehgal, Manika; Singh, Tiratha Raj

    2014-04-01

    We present DR-GAS(1), a unique, consolidated and comprehensive DNA repair genetic association studies database of human DNA repair system. It presents information on repair genes, assorted mechanisms of DNA repair, linkage disequilibrium, haplotype blocks, nsSNPs, phosphorylation sites, associated diseases, and pathways involved in repair systems. DNA repair is an intricate process which plays an essential role in maintaining the integrity of the genome by eradicating the damaging effect of internal and external changes in the genome. Hence, it is crucial to extensively understand the intact process of DNA repair, genes involved, non-synonymous SNPs which perhaps affect the function, phosphorylated residues and other related genetic parameters. All the corresponding entries for DNA repair genes, such as proteins, OMIM IDs, literature references and pathways are cross-referenced to their respective primary databases. DNA repair genes and their associated parameters are either represented in tabular or in graphical form through images elucidated by computational and statistical analyses. It is believed that the database will assist molecular biologists, biotechnologists, therapeutic developers and other scientific community to encounter biologically meaningful information, and meticulous contribution of genetic level information towards treacherous diseases in human DNA repair systems. DR-GAS is freely available for academic and research purposes at: http://www.bioinfoindia.org/drgas. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Characterization of Heterobasidion occidentale transcriptomes reveals candidate genes and DNA polymorphisms for virulence variations.

    PubMed

    Liu, Jun-Jun; Shamoun, Simon Francis; Leal, Isabel; Kowbel, Robert; Sumampong, Grace; Zamany, Arezoo

    2018-05-01

    Characterization of genes involved in differentiation of pathogen species and isolates with variations of virulence traits provides valuable information to control tree diseases for meeting the challenges of sustainable forest health and phytosanitary trade issues. Lack of genetic knowledge and genomic resources hinders novel gene discovery, molecular mechanism studies and development of diagnostic tools in the management of forest pathogens. Here, we report on transcriptome profiling of Heterobasidion occidentale isolates with contrasting virulence levels. Comparative transcriptomic analysis identified orthologous groups exclusive to H. occidentale and its isolates, revealing biological processes involved in the differentiation of isolates. Further bioinformatics analyses identified an H. occidentale secretome, CYPome and other candidate effectors, from which genes with species- and isolate-specific expression were characterized. A large proportion of differentially expressed genes were revealed to have putative activities as cell wall modification enzymes and transcription factors, suggesting their potential roles in virulence and fungal pathogenesis. Next, large numbers of simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were detected, including more than 14 000 interisolate non-synonymous SNPs. These polymorphic loci and species/isolate-specific genes may contribute to virulence variations and provide ideal DNA markers for development of diagnostic tools and investigation of genetic diversity. © 2018 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.

  20. A meta-analysis of public microarray data identifies biological regulatory networks in Parkinson's disease.

    PubMed

    Su, Lining; Wang, Chunjie; Zheng, Chenqing; Wei, Huiping; Song, Xiaoqing

    2018-04-13

    Parkinson's disease (PD) is a long-term degenerative disease that is caused by environmental and genetic factors. The networks of genes and their regulators that control the progression and development of PD require further elucidation. We examine common differentially expressed genes (DEGs) from several PD blood and substantia nigra (SN) microarray datasets by meta-analysis. Further we screen the PD-specific genes from common DEGs using GCBI. Next, we used a series of bioinformatics software to analyze the miRNAs, lncRNAs and SNPs associated with the common PD-specific genes, and then identify the mTF-miRNA-gene-gTF network. Our results identified 36 common DEGs in PD blood studies and 17 common DEGs in PD SN studies, and five of the genes were previously known to be associated with PD. Further study of the regulatory miRNAs associated with the common PD-specific genes revealed 14 PD-specific miRNAs in our study. Analysis of the mTF-miRNA-gene-gTF network about PD-specific genes revealed two feed-forward loops: one involving the SPRK2 gene, hsa-miR-19a-3p and SPI1, and the second involving the SPRK2 gene, hsa-miR-17-3p and SPI. The long non-coding RNA (lncRNA)-mediated regulatory network identified lncRNAs associated with PD-specific genes and PD-specific miRNAs. Moreover, single nucleotide polymorphism (SNP) analysis of the PD-specific genes identified two significant SNPs, and SNP analysis of the neurodegenerative disease-specific genes identified seven significant SNPs. Most of these SNPs are present in the 3'-untranslated region of genes and are controlled by several miRNAs. Our study identified a total of 53 common DEGs in PD patients compared with healthy controls in blood and brain datasets and five of these genes were previously linked with PD. Regulatory network analysis identified PD-specific miRNAs, associated long non-coding RNA and feed-forward loops, which contribute to our understanding of the mechanisms underlying PD. The SNPs identified in our study can determine whether a genetic variant is associated with PD. Overall, these findings will help guide our study of the complex molecular mechanism of PD.

  1. regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

    PubMed

    Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

    2017-09-01

    While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.

  2. Population-specific variation in haplotype composition and heterozygosity at the POLB locus.

    PubMed

    Yamtich, Jennifer; Speed, William C; Straka, Eva; Kidd, Judith R; Sweasy, Joann B; Kidd, Kenneth K

    2009-05-01

    DNA polymerase beta plays a central role in base excision repair (BER), which removes large numbers of endogenous DNA lesions from each cell on a daily basis. Little is currently known about germline polymorphisms within the POLB locus, making it difficult to study the association of variants at this locus with human diseases such as cancer. Yet, approximately thirty percent of human tumor types show variants of DNA polymerase beta. We have assessed the global frequency distributions of coding and common non-coding SNPs in and flanking the POLB gene for a total of 14 sites typed in approximately 2400 individuals from anthropologically defined human populations worldwide. We have found a marked difference between haplotype frequencies in African populations and in non-African populations.

  3. Semantic Relationships between Contextual Synonyms

    ERIC Educational Resources Information Center

    Zeng, Xian-mo

    2007-01-01

    Contextual synonym is a linguistic phenomenon often applied but rarely discussed. This paper is to discuss the semantic relationships between contextual synonyms and the requirements under which words can be used as contextual synonyms between each other. The three basic relationships are embedment, intersection and non-coherence. The requirements…

  4. Evolutionary and functional mitogenomics associated with the genetic restoration of the Florida panther

    USGS Publications Warehouse

    Ochoa, Alexander; Onorato, David P.; Fitak, Robert R.; Roelke-Parker, Melody; Culver, Melanie

    2017-01-01

    Florida panthers are endangered pumas that currently persist in reduced patches of habitat in South Florida, USA. We performed mitogenome reference-based assemblies for most parental lines of the admixed Florida panthers that resulted from the introduction of female Texas pumas into South Florida in 1995. With the addition of 2 puma mitogenomes, we characterized 174 single nucleotide polymorphisms (SNPs) across 12 individuals. We defined 5 haplotypes (Pco1–Pco5), one of which (Pco1) had a geographic origin exclusive to Costa Rica and Panama and was possibly introduced into the Everglades National Park, Florida, prior to 1995. Haplotype Pco2 was native to Florida. Haplotypes Pco3 and Pco4 were exclusive to Texas, whereas haplotype Pco5 had an undetermined geographic origin. Phylogenetic inference suggests that haplotypes Pco1–Pco4 diverged ~202000 (95% HPDI = 83000–345000) years ago and that haplotypes Pco2–Pco4 diverged ~61000 (95% HPDI = 9000–127000) years ago. These results are congruent with a south-to-north continental expansion and with a recent North American colonization by pumas. Furthermore, pumas may have migrated from Texas to Florida no earlier than ~44000 (95% HPDI = 2000–98000) years ago. Synonymous mutations presented a greater mean substitution rate than other mitochondrial functional regions: nonsynonymous mutations, tRNAs, rRNAs, and control region. Similarly, all protein-coding genes were under predominant negative selection constraints. We directly and indirectly assessed the presence of potential deleterious SNPs in the ND2 and ND5 genes in Florida panthers prior to and as a consequence of the introduction of Texas pumas. Screenings for such variants are recommended in extant Florida panthers.

  5. Comprehensive Analysis of Non-Synonymous Natural Variants of G Protein-Coupled Receptors.

    PubMed

    Kim, Hee Ryung; Duc, Nguyen Minh; Chung, Ka Young

    2018-03-01

    G protein-coupled receptors (GPCRs) are the largest superfamily of transmembrane receptors and have vital signaling functions in various organs. Because of their critical roles in physiology and pathology, GPCRs are the most commonly used therapeutic target. It has been suggested that GPCRs undergo massive genetic variations such as genetic polymorphisms and DNA insertions or deletions. Among these genetic variations, non-synonymous natural variations change the amino acid sequence and could thus alter GPCR functions such as expression, localization, signaling, and ligand binding, which may be involved in disease development and altered responses to GPCR-targeting drugs. Despite the clinical importance of GPCRs, studies on the genotype-phenotype relationship of GPCR natural variants have been limited to a few GPCRs such as β-adrenergic receptors and opioid receptors. Comprehensive understanding of non-synonymous natural variations within GPCRs would help to predict the unknown genotype-phenotype relationship and yet-to-be-discovered natural variants. Here, we analyzed the non-synonymous natural variants of all non-olfactory GPCRs available from a public database, UniProt. The results suggest that non-synonymous natural variations occur extensively within the GPCR superfamily especially in the N-terminus and transmembrane domains. Within the transmembrane domains, natural variations observed more frequently in the conserved residues, which leads to disruption of the receptor function. Our analysis also suggests that only few non-synonymous natural variations have been studied in efforts to link the variations with functional consequences.

  6. Endurance Exercise Ability in the Horse: A Trait with Complex Polygenic Determinism

    PubMed Central

    Ricard, Anne; Robert, Céline; Blouin, Christine; Baste, Fanny; Torquet, Gwendoline; Morgenthaler, Caroline; Rivière, Julie; Mach, Nuria; Mata, Xavier; Schibler, Laurent; Barrey, Eric

    2017-01-01

    Endurance horses are able to run at more than 20 km/h for 160 km (in bouts of 30–40 km). This level of performance is based on intense aerobic metabolism, effective body heat dissipation and the ability to endure painful exercise. The known heritabilities of endurance performance and exercise-related physiological traits in Arabian horses suggest that adaptation to extreme endurance exercise is influenced by genetic factors. The objective of the present genome-wide association study (GWAS) was to identify single nucleotide polymorphisms (SNPs) related to endurance racing performance in 597 Arabian horses. The performance traits studied were the total race distance, average race speed and finishing status (qualified, eliminated or retired). We used three mixed models that included a fixed allele or genotype effect and a random, polygenic effect. Quantile-quantile plots were acceptable, and the regression coefficients for actual vs. expected log10 p-values ranged from 0.865 to 1.055. The GWAS revealed five significant quantitative trait loci (QTL) corresponding to 6 SNPs on chromosomes 6, 1, 7, 16, and 29 (two SNPs) with corrected p-values from 1.7 × 10−6 to 1.8 × 10−5. Annotation of these 5 QTL revealed two genes: sortilin-related VPS10-domain-containing receptor 3 (SORCS3) on chromosome 1 is involved in protein trafficking, and solute carrier family 39 member 12 (SLC39A12) on chromosome 29 is active in zinc transport and cell homeostasis. These two coding genes could be involved in neuronal tissues (CNS). The other QTL on chromosomes 6, 7, and 16 may be involved in the regulation of the gene expression through non-coding RNAs, CpG islands and transcription factor binding sites. On chromosome 6, a new candidate equine long non-coding RNA (KCNQ1OT1 ortholog: opposite antisense transcript 1 of potassium voltage-gated channel subfamily Q member 1 gene) was predicted in silico and validated by RT-qPCR in primary cultures of equine myoblasts and fibroblasts. This lncRNA could be one element of the cardiac rhythm regulation. Our GWAS revealed that equine performance during endurance races is a complex polygenic trait, and is partially governed by at least 5 QTL: two coding genes involved in neuronal tissues and three other loci with many regulatory functions such as slowing down heart rate. PMID:28702049

  7. Endurance Exercise Ability in the Horse: A Trait with Complex Polygenic Determinism.

    PubMed

    Ricard, Anne; Robert, Céline; Blouin, Christine; Baste, Fanny; Torquet, Gwendoline; Morgenthaler, Caroline; Rivière, Julie; Mach, Nuria; Mata, Xavier; Schibler, Laurent; Barrey, Eric

    2017-01-01

    Endurance horses are able to run at more than 20 km/h for 160 km (in bouts of 30-40 km). This level of performance is based on intense aerobic metabolism, effective body heat dissipation and the ability to endure painful exercise. The known heritabilities of endurance performance and exercise-related physiological traits in Arabian horses suggest that adaptation to extreme endurance exercise is influenced by genetic factors. The objective of the present genome-wide association study (GWAS) was to identify single nucleotide polymorphisms (SNPs) related to endurance racing performance in 597 Arabian horses. The performance traits studied were the total race distance, average race speed and finishing status (qualified, eliminated or retired). We used three mixed models that included a fixed allele or genotype effect and a random, polygenic effect. Quantile-quantile plots were acceptable, and the regression coefficients for actual vs. expected log 10 p -values ranged from 0.865 to 1.055. The GWAS revealed five significant quantitative trait loci (QTL) corresponding to 6 SNPs on chromosomes 6, 1, 7, 16, and 29 (two SNPs) with corrected p -values from 1.7 × 10 -6 to 1.8 × 10 -5 . Annotation of these 5 QTL revealed two genes: sortilin-related VPS10-domain-containing receptor 3 ( SORCS3 ) on chromosome 1 is involved in protein trafficking, and solute carrier family 39 member 12 ( SLC39A12 ) on chromosome 29 is active in zinc transport and cell homeostasis. These two coding genes could be involved in neuronal tissues (CNS). The other QTL on chromosomes 6, 7, and 16 may be involved in the regulation of the gene expression through non-coding RNAs, CpG islands and transcription factor binding sites. On chromosome 6, a new candidate equine long non-coding RNA ( KCNQ1OT1 ortholog: opposite antisense transcript 1 of potassium voltage-gated channel subfamily Q member 1 gene) was predicted in silico and validated by RT-qPCR in primary cultures of equine myoblasts and fibroblasts. This lncRNA could be one element of the cardiac rhythm regulation. Our GWAS revealed that equine performance during endurance races is a complex polygenic trait, and is partially governed by at least 5 QTL: two coding genes involved in neuronal tissues and three other loci with many regulatory functions such as slowing down heart rate.

  8. Polymorphism discovery and allele frequency estimation using high-throughput DNA sequencing of target-enriched pooled DNA samples

    PubMed Central

    2012-01-01

    Background The central role of the somatotrophic axis in animal post-natal growth, development and fertility is well established. Therefore, the identification of genetic variants affecting quantitative traits within this axis is an attractive goal. However, large sample numbers are a pre-requisite for the identification of genetic variants underlying complex traits and although technologies are improving rapidly, high-throughput sequencing of large numbers of complete individual genomes remains prohibitively expensive. Therefore using a pooled DNA approach coupled with target enrichment and high-throughput sequencing, the aim of this study was to identify polymorphisms and estimate allele frequency differences across 83 candidate genes of the somatotrophic axis, in 150 Holstein-Friesian dairy bulls divided into two groups divergent for genetic merit for fertility. Results In total, 4,135 SNPs and 893 indels were identified during the resequencing of the 83 candidate genes. Nineteen percent (n = 952) of variants were located within 5' and 3' UTRs. Seventy-two percent (n = 3,612) were intronic and 9% (n = 464) were exonic, including 65 indels and 236 SNPs resulting in non-synonymous substitutions (NSS). Significant (P < 0.01) mean allele frequency differentials between the low and high fertility groups were observed for 720 SNPs (58 NSS). Allele frequencies for 43 of the SNPs were also determined by genotyping the 150 individual animals (Sequenom® MassARRAY). No significant differences (P > 0.1) were observed between the two methods for any of the 43 SNPs across both pools (i.e., 86 tests in total). Conclusions The results of the current study support previous findings of the use of DNA sample pooling and high-throughput sequencing as a viable strategy for polymorphism discovery and allele frequency estimation. Using this approach we have characterised the genetic variation within genes of the somatotrophic axis and related pathways, central to mammalian post-natal growth and development and subsequent lactogenesis and fertility. We have identified a large number of variants segregating at significantly different frequencies between cattle groups divergent for calving interval plausibly harbouring causative variants contributing to heritable variation. To our knowledge, this is the first report describing sequencing of targeted genomic regions in any livestock species using groups with divergent phenotypes for an economically important trait. PMID:22235840

  9. Genetic Associations with Plasma B12, B6, and Folate Levels in an Ischemic Stroke Population from the Vitamin Intervention for Stroke Prevention (VISP) Trial.

    PubMed

    Keene, Keith L; Chen, Wei-Min; Chen, Fang; Williams, Stephen R; Elkhatib, Stacey D; Hsu, Fang-Chi; Mychaleckyj, Josyf C; Doheny, Kimberly F; Pugh, Elizabeth W; Ling, Hua; Laurie, Cathy C; Gogarten, Stephanie M; Madden, Ebony B; Worrall, Bradford B; Sale, Michele M

    2014-01-01

    B vitamins play an important role in homocysteine metabolism, with vitamin deficiencies resulting in increased levels of homocysteine and increased risk for stroke. We performed a genome-wide association study (GWAS) in 2,100 stroke patients from the Vitamin Intervention for Stroke Prevention (VISP) trial, a clinical trial designed to determine whether the daily intake of high-dose folic acid, vitamins B6, and B12 reduce recurrent cerebral infarction. Extensive quality control (QC) measures resulted in a total of 737,081 SNPs for analysis. Genome-wide association analyses for baseline quantitative measures of folate, Vitamins B12, and B6 were completed using linear regression approaches, implemented in PLINK. Six associations met or exceeded genome-wide significance (P ≤ 5 × 10(-08)). For baseline Vitamin B12, the strongest association was observed with a non-synonymous SNP (nsSNP) located in the CUBN gene (P = 1.76 × 10(-13)). Two additional CUBN intronic SNPs demonstrated strong associations with B12 (P = 2.92 × 10(-10) and 4.11 × 10(-10)), while a second nsSNP, located in the TCN1 gene, also reached genome-wide significance (P = 5.14 × 10(-11)). For baseline measures of Vitamin B6, we identified genome-wide significant associations for SNPs at the ALPL locus (rs1697421; P = 7.06 × 10(-10) and rs1780316; P = 2.25 × 10(-08)). In addition to the six genome-wide significant associations, nine SNPs (two for Vitamin B6, six for Vitamin B12, and one for folate measures) provided suggestive evidence for association (P ≤ 10(-07)). Our GWAS study has identified six genome-wide significant associations, nine suggestive associations, and successfully replicated 5 of 16 SNPs previously reported to be associated with measures of B vitamins. The six genome-wide significant associations are located in gene regions that have shown previous associations with measures of B vitamins; however, four of the nine suggestive associations represent novel finding and warrant further investigation in additional populations.

  10. Codon Optimizing for Increased Membrane Protein Production: A Minimalist Approach.

    PubMed

    Mirzadeh, Kiavash; Toddo, Stephen; Nørholm, Morten H H; Daley, Daniel O

    2016-01-01

    Reengineering a gene with synonymous codons is a popular approach for increasing production levels of recombinant proteins. Here we present a minimalist alternative to this method, which samples synonymous codons only at the second and third positions rather than the entire coding sequence. As demonstrated with two membrane-embedded transporters in Escherichia coli, the method was more effective than optimizing the entire coding sequence. The method we present is PCR based and requires three simple steps: (1) the design of two PCR primers, one of which is degenerate; (2) the amplification of a mini-library by PCR; and (3) screening for high-expressing clones.

  11. QTL-seq approach identified genomic regions and diagnostic markers for rust and late leaf spot resistance in groundnut (Arachis hypogaea L.).

    PubMed

    Pandey, Manish K; Khan, Aamir W; Singh, Vikas K; Vishwakarma, Manish K; Shasidhar, Yaduru; Kumar, Vinay; Garg, Vanika; Bhat, Ramesh S; Chitikineni, Annapurna; Janila, Pasupuleti; Guo, Baozhu; Varshney, Rajeev K

    2017-08-01

    Rust and late leaf spot (LLS) are the two major foliar fungal diseases in groundnut, and their co-occurrence leads to significant yield loss in addition to the deterioration of fodder quality. To identify candidate genomic regions controlling resistance to rust and LLS, whole-genome resequencing (WGRS)-based approach referred as 'QTL-seq' was deployed. A total of 231.67 Gb raw and 192.10 Gb of clean sequence data were generated through WGRS of resistant parent and the resistant and susceptible bulks for rust and LLS. Sequence analysis of bulks for rust and LLS with reference-guided resistant parent assembly identified 3136 single-nucleotide polymorphisms (SNPs) for rust and 66 SNPs for LLS with the read depth of ≥7 in the identified genomic region on pseudomolecule A03. Detailed analysis identified 30 nonsynonymous SNPs affecting 25 candidate genes for rust resistance, while 14 intronic and three synonymous SNPs affecting nine candidate genes for LLS resistance. Subsequently, allele-specific diagnostic markers were identified for three SNPs for rust resistance and one SNP for LLS resistance. Genotyping of one RIL population (TAG 24 × GPBD 4) with these four diagnostic markers revealed higher phenotypic variation for these two diseases. These results suggest usefulness of QTL-seq approach in precise and rapid identification of candidate genomic regions and development of diagnostic markers for breeding applications. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  12. Targeted deep sequencing identifies rare loss-of-function variants in IFNGR1 for risk of atopic dermatitis complicated by eczema herpeticum.

    PubMed

    Gao, Li; Bin, Lianghua; Rafaels, Nicholas M; Huang, Lili; Potee, Joseph; Ruczinski, Ingo; Beaty, Terri H; Paller, Amy S; Schneider, Lynda C; Gallo, Rich; Hanifin, Jon M; Beck, Lisa A; Geha, Raif S; Mathias, Rasika A; Barnes, Kathleen C; Leung, Donald Y M

    2015-12-01

    A subset of atopic dermatitis is associated with increased susceptibility to eczema herpeticum (ADEH+). We previously reported that common single nucleotide polymorphisms (SNPs) in the IFN-γ (IFNG) and IFN-γ receptor 1 (IFNGR1) genes were associated with the ADEH+ phenotype. We sought to interrogate the role of rare variants in interferon pathway genes for the risk of ADEH+. We performed targeted sequencing of interferon pathway genes (IFNG, IFNGR1, IFNAR1, and IL12RB1) in 228 European American patients with AD selected according to their eczema herpeticum status, and severity was measured by using the Eczema Area and Severity Index. Replication genotyping was performed in independent samples of 219 European American and 333 African American subjects. Functional investigation of loss-of-function variants was conducted by using site-directed mutagenesis. We identified 494 single nucleotide variants encompassing 105 kb of sequence, including 145 common, 349 (70.6%) rare (minor allele frequency <5%), and 86 (17.4%) novel variants, of which 2.8% were coding synonymous, 93.3% were noncoding (64.6% intronic), and 3.8% were missense. We identified 6 rare IFNGR1 missense variants, including 3 damaging variants (Val14Met [V14M], Val61Ile, and Tyr397Cys [Y397C]) conferring a higher risk for ADEH+ (P = .031). Variants V14M and Y397C were confirmed to be deleterious, leading to partial IFNGR1 deficiency. Seven common IFNGR1 SNPs, along with common protective haplotypes (2-7 SNPs), conferred a reduced risk of ADEH+ (P = .015-.002 and P = .0015-.0004, respectively), and both SNP and haplotype associations were replicated in an independent African American sample (P = .004-.0001 and P = .001-.0001, respectively). Our results provide evidence that both genetic variants in the gene encoding IFNGR1 are implicated in susceptibility to the ADEH+ phenotype. Copyright © 2015 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.

  13. Single nucleotide polymorphism-specific regulation of matrix metalloproteinase-9 by multiple miRNAs targeting the coding exon

    PubMed Central

    Duellman, Tyler; Warren, Christopher; Yang, Jay

    2014-01-01

    Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221

  14. High-resolution melting analysis of the single nucleotide polymorphism hot-spot region in the rpoB gene as an indicator of reduced susceptibility to rifaximin in Clostridium difficile.

    PubMed

    Pecavar, Verena; Blaschitz, Marion; Hufnagl, Peter; Zeinzinger, Josef; Fiedler, Anita; Allerberger, Franz; Maass, Matthias; Indra, Alexander

    2012-06-01

    Clostridium difficile, a Gram-positive, spore-forming, anaerobic bacterium, is the main causative agent of hospital-acquired diarrhoea worldwide. In addition to metronidazole and vancomycin, rifaximin, a rifamycin derivative, is a promising antibiotic for the treatment of recurring C. difficile infections (CDI). However, exposure of C. difficile to this antibiotic has led to the development of rifaximin-resistance due to point mutations in the β-subunit of the RNA polymerase (rpoB) gene. In the present study, 348 C. difficile strains with known PCR-ribotypes were investigated for respective single nucleotide polymorphisms (SNPs) within the proposed rpoB hot-spot region by using high-resolution melting (HRM) analysis. This method allows the detection of SNPs by comparing the altered melting behaviour of dsDNA with that of wild-type DNA. Discrimination between wild-type and mutant strains was enhanced by creating heteroduplexes by mixing sample DNA with wild-type DNA, leading to characteristic melting curve shapes from samples containing SNPs in the respective rpoB section. In the present study, we were able to identify 16 different rpoB sequence-types (ST) by sequencing analysis of a 325 bp fragment. The 16 PCR STs displayed a total of 24 different SNPs. Fifteen of these 24 SNPs were located within the proposed 151 bp SNP hot-spot region, resulting in 11 different HRM curve profiles (CP). Eleven SNPs (seven of which were within the proposed hot-spot region) led to amino acid substitutions associated with reduced susceptibility to rifaximin and 13 SNPs (eight of which were within the hot-spot region) were synonymous. This investigation clearly demonstrates that HRM analysis of the proposed SNP hot-spot region in the rpoB gene of C. difficile is a fast and cost-effective method for the identification of C. difficile samples with reduced susceptibility to rifaximin and even allows simultaneous SNP subtyping of the respective C. difficile isolates.

  15. Polymorphisms of the bovine DKK2 and their associations with body measurement traits and meat quality traits in Qinchuan cattle.

    PubMed

    Zhan, Xiaoli; Gao, Jianbin; Huangfu, Yifan; Fu, Changzhen; Zan, Linsen

    2013-12-01

    The objective of this research were to detect bovine Dickkopf 2 (DKK2) gene polymorphism and analyze their associations with body measurement traits (BMT) and meat quality traits (MQT) of animals. Blood samples were taken from a total of 541 Qinchuan cattle aged from 18 to 24 months. Polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) was employed to find out DKK2 single-polymorphism nucleotide (SNPs) and to explore their possible association with BMT and MQT. Sequence analysis of DKK2 gene revealed 2 SNPs (C29 T and A169C) in 5' untranslated region (5'UTR) of exon 1.C29T and A164T SNPs are both synonymous mutation, which showed 2 genotypes namely (CC, CT) and (AA and AC), respectively. Association analysis of polymorphism with body measurement and meat quality traits at the two locus showed that there were significant effects on CT, BL, RL, PBW, BFT, LMA, and IFC. These results suggest that the DKK2 gene might have potential effects on BMT and MQT in Qinchuan cattle population and could be used for marker-assisted selection.

  16. Application of whole genome sequence data in analyzing the molecular epidemiology of Shiga toxin-producing Escherichia coli O157:H7/H.

    PubMed

    Yokoyama, Eiji; Hirai, Shinichiro; Ishige, Taichiro; Murakami, Satoshi

    2018-01-02

    Seventeen clusters of Shiga toxin-producing Escherichia coli O157:H7/- (O157) strains, determined by cluster analysis of pulsed-field gel electrophoresis patterns, were analyzed using whole genome sequence (WGS) data to investigate this pathogen's molecular epidemiology. The 17 clusters included 136 strains containing strains from nine outbreaks, with each outbreak caused by a single source contaminated with the organism, as shown by epidemiological contact surveys. WGS data of these strains were used to identify single nucleotide polymorphisms (SNPs) by two methods: short read data were directly mapped to a reference genome (mapping derived SNPs) and common SNPs between the mapping derived SNPs and SNPs in assembled data of short read data (common SNPs). Among both SNPs, those that were detected in genes with a gap were excluded to remove ambiguous SNPs from further analysis. The effectiveness of both SNPs was investigated among all the concatenated SNPs that were detected (whole SNP set); SNPs were divided into three categories based on the genes in which they were located (i.e., backbone SNP set, O-island SNP set, and mobile element SNP set); and SNPs in non-coding regions (intergenic region SNP set). When SNPs from strains isolated from the nine single source derived outbreaks were analyzed using an unweighted pair group method with arithmetic mean tree (UPGMA) and a minimum spanning tree (MST), the maximum pair-wise distances of the backbone SNP set of the mapping derived SNPs were significantly smaller than those of the whole and intergenic region SNP set on both UPGMAs and MSTs. This significant difference was also observed when the backbone SNP set of the common SNPs were examined (Steel-Dwass test, P≤0.01). When the maximum pair-wise distances were compared between the mapping derived and common SNPs, significant differences were observed in those of the whole, mobile element, and intergenic region SNP set (Wilcoxon signed rank test, P≤0.01). When all the strains included in one complex on an MST or one cluster on a UPGMA were designated as the same genotype, the values of the Hunter-Gaston Discriminatory Power Index for the backbone SNP set of the mapping derived and common SNPs were higher than those of other SNP sets. In contrast, the mobile element SNP set could not robustly subdivide lineage I strains of tested O157 strains using both the mapping derived and common SNPs. These results suggested that the backbone SNP set were the most effective for analysis of WGS data for O157 in enabling an appropriation of its molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. PredictProtein—an open resource for online prediction of protein structural and functional features

    PubMed Central

    Yachdav, Guy; Kloppmann, Edda; Kajan, Laszlo; Hecht, Maximilian; Goldberg, Tatyana; Hamp, Tobias; Hönigschmid, Peter; Schafferhans, Andrea; Roos, Manfred; Bernhofer, Michael; Richter, Lothar; Ashkenazy, Haim; Punta, Marco; Schlessinger, Avner; Bromberg, Yana; Schneider, Reinhard; Vriend, Gerrit; Sander, Chris; Ben-Tal, Nir; Rost, Burkhard

    2014-01-01

    PredictProtein is a meta-service for sequence analysis that has been predicting structural and functional features of proteins since 1992. Queried with a protein sequence it returns: multiple sequence alignments, predicted aspects of structure (secondary structure, solvent accessibility, transmembrane helices (TMSEG) and strands, coiled-coil regions, disulfide bonds and disordered regions) and function. The service incorporates analysis methods for the identification of functional regions (ConSurf), homology-based inference of Gene Ontology terms (metastudent), comprehensive subcellular localization prediction (LocTree3), protein–protein binding sites (ISIS2), protein–polynucleotide binding sites (SomeNA) and predictions of the effect of point mutations (non-synonymous SNPs) on protein function (SNAP2). Our goal has always been to develop a system optimized to meet the demands of experimentalists not highly experienced in bioinformatics. To this end, the PredictProtein results are presented as both text and a series of intuitive, interactive and visually appealing figures. The web server and sources are available at http://ppopen.rostlab.org. PMID:24799431

  18. Meta-analysis identifies 29 additional ulcerative colitis risk loci, increasing the number of confirmed associations to 47.

    PubMed

    Anderson, Carl A; Boucher, Gabrielle; Lees, Charlie W; Franke, Andre; D'Amato, Mauro; Taylor, Kent D; Lee, James C; Goyette, Philippe; Imielinski, Marcin; Latiano, Anna; Lagacé, Caroline; Scott, Regan; Amininejad, Leila; Bumpstead, Suzannah; Baidoo, Leonard; Baldassano, Robert N; Barclay, Murray; Bayless, Theodore M; Brand, Stephan; Büning, Carsten; Colombel, Jean-Frédéric; Denson, Lee A; De Vos, Martine; Dubinsky, Marla; Edwards, Cathryn; Ellinghaus, David; Fehrmann, Rudolf S N; Floyd, James A B; Florin, Timothy; Franchimont, Denis; Franke, Lude; Georges, Michel; Glas, Jürgen; Glazer, Nicole L; Guthery, Stephen L; Haritunians, Talin; Hayward, Nicholas K; Hugot, Jean-Pierre; Jobin, Gilles; Laukens, Debby; Lawrance, Ian; Lémann, Marc; Levine, Arie; Libioulle, Cecile; Louis, Edouard; McGovern, Dermot P; Milla, Monica; Montgomery, Grant W; Morley, Katherine I; Mowat, Craig; Ng, Aylwin; Newman, William; Ophoff, Roel A; Papi, Laura; Palmieri, Orazio; Peyrin-Biroulet, Laurent; Panés, Julián; Phillips, Anne; Prescott, Natalie J; Proctor, Deborah D; Roberts, Rebecca; Russell, Richard; Rutgeerts, Paul; Sanderson, Jeremy; Sans, Miquel; Schumm, Philip; Seibold, Frank; Sharma, Yashoda; Simms, Lisa A; Seielstad, Mark; Steinhart, A Hillary; Targan, Stephan R; van den Berg, Leonard H; Vatn, Morten; Verspaget, Hein; Walters, Thomas; Wijmenga, Cisca; Wilson, David C; Westra, Harm-Jan; Xavier, Ramnik J; Zhao, Zhen Z; Ponsioen, Cyriel Y; Andersen, Vibeke; Torkvist, Leif; Gazouli, Maria; Anagnou, Nicholas P; Karlsen, Tom H; Kupcinskas, Limas; Sventoraityte, Jurgita; Mansfield, John C; Kugathasan, Subra; Silverberg, Mark S; Halfvarson, Jonas; Rotter, Jerome I; Mathew, Christopher G; Griffiths, Anne M; Gearry, Richard; Ahmad, Tariq; Brant, Steven R; Chamaillard, Mathias; Satsangi, Jack; Cho, Judy H; Schreiber, Stefan; Daly, Mark J; Barrett, Jeffrey C; Parkes, Miles; Annese, Vito; Hakonarson, Hakon; Radford-Smith, Graham; Duerr, Richard H; Vermeire, Séverine; Weersma, Rinse K; Rioux, John D

    2011-03-01

    Genome-wide association studies and candidate gene studies in ulcerative colitis have identified 18 susceptibility loci. We conducted a meta-analysis of six ulcerative colitis genome-wide association study datasets, comprising 6,687 cases and 19,718 controls, and followed up the top association signals in 9,628 cases and 12,917 controls. We identified 29 additional risk loci (P < 5 × 10(-8)), increasing the number of ulcerative colitis-associated loci to 47. After annotating associated regions using GRAIL, expression quantitative trait loci data and correlations with non-synonymous SNPs, we identified many candidate genes that provide potentially important insights into disease pathogenesis, including IL1R2, IL8RA-IL8RB, IL7R, IL12B, DAP, PRDM1, JAK2, IRF5, GNA12 and LSP1. The total number of confirmed inflammatory bowel disease risk loci is now 99, including a minimum of 28 shared association signals between Crohn's disease and ulcerative colitis.

  19. High throughput SNP discovery and genotyping in hexaploid wheat.

    PubMed

    Rimbert, Hélène; Darrier, Benoît; Navarro, Julien; Kitt, Jonathan; Choulet, Frédéric; Leveugle, Magalie; Duarte, Jorge; Rivière, Nathalie; Eversole, Kellye; Le Gouis, Jacques; Davassi, Alessandro; Balfourier, François; Le Paslier, Marie-Christine; Berard, Aurélie; Brunel, Dominique; Feuillet, Catherine; Poncet, Charles; Sourdille, Pierre; Paux, Etienne

    2018-01-01

    Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research.

  20. Shannon Entropy of the Canonical Genetic Code

    NASA Astrophysics Data System (ADS)

    Nemzer, Louis

    The probability that a non-synonymous point mutation in DNA will adversely affect the functionality of the resultant protein is greatly reduced if the substitution is conservative. In that case, the amino acid coded by the mutated codon has similar physico-chemical properties to the original. Many simplified alphabets, which group the 20 common amino acids into families, have been proposed. To evaluate these schema objectively, we introduce a novel, quantitative method based on the inherent redundancy in the canonical genetic code. By calculating the Shannon information entropy carried by 1- or 2-bit messages, groupings that best leverage the robustness of the code are identified. The relative importance of properties related to protein folding - like hydropathy and size - and function, including side-chain acidity, can also be estimated. In addition, this approach allows us to quantify the average information value of nucleotide codon positions, and explore the physiological basis for distinguishing between transition and transversion mutations. Supported by NSU PFRDG Grant #335347.

  1. Structural and functional effects of nucleotide variation on the human TB drug metabolizing enzyme arylamine N-acetyltransferase 1.

    PubMed

    Cloete, Ruben; Akurugu, Wisdom A; Werely, Cedric J; van Helden, Paul D; Christoffels, Alan

    2017-08-01

    The human arylamine N-acetyltransferase 1 (NAT1) enzyme plays a vital role in determining the duration of action of amine-containing drugs such as para-aminobenzoic acid (PABA) by influencing the balance between detoxification and metabolic activation of these drugs. Recently, four novel single nucleotide polymorphisms (SNPs) were identified within a South African mixed ancestry population. Modeling the effects of these SNPs within the structural protein was done to assess possible structure and function changes in the enzyme. The use of molecular dynamics simulations and stability predictions indicated less thermodynamically stable protein structures containing E264K and V231G, while the N245I change showed a stabilizing effect. Coincidently the N245I change displayed a similar free energy landscape profile to the known R64W amino acid substitution (slow acetylator), while the R242M displayed a similar profile to the published variant, I263V (proposed fast acetylator), and the wild type protein structure. Similarly, principal component analysis indicated that two amino acid substitutions (E264K and V231G) occupied less conformational clusters of folded states as compared to the WT and were found to be destabilizing (may affect protein function). However, two of the four novel SNPs that result in amino acid changes: (V231G and N245I) were predicted by both SIFT and POLYPHEN-2 algorithms to affect NAT1 protein function, while two other SNPs that result in R242M and E264K substitutions showed contradictory results based on SIFT and POLYPHEN-2 analysis. In conclusion, the structural methods were able to verify that two non-synonymous substitutions (E264K and V231G) can destabilize the protein structure, and are in agreement with mCSM predictions, and should therefore be experimentally tested for NAT1 activity. These findings could inform a strategy of incorporating genotypic data (i.e., functional SNP alleles) with phenotypic information (slow or fast acetylator) to better prescribe effective treatment using drugs metabolized by NAT1. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  2. A PYY Q62P variant linked to human obesity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ahituv, Nadav; Kavaslar, Nihan; Schackwitz, Wendy

    2005-06-27

    Members of the pancreatic polypeptide family and the irreceptors have been implicated in the control of food intake in rodents and humans. To investigate whether nucleotide changes in these candidate genes result in abnormal weight in humans, we sequenced the coding exons and splice sites of seven family members (NPY, PYY, PPY, NPY1R, NPY2R, NPY4R, and NPY5R) in a large cohort of extremely obese (n=379) and lean (n=378) individuals. In total we found eleven rare non-synonymous variants, four of which exhibited familial segregation, NPY1R L53P and PPY P63L with leanness and NPY2R D42G and PYY Q62P with obesity. Functional analysismore » of the obese variants revealed NPY2R D42G to have reduced cell surface expression, while previous cell culture based studies indicated variant PYY Q62P to have altered receptor binding selectivity and we show that it fails to reduce food intake through mouse peptide injection experiments. These results support that rare non-synonymous variants within these genes can alter susceptibility to human body mass index extremes.« less

  3. Enterobacter aerogenes Hormaeche and Edwards 1960 (Approved Lists 1980) and Klebsiella mobilis Bascomb et al. 1971 (Approved Lists 1980) share the same nomenclatural type (ATCC 13048) on the Approved Lists and are homotypic synonyms, with consequences for the name Klebsiella mobilis Bascomb et al. 1971 (Approved Lists 1980).

    PubMed

    Tindall, B J; Sutton, G; Garrity, G M

    2017-02-01

    Enterobacter aerogenes Hormaeche and Edwards 1960 (Approved Lists 1980) and Klebsiella mobilis Bascomb et al. 1971 (Approved Lists 1980) were placed on the Approved Lists of Bacterial Names and were based on the same nomenclatural type, ATCC 13048. Consequently they are to be treated as homotypic synonyms. However, the names of homotypic synonyms at the rank of species normally are based on the same epithet. Examination of the Rules of the International Code of Nomenclature of Bacteria in force at the time indicates that the epithet mobilis in Klebsiella mobilis Bascomb et al. 1971 (Approved Lists 1980) was illegitimate at the time the Approved Lists were published and according to the Rules of the current International Code of Nomenclature of Prokaryotes continues to be illegitimate.

  4. Genome-wide association analyses of child genotype effects and parent-of-origin effects in specific language impairment

    PubMed Central

    Nudel, R; Simpson, N H; Baird, G; O’Hare, A; Conti-Ramsden, G; Bolton, P F; Hennessy, E R; Ring, S M; Davey Smith, G; Francks, C; Paracchini, S; Monaco, A P; Fisher, S E; Newbury, D F

    2014-01-01

    Specific language impairment (SLI) is a neurodevelopmental disorder that affects linguistic abilities when development is otherwise normal. We report the results of a genome-wide association study of SLI which included parent-of-origin effects and child genotype effects and used 278 families of language-impaired children. The child genotype effects analysis did not identify significant associations. We found genome-wide significant paternal parent-of-origin effects on chromosome 14q12 (P = 3.74 × 10−8) and suggestive maternal parent-of-origin effects on chromosome 5p13 (P = 1.16 × 10−7). A subsequent targeted association of six single-nucleotide-polymorphisms (SNPs) on chromosome 5 in 313 language-impaired individuals and their mothers from the ALSPAC cohort replicated the maternal effects, albeit in the opposite direction (P = 0.001); as fathers’ genotypes were not available in the ALSPAC study, the replication analysis did not include paternal parent-of-origin effects. The paternally-associated SNP on chromosome 14 yields a non-synonymous coding change within the NOP9 gene. This gene encodes an RNA-binding protein that has been reported to be significantly dysregulated in individuals with schizophrenia. The region of maternal association on chromosome 5 falls between the PTGER4 and DAB2 genes, in a region previously implicated in autism and ADHD. The top SNP in this association locus is a potential expression QTL of ARHGEF19 (also called WGEF) on chromosome 1. Members of this protein family have been implicated in intellectual disability. In summary, this study implicates parent-of-origin effects in language impairment, and adds an interesting new dimension to the emerging picture of shared genetic etiology across various neurodevelopmental disorders. PMID:24571439

  5. Resequencing of the vesicular glutamate transporter 2 gene (VGLUT2) reveals some rare genetic variants that may increase the genetic burden in schizophrenia.

    PubMed

    Shen, Yu-Chih; Liao, Ding-Lieh; Lu, Chao-Lin; Chen, Jen-Yeu; Liou, Ying-Jay; Chen, Tzu-Ting; Chen, Chia-Hsiang

    2010-08-01

    Vesicular glutamate transporters (VGLUT1-3) package glutamate into vesicles in the presynaptic terminal and regulate the release of glutamate. In mesencephalic dopamine neuron culture, the majority of isolated dopamine neurons express VGLUT2, but not VGLUT1 or 3, have been demonstrated. As related to the dysregulated glutamatergic hypothesis of schizophrenia, the gene encoding VGLUT2 is the most plausible candidate involved in the pathogenesis of this illness. We searched for genetic variants in the promoter region and 12 exons (including UTR ends) of the VGLUT2 gene using direct sequencing in a sample of Han Chinese schizophrenic patients (n=375) and non-psychotic controls (n=366) from Taiwan, and conducted a case-control association study. We identified 8 common SNPs in the VGLUT2 gene. SNP and haplotype-based analyses showed no association with schizophrenia. Besides, we identified 9 rare variants in 13 out of 375 patients, including 3 variants located at the promoter region, 2 synonymous variants located at protein coding regions, and 4 variants located at UTR ends. No rare variants were found in the control subjects. Collectively, these rare variants were significantly overrepresented in the patient group (3.5% versus 0, p value of Fisher's exact test=2.3x10(-5)), suggesting they may contribute to the pathogenesis of schizophrenia. Although the functional significance of these rare variants remains to be characterized, our study may lend support to the multiple rare mutations hypothesis of schizophrenia, and may provide genetic clues to indicate the involvement of the glutamate transmission pathway in the pathogenesis of schizophrenia. Copyright 2010 Elsevier B.V. All rights reserved.

  6. Genome-Wide Association Study Identifies Common Genetic Variants Associated with Salivary Gland Carcinoma and its Subtypes

    PubMed Central

    Xu, Li; Tang, Hongwei; Chen, Diane W.; El-Naggar, Adel K.; Wei, Peng; Sturgis, Erich M.

    2015-01-01

    BACKGROUND Salivary gland carcinomas (SGCs) are a rare malignancy with unknown etiology. We aimed to identify genetic variants modifying risk of SGC and its major subtypes, adenoid cystic carcinoma (ACCA) and mucoepidermoid carcinoma (MECA). METHODS We conducted a genome-wide association study in 309 well-defined SGC cases and 535 cancer-free controls. We performed a SNP-level discovery study in non-Hispanic whites followed by a replication study in Hispanics. A logistic regression was applied to calculate odds ratios (ORs) and 95% confidence intervals (95%CIs). A meta-analysis was conducted of the results. RESULTS Genome-wide significant association with SGC in non-Hispanic whites was detected at coding SNPs in CHRNA2 (OR=8.55, 95%CI: 4.53–16.13, P = 3.6 × 10−11), OR4F15 (OR=5.26, 95%CI: 3.13–8.83, P = 3.5 × 10−10), ZNF343 (OR=3.28, 95%CI: 2.12–5.07, P = 9.1 × 10−8), and PARP4 (OR=2.00, 95%CI: 1.54–2.59, P = 1.7 × 10−7). Meta-analysis of the non-Hispanic white and Hispanic cohorts identified another genome-wide significant SNP in ELL2 (meta-OR=1.86, 95%CI: 1.48–2.34, P = 1.3 × 10−7). Risk alleles largely enriched in MECA, where the SNPs in CHRNA2, OR4F15, and ZNF343 had ORs of 15.71 (95%CI: 6.59–37.47, P = 5.2 × 10−10), 15.60 (95%CI: 6.50–37.41, P = 7.5 × 10−10), and 6.49 (95%CI: 3.36–12.52, P = 2.5 × 10−8), respectively. None of these SNPs retained significant association with ACCA. CONCLUSIONS These findings, for the first time, identify a panel of SNPs associated with SGC risk. Confirmation of these findings along with functional analysis of identified SNPs are needed. PMID:25823930

  7. Energy efficiency trade-offs drive nucleotide usage in transcribed regions

    PubMed Central

    Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.

    2016-01-01

    Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217

  8. Deciphering the recent phylogenetic expansion of the originally deeply rooted Mycobacterium tuberculosis lineage 7.

    PubMed

    Yimer, Solomon A; Namouchi, Amine; Zegeye, Ephrem Debebe; Holm-Hansen, Carol; Norheim, Gunnstein; Abebe, Markos; Aseffa, Abraham; Tønjum, Tone

    2016-06-30

    A deeply rooted phylogenetic lineage of Mycobacterium tuberculosis (M. tuberculosis) termed lineage 7 was discovered in Ethiopia. Whole genome sequencing of 30 lineage 7 strains from patients in Ethiopia was performed. Intra-lineage genome variation was defined and unique characteristics identified with a focus on genes involved in DNA repair, recombination and replication (3R genes). More than 800 mutations specific to M. tuberculosis lineage 7 strains were identified. The proportion of non-synonymous single nucleotide polymorphisms (nsSNPs) in 3R genes was higher after the recent expansion of M. tuberculosis lineage 7 strain started. The proportion of nsSNPs in genes involved in inorganic ion transport and metabolism was significantly higher before the expansion began. A total of 22346 bp deletions were observed. Lineage 7 strains also exhibited a high number of mutations in genes involved in carbohydrate transport and metabolism, transcription, energy production and conversion. We have identified unique genomic signatures of the lineage 7 strains. The high frequency of nsSNP in 3R genes after the phylogenetic expansion may have contributed to recent variability and adaptation. The abundance of mutations in genes involved in inorganic ion transport and metabolism before the expansion period may indicate an adaptive response of lineage 7 strains to enable survival, potentially under environmental stress exposure. As lineage 7 strains originally were phylogenetically deeply rooted, this may indicate fundamental adaptive genomic pathways affecting the fitness of M. tuberculosis as a species.

  9. South-East Asian strains of Plasmodium falciparum display higher ratio of non-synonymous to synonymous polymorphisms compared to African strains

    PubMed Central

    Singh, Gajinder Pal; Sharma, Amit

    2016-01-01

    Resistance to frontline anti-malarial drugs, including artemisinin, has repeatedly arisen in South-East Asia, but the reasons for this are not understood. Here we test whether evolutionary constraints on Plasmodium falciparum strains from South-East Asia differ from African strains. We find a significantly higher ratio of non-synonymous to synonymous polymorphisms in P. falciparum from South-East Asia compared to Africa, suggesting differences in the selective constraints on P. falciparum genome in these geographical regions. Furthermore, South-East Asian strains showed a higher proportion of non-synonymous polymorphism at conserved positions, suggesting reduced negative selection. There was a lower rate of mixed infection by multiple genotypes in samples from South-East Asia compared to Africa. We propose that a lower mixed infection rate in South-East Asia reduces intra-host competition between the parasite clones, reducing the efficiency of natural selection. This might increase the probability of fixation of fitness-reducing mutations including drug resistant ones. PMID:27853513

  10. Association analysis identifies 65 new breast cancer risk loci

    PubMed Central

    Lemaçon, Audrey; Soucy, Penny; Glubb, Dylan; Rostamianfar, Asha; Bolla, Manjeet K.; Wang, Qin; Tyrer, Jonathan; Dicks, Ed; Lee, Andrew; Wang, Zhaoming; Allen, Jamie; Keeman, Renske; Eilber, Ursula; French, Juliet D.; Chen, Xiao Qing; Fachal, Laura; McCue, Karen; McCart Reed, Amy E.; Ghoussaini, Maya; Carroll, Jason; Jiang, Xia; Finucane, Hilary; Adams, Marcia; Adank, Muriel A.; Ahsan, Habibul; Aittomäki, Kristiina; Anton-Culver, Hoda; Antonenkova, Natalia N.; Arndt, Volker; Aronson, Kristan J.; Arun, Banu; Auer, Paul L.; Bacot, François; Barrdahl, Myrto; Baynes, Caroline; Beckmann, Matthias W.; Behrens, Sabine; Benitez, Javier; Bermisheva, Marina; Bernstein, Leslie; Blomqvist, Carl; Bogdanova, Natalia V.; Bojesen, Stig E.; Bonanni, Bernardo; Børresen-Dale, Anne-Lise; Brand, Judith S.; Brauch, Hiltrud; Brennan, Paul; Brenner, Hermann; Brinton, Louise; Broberg, Per; Brock, Ian W.; Broeks, Annegien; Brooks-Wilson, Angela; Brucker, Sara Y.; Brüning, Thomas; Burwinkel, Barbara; Butterbach, Katja; Cai, Qiuyin; Cai, Hui; Caldés, Trinidad; Canzian, Federico; Carracedo, Angel; Carter, Brian D.; Castelao, Jose E.; Chan, Tsun L.; Cheng, Ting-Yuan David; Chia, Kee Seng; Choi, Ji-Yeob; Christiansen, Hans; Clarke, Christine L.; Collée, Margriet; Conroy, Don M.; Cordina-Duverger, Emilie; Cornelissen, Sten; Cox, David G; Cox, Angela; Cross, Simon S.; Cunningham, Julie M.; Czene, Kamila; Daly, Mary B.; Devilee, Peter; Doheny, Kimberly F.; Dörk, Thilo; dos-Santos-Silva, Isabel; Dumont, Martine; Durcan, Lorraine; Dwek, Miriam; Eccles, Diana M.; Ekici, Arif B.; Eliassen, A. Heather; Ellberg, Carolina; Elvira, Mingajeva; Engel, Christoph; Eriksson, Mikael; Fasching, Peter A.; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Flyger, Henrik; Fritschi, Lin; Gaborieau, Valerie; Gabrielson, Marike; Gago-Dominguez, Manuela; Gao, Yu-Tang; Gapstur, Susan M.; García-Sáenz, José A.; Gaudet, Mia M.; Georgoulias, Vassilios; Giles, Graham G.; Glendon, Gord; Goldberg, Mark S.; Goldgar, David E.; González-Neira, Anna; Grenaker Alnæs, Grethe I.; Grip, Mervi; Gronwald, Jacek; Grundy, Anne; Guénel, Pascal; Haeberle, Lothar; Hahnen, Eric; Haiman, Christopher A.; Håkansson, Niclas; Hamann, Ute; Hamel, Nathalie; Hankinson, Susan; Harrington, Patricia; Hart, Steven N.; Hartikainen, Jaana M.; Hartman, Mikael; Hein, Alexander; Heyworth, Jane; Hicks, Belynda; Hillemanns, Peter; Ho, Dona N.; Hollestelle, Antoinette; Hooning, Maartje J.; Hoover, Robert N.; Hopper, John L.; Hou, Ming-Feng; Hsiung, Chia-Ni; Huang, Guanmengqian; Humphreys, Keith; Ishiguro, Junko; Ito, Hidemi; Iwasaki, Motoki; Iwata, Hiroji; Jakubowska, Anna; Janni, Wolfgang; John, Esther M.; Johnson, Nichola; Jones, Kristine; Jones, Michael; Jukkola-Vuorinen, Arja; Kaaks, Rudolf; Kabisch, Maria; Kaczmarek, Katarzyna; Kang, Daehee; Kasuga, Yoshio; Kerin, Michael J.; Khan, Sofia; Khusnutdinova, Elza; Kiiski, Johanna I.; Kim, Sung-Won; Knight, Julia A.; Kosma, Veli-Matti; Kristensen, Vessela N.; Krüger, Ute; Kwong, Ava; Lambrechts, Diether; Marchand, Loic Le; Lee, Eunjung; Lee, Min Hyuk; Lee, Jong Won; Lee, Chuen Neng; Lejbkowicz, Flavio; Li, Jingmei; Lilyquist, Jenna; Lindblom, Annika; Lissowska, Jolanta; Lo, Wing-Yee; Loibl, Sibylle; Long, Jirong; Lophatananon, Artitaya; Lubinski, Jan; Luccarini, Craig; Lux, Michael P.; Ma, Edmond S.K.; MacInnis, Robert J.; Maishman, Tom; Makalic, Enes; Malone, Kathleen E; Kostovska, Ivana Maleva; Mannermaa, Arto; Manoukian, Siranoush; Manson, JoAnn E.; Margolin, Sara; Mariapun, Shivaani; Martinez, Maria Elena; Matsuo, Keitaro; Mavroudis, Dimitrios; McKay, James; McLean, Catriona; Meijers-Heijboer, Hanne; Meindl, Alfons; Menéndez, Primitiva; Menon, Usha; Meyer, Jeffery; Miao, Hui; Miller, Nicola; Mohd Taib, Nur Aishah; Muir, Kenneth; Mulligan, Anna Marie; Mulot, Claire; Neuhausen, Susan L.; Nevanlinna, Heli; Neven, Patrick; Nielsen, Sune F.; Noh, Dong-Young; Nordestgaard, Børge G.; Norman, Aaron; Olopade, Olufunmilayo I.; Olson, Janet E.; Olsson, Håkan; Olswold, Curtis; Orr, Nick; Pankratz, V. Shane; Park, Sue K.; Park-Simon, Tjoung-Won; Lloyd, Rachel; Perez, Jose I.A.; Peterlongo, Paolo; Peto, Julian; Phillips, Kelly-Anne; Pinchev, Mila; Plaseska-Karanfilska, Dijana; Prentice, Ross; Presneau, Nadege; Prokofieva, Darya; Pugh, Elizabeth; Pylkäs, Katri; Rack, Brigitte; Radice, Paolo; Rahman, Nazneen; Rennert, Gadi; Rennert, Hedy S.; Rhenius, Valerie; Romero, Atocha; Romm, Jane; Ruddy, Kathryn J; Rüdiger, Thomas; Rudolph, Anja; Ruebner, Matthias; Rutgers, Emiel J. Th.; Saloustros, Emmanouil; Sandler, Dale P.; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Daniel F.; Schmutzler, Rita K.; Schneeweiss, Andreas; Schoemaker, Minouk J.; Schumacher, Fredrick; Schürmann, Peter; Scott, Rodney J.; Scott, Christopher; Seal, Sheila; Seynaeve, Caroline; Shah, Mitul; Sharma, Priyanka; Shen, Chen-Yang; Sheng, Grace; Sherman, Mark E.; Shrubsole, Martha J.; Shu, Xiao-Ou; Smeets, Ann; Sohn, Christof; Southey, Melissa C.; Spinelli, John J.; Stegmaier, Christa; Stewart-Brown, Sarah; Stone, Jennifer; Stram, Daniel O.; Surowy, Harald; Swerdlow, Anthony; Tamimi, Rulla; Taylor, Jack A.; Tengström, Maria; Teo, Soo H.; Terry, Mary Beth; Tessier, Daniel C.; Thanasitthichai, Somchai; Thöne, Kathrin; Tollenaar, Rob A.E.M.; Tomlinson, Ian; Tong, Ling; Torres, Diana; Truong, Thérèse; Tseng, Chiu-chen; Tsugane, Shoichiro; Ulmer, Hans-Ulrich; Ursin, Giske; Untch, Michael; Vachon, Celine; van Asperen, Christi J.; Van Den Berg, David; van den Ouweland, Ans M.W.; van der Kolk, Lizet; van der Luijt, Rob B.; Vincent, Daniel; Vollenweider, Jason; Waisfisz, Quinten; Wang-Gohrke, Shan; Weinberg, Clarice R.; Wendt, Camilla; Whittemore, Alice S.; Wildiers, Hans; Willett, Walter; Winqvist, Robert; Wolk, Alicja; Wu, Anna H.; Xia, Lucy; Yamaji, Taiki; Yang, Xiaohong R.; Yip, Cheng Har; Yoo, Keun-Young; Yu, Jyh-Cherng; Zheng, Wei; Zheng, Ying; Zhu, Bin; Ziogas, Argyrios; Ziv, Elad; Lakhani, Sunil R.; Antoniou, Antonis C.; Droit, Arnaud; Andrulis, Irene L.; Amos, Christopher I.; Couch, Fergus J.; Pharoah, Paul D.P.; Chang-Claude, Jenny; Hall, Per; Hunter, David J.; Milne, Roger L.; García-Closas, Montserrat; Schmidt, Marjanka K.; Chanock, Stephen J.; Dunning, Alison M.; Edwards, Stacey L.; Bader, Gary D.; Chenevix-Trench, Georgia; Simard, Jacques; Kraft, Peter; Easton, Douglas F.

    2017-01-01

    Breast cancer risk is influenced by rare coding variants in susceptibility genes such as BRCA1 and many common, mainly non-coding variants. However, much of the genetic contribution to breast cancer risk remains unknown. We report results from a genome-wide association study (GWAS) of breast cancer in 122,977 cases and 105,974 controls of European ancestry and 14,068 cases and 13,104 controls of East Asian ancestry1. We identified 65 new loci associated with overall breast cancer at p<5x10-8. The majority of credible risk SNPs in the new loci fall in distal regulatory elements, and by integrating in-silico data to predict target genes in breast cells at each locus, we demonstrate a strong overlap between candidate target genes and somatic driver genes in breast tumours. We also find that heritability of breast cancer due to all SNPs in regulatory features was 2-5-fold enriched relative to the genome-wide average, with strong enrichment for particular transcription factor binding sites. These results provide further insight into genetic susceptibility to breast cancer and will improve the utility of genetic risk scores for individualized screening and prevention. PMID:29059683

  11. An integrated map of genetic variation from 1,092 human genomes

    PubMed Central

    2012-01-01

    Summary Through characterising the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help understand the genetic contribution to disease. We describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methodologies to integrate information across multiple algorithms and diverse data sources we provide a validated haplotype map of 38 million SNPs, 1.4 million indels and over 14 thousand larger deletions. We show that individuals from different populations carry different profiles of rare and common variants and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways and that each individual harbours hundreds of rare non-coding variants at conserved sites, such as transcription-factor-motif disrupting changes. This resource, which captures up to 98% of accessible SNPs at a frequency of 1% in populations of medical genetics focus, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations. PMID:23128226

  12. Complete genome analysis of dengue virus type 3 isolated from the 2013 dengue outbreak in Yunnan, China.

    PubMed

    Wang, Xiaodan; Ma, Dehong; Huang, Xinwei; Li, Lihua; Li, Duo; Zhao, Yujiao; Qiu, Lijuan; Pan, Yue; Chen, Junying; Xi, Juemin; Shan, Xiyun; Sun, Qiangming

    2017-06-15

    In the past few decades, dengue has spread rapidly and is an emerging disease in China. An unexpected dengue outbreak occurred in Xishuangbanna, Yunnan, China, resulting in 1331 patients in 2013. In order to obtain the complete genome information and perform mutation and evolutionary analysis of causative agent related to this largest outbreak of dengue fever. The viruses were isolated by cell culture and evaluated by genome sequence analysis. Phylogenetic trees were then constructed by Neighbor-Joining methods (MEGA6.0), followed by analysis of nucleotide mutation and amino acid substitution. The analysis of the diversity of secondary structure for E and NS1 protein were also performed. Then selection pressures acting on the coding sequences were estimated by PAML software. The complete genome sequences of two isolated strains (YNSW1, YNSW2) were 10,710 and 10,702 nucleotides in length, respectively. Phylogenetic analysis revealed both strain were classified as genotype II of DENV-3. The results indicated that both isolated strains of Xishuangbanna in 2013 and Laos 2013 stains (KF816161.1, KF816158.1, LC147061.1, LC147059.1, KF816162.1) were most similar to Bangladesh (AY496873.2) in 2002. After comparing with the DENV-3SS (H87) 62 amino acid substitutions were identified in translated regions, and 38 amino acid substitutions were identified in translated regions compared with DENV-3 genotype II stains Bangladesh (AY496873.2). 27(YNSW1) or 28(YNSW2) single nucleotide changes were observed in structural protein sequences with 7(YNSW1) or 8(YNSW2) non-synonymous mutations compared with AY496873.2. Of them, 4 non-synonymous mutations were identified in E protein sequences with (2 in the β-sheet, 2 in the coil). Meanwhile, 117(YNSW1) or 115 (YNSW2) single nucleotide changes were observed in non-structural protein sequences with 31(YNSW1) or 30 (YNSW2) non-synonymous mutations. Particularly, 14 single nucleotide changes were observed in NS1 sequences with 4/14 non-synonymous substitutions (4 in the coil). Selection pressure analysis revealed no positive selection in the amino acid sites of the genes encoding for structural and non-structural proteins. This study may help understand the intrinsic geographical relatedness of dengue virus 3 and contributes further to research on their infectivity, pathogenicity and vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Engineered single nucleotide polymorphisms in the mosquito MEK docking site alter Plasmodium berghei development in Anopheles gambiae.

    PubMed

    Brenton, Ashley A; Souvannaseng, Lattha; Cheung, Kong; Anishchenko, Michael; Brault, Aaron C; Luckhart, Shirley

    2014-06-23

    Susceptibility to Plasmodium infection in Anopheles gambiae has been proposed to result from naturally occurring polymorphisms that alter the strength of endogenous innate defenses. Despite the fact that some of these mutations are known to introduce non-synonymous substitutions in coding sequences, these mutations have largely been used to rationalize knockdown of associated target proteins to query the effects on parasite development in the mosquito host. Here, we assay the effects of engineered mutations on an immune signaling protein target that is known to control parasite sporogonic development. By this proof-of-principle work, we have established that naturally occurring mutations can be queried for their effects on mosquito protein function and on parasite development and that this important signaling pathway can be genetically manipulated to enhance mosquito resistance. We introduced SNPs into the A. gambiae MAPK kinase MEK to alter key residues in the N-terminal docking site (D-site), thus interfering with its ability to interact with the downstream kinase target ERK. ERK phosphorylation levels in vitro and in vivo were evaluated to confirm the effects of MEK D-site mutations. In addition, overexpression of various MEK D-site alleles was used to assess P. berghei infection in A. gambiae. The MEK D-site contains conserved lysine residues predicted to mediate protein-protein interaction with ERK. As anticipated, each of the D-site mutations (K3M, K6M) suppressed ERK phosphorylation and this inhibition was significant when both mutations were present. Tissue-targeted overexpression of alleles encoding MEK D-site polymorphisms resulted in reduced ERK phosphorylation in the midgut of A. gambiae. Furthermore, as expected, inhibition of MEK-ERK signaling due to D-site mutations resulted in reduction in P. berghei development relative to infection in the presence of overexpressed catalytically active MEK. MEK-ERK signaling in A. gambiae, as in model organisms and humans, depends on the integrity of conserved key residues within the MEK D-site. Disruption of signal transmission via engineered SNPs provides a purposeful proof-of-principle model for the study of naturally occurring mutations that may be associated with mosquito resistance to parasite infection as well as an alternative genetic basis for manipulation of this important immune signaling pathway.

  14. Molecular characterization of GPR50 gene and study of its comparative genetic variability in sheep breeds adapted to different thermo-contrasting climatic regimens

    NASA Astrophysics Data System (ADS)

    Saxena, Vijay Kumar; Kumar, Davendra; Naqvi, S. M. K.

    2017-04-01

    GPR50, formerly known as a melatonin-related receptor, is one of the three subtypes of melatonin receptor subfamily, together with MTNR1A and MTNR1B. GPR50, despite its high identity with the melatonin receptor family, does not bind melatonin and is considered to be an ortholog of MTNR1C in mammals. GPR50-expressing cells have been found in the dorsomedial nucleus of the hypothalamus, the periventricular nucleus, and the median eminence. Genetic and functional evidence have been recently investigated linking GPR50 to adaptive thermogenesis and torpor, but still, it is an orphan receptor and is yet to be studied conclusively. The aims of the study were to characterize the GPR50 gene of sheep and to study the sequence variability of the gene in Indian sheep breeds of two different thermo-varied agroclimatic conditions. Genomic DNA isolation was done and a 791-bp sequence was amplified using self-designed primers and SNP profiling done out of samples of all the breeds to study the relative frequency of SNPs in each of the breed. Five important non-synonymous mutations were observed in the various breeds studied. T698G, G1097A, G1270A, G1318A, and C1334G lead to the following substitution: valine by glycine, arginine by glutamine, threonine by alanine, isoleucine by valine, and serine by cytosine, respectively. Two synonymous mutations (T663G and C888T) were also observed in some of the studied breeds. G1270A and C888T were the most prevalent SNPs observed in nearly all of the breeds. C888T SNPs were observed in higher prevalence in Chokla, Marwari, and Magra in comparison to Gaddi and Bharat Merino. A PolyPhen-2 analysis, which is used to assess the potential damaging nature of an SNP, revealed that mutation T698G and G1270A were benign while G1097A, G1318A, and C1334G were damaging with a score of 0.987, 0.993, and 0.739, respectively. A 3-D homology model of the protein was prepared using c4zwjA (UniProt sequence ID) as a template using the online version of Phyre2 protein modeling software. The structure demonstrated closed similarity with other G-coupled receptor and it had a 45 % α-helical content. G1270A and C888T may be taken up for SNP correlation in a larger population study for their association with heat stress protection.

  15. SNP discovery in the bovine milk transcriptome using RNA-Seq technology.

    PubMed

    Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F

    2010-12-01

    High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.

  16. New insights on the evolution of Leafy cotyledon1 (LEC1) type genes in vascular plants.

    PubMed

    Cagliari, Alexandro; Turchetto-Zolet, Andreia Carina; Korbes, Ana Paula; Maraschin, Felipe Dos Santos; Margis, Rogerio; Margis-Pinheiro, Marcia

    2014-01-01

    NF-Y is a conserved oligomeric transcription factor found in all eukaryotes. In plants, this regulator evolved with a broad diversification of the genes coding for its three subunits (NF-YA, NF-YB and NF-YC). The NF-YB members can be divided into Leafy Cotyledon1 (LEC1) and non-LEC1 types. Here we presented a comparative genomic study using phylogenetic analyses to validate an evolutionary model for the origin of LEC-type genes in plants and their emergence from non-LEC1-type genes. We identified LEC1-type members in all vascular plant genomes, but not in amoebozoa, algae, fungi, metazoa and non-vascular plant representatives, which present exclusively non-LEC1-type genes as constituents of their NF-YB subunits. The non-synonymous to synonymous nucleotide substitution rates (Ka/Ks) between LEC1 and non-LEC1-type genes indicate the presence of positive selection acting on LEC1-type members to the fixation of LEC1-specific amino acid residues. The phylogenetic analyses demonstrated that plant LEC1-type genes are evolutionary divergent from the non-LEC1-type genes of plants, fungi, amoebozoa, algae and animals. Our results point to a scenario in which LEC1-type genes have originated in vascular plants after gene expansion in plants. We suggest that processes of neofunctionalization and/or subfunctionalization were responsible for the emergence of a versatile role for LEC1-type genes in vascular plants, especially in seed plants. LEC1-type genes besides being phylogenetic divergent also present different expression profile when compared with non-LEC1-type genes. Altogether, our data provide new insights about the LEC1 and non-LEC1 evolutionary relationship during the vascular plant evolution. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Genetic polymorphisms related to efficacy and overuse of triptans in chronic migraine.

    PubMed

    Gentile, Giovanna; Borro, Marina; Lala, Noemi; Missori, Serena; Simmaco, Maurizio; Martelletti, Paolo

    2010-10-01

    Migraine is a common type of headache and its most severe attacks are usually treated with triptans, the efficacy of which is extremely variable. Several SNPs in genes involved in metabolism and target mechanisms of triptans have been described. To define an association between genetic profile and triptan response, we classified a migrainous population on the basis of triptan response and characterized it for polymorphisms in the genes coding for monoamine oxidase A, G protein β3 and the cytochrome CYP1A2. Analysis of the association between genotypic and allelic frequencies of the analyzed SNPs and the grade of response to triptan administration showed a significant correlation for MAOA uVNTR polymorphism. Further stratification of patients in abuser and non-abuser groups revealed a significant association with triptan overuse and, within the abusers, with drug response to the CYP1A2*1F variant.

  18. Genome sequence, comparative analysis and haplotype structure of the domestic dog.

    PubMed

    Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S

    2005-12-08

    Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.

  19. Genomic Characterization of Phenylalanine Ammonia Lyase Gene in Buckwheat

    PubMed Central

    Thiyagarajan, Karthikeyan; Vitali, Fabio; Tolaini, Valentina; Galeffi, Patrizia; Cantale, Cristina; Vikram, Prashant; Singh, Sukhwinder; De Rossi, Patrizia; Nobili, Chiara; Procacci, Silvia; Del Fiore, Antonella; Antonini, Alessandro; Presenti, Ombretta; Brunori, Andrea

    2016-01-01

    Phenylalanine Ammonia Lyase (PAL) gene which plays a key role in bio-synthesis of medicinally important compounds, Rutin/quercetin was sequence characterized for its efficient genomics application. These compounds possessing anti-diabetic and anti-cancer properties and are predominantly produced by Fagopyrum spp. In the present study, PAL gene was sequenced from three Fagopyrum spp. (F. tataricum, F. esculentum and F. dibotrys) and showed the presence of three SNPs and four insertion/deletions at intra and inter specific level. Among them, the potential SNP (position 949th bp G>C) with Parsimony Informative Site was selected and successfully utilised to individuate the zygosity/allelic variation of 16 F. tataricum varieties. Insertion mutations were identified in coding region, which resulted the change of a stretch of 39 amino acids on the putative protein. Our Study revealed that autogamous species (F. tataricum) has lower frequency of observed SNPs as compared to allogamous species (F. dibotrys and F. esculentum). The identified SNPs in F. tataricum didn’t result to amino acid change, while in other two species it caused both conservative and non-conservative variations. Consistent pattern of SNPs across the species revealed their phylogenetic importance. We found two groups of F. tataricum and one of them was closely related with F. dibotrys. Sequence characterization information of PAL gene reported in present investigation can be utilized in genetic improvement of buckwheat in reference to its medicinal value. PMID:26990297

  20. Adaptive evolution of the matrix extracellular phosphoglycoprotein in mammals

    PubMed Central

    2011-01-01

    Background Matrix extracellular phosphoglycoprotein (MEPE) belongs to a family of small integrin-binding ligand N-linked glycoproteins (SIBLINGs) that play a key role in skeleton development, particularly in mineralization, phosphate regulation and osteogenesis. MEPE associated disorders cause various physiological effects, such as loss of bone mass, tumors and disruption of renal function (hypophosphatemia). The study of this developmental gene from an evolutionary perspective could provide valuable insights on the adaptive diversification of morphological phenotypes in vertebrates. Results Here we studied the adaptive evolution of the MEPE gene in 26 Eutherian mammals and three birds. The comparative genomic analyses revealed a high degree of evolutionary conservation of some coding and non-coding regions of the MEPE gene across mammals indicating a possible regulatory or functional role likely related with mineralization and/or phosphate regulation. However, the majority of the coding region had a fast evolutionary rate, particularly within the largest exon (1467 bp). Rodentia and Scandentia had distinct substitution rates with an increased accumulation of both synonymous and non-synonymous mutations compared with other mammalian lineages. Characteristics of the gene (e.g. biochemical, evolutionary rate, and intronic conservation) differed greatly among lineages of the eight mammalian orders. We identified 20 sites with significant positive selection signatures (codon and protein level) outside the main regulatory motifs (dentonin and ASARM) suggestive of an adaptive role. Conversely, we find three sites under selection in the signal peptide and one in the ASARM motif that were supported by at least one selection model. The MEPE protein tends to accumulate amino acids promoting disorder and potential phosphorylation targets. Conclusion MEPE shows a high number of selection signatures, revealing the crucial role of positive selection in the evolution of this SIBLING member. The selection signatures were found mainly outside the functional motifs, reinforcing the idea that other regions outside the dentonin and the ASARM might be crucial for the function of the protein and future studies should be undertaken to understand its importance. PMID:22103247

  1. High throughput SNP discovery and genotyping in hexaploid wheat

    PubMed Central

    Navarro, Julien; Kitt, Jonathan; Choulet, Frédéric; Leveugle, Magalie; Duarte, Jorge; Rivière, Nathalie; Eversole, Kellye; Le Gouis, Jacques; Davassi, Alessandro; Balfourier, François; Le Paslier, Marie-Christine; Berard, Aurélie; Brunel, Dominique; Feuillet, Catherine; Poncet, Charles; Sourdille, Pierre

    2018-01-01

    Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research. PMID:29293495

  2. Genome-Wide Association Study of Serum 25-Hydroxyvitamin D in US Women.

    PubMed

    O'Brien, Katie M; Sandler, Dale P; Shi, Min; Harmon, Quaker E; Taylor, Jack A; Weinberg, Clarice R

    2018-01-01

    Genetic factors likely influence individuals' concentrations of 25-hydroxyvitamin D [25(OH)D], a biomarker of vitamin D exposure previously linked to reduced risk of several chronic diseases. We conducted a genome-wide association study of serum 25(OH)D (assessed using liquid chromatography-tandem mass spectrometry) and 386,449 single nucleotide polymorphisms (SNPs). Our sample consisted of 1,829 participants randomly selected from the Sister Study, a cohort of women who had a sister with breast cancer but had never had breast cancer themselves. 19,741 SNPs were associated with 25(OH)D ( p < 0.05). We re-assessed these hits in an independent sample of 1,534 participants who later developed breast cancer. After pooling, 32 SNPs had genome-wide significant associations ( p < 5 × 10 -8 ). These were located in or near GC , the vitamin D binding protein, or CYP2R1 , a cytochrome P450 enzyme that hydroxylates vitamin D to form 25(OH)D. The top hit was rs4588, a missense GC polymorphism associated with a 3.5 ng/mL decrease in 25(OH)D per copy of the minor allele (95% confidence interval [CI]: -4.1, -3.0; p = 4.5 × 10 -38 ). The strongest SNP near CYP2R1 was rs12794714, a synonymous variant ( p = 3.8 × 10 -12 ; β = 1.8 ng/mL decrease in 25(OH)D per minor allele [CI: -2.2, -1.3]). Serum 25(OH)D concentrations from samples collected from some participants 3-10 years after baseline (811 cases, 780 non-cases) were also strongly associated with both loci. These findings augment our understanding of genetic influences on 25(OH)D and the possible role of vitamin D binding proteins and cytochrome P450 enzymes in determining measured levels. These results may help to identify individuals genetically predisposed to vitamin D insufficiency.

  3. Nucleotide Excision Repair Gene Polymorphisms, Meat Intake and Colon Cancer Risk

    PubMed Central

    Steck, Susan E.; Butler, Lesley M.; Keku, Temitope; Antwi, Samuel; Galanko, Joseph; Sandler, Robert S.; Hu, Jennifer J.

    2014-01-01

    Purpose Much of the DNA damage from colon cancer-related carcinogens, including heterocyclic amines (HCA) and polycyclic aromatic hydrocarbons (PAH) from red meat cooked at high temperature, are repaired by the nucleotide excision repair (NER) pathway. Thus, we examined whether NER non-synonymous single nucleotide polymorphisms (nsSNPs) modified the association between red meat intake and colon cancer risk. Methods The study consists of 244 African-American and 311 white colon cancer cases and population-based controls (331 African Americans and 544 whites) recruited from 33 counties in North Carolina from 1996 to 2000. Information collected by food frequency questionnaire on meat intake and preparation methods were used to estimate HCA and benzo(a)pyrene (BaP, a PAH) intake. We tested 7 nsSNPs in 5 NER genes: XPC A499V and K939Q, XPD D312N and K751Q, XPF R415Q, XPG D1104H, and RAD23B A249V. Adjusted odds ratios (OR) and 95% confidence intervals (CI) were calculated using unconditional logistic regression. Results Among African Americans, we observed a statistically significant positive association between colon cancer risk and XPC 499 AV+VV genotype (OR=1.7, 95% CI: 1.1, 2.7, AA as referent), and an inverse association with XPC 939 QQ (OR=0.3, 95%CI: 0.2, 0.8, KK as referent). These associations were not observed among whites. For both races combined, there was interaction between the XPC 939 genotype, well-done red meat intake and colon cancer risk (OR=1.5, 95% CI=1.0, 2.2 for high well-done red meat and KK genotype as compared to low well-done red meat and KK genotype, pinteraction =0.05). Conclusions Our data suggest that NER nsSNPs are associated with colon cancer risk and may modify the association between well-done red meat intake and colon cancer risk. PMID:24607854

  4. SNPs Altering Ammonium Transport Activity of Human Rhesus Factors Characterized by a Yeast-Based Functional Assay

    PubMed Central

    Deschuyteneer, Aude; Boeckstaens, Mélanie; De Mees, Christelle; Van Vooren, Pascale; Wintjens, René; Marini, Anna Maria

    2013-01-01

    Proteins of the conserved Mep-Amt-Rh family, including mammalian Rhesus factors, mediate transmembrane ammonium transport. Ammonium is an important nitrogen source for the biosynthesis of amino acids but is also a metabolic waste product. Its disposal in urine plays a critical role in the regulation of the acid/base homeostasis, especially with an acid diet, a trait of Western countries. Ammonium accumulation above a certain concentration is however pathologic, the cytotoxicity causing fatal cerebral paralysis in acute cases. Alteration in ammonium transport via human Rh proteins could have clinical outcomes. We used a yeast-based expression assay to characterize human Rh variants resulting from non synonymous single nucleotide polymorphisms (nsSNPs) with known or unknown clinical phenotypes and assessed their ammonium transport efficiency, protein level, localization and potential trans-dominant impact. The HsRhAG variants (I61R, F65S) associated to overhydrated hereditary stomatocytosis (OHSt), a disease affecting erythrocytes, proved affected in intrinsic bidirectional ammonium transport. Moreover, this study reveals that the R202C variant of HsRhCG, the orthologue of mouse MmRhcg required for optimal urinary ammonium excretion and blood pH control, shows an impaired inherent ammonium transport activity. Urinary ammonium excretion was RHcg gene-dose dependent in mouse, highlighting MmRhcg as a limiting factor. HsRhCGR202C may confer susceptibility to disorders leading to metabolic acidosis for instance. Finally, the analogous R211C mutation in the yeast ScMep2 homologue also impaired intrinsic activity consistent with a conserved functional role of the preserved arginine residue. The yeast expression assay used here constitutes an inexpensive, fast and easy tool to screen nsSNPs reported by high throughput sequencing or individual cases for functional alterations in Rh factors revealing potential causal variants. PMID:23967154

  5. ICAM-1-related long non-coding RNA: promoter analysis and expression in human retinal endothelial cells.

    PubMed

    Lumsden, Amanda L; Ma, Yuefang; Ashander, Liam M; Stempel, Andrew J; Keating, Damien J; Smith, Justine R; Appukuttan, Binoy

    2018-05-09

    Regulation of intercellular adhesion molecule (ICAM)-1 in retinal endothelial cells is a promising druggable target for retinal vascular diseases. The ICAM-1-related (ICR) long non-coding RNA stabilizes ICAM-1 transcript, increasing protein expression. However, studies of ICR involvement in disease have been limited as the promoter is uncharacterized. To address this issue, we undertook a comprehensive in silico analysis of the human ICR gene promoter region. We used genomic evolutionary rate profiling to identify a 115 base pair (bp) sequence within 500 bp upstream of the transcription start site of the annotated human ICR gene that was conserved across 25 eutherian genomes. A second constrained sequence upstream of the orthologous mouse gene (68 bp; conserved across 27 Eutherian genomes including human) was also discovered. Searching these elements identified 33 matrices predictive of binding sites for transcription factors known to be responsive to a broad range of pathological stimuli, including hypoxia, and metabolic and inflammatory proteins. Five phenotype-associated single nucleotide polymorphisms (SNPs) in the immediate vicinity of these elements included four SNPs (i.e. rs2569693, rs281439, rs281440 and rs11575074) predicted to impact binding motifs of transcription factors, and thus the expression of ICR and ICAM-1 genes, with potential to influence disease susceptibility. We verified that human retinal endothelial cells expressed ICR, and observed induction of expression by tumor necrosis factor-α.

  6. Next generation sequencing gives an insight into the characteristics of highly selected breeds versus non-breed horses in the course of domestication.

    PubMed

    Metzger, Julia; Tonda, Raul; Beltran, Sergi; Agueda, Lídia; Gut, Marta; Distl, Ottmar

    2014-07-04

    Domestication has shaped the horse and lead to a group of many different types. Some have been under strong human selection while others developed in close relationship with nature. The aim of our study was to perform next generation sequencing of breed and non-breed horses to provide an insight into genetic influences on selective forces. Whole genome sequencing of five horses of four different populations revealed 10,193,421 single nucleotide polymorphisms (SNPs) and 1,361,948 insertion/deletion polymorphisms (indels). In comparison to horse variant databases and previous reports, we were able to identify 3,394,883 novel SNPs and 868,525 novel indels. We analyzed the distribution of individual variants and found significant enrichment of private mutations in coding regions of genes involved in primary metabolic processes, anatomical structures, morphogenesis and cellular components in non-breed horses and in contrast to that private mutations in genes affecting cell communication, lipid metabolic process, neurological system process, muscle contraction, ion transport, developmental processes of the nervous system and ectoderm in breed horses. Our next generation sequencing data constitute an important first step for the characterization of non-breed in comparison to breed horses and provide a large number of novel variants for future analyses. Functional annotations suggest specific variants that could play a role for the characterization of breed or non-breed horses.

  7. Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort.

    PubMed

    Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Siler Masters, Bettie Sue; Martásek, Pavel

    2015-01-01

    Estimating polymorphic allele frequencies of the NADPH-CYP450 oxidoreductase (POR) gene in a Czech Slavic population. The POR gene was analyzed in 322 individuals from a control cohort by sequencing and high resolution melting analysis. We identified seven unreported SNP genetic variations, including two SNPs in the 5' flanking region (g.4965C>T and g.4994G>T), one intronic variant (c.1899-20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared with wild-type. New POR variant identification indicates the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYP450s in the endoplasmic reticulum. Original submitted 15 September 2014; Revision submitted 17 November 2014.

  8. Prenatal exposure to drinking-water chlorination by-products, cytochrome P450 gene polymorphisms and small-for-gestational-age neonates.

    PubMed

    Bonou, Samuella G; Levallois, Patrick; Giguère, Yves; Rodriguez, Manuel; Bureau, Alexandre

    2017-10-01

    Genetic susceptibility may modulate chlorination by-products (CBPs) effects on fetal growth, especially genes coding for the cytochrome P450 involved in the metabolism of CBPs and steroidogenesis. In a case-control study of 1432 mother-child pairs, we assessed the association between maternal and child single nucleotide polymorphisms (SNPs) within CYP1A2, CYP2A6, CYP2D6 and CYP17A1 genes and small-for-gestational-age neonates (SGA<10th percentile) as well as interaction between these SNPs and maternal exposure to trihalomethanes or haloacetic acids (HAAs) during the third trimester of pregnancy. Interactions were found between mother and neonate carrying CYP17A1 rs4919687A and rs743572G alleles and maternal exposure to total trihalomethanes or five regulated HAAs species. However, these interactions became non statistically significant after correction for multiple testing. There is some evidence, albeit weak, of a potential effect modification of the association between CBPs and SGA by SNPs in CYP17A1 gene. Further studies are needed to validate these observations. Copyright © 2017 Elsevier Inc. All rights reserved.

  9. An integrated in silico approach for functional and structural impact of non- synonymous SNPs in the MYH1 gene in Jeju Native Pigs.

    PubMed

    Ghosh, Mrinmoy; Sodhi, Simrinder Singh; Sharma, Neelesh; Mongre, Raj Kumar; Kim, Nameun; Singh, Amit Kumar; Lee, Sung Jin; Kim, Dae Cheol; Kim, Sung Woo; Lee, Hak Kyo; Song, Ki-Duk; Jeong, Dong Kee

    2016-02-04

    This study was performed to identify the non- synonymous polymorphisms in the myosin heavy chain 1 gene (MYH1) association with skeletal muscle development in economically important Jeju Native Pig (JNP) and Berkshire breeds. Herein, we present an in silico analysis, with a focus on (a) in silico approaches to predict the functional effect of non-synonymous SNP (nsSNP) in MYH1 on growth, and (b) molecular docking and dynamic simulation of MYH1 to predict the effects of those nsSNP on protein-protein association. The NextGENe (V 2.3.4.) tool was used to identify the variants in MYH1 from JNP and Berkshire using RNA seq. Gene ontology analysis of MYH1 revealed significant association with muscle contraction and muscle organ development. The 95 % confidence intervals clearly indicate that the mRNA expression of MYH1 is significantly higher in the Berkshire longissimus dorsi muscle samples than JNP breed. Concordant in silico analysis of MYH1, the open-source software tools identified 4 potential nsSNP (L884T, K972C, N981G, and Q1285C) in JNP and 1 nsSNP (H973G) in Berkshire pigs. Moreover, protein-protein interactions were studied to investigate the effect of MYH1 mutations on association with hub proteins, and MYH1 was found to be closely associated with the protein myosin light chain, phosphorylatable, fast skeletal muscle MYLPF. The results of molecular docking studies on MYH1 (native and 4 mutants) and MYLFP demonstrated that the native complex showed higher electrostatic energy (-466.5 Kcal mol(-1)), van der Walls energy (-87.3 Kcal mol(-1)), and interaction energy (-835.7 Kcal mol(-1)) than the mutant complexes. Furthermore, the molecular dynamic simulation revealed that the native complex yielded a higher root-mean-square deviation (0.2-0.55 nm) and lower root-mean-square fluctuation (approximately 0.08-0.3 nm) as compared to the mutant complexes. The results suggest that the variants at L884T, K972C, N981G, and Q1285C in MYH1 in JNP might represent a cause for the poor growth performance for this breed. This study is a pioneering in-depth in silico analysis of polymorphic MYH1 and will serve as a valuable resource for further targeted molecular diagnosis and population-based studies conducted for improving the growth performance of JNP.

  10. Molecular evolution of bovine Toll-like receptor 2 suggests substitutions of functional relevance.

    PubMed

    Jann, Oliver C; Werling, Dirk; Chang, Jung-Su; Haig, David; Glass, Elizabeth J

    2008-10-20

    There is accumulating evidence that polymorphism in Toll-like receptor (TLR) genes might be associated with disease resistance or susceptibility traits in livestock. Polymorphic sites affecting TLR function should exhibit signatures of positive selection, identified as a high ratio of non-synonymous to synonymous nucleotide substitutions (omega). Phylogeny based models of codon substitution based on estimates of omega for each amino acid position can therefore offer a valuable tool to predict sites of functional relevance. We have used this approach to identify such polymorphic sites within the bovine TLR2 genes from ten Bos indicus and Bos taurus cattle breeds. By analysing TLR2 gene phylogeny in a set of mammalian species and a subset of ruminant species we have estimated the selective pressure on individual sites and domains and identified polymorphisms at sites of putative functional importance. The omega were highest in the mammalian TLR2 domains thought to be responsible for ligand binding and lowest in regions responsible for heterodimerisation with other TLR-related molecules. Several positively-selected sites were detected in or around ligand-binding domains. However a comparison of the ruminant subset of TLR2 sequences with the whole mammalian set of sequences revealed that there has been less selective pressure among ruminants than in mammals as a whole. This suggests that there have been functional changes during ruminant evolution. Twenty newly-discovered non-synonymous polymorphic sites were identified in cattle. Three of them were localised at positions shaped by positive selection in the ruminant dataset (Leu227Phe, His305Pro, His326Gln) and in domains involved in the recognition of ligands. His326Gln is of particular interest as it consists of an exchange of differentially-charged amino acids at a position which has previously been shown to be crucial for ligand binding in human TLR2. Within bovine TLR2, polymorphisms at amino acid positions 227, 305 and 326 map to functionally important sites of TLR2 and should be considered as candidate SNPs for immune related traits in cattle. A final proof of their functional relevance requires further studies to determine their functional effect on the immune response after stimulation with relevant ligands and/or their association with immune related traits in animals.

  11. Empirical characteristics of family-based linkage to a complex trait: the ADIPOQ region and adiponectin levels.

    PubMed

    Hellwege, Jacklyn N; Palmer, Nicholette D; Mark Brown, W; Brown, Mark W; Ziegler, Julie T; Sandy An, S; An, Sandy S; Guo, Xiuqing; Ida Chen, Y-D; Chen, Ida Y-D; Taylor, Kent; Hawkins, Gregory A; Ng, Maggie C Y; Speliotes, Elizabeth K; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Wagenknecht, Lynne E; Langefeld, Carl D; Bowden, Donald W

    2015-02-01

    We previously identified a low-frequency (1.1 %) coding variant (G45R; rs200573126) in the adiponectin gene (ADIPOQ) which was the basis for a multipoint microsatellite linkage signal (LOD = 8.2) for plasma adiponectin levels in Hispanic families. We have empirically evaluated the ability of data from targeted common variants, exome chip genotyping, and genome-wide association study data to detect linkage and association to adiponectin protein levels at this locus. Simple two-point linkage and association analyses were performed in 88 Hispanic families (1,150 individuals) using 10,958 SNPs on chromosome 3. Approaches were compared for their ability to map the functional variant, G45R, which was strongly linked (two-point LOD = 20.98) and powerfully associated (p value = 8.1 × 10(-50)). Over 450 SNPs within a broad 61 Mb interval around rs200573126 showed nominal evidence of linkage (LOD > 3) but only four other SNPs in this region were associated with p values < 1.0 × 10(-4). When G45R was accounted for, the maximum LOD score across the interval dropped to 4.39 and the best p value was 1.1 × 10(-5). Linked and/or associated variants ranged in frequency (0.0018-0.50) and type (coding, non-coding) and had little detectable linkage disequilibrium with rs200573126 (r (2) < 0.20). In addition, the two-point linkage approach empirically outperformed multipoint microsatellite and multipoint SNP analysis. In the absence of data for rs200573126, family-based linkage analysis using a moderately dense SNP dataset, including both common and low-frequency variants, resulted in stronger evidence for an adiponectin locus than association data alone. Thus, linkage analysis can be a useful tool to facilitate identification of high-impact genetic variants.

  12. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies

    PubMed Central

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-01-01

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. PMID:27172202

  13. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies.

    PubMed

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-07-07

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. Copyright © 2016 Chen et al.

  14. Phylogenetic and population-based approaches to mitogenome variation do not support association with male infertility.

    PubMed

    Gómez-Carballa, Alberto; Pardo-Seco, Jacobo; Martinón-Torres, Federico; Salas, Antonio

    2017-03-01

    Infertility has a complex multifactorial etiology and a high prevalence worldwide. Several studies have pointed to variation in the mitochondrial DNA (mtDNA) molecule as a factor responsible for the different disease phenotypes related to infertility. We analyzed 53 mitogenomes of infertile males from Galicia (northwest Spain), and these haplotypes were meta-analyzed phylogenetically with 43 previously reported from Portugal. Taking advantage of the large amount of information available, we additionally carried out association tests between patient mtDNA single-nucleotide polymorphisms (mtSNPs) and haplogroups against Iberian matched controls retrieved from The 1000 Genomes Project and the literature. Phylogenetic and association analyses did not reveal evidence of association between mtSNPs/haplogroups and infertility. Ratios and patterns in patients of nonsynonymous/synonymous changes, and variation at homoplasmic, heteroplasmic and private variants, fall within expected values for healthy individuals. Moreover, the haplogroup background of patients was variable and fits well with patterns typically observed in healthy western Europeans. We did not find evidence of association of mtSNPs or haplogroups pointing to a role for mtDNA in male infertility. A thorough review of the literature on mtDNA variation and infertility revealed contradictory findings and methodological and theoretical problems that overall undermine previous positive findings.

  15. Unlocking the vault: next generation museum population genomics

    PubMed Central

    Bi, Ke; Linderoth, Tyler; Vanderpool, Dan; Good, Jeffrey M; Nielsen, Rasmus; Moritz, Craig

    2014-01-01

    Natural history museum collections provide unique resources for understanding how species respond to environmental change, including the abrupt, anthropogenic climate change of the past century. Ideally, researchers would conduct genome-scale screening of museum specimens to explore the evolutionary consequences of environmental changes, but to date such analyses have been severely limited by the numerous challenges of working with the highly degraded DNA typical of historic samples. Here we circumvent these challenges by using custom, multiplexed, exon-capture to enrich and sequence ~11,000 exons (~4Mb) from early 20TH century museum skins. We used this approach to test for changes in genomic diversity accompanying a climate-related range retraction in the alpine chipmunks (Tamias alpinus) in the high Sierra Nevada area of California, USA. We developed robust bioinformatic pipelines that rigorously detect and filter-out base misincorporations in DNA derived from skins, most of which likely resulted from post-mortem damage. Furthermore, to accommodate genotyping uncertainties associated with low-medium coverage data, we applied a recently developed probabilistic method to call SNPs and estimate allele frequencies and the joint site frequency spectrum. Our results show increased genetic subdivision following range retraction, but no change in overall genetic diversity at either non-synonymous or synonymous sites. This case study showcases the advantages of integrating emerging genomic and statistical tools in museum collection-based population genomic applications. Such technical advances greatly enhance the value of museum collections, even where a pre-existing reference is lacking, and points to a broad range of potential applications in evolutionary and conservation biology. PMID:24118668

  16. E6 and E7 Gene Polymorphisms in Human Papillomavirus Types-58 and 33 Identified in Southwest China

    PubMed Central

    Wen, Qiang; Wang, Tao; Mu, Xuemei; Chenzhang, Yuwei; Cao, Man

    2017-01-01

    Cancer of the cervix is associated with infection by certain types of human papillomavirus (HPV). The gene variants differ in immune responses and oncogenic potential. The E6 and E7 proteins encoded by high-risk HPV play a key role in cellular transformation. HPV-33 and HPV-58 types are highly prevalent among Chinese women. To study the gene intratypic variations, polymorphisms and positive selections of HPV-33 and HPV-58 E6/E7 in southwest China, HPV-33 (E6, E7: n = 216) and HPV-58 (E6, E7: n = 405) E6 and E7 genes were sequenced and compared to others submitted to GenBank. Phylogenetic trees were constructed by Maximum-likelihood and the Kimura 2-parameters methods by MEGA 6 (Molecular Evolutionary Genetics Analysis version 6.0). The diversity of secondary structure was analyzed by PSIPred software. The selection pressures acting on the E6/E7 genes were estimated by PAML 4.8 (Phylogenetic Analyses by Maximun Likelihood version4.8) software. The positive sites of HPV-33 and HPV-58 E6/E7 were contrasted by ClustalX 2.1. Among 216 HPV-33 E6 sequences, 8 single nucleotide mutations were observed with 6/8 non-synonymous and 2/8 synonymous mutations. The 216 HPV-33 E7 sequences showed 3 single nucleotide mutations that were non-synonymous. The 405 HPV-58 E6 sequences revealed 8 single nucleotide mutations with 4/8 non-synonymous and 4/8 synonymous mutations. Among 405 HPV-58 E7 sequences, 13 single nucleotide mutations were observed with 10/13 non-synonymous mutations and 3/13 synonymous mutations. The selective pressure analysis showed that all HPV-33 and 4/6 HPV-58 E6/E7 major non-synonymous mutations were sites of positive selection. All variations were observed in sites belonging to major histocompatibility complex and/or B-cell predicted epitopes. K93N and R145 (I/N) were observed in both HPV-33 and HPV-58 E6. PMID:28141822

  17. Absence of coding mutations in the X-linked genes neuroligin 3 and neuroligin 4 in individuals with autism from the IMGSAC collection.

    PubMed

    Blasi, Francesca; Bacchelli, Elena; Pesaresi, Giulia; Carone, Simona; Bailey, Anthony J; Maestrini, Elena

    2006-04-05

    Neuroligin abnormalities have been recently implicated in the aetiology of autism spectrum disorders (ASD), given the finding of point mutations in the two X-linked genes NLGN3 and NLGN4X and the important role of neuroligins in synaptogenesis. To enquire on the relevance and frequency of neuroligin mutations in ASD, we performed a mutation screening of NLGN3 and NLGN4X in a sample of 124 autism probands from the International Molecular Genetic Study of Autism Consortium (IMGSAC). We identified a new non-synonymous variant in NLGN3 (Thr632Ala), which is likely to be a rare polymorphism. Our data indicate that coding mutations in these genes are very rarely associated to ASD. Copyright 2006 Wiley-Liss, Inc.

  18. Black-tie dress code: two new species of the genus Toxomerus (Diptera, Syrphidae)

    PubMed Central

    Mengual, Ximo

    2011-01-01

    Abstract Toxomerus hauseri Mengual sp. n. and Toxomerus picudus Mengual sp. n. are described from Peru and Ecuador respectively. Toxomerus circumcintus (Enderlein, 1938) is treated as a valid species and not considered synonym of Toxomerus marginatus, and Toxomerus ovatus (Hull, 1942) is considered junior synonym of Toxomerus nitidus (Schiner, 1868). An identification key for the Toxomerus species with dark abdomens is given along with diagnoses for each studied species. PMID:22144857

  19. Language impairment in a case of a complex chromosomal rearrangement with a breakpoint downstream of FOXP2.

    PubMed

    Moralli, Daniela; Nudel, Ron; Chan, May T M; Green, Catherine M; Volpi, Emanuela V; Benítez-Burraco, Antonio; Newbury, Dianne F; García-Bellido, Paloma

    2015-01-01

    We report on a young female, who presents with a severe speech and language disorder and a balanced de novo complex chromosomal rearrangement, likely to have resulted from a chromosome 7 pericentromeric inversion, followed by a chromosome 7 and 11 translocation. Using molecular cytogenetics, we mapped the four breakpoints to 7p21.1-15.3 (chromosome position: 20,954,043-21,001,537, hg19), 7q31 (chromosome position: 114,528,369-114,556,605, hg19), 7q21.3 (chromosome position: 93,884,065-93,933,453, hg19) and 11p12 (chromosome position: 38,601,145-38,621,572, hg19). These regions contain only non-coding transcripts (ENSG00000232790 on 7p21.1 and TCONS_00013886, TCONS_00013887, TCONS_00014353, TCONS_00013888 on 7q21) indicating that no coding sequences are directly disrupted. The breakpoint on 7q31 mapped 200 kb downstream of FOXP2, a well-known language gene. No splice site or non-synonymous coding variants were found in the FOXP2 coding sequence. We were unable to detect any changes in the expression level of FOXP2 in fibroblast cells derived from the proband, although this may be the result of the low expression level of FOXP2 in these cells. We conclude that the phenotype observed in this patient either arises from a subtle change in FOXP2 regulation due to the disruption of a downstream element controlling its expression, or from the direct disruption of non-coding RNAs.

  20. SNPs in putative regulatory regions identified by human mouse comparative sequencing and transcription factor binding site data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Banerjee, Poulabi; Bahlo, Melanie; Schwartz, Jody R.

    2002-01-01

    Genome wide disease association analysis using SNPs is being explored as a method for dissecting complex genetic traits and a vast number of SNPs have been generated for this purpose. As there are cost and throughput limitations of genotyping large numbers of SNPs and statistical issues regarding the large number of dependent tests on the same data set, to make association analysis practical it has been proposed that SNPs should be prioritized based on likely functional importance. The most easily identifiable functional SNPs are coding SNPs (cSNPs) and accordingly cSNPs have been screened in a number of studies. SNPs inmore » gene regulatory sequences embedded in noncoding DNA are another class of SNPs suggested for prioritization due to their predicted quantitative impact on gene expression. The main challenge in evaluating these SNPs, in contrast to cSNPs is a lack of robust algorithms and databases for recognizing regulatory sequences in noncoding DNA. Approaches that have been previously used to delineate noncoding sequences with gene regulatory activity include cross-species sequence comparisons and the search for sequences recognized by transcription factors. We combined these two methods to sift through mouse human genomic sequences to identify putative gene regulatory elements and subsequently localized SNPs within these sequences in a 1 Megabase (Mb) region of human chromosome 5q31, orthologous to mouse chromosome 11 containing the Interleukin cluster.« less

  1. CsSNP: A Web-Based Tool for the Detecting of Comparative Segments SNPs.

    PubMed

    Wang, Yi; Wang, Shuangshuang; Zhou, Dongjie; Yang, Shuai; Xu, Yongchao; Yang, Chao; Yang, Long

    2016-07-01

    SNP (single nucleotide polymorphism) is a popular tool for the study of genetic diversity, evolution, and other areas. Therefore, it is necessary to develop a convenient, utility, robust, rapid, and open source detecting-SNP tool for all researchers. Since the detection of SNPs needs special software and series steps including alignment, detection, analysis and present, the study of SNPs is limited for nonprofessional users. CsSNP (Comparative segments SNP, http://biodb.sdau.edu.cn/cssnp/ ) is a freely available web tool based on the Blat, Blast, and Perl programs to detect comparative segments SNPs and to show the detail information of SNPs. The results are filtered and presented in the statistics figure and a Gbrowse map. This platform contains the reference genomic sequences and coding sequences of 60 plant species, and also provides new opportunities for the users to detect SNPs easily. CsSNP is provided a convenient tool for nonprofessional users to find comparative segments SNPs in their own sequences, and give the users the information and the analysis of SNPs, and display these data in a dynamic map. It provides a new method to detect SNPs and may accelerate related studies.

  2. Identification of susceptible genes for complex chronic diseases based on disease risk functional SNPs and interaction networks.

    PubMed

    Li, Wan; Zhu, Lina; Huang, Hao; He, Yuehan; Lv, Junjie; Li, Weimin; Chen, Lina; He, Weiming

    2017-10-01

    Complex chronic diseases are caused by the effects of genetic and environmental factors. Single nucleotide polymorphisms (SNPs), one common type of genetic variations, played vital roles in diseases. We hypothesized that disease risk functional SNPs in coding regions and protein interaction network modules were more likely to contribute to the identification of disease susceptible genes for complex chronic diseases. This could help to further reveal the pathogenesis of complex chronic diseases. Disease risk SNPs were first recognized from public SNP data for coronary heart disease (CHD), hypertension (HT) and type 2 diabetes (T2D). SNPs in coding regions that were classified into nonsense and missense by integrating several SNP functional annotation databases were treated as functional SNPs. Then, regions significantly associated with each disease were screened using random permutations for disease risk functional SNPs. Corresponding to these regions, 155, 169 and 173 potential disease susceptible genes were identified for CHD, HT and T2D, respectively. A disease-related gene product interaction network in environmental context was constructed for interacting gene products of both disease genes and potential disease susceptible genes for these diseases. After functional enrichment analysis for disease associated modules, 5 CHD susceptible genes, 7 HT susceptible genes and 3 T2D susceptible genes were finally identified, some of which had pleiotropic effects. Most of these genes were verified to be related to these diseases in literature. This was similar for disease genes identified from another method proposed by Lee et al. from a different aspect. This research could provide novel perspectives for diagnosis and treatment of complex chronic diseases and susceptible genes identification for other diseases. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. Junk DNA and the long non-coding RNA twist in cancer genetics

    PubMed Central

    Ling, Hui; Vincent, Kimberly; Pichler, Martin; Fodde, Riccardo; Berindan-Neagoe, Ioana; Slack, Frank J.; Calin, George A

    2015-01-01

    The central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs (lncRNAs) have attracted much attention due to their large number and biological significance. Many lncRNAs have been identified as mapping to regulatory elements including gene promoters and enhancers, ultraconserved regions, and intergenic regions of protein-coding genes. Yet, the biological function and molecular mechanisms of lncRNA in human diseases in general and cancer in particular remain largely unknown. Data from the literature suggest that lncRNA, often via interaction with proteins, functions in specific genomic loci or use their own transcription loci for regulatory activity. In this review, we summarize recent findings supporting the importance of DNA loci in lncRNA function, and the underlying molecular mechanisms via cis or trans regulation, and discuss their implications in cancer. In addition, we use the 8q24 genomic locus, a region containing interactive SNPs, DNA regulatory elements and lncRNAs, as an example to illustrate how single nucleotide polymorphism (SNP) located within lncRNAs may be functionally associated with the individual’s susceptibility to cancer. PMID:25619839

  4. Exome sequencing for simultaneous mutation screening in children with hemophagocytic lymphohistiocytosis.

    PubMed

    Mukda, Ekchol; Trachoo, Objoon; Pasomsub, Ekawat; Tiyasirichokchai, Rawiphorn; Iemwimangsa, Nareenart; Sosothikul, Darintr; Chantratita, Wasun; Pakakasama, Samart

    2017-08-01

    In the present study, we used exome sequencing to analyze PRF1, UNC13D, STX11, and STXBP2, as well as genes associated with primary immunodeficiency disease (RAB27A, LYST, AP3B1, SH2D1A, ITK, CD27, XIAP, and MAGT1) in Thai children with hemophagocytic lymphohistiocytosis (HLH). We performed mutation analysis of HLH-associated genes in 25 Thai children using an exome sequencing method. Genetic variations found within these target genes were compared to exome sequencing data from 133 healthy individuals. Variants identified with minor allele frequencies <5% and novel mutations were confirmed using Sanger sequencing. Exome sequencing data revealed 101 non-synonymous single nucleotide polymorphisms (SNPs) in all subjects. These SNPs were classified as pathogenic (n = 1), likely pathogenic (n = 16), variant of unknown significance (n = 12), or benign variant (n = 72). Homozygous, compound heterozygous, and double-gene heterozygous variants, involving mutations in PRF1 (n = 3), UNC13D (n = 2), STXBP2 (n = 3), LYST (n = 3), XIAP (n = 2), AP3B1 (n = 1), RAB27A (n = 1), and MAGT1 (n = 1), were demonstrated in 12 patients. Novel mutations were found in most patients in this study. In conclusion, exome sequencing demonstrated the ability to identify rare genetic variants in HLH patients. This method is useful in the detection of mutations in multi-gene associated diseases.

  5. A Genome-wide Combinatorial Strategy Dissects Complex Genetic Architecture of Seed Coat Color in Chickpea

    PubMed Central

    Bajaj, Deepak; Das, Shouvik; Upadhyaya, Hari D.; Ranjan, Rajeev; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea. PMID:26635822

  6. Clinical characteristics and genetic analysis of childhood acute lymphoblastic leukemia with hemophagocytic lymphohistiocytosis: a Japanese retrospective study by the Kyushu-Yamaguchi Children's Cancer Study Group.

    PubMed

    Moritake, Hiroshi; Kamimura, Sachiyo; Nunoi, Hiroyuki; Nakayama, Hideki; Suminoe, Aiko; Inada, Hiroko; Inagaki, Jiro; Yanai, Fumio; Okamoto, Yasuhiro; Shinkoda, Yuichi; Shimomura, Maiko; Itonaga, Nobuyoshi; Hotta, Noriko; Hidaka, Yasufumi; Ohara, Osamu; Yanagimachi, Masakatsu; Nakajima, Noriko; Okamura, Jun; Kawano, Yoshifumi

    2014-07-01

    This present study sought to analyze acute lymphoblastic leukemia (ALL) patients with hemophagocytic lymphohistiocytosis (HLH) registered in Kyushu-Yamaguchi Children's Cancer Study Group studies conducted between 1996 and 2007. Four of 357 patients, including two of 318 patients with B cell precursor acute lymphoblastic leukemia (BCP-ALL) and two of 39 of those with T cell acute lymphoblastic leukemia (T-ALL), were identified. HLH was observed more frequently in the T-ALL patients than in the BCP-ALL patients (P = 0.061). The mean age of 13.0 years at the diagnosis of leukemia in the HLH + ALL group was significantly higher than the 6.05 years observed in the remaining ALL groups (P = 0.001). A female predisposition was noted, as all four patients were female (P = 0.043). In two of four patients, the leukemic cells exhibited deletions on the long arm of chromosome 6 (P = 0.003). Three patients suffered from HLH during maintenance therapy. Parvovirus B19 infection and cytomegalovirus reactivation were identified as causes of HLH in one and two patients, respectively. All four patients are currently in complete remission, although one developed relapse of leukemia after receiving maintenance therapy. Based on the genetic analyses, non-synonymous single nucleotide polymorphisms (SNPs) in UNC13D, syntaxin 11, and STXBP2 were identified in all patients. Clinicians should therefore be aware of the risk of HLH during maintenance therapy, especially in older T-ALL patients with SNPs in familial HLH causative genes.

  7. In Silico and In Vitro Investigations of the Mutability of Disease-Causing Missense Mutation Sites in Spermine Synthase

    PubMed Central

    Zhang, Zhe; Norris, Joy; Schwartz, Charles; Alexov, Emil

    2011-01-01

    Background Spermine synthase (SMS) is a key enzyme controlling the concentration of spermidine and spermine in the cell. The importance of SMS is manifested by the fact that single missense mutations were found to cause Snyder-Robinson Syndrome (SRS). At the same time, currently there are no non-synonymous single nucleoside polymorphisms, nsSNPs (harmless mutations), found in SMS, which may imply that the SMS does not tolerate amino acid substitutions, i.e. is not mutable. Methodology/Principal Findings To investigate the mutability of the SMS, we carried out in silico analysis and in vitro experiments of the effects of amino acid substitutions at the missense mutation sites (G56, V132 and I150) that have been shown to cause SRS. Our investigation showed that the mutation sites have different degree of mutability depending on their structural micro-environment and involvement in the function and structural integrity of the SMS. It was found that the I150 site does not tolerate any mutation, while V132, despite its key position at the interface of SMS dimer, is quite mutable. The G56 site is in the middle of the spectra, but still quite sensitive to charge residue replacement. Conclusions/Significance The performed analysis showed that mutability depends on the detail of the structural and functional factors and cannot be predicted based on conservation of wild type properties alone. Also, harmless nsSNPs can be expected to occur even at sites at which missense mutations were found to cause diseases. PMID:21647366

  8. Von Willebrand Factor Gene Variants Associate with Herpes simplex Encephalitis.

    PubMed

    Abdelmagid, Nada; Bereczky-Veress, Biborka; Atanur, Santosh; Musilová, Alena; Zídek, Václav; Saba, Laura; Warnecke, Andreas; Khademi, Mohsen; Studahl, Marie; Aurelius, Elisabeth; Hjalmarsson, Anders; Garcia-Diaz, Ana; Denis, Cécile V; Bergström, Tomas; Sköldenberg, Birgit; Kockum, Ingrid; Aitman, Timothy; Hübner, Norbert; Olsson, Tomas; Pravenec, Michal; Diez, Margarita

    2016-01-01

    Herpes simplex encephalitis (HSE) is a rare complication of Herpes simplex virus type-1 infection. It results in severe parenchymal damage in the brain. Although viral latency in neurons is very common in the population, it remains unclear why certain individuals develop HSE. Here we explore potential host genetic variants predisposing to HSE. In order to investigate this we used a rat HSE model comparing the HSE susceptible SHR (Spontaneously Hypertensive Rats) with the asymptomatic infection of BN (Brown Norway). Notably, both strains have HSV-1 spread to the CNS at four days after infection. A genome wide linkage analysis of 29 infected HXB/BXH RILs (recombinant inbred lines-generated from the prior two strains), displayed variable susceptibility to HSE enabling the definition of a significant QTL (quantitative trait locus) named Hse6 towards the end of chromosome 4 (160.89-174Mb) containing the Vwf (von Willebrand factor) gene. This was the only gene in the QTL with both cis-regulation in the brain and included several non-synonymous SNPs (single nucleotide polymorphism). Intriguingly, in human chromosome 12 several SNPs within the intronic region between exon 43 and 44 of the VWF gene were associated with human HSE pathogenesis. In particular, rs917859 is nominally associated with an odds ratio of 1.5 (95% CI 1.11-2.02; p-value = 0.008) after genotyping in 115 HSE cases and 428 controls. Although there are possibly several genetic and environmental factors involved in development of HSE, our study identifies variants of the VWF gene as candidates for susceptibility in experimental and human HSE.

  9. Von Willebrand Factor Gene Variants Associate with Herpes simplex Encephalitis

    PubMed Central

    Atanur, Santosh; Musilová, Alena; Zídek, Václav; Saba, Laura; Warnecke, Andreas; Khademi, Mohsen; Studahl, Marie; Aurelius, Elisabeth; Hjalmarsson, Anders; Garcia-Diaz, Ana; Denis, Cécile V.; Bergström, Tomas; Sköldenberg, Birgit; Kockum, Ingrid; Aitman, Timothy; Hübner, Norbert; Olsson, Tomas; Pravenec, Michal; Diez, Margarita

    2016-01-01

    Herpes simplex encephalitis (HSE) is a rare complication of Herpes simplex virus type-1 infection. It results in severe parenchymal damage in the brain. Although viral latency in neurons is very common in the population, it remains unclear why certain individuals develop HSE. Here we explore potential host genetic variants predisposing to HSE. In order to investigate this we used a rat HSE model comparing the HSE susceptible SHR (Spontaneously Hypertensive Rats) with the asymptomatic infection of BN (Brown Norway). Notably, both strains have HSV-1 spread to the CNS at four days after infection. A genome wide linkage analysis of 29 infected HXB/BXH RILs (recombinant inbred lines—generated from the prior two strains), displayed variable susceptibility to HSE enabling the definition of a significant QTL (quantitative trait locus) named Hse6 towards the end of chromosome 4 (160.89–174Mb) containing the Vwf (von Willebrand factor) gene. This was the only gene in the QTL with both cis-regulation in the brain and included several non-synonymous SNPs (single nucleotide polymorphism). Intriguingly, in human chromosome 12 several SNPs within the intronic region between exon 43 and 44 of the VWF gene were associated with human HSE pathogenesis. In particular, rs917859 is nominally associated with an odds ratio of 1.5 (95% CI 1.11–2.02; p-value = 0.008) after genotyping in 115 HSE cases and 428 controls. Although there are possibly several genetic and environmental factors involved in development of HSE, our study identifies variants of the VWF gene as candidates for susceptibility in experimental and human HSE. PMID:27224245

  10. RNA editing differently affects protein-coding genes in D. melanogaster and H. sapiens.

    PubMed

    Grassi, Luigi; Leoni, Guido; Tramontano, Anna

    2015-07-14

    When an RNA editing event occurs within a coding sequence it can lead to a different encoded amino acid. The biological significance of these events remains an open question: they can modulate protein functionality, increase the complexity of transcriptomes or arise from a loose specificity of the involved enzymes. We analysed the editing events in coding regions that produce or not a change in the encoded amino acid (nonsynonymous and synonymous events, respectively) in D. melanogaster and in H. sapiens and compared them with the appropriate random models. Interestingly, our results show that the phenomenon has rather different characteristics in the two organisms. For example, we confirm the observation that editing events occur more frequently in non-coding than in coding regions, and report that this effect is much more evident in H. sapiens. Additionally, in this latter organism, editing events tend to affect less conserved residues. The less frequently occurring editing events in Drosophila tend to avoid drastic amino acid changes. Interestingly, we find that, in Drosophila, changes from less frequently used codons to more frequently used ones are favoured, while this is not the case in H. sapiens.

  11. Genome-wide association study identifies common genetic variants associated with salivary gland carcinoma and its subtypes.

    PubMed

    Xu, Li; Tang, Hongwei; Chen, Diane W; El-Naggar, Adel K; Wei, Peng; Sturgis, Erich M

    2015-07-15

    Salivary gland carcinomas (SGCs) are a rare malignancy with unknown etiology. The objective of the current study was to identify genetic variants modifying the risk of SGC and its major subtypes: adenoid cystic carcinoma and mucoepidermoid carcinoma. The authors conducted a genome-wide association study in 309 well-defined SGC cases and 535 cancer-free controls. A single-nucleotide polymorphism (SNP)-level discovery study was performed in non-Hispanic white individuals followed by a replication study in Hispanic individuals. A logistic regression analysis was applied to calculate odds ratios (ORs) and 95% confidence intervals (95% CIs). A meta-analysis of the results was conducted. A genome-wide significant association with SGC in non-Hispanic white individuals was detected at coding SNPs in CHRNA2 (cholinergic receptor, nicotinic, alpha 2 [neuronal]) (OR, 8.55; 95% CI, 4.53-16.13 [P = 3.6 × 10(-11)]), OR4F15 (olfactory receptor, family 4, subfamily F, member 15) (OR, 5.26; 95% CI, 3.13-8.83 [P = 3.5 × 10(-10)]), ZNF343 (zinc finger protein 343) (OR, 3.28; 95% CI, 2.12-5.07 [P = 9.1 × 10(-8)]), and PARP4 (poly(ADP-ribose) polymerase family, member 4) (OR, 2.00; 95% CI, 1.54-2.59 [P = 1.7 × 10(-7)]). Meta-analysis of the non-Hispanic white and Hispanic cohorts identified another genome-wide significant SNP in ELL2 (meta-OR, 1.86; 95% CI, 1.48-2.34 [P = 1.3 × 10(-7)]). Risk alleles were largely enriched in mucoepidermoid carcinoma, in which the SNPs in CHRNA2, OR4F15, and ZNF343 had ORs of 15.71 (95% CI, 6.59-37.47 [P = 5.2 × 10(-10)]), 15.60 (95% CI, 6.50-37.41 [P = 7.5 × 10(-10)]), and 6.49 (95% CI, 3.36-12.52 [P = 2.5 × 10(-8)]), respectively. None of these SNPs retained a significant association with adenoid cystic carcinoma. To the best of the authors' knowledge, the current study is the first to identify a panel of SNPs associated with the risk of SGC. Confirmation of these findings along with functional analysis of identified SNPs are needed. © 2015 American Cancer Society.

  12. High-confidence assessment of functional impact of human mitochondrial non-synonymous genome variations by APOGEE.

    PubMed

    Castellana, Stefano; Fusilli, Caterina; Mazzoccoli, Gianluigi; Biagini, Tommaso; Capocefalo, Daniele; Carella, Massimo; Vescovi, Angelo Luigi; Mazza, Tommaso

    2017-06-01

    24,189 are all the possible non-synonymous amino acid changes potentially affecting the human mitochondrial DNA. Only a tiny subset was functionally evaluated with certainty so far, while the pathogenicity of the vast majority was only assessed in-silico by software predictors. Since these tools proved to be rather incongruent, we have designed and implemented APOGEE, a machine-learning algorithm that outperforms all existing prediction methods in estimating the harmfulness of mitochondrial non-synonymous genome variations. We provide a detailed description of the underlying algorithm, of the selected and manually curated training and test sets of variants, as well as of its classification ability.

  13. Genomic analysis of the aconidial and high-performance protein producer, industrially relevant Aspergillus niger SH2 strain.

    PubMed

    Yin, Chao; Wang, Bin; He, Pan; Lin, Ying; Pan, Li

    2014-05-15

    Aspergillus niger is usually regarded as a beneficial species widely used in biotechnological industry. Obtaining the genome sequence of the widely used aconidial A. niger SH2 strain is of great importance to understand its unusual production capability. In this study we assembled a high-quality genome sequence of A. niger SH2 with approximately 11,517 ORFs. Relatively high proportion of genes enriched for protein expression related FunCat items verify its efficient capacity in protein production. Furthermore, genome-wide comparative analysis between A. niger SH2 and CBS513.88 reveals insights into unique properties of A. niger SH2. A. niger SH2 lacks the gene related with the initiation of asexual sporulation (PrpA), leading to its distinct aconidial phenotype. Frame shift mutations and non-synonymous SNPs in genes of cell wall integrity signaling, β-1,3-glucan synthesis and chitin synthesis influence its cell wall development which is important for its hyphal fragmentation during industrial high-efficiency protein production. Copyright © 2014 Elsevier B.V. All rights reserved.

  14. Are Synonymous Substitutions in Flowering Plant Mitochondria Neutral?

    PubMed

    Wynn, Emily L; Christensen, Alan C

    2015-10-01

    Angiosperm mitochondrial genes appear to have very low mutation rates, while non-gene regions expand, diverge, and rearrange quickly. One possible explanation for this disparity is that synonymous substitutions in plant mitochondrial genes are not truly neutral and selection keeps their occurrence low. If this were true, the explanation for the disparity in mutation rates in genes and non-genes needs to consider selection as well as mechanisms of DNA repair. Rps14 is co-transcribed with cob and rpl5 in most plant mitochondrial genomes, but in some genomes, rps14 has been duplicated to the nucleus leaving a pseudogene in the mitochondria. This provides an opportunity to compare neutral substitution rates in pseudogenes with synonymous substitution rates in the orthologs. Genes and pseudogenes of rps14 have been aligned among different species and the mutation rates have been calculated. Neutral substitution rates in pseudogenes and synonymous substitution rates in genes are significantly different, providing evidence that synonymous substitutions in plant mitochondrial genes are not completely neutral. The non-neutrality is not sufficient to completely explain the exceptionally low mutation rates in land plant mitochondrial genomes, but selective forces appear to play a small role.

  15. Streptomyces ciscaucasicus Sveshnikova et al. 1983 is a later subjective synonym of Streptomyces canus Heinemann et al. 1953.

    PubMed

    Kämpfer, Peter; Rückert, Christian; Blom, Jochen; Goesmann, Alexander; Wink, Joachim; Kalinowski, Jörn; Glaeser, Stefanie P

    2018-01-01

    Streptomyces canuswas described in 1953 and the name was listed in the Approved List of Bacterial Names in 1980. Three years later, Streptomyces ciscaucasicus was published and the name was subsequently validated in Validation List no. 22 in 1986. On the basis of genome comparison and multilocus sequence analysis of the type strains of Streptomyces canus and Streptomyces ciscaucasicus it can now be shown that these two species despite some phenotypic differences are subjective synonyms. In such a case Rule 24 of the Bacteriological Code applies, in which priority of names is determined by the date of the original publication. Hence, we propose that S. ciscaucasicus is a later subjective synonym of S. canus.

  16. Widespread signatures of local mRNA folding structure selection in four Dengue virus serotypes

    PubMed Central

    2015-01-01

    Background It is known that mRNA folding can affect and regulate various gene expression steps both in living organisms and in viruses. Previous studies have recognized functional RNA structures in the genome of the Dengue virus. However, these studies usually focused either on the viral untranslated regions or on very specific and limited regions at the beginning of the coding sequences, in a limited number of strains, and without considering evolutionary selection. Results Here we performed the first large scale comprehensive genomics analysis of selection for local mRNA folding strength in the Dengue virus coding sequences, based on a total of 1,670 genomes and 4 serotypes. Our analysis identified clusters of positions along the coding regions that may undergo a conserved evolutionary selection for strong or weak local folding maintained across different viral variants. Specifically, 53-66 clusters for strong folding and 49-73 clusters for weak folding (depending on serotype) aggregated of positions with a significant conservation of folding energy signals (related to partially overlapping local genomic regions) were recognized. In addition, up to 7% of these positions were found to be conserved in more than 90% of the viral genomes. Although some of the identified positions undergo frequent synonymous / non-synonymous substitutions, the selection for folding strength therein is preserved, and thus cannot be trivially explained based on sequence conservation alone. Conclusions The fact that many of the positions with significant folding related signals are conserved among different Dengue variants suggests that a better understanding of the mRNA structures in the corresponding regions may promote the development of prospective anti- Dengue vaccination strategies. The comparative genomics approach described here can be employed in the future for detecting functional regions in other pathogens with very high mutations rates. PMID:26449467

  17. In-depth genome characterization of a Brazilian common bean core collection using DArTseq high-density SNP genotyping.

    PubMed

    Valdisser, Paula A M R; Pereira, Wendell J; Almeida Filho, Jâneo E; Müller, Bárbara S F; Coelho, Gesimária R C; de Menezes, Ivandilson P P; Vianna, João P G; Zucchi, Maria I; Lanna, Anna C; Coelho, Alexandre S G; de Oliveira, Jaison P; Moraes, Alessandra da Cunha; Brondani, Claudio; Vianello, Rosana P

    2017-05-30

    Common bean is a legume of social and nutritional importance as a food crop, cultivated worldwide especially in developing countries, accounting for an important source of income for small farmers. The availability of the complete sequences of the two common bean genomes has dramatically accelerated and has enabled new experimental strategies to be applied for genetic research. DArTseq has been widely used as a method of SNP genotyping allowing comprehensive genome coverage with genetic applications in common bean breeding programs. Using this technology, 6286 SNPs (1 SNP/86.5 Kbp) were genotyped in genic (43.3%) and non-genic regions (56.7%). Genetic subdivision associated to the common bean gene pools (K = 2) and related to grain types (K = 3 and K = 5) were reported. A total of 83% and 91% of all SNPs were polymorphic within the Andean and Mesoamerican gene pools, respectively, and 26% were able to differentiate the gene pools. Genetic diversity analysis revealed an average H E of 0.442 for the whole collection, 0.102 for Andean and 0.168 for Mesoamerican gene pools (F ST  = 0.747 between gene pools), 0.440 for the group of cultivars and lines, and 0.448 for the group of landrace accessions (F ST  = 0.002 between cultivar/line and landrace groups). The SNP effects were predicted with predominance of impact on non-coding regions (77.8%). SNPs under selection were identified within gene pools comparing landrace and cultivar/line germplasm groups (Andean: 18; Mesoamerican: 69) and between the gene pools (59 SNPs), predominantly on chromosomes 1 and 9. The LD extension estimate corrected for population structure and relatedness (r 2 SV ) was ~ 88 kbp, while for the Andean gene pool was ~ 395 kbp, and for the Mesoamerican was ~ 130 kbp. For common bean, DArTseq provides an efficient and cost-effective strategy of generating SNPs for large-scale genome-wide studies. The DArTseq resulted in an operational panel of 560 polymorphic SNPs in linkage equilibrium, providing high genome coverage. This SNP set could be used in genotyping platforms with many applications, such as population genetics, phylogeny relation between common bean varieties and support to molecular breeding approaches.

  18. Association of twelve polymorphisms in three onco-lncRNA genes with hepatocellular cancer risk and prognosis: A case-control study

    PubMed Central

    Wang, Ben-Gang; Xu, Qian; Lv, Zhi; Fang, Xin-Xin; Ding, Han-Xi; Wen, Jing; Yuan, Yuan

    2018-01-01

    AIM To evaluate the association of 12 tag single nucleotide polymorphisms (tagSNPs) in three onco-long non-coding RNA (lncRNA) genes (HOTTIP, CCAT2, MALAT1) with the risk and prognosis of hepatocellular cancer (HCC). METHODS Twelve tagSNPs covering the three onco-lncRNAs were genotyped by the KASP method in a total of 1338 samples, including 521 HCC patients and frequency-matched 817 controls. The samples were obtained from an unrelated Chinese population at the First Hospital of China Medical University from 2012-2015. The expression quantitative trait loci (eQTL) analyses were conducted to explore further the potential function of the promising SNPs. RESULTS Three SNPs in HOTTIP, one promoter SNP in MALAT1, and one haplotype of HOTTIP were associated with HCC risk. The HOTTIP rs17501292, rs2067087, and rs17427960 SNPs were increased to 1.55-, 1.20-, and 1.18-fold HCC risk under allelic models (P = 0.012, 0.017 and 0.049, respectively). MALAT1 rs4102217 SNP was increased to a 1.32-fold HCC risk under dominant models (P = 0.028). In addition, the two-way interaction of HOTTIP rs17501292-MALAT1 rs619586 polymorphisms showed a decreased effect on HCC risk (Pinteraction = 0.028, OR = 0.30) and epistasis with each other. HOTTIP rs3807598 variant genotype showed significantly longer survival time in HBV negative subgroup (P = 0.049, HR = 0.12), and MALAT1 rs591291 showed significantly better prognosis in female and HBV negative subgroups (P = 0.022, HR = 0.37; P = 0.042, HR = 0.25, respectively). In the study, no significant effect was observed in eQTL analysis. CONCLUSION Specific lncRNA (HOTTIP and MALAT1) SNPs have potential to be biomarkers for HCC risk and prognosis. PMID:29930469

  19. Identification and functional characterization of genetic variants of human organic cation transporters in a Korean population.

    PubMed

    Kang, Ho-Jin; Song, Im-Sook; Shin, Ho Jung; Kim, Woo-Young; Lee, Choong-Hee; Shim, Joo-Cheol; Zhou, Hong-Hao; Lee, Sang Seop; Shin, Jae-Gook

    2007-04-01

    Genetic variants of three human organic cation transporter genes (hOCTs) were extensively explored in a Korean population. The functional changes of hOCT2 variants were evaluated in vitro, and those genetic polymorphisms of hOCTs were compared among different ethnic populations. From direct DNA sequencing, 7 of 13 coding variants were nonsynonymous single-nucleotide polymorphisms (SNPs), including four variants from hOCT1 (F160L, P283L, P341L, and M408V) and three from hOCT2 (T199I, T201M, and A270S), whereas 6 were synonymous SNPs. The linkage disequilibrium analysis presented for three independent LD blocks for each hOCT gene showed no significant linkage among all three hOCT genes. The transporter activities of MDCK cells that overexpress the hOCT2-T199I, -T201M, and -A270S variants showed significantly decreased uptake of [(3)H]methyl-4-phenylpyridinium acetate (MPP(+)) or [(14)C]tetraethylammonium compared with those cells that overexpress wild-type hOCT2, and the estimated kinetic parameters of these variants for [(3)H]MPP(+) uptake in oocytes showed a 2- to 5-fold increase in K(m) values and a 10- to 20-fold decrease in V(max) values. The allele frequencies of the five functional variants hOCT1-P283L, -P341L, and hOCT2-T199I, -T201M, and -A270S were 1.3, 17, 0.7, 0.7, and 11%, respectively, in a Korean population; the frequency distributions of these variants were not significantly different from those of Chinese and Vietnamese populations. These findings suggest that genetic variants of hOCTs are not linked among three genes in a Korean population, and several of the hOCT genetic variants cause decreased transport activity in vitro compared with the wild type, although the clinical relevance of these variants remains to be evaluated.

  20. Genetic variants in the mTOR pathway and interaction with body size and weight gain on breast cancer risk in African-American and European-American women

    PubMed Central

    Cheng, Ting-Yuan David; Shankar, Jyoti; Zirpoli, Gary; Roberts, Michelle R.; Hong, Chi-Chen; Bandera, Elisa V.; Ambrosone, Christine B.; Yao, Song

    2016-01-01

    Purpose Positive energy imbalance and growth factors linked to obesity promote the phosphatidylinositol 3-kinase/AKT/mammalian target of rapamycin (mTOR) pathway. As the obesity-breast cancer associations differ between European-American (EA) and African-American (AA) women, we investigated genetic variants in the mTOR pathway and breast cancer risk in these two racial groups. Methods We examined 400 single nucleotide polymorphisms (SNPs) in 31 mTOR pathway genes in the Women’s Circle of Health Study with 1263 incident breast cancers (645 EA, 618 AA) and 1382 controls (641 EA, 741 AA). Multivariable logistic regression was performed separately within racial groups. Effect modification was assessed for measured body size and weight gain since age 20. Results In EA women, variants in FRAP1 rs12125777 (intron), PRR5L rs3740958 (synonymous-coding), and CDKAL1 rs9368197 (intron) were associated with increased breast cancer risk, while variants in RPTOR rs9900506 (intron) were associated with decreased risk (nominal P-trend for functional and FRAP1 SNPs or P adjusted for correlated test [PACT] <0.05). For AA women, variants in RPTOR rs3817293 (intron), PIK3R1 rs7713645 (intron), and CDKAL1 rs9368197 were associated with decreased breast cancer risk. The significance for FRAP1 rs12125777 and RPTOR rs9900506 in EA women did not hold after correction for multiple comparisons. The risk associated with FRAP1 rs12125777 was higher among EAs who had body mass index ≥30 kg/m2 (odds ratio=7.69, 95% CI=2.11–28.0; P-interaction=0.007) and gained weight ≥35 lb. since age 20 (odds ratio=3.34, 95% CI=1.42–7.85; P-interaction=0.021), compared to their counterparts. Conclusions The mTOR pathway may be involved in breast cancer carcinogenesis differently for EA and AA women. PMID:27314662

  1. Identification of the Q969R gain-of-function polymorphism in the gene encoding porcine NLRP3 and its distribution in pigs of Asian and European origin.

    PubMed

    Tohno, Masanori; Shinkai, Hiroki; Toki, Daisuke; Okumura, Naohiko; Tajima, Kiyoshi; Uenishi, Hirohide

    2016-10-01

    The nucleotide-binding domain, leucine-rich-containing family, pyrin-domain containing-3 (NLRP3) inflammasome comprises the major components caspase-1, apoptosis-associated speck-like protein containing a caspase recruitment domain (ASC), and NLRP3. NLRP3 plays important roles in maintaining immune homeostasis mediated by intestinal microorganisms and in the immunostimulatory properties of vaccine adjuvants used to induce an immune response. In the present study, we first cloned a complementary DNA (cDNA) encoding porcine ASC because its genomic sequence was not completely determined. The availability of the ASC cDNA enabled us to reconstitute porcine NLRP3 inflammasomes using an in vitro system that led to the identification of the immune functions of porcine NLRP3 and ASC based on the production of interleukin-1β (IL-1β). Further, we identified six synonymous and six nonsynonymous single-nucleotide polymorphisms (SNPs) in the coding sequence of NLRP3 of six breeds of pigs, including major commercial breeds. Among the nonsynonymous SNPs, the Q969R polymorphism is associated with an increased release of IL-1β compared with other porcine NLRP3 variants, indicating that this polymorphism represents a gain-of-function mutation. This allele was detected in 100 % of the analyzed Chinese Jinhua and Japanese wild boars, suggesting that the allele is maintained in the major commercial native European breeds Landrace, Large White, and Berkshire. These findings represent an important contribution to our knowledge of the diversity of NLRP3 nucleotide sequences among various pig populations. Moreover, efforts to exploit the gain of function induced by the Q969R polymorphism promise to improve pig breeding and husbandry by conferring enhanced resistance to pathogens as well as contributing to vaccine efficacy.

  2. The complete mitochondrial genome of dhole Cuon alpinus: phylogenetic analysis and dating evolutionary divergence within Canidae.

    PubMed

    Zhang, Honghai; Chen, Lei

    2011-03-01

    The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.

  3. Polymorphisms of 20 regulatory proteins between Mycobacterium tuberculosis and Mycobacterium bovis.

    PubMed

    Bigi, María M; Blanco, Federico Carlos; Araújo, Flabio R; Thacker, Tyler C; Zumárraga, Martín J; Cataldi, Angel A; Soria, Marcelo A; Bigi, Fabiana

    2016-08-01

    Mycobacterium tuberculosis and Mycobacterium bovis are responsible for tuberculosis in humans and animals, respectively. Both species are closely related and belong to the Mycobacterium tuberculosis complex (MTC). M. tuberculosis is the most ancient species from which M. bovis and other members of the MTC evolved. The genome of M. bovis is over >99.95% identical to that of M. tuberculosis but with seven deletions ranging in size from 1 to 12.7 kb. In addition, 1200 single nucleotide mutations in coding regions distinguish M. bovis from M. tuberculosis. In the present study, we assessed 75 M. tuberculosis genomes and 23 M. bovis genomes to identify non-synonymous mutations in 202 coding sequences of regulatory genes between both species. We identified species-specific variants in 20 regulatory proteins and confirmed differential expression of hypoxia-related genes between M. bovis and M. tuberculosis. © 2016 The Societies and John Wiley & Sons Australia, Ltd.

  4. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

    PubMed

    Flannick, Jason; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M; Agarwala, Vineeta; Gaulton, Kyle J; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Dennis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana Cn; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Altshuler, David; Burtt, Noël P; Florez, Jose C; Boehnke, Michael; McCarthy, Mark I

    2017-12-19

    To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.

  5. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

    PubMed Central

    Jason, Flannick; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M.; Agarwala, Vineeta; Gaulton, Kyle J.; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J.; Rivas, Manuel A.; Perry, John R. B.; Sim, Xueling; Blackwell, Thomas W.; Robertson, Neil R.; Rayner, N William; Cingolani, Pablo; Locke, Adam E.; Tajes, Juan Fernandez; Highland, Heather M.; Dupuis, Josee; Chines, Peter S.; Lindgren, Cecilia M.; Hartl, Christopher; Jackson, Anne U.; Chen, Han; Huyghe, Jeroen R.; van de Bunt, Martijn; Pearson, Richard D.; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M.; Gamazon, Eric R.; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A.; Below, Jennifer E.; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L.; Pasko, Dorota; Parker, Stephen C. J.; Varga, Tibor V.; Green, Todd; Beer, Nicola L.; Day-Williams, Aaron G.; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J.; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P.; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F.; Han, Bok-Ghee; Jenkinson, Christopher P.; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C. Y.; Palmer, Nicholette D.; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E.; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D.; Neale, Benjamin M.; Purcell, Shaun; Butterworth, Adam S.; Howson, Joanna M. M.; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K. L.; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H. T.; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E.; Rybin, Dennis; Farook, Vidya S.; Fowler, Sharon P.; Freedman, Barry I.; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J.; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K.; Puppala, Sobha; Scott, William R.; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A.; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C.; Mangino, Massimo; Bonnycastle, Lori L.; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L.; Herder, Christian; Groves, Christopher J.; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A.; Doney, Alex S. F.; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J.; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E.; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H.; Stirrups, Kathleen; Wood, Andrew R.; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O.; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P.; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B.; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N. A.; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M.; Syvänen, Ann-Christine; Bergman, Richard N.; Bharadwaj, Dwaipayan; Bottinger, Erwin P.; Cho, Yoon Shin; Chandak, Giriraj R.; Chan, Juliana CN; Chia, Kee Seng; Daly, Mark J.; Ebrahim, Shah B.; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A.; Lehman, Donna M.; Jia, Weiping; Ma, Ronald C. W.; Pollin, Toni I.; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J. F.; Small, Kerrin S.; Ried, Janina S.; DeFronzo, Ralph A.; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J.; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W.; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R.; Gloyn, Anna L.; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D.; Hattersley, Andrew T.; Bowden, Donald W.; Collins, Francis S.; Atzmon, Gil; Chambers, John C.; Spector, Timothy D.; Laakso, Markku; Strom, Tim M.; Bell, Graeme I.; Blangero, John; Duggirala, Ravindranath; Tai, E. Shyong; McVean, Gilean; Hanis, Craig L.; Wilson, James G.; Seielstad, Mark; Frayling, Timothy M.; Meigs, James B.; Cox, Nancy J.; Sladek, Rob; Lander, Eric S.; Gabriel, Stacey; Mohlke, Karen L.; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J.; Morris, Andrew P.; Kang, Hyun Min; Altshuler, David; Burtt, Noël P.; Florez, Jose C.; Boehnke, Michael; McCarthy, Mark I.

    2017-01-01

    To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1–5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D. PMID:29257133

  6. Identification of a possible susceptibility locus for UVB-induced skin tanning phenotype in Korean females using genomewide association study.

    PubMed

    Kwak, Taek-Jong; Chang, Yun-Hee; Shin, Young-Ah; Shin, Jung-Min; Kim, Ji-Hye; Lim, Seul-Ki; Lee, Sang-Hwha; Lee, Min-Geol; Yoon, Tae-Jin; Kim, Chang-Deok; Lee, Jeung-Hoon; Koh, Jae Sook; Seo, Young Kyoung; Chang, Min-Youl; Lee, Young

    2015-12-01

    A two-stage genomewide association (GWA) analysis was conducted to investigate the genetic factors influencing ultraviolet (UV)-induced skin pigmentation in Korean females after UV exposure. Previously, a GWA study evaluating ~500 000 single nucleotide polymorphisms (SNPs) in 99 Korean females identified eight SNPs that were highly associated with tanning ability. To confirm these associations, we genotyped the SNPs in an independent replication study (112 Korean females). We found that a novel SNP in the intron of the WW domain-containing oxidoreductase (WWOX) gene yielded significant replicated associations with skin tanning ability (P-value = 1.16 × 10(-4) ). To understand the functional consequences of this locus located in the non-coding region, we investigated the role of WWOX in human melanocytes using a recombinant adenovirus expressing a microRNA specific for WWOX. Inhibition of WWOX expression significantly increased the expression and activity of tyrosinase in human melanocytes. Taken together, our results suggest that genetic variants in the intronic region of WWOX could be determinants in the UV-induced tanning ability of Korean females. WWOX represents a new candidate gene to evaluate the molecular basis of the UV-induced tanning ability in individuals. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Ferret: a user-friendly Java tool to extract data from the 1000 Genomes Project.

    PubMed

    Limou, Sophie; Taverner, Andrew M; Winkler, Cheryl A

    2016-07-15

    The 1000 Genomes (1KG) Project provides a near-comprehensive resource on human genetic variation in worldwide reference populations. 1KG variants can be accessed through a browser and through the raw and annotated data that are regularly released on an ftp server. We developed Ferret, a user-friendly Java tool, to easily extract genetic variation information from these large and complex data files. From a locus, gene(s) or SNP(s) of interest, Ferret retrieves genotype data for 1KG SNPs and indels, and computes allelic frequencies for 1KG populations and optionally, for the Exome Sequencing Project populations. By converting the 1KG data into files that can be imported into popular pre-existing tools (e.g. PLINK and HaploView), Ferret offers a straightforward way, even for non-bioinformatics specialists, to manipulate, explore and merge 1KG data with the user's dataset, as well as visualize linkage disequilibrium pattern, infer haplotypes and design tagSNPs. Ferret tool and source code are publicly available at http://limousophie35.github.io/Ferret/ ferret@nih.gov Supplementary data are available at Bioinformatics online. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.

  8. Polymorphisms of the bovine MC3R gene and their associations with body measurement traits and meat quality traits in Qinchuan cattle.

    PubMed

    Yang, W-C; Wang, Y-N; Cui, A; Zan, L-S

    2015-10-05

    The melanocortin 3 receptor (MC3R) gene, which belongs to the rhodopsin-like family A of the G protein-coupled receptor family, plays a crucial role in feed efficiency and energy homeostasis. The aim of this study was to examine associations between bovine MC3R gene polymorphisms and body measurement traits (BMTs) and meat quality traits (MQTs). We identified three synonymous mutations (T429C, T537C, and T663C) in exon 1 of the MC3R gene in Chinese Qinchuan beef cattle (N = 271) by sequencing. D' and r(2) values revealed that these three SNPs were in strong linkage disequilibrium (LD) (r(2) > 0.33); the T429C and T537C SNPs were in complete LD (D' = 1 and r(2) = 1). Association analyses revealed that the SNPs were significantly associated with BMTs and MQTs in Qinchuan cattle. Individuals with the wild homozygotic genotypes g.TTTT and g.TT had significantly higher values of chest depth, heart girth, back fat thickness, intramuscular fat content, and loin muscle area than the mutant heterozygotic genotypes g.TCTC and g.TC. These results suggest that the MC3R gene affects MQTs in Qinchuan cattle, and that it may be a good candidate gene for marker-assisted selection.

  9. Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.

    PubMed

    Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y

    2013-02-27

    We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.

  10. CIDR

    Science.gov Websites

    this process to the requirements of their study. Data from both reviewed and non-reviewed SNPs will be released. A technical filter will be applied to the non-reviewed SNPs to eliminate total assay failures

  11. Correlations of nucleotide substitution rates and base composition of mammalian coding sequences with protein structure.

    PubMed

    Chiusano, M L; D'Onofrio, G; Alvarez-Valin, F; Jabbari, K; Colonna, G; Bernardi, G

    1999-09-30

    We investigated the relationships between the nucleotide substitution rates and the predicted secondary structures in the three states representation (alpha-helix, beta-sheet, and coil). The analysis was carried out on 34 alignments, each of which comprised sequences belonging to at least four different mammalian orders. The rates of synonymous substitution were found to be significantly different in regions predicted to be alpha-helix, beta-sheet, or coil. Likewise, the nonsynonymous rates also differ, although expectedly at a lower extent, in the three types of secondary structure, suggesting that different selective constraints associated with the different structures are affecting in a similar way the synonymous and nonsynonymous rates. Moreover, the base composition of the third codon positions is different in coding sequence regions corresponding to different secondary structures of proteins.

  12. Association of genetic and non-genetic risk factors with the development of prostate cancer in Malaysian men.

    PubMed

    Munretnam, Khamsigan; Alex, Livy; Ramzi, Nurul Hanis; Chahil, Jagdish Kaur; Kavitha, I S; Hashim, Nikman Adli Nor; Lye, Say Hean; Velapasamy, Sharmila; Ler, Lian Wee

    2014-01-01

    There is growing global interest to stratify men into different levels of risk to developing prostate cancer, thus it is important to identify common genetic variants that confer the risk. Although many studies have identified more than a dozen common genetic variants which are highly associated with prostate cancer, none have been done in Malaysian population. To determine the association of such variants in Malaysian men with prostate cancer, we evaluated a panel of 768 SNPs found previously associated with various cancers which also included the prostate specific SNPs in a population based case control study (51 case subjects with prostate cancer and 51 control subjects) in Malaysian men of Malay, Chinese and Indian ethnicity. We identified 21 SNPs significantly associated with prostate cancer. Among these, 12 SNPs were strongly associated with increased risk of prostate cancer while remaining nine SNPs were associated with reduced risk. However, data analysis based on ethnic stratification led to only five SNPs in Malays and 3 SNPs in Chinese which remained significant. This could be due to small sample size in each ethnic group. Significant non-genetic risk factors were also identified for their association with prostate cancer. Our study is the first to investigate the involvement of multiple variants towards susceptibility for PC in Malaysian men using genotyping approach. Identified SNPs and non-genetic risk factors have a significant association with prostate cancer.

  13. Inherited variants in the inner centromere protein (INCENP) gene of the chromosomal passenger complex contribute to the susceptibility of ER-negative breast cancer

    PubMed Central

    Kabisch, Maria; Lorenzo Bermejo, Justo; Dünnebier, Thomas; Ying, Shibo; Michailidou, Kyriaki; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Shah, Mitul; Perkins, Barbara J.; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Lambrechts, Diether; Neven, Patrick; Peeters, Stephanie; Weltens, Caroline; Couch, Fergus J.; Olson, Janet E.; Wang, Xianshu; Purrington, Kristen; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Peto, Julian; dos-Santos-Silva, Isabel; Johnson, Nichola; Fletcher, Olivia; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Schmidt, Marjanka K.; Broeks, Annegien; Cornelissen, Sten; Hogervorst, Frans B.L.; Li, Jingmei; Brand, Judith S.; Humphreys, Keith; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Burwinkel, Barbara; Marmé, Frederik; Yang, Rongxi; Bugert, Peter; González-Neira, Anna; Benitez, Javier; Pilar Zamora, M.; Arias Perez, Jose I.; Cox, Angela; Cross, Simon S.; Reed, Malcolm W.R.; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Haiman, Christopher A.; Schumacher, Fredrick; Henderson, Brian E.; Le Marchand, Loic; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J.; Hollestelle, Antoinette; Kriege, Mieke; Koppert, Linetta B.; Hopper, John L.; Southey, Melissa C.; Tsimiklis, Helen; Apicella, Carmel; Slettedahl, Seth; Toland, Amanda E.; Vachon, Celine; Yannoukakos, Drakoulis; Giles, Graham G.; Milne, Roger L.; McLean, Catriona; Fasching, Peter A.; Ruebner, Matthias; Ekici, Arif B.; Beckmann, Matthias W.; Brenner, Hermann; Dieffenbach, Aida K.; Arndt, Volker; Stegmaier, Christa; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk J.; Swerdlow, Anthony; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Radice, Paolo; Peterlongo, Paolo; Scuvera, Giulietta; Fortuzzi, Stefano; Bogdanova, Natalia; Dörk, Thilo; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Van Asperen, Christi J.; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Zheng, Wei; Shrubsole, Martha J.; Cai, Qiuyin; Torres, Diana; Anton-Culver, Hoda; Kristensen, Vessela; Bacot, François; Tessier, Daniel C.; Vincent, Daniel; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Maranian, Mel; Simard, Jacques; Chenevix-Trench, Georgia; Hall, Per; Pharoah, Paul D.P.; Dunning, Alison M.; Easton, Douglas F.; Hamann, Ute

    2015-01-01

    The chromosomal passenger complex (CPC) plays a pivotal role in the regulation of cell division. Therefore, inherited CPC variability could influence tumor development. The present candidate gene approach investigates the relationship between single nucleotide polymorphisms (SNPs) in genes encoding key CPC components and breast cancer risk. Fifteen SNPs in four CPC genes (INCENP, AURKB, BIRC5 and CDCA8) were genotyped in 88 911 European women from 39 case-control studies of the Breast Cancer Association Consortium. Possible associations were investigated in fixed-effects meta-analyses. The synonymous SNP rs1675126 in exon 7 of INCENP was associated with overall breast cancer risk [per A allele odds ratio (OR) 0.95, 95% confidence interval (CI) 0.92–0.98, P = 0.007] and particularly with estrogen receptor (ER)-negative breast tumors (per A allele OR 0.89, 95% CI 0.83–0.95, P = 0.0005). SNPs not directly genotyped were imputed based on 1000 Genomes. The SNPs rs1047739 in the 3ʹ untranslated region and rs144045115 downstream of INCENP showed the strongest association signals for overall (per T allele OR 1.03, 95% CI 1.00–1.06, P = 0.0009) and ER-negative breast cancer risk (per A allele OR 1.06, 95% CI 1.02–1.10, P = 0.0002). Two genotyped SNPs in BIRC5 were associated with familial breast cancer risk (top SNP rs2071214: per G allele OR 1.12, 95% CI 1.04–1.21, P = 0.002). The data suggest that INCENP in the CPC pathway contributes to ER-negative breast cancer susceptibility in the European population. In spite of a modest contribution of CPC-inherited variants to the total burden of sporadic and familial breast cancer, their potential as novel targets for breast cancer treatment should be further investigated. PMID:25586992

  14. Inherited variants in the inner centromere protein (INCENP) gene of the chromosomal passenger complex contribute to the susceptibility of ER-negative breast cancer.

    PubMed

    Kabisch, Maria; Lorenzo Bermejo, Justo; Dünnebier, Thomas; Ying, Shibo; Michailidou, Kyriaki; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Shah, Mitul; Perkins, Barbara J; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Lambrechts, Diether; Neven, Patrick; Peeters, Stephanie; Weltens, Caroline; Couch, Fergus J; Olson, Janet E; Wang, Xianshu; Purrington, Kristen; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Peto, Julian; dos-Santos-Silva, Isabel; Johnson, Nichola; Fletcher, Olivia; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Schmidt, Marjanka K; Broeks, Annegien; Cornelissen, Sten; Hogervorst, Frans B L; Li, Jingmei; Brand, Judith S; Humphreys, Keith; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Burwinkel, Barbara; Marmé, Frederik; Yang, Rongxi; Bugert, Peter; González-Neira, Anna; Benitez, Javier; Pilar Zamora, M; Arias Perez, Jose I; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Haiman, Christopher A; Schumacher, Fredrick; Henderson, Brian E; Le Marchand, Loic; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J; Hollestelle, Antoinette; Kriege, Mieke; Koppert, Linetta B; Hopper, John L; Southey, Melissa C; Tsimiklis, Helen; Apicella, Carmel; Slettedahl, Seth; Toland, Amanda E; Vachon, Celine; Yannoukakos, Drakoulis; Giles, Graham G; Milne, Roger L; McLean, Catriona; Fasching, Peter A; Ruebner, Matthias; Ekici, Arif B; Beckmann, Matthias W; Brenner, Hermann; Dieffenbach, Aida K; Arndt, Volker; Stegmaier, Christa; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk J; Swerdlow, Anthony; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J; Lissowska, Jolanta; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Radice, Paolo; Peterlongo, Paolo; Scuvera, Giulietta; Fortuzzi, Stefano; Bogdanova, Natalia; Dörk, Thilo; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Van Asperen, Christi J; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Zheng, Wei; Shrubsole, Martha J; Cai, Qiuyin; Torres, Diana; Anton-Culver, Hoda; Kristensen, Vessela; Bacot, François; Tessier, Daniel C; Vincent, Daniel; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Maranian, Mel; Simard, Jacques; Chenevix-Trench, Georgia; Hall, Per; Pharoah, Paul D P; Dunning, Alison M; Easton, Douglas F; Hamann, Ute

    2015-02-01

    The chromosomal passenger complex (CPC) plays a pivotal role in the regulation of cell division. Therefore, inherited CPC variability could influence tumor development. The present candidate gene approach investigates the relationship between single nucleotide polymorphisms (SNPs) in genes encoding key CPC components and breast cancer risk. Fifteen SNPs in four CPC genes (INCENP, AURKB, BIRC5 and CDCA8) were genotyped in 88 911 European women from 39 case-control studies of the Breast Cancer Association Consortium. Possible associations were investigated in fixed-effects meta-analyses. The synonymous SNP rs1675126 in exon 7 of INCENP was associated with overall breast cancer risk [per A allele odds ratio (OR) 0.95, 95% confidence interval (CI) 0.92-0.98, P = 0.007] and particularly with estrogen receptor (ER)-negative breast tumors (per A allele OR 0.89, 95% CI 0.83-0.95, P = 0.0005). SNPs not directly genotyped were imputed based on 1000 Genomes. The SNPs rs1047739 in the 3' untranslated region and rs144045115 downstream of INCENP showed the strongest association signals for overall (per T allele OR 1.03, 95% CI 1.00-1.06, P = 0.0009) and ER-negative breast cancer risk (per A allele OR 1.06, 95% CI 1.02-1.10, P = 0.0002). Two genotyped SNPs in BIRC5 were associated with familial breast cancer risk (top SNP rs2071214: per G allele OR 1.12, 95% CI 1.04-1.21, P = 0.002). The data suggest that INCENP in the CPC pathway contributes to ER-negative breast cancer susceptibility in the European population. In spite of a modest contribution of CPC-inherited variants to the total burden of sporadic and familial breast cancer, their potential as novel targets for breast cancer treatment should be further investigated. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  15. A genome-wide signature of positive selection in ancient and recent invasive expansions of the honey bee Apis mellifera

    PubMed Central

    Zayed, Amro; Whitfield, Charles W.

    2008-01-01

    Apis mellifera originated in Africa and extended its range into Eurasia in two or more ancient expansions. In 1956, honey bees of African origin were introduced into South America, their descendents admixing with previously introduced European bees, giving rise to the highly invasive and economically devastating “Africanized” honey bee. Here we ask whether the honey bee's out-of-Africa expansions, both ancient and recent (invasive), were associated with a genome-wide signature of positive selection, detected by contrasting genetic differentiation estimates (FST) between coding and noncoding SNPs. In native populations, SNPs in protein-coding regions had significantly higher FST estimates than those in noncoding regions, indicating adaptive evolution in the genome driven by positive selection. This signal of selection was associated with the expansion of honey bees from Africa into Western and Northern Europe, perhaps reflecting adaptation to temperate environments. We estimate that positive selection acted on a minimum of 852–1,371 genes or ≈10% of the bee's coding genome. We also detected positive selection associated with the invasion of African-derived honey bees in the New World. We found that introgression of European-derived alleles into Africanized bees was significantly greater for coding than noncoding regions. Our findings demonstrate that Africanized bees exploited the genetic diversity present from preexisting introductions in an adaptive way. Finally, we found a significant negative correlation between FST estimates and the local GC content surrounding coding SNPs, suggesting that AT-rich genes play an important role in adaptive evolution in the honey bee. PMID:18299560

  16. A genome-wide signature of positive selection in ancient and recent invasive expansions of the honey bee Apis mellifera.

    PubMed

    Zayed, Amro; Whitfield, Charles W

    2008-03-04

    Apis mellifera originated in Africa and extended its range into Eurasia in two or more ancient expansions. In 1956, honey bees of African origin were introduced into South America, their descendents admixing with previously introduced European bees, giving rise to the highly invasive and economically devastating "Africanized" honey bee. Here we ask whether the honey bee's out-of-Africa expansions, both ancient and recent (invasive), were associated with a genome-wide signature of positive selection, detected by contrasting genetic differentiation estimates (F(ST)) between coding and noncoding SNPs. In native populations, SNPs in protein-coding regions had significantly higher F(ST) estimates than those in noncoding regions, indicating adaptive evolution in the genome driven by positive selection. This signal of selection was associated with the expansion of honey bees from Africa into Western and Northern Europe, perhaps reflecting adaptation to temperate environments. We estimate that positive selection acted on a minimum of 852-1,371 genes or approximately 10% of the bee's coding genome. We also detected positive selection associated with the invasion of African-derived honey bees in the New World. We found that introgression of European-derived alleles into Africanized bees was significantly greater for coding than noncoding regions. Our findings demonstrate that Africanized bees exploited the genetic diversity present from preexisting introductions in an adaptive way. Finally, we found a significant negative correlation between F(ST) estimates and the local GC content surrounding coding SNPs, suggesting that AT-rich genes play an important role in adaptive evolution in the honey bee.

  17. Whole-Gene Positive Selection, Elevated Synonymous Substitution Rates, Duplication, and Indel Evolution of the Chloroplast clpP1 Gene

    PubMed Central

    Erixon, Per; Oxelman, Bengt

    2008-01-01

    Background Synonymous DNA substitution rates in the plant chloroplast genome are generally relatively slow and lineage dependent. Non-synonymous rates are usually even slower due to purifying selection acting on the genes. Positive selection is expected to speed up non-synonymous substitution rates, whereas synonymous rates are expected to be unaffected. Until recently, positive selection has seldom been observed in chloroplast genes, and large-scale structural rearrangements leading to gene duplications are hitherto supposed to be rare. Methodology/Principle Findings We found high substitution rates in the exons of the plastid clpP1 gene in Oenothera (the Evening Primrose family) and three separate lineages in the tribe Sileneae (Caryophyllaceae, the Carnation family). Introns have been lost in some of the lineages, but where present, the intron sequences have substitution rates similar to those found in other introns of their genomes. The elevated substitution rates of clpP1 are associated with statistically significant whole-gene positive selection in three branches of the phylogeny. In two of the lineages we found multiple copies of the gene. Neighboring genes present in the duplicated fragments do not show signs of elevated substitution rates or positive selection. Although non-synonymous substitutions account for most of the increase in substitution rates, synonymous rates are also markedly elevated in some lineages. Whereas plant clpP1 genes experiencing negative (purifying) selection are characterized by having very conserved lengths, genes under positive selection often have large insertions of more or less repetitive amino acid sequence motifs. Conclusions/Significance We found positive selection of the clpP1 gene in various plant lineages to correlated with repeated duplication of the clpP1 gene and surrounding regions, repetitive amino acid sequences, and increase in synonymous substitution rates. The present study sheds light on the controversial issue of whether negative or positive selection is to be expected after gene duplications by providing evidence for the latter alternative. The observed increase in synonymous substitution rates in some of the lineages indicates that the detection of positive selection may be obscured under such circumstances. Future studies are required to explore the functional significance of the large inserted repeated amino acid motifs, as well as the possibility that synonymous substitution rates may be affected by positive selection. PMID:18167545

  18. Synonymous deoptimization of the foot-and-mouth disease virus P1 coding region causes attenuation in vivo while inducing a strong neutralizing antibody response

    USDA-ARS?s Scientific Manuscript database

    Codon bias deoptimization has been previously used to successfully attenuate human pathogens including polio, respiratory syncytial and influenza viruses. We have applied a similar technology to deoptimize the capsid coding region (P1 region) of the cDNA infectious clone of foot-and-mouth disease vi...

  19. Haplotype combination of the bovine CFL2 gene sequence variants and association with growth traits in Qinchuan cattle.

    PubMed

    Sun, Yujia; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong

    2015-06-01

    The aim of this study was to examine the association of cofilin2 (CFL2) gene polymorphisms with growth traits in Chinese Qinchuan cattle. Three single nucleotide polymorphisms (SNPs) were identified in the bovine CFL2 gene using DNA sequencing and (forced) PCR-RFLP methods. These polymorphisms included a missense mutation (NC_007319.5: g. C 2213 G) in exon 4, one synonymous mutation (NC_007319.5: g. T 1694 A) in exon 4, and a mutation (NC_007319.5: g. G 1500 A) in intron 2, respectively. In addition, we evaluated the haplotype frequency and linkage disequilibrium coefficient of three sequence variants in 488 individuals in QC cattle. All the three SNPs in QC cattle belonged to an intermediate level of genetic diversity (0.250.33). Association analysis indicated that SNP G 1500 A, T 1694 A and C 2213 G were significantly associated with growth traits in the QC population. The results of our study suggest that the CFL2 gene may be a strong candidate gene that affects growth traits in the QC cattle breeding program. Copyright © 2015 Elsevier B.V. All rights reserved.

  20. A novel method for in silico identification of regulatory SNPs in human genome.

    PubMed

    Li, Rong; Zhong, Dexing; Liu, Ruiling; Lv, Hongqiang; Zhang, Xinman; Liu, Jun; Han, Jiuqiang

    2017-02-21

    Regulatory single nucleotide polymorphisms (rSNPs), kind of functional noncoding genetic variants, can affect gene expression in a regulatory way, and they are thought to be associated with increased susceptibilities to complex diseases. Here a novel computational approach to identify potential rSNPs is presented. Different from most other rSNPs finding methods which based on hypothesis that SNPs causing large allele-specific changes in transcription factor binding affinities are more likely to play regulatory functions, we use a set of documented experimentally verified rSNPs and nonfunctional background SNPs to train classifiers, so the discriminating features are found. To characterize variants, an extensive range of characteristics, such as sequence context, DNA structure and evolutionary conservation etc. are analyzed. Support vector machine is adopted to build the classifier model together with an ensemble method to deal with unbalanced data. 10-fold cross-validation result shows that our method can achieve accuracy with sensitivity of ~78% and specificity of ~82%. Furthermore, our method performances better than some other algorithms based on aforementioned hypothesis in handling false positives. The original data and the source matlab codes involved are available at https://sourceforge.net/projects/rsnppredict/. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Genetic variations of VDR/NR1I1 encoding vitamin D receptor in a Japanese population.

    PubMed

    Ukaji, Maho; Saito, Yoshiro; Fukushima-Uesaka, Hiromi; Maekawa, Keiko; Katori, Noriko; Kaniwa, Nahoko; Yoshida, Teruhiko; Nokihara, Hiroshi; Sekine, Ikuo; Kunitoh, Hideo; Ohe, Yuichiro; Yamamoto, Noboru; Tamura, Tomohide; Saijo, Nagahiro; Sawada, Jun-ichi

    2007-12-01

    The vitamin D receptor (VDR) is a transcriptional factor responsive to 1alpha,25-dihydroxyvitamin D(3) and lithocholic acid, and induces expression of drug metabolizing enzymes CYP3A4, CYP2B6 and CYP2C9. In this study, the promoter regions, 14 exons (including 6 exon 1's) and their flanking introns of VDR were comprehensively screened for genetic variations in 107 Japanese subjects. Sixty-one genetic variations including 25 novel ones were found: 9 in the 5'-flanking region, 2 in the 5'-untranslated region (UTR), 7 in the coding exons (5 synonymous and 2 nonsynonymous variations), 12 in the 3'-UTR, 19 in the introns between the exon 1's, and 12 in introns 2 to 8. Of these, one novel nonsynonymous variation, 154A>G (Met52Val), was detected with an allele frequency of 0.005. The single nucleotide polymorphisms (SNPs) that increase VDR expression or activity, -29649G>A, 2T>C and 1592((*)308)C>A tagging linked variations in the 3'-UTR, were detected at 0.430, 0.636, and 0.318 allele frequencies, respectively. Another SNP, -26930A>G, with reduced VDR transcription was found at a 0.028 frequency. These findings would be useful for association studies on VDR variations in Japanese.

  2. GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores.

    PubMed

    Chikkagoudar, Satish; Wang, Kai; Li, Mingyao

    2011-05-26

    Gene-gene interaction in genetic association studies is computationally intensive when a large number of SNPs are involved. Most of the latest Central Processing Units (CPUs) have multiple cores, whereas Graphics Processing Units (GPUs) also have hundreds of cores and have been recently used to implement faster scientific software. However, currently there are no genetic analysis software packages that allow users to fully utilize the computing power of these multi-core devices for genetic interaction analysis for binary traits. Here we present a novel software package GENIE, which utilizes the power of multiple GPU or CPU processor cores to parallelize the interaction analysis. GENIE reads an entire genetic association study dataset into memory and partitions the dataset into fragments with non-overlapping sets of SNPs. For each fragment, GENIE analyzes: 1) the interaction of SNPs within it in parallel, and 2) the interaction between the SNPs of the current fragment and other fragments in parallel. We tested GENIE on a large-scale candidate gene study on high-density lipoprotein cholesterol. Using an NVIDIA Tesla C1060 graphics card, the GPU mode of GENIE achieves a speedup of 27 times over its single-core CPU mode run. GENIE is open-source, economical, user-friendly, and scalable. Since the computing power and memory capacity of graphics cards are increasing rapidly while their cost is going down, we anticipate that GENIE will achieve greater speedups with faster GPU cards. Documentation, source code, and precompiled binaries can be downloaded from http://www.cceb.upenn.edu/~mli/software/GENIE/.

  3. GENIE: a software package for gene-gene interaction analysis in genetic association studies using multiple GPU or CPU cores

    PubMed Central

    2011-01-01

    Background Gene-gene interaction in genetic association studies is computationally intensive when a large number of SNPs are involved. Most of the latest Central Processing Units (CPUs) have multiple cores, whereas Graphics Processing Units (GPUs) also have hundreds of cores and have been recently used to implement faster scientific software. However, currently there are no genetic analysis software packages that allow users to fully utilize the computing power of these multi-core devices for genetic interaction analysis for binary traits. Findings Here we present a novel software package GENIE, which utilizes the power of multiple GPU or CPU processor cores to parallelize the interaction analysis. GENIE reads an entire genetic association study dataset into memory and partitions the dataset into fragments with non-overlapping sets of SNPs. For each fragment, GENIE analyzes: 1) the interaction of SNPs within it in parallel, and 2) the interaction between the SNPs of the current fragment and other fragments in parallel. We tested GENIE on a large-scale candidate gene study on high-density lipoprotein cholesterol. Using an NVIDIA Tesla C1060 graphics card, the GPU mode of GENIE achieves a speedup of 27 times over its single-core CPU mode run. Conclusions GENIE is open-source, economical, user-friendly, and scalable. Since the computing power and memory capacity of graphics cards are increasing rapidly while their cost is going down, we anticipate that GENIE will achieve greater speedups with faster GPU cards. Documentation, source code, and precompiled binaries can be downloaded from http://www.cceb.upenn.edu/~mli/software/GENIE/. PMID:21615923

  4. [Analysis of mitochondrial SNPs in addition to conventional STR-typing in a case of aggravated theft].

    PubMed

    Röper, Andrea; Reichert, Walter; Mattern, Rainer

    2007-01-01

    In the field of forensic DNA typing, the analysis of Short Tandem Repeats (STRs) can fail in cases of degraded DNA. The typing of coding region Single Nucleotide Polymorphisms (SNPs) of the mitochondrial genome provides an approach to acquire additional information. In the examined case of aggravated theft, both suspects could be excluded of having left the analyzed hair on the crime scene by SNP typing. This conclusion was not possible subsequent to STR typing. SNP typing of the trace on the torch light left on the crime scene increased the likelihood for suspect no. 2 to be the origin of this trace. This finding was already indicated by STR analysis. Suspect no. 1 was excluded for being the origin of this trace by SNP typing which was also indicated by STR analysis. A limiting factor for the analysis of SNPs is the maternal inheritance of mitochondrial DNA. Individualisation is not possible. In conclusion, it can be said that in the case of traces which cause problems with conventional STR typing the supplementary analysis of coding region SNPs from the mitochondrial genome is very reasonable and greatly contributes to the refinement of analysis methods in the field of forensic genetics.

  5. Association of Amine-Receptor DNA Sequence Variants with Associative Learning in the Honeybee.

    PubMed

    Lagisz, Malgorzata; Mercer, Alison R; de Mouzon, Charlotte; Santos, Luana L S; Nakagawa, Shinichi

    2016-03-01

    Octopamine- and dopamine-based neuromodulatory systems play a critical role in learning and learning-related behaviour in insects. To further our understanding of these systems and resulting phenotypes, we quantified DNA sequence variations at six loci coding octopamine-and dopamine-receptors and their association with aversive and appetitive learning traits in a population of honeybees. We identified 79 polymorphic sequence markers (mostly SNPs and a few insertions/deletions) located within or close to six candidate genes. Intriguingly, we found that levels of sequence variation in the protein-coding regions studied were low, indicating that sequence variation in the coding regions of receptor genes critical to learning and memory is strongly selected against. Non-coding and upstream regions of the same genes, however, were less conserved and sequence variations in these regions were weakly associated with between-individual differences in learning-related traits. While these associations do not directly imply a specific molecular mechanism, they suggest that the cross-talk between dopamine and octopamine signalling pathways may influence olfactory learning and memory in the honeybee.

  6. High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology

    PubMed Central

    Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M

    2007-01-01

    Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442

  7. Racial/ethnic variation in the association of lipid-related genetic variants with blood lipids in the US adult population.

    PubMed

    Chang, Man-huei; Ned, Renée M; Hong, Yuling; Yesupriya, Ajay; Yang, Quanhe; Liu, Tiebin; Janssens, A Cecile J W; Dowling, Nicole F

    2011-10-01

    Genome-wide association studies (GWAS) have identified a number of single-nucleotide polymorphisms (SNPs) associated with serum lipid level in populations of European descent. The individual and the cumulative effect of these SNPs on blood lipids are largely unclear for the US population. Using data from the second phase (1991-1994) of the Third National Health and Nutrition Examination Survey (NHANES III), a nationally representative survey of the US population, we examined associations of 57 GWAS-identified or well-established lipid-related genetic loci with plasma concentrations of high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol, total cholesterol, triglycerides, total cholesterol/HDL-C ratio, and non-HDL-C. We used multivariable linear regression to examine single SNP associations and the cumulative effect of multiple SNPs (using a genetic risk score [GRS]) on blood lipid levels. Analyses were conducted in adults from each of the 3 major racial/ethnic groups in the United States: non-Hispanic whites (n=2296), non-Hispanic blacks (n=1699), and Mexican Americans (n=1713). Allele frequencies for all SNPs varied significantly by race/ethnicity, except rs3764261 in CETP. Individual SNPs had very small effects on lipid levels, effects that were generally consistent in direction across racial/ethnic groups. More GWAS-validated SNPs were replicated in non-Hispanic whites (<67%) than in non-Hispanic blacks (<44%) or Mexican Americans (<44%). GRSs were strongly associated with increased lipid levels in each racial/ethnic group. The combination of all SNPs into a weighted GRS explained no more than 11% of the total variance in blood lipid levels. Our findings show that the combined association of SNPs, based on a GRS, was strongly associated with increased blood lipid measures in all major race/ethnic groups in the United States, which may help in identifying subgroups with a high risk for an unfavorable lipid profile.

  8. Variation in genes encoding the neuroactive steroid synthetic enzymes 5α-reductase type 1 and 3α-reductase type 2 is associated with alcohol dependence.

    PubMed

    Milivojevic, Verica; Kranzler, Henry R; Gelernter, Joel; Burian, Linda; Covault, Jonathan

    2011-05-01

    Studies of alcohol effects in rodents and in vitro implicate endogenous neuroactive steroids as key mediators of alcohol effects at GABA(A) receptors. We used a case-control sample to test the association with alcohol dependence (AD) of single nucleotide polymorphisms in the genes encoding two key enzymes required for the generation of endogenous neuroactive steroids: 5α-reductase, type I (5α-R), and 3α-hydroxysteroid dehydrogenase, type 2 (3α-HSD), both of which are expressed in human brain. We focused on markers previously associated with a biological phenotype. For 5α-R, we examined the synonymous SRD5A1 exon 1 SNP rs248793, which has been associated with the ratio of dihydrotestosterone to testosterone. For 3α-HSD, we examined the nonsynonymous AKR1C3 SNP rs12529 (H5Q), which has been associated with bladder cancer. The SNPs were genotyped in a sample of 1,083 non-Hispanic Caucasians including 552 controls and 531 subjects with AD. The minor allele for both SNPs was more common among controls than subjects with AD: SRD5A1 rs248793 C-allele (χ(2)(1) = 7.6, p = 0.006) and AKR1C3 rs12529 G-allele (χ(2)(1) = 14.6, p = 0.0001). There was also an interaction of these alleles such that the "protective" effect of the minor allele at each marker for AD was conditional on the genotype of the second marker. We found evidence of an association with AD of polymorphisms in two genes encoding neuroactive steroid biosynthetic enzymes, providing indirect evidence that neuroactive steroids are important mediators of alcohol effects in humans. Copyright © 2011 by the Research Society on Alcoholism.

  9. Whole exome sequencing is an efficient, sensitive and specific method of mutation detection in osteogenesis imperfecta and Marfan syndrome

    PubMed Central

    McInerney-Leo, Aideen M; Marshall, Mhairi S; Gardiner, Brooke; Coucke, Paul J; Van Laer, Lut; Loeys, Bart L; Summers, Kim M; Symoens, Sofie; West, Jennifer A; West, Malcolm J; Paul Wordsworth, B; Zankl, Andreas; Leo, Paul J; Brown, Matthew A; Duncan, Emma L

    2013-01-01

    Osteogenesis imperfecta (OI) and Marfan syndrome (MFS) are common Mendelian disorders. Both conditions are usually diagnosed clinically, as genetic testing is expensive due to the size and number of potentially causative genes and mutations. However, genetic testing may benefit patients, at-risk family members and individuals with borderline phenotypes, as well as improving genetic counseling and allowing critical differential diagnoses. We assessed whether whole exome sequencing (WES) is a sensitive method for mutation detection in OI and MFS. WES was performed on genomic DNA from 13 participants with OI and 10 participants with MFS who had known mutations, with exome capture followed by massive parallel sequencing of multiplexed samples. Single nucleotide polymorphisms (SNPs) and small indels were called using Genome Analysis Toolkit (GATK) and annotated with ANNOVAR. CREST, exomeCopy and exomeDepth were used for large deletion detection. Results were compared with the previous data. Specificity was calculated by screening WES data from a control population of 487 individuals for mutations in COL1A1, COL1A2 and FBN1. The target capture of five exome capture platforms was compared. All 13 mutations in the OI cohort and 9/10 in the MFS cohort were detected (sensitivity=95.6%) including non-synonymous SNPs, small indels (<10 bp), and a large UTR5/exon 1 deletion. One mutation was not detected by GATK due to strand bias. Specificity was 99.5%. Capture platforms and analysis programs differed considerably in their ability to detect mutations. Consumable costs for WES were low. WES is an efficient, sensitive, specific and cost-effective method for mutation detection in patients with OI and MFS. Careful selection of platform and analysis programs is necessary to maximize success. PMID:24501682

  10. Comparative Genomic Analysis of Escherichia coli O157:H7 Isolated from Super-Shedder and Low-Shedder Cattle

    PubMed Central

    Munns, Krysty D.; Zaheer, Rahat; Xu, Yong; Stanford, Kim; Laing, Chad R.; Gannon, Victor P. J.; Selinger, L. Brent; McAllister, Tim A.

    2016-01-01

    Cattle are the primary reservoir of the foodborne pathogen Escherichia coli O157:H7, with the concentration and frequency of E. coli O157:H7 shedding varying substantially among individual hosts. The term ‘‘super-shedder” has been applied to cattle that shed ≥104 cfu E. coli O157:H7/g of feces. Super-shedders have been reported to be responsible for the majority of E. coli O157:H7 shed into the environment. The objective of this study was to determine if there are phenotypic and/or genotypic differences between E. coli O157:H7 isolates obtained from super-shedder compared to low-shedder cattle. From a total of 784 isolates, four were selected from low-shedder steers and six isolates from super-shedder steers (4.01–8.45 log cfu/g feces) for whole genome sequencing. Isolates were phage and clade typed, screened for substrate utilization, pH sensitivity, virulence gene profiles and Stx bacteriophage insertion (SBI) sites. A range of 89–2473 total single nucleotide polymorphisms (SNPs) were identified when sequenced strains were compared to E. coli O157:H7 strain Sakai. More non-synonymous SNP mutations were observed in low-shedder isolates. Pan-genomic and SNPs comparisons did not identify genetic segregation between super-shedder or low-shedder isolates. All super-shedder isolates and 3 of 4 of low-shedder isolates were typed as phage type 14a, SBI cluster 3 and SNP clade 2. Super-shedder isolates displayed increased utilization of galactitol, thymidine and 3-O-β-D-galactopyranosyl-D-arabinose when compared to low-shedder isolates, but no differences in SNPs were observed in genes encoding for proteins involved in the metabolism of these substrates. While genetic traits specific to super-shedder isolates were not identified in this study, differences in the level of gene expression or genes of unknown function may still contribute to some strains of E. coli O157:H7 reaching high densities within bovine feces. PMID:27018858

  11. Comparative Genomic Analysis of Escherichia coli O157:H7 Isolated from Super-Shedder and Low-Shedder Cattle.

    PubMed

    Munns, Krysty D; Zaheer, Rahat; Xu, Yong; Stanford, Kim; Laing, Chad R; Gannon, Victor P J; Selinger, L Brent; McAllister, Tim A

    2016-01-01

    Cattle are the primary reservoir of the foodborne pathogen Escherichia coli O157:H7, with the concentration and frequency of E. coli O157:H7 shedding varying substantially among individual hosts. The term ''super-shedder" has been applied to cattle that shed ≥10(4) cfu E. coli O157:H7/g of feces. Super-shedders have been reported to be responsible for the majority of E. coli O157:H7 shed into the environment. The objective of this study was to determine if there are phenotypic and/or genotypic differences between E. coli O157:H7 isolates obtained from super-shedder compared to low-shedder cattle. From a total of 784 isolates, four were selected from low-shedder steers and six isolates from super-shedder steers (4.01-8.45 log cfu/g feces) for whole genome sequencing. Isolates were phage and clade typed, screened for substrate utilization, pH sensitivity, virulence gene profiles and Stx bacteriophage insertion (SBI) sites. A range of 89-2473 total single nucleotide polymorphisms (SNPs) were identified when sequenced strains were compared to E. coli O157:H7 strain Sakai. More non-synonymous SNP mutations were observed in low-shedder isolates. Pan-genomic and SNPs comparisons did not identify genetic segregation between super-shedder or low-shedder isolates. All super-shedder isolates and 3 of 4 of low-shedder isolates were typed as phage type 14a, SBI cluster 3 and SNP clade 2. Super-shedder isolates displayed increased utilization of galactitol, thymidine and 3-O-β-D-galactopyranosyl-D-arabinose when compared to low-shedder isolates, but no differences in SNPs were observed in genes encoding for proteins involved in the metabolism of these substrates. While genetic traits specific to super-shedder isolates were not identified in this study, differences in the level of gene expression or genes of unknown function may still contribute to some strains of E. coli O157:H7 reaching high densities within bovine feces.

  12. Genome-wide association analyses of child genotype effects and parent-of-origin effects in specific language impairment.

    PubMed

    Nudel, R; Simpson, N H; Baird, G; O'Hare, A; Conti-Ramsden, G; Bolton, P F; Hennessy, E R; Ring, S M; Davey Smith, G; Francks, C; Paracchini, S; Monaco, A P; Fisher, S E; Newbury, D F

    2014-04-01

    Specific language impairment (SLI) is a neurodevelopmental disorder that affects linguistic abilities when development is otherwise normal. We report the results of a genome-wide association study of SLI which included parent-of-origin effects and child genotype effects and used 278 families of language-impaired children. The child genotype effects analysis did not identify significant associations. We found genome-wide significant paternal parent-of-origin effects on chromosome 14q12 (P = 3.74 × 10(-8)) and suggestive maternal parent-of-origin effects on chromosome 5p13 (P = 1.16 × 10(-7)). A subsequent targeted association of six single-nucleotide-polymorphisms (SNPs) on chromosome 5 in 313 language-impaired individuals and their mothers from the ALSPAC cohort replicated the maternal effects, albeit in the opposite direction (P = 0.001); as fathers' genotypes were not available in the ALSPAC study, the replication analysis did not include paternal parent-of-origin effects. The paternally-associated SNP on chromosome 14 yields a non-synonymous coding change within the NOP9 gene. This gene encodes an RNA-binding protein that has been reported to be significantly dysregulated in individuals with schizophrenia. The region of maternal association on chromosome 5 falls between the PTGER4 and DAB2 genes, in a region previously implicated in autism and ADHD. The top SNP in this association locus is a potential expression QTL of ARHGEF19 (also called WGEF) on chromosome 1. Members of this protein family have been implicated in intellectual disability. In summary, this study implicates parent-of-origin effects in language impairment, and adds an interesting new dimension to the emerging picture of shared genetic etiology across various neurodevelopmental disorders. © 2014 The Authors. Genes, Brain and Behavior published by International Behavioural and Neural Genetics Society and John Wiley & Sons Ltd.

  13. Cellular dissection of psoriasis for transcriptome analyses and the post-GWAS era

    PubMed Central

    2014-01-01

    Background Genome-scale studies of psoriasis have been used to identify genes of potential relevance to disease mechanisms. For many identified genes, however, the cell type mediating disease activity is uncertain, which has limited our ability to design gene functional studies based on genomic findings. Methods We identified differentially expressed genes (DEGs) with altered expression in psoriasis lesions (n = 216 patients), as well as candidate genes near susceptibility loci from psoriasis GWAS studies. These gene sets were characterized based upon their expression across 10 cell types present in psoriasis lesions. Susceptibility-associated variation at intergenic (non-coding) loci was evaluated to identify sites of allele-specific transcription factor binding. Results Half of DEGs showed highest expression in skin cells, although the dominant cell type differed between psoriasis-increased DEGs (keratinocytes, 35%) and psoriasis-decreased DEGs (fibroblasts, 33%). In contrast, psoriasis GWAS candidates tended to have highest expression in immune cells (71%), with a significant fraction showing maximal expression in neutrophils (24%, P < 0.001). By identifying candidate cell types for genes near susceptibility loci, we could identify and prioritize SNPs at which susceptibility variants are predicted to influence transcription factor binding. This led to the identification of potentially causal (non-coding) SNPs for which susceptibility variants influence binding of AP-1, NF-κB, IRF1, STAT3 and STAT4. Conclusions These findings underscore the role of innate immunity in psoriasis and highlight neutrophils as a cell type linked with pathogenetic mechanisms. Assignment of candidate cell types to genes emerging from GWAS studies provides a first step towards functional analysis, and we have proposed an approach for generating hypotheses to explain GWAS hits at intergenic loci. PMID:24885462

  14. Genetic Diversity and Natural Selection in 42 kDa Region of Plasmodium vivax Merozoite Surface Protein-1 from China-Myanmar Endemic Border.

    PubMed

    Zhou, Xia; Tambo, Ernest; Su, Jing; Fang, Qiang; Ruan, Wei; Chen, Jun-Hu; Yin, Ming-Bo; Zhou, Xiao-Nong

    2017-10-01

    Plasmodium vivax merozoite surface protein-1 (PvMSP1) gene codes for a major malaria vaccine candidate antigen. However, its polymorphic nature represents an obstacle to the design of a protective vaccine. In this study, we analyzed the genetic polymorphism and natural selection of the C-terminal 42 kDa fragment within PvMSP1 gene (Pv MSP142) from 77 P. vivax isolates, collected from imported cases of China-Myanmar border (CMB) areas in Yunnan province and the inland cases from Anhui, Yunnan, and Zhejiang province in China during 2009-2012. Totally, 41 haplotypes were identified and 30 of them were new haplotypes. The differences between the rates of non-synonymous and synonymous mutations suggest that PvMSP142 has evolved under natural selection, and a high selective pressure preferentially acted on regions identified of PvMSP133. Our results also demonstrated that PvMSP142 of P. vivax isolates collected on China-Myanmar border areas display higher genetic polymorphisms than those collected from inland of China. Such results have significant implications for understanding the dynamic of the P. vivax population and may be useful information towards China malaria elimination campaign strategies.

  15. No association of dynamin binding protein (DNMBP) gene SNPs and Alzheimer's disease.

    PubMed

    Minster, Ryan L; DeKosky, Steven T; Kamboh, M Ilyas

    2008-10-01

    A recent scan of single nucleotide polymorphisms (SNPs) on chromosome 10q found significant association of six correlated SNPs with late-onset Alzheimer's disease (AD) among Japanese. We examined the SNP with the highest statistical significance (rs3740058) in a large Caucasian American case-control cohort and the remaining five SNPs in a smaller subset of cases and controls. We observed no association of statistical significance in either the total sample or the APOE*4 non-carriers for any of the SNPs.

  16. Rapid evolution of avirulence genes in rice blast fungus Magnaporthe oryzae

    PubMed Central

    2014-01-01

    Background Rice blast fungus Magnaporthe oryzae is one of the most devastating pathogens in rice. Avirulence genes in this fungus share a gene-for-gene relationship with the resistance genes in its host rice. Although numerous studies have shown that rice blast R-genes are extremely diverse and evolve rapidly in their host populations, little is known about the evolutionary patterns of the Avr-genes in the pathogens. Results Here, six well-characterized Avr-genes and seven randomly selected non-Avr control genes were used to investigate the genetic variations in 62 rice blast strains from different parts of China. Frequent presence/absence polymorphisms, high levels of nucleotide variation (~10-fold higher than non-Avr genes), high non-synonymous to synonymous substitution ratios, and frequent shared non-synonymous substitution were observed in the Avr-genes of these diversified blast strains. In addition, most Avr-genes are closely associated with diverse repeated sequences, which may partially explain the frequent presence/absence polymorphisms in Avr-genes. Conclusion The frequent deletion and gain of Avr-genes and rapid non-synonymous variations might be the primary mechanisms underlying rapid adaptive evolution of pathogens toward virulence to their host plants, and these features can be used as the indicators for identifying additional Avr-genes. The high number of nucleotide polymorphisms among Avr-gene alleles could also be used to distinguish genetic groups among different strains. PMID:24725999

  17. Accurate quantification of within- and between-host HBV evolutionary rates requires explicit transmission chain modelling.

    PubMed

    Vrancken, Bram; Suchard, Marc A; Lemey, Philippe

    2017-07-01

    Analyses of virus evolution in known transmission chains have the potential to elucidate the impact of transmission dynamics on the viral evolutionary rate and its difference within and between hosts. Lin et al. (2015, Journal of Virology , 89/7: 3512-22) recently investigated the evolutionary history of hepatitis B virus in a transmission chain and postulated that the 'colonization-adaptation-transmission' model can explain the differential impact of transmission on synonymous and non-synonymous substitution rates. Here, we revisit this dataset using a full probabilistic Bayesian phylogenetic framework that adequately accounts for the non-independence of sequence data when estimating evolutionary parameters. Examination of the transmission chain data under a flexible coalescent prior reveals a general inconsistency between the estimated timings and clustering patterns and the known transmission history, highlighting the need to incorporate host transmission information in the analysis. Using an explicit genealogical transmission chain model, we find strong support for a transmission-associated decrease of the overall evolutionary rate. However, in contrast to the initially reported larger transmission effect on non-synonymous substitution rate, we find a similar decrease in both non-synonymous and synonymous substitution rates that cannot be adequately explained by the colonization-adaptation-transmission model. An alternative explanation may involve a transmission/establishment advantage of hepatitis B virus variants that have accumulated fewer within-host substitutions, perhaps by spending more time in the covalently closed circular DNA state between each round of viral replication. More generally, this study illustrates that ignoring phylogenetic relationships can lead to misleading evolutionary estimates.

  18. Mutation Analysis of COL1A1 and COL1A2 in Fetuses with Osteogenesis Imperfecta Type II/III.

    PubMed

    Wang, Wenbo; Wu, Qichang; Cao, Lin; Sun, Li; Xu, Yasong; Guo, Qiwei

    2015-01-27

    Aim: To analyze COL1A1/2 mutations in prenatal-onset OI for determine the proportion of mutations in type I collagen genes among prenatal onset OI and to provide additional data for genotype-phenotype analyses. Material and Methods: Ten cases of severe fetal short-limb dwarfism detected by antenatal ultrasonography were referred to our center. Before the termination of pregnancy, cordocentesis was performed for fetal karyotype and COL1A1/2 gene sequencing analysis. Postmortem radiographic examination was performed at all instances for definitive diagnosis. Results: COL1A1 and COL1A2 SNP and mutations were identified in all the cases. Among these, one synonymous SNP and four synonymous SNPs were recognized in COL1A1/2, respectively, seven cases have distinct heterozygous mutations and six new COL1A1/2 gene mutations were identified. Conclusion: There has been substantial progress in the identification of the molecular defects responsible for skeletal dysplasias. With the constant increase in the number of identified mutations in COL1A1 and COL1A2, genotype-phenotype correlation is becoming increasingly pertinent. © 2015 S. Karger AG, Basel.

  19. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

    PubMed

    Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  20. Leptin and Leptin-Related Gene Polymorphisms, Obesity, and Influenza A/H1N1 Vaccine–Induced Immune Responses in Older Individuals

    PubMed Central

    Ovsyannikova, Inna G.; White, Sarah J.; Larrabee, Beth R.; Grill, Diane E.; Jacobson, Robert M.; Poland, Gregory A.

    2014-01-01

    Obesity is a risk factor for complicated influenza A/H1N1 disease and poor vaccine immunogenicity. Leptin, an adipocyte-derived hormone/cytokine, has many immune regulatory functions and therefore could explain susceptibility to infections and poor vaccine outcomes. We recruited 159 healthy adults (5074 years old) who were immunized with inactivated TIV influenza vacci–ne that contained A/California/7/2009/H1N1 virus. We found a strong correlation between leptin concentration and BMI (r=0.55, p<0.0001), but no association with hemagglutination antibody inhibition (HAI), B-cell, or granzyme B responses. We found a slight correlation between leptin concentration and an immunosenescence marker (TREC: T-cell receptor excision circles) level (r=0.23, p=0.01). We found eight SNPs in the LEP/LEPR/GHRL genes that were associated with leptin levels and four SNPs in the PTPN1/LEPR/STAT3 genes associated with peripheral blood TREC levels (p<0.05). Heterozygosity of the synonymous variant rs2230604 in the PTPN1 gene was associated with a significantly lower (531 vs. 259, p = 0.005) TREC level, as compared to the homozygous major variant. We also found eight SNPs in the LEP/PPARG/CRP genes associated with variations in influenza-specific HAI and B-cell responses (p<0.05). Our results suggest that specific allelic variations in the leptin-related genes may influence adaptive immune responses to influenza vaccine. PMID:24360890

  1. Bovine GDF10 gene polymorphism analysis and its association with body measurement traits in Chinese indigenous cattle.

    PubMed

    Adoligbe, C; Zan, Linsen; Farougou, S; Wang, Hongbao; Ujjan, J A

    2012-04-01

    The objective of this research was to detect bovine GDF10 gene polymorphism and analyze its association with body measurement traits (BMT) of animals sampled from 6 different Chinese indigenous cattle populations. The populations included Xuelong (Xl), Luxi (Lx), Qinchuan (Qc), Jiaxian red (Jx), Xianang (Xn) and Nanyang (Ny). Blood samples were taken from a total of 417 female animals stratified into age categories of 12-36 months. Polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) was employed to find out GDF10 single polymorphism nucleotide (SNPs) and explore their possible association with BMT. Sequence analysis of GDF10 gene revealed 3 SNPs in total: 1 in exon1 (G142A) and 2 in exon3 (A11471G, and T12495C). G142A and T12495C SNPs are both synonymous mutation. They showed 2 genotypes namely respectively (GG, GA) and (PP and PB). A11471G SNP is a missense mutation leading to the change of Alanine to Threonine amino acid. It showed three genotypes namely AA, BB and AB. Analysis of association of polymorphism with body measurement traits at the three locus showed that there were significant effects on BMT in Qc, Jx and Ny cattle population. These results suggest that the GDF10 gene might have potential effects on body measurement traits in the above mentioned cattle populations and could be used for marker-assisted selection.

  2. Interactions of Cigarette Smoking with NAT2 Polymorphisms Impact Rheumatoid Arthritis Risk in African Americans

    PubMed Central

    Mikuls, Ted R.; LeVan, Tricia; Gould, Karen A.; Yu, Fang; Thiele, Geoffrey M.; Bynote, Kimberly K.; Conn, Doyt; Jonas, Beth L.; Callahan, Leigh F.; Smith, Edwin; Brasington, Richard; Moreland, Larry W.; Reynolds, Richard; Gaffo, Angelo; Bridges, S. Louis

    2011-01-01

    Objective To examine whether polymorphisms in genes coding for drug metabolizing enzymes (DMEs) impact rheumatoid arthritis (RA) risk due to cigarette smoking in African Americans. Methods Smoking status was evaluated in African American RA cases and non-RA controls categorized as heavy (≥ 10 pack-years) vs. other. Individuals were genotyped for a homozygous deletion polymorphism in glutathione S-transferase Mu-1 (GSTM1-null) in addition to tagging single nucleotide polymorphisms (SNPs) in N-acetyltransferase (NAT)1, NAT2, and epoxide hydrolase (EPXH1). Associations of genotypes with RA were examined using logistic regression and gene-smoking interactions were assessed. Results There were no significant associations of any DME genotype with RA. After adjustment for multiple comparisons, there were significant additive interactions between heavy smoking and NAT2 SNPs rs9987109 (Padd = 0.000003) and rs1208 (Padd = 0.00001); attributable proportions (APs) due to interaction ranged from 0.61 to 0.67. None of the multiplicative gene-smoking interactions examined remained significant after adjustment for multiple testing in overall disease risk. There was no evidence of significant gene-smoking interactions in analyses of GSTM1-null, NAT1, or EPXH1. DME gene-smoking interactions were similar when cases were limited to anti-citrullinated protein antibody (ACPA) positive individuals. Conclusion Among African Americans, RA risk imposed by heavy smoking appears to be mediated in part by genetic variation in NAT2. While further studies are needed to elucidate mechanisms underpinning these interactions, these SNPs appear to identify African American smokers at a much higher risk for RA with relative risks that are at least two-fold higher compared to non-smokers lacking these risk alleles. PMID:21989592

  3. Cell-type-specific enrichment of risk-associated regulatory elements at ovarian cancer susceptibility loci.

    PubMed

    Coetzee, Simon G; Shen, Howard C; Hazelett, Dennis J; Lawrenson, Kate; Kuchenbaecker, Karoline; Tyrer, Jonathan; Rhie, Suhn K; Levanon, Keren; Karst, Alison; Drapkin, Ronny; Ramus, Susan J; Couch, Fergus J; Offit, Kenneth; Chenevix-Trench, Georgia; Monteiro, Alvaro N A; Antoniou, Antonis; Freedman, Matthew; Coetzee, Gerhard A; Pharoah, Paul D P; Noushmehr, Houtan; Gayther, Simon A

    2015-07-01

    Understanding the regulatory landscape of the human genome is a central question in complex trait genetics. Most single-nucleotide polymorphisms (SNPs) associated with cancer risk lie in non-protein-coding regions, implicating regulatory DNA elements as functional targets of susceptibility variants. Here, we describe genome-wide annotation of regions of open chromatin and histone modification in fallopian tube and ovarian surface epithelial cells (FTSECs, OSECs), the debated cellular origins of high-grade serous ovarian cancers (HGSOCs) and in endometriosis epithelial cells (EECs), the likely precursor of clear cell ovarian carcinomas (CCOCs). The regulatory architecture of these cell types was compared with normal human mammary epithelial cells and LNCaP prostate cancer cells. We observed similar positional patterns of global enhancer signatures across the three different ovarian cancer precursor cell types, and evidence of tissue-specific regulatory signatures compared to non-gynecological cell types. We found significant enrichment for risk-associated SNPs intersecting regulatory biofeatures at 17 known HGSOC susceptibility loci in FTSECs (P = 3.8 × 10(-30)), OSECs (P = 2.4 × 10(-23)) and HMECs (P = 6.7 × 10(-15)) but not for EECs (P = 0.45) or LNCaP cells (P = 0.88). Hierarchical clustering of risk SNPs conditioned on the six different cell types indicates FTSECs and OSECs are highly related (96% of samples using multi-scale bootstrapping) suggesting both cell types may be precursors of HGSOC. These data represent the first description of regulatory catalogues of normal precursor cells for different ovarian cancer subtypes, and provide unique insights into the tissue specific regulatory variation with respect to the likely functional targets of germline genetic susceptibility variants for ovarian cancer. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  4. The importance of mRNA structure in determining the pathogenicity of synonymous and non-synonymous mutations in haemophilia

    PubMed Central

    Hamasaki-Katagiri, Nobuko; Lin, Brian C.; Simon, Jonathan; Hunt, Ryan C.; Schiller, Tal; Russek-Cohen, Estelle; Komar, Anton A.; Bar, Haim; Kimchi-Sarfaty, Chava

    2016-01-01

    Introduction Mutational analysis is commonly used to support the diagnosis and management of haemophilia. This has allowed for the generation of large mutation databases which provide unparalleled insight into genotype-phenotype relationships. Haemophilia is associated with inversions, deletions, insertions, nonsense and missense mutations. Both synonymous and non-synonymous mutations influence the base pairing of messenger RNA (mRNA), which can alter mRNA structure, cellular half-life and ribosome processivity/elongation. However, the role of mRNA structure in determining the pathogenicity of point mutations in haemophilia has not been evaluated. Aim To evaluate mRNA thermodynamic stability and associated RNA prediction software as a means to distinguish between neutral and disease-associated mutations in haemophilia. Methods Five mRNA structure prediction software programs were used to assess the thermodynamic stability of mRNA fragments carrying neutral vs. disease-associated and synonymous vs. non-synonymous point mutations in F8, F9 and a third X-linked gene, DMD (dystrophin). Results In F8 and DMD, disease-associated mutations tend to occur in more structurally stable mRNA regions, represented by lower MFE (minimum free energy) levels. In comparing multiple software packages for mRNA structure prediction, a 101–151 nucleotide fragment length appears to be a feasible range for structuring future studies. Conclusion mRNA thermodynamic stability is one predictive characteristic, which when combined with other RNA and protein features, may offer significant insight when screening sequencing data for novel disease-associated mutations. Our results also suggest potential utility in evaluating the mRNA thermodynamic stability profile of a gene when determining the viability of interchanging codons for biological and therapeutic applications. PMID:27933712

  5. SNP-associations and phenotype predictions from hundreds of microbial genomes without genome alignments.

    PubMed

    Hall, Barry G

    2014-01-01

    SNP-association studies are a starting point for identifying genes that may be responsible for specific phenotypes, such as disease traits. The vast bulk of tools for SNP-association studies are directed toward SNPs in the human genome, and I am unaware of any tools designed specifically for such studies in bacterial or viral genomes. The PPFS (Predict Phenotypes From SNPs) package described here is an add-on to kSNP , a program that can identify SNPs in a data set of hundreds of microbial genomes. PPFS identifies those SNPs that are non-randomly associated with a phenotype based on the χ² probability, then uses those diagnostic SNPs for two distinct, but related, purposes: (1) to predict the phenotypes of strains whose phenotypes are unknown, and (2) to identify those diagnostic SNPs that are most likely to be causally related to the phenotype. In the example illustrated here, from a set of 68 E. coli genomes, for 67 of which the pathogenicity phenotype was known, there were 418,500 SNPs. Using the phenotypes of 36 of those strains, PPFS identified 207 diagnostic SNPs. The diagnostic SNPs predicted the phenotypes of all of the genomes with 97% accuracy. It then identified 97 SNPs whose probability of being causally related to the pathogenic phenotype was >0.999. In a second example, from a set of 116 E. coli genome sequences, using the phenotypes of 65 strains PPFS identified 101 SNPs that predicted the source host (human or non-human) with 90% accuracy.

  6. Genome-wide analysis identifies novel loci associated with ovarian cancer outcomes: findings from the Ovarian Cancer Association Consortium

    PubMed Central

    Johnatty, Sharon E.; Tyrer, Jonathan P.; Kar, Siddhartha; Beesley, Jonathan; Lu, Yi; Gao, Bo; Fasching, Peter A.; Hein, Alexander; Ekici, Arif B.; Beckmann, Matthias W.; Lambrechts, Diether; Nieuwenhuysen, Els Van; Vergote, Ignace; Lambrechts, Sandrina; Rossing, Mary Anne; Doherty, Jennifer A.; Chang-Claude, Jenny; Modugno, Francesmary; Ness, Roberta B.; Moysich, Kirsten B.; Levine, Douglas A.; Kiemeney, Lambertus A.; Massuger, Leon F.A.G.; Gronwald, Jacek; Lubiński, Jan; Jakubowska, Anna; Cybulski, Cezary; Brinton, Louise; Lissowska, Jolanta; Wentzensen, Nicolas; Song, Honglin; Rhenius, Valerie; Campbell, Ian; Eccles, Diana; Sieh, Weiva; Whittemore, Alice S.; McGuire, Valerie; Rothstein, Joseph H.; Sutphen, Rebecca; Anton-Culver, Hoda; Ziogas, Argyrios; Gayther, Simon A.; Gentry-Maharaj, Aleksandra; Menon, Usha; Ramus, Susan J.; Pearce, Celeste L; Pike, Malcolm C; Stram, Daniel O.; Wu, Anna H.; Kupryjanczyk, Jolanta; Dansonka-Mieszkowska, Agnieszka; Rzepecka, Iwona K.; Spiewankiewicz, Beata; Goodman, Marc T.; Wilkens, Lynne R.; Carney, Michael E.; Thompson, Pamela J; Heitz, Florian; du Bois, Andreas; Schwaab, Ira; Harter, Philipp; Pisterer, Jacobus; Hillemanns, Peter; Karlan, Beth Y.; Walsh, Christine; Lester, Jenny; Orsulic, Sandra; Winham, Stacey J; Earp, Madalene; Larson, Melissa C.; Fogarty, Zachary C.; Høgdall, Estrid; Jensen, Allan; Kjaer, Susanne Kruger; Fridley, Brooke L.; Cunningham, Julie M.; Vierkant, Robert A.; Schildkraut, Joellen M.; Iversen, Edwin S.; Terry, Kathryn L.; Cramer, Daniel W.; Bandera, Elisa V.; Orlow, Irene; Pejovic, Tanja; Bean, Yukie; Høgdall, Claus; Lundvall, Lene; McNeish, Ian; Paul, James; Carty, Karen; Siddiqui, Nadeem; Glasspool, Rosalind; Sellers, Thomas; Kennedy, Catherine; Chiew, Yoke-Eng; Berchuck, Andrew; MacGregor, Stuart; deFazio, Anna; Pharoah, Paul D.P.; Goode, Ellen L.; deFazio, Anna; Webb, Penelope M.; Chenevix-Trench, Georgia

    2015-01-01

    Purpose Chemotherapy resistance remains a major challenge in the treatment of ovarian cancer. We hypothesize that germline polymorphisms might be associated with clinical outcome. Experimental Design We analyzed ~2.8 million genotyped and imputed SNPs from the iCOGS experiment for progression-free survival (PFS) and overall survival (OS) in 2,901 European epithelial ovarian cancer (EOC) patients who underwent firstline treatment of cytoreductive surgery and chemotherapy regardless of regimen, and in a subset of 1,098 patients treated with ≥4 cycles of paclitaxel and carboplatin at standard doses. We evaluated the top SNPs in 4,434 EOC patients including patients from The Cancer Genome Atlas. Additionally we conducted pathway analysis of all intragenic SNPs and tested their association with PFS and OS using gene set enrichment analysis. Results Five SNPs were significantly associated (p≤1.0x10−5) with poorer outcomes in at least one of the four analyses, three of which, rs4910232 (11p15.3), rs2549714 (16q23) and rs6674079 (1q22) were located in long non-coding RNAs (lncRNAs) RP11–179A10.1, RP11–314O13.1 and RP11–284F21.8 respectively (p≤7.1x10−6). ENCODE ChIP-seq data at 1q22 for normal ovary shows evidence of histone modification around RP11–284F21.8, and rs6674079 is perfectly correlated with another SNP within the super-enhancer MEF2D, expression levels of which were reportedly associated with prognosis in another solid tumor. YAP1- and WWTR1 (TAZ)-stimulated gene expression, and HDL-mediated lipid transport pathways were associated with PFS and OS, respectively, in the cohort who had standard chemotherapy (pGSEA≤6x10−3). Conclusion We have identified SNPs in three lncRNAs that might be important targets for novel EOC therapies. PMID:26152742

  7. RNA-Seq identifies SNP markers for growth traits in rainbow trout.

    PubMed

    Salem, Mohamed; Vallejo, Roger L; Leeds, Timothy D; Palti, Yniv; Liu, Sixin; Sabbagh, Annas; Rexroad, Caird E; Yao, Jianbo

    2012-01-01

    Fast growth is an important and highly desired trait, which affects the profitability of food animal production, with feed costs accounting for the largest proportion of production costs. Traditional phenotype-based selection is typically used to select for growth traits; however, genetic improvement is slow over generations. Single nucleotide polymorphisms (SNPs) explain 90% of the genetic differences between individuals; therefore, they are most suitable for genetic evaluation and strategies that employ molecular genetics for selective breeding. SNPs found within or near a coding sequence are of particular interest because they are more likely to alter the biological function of a protein. We aimed to use SNPs to identify markers and genes associated with genetic variation in growth. RNA-Seq whole-transcriptome analysis of pooled cDNA samples from a population of rainbow trout selected for improved growth versus unselected genetic cohorts (10 fish from 1 full-sib family each) identified SNP markers associated with growth-rate. The allelic imbalances (the ratio between the allele frequencies of the fast growing sample and that of the slow growing sample) were considered at scores >5.0 as an amplification and <0.2 as loss of heterozygosity. A subset of SNPs (n = 54) were validated and evaluated for association with growth traits in 778 individuals of a three-generation parent/offspring panel representing 40 families. Twenty-two SNP markers and one mitochondrial haplotype were significantly associated with growth traits. Polymorphism of 48 of the markers was confirmed in other commercially important aquaculture stocks. Many markers were clustered into genes of metabolic energy production pathways and are suitable candidates for genetic selection. The study demonstrates that RNA-Seq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes. This technique is suitable for marker development in non-model species lacking complete and well-annotated genome reference sequences.

  8. The analysis of APOL1 genetic variation and haplotype diversity provided by 1000 Genomes project.

    PubMed

    Peng, Ting; Wang, Li; Li, Guisen

    2017-08-11

    The APOL1 gene variants has been shown to be associated with an increased risk of multiple kinds of diseases, particularly in African Americans, but not in Caucasians and Asians. In this study, we explored the single nucleotide polymorphism (SNP) and haplotype diversity of APOL1 gene in different races provided by 1000 Genomes project. Variants of APOL1 gene in 1000 Genome Project were obtained and SNPs located in the regulatory region or coding region were selected for genetic variation analysis. Total 2504 individuals from 26 populations were classified as four groups that included Africa, Europe, Asia and Admixed populations. Tag SNPs were selected to evaluate the haplotype diversities in the four populations by HaploStats software. APOL1 gene was surrounded by some of the most polymorphic genes in the human genome, variation of APOL1 gene was common, with up to 613 SNP (1000 Genome Project reported) and 99 of them (16.2%) with MAF ≥ 1%. There were 79 SNPs in the URR and 92 SNPs in 3'UTR. Total 12 SNPs in URR and 24 SNPs in 3'UTR were considered as common variants with MAF ≥ 1%. It is worth noting that URR-1 was presents lower frequencies in European populations, while other three haplotypes taken an opposite pattern; 3'UTR presents several high-frequency variation sites in a short segment, and the differences of its haplotypes among different population were significant (P < 0.01), UTR-1 and UTR-5 presented much higher frequency in African population, while UTR-2, UTR-3 and UTR-4 were much lower. APOL1 coding region showed that two SNP of G1 with higher frequency are actually pull down the haplotype H-1 frequency when considering all populations pooled together, and the diversity among the four populations be widen by the G1 two mutation (P 1  = 3.33E-4 vs P 2  = 3.61E-30). The distributions of APOL1 gene variants and haplotypes were significantly different among the different populations, in either regulatory or coding regions. It could provide clues for the future genetic study of APOL1 related diseases.

  9. 708 Common and 2010 rare DISC1 locus variants identified in 1542 subjects: analysis for association with psychiatric disorder and cognitive traits.

    PubMed

    Thomson, P A; Parla, J S; McRae, A F; Kramer, M; Ramakrishnan, K; Yao, J; Soares, D C; McCarthy, S; Morris, S W; Cardone, L; Cass, S; Ghiban, E; Hennah, W; Evans, K L; Rebolini, D; Millar, J K; Harris, S E; Starr, J M; MacIntyre, D J; McIntosh, A M; Watson, J D; Deary, I J; Visscher, P M; Blackwood, D H; McCombie, W R; Porteous, D J

    2014-06-01

    A balanced t(1;11) translocation that transects the Disrupted in schizophrenia 1 (DISC1) gene shows genome-wide significant linkage for schizophrenia and recurrent major depressive disorder (rMDD) in a single large Scottish family, but genome-wide and exome sequencing-based association studies have not supported a role for DISC1 in psychiatric illness. To explore DISC1 in more detail, we sequenced 528 kb of the DISC1 locus in 653 cases and 889 controls. We report 2718 validated single-nucleotide polymorphisms (SNPs) of which 2010 have a minor allele frequency of <1%. Only 38% of these variants are reported in the 1000 Genomes Project European subset. This suggests that many DISC1 SNPs remain undiscovered and are essentially private. Rare coding variants identified exclusively in patients were found in likely functional protein domains. Significant region-wide association was observed between rs16856199 and rMDD (P=0.026, unadjusted P=6.3 × 10(-5), OR=3.48). This was not replicated in additional recurrent major depression samples (replication P=0.11). Combined analysis of both the original and replication set supported the original association (P=0.0058, OR=1.46). Evidence for segregation of this variant with disease in families was limited to those of rMDD individuals referred from primary care. Burden analysis for coding and non-coding variants gave nominal associations with diagnosis and measures of mood and cognition. Together, these observations are likely to generalise to other candidate genes for major mental illness and may thus provide guidelines for the design of future studies.

  10. Characterization and phylogenetic analysis of the swine leukocyte antigen 3 gene from Korean native pigs.

    PubMed

    Chung, H Y; Choi, Y C; Park, H N

    2015-05-18

    We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.

  11. Allele-Skewed DNA Modification in the Brain: Relevance to a Schizophrenia GWAS

    PubMed Central

    Gagliano, Sarah A.; Ptak, Carolyn; Mak, Denise Y.F.; Shamsi, Mehrdad; Oh, Gabriel; Knight, Joanne; Boutros, Paul C.; Petronis, Arturas

    2016-01-01

    Numerous recent studies have suggested that phenotypic effects of DNA sequence variants can be mediated or modulated by their epigenetic marks, such as allele-skewed DNA modification (ASM). Using Affymetrix SNP microarrays, we performed a comprehensive search of ASM effects in human post-mortem brain and sperm samples (total n = 256) from individuals with major psychosis and control individuals. Depending on the phenotypic category of the brain samples, 1.4%–7.5% of interrogated SNPs exhibited ASM effects. Next, we investigated ASM in the context of genetic studies of schizophrenia and detected that brain ASM SNPs were significantly overrepresented among sub-threshold SNPs from a schizophrenia genome-wide association study (GWAS). Brain ASM SNPs showed a much stronger enrichment in a schizophrenia GWAS than in 17 large GWASs of non-psychiatric diseases and traits, arguing that ASM effects are at least partially tissue specific. Studies of germline and control brain ASM SNPs supported a causal association between ASM and schizophrenia. Finally, significantly higher proportions of ASM SNPs than of non-ASM SNPs were detected at loci exhibiting epigenetic signatures of enhancers and promoters, and they were overrepresented within transcription factor binding regions and DNase I hypersensitive sites. All of these findings collectively indicate that ASM SNPs should be prioritized in follow-up GWASs. PMID:27087318

  12. HapMap filter 1.0: a tool to preprocess the HapMap genotypic data for association studies.

    PubMed

    Zhang, Wei; Duan, Shiwei; Dolan, M Eileen

    2008-05-13

    The International HapMap Project provides a resource of genotypic data on single nucleotide polymorphisms (SNPs), which can be used in various association studies to identify the genetic determinants for phenotypic variations. Prior to the association studies, the HapMap dataset should be preprocessed in order to reduce the computation time and control the multiple testing problem. The less informative SNPs including those with very low genotyping rate and SNPs with rare minor allele frequencies to some extent in one or more population are removed. Some research designs only use SNPs in a subset of HapMap cell lines. Although the HapMap website and other association software packages have provided some basic tools for optimizing these datasets, a fast and user-friendly program to generate the output for filtered genotypic data would be beneficial for association studies. Here, we present a flexible, straight-forward bioinformatics program that can be useful in preparing the HapMap genotypic data for association studies by specifying cell lines and two common filtering criteria: minor allele frequencies and genotyping rate. The software was developed for Microsoft Windows and written in C++. The Windows executable and source code in Microsoft Visual C++ are available at Google Code (http://hapmap-filter-v1.googlecode.com/) or upon request. Their distribution is subject to GNU General Public License v3.

  13. Axon guidance pathways served as common targets for human speech/language evolution and related disorders.

    PubMed

    Lei, Huimeng; Yan, Zhangming; Sun, Xiaohong; Zhang, Yue; Wang, Jianhong; Ma, Caihong; Xu, Qunyuan; Wang, Rui; Jarvis, Erich D; Sun, Zhirong

    2017-11-01

    Human and several nonhuman species share the rare ability of modifying acoustic and/or syntactic features of sounds produced, i.e. vocal learning, which is the important neurobiological and behavioral substrate of human speech/language. This convergent trait was suggested to be associated with significant genomic convergence and best manifested at the ROBO-SLIT axon guidance pathway. Here we verified the significance of such genomic convergence and assessed its functional relevance to human speech/language using human genetic variation data. In normal human populations, we found the affected amino acid sites were well fixed and accompanied with significantly more associated protein-coding SNPs in the same genes than the rest genes. Diseased individuals with speech/language disorders have significant more low frequency protein coding SNPs but they preferentially occurred outside the affected genes. Such patients' SNPs were enriched in several functional categories including two axon guidance pathways (mediated by netrin and semaphorin) that interact with ROBO-SLITs. Four of the six patients have homozygous missense SNPs on PRAME gene family, one youngest gene family in human lineage, which possibly acts upon retinoic acid receptor signaling, similarly as FOXP2, to modulate axon guidance. Taken together, we suggest the axon guidance pathways (e.g. ROBO-SLIT, PRAME gene family) served as common targets for human speech/language evolution and related disorders. Copyright © 2017 Elsevier Inc. All rights reserved.

  14. Clinical code set engineering for reusing EHR data for research: A review.

    PubMed

    Williams, Richard; Kontopantelis, Evangelos; Buchan, Iain; Peek, Niels

    2017-06-01

    The construction of reliable, reusable clinical code sets is essential when re-using Electronic Health Record (EHR) data for research. Yet code set definitions are rarely transparent and their sharing is almost non-existent. There is a lack of methodological standards for the management (construction, sharing, revision and reuse) of clinical code sets which needs to be addressed to ensure the reliability and credibility of studies which use code sets. To review methodological literature on the management of sets of clinical codes used in research on clinical databases and to provide a list of best practice recommendations for future studies and software tools. We performed an exhaustive search for methodological papers about clinical code set engineering for re-using EHR data in research. This was supplemented with papers identified by snowball sampling. In addition, a list of e-phenotyping systems was constructed by merging references from several systematic reviews on this topic, and the processes adopted by those systems for code set management was reviewed. Thirty methodological papers were reviewed. Common approaches included: creating an initial list of synonyms for the condition of interest (n=20); making use of the hierarchical nature of coding terminologies during searching (n=23); reviewing sets with clinician input (n=20); and reusing and updating an existing code set (n=20). Several open source software tools (n=3) were discovered. There is a need for software tools that enable users to easily and quickly create, revise, extend, review and share code sets and we provide a list of recommendations for their design and implementation. Research re-using EHR data could be improved through the further development, more widespread use and routine reporting of the methods by which clinical codes were selected. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

  15. SNP rs2071095 in LincRNA H19 is associated with breast cancer risk.

    PubMed

    Cui, Ping; Zhao, Yanrui; Chu, Xinlei; He, Na; Zheng, Hong; Han, Jiali; Song, Fengju; Chen, Kexin

    2018-05-08

    An increasing number of long intergenic non-coding RNAs (lincRNAs) appear to play critical roles in cancer development and progression. To assess the association between SNPs that reside in regions of lincRNAs and breast cancer risk, we performed a large case-control study in China. We carried out a two-stage case-control study including 2881 breast cancer cases and 3220 controls. In stage I, we genotyped 17 independent (r 2  < 0.5) SNPs located in 6 tumor-related lincRNAs by using the TaqMan platform. In stage II, SNPs potentially associated with breast cancer risk were replicated in an independent population. Quantitative real-time PCR was used to measure H19 levels in tissues from 228 breast cancer patients with different genotypes. We identified 2 SNPs significantly associated with breast cancer risk in stage I (P < 0.05), but not significantly replicated in stage II. We combined the data from stage I and stage II, and found that, compared with the rs2071095 CC genotype, AA and CA + AA genotypes were associated with significantly decreased risk of breast cancer (adjusted OR 0.83, 95% CI 0.69-0.99; adjusted OR 0.88, 95% CI 0.80-0.98, respectively). Stratified analyses showed that rs2071095 was associated with breast cancer risk in estrogen receptor (ER)-positive patients (P = 0.002), but not in ER-negative ones (P = 0.332). Expression levels of H19 in breast cancer cases with AA genotype were significantly lower than those with CC genotype. We identified that rs2071095 may contribute to the susceptibility of breast cancer in Chinese women via affecting H19 expression. The mechanisms underlying the association remain to be investigated.

  16. Catalog of MicroRNA Seed Polymorphisms in Vertebrates

    PubMed Central

    Calin, George Adrian; Horvat, Simon; Jiang, Zhihua; Dovc, Peter; Kunej, Tanja

    2012-01-01

    MicroRNAs (miRNAs) are a class of non-coding RNA that plays an important role in posttranscriptional regulation of mRNA. Evidence has shown that miRNA gene variability might interfere with its function resulting in phenotypic variation and disease susceptibility. A major role in miRNA target recognition is ascribed to complementarity with the miRNA seed region that can be affected by polymorphisms. In the present study, we developed an online tool for the detection of miRNA polymorphisms (miRNA SNiPer) in vertebrates (http://www.integratomics-time.com/miRNA-SNiPer) and generated a catalog of miRNA seed region polymorphisms (miR-seed-SNPs) consisting of 149 SNPs in six species. Although a majority of detected polymorphisms were due to point mutations, two consecutive nucleotide substitutions (double nucleotide polymorphisms, DNPs) were also identified in nine miRNAs. We determined that miR-SNPs are frequently located within the quantitative trait loci (QTL), chromosome fragile sites, and cancer susceptibility loci, indicating their potential role in the genetic control of various complex traits. To test this further, we performed an association analysis between the mmu-miR-717 seed SNP rs30372501, which is polymorphic in a large number of standard inbred strains, and all phenotypic traits in these strains deposited in the Mouse Phenome Database. Analysis showed a significant association between the mmu-miR-717 seed SNP and a diverse array of traits including behavior, blood-clinical chemistry, body weight size and growth, and immune system suggesting that seed SNPs can indeed have major pleiotropic effects. The bioinformatics analyses, data and tools developed in the present study can serve researchers as a starting point in testing more targeted hypotheses and designing experiments using optimal species or strains for further mechanistic studies. PMID:22303453

  17. A Simple Test of Class-Level Genetic Association Can Reveal Novel Cardiometabolic Trait Loci.

    PubMed

    Qian, Jing; Nunez, Sara; Reed, Eric; Reilly, Muredach P; Foulkes, Andrea S

    2016-01-01

    Characterizing the genetic determinants of complex diseases can be further augmented by incorporating knowledge of underlying structure or classifications of the genome, such as newly developed mappings of protein-coding genes, epigenetic marks, enhancer elements and non-coding RNAs. We apply a simple class-level testing framework, termed Genetic Class Association Testing (GenCAT), to identify protein-coding gene association with 14 cardiometabolic (CMD) related traits across 6 publicly available genome wide association (GWA) meta-analysis data resources. GenCAT uses SNP-level meta-analysis test statistics across all SNPs within a class of elements, as well as the size of the class and its unique correlation structure, to determine if the class is statistically meaningful. The novelty of findings is evaluated through investigation of regional signals. A subset of findings are validated using recently updated, larger meta-analysis resources. A simulation study is presented to characterize overall performance with respect to power, control of family-wise error and computational efficiency. All analysis is performed using the GenCAT package, R version 3.2.1. We demonstrate that class-level testing complements the common first stage minP approach that involves individual SNP-level testing followed by post-hoc ascribing of statistically significant SNPs to genes and loci. GenCAT suggests 54 protein-coding genes at 41 distinct loci for the 13 CMD traits investigated in the discovery analysis, that are beyond the discoveries of minP alone. An additional application to biological pathways demonstrates flexibility in defining genetic classes. We conclude that it would be prudent to include class-level testing as standard practice in GWA analysis. GenCAT, for example, can be used as a simple, complementary and efficient strategy for class-level testing that leverages existing data resources, requires only summary level data in the form of test statistics, and adds significant value with respect to its potential for identifying multiple novel and clinically relevant trait associations.

  18. Analysis of nucleotide diversity among alleles of the major bacterial blight resistance gene Xa27 in cultivars of rice (Oryza sativa) and its wild relatives.

    PubMed

    Bimolata, Waikhom; Kumar, Anirudh; Sundaram, Raman Meenakshi; Laha, Gouri Shankar; Qureshi, Insaf Ahmed; Reddy, Gajjala Ashok; Ghazi, Irfan Ahmad

    2013-08-01

    Xa27 is one of the important R-genes, effective against bacterial blight disease of rice caused by Xanthomonas oryzae pv. oryzae (Xoo). Using natural population of Oryza, we analyzed the sequence variation in the functionally important domains of Xa27 across the Oryza species. DNA sequences of Xa27 alleles from 27 rice accessions revealed higher nucleotide diversity among the reported R-genes of rice. Sequence polymorphism analysis revealed synonymous and non-synonymous mutations in addition to a number of InDels in non-coding regions of the gene. High sequence variation was observed in the promoter region including the 5'UTR with 'π' value 0.00916 and 'θ w ' = 0.01785. Comparative analysis of the identified Xa27 alleles with that of IRBB27 and IR24 indicated the operation of both positive selection (Ka/Ks > 1) and neutral selection (Ka/Ks ≈ 0). The genetic distances of alleles of the gene from Oryza nivara were nearer to IRBB27 as compared to IR24. We also found the presence of conserved and null UPT (upregulated by transcriptional activator) box in the isolated alleles. Considerable amino acid polymorphism was localized in the trans-membrane domain for which the functional significance is yet to be elucidated. However, the absence of functional UPT box in all the alleles except IRBB27 suggests the maintenance of single resistant allele throughout the natural population.

  19. Genome-Wide Association Analysis of Blood Biomarkers in Chronic Obstructive Pulmonary Disease

    PubMed Central

    Kim, Deog Kyeom; Cho, Michael H.; Hersh, Craig P.; Lomas, David A.; Miller, Bruce E.; Kong, Xiangyang; Bakke, Per; Gulsvik, Amund; Agustí, Alvar; Wouters, Emiel; Celli, Bartolome; Coxson, Harvey; Vestbo, Jørgen; MacNee, William; Yates, Julie C.; Rennard, Stephen; Litonjua, Augusto; Qiu, Weiliang; Beaty, Terri H.; Crapo, James D.; Riley, John H.; Tal-Singer, Ruth

    2012-01-01

    Rationale: A genome-wide association study (GWAS) for circulating chronic obstructive pulmonary disease (COPD) biomarkers could identify genetic determinants of biomarker levels and COPD susceptibility. Objectives: To identify genetic variants of circulating protein biomarkers and novel genetic determinants of COPD. Methods: GWAS was performed for two pneumoproteins, Clara cell secretory protein (CC16) and surfactant protein D (SP-D), and five systemic inflammatory markers (C-reactive protein, fibrinogen, IL-6, IL-8, and tumor necrosis factor-α) in 1,951 subjects with COPD. For genome-wide significant single nucleotide polymorphisms (SNPs) (P < 1 × 10−8), association with COPD susceptibility was tested in 2,939 cases with COPD and 1,380 smoking control subjects. The association of candidate SNPs with mRNA expression in induced sputum was also elucidated. Measurements and Main Results: Genome-wide significant susceptibility loci affecting biomarker levels were found only for the two pneumoproteins. Two discrete loci affecting CC16, one region near the CC16 coding gene (SCGB1A1) on chromosome 11 and another locus approximately 25 Mb away from SCGB1A1, were identified, whereas multiple SNPs on chromosomes 6 and 16, in addition to SNPs near SFTPD, had genome-wide significant associations with SP-D levels. Several SNPs affecting circulating CC16 levels were significantly associated with sputum mRNA expression of SCGB1A1 (P = 0.009–0.03). Several SNPs highly associated with CC16 or SP-D levels were nominally associated with COPD in a collaborative GWAS (P = 0.001–0.049), although these COPD associations were not replicated in two additional cohorts. Conclusions: Distant genetic loci and biomarker-coding genes affect circulating levels of COPD-related pneumoproteins. A subset of these protein quantitative trait loci may influence their gene expression in the lung and/or COPD susceptibility. Clinical trial registered with www.clinicaltrials.gov (NCT 00292552). PMID:23144326

  20. Tumor taxonomy for the developmental lineage classification of neoplasms

    PubMed Central

    Berman, Jules J

    2004-01-01

    Background The new "Developmental lineage classification of neoplasms" was described in a prior publication. The classification is simple (the entire hierarchy is described with just 39 classifiers), comprehensive (providing a place for every tumor of man), and consistent with recent attempts to characterize tumors by cytogenetic and molecular features. A taxonomy is a list of the instances that populate a classification. The taxonomy of neoplasia attempts to list every known term for every known tumor of man. Methods The taxonomy provides each concept with a unique code and groups synonymous terms under the same concept. A Perl script validated successive drafts of the taxonomy ensuring that: 1) each term occurs only once in the taxonomy; 2) each term occurs in only one tumor class; 3) each concept code occurs in one and only one hierarchical position in the classification; and 4) the file containing the classification and taxonomy is a well-formed XML (eXtensible Markup Language) document. Results The taxonomy currently contains 122,632 different terms encompassing 5,376 neoplasm concepts. Each concept has, on average, 23 synonyms. The taxonomy populates "The developmental lineage classification of neoplasms," and is available as an XML file, currently 9+ Megabytes in length. A representation of the classification/taxonomy listing each term followed by its code, followed by its full ancestry, is available as a flat-file, 19+ Megabytes in length. The taxonomy is the largest nomenclature of neoplasms, with more than twice the number of neoplasm names found in other medical nomenclatures, including the 2004 version of the Unified Medical Language System, the Systematized Nomenclature of Medicine Clinical Terminology, the National Cancer Institute's Thesaurus, and the International Classification of Diseases Oncolology version. Conclusions This manuscript describes a comprehensive taxonomy of neoplasia that collects synonymous terms under a unique code number and assigns each tumor to a single class within the tumor hierarchy. The entire classification and taxonomy are available as open access files (in XML and flat-file formats) with this article. PMID:15571625

  1. Polymorphism at the defensin gene in the Anopheles gambiae complex: testing different selection hypotheses

    PubMed Central

    Simard, Frédéric; Licht, Monica; Besansky, Nora J.; Lehmann, Tovi

    2007-01-01

    Genetic variation in defensin, a gene encoding a major effector molecule of insects immune response was analyzed within and between populations of three members of the Anopheles gambiae complex. The species selected included the two anthropophilic species, An. gambiae and An. arabiensis and the most zoophilic species of the complex, An. quadriannulatus. The first species was represented by four populations spanning its extreme genetic and geographical ranges, whereas each of the other two species was represented by a single population. We found (i) reduced overall polymorphism in the mature peptide region and in the total coding region, together with specific reductions in rare and moderately frequent mutations (sites) in the coding region compared with non coding regions, (ii) markedly reduced rate of nonsynonymous diversity compared with synonymous variation in the mature peptide and virtually identical mature peptide across the three species, and (iii) increased divergence between species in the mature peptide together with reduced differentiation between populations of An. gambiae in the same DNA region. These patterns suggest a strong purifying selection on the mature peptide and probably the whole coding region. Because An. quadriannulatus is not exposed to human pathogens, identical mature peptide and similar pattern of polymorphism across species implies that human pathogens played no role as selective agents on this peptide. PMID:17161659

  2. Global variation in CYP2C8–CYP2C9 functional haplotypes

    PubMed Central

    Speed, William C; Kang, Soonmo Peter; Tuck, David P; Harris, Lyndsay N; Kidd, Kenneth K

    2009-01-01

    We have studied the global frequency distributions of 10 single nucleotide polymorphisms (SNPs) across 132 kb of CYP2C8 and CYP2C9 in ∼2500 individuals representing 45 populations. Five of the SNPs were in noncoding sequences; the other five involved the more common missense variants (four in CYP2C8, one in CYP2C9) that change amino acids in the gene products. One haplotype containing two CYP2C8 coding variants and one CYP2C9 coding variant reaches an average frequency of 10% in Europe; a set of haplotypes with a different CYP2C8 coding variant reaches 17% in Africa. In both cases these haplotypes are found in other regions of the world at <1%. This considerable geographic variation in haplotype frequencies impacts the interpretation of CYP2C8/CYP2C9 association studies, and has pharmacogenomic implications for drug interactions. PMID:19381162

  3. African ancestry allelic variation at the MYH9 gene contributes to increased susceptibility to non-diabetic end-stage kidney disease in Hispanic Americans

    PubMed Central

    Behar, Doron M.; Rosset, Saharon; Tzur, Shay; Selig, Sara; Yudkovsky, Guennady; Bercovici, Sivan; Kopp, Jeffrey B.; Winkler, Cheryl A.; Nelson, George W.; Wasser, Walter G.; Skorecki, Karl

    2010-01-01

    Recent studies identified MYH9 as a major susceptibility gene for common forms of non-diabetic end-stage kidney disease (ESKD). A set of African ancestry DNA sequence variants comprising the E-1 haplotype, was significantly associated with ESKD. In order to determine whether African ancestry variants are also associated with disease susceptibility in admixed populations with differing genomic backgrounds, we genotyped a total of 1425 African and Hispanic American subjects comprising dialysis patients with diabetic and non-diabetic ESKD and controls, using 42 single nucleotide polymorphisms (SNPs) within the MYH9 gene and 40 genome-wide and 38 chromosome 22 ancestry informative markers. Following ancestry correction, logistic regression demonstrated that three of the E-1 SNPs are also associated with non-diabetic ESKD in the new sample sets of both African and Hispanic Americans, with a stronger association in Hispanic Americans. We also identified MYH9 SNPs that are even more powerfully associated with the disease phenotype than the E-1 SNPs. These newly associated SNPs, could be divided into those comprising a haplotype termed S-1 whose association was significant under a recessive or additive inheritance mode (rs5750248, OR 4.21, P < 0.01, Hispanic Americans, recessive), and those comprising a haplotype termed F-1 whose association was significant under a dominant or additive inheritance mode (rs11912763, OR 4.59, P < 0.01, Hispanic Americans, dominant). These findings strengthen the contention that a sequence variant of MYH9, common in populations with varying degrees of African ancestry admixture, and in strong linkage disequilibrium with the associated SNPs and haplotypes reported herein, strongly predisposes to non-diabetic ESKD. PMID:20144966

  4. African ancestry allelic variation at the MYH9 gene contributes to increased susceptibility to non-diabetic end-stage kidney disease in Hispanic Americans.

    PubMed

    Behar, Doron M; Rosset, Saharon; Tzur, Shay; Selig, Sara; Yudkovsky, Guennady; Bercovici, Sivan; Kopp, Jeffrey B; Winkler, Cheryl A; Nelson, George W; Wasser, Walter G; Skorecki, Karl

    2010-05-01

    Recent studies identified MYH9 as a major susceptibility gene for common forms of non-diabetic end-stage kidney disease (ESKD). A set of African ancestry DNA sequence variants comprising the E-1 haplotype, was significantly associated with ESKD. In order to determine whether African ancestry variants are also associated with disease susceptibility in admixed populations with differing genomic backgrounds, we genotyped a total of 1425 African and Hispanic American subjects comprising dialysis patients with diabetic and non-diabetic ESKD and controls, using 42 single nucleotide polymorphisms (SNPs) within the MYH9 gene and 40 genome-wide and 38 chromosome 22 ancestry informative markers. Following ancestry correction, logistic regression demonstrated that three of the E-1 SNPs are also associated with non-diabetic ESKD in the new sample sets of both African and Hispanic Americans, with a stronger association in Hispanic Americans. We also identified MYH9 SNPs that are even more powerfully associated with the disease phenotype than the E-1 SNPs. These newly associated SNPs, could be divided into those comprising a haplotype termed S-1 whose association was significant under a recessive or additive inheritance mode (rs5750248, OR 4.21, P < 0.01, Hispanic Americans, recessive), and those comprising a haplotype termed F-1 whose association was significant under a dominant or additive inheritance mode (rs11912763, OR 4.59, P < 0.01, Hispanic Americans, dominant). These findings strengthen the contention that a sequence variant of MYH9, common in populations with varying degrees of African ancestry admixture, and in strong linkage disequilibrium with the associated SNPs and haplotypes reported herein, strongly predisposes to non-diabetic ESKD.

  5. Evolution of Synonymous Codon Usage in Neurospora tetrasperma and Neurospora discreta

    PubMed Central

    Whittle, C. A.; Sun, Y.; Johannesson, H.

    2011-01-01

    Neurospora comprises a primary model system for the study of fungal genetics and biology. In spite of this, little is known about genome evolution in Neurospora. For example, the evolution of synonymous codon usage is largely unknown in this genus. In the present investigation, we conducted a comprehensive analysis of synonymous codon usage and its relationship to gene expression and gene length (GL) in Neurospora tetrasperma and Neurospora discreta. For our analysis, we examined codon usage among 2,079 genes per organism and assessed gene expression using large-scale expressed sequenced tag (EST) data sets (279,323 and 453,559 ESTs for N. tetrasperma and N. discreta, respectively). Data on relative synonymous codon usage revealed 24 codons (and two putative codons) that are more frequently used in genes with high than with low expression and thus were defined as optimal codons. Although codon-usage bias was highly correlated with gene expression, it was independent of selectively neutral base composition (introns); thus demonstrating that translational selection drives synonymous codon usage in these genomes. We also report that GL (coding sequences [CDS]) was inversely associated with optimal codon usage at each gene expression level, with highly expressed short genes having the greatest frequency of optimal codons. Optimal codon frequency was moderately higher in N. tetrasperma than in N. discreta, which might be due to variation in selective pressures and/or mating systems. PMID:21402862

  6. Association of Germline CHEK2 Gene Variants with Risk and Prognosis of Non-Hodgkin Lymphoma

    PubMed Central

    Havranek, Ondrej; Kleiblova, Petra; Hojny, Jan; Lhota, Filip; Soucek, Pavel; Trneny, Marek; Kleibl, Zdenek

    2015-01-01

    The checkpoint kinase 2 gene (CHEK2) codes for the CHK2 protein, an important mediator of the DNA damage response pathway. The CHEK2 gene has been recognized as a multi-cancer susceptibility gene; however, its role in non-Hodgkin lymphoma (NHL) remains unclear. We performed mutation analysis of the entire CHEK2 coding sequence in 340 NHL patients using denaturing high-performance liquid chromatography (DHPLC) and multiplex ligation-dependent probe amplification (MLPA). Identified hereditary variants were genotyped in 445 non-cancer controls. The influence of CHEK2 variants on disease risk was statistically evaluated. Identified CHEK2 germline variants included four truncating mutations (found in five patients and no control; P = 0.02) and nine missense variants (found in 21 patients and 12 controls; P = 0.02). Carriers of non-synonymous variants had an increased risk of NHL development [odds ratio (OR) 2.86; 95% confidence interval (CI) 1.42–5.79] and an unfavorable prognosis [hazard ratio (HR) of progression-free survival (PFS) 2.1; 95% CI 1.12–4.05]. In contrast, the most frequent intronic variant c.319+43dupA (identified in 22% of patients and 31% of controls) was associated with a decreased NHL risk (OR = 0.62; 95% CI 0.45–0.86), but its positive prognostic effect was limited to NHL patients with diffuse large B-cell lymphoma (DLBCL) treated by conventional chemotherapy without rituximab (HR-PFS 0.4; 94% CI 0.17–0.74). Our results show that germ-line CHEK2 mutations affecting protein coding sequence confer a moderately-increased risk of NHL, they are associated with an unfavorable NHL prognosis, and they may represent a valuable predictive biomarker for patients with DLBCL. PMID:26506619

  7. Association of Germline CHEK2 Gene Variants with Risk and Prognosis of Non-Hodgkin Lymphoma.

    PubMed

    Havranek, Ondrej; Kleiblova, Petra; Hojny, Jan; Lhota, Filip; Soucek, Pavel; Trneny, Marek; Kleibl, Zdenek

    2015-01-01

    The checkpoint kinase 2 gene (CHEK2) codes for the CHK2 protein, an important mediator of the DNA damage response pathway. The CHEK2 gene has been recognized as a multi-cancer susceptibility gene; however, its role in non-Hodgkin lymphoma (NHL) remains unclear. We performed mutation analysis of the entire CHEK2 coding sequence in 340 NHL patients using denaturing high-performance liquid chromatography (DHPLC) and multiplex ligation-dependent probe amplification (MLPA). Identified hereditary variants were genotyped in 445 non-cancer controls. The influence of CHEK2 variants on disease risk was statistically evaluated. Identified CHEK2 germline variants included four truncating mutations (found in five patients and no control; P = 0.02) and nine missense variants (found in 21 patients and 12 controls; P = 0.02). Carriers of non-synonymous variants had an increased risk of NHL development [odds ratio (OR) 2.86; 95% confidence interval (CI) 1.42-5.79] and an unfavorable prognosis [hazard ratio (HR) of progression-free survival (PFS) 2.1; 95% CI 1.12-4.05]. In contrast, the most frequent intronic variant c.319+43dupA (identified in 22% of patients and 31% of controls) was associated with a decreased NHL risk (OR = 0.62; 95% CI 0.45-0.86), but its positive prognostic effect was limited to NHL patients with diffuse large B-cell lymphoma (DLBCL) treated by conventional chemotherapy without rituximab (HR-PFS 0.4; 94% CI 0.17-0.74). Our results show that germ-line CHEK2 mutations affecting protein coding sequence confer a moderately-increased risk of NHL, they are associated with an unfavorable NHL prognosis, and they may represent a valuable predictive biomarker for patients with DLBCL.

  8. [Polymorphisms of inhibin α gene exon 1 in buffalo (Bubalus bubalis), gayal (Bos frontalis) and yak (Bos grunniens)].

    PubMed

    Miao, Yong-Wang; Ha, Fu; Gao, Hua-Shan; Yuan, Feng; Li, Da-Lin; Yuan, Yue-Yun

    2012-08-01

    To elucidate the genetic characteristics of the bovine Inhibin α subunit (INHA) gene, the polymorphisms in exon 1 of INHA and its bilateral sequences were assayed using PCR with direct sequencing in buffalo, gayal and yak. A comparative analysis was conducted by pooled the results in this study with the published data of INHA on some mammals including some bovine species together. A synonymous substitution c.73C>A was identified in exon 1 of INHA for buffalo, which results in identical encoding product in river and swamp buffalo. In gayal, two non-synonymous but same property substitutions in exon 1 of INHA, viz. c.62 C>T and c.187 G>A, were detected, which lead to p. P21L, p. V63M changes in INHA, respectively. In yak, nucleotide substitution c.62C> T, c.129A>G were found in exon 1 of INHA, the former still causes p. P21L substitution and the latter is synonymous. For the sequence of the 5'-flanking region of INHA examined, no SNPs were found within the species, but a substitution, c. -6T>G, was found. The nucleotide in this site in gayal, yak and cattle was c. -6G, whereas in buffalo it was c. -6T. Meanwhile, a 6-bp deletion, namely c. 262+31_262+36delTCTGAC, was found in the intron of buffalo INHA gene. For this deletion, wild types (+/+) account for main part in river buffalo while mutant types (-/-) are predominant in swamp buffalo. This deletion was not found in gayal, yak and cattle, though these all have another deletion in the intron of INHA, c. 262+78_262+79delTG. The results of sequence alignment showed that the substitutions c. 43A and c. 67G in exon 1 of INHA are specific to buffalo, whereas the substitutions c. 173A and c. 255G are exclusive to gayal, yak and cattle, and c. 24C, c. 47G, c. 174T and c. 206T are specific to goat. Furthermore, there are few differences among gayal, yak and cattle, but there relatively great differences between buffalo, goat and other bovine species regarding the sequences of INHA exon 1.

  9. Association Analysis of the Ephrin-B2 Gene in African-Americans with End-Stage Renal Disease

    PubMed Central

    Hicks, Pamela J.; Staten, Jennifer L.; Palmer, Nicholette D.; Langefeld, Carl D.; Ziegler, Julie T.; Keene, Keith L.; Sale, Michele M.; Bowden, Donald W.; Freedman, Barry I.

    2008-01-01

    Background Genome scans in African-Americans with end-stage renal disease (ESRD) identified linkage on chromosome 13q33 in the region containing the ephrin-B2 ligand (EFNB2) genes. Interactions between the ephrin-B2 receptor and ephrin-B2 ligand play essential roles in renal angiogenesis, blood vessel maturation, and kidney disease. Methods The EFNB2 gene was evaluated as a positional candidate for non-diabetic and diabetic ESRD susceptibility in 1,071 unrelated African-American subjects; 316 with non-diabetic etiologies of ESRD, 394 with type 2 diabetes-associated ESRD and 361 healthy controls. Single nucleotide polymorphism (SNP) genotyping was performed on the Sequenom Mass Array System. Statistical analyses were computed using Dandelion version 1.26, Snpaddmix version 1.4 and Haploview version 3.32. Results Twenty-eight HapMap tag SNPs were genotyped spanning the 39 kilobases (kb) of the EFNB2 coding region, with average spacing of 1.43 kb. Analysis of 710 ESRD patient samples and 361 controls provided no evidence of single SNP associations in either diabetic or non-diabetic ESRD; although nominal evidence of association with all-cause ESRD was observed with a two SNP (p = 0.022) and three SNP (p = 0.023) haplotype, both containing SNPs rs7490924 and rs2391335 in intron 1. Conclusions Although an attractive positional candidate gene, polymorphisms in the EFNB2 gene do not appear to contribute in a substantial way to non-diabetic, diabetic or all-cause ESRD susceptibility in African-Americans. Additional genes within the chromosome 13q33 linkage interval are likely contributors to African-American non-diabetic ESRD. PMID:18580054

  10. Evaluating the transferability of 15 European-derived fasting plasma glucose SNPs in Mexican children and adolescents

    PubMed Central

    Langlois, Christine; Abadi, Arkan; Peralta-Romero, Jesus; Alyass, Akram; Suarez, Fernando; Gomez-Zamudio, Jaime; Burguete-Garcia, Ana I.; Yazdi, Fereshteh T.; Cruz, Miguel; Meyre, David

    2016-01-01

    Genome wide association studies (GWAS) have identified single-nucleotide polymorphisms (SNPs) that are associated with fasting plasma glucose (FPG) in adult European populations. The contribution of these SNPs to FPG in non-Europeans and children is unclear. We studied the association of 15 GWAS SNPs and a genotype score (GS) with FPG and 7 metabolic traits in 1,421 Mexican children and adolescents from Mexico City. Genotyping of the 15 SNPs was performed using TaqMan Open Array. We used multivariate linear regression models adjusted for age, sex, body mass index standard deviation score, and recruitment center. We identified significant associations between 3 SNPs (G6PC2 (rs560887), GCKR (rs1260326), MTNR1B (rs10830963)), the GS and FPG level. The FPG risk alleles of 11 out of the 15 SNPs (73.3%) displayed significant or non-significant beta values for FPG directionally consistent with those reported in adult European GWAS. The risk allele frequencies for 11 of 15 (73.3%) SNPs differed significantly in Mexican children and adolescents compared to European adults from the 1000G Project, but no significant enrichment in FPG risk alleles was observed in the Mexican population. Our data support a partial transferability of European GWAS FPG association signals in children and adolescents from the admixed Mexican population. PMID:27782183

  11. Genetics of Inflammatory Bowel Diseases

    PubMed Central

    McGovern, Dermot; Kugathasan, Subra; Cho, Judy H.

    2015-01-01

    In this Review, we provide an update on genome-wide association studies (GWAS) in inflammatory bowel disease (IBD). In addition, we summarize progress in defining the functional consequences of associated alleles for coding and non-coding genetic variation. In the small minority of loci where major association signals correspond to non-synonymous variation, we summarize studies defining their functional effects and implications for therapeutic targeting. Importantly, the large majority of GWAS-associated loci involve non-coding variation, many of which modulate levels of gene expression. Recent expression quantitative trait loci (eQTL) studies have established that expression of the large majority of human genes is regulated by non-coding genetic variation. Significant advances in defining the epigenetic landscape have demonstrated that IBD GWAS signals are highly enriched within cell-specific active enhancer marks. Studies in European ancestry populations have dominated the landscape of IBD genetics studies, but increasingly, studies in Asian and African-American populations are being reported. Common variation accounts for only a modest fraction of the predicted heritability and the role of rare genetic variation of higher effects (i.e. odds ratios markedly deviating from one) is increasingly being identified through sequencing efforts. These sequencing studies have been particularly productive in very-early onset, more severe cases. A major challenge in IBD genetics will be harnessing the vast array of genetic discovery for clinical utility, through emerging precision medicine initiatives. We discuss the rapidly evolving area of direct to consumer genetic testing, as well as the current utility of clinical exome sequencing, especially in very early onset, severe IBD cases. We summarize recent progress in the pharmacogenetics of IBD with respect of partitioning patient responses to anti-TNF and thiopurine therapies. Highly collaborative studies across research centers and across subspecialties and disciplines will be required to fully realize the promise of genetic discovery in IBD. PMID:26255561

  12. Insights into HLA-G Genetics Provided by Worldwide Haplotype Diversity

    PubMed Central

    Castelli, Erick C.; Ramalho, Jaqueline; Porto, Iane O. P.; Lima, Thálitta H. A.; Felício, Leandro P.; Sabbagh, Audrey; Donadi, Eduardo A.; Mendes-Junior, Celso T.

    2014-01-01

    Human leukocyte antigen G (HLA-G) belongs to the family of non-classical HLA class I genes, located within the major histocompatibility complex (MHC). HLA-G has been the target of most recent research regarding the function of class I non-classical genes. The main features that distinguish HLA-G from classical class I genes are (a) limited protein variability, (b) alternative splicing generating several membrane bound and soluble isoforms, (c) short cytoplasmic tail, (d) modulation of immune response (immune tolerance), and (e) restricted expression to certain tissues. In the present work, we describe the HLA-G gene structure and address the HLA-G variability and haplotype diversity among several populations around the world, considering each of its major segments [promoter, coding, and 3′ untranslated region (UTR)]. For this purpose, we developed a pipeline to reevaluate the 1000Genomes data and recover miscalled or missing genotypes and haplotypes. It became clear that the overall structure of the HLA-G molecule has been maintained during the evolutionary process and that most of the variation sites found in the HLA-G coding region are either coding synonymous or intronic mutations. In addition, only a few frequent and divergent extended haplotypes are found when the promoter, coding, and 3′UTRs are evaluated together. The divergence is particularly evident for the regulatory regions. The population comparisons confirmed that most of the HLA-G variability has originated before human dispersion from Africa and that the allele and haplotype frequencies have probably been shaped by strong selective pressures. PMID:25339953

  13. Problem-Solving Test: The Effect of Synonymous Codons on Gene Expression

    ERIC Educational Resources Information Center

    Szeberenyi, Jozsef

    2009-01-01

    Terms to be familiar with before you start to solve the test: the genetic code, codon, degenerate codons, protein synthesis, aminoacyl-tRNA, anticodon, antiparallel orientation, wobble, unambiguous codons, ribosomes, initiation, elongation and termination of translation, peptidyl transferase, translocation, degenerate oligonucleotides, green…

  14. Low-phosphate-selected Auxenochlorella protothecoides redirects phosphate to essential pathways while producing more biomass

    PubMed Central

    Park, Sang-Hyuck; Kyndt, John; Chougule, Kapeel; Park, Jeong-Jin

    2018-01-01

    Despite the capacity to accumulate ~70% w/w of lipids, commercially produced unicellular green alga A. protothecoides may become compromised due to the high cost of phosphate fertilizers. To address this limitation A. protothecoides was selected for adaptation to conditions of 100× and 5× lower phosphate and peptone, respectively, compared to ‘wild-type media’. The A. protothecoides showed initial signs of adaptation by 45–50 days, and steady state growth at ~100 days. The low phosphate (P)-adapted strain produced up to ~30% greater biomass, while total lipids (~10% w/w) remained about the same, compared to the wild-type strain. Metabolomic analyses indicated that the low P-adapted produced 3.3-fold more saturated palmitic acid (16:0) and 2.2-fold less linolenic acid (18:3), compared to the wild-type strain, resulting in an ~11% increase in caloric value, from 19.5kJ/g for the wild-type strain to 21.6kJ/g for the low P-adapted strain, due to the amounts and composition of certain saturated fatty acids, compared to the wild type strain. Biochemical changes in A. protothecoides adapted to lower phosphate conditions were assessed by comparative RNA-Seq analysis, which yielded 27,279 transcripts. Among them, 2,667 and 15 genes were significantly down- and up-regulated, at >999-fold and >3-fold (adjusted p-value <0.1), respectively. The expression of genes encoding proteins involved in cellular processes such as division, growth, and membrane biosynthesis, showed a trend toward down-regulation. At the genomic level, synonymous SNPs and Indels were observed primarily in coding regions, with the 40S ribosomal subunit gene harboring substantial SNPs. Overall, the adapted strain out-performed the wild-type strain by prioritizing the use of its limited phosphate supply for essential biological processes. The low P-adapted A. protothecoides is expected to be more economical to grow over the wild-type strain, based on overall greater productivity and caloric content, while importantly, also requiring 100-fold less phosphate. PMID:29920531

  15. Genetic variants in the mTOR pathway and interaction with body size and weight gain on breast cancer risk in African-American and European American women.

    PubMed

    Cheng, Ting-Yuan David; Shankar, Jyoti; Zirpoli, Gary; Roberts, Michelle R; Hong, Chi-Chen; Bandera, Elisa V; Ambrosone, Christine B; Yao, Song

    2016-08-01

    Positive energy imbalance and growth factors linked to obesity promote the phosphatidylinositol 3-kinase/AKT/mammalian target of rapamycin (mTOR) pathway. As the obesity-breast cancer associations differ between European American (EA) and African-American (AA) women, we investigated genetic variants in the mTOR pathway and breast cancer risk in these two racial groups. We examined 400 single-nucleotide polymorphisms (SNPs) in 31 mTOR pathway genes in the Women's Circle of Health Study with 1263 incident breast cancers (645 EA, 618 AA) and 1382 controls (641 EA, 741 AA). Multivariable logistic regression was performed separately within racial groups. Effect modification was assessed for measured body size and weight gain since age 20. In EA women, variants in FRAP1 rs12125777 (intron), PRR5L rs3740958 (synonymous coding), and CDKAL1 rs9368197 (intron) were associated with increased breast cancer risk, while variants in RPTOR rs9900506 (intron) were associated with decreased risk (nominal p-trend for functional and FRAP1 SNPs or p adjusted for correlated test [p ACT] < 0.05). For AA women, variants in RPTOR rs3817293 (intron), PIK3R1 rs7713645 (intron), and CDKAL1 rs9368197 were associated with decreased breast cancer risk. The significance for FRAP1 rs12125777 and RPTOR rs9900506 in EA women did not hold after correction for multiple comparisons. The risk associated with FRAP1 rs12125777 was higher among EAs who had body mass index ≥30 kg/m(2) (odds ratio = 7.69, 95 % CI 2.11-28.0; p-interaction = 0.007) and gained weight ≥35 lb since age 20 (odds ratio = 3.34, 95 % CI 1.42-7.85; p-interaction = 0.021), compared to their counterparts. The mTOR pathway may be involved in breast cancer carcinogenesis differently for EA and AA women.

  16. Haplotype combination of the bovine INSIG1 gene sequence variants and association with growth traits in Nanyang cattle.

    PubMed

    Sun, Jiajie; Gao, Yuan; Liu, Dong; Ma, Wei; Xue, Jing; Zhang, Chunlei; Lan, Xianyong; Lei, Chuzhao; Chen, Hong

    2012-06-01

    The insulin-induced gene 1 (INSIG1) gene encodes a protein that blocks proteolytic activation of sterol regulatory element binding proteins, which are transcription factors that activate genes that regulate cholesterol, fatty acid, and glucose metabolism. However, similar research for the bovine INSIG1 gene is lacking. Therefore, in this study, polymorphisms of the bovine INSIG1 gene were detected in 643 individuals from four cattle breeds by DNA pooling, forced PCR-RFLP, PCR-SSCP, and DNA sequencing methods. Only 10 novel SNPs were identified, which included four mutations in the coding region and the others in the introns. In Nanyang individuals, seven common haplotypes were identified based on four coding region SNPs. The haplotype GACT, with a frequency of 75.4%, was the most prevalent haplotypes and SNPs formed two linkage disequilibrium blocks with strong multi-allelic D' (D' = 1). Additionally, association analysis between mutations of the bovine INSIG1 gene and growth traits in Nanyang cattle at 6, 12, 18, and 24 months old was performed, and the results indicated that the polymorphisms were not significantly associated with body mass.

  17. Novel insertion mutation of ABCB1 gene in an ivermectin-sensitive Border Collie.

    PubMed

    Han, Jae-Ik; Son, Hyoung-Won; Park, Seung-Cheol; Na, Ki-Jeong

    2010-12-01

    P-glycoprotein (P-gp) is encoded by the ABCB1 gene and acts as an efflux pump for xenobiotics. In the Border Collie, a nonsense mutation caused by a 4-base pair deletion in the ABCB1 gene is associated with a premature stop to P-gp synthesis. In this study, we examined the full-length coding sequence of the ABCB1 gene in an ivermectin-sensitive Border Collie that lacked the aforementioned deletion mutation. The sequence was compared to the corresponding sequences of a wild-type Beagle and seven ivermectin-tolerant family members of the Border Collie. When compared to the wild-type Beagle sequence, that of the ivermectin-sensitive Border Collie was found to have one insertion mutation and eight single nucleotide polymorphisms (SNPs) in the coding sequence of the ABCB1 gene. While the eight SNPs were also found in the family members' sequences, the insertion mutation was found only in the ivermectin-sensitive dog. These results suggest the possibility that the SNPs are species-specific features of the ABCB1 gene in Border Collies, and that the insertion mutation may be related to ivermectin intolerance.

  18. Novel insertion mutation of ABCB1 gene in an ivermectin-sensitive Border Collie

    PubMed Central

    Han, Jae-Ik; Son, Hyoung-Won; Park, Seung-Cheol

    2010-01-01

    P-glycoprotein (P-gp) is encoded by the ABCB1 gene and acts as an efflux pump for xenobiotics. In the Border Collie, a nonsense mutation caused by a 4-base pair deletion in the ABCB1 gene is associated with a premature stop to P-gp synthesis. In this study, we examined the full-length coding sequence of the ABCB1 gene in an ivermectin-sensitive Border Collie that lacked the aforementioned deletion mutation. The sequence was compared to the corresponding sequences of a wild-type Beagle and seven ivermectin-tolerant family members of the Border Collie. When compared to the wild-type Beagle sequence, that of the ivermectin-sensitive Border Collie was found to have one insertion mutation and eight single nucleotide polymorphisms (SNPs) in the coding sequence of the ABCB1 gene. While the eight SNPs were also found in the family members' sequences, the insertion mutation was found only in the ivermectin-sensitive dog. These results suggest the possibility that the SNPs are species-specific features of the ABCB1 gene in Border Collies, and that the insertion mutation may be related to ivermectin intolerance. PMID:21113104

  19. Evidence of translation efficiency adaptation of the coding regions of the bacteriophage lambda.

    PubMed

    Goz, Eli; Mioduser, Oriah; Diament, Alon; Tuller, Tamir

    2017-08-01

    Deciphering the way gene expression regulatory aspects are encoded in viral genomes is a challenging mission with ramifications related to all biomedical disciplines. Here, we aimed to understand how the evolution shapes the bacteriophage lambda genes by performing a high resolution analysis of ribosomal profiling data and gene expression related synonymous/silent information encoded in bacteriophage coding regions.We demonstrated evidence of selection for distinct compositions of synonymous codons in early and late viral genes related to the adaptation of translation efficiency to different bacteriophage developmental stages. Specifically, we showed that evolution of viral coding regions is driven, among others, by selection for codons with higher decoding rates; during the initial/progressive stages of infection the decoding rates in early/late genes were found to be superior to those in late/early genes, respectively. Moreover, we argued that selection for translation efficiency could be partially explained by adaptation to Escherichia coli tRNA pool and the fact that it can change during the bacteriophage life cycle.An analysis of additional aspects related to the expression of viral genes, such as mRNA folding and more complex/longer regulatory signals in the coding regions, is also reported. The reported conclusions are likely to be relevant also to additional viruses. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  20. The Evolution of Vp1 Gene in Enterovirus C Species Sub-Group That Contains Types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99

    PubMed Central

    Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

    2014-01-01

    Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific ‘signature’ amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific ‘signature’ amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared. PMID:24695547

  1. The evolution of Vp1 gene in enterovirus C species sub-group that contains types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99.

    PubMed

    Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

    2014-01-01

    Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific 'signature' amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific 'signature' amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared.

  2. Monoamine oxidase A gene polymorphisms and enzyme activity associated with risk of gout in Taiwan aborigines.

    PubMed

    Tu, Hung-Pin; Ko, Albert Min-Shan; Wang, Shu-Jung; Lee, Chien-Hung; Lea, Rod A; Chiang, Shang-Lun; Chiang, Hung-Che; Wang, Tsu-Nai; Huang, Meng-Chuan; Ou, Tsan-Teng; Lin, Gau-Tyan; Ko, Ying-Chin

    2010-02-01

    Taiwanese aborigines have a high prevalence of hyperuricemia and gout. Uric acid levels and urate excretion have correlated with dopamine-induced glomerular filtration response. MAOs represent one of the major renal dopamine metabolic pathways. We aimed to identify the monoamine oxidase A (MAOA, Xp11.3) gene variants and MAO-A enzyme activity associated with gout risk. This study was to investigate the association between gout and the MAOA single-nucleotide polymorphisms (SNPs) rs5953210, rs2283725, and rs1137070 as well as between gout and the COMT SNPs rs4680 Val158Met for 374 gout cases and 604 controls. MAO-A activity was also measured. All three MAOA SNPs were significantly associated with gout. A synonymous MAOA SNP, rs1137070 Asp470Asp, located in exon 14, was associated with the risk of having gout (P = 4.0 x 10(-5), adjusted odds ratio 1.46, 95% confidence intervals [CI]: 1.11-1.91). We also showed that, when compared to individuals with the MAOA GAT haplotype, carriers of the AGC haplotype had a 1.67-fold (95% CI: 1.28-2.17) higher risk of gout. Moreover, we found that MAOA enzyme activity correlated positively with hyperuricemia and gout (P for trend = 2.00 x 10(-3) vs. normal control). We also found that MAOA enzyme activity by rs1137070 allele was associated with hyperuricemia and gout (P for trend = 1.53 x 10(-6) vs. wild-type allele). Thus, our results show that some MAOA alleles, which have a higher enzyme activity, predispose to the development of gout.

  3. Leptin and leptin-related gene polymorphisms, obesity, and influenza A/H1N1 vaccine-induced immune responses in older individuals.

    PubMed

    Ovsyannikova, Inna G; White, Sarah J; Larrabee, Beth R; Grill, Diane E; Jacobson, Robert M; Poland, Gregory A

    2014-02-07

    Obesity is a risk factor for complicated influenza A/H1N1 disease and poor vaccine immunogenicity. Leptin, an adipocyte-derived hormone/cytokine, has many immune regulatory functions and therefore could explain susceptibility to infections and poor vaccine outcomes. We recruited 159 healthy adults (50-74 years old) who were immunized with inactivated TIV influenza vaccine that contained A/California/7/2009/H1N1 virus. We found a strong correlation between leptin concentration and BMI (r=0.55, p<0.0001), but no association with hemagglutination antibody inhibition (HAI), B-cell, or granzyme B responses. We found a slight correlation between leptin concentration and an immunosenescence marker (TREC: T-cell receptor excision circles) level (r=0.23, p=0.01). We found eight SNPs in the LEP/LEPR/GHRL genes that were associated with leptin levels and four SNPs in the PTPN1/LEPR/STAT3 genes associated with peripheral blood TREC levels (p<0.05). Heterozygosity of the synonymous variant rs2230604 in the PTPN1 gene was associated with a significantly lower (531 vs. 259, p=0.005) TREC level, as compared to the homozygous major variant. We also found eight SNPs in the LEP/PPARG/CRP genes associated with variations in influenza-specific HAI and B-cell responses (p<0.05). Our results suggest that specific allelic variations in the leptin-related genes may influence adaptive immune responses to influenza vaccine. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Structure and variation of three canine genes involved in serotonin binding and transport: the serotonin receptor 1A gene (htr1A), serotonin receptor 2A gene (htr2A), and serotonin transporter gene (slc6A4).

    PubMed

    van den Berg, L; Kwant, L; Hestand, M S; van Oost, B A; Leegwater, P A J

    2005-01-01

    Aggressive behavior is the most frequently encountered behavioral problem in dogs. Abnormalities in brain serotonin metabolism have been described in aggressive dogs. We studied canine serotonergic genes to investigate genetic factors underlying canine aggression. Here, we describe the characterization of three genes of the canine serotonergic system: the serotonin receptor 1A and 2A gene (htr1A and htr2A) and the serotonin transporter gene (slc6A4). We isolated canine bacterial artificial chromosome clones containing these genes and designed oligonucleotides for genomic sequencing of coding regions and intron-exon boundaries. Golden retrievers were analyzed for DNA sequence variations. We found two nonsynonymous single nucleotide polymorphisms (SNPs) in the coding sequence of htr1A; one SNP close to a splice site in htr2A; and two SNPs in slc6A4, one in the coding sequence and one close to a splice site. In addition, we identified a polymorphic microsatellite marker for each gene. Htr1A is a strong candidate for involvement in the domestication of the dog. We genotyped the htr1A SNPs in 41 dogs of seven breeds with diverse behavioral characteristics. At least three SNP haplotypes were found. Our results do not support involvement of the gene in domestication.

  5. Development and evaluation of high-density Axiom® CicerSNP Array for high-resolution genetic mapping and breeding applications in chickpea.

    PubMed

    Roorkiwal, Manish; Jain, Ankit; Kale, Sandip M; Doddamani, Dadakhalandar; Chitikineni, Annapurna; Thudi, Mahendar; Varshney, Rajeev K

    2018-04-01

    To accelerate genomics research and molecular breeding applications in chickpea, a high-throughput SNP genotyping platform 'Axiom ® CicerSNP Array' has been designed, developed and validated. Screening of whole-genome resequencing data from 429 chickpea lines identified 4.9 million SNPs, from which a subset of 70 463 high-quality nonredundant SNPs was selected using different stringent filter criteria. This was further narrowed down to 61 174 SNPs based on p-convert score ≥0.3, of which 50 590 SNPs could be tiled on array. Among these tiled SNPs, a total of 11 245 SNPs (22.23%) were from the coding regions of 3673 different genes. The developed Axiom ® CicerSNP Array was used for genotyping two recombinant inbred line populations, namely ICCRIL03 (ICC 4958 × ICC 1882) and ICCRIL04 (ICC 283 × ICC 8261). Genotyping data reflected high success and polymorphic rate, with 15 140 (29.93%; ICCRIL03) and 20 018 (39.57%; ICCRIL04) polymorphic SNPs. High-density genetic maps comprising 13 679 SNPs spanning 1033.67 cM and 7769 SNPs spanning 1076.35 cM were developed for ICCRIL03 and ICCRIL04 populations, respectively. QTL analysis using multilocation, multiseason phenotyping data on these RILs identified 70 (ICCRIL03) and 120 (ICCRIL04) main-effect QTLs on genetic map. Higher precision and potential of this array is expected to advance chickpea genetics and breeding applications. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  6. Elevated Rate of Fixation of Endogenous Retroviral Elements in Haplorhini TRIM5 and TRIM22 Genomic Sequences: Impact on Transcriptional Regulation

    PubMed Central

    Diehl, William E.; Johnson, Welkin E.; Hunter, Eric

    2013-01-01

    All genes in the TRIM6/TRIM34/TRIM5/TRIM22 locus are type I interferon inducible, with TRIM5 and TRIM22 possessing antiviral properties. Evolutionary studies involving the TRIM6/34/5/22 locus have predominantly focused on the coding sequence of the genes, finding that TRIM5 and TRIM22 have undergone high rates of both non-synonymous nucleotide replacements and in-frame insertions and deletions. We sought to understand if divergent evolutionary pressures on TRIM6/34/5/22 coding regions have selected for modifications in the non-coding regions of these genes and explore whether such non-coding changes may influence the biological function of these genes. The transcribed genomic regions, including the introns, of TRIM6, TRIM34, TRIM5, and TRIM22 from ten Haplorhini primates and one prosimian species were analyzed for transposable element content. In Haplorhini species, TRIM5 displayed an exaggerated interspecies variability, predominantly resulting from changes in the composition of transposable elements in the large first and fourth introns. Multiple lineage-specific endogenous retroviral long terminal repeats (LTRs) were identified in the first intron of TRIM5 and TRIM22. In the prosimian genome, we identified a duplication of TRIM5 with a concomitant loss of TRIM22. The transposable element content of the prosimian TRIM5 genes appears to largely represent the shared Haplorhini/prosimian ancestral state for this gene. Furthermore, we demonstrated that one such differentially fixed LTR provides for species-specific transcriptional regulation of TRIM22 in response to p53 activation. Our results identify a previously unrecognized source of species-specific variation in the antiviral TRIM genes, which can lead to alterations in their transcriptional regulation. These observations suggest that there has existed long-term pressure for exaptation of retroviral LTRs in the non-coding regions of these genes. This likely resulted from serial viral challenges and provided a mechanism for rapid alteration of transcriptional regulation. To our knowledge, this represents the first report of persistent evolutionary pressure for the capture of retroviral LTR insertions. PMID:23516500

  7. Genome-wide interaction study of smoking behavior and non-small cell lung cancer risk in Caucasian population.

    PubMed

    Li, Yafang; Xiao, Xiangjun; Han, Younghun; Gorlova, Olga; Qian, David; Leighl, Natasha; Johansen, Jakob S; Barnett, Matt; Chen, Chu; Goodman, Gary; Cox, Angela; Taylor, Fiona; Woll, Penella; Wichmann, H-Erich; Manz, Judith; Muley, Thomas; Risch, Angela; Rosenberger, Albert; Arnold, Susanne M; Haura, Eric B; Bolca, Ciprian; Holcatova, Ivana; Janout, Vladimir; Kontic, Milica; Lissowska, Jolanta; Mukeria, Anush; Ognjanovic, Simona; Orlowski, Tadeusz M; Scelo, Ghislaine; Swiatkowska, Beata; Zaridze, David; Bakke, Per; Skaug, Vidar; Zienolddiny, Shanbeh; Duell, Eric J; Butler, Lesley M; Houlston, Richard; Soler Artigas, María; Grankvist, Kjell; Johansson, Mikael; Shepherd, Frances A; Marcus, Michael W; Brunnström, Hans; Manjer, Jonas; Melander, Olle; Muller, David C; Overvad, Kim; Trichopoulou, Antonia; Tumino, Rosario; Liu, Geoffrey; Bojesen, Stig E; Wu, Xifeng; Marchand, Loic Le; Albanes, Demetrios; Bickeböller, Heike; Aldrich, Melinda C; Bush, William S; Tardon, Adonina; Rennert, Gad; Teare, M Dawn; Field, John K; Kiemeney, Lambertus A; Lazarus, Philip; Haugen, Aage; Lam, Stephen; Schabath, Matthew B; Andrew, Angeline S; Bertazzi, Pier Alberto; Pesatori, Angela C; Christiani, David C; Caporaso, Neil; Johansson, Mattias; McKay, James D; Brennan, Paul; Hung, Rayjean J; Amos, Christopher I

    2018-03-08

    Non-small cell lung cancer is the most common type of lung cancer. Both environmental and genetic risk factors contribute to lung carcinogenesis. We conducted a genome-wide interaction analysis between single nucleotide polymorphisms (SNPs) and smoking status (never- versus ever-smokers) in a European-descent population. We adopted a two-step analysis strategy in the discovery stage: we first conducted a case-only interaction analysis to assess the relationship between SNPs and smoking behavior using 13336 non-small cell lung cancer cases. Candidate SNPs with P-value <0.001 were further analyzed using a standard case-control interaction analysis including 13970 controls. The significant SNPs with P-value <3.5 × 10-5 (correcting for multiple tests) from the case-control analysis in the discovery stage were further validated using an independent replication dataset comprising 5377 controls and 3054 non-small cell lung cancer cases. We further stratified the analysis by histological subtypes. Two novel SNPs, rs6441286 and rs17723637, were identified for overall lung cancer risk. The interaction odds ratio and meta-analysis P-value for these two SNPs were 1.24 with 6.96 × 10-7 and 1.37 with 3.49 × 10-7, respectively. In addition, interaction of smoking with rs4751674 was identified in squamous cell lung carcinoma with an odds ratio of 0.58 and P-value of 8.12 × 10-7. This study is by far the largest genome-wide SNP-smoking interaction analysis reported for lung cancer. The three identified novel SNPs provide potential candidate biomarkers for lung cancer risk screening and intervention. The results from our study reinforce that gene-smoking interactions play important roles in the etiology of lung cancer and account for part of the missing heritability of this disease.

  8. Epidemiology of angina pectoris: role of natural language processing of the medical record

    PubMed Central

    Pakhomov, Serguei; Hemingway, Harry; Weston, Susan A.; Jacobsen, Steven J.; Rodeheffer, Richard; Roger, Véronique L.

    2007-01-01

    Background The diagnosis of angina is challenging as it relies on symptom descriptions. Natural language processing (NLP) of the electronic medical record (EMR) can provide access to such information contained in free text that may not be fully captured by conventional diagnostic coding. Objective To test the hypothesis that NLP of the EMR improves angina pectoris (AP) ascertainment over diagnostic codes. Methods Billing records of in- and out-patients were searched for ICD-9 codes for AP, chronic ischemic heart disease and chest pain. EMR clinical reports were searched electronically for 50 specific non-negated natural language synonyms to these ICD-9 codes. The two methods were compared to a standardized assessment of angina by Rose questionnaire for three diagnostic levels: unspecified chest pain, exertional chest pain, and Rose angina. Results Compared to the Rose questionnaire, the true positive rate of EMR-NLP for unspecified chest pain was 62% (95%CI:55–67) vs. 51% (95%CI:44–58) for diagnostic codes (p<0.001). For exertional chest pain, the EMR-NLP true positive rate was 71% (95%CI:61–80) vs. 62% (95%CI:52–73) for diagnostic codes (p=0.10). Both approaches had 88% (95%CI:65–100) true positive rate for Rose angina. The EMR-NLP method consistently identified more patients with exertional chest pain over 28-month follow-up. Conclusion EMR-NLP method improves the detection of unspecified and exertional chest pain cases compared to diagnostic codes. These findings have implications for epidemiological and clinical studies of angina pectoris. PMID:17383310

  9. Identification of KCNJ15 as a Susceptibility Gene in Asian Patients with Type 2 Diabetes Mellitus

    PubMed Central

    Okamoto, Koji; Iwasaki, Naoko; Nishimura, Chisa; Doi, Kent; Noiri, Eisei; Nakamura, Shinko; Takizawa, Miho; Ogata, Makiko; Fujimaki, Risa; Grarup, Niels; Pisinger, Charlotta; Borch-Johnsen, Knut; Lauritzen, Torsten; Sandbaek, Annelli; Hansen, Torben; Yasuda, Kazuki; Osawa, Haruhiko; Nanjo, Kishio; Kadowaki, Takashi; Kasuga, Masato; Pedersen, Oluf; Fujita, Toshiro; Kamatani, Naoyuki; Iwamoto, Yasuhiko; Tokunaga, Katsushi

    2010-01-01

    Recent advances in genome research have enabled the identification of new genomic variations that are associated with type 2 diabetes mellitus (T2DM). Via fine mapping of SNPs in a candidate region of chromosome 21q, the current study identifies potassium inwardly-rectifying channel, subfamily J, member 15 (KCNJ15) as a new T2DM susceptibility gene. KCNJ15 is expressed in the β cell of the pancreas, and a synonymous SNP, rs3746876, in exon 4 (C566T) of this gene, with T allele frequency among control subjects of 3.1%, showed a significant association with T2DM affecting lean individuals in three independent Japanese sample sets (p = 2.5 × 10−7, odds ratio [OR] = 2.54, 95% confidence interval [CI] = 1.76–3.67) and with unstratified T2DM (p = 6.7 × 10−6, OR = 1.76, 95% CI = 1.37–2.25). The diabetes risk allele frequency was, however, very low among Europeans in whom no association between this variant and T2DM could be shown. Functional analysis in human embryonic kidney 293 cells demonstrated that the risk allele of the synonymous SNP in exon 4 increased KCNJ15 expression via increased mRNA stability, which resulted in the higher expression of protein as compared to that of the nonrisk allele. We also showed that KCNJ15 is expressed in human pancreatic β cells. In conclusion, we demonstrated a significant association between a synonymous variant in KCNJ15 and T2DM in lean Japanese patients with T2DM, suggesting that KCNJ15 is a previously unreported susceptibility gene for T2DM among Asians. PMID:20085713

  10. In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?

    PubMed Central

    Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe

    2010-01-01

    Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950

  11. CGDSNPdb: a database resource for error-checked and imputed mouse SNPs.

    PubMed

    Hutchins, Lucie N; Ding, Yueming; Szatkiewicz, Jin P; Von Smith, Randy; Yang, Hyuna; de Villena, Fernando Pardo-Manuel; Churchill, Gary A; Graber, Joel H

    2010-07-06

    The Center for Genome Dynamics Single Nucleotide Polymorphism Database (CGDSNPdb) is an open-source value-added database with more than nine million mouse single nucleotide polymorphisms (SNPs), drawn from multiple sources, with genotypes assigned to multiple inbred strains of laboratory mice. All SNPs are checked for accuracy and annotated for properties specific to the SNP as well as those implied by changes to overlapping protein-coding genes. CGDSNPdb serves as the primary interface to two unique data sets, the 'imputed genotype resource' in which a Hidden Markov Model was used to assess local haplotypes and the most probable base assignment at several million genomic loci in tens of strains of mice, and the Affymetrix Mouse Diversity Genotyping Array, a high density microarray with over 600,000 SNPs and over 900,000 invariant genomic probes. CGDSNPdb is accessible online through either a web-based query tool or a MySQL public login. Database URL: http://cgd.jax.org/cgdsnpdb/

  12. Translating natural genetic variation to gene expression in a computational model of the Drosophila gap gene regulatory network

    PubMed Central

    Kozlov, Konstantin N.; Kulakovskiy, Ivan V.; Zubair, Asif; Marjoram, Paul; Lawrie, David S.; Nuzhdin, Sergey V.; Samsonova, Maria G.

    2017-01-01

    Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects. PMID:28898266

  13. 7 CFR 987.102 - Lot number.

    Code of Federal Regulations, 2010 CFR

    2010-01-01

    ... 7 Agriculture 8 2010-01-01 2010-01-01 false Lot number. 987.102 Section 987.102 Agriculture... RIVERSIDE COUNTY, CALIFORNIA Administrative Rules Definitions § 987.102 Lot number. Lot number is synonymous with code and means a combination of letters or numbers, or both, acceptable to the Committee, showing...

  14. 7 CFR 987.102 - Lot number.

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... 7 Agriculture 8 2011-01-01 2011-01-01 false Lot number. 987.102 Section 987.102 Agriculture... RIVERSIDE COUNTY, CALIFORNIA Administrative Rules Definitions § 987.102 Lot number. Lot number is synonymous with code and means a combination of letters or numbers, or both, acceptable to the Committee, showing...

  15. 7 CFR 987.102 - Lot number.

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... 7 Agriculture 8 2013-01-01 2013-01-01 false Lot number. 987.102 Section 987.102 Agriculture... RIVERSIDE COUNTY, CALIFORNIA Administrative Rules Definitions § 987.102 Lot number. Lot number is synonymous with code and means a combination of letters or numbers, or both, acceptable to the Committee, showing...

  16. 7 CFR 987.102 - Lot number.

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... 7 Agriculture 8 2014-01-01 2014-01-01 false Lot number. 987.102 Section 987.102 Agriculture... RIVERSIDE COUNTY, CALIFORNIA Administrative Rules Definitions § 987.102 Lot number. Lot number is synonymous with code and means a combination of letters or numbers, or both, acceptable to the Committee, showing...

  17. 7 CFR 987.102 - Lot number.

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... 7 Agriculture 8 2012-01-01 2012-01-01 false Lot number. 987.102 Section 987.102 Agriculture... RIVERSIDE COUNTY, CALIFORNIA Administrative Rules Definitions § 987.102 Lot number. Lot number is synonymous with code and means a combination of letters or numbers, or both, acceptable to the Committee, showing...

  18. Positive selection on the killer whale mitogenome.

    PubMed

    Foote, Andrew D; Morin, Phillip A; Durban, John W; Pitman, Robert L; Wade, Paul; Willerslev, Eske; Gilbert, M Thomas P; da Fonseca, Rute R

    2011-02-23

    Mitochondria produce up to 95 per cent of the eukaryotic cell's energy. The coding genes of the mitochondrial DNA may therefore evolve under selection owing to metabolic requirements. The killer whale, Orcinus orca, is polymorphic, has a global distribution and occupies a range of ecological niches. It is therefore a suitable organism for testing this hypothesis. We compared a global dataset of the complete mitochondrial genomes of 139 individuals for amino acid changes that were associated with radical physico-chemical property changes and were influenced by positive selection. Two such selected non-synonymous amino acid changes were found; one in each of two ecotypes that inhabit the Antarctic pack ice. Both substitutions were associated with changes in local polarity, increased steric constraints and α-helical tendencies that could influence overall metabolic performance, suggesting a functional change.

  19. Genetic differentiation of Artyfechinostomum malayanum and A. sufrartyfex (Trematoda: Echinostomatidae) based on internal transcribed spacer sequences.

    PubMed

    Tantrawatpan, Chairat; Saijuntha, Weerachai; Sithithaworn, Paiboon; Andrews, Ross H; Petney, Trevor N

    2013-01-01

    Genetic differentiation between two synonymous echinostomes species, Artyfechinostomum malayanum and Artyfechinostomum sufrartyfex was determined by using the first and second internal transcribed spacers (ITS1 and ITS2), the non-coding region of rDNA as genetic makers. Of the 699 bp of combined ITS1 and ITS2 sequences examined, 18 variable nucleotide positions (2.58 %) were observed. Of these, 17 positions could be used as diagnostic position between these two sibling species, whereas the other one variation was intraspecific variation of A. malayanum. A clade of A. malayanum was closely aligned with A. sufrartyfex and clearly distance from the cluster of other echinostomes. Our results may sufficiently suggest that the current synonymy of these species is not valid.

  20. Genetic polymorphisms associated with breast cancer in malaysian cohort.

    PubMed

    Chahil, Jagdish Kaur; Munretnam, Khamsigan; Samsudin, Nurulhafizah; Lye, Say Hean; Hashim, Nikman Adli Nor; Ramzi, Nurul Hanis; Velapasamy, Sharmila; Wee, Ler Lian; Alex, Livy

    2015-04-01

    Genome-wide association studies have discovered multiple single nucleotide polymorphisms (SNPs) associated with the risk of common diseases. The objective of this study was to demonstrate the replication of previously published SNPs that showed statistical significance for breast cancer in the Malaysian population. In this case-control study, 80 subjects for each group were recruited from various hospitals in Malaysia. A total of 768 SNPs were genotyped and analyzed to distinguish risk and protective alleles. A total of three SNPs were found to be associated with increased risk of breast cancer while six SNPs showed protective effect. All nine were statistically significant SNPs (p ≤ 0.01), five SNPs from previous studies were successfully replicated in our study. Significant modifiable (diet) and non-modifiable (family history of breast cancer in first degree relative) risk factors were also observed. We identified nine SNPs from this study to be either conferring susceptibility or protection to breast cancer which may serve as potential markers in risk prediction.

  1. Functional variants of human papillomavirus type 16 demonstrate host genome integration and transcriptional alterations corresponding to their unique cancer epidemiology.

    PubMed

    Jackson, Robert; Rosa, Bruce A; Lameiras, Sonia; Cuninghame, Sean; Bernard, Josee; Floriano, Wely B; Lambert, Paul F; Nicolas, Alain; Zehbe, Ingeborg

    2016-11-02

    Human papillomaviruses (HPVs) are a worldwide burden as they are a widespread group of tumour viruses in humans. Having a tropism for mucosal tissues, high-risk HPVs are detected in nearly all cervical cancers. HPV16 is the most common high-risk type but not all women infected with high-risk HPV develop a malignant tumour. Likely relevant, HPV genomes are polymorphic and some HPV16 single nucleotide polymorphisms (SNPs) are under evolutionary constraint instigating variable oncogenicity and immunogenicity in the infected host. To investigate the tumourigenicity of two common HPV16 variants, we used our recently developed, three-dimensional organotypic model reminiscent of the natural HPV infectious cycle and conducted various "omics" and bioinformatics approaches. Based on epidemiological studies we chose to examine the HPV16 Asian-American (AA) and HPV16 European Prototype (EP) variants. They differ by three non-synonymous SNPs in the transforming and virus-encoded E6 oncogene where AAE6 is classified as a high- and EPE6 as a low-risk variant. Remarkably, the high-risk AAE6 variant genome integrated into the host DNA, while the low-risk EPE6 variant genome remained episomal as evidenced by highly sensitive Capt-HPV sequencing. RNA-seq experiments showed that the truncated form of AAE6, integrated in chromosome 5q32, produced a local gene over-expression and a large variety of viral-human fusion transcripts, including long distance spliced transcripts. In addition, differential enrichment of host cell pathways was observed between both HPV16 E6 variant-containing epithelia. Finally, in the high-risk variant, we detected a molecular signature of host chromosomal instability, a common property of cancer cells. We show how naturally occurring SNPs in the HPV16 E6 oncogene cause significant changes in the outcome of HPV infections and subsequent viral and host transcriptome alterations prone to drive carcinogenesis. Host genome instability is closely linked to viral integration into the host genome of HPV-infected cells, which is a key phenomenon for malignant cellular transformation and the reason for uncontrolled E6 oncogene expression. In particular, the finding of variant-specific integration potential represents a new paradigm in HPV variant biology.

  2. Investigating the Role of Mitochondrial Haplogroups in Genetic Predisposition to Meningococcal Disease

    PubMed Central

    Salas, Antonio; Fachal, Laura; Marcos-Alonso, Sonia; Vega, Ana; Martinón-Torres, Federico

    2009-01-01

    Background and Aims Meningococcal disease remains one of the most important infectious causes of death in industrialized countries. The highly diverse clinical presentation and prognosis of Neisseria meningitidis infections are the result of complex host genetics and environmental interactions. We investigated whether mitochondrial genetic background contributes to meningococcal disease (MD) susceptibility. Methodology/Principal Findings Prospective controlled study was performed through a national research network on MD that includes 41 Spanish hospitals. Cases were 307 paediatric patients with confirmed MD, representing the largest series of MD patients analysed to date. Two independent sets of ethnicity-matched control samples (CG1 [N = 917]), and CG2 [N = 616]) were used for comparison. Cases and controls underwent mtDNA haplotyping of a selected set of 25 mtDNA SNPs (mtSNPs), some of them defining major European branches of the mtDNA phylogeny. In addition, 34 ancestry informative markers (AIMs) were genotyped in cases and CG2 in order to monitor potential hidden population stratification. Samples of known African, Native American and European ancestry (N = 711) were used as classification sets for the determination of ancestral membership of our MD patients. A total of 39 individuals were eliminated from the main statistical analyses (including fourteen gypsies) on the basis of either non-Spanish self-reported ancestry or the results of AIMs indicating a European membership lower than 95%. Association analysis of the remaining 268 cases against CG1 suggested an overrepresentation of the synonym mtSNP G11719A variant (Pearson's chi-square test; adjusted P-value = 0.0188; OR [95% CI] = 1.63 [1.22–2.18]). When cases were compared with CG2, the positive association could not be replicated. No positive association has been observed between haplogroup (hg) status of cases and CG1/CG2 and hg status of cases and several clinical variants. Conclusions We did not find evidence of association between mtSNPs and mtDNA hgs with MD after carefully monitoring the confounding effect of population sub-structure. MtDNA variability is particularly stratified in human populations owing to its low effective population size in comparison with autosomal markers and therefore, special care should be taken in the interpretation of seeming signals of positive associations in mtDNA case-control association studies. PMID:20019817

  3. Donor single nucleotide polymorphism in the CCR9 gene affects the incidence of skin GVHD.

    PubMed

    Inamoto, Y; Murata, M; Katsumi, A; Kuwatsuka, Y; Tsujimura, A; Ishikawa, Y; Sugimoto, K; Onizuka, M; Terakura, S; Nishida, T; Kanie, T; Taji, H; Iida, H; Suzuki, R; Abe, A; Kiyoi, H; Matsushita, T; Miyamura, K; Kodera, Y; Naoe, T

    2010-02-01

    The interactions between chemokines and their receptors may have an important role in initiating GVHD after allogeneic hematopoietic SCT (allo-HSCT). CCL25 and CCR9 are unique because they are exclusively expressed in epithelial cells and in Peyer's patches of the small intestine. We focused on rs12721497 (G926A), one of the non-synonymous single nucleotide polymorphisms (SNPs) in the CCR9 gene, and analyzed the SNP of donors in 167 consecutive patients who received allo-HSCT from an HLA-identical sibling donor. Genotypes were tested for associations with acute and chronic GVHD in each organ and transplant outcome. Multivariate analyses showed that the genotype 926AG was significantly associated with the incidence of acute stage > or =2 skin GVHD (hazard ratio: 3.2; 95% confidence interval (95% CI): 1.1-9.1; P=0.032) and chronic skin GVHD (hazard ratio: 4.1; 95% CI: 1.1-15; P=0.036), but not with GVHD in other organs or with relapse, non-relapse mortality or OS. To clarify the functional differences between genotypes, each SNP in retroviral vectors was transfected into Jurkat cells. In chemotaxis assays, the 926G transfectant showed greater response to CCL25 than the 926A transfectant. In conclusion, more active homing of CCR9-926AG T cells to Peyer's patches may produce changes in Ag presentation and result in increased incidence of skin GVHD.

  4. Allelic variations and differential expressions detected at quantitative trait loci for salt stress tolerance in wheat.

    PubMed

    Oyiga, Benedict C; Sharma, Ram C; Baum, Michael; Ogbonnaya, Francis C; Léon, Jens; Ballvora, Agim

    2018-05-01

    The increasing salinization of agricultural lands is a threat to global wheat production. Understanding of the mechanistic basis of salt tolerance (ST) is essential for developing breeding and selection strategies that would allow for increased wheat production under saline conditions to meet the increasing global demand. We used a set that consists of 150 internationally derived winter and facultative wheat cultivars genotyped with a 90K SNP chip and phenotyped for ST across three growth stages and for ionic (leaf K + and Na +  contents) traits to dissect the genetic architecture regulating ST in wheat. Genome-wide association mapping revealed 187 Single Nucleotide Polymorphism (SNPs) (R 2  = 3.00-30.67%), representing 37 quantitative trait loci (QTL), significantly associated with the ST traits. Of these, four QTL on 1BS, 2AL, 2BS and 3AL were associated with ST across the three growth stages and with the ionic traits. Novel QTL were also detected on 1BS and 1DL. Candidate genes linked to these polymorphisms were uncovered, and expression analyses were performed and validated on them under saline and non-saline conditions using transcriptomics and qRT-PCR data. Expressed sequence comparisons in contrasting ST wheat genotypes identified several non-synonymous/missense mutation sites that are contributory to the ST trait variations, indicating the biological relevance of these polymorphisms that can be exploited in breeding for ST in wheat. © 2017 The Authors. Plant, Cell & Environment published by JohnWiley & Sons Ltd.

  5. Bactericidal activity of tracheal antimicrobial peptide against respiratory pathogens of cattle.

    PubMed

    Taha-Abdelaziz, Khaled; Perez-Casal, José; Schott, Courtney; Hsiao, Jason; Attah-Poku, Samuel; Slavić, Durđa; Caswell, Jeff L

    2013-04-15

    Tracheal antimicrobial peptide (TAP) is a β-defensin produced by mucosal epithelial cells of cattle. Although effective against several human pathogens, the activity of this bovine peptide against the bacterial pathogens that cause bovine respiratory disease have not been reported. This study compared the antibacterial effects of synthetic TAP against Mannheimia haemolytica, Histophilus somni, Pasteurella multocida, and Mycoplasma bovis. Bactericidal activity against M. bovis was not detected. In contrast, the Pasteurellaceae bacteria showed similar levels of susceptibility to that of Escherichia coli, with 0.125μg TAP inhibiting growth in a radial diffusion assay and minimum inhibitory concentrations of 1.56-6.25μg/ml in a bactericidal assay. Significant differences among isolates were not observed. Sequencing of exon 2 of the TAP gene from 23 cattle revealed a prevalent non-synonymous single nucleotide polymorphism (SNP) A137G, encoding either serine or asparagine at residue 20 of the mature peptide. The functional effect of this SNP was tested against M. haemolytica using synthetic peptides. The bactericidal effect of the asparagine-containing peptide was consistently higher than the serine-containing peptide. Bactericidal activities were similar for an acapsular mutant of M. haemolytica compared to the wild type. These findings indicate that the Pasteurellaceae bacteria that cause bovine respiratory disease are susceptible to killing by bovine TAP and appear not to have evolved resistance, whereas M. bovis appears to be resistant. A non-synonymous SNP was identified in the coding region of the TAP gene, and the corresponding peptides vary in their bactericidal activity against M. haemolytica. Copyright © 2013 Elsevier B.V. All rights reserved.

  6. Associations between RNA splicing regulatory variants of stemness-related genes and racial disparities in susceptibility to prostate cancer.

    PubMed

    Wang, Yanru; Freedman, Jennifer A; Liu, Hongliang; Moorman, Patricia G; Hyslop, Terry; George, Daniel J; Lee, Norman H; Patierno, Steven R; Wei, Qingyi

    2017-08-15

    Evidence suggests that cells with a stemness phenotype play a pivotal role in oncogenesis, and prostate cells exhibiting this phenotype have been identified. We used two genome-wide association study (GWAS) datasets of African descendants, from the Multiethnic/Minority Cohort Study of Diet and Cancer (MEC) and the Ghana Prostate Study, and two GWAS datasets of non-Hispanic whites, from the Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer Screening Trial and the Breast and Prostate Cancer Cohort Consortium (BPC3), to analyze the associations between genetic variants of stemness-related genes and racial disparities in susceptibility to prostate cancer. We evaluated associations of single-nucleotide polymorphisms (SNPs) in 25 stemness-related genes with prostate cancer risk in 1,609 cases and 2,550 controls of non-Hispanic whites (4,934 SNPs) and 1,144 cases and 1,116 controls of African descendants (5,448 SNPs) with correction by false discovery rate ≤0.2. We identified 32 SNPs in five genes (TP63, ALDH1A1, WNT1, MET and EGFR) that were significantly associated with prostate cancer risk, of which six SNPs in three genes (TP63, ALDH1A1 and WNT1) and eight EGFR SNPs showed heterogeneity in susceptibility between these two racial groups. In addition, 13 SNPs in MET and one in ALDH1A1 were found only in African descendants. The in silico bioinformatics analyses revealed that EGFR rs2072454 and SNPs in linkage with the identified SNPs in MET and ALDH1A1 (r 2  > 0.6) were predicted to regulate RNA splicing. These variants may serve as novel biomarkers for racial disparities in prostate cancer risk. © 2017 UICC.

  7. Identification of novel mutations and sequence variants in the SOX2 and CHX10 genes in patients with anophthalmia/microphthalmia

    PubMed Central

    Zhou, Jie; Kherani, Femida; Bardakjian, Tanya M.; Katowitz, James; Hughes, Nkecha; Schimmenti, Lisa A.; Schneider, Adele

    2008-01-01

    Purpose Mutations in the SOX2 and CHX10 genes have been reported in patients with anophthalmia and/or microphthalmia. In this study, we evaluated 34 anophthalmic/microphthalmic patient DNA samples (two sets of siblings included) for mutations and sequence variants in SOX2 and CHX10. Methods Conformational sensitive gel electrophoresis (CSGE) was used for the initial SOX2 and CHX10 screening of 34 affected individuals (two sets of siblings), five unaffected family members, and 80 healthy controls. Patient samples containing heteroduplexes were selected for sequence analysis. Base pair changes in SOX2 and CHX10 were confirmed by sequencing bidirectionally in patient samples. Results Two novel heterozygous mutations and two sequence variants (one known) in SOX2 were identified in this cohort. Mutation c.310 G>T (p. Glu104X), found in one patient, was in the region encoding the high mobility group (HMG) DNA-binding domain and resulted in a change from glutamic acid to a stop codon. The second mutation, noted in two affected siblings, was a single nucleotide deletion c.549delC (p. Pro184ArgfsX19) in the region encoding the activation domain, resulting in a frameshift and premature termination of the coding sequence. The shortened protein products may result in the loss of function. In addition, a novel nucleotide substitution c.*557G>A was identified in the 3′-untranslated region in one patient. The relationship between the nucleotide change and the protein function is indeterminate. A known single nucleotide polymorphism (c. *469 C>A, SNP rs11915160) was also detected in 2 of the 34 patients. Screening of CHX10 identified two synonymous sequence variants, c.471 C>T (p.Ser157Ser, rs35435463) and c.579 G>A (p. Gln193Gln, novel SNP), and one non-synonymous sequence variant, c.871 G>A (p. Asp291Asn, novel SNP). The non-synonymous polymorphism was also present in healthy controls, suggesting non-causality. Conclusions These results support the role of SOX2 in ocular development. Loss of SOX2 function results in severe eye malformation. CHX10 was not implicated with microphthalmia/anophthalmia in our patient cohort. PMID:18385794

  8. [Association between aryl hydrocarbon receptor gene polymorphisms and chromosomal damage in coke-oven workers].

    PubMed

    Bin, Ping; Leng, Shuguang; Liang, Xuemiao; Cheng, Juan

    2007-11-01

    To investigate the association of single nucleotide polymorphisms (SNPs) or haplotypes of aryl hydrocarbon receptor (AHR) gene and chromosomal damage in peripheral blood lymphocytes among coke-oven workers. Eighty-nine coke-oven workers exposed to a high level of polycyclic aromatic hydrocarbons (PAHs) and sixty non-exposed workers were selected as the study subjects. Urinary 1-hydroxypyrene (1-OHPyr) levels were measured as the internal dose of PAHs exposure. The chromosomal damage in peripheral lymphocyte was measured by the cytokinesis-block micronucleus (CBMN) assay. Two SNPs in AHR gene, including rs6960165, rs2282885 were detected by PCR-RFLP. The AHR haplotypes were estimated by Bayesian statistical method with the software of PHASE Version 2.1. The associations between SNPs or haplotypes pairs and CBMN were assessed by analysis of covariance in the coke-oven workers and non-exposed workers. The level of 1-OHPyr among coke-oven workers was significantly higher than that among non-exposed workers (P < 0.01). The CBMN among coke-oven workers was significantly higher than that among non-exposed workers (P < 0.01). After adjusting the age and the level of 1-OHPyr, the different SNPs of AHR gene rs6960165 in coke-oven workers were related to the CBMN frequencies (P = 0.014), but no association between the different SNPs of AHR gene rs2282885 and the rates of CBMN was observed in coke-oven workers (P = 0.586), either in the controls (P = 0.308 and P = 0.415, respectively), the haplotypes in coke-oven workers were significantly related to the rates of CBMN (P = 0.007), while there was no significant association in non-exposed workers (P = 0.768). Our results suggested that SNPs rs6960165 or haplotypes of AHR were associated with the CBMN frequencies in coke-oven workers.

  9. Fine-mapping and initial characterization of QT interval loci in African Americans.

    PubMed

    Avery, Christy L; Sethupathy, Praveen; Buyske, Steven; He, Qianchuan; Lin, Dan-Yu; Arking, Dan E; Carty, Cara L; Duggan, David; Fesinmeyer, Megan D; Hindorff, Lucia A; Jeff, Janina M; Klein, Liviu; Patton, Kristen K; Peters, Ulrike; Shohet, Ralph V; Sotoodehnia, Nona; Young, Alicia M; Kooperberg, Charles; Haiman, Christopher A; Mohlke, Karen L; Whitsel, Eric A; North, Kari E

    2012-01-01

    The QT interval (QT) is heritable and its prolongation is a risk factor for ventricular tachyarrhythmias and sudden death. Most genetic studies of QT have examined European ancestral populations; however, the increased genetic diversity in African Americans provides opportunities to narrow association signals and identify population-specific variants. We therefore evaluated 6,670 SNPs spanning eleven previously identified QT loci in 8,644 African American participants from two Population Architecture using Genomics and Epidemiology (PAGE) studies: the Atherosclerosis Risk in Communities study and Women's Health Initiative Clinical Trial. Of the fifteen known independent QT variants at the eleven previously identified loci, six were significantly associated with QT in African American populations (P≤1.20×10(-4)): ATP1B1, PLN1, KCNQ1, NDRG4, and two NOS1AP independent signals. We also identified three population-specific signals significantly associated with QT in African Americans (P≤1.37×10(-5)): one at NOS1AP and two at ATP1B1. Linkage disequilibrium (LD) patterns in African Americans assisted in narrowing the region likely to contain the functional variants for several loci. For example, African American LD patterns showed that 0 SNPs were in LD with NOS1AP signal rs12143842, compared with European LD patterns that indicated 87 SNPs, which spanned 114.2 Kb, were in LD with rs12143842. Finally, bioinformatic-based characterization of the nine African American signals pointed to functional candidates located exclusively within non-coding regions, including predicted binding sites for transcription factors such as TBX5, which has been implicated in cardiac structure and conductance. In this detailed evaluation of QT loci, we identified several African Americans SNPs that better define the association with QT and successfully narrowed intervals surrounding established loci. These results demonstrate that the same loci influence variation in QT across multiple populations, that novel signals exist in African Americans, and that the SNPs identified as strong candidates for functional evaluation implicate gene regulatory dysfunction in QT prolongation.

  10. Fine-Mapping and Initial Characterization of QT Interval Loci in African Americans

    PubMed Central

    Avery, Christy L.; Sethupathy, Praveen; Buyske, Steven; He, Qianchuan; Lin, Dan-Yu; Arking, Dan E.; Carty, Cara L.; Duggan, David; Fesinmeyer, Megan D.; Hindorff, Lucia A.; Jeff, Janina M.; Klein, Liviu; Patton, Kristen K.; Peters, Ulrike; Shohet, Ralph V.; Sotoodehnia, Nona; Young, Alicia M.; Kooperberg, Charles; Haiman, Christopher A.; Mohlke, Karen L.; Whitsel, Eric A.; North, Kari E.

    2012-01-01

    The QT interval (QT) is heritable and its prolongation is a risk factor for ventricular tachyarrhythmias and sudden death. Most genetic studies of QT have examined European ancestral populations; however, the increased genetic diversity in African Americans provides opportunities to narrow association signals and identify population-specific variants. We therefore evaluated 6,670 SNPs spanning eleven previously identified QT loci in 8,644 African American participants from two Population Architecture using Genomics and Epidemiology (PAGE) studies: the Atherosclerosis Risk in Communities study and Women's Health Initiative Clinical Trial. Of the fifteen known independent QT variants at the eleven previously identified loci, six were significantly associated with QT in African American populations (P≤1.20×10−4): ATP1B1, PLN1, KCNQ1, NDRG4, and two NOS1AP independent signals. We also identified three population-specific signals significantly associated with QT in African Americans (P≤1.37×10−5): one at NOS1AP and two at ATP1B1. Linkage disequilibrium (LD) patterns in African Americans assisted in narrowing the region likely to contain the functional variants for several loci. For example, African American LD patterns showed that 0 SNPs were in LD with NOS1AP signal rs12143842, compared with European LD patterns that indicated 87 SNPs, which spanned 114.2 Kb, were in LD with rs12143842. Finally, bioinformatic-based characterization of the nine African American signals pointed to functional candidates located exclusively within non-coding regions, including predicted binding sites for transcription factors such as TBX5, which has been implicated in cardiac structure and conductance. In this detailed evaluation of QT loci, we identified several African Americans SNPs that better define the association with QT and successfully narrowed intervals surrounding established loci. These results demonstrate that the same loci influence variation in QT across multiple populations, that novel signals exist in African Americans, and that the SNPs identified as strong candidates for functional evaluation implicate gene regulatory dysfunction in QT prolongation. PMID:22912591

  11. Can Non-HLA Single Nucleotide Polymorphisms Help Stratify Risk in TrialNet Relatives at Risk for Type 1 Diabetes?

    PubMed

    Steck, Andrea K; Xu, Ping; Geyer, Susan; Redondo, Maria J; Antinozzi, Peter; Wentworth, John M; Sosenko, Jay; Onengut-Gumuscu, Suna; Chen, Wei-Min; Rich, Stephen S; Pugliese, Alberto

    2017-08-01

    Genome-wide association studies identified >50 type 1 diabetes (T1D) associated non-human leukocyte antigens (non-HLA) loci. The purpose of this study was to assess the contribution of non-HLA single nucleotide polymorphisms (SNPs) to risk of disease progression. The TrialNet Pathway to Prevention Study follows relatives of T1D patients for development of autoantibodies (Abs) and T1D. Using the Immunochip, we analyzed 53 diabetes-associated, non-HLA SNPs in 1016 Ab-positive, at-risk non-Hispanic white relatives. Effect of SNPs on the development of multiple Abs and T1D. Cox proportional analyses included all substantial non-HLA SNPs, HLA genotypes, relationship to proband, sex, age at initial screening, initial Ab type, and number. Factors involved in progression from single to multiple Abs included age at screening, relationship to proband, HLA genotypes, and rs3087243 (cytotoxic T lymphocyte antigen-4). Significant factors for diabetes progression included age at screening, Ab number, HLA genotypes, rs6476839 [GLIS family zinc finger 3 (GLIS3)], and rs3184504 [SH2B adaptor protein 3 (SH2B3)]. When glucose area under the curve (AUC) was included, factors involved in disease progression included glucose AUC, age at screening, Ab number, relationship to proband, HLA genotypes, rs6476839 (GLIS3), and rs7221109 (CCR7). In stratified analyses by age, glucose AUC, age at screening, sibling, HLA genotypes, rs6476839 (GLIS3), and rs4900384 (C14orf64) were significantly associated with progression to diabetes in participants <12 years old, whereas glucose AUC, sibling, rs3184504 (SH2B3), and rs4900384 (C14orf64) were significant in those ≥12. In conclusion, we identified five non-HLA SNPs associated with increased risk of progression from Ab positivity to disease that may improve risk stratification for prevention trials. Copyright © 2017 by the Endocrine Society

  12. GARLIC: a bioinformatic toolkit for aetiologically connecting diseases and cell type-specific regulatory maps

    PubMed Central

    Nikolić, Miloš; Papantonis, Argyris

    2017-01-01

    Abstract Genome-wide association studies (GWAS) have emerged as a powerful tool to uncover the genetic basis of human common diseases, which often show a complex, polygenic and multi-factorial aetiology. These studies have revealed that 70–90% of all single nucleotide polymorphisms (SNPs) associated with common complex diseases do not occur within genes (i.e. they are non-coding), making the discovery of disease-causative genetic variants and the elucidation of the underlying pathological mechanisms far from straightforward. Based on emerging evidences suggesting that disease-associated SNPs are frequently found within cell type-specific regulatory sequences, here we present GARLIC (GWAS-based Prediction Toolkit for Connecting Diseases and Cell Types), a user-friendly, multi-purpose software with an associated database and online viewer that, using global maps of cis-regulatory elements, can aetiologically connect human diseases with relevant cell types. Additionally, GARLIC can be used to retrieve potential disease-causative genetic variants overlapping regulatory sequences of interest. Overall, GARLIC can satisfy several important needs within the field of medical genetics, thus potentially assisting in the ultimate goal of uncovering the elusive and complex genetic basis of common human disorders. PMID:28007912

  13. Prioritisation of associations between protein domains and complex diseases using domain-domain interaction networks.

    PubMed

    Wang, W; Zhang, W; Jiang, R; Luan, Y

    2010-05-01

    It is of vital importance to find genetic variants that underlie human complex diseases and locate genes that are responsible for these diseases. Since proteins are typically composed of several structural domains, it is reasonable to assume that harmful genetic variants may alter structures of protein domains, affect functions of proteins and eventually cause disorders. With this understanding, the authors explore the possibility of recovering associations between protein domains and complex diseases. The authors define associations between protein domains and disease families on the basis of associations between non-synonymous single nucleotide polymorphisms (nsSNPs) and complex diseases, similarities between diseases, and relations between proteins and domains. Based on a domain-domain interaction network, the authors propose a 'guilt-by-proximity' principle to rank candidate domains according to their average distance to a set of seed domains in the domain-domain interaction network. The authors validate the method through large-scale cross-validation experiments on simulated linkage intervals, random controls and the whole genome. Results show that areas under receiver operating characteristic curves (AUC scores) can be as high as 77.90%, and the mean rank ratios can be as low as 21.82%. The authors further offer a freely accessible web interface for a genome-wide landscape of associations between domains and disease families.

  14. Phylodynamic Analysis of Clinical and Environmental Vibrio cholerae Isolates from Haiti Reveals Diversification Driven by Positive Selection

    PubMed Central

    Azarian, Taj; Ali, Afsar; Johnson, Judith A.; Mohr, David; Prosperi, Mattia; Veras, Nazle M.; Jubair, Mohammed; Strickland, Samantha L.; Rashid, Mohammad H.; Alam, Meer T.; Weppelmann, Thomas A.; Katz, Lee S.; Tarr, Cheryl L.; Colwell, Rita R.

    2014-01-01

    ABSTRACT Phylodynamic analysis of genome-wide single-nucleotide polymorphism (SNP) data is a powerful tool to investigate underlying evolutionary processes of bacterial epidemics. The method was applied to investigate a collection of 65 clinical and environmental isolates of Vibrio cholerae from Haiti collected between 2010 and 2012. Characterization of isolates recovered from environmental samples identified a total of four toxigenic V. cholerae O1 isolates, four non-O1/O139 isolates, and a novel nontoxigenic V. cholerae O1 isolate with the classical tcpA gene. Phylogenies of strains were inferred from genome-wide SNPs using coalescent-based demographic models within a Bayesian framework. A close phylogenetic relationship between clinical and environmental toxigenic V. cholerae O1 strains was observed. As cholera spread throughout Haiti between October 2010 and August 2012, the population size initially increased and then fluctuated over time. Selection analysis along internal branches of the phylogeny showed a steady accumulation of synonymous substitutions and a progressive increase of nonsynonymous substitutions over time, suggesting diversification likely was driven by positive selection. Short-term accumulation of nonsynonymous substitutions driven by selection may have significant implications for virulence, transmission dynamics, and even vaccine efficacy. PMID:25538191

  15. Polymorphisms in the K13-propeller gene in artemisinin-susceptible Plasmodium falciparum parasites from Bougoula-Hameau and Bandiagara, Mali.

    PubMed

    Ouattara, Amed; Kone, Aminatou; Adams, Matthew; Fofana, Bakary; Maiga, Amelia Walling; Hampton, Shay; Coulibaly, Drissa; Thera, Mahamadou A; Diallo, Nouhoum; Dara, Antoine; Sagara, Issaka; Gil, Jose Pedro; Bjorkman, Anders; Takala-Harrison, Shannon; Doumbo, Ogobara K; Plowe, Christopher V; Djimde, Abdoulaye A

    2015-06-01

    Artemisinin-resistant Plasmodium falciparum malaria has been documented in southeast Asia and may already be spreading in that region. Molecular markers are important tools for monitoring the spread of antimalarial drug resistance. Recently, single-nucleotide polymorphisms (SNPs) in the PF3D7_1343700 kelch propeller (K13-propeller) domain were shown to be associated with artemisinin resistance in vivo and in vitro. The prevalence and role of K13-propeller mutations are poorly known in sub-Saharan Africa. K13-propeller mutations were genotyped by direct sequencing of nested polymerase chain reaction (PCR) amplicons from dried blood spots of pre-treatment falciparum malaria infections collected before and after the use of artemisinin-based combination therapy (ACT) as first-line therapy in Mali. Although K13-propeller mutations previously associated with delayed parasite clearance in Cambodia were not identified, 26 K13-propeller mutations were identified in both recent samples and pre-ACT infections. Parasite clearance time was comparable between infections with non-synonymous K13-propeller mutations and infections with the reference allele. These findings suggest that K13-propeller mutations are present in artemisinin-sensitive parasites and that they preceded the wide use of ACTs in Mali. © The American Society of Tropical Medicine and Hygiene.

  16. Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes.

    PubMed

    Seligmann, Hervé; Warthi, Ganesh

    2017-01-01

    A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').

  17. The two single nucleotide polymorphisms in the H37/RBM5 tumour suppressor gene at 3p21.3 correlated with different subtypes of non-small cell lung cancers

    PubMed Central

    Oh, Juliana J.; Koegel, Ashley; Phan, Diana T.; Razfar, Ali; Slamon, Dennis J.

    2007-01-01

    Summary Allele loss and genetic alteration in chromosome 3p, particularly in 3p21.3 region, are the most frequent and the earliest genomic abnormalities found in lung cancer. Multiple 3p21.3 genes exhibit various degrees of tumour suppression activity suggesting that 3p21.3 genes may function as an integrated tumour suppressor region through their diverse biological activities. We have previously demonstrated growth inhibitory effects and tumour suppression mechanism of the H37/RBM5 gene which is one of the 19 genes residing in the 370kb minimal overlap region at 3p21.3. In the current study, in an attempt to find, if any, mutations in the H37 coding region in lung cancer cells, we compared nucleotide sequences of the entire H37 gene in tumour vs. adjacent normal tissues from 17 non-small cell lung cancer (NSCLC) patients. No mutations were detected, instead, we found the two silent single nucleotide polymorphisms (SNPs), C1138T and C2185T, within the coding region of the H37 gene. In addition, we found that specific allele types at these SNP positions are correlated with different histological subtypes of NSCLC; tumours containing heterozygous alleles (C+T) at these SNP positions are more likely to be associated with adenocarcinoma (AC) whereas homozygous alleles (either C or T) are associated with squamous cell carcinoma (SCC) (p=0.0098). We postulate that, these two silent polymorphisms may be in linkage disequilibrium (LD) with a disease causative allele in the 3p21.3 tumour suppressor region which is packed with a large number of important genes affecting lung cancer development. In addition, because of prevalent loss of heterozygosity (LOH) detected at 3p21.3 which precedes lung cancer initiation, these SNPs may be developed into a marker screening for the high risk individuals. PMID:17606309

  18. Sudden infant death syndrome (SIDS) and polymorphisms in Monoamine oxidase A gene (MAOA): a revisit.

    PubMed

    Groß, Maximilian; Bajanowski, Thomas; Vennemann, Mechtild; Poetsch, Micaela

    2014-01-01

    Literature describes multiple possible links between genetic variations in the neuroadrenergic system and the occurrence of sudden infant death syndrome. The X-chromosomal Monoamine oxidase A (MAOA) is one of the genes with regulatory activity in the noradrenergic and serotonergic neuronal systems and a polymorphism of the promoter which affects the activity of this gene has been proclaimed to contribute significantly to the prevalence of sudden infant death syndrome (SIDS) in three studies from 2009, 2012 and 2013. However, these studies described different significant correlations regarding gender or age of children. Since several studies, suggesting associations between genetic variations and SIDS, were disproved by follow-up analysis, this study was conducted to take a closer look at the MAOA gene and its polymorphisms. The functional MAOA promoter length polymorphism was investigated in 261 SIDS cases and 93 control subjects. Moreover, the allele distribution of 12 coding and non-coding single nucleotide polymorphisms (SNPs) of the MAOA gene was examined in 285 SIDS cases and 93 controls by a minisequencing technique. In contrast to prior studies with fewer individuals, no significant correlations between the occurrence of SIDS and the frequency of allele variants of the promoter polymorphism could be demonstrated, even including the results from the abovementioned previous studies. Regarding the SNPs, three statistically significant associations were observed which had not been described before. This study clearly disproves interactions between MAOA promoter polymorphisms and SIDS, even if variations in single nucleotide polymorphisms of MAOA should be subjected to further analysis to clarify their impact on SIDS.

  19. [Genetic diversity analysis of Andrographis paniculata in China based on SRAP and SNP].

    PubMed

    Chen, Rong; Wang, Xiao-Yun; Song, Yu-Ning; Zhu, Yun-feng; Wang, Peng-liang; Li, Min; Zhong, Guo-Yue

    2014-12-01

    In order to reveal genetic diversity of domestic Andrographis paniculata and its impact on quality, genetic backgrounds of 103 samples from 7 provinces in China were analyzed using SRAP marker and SNP marker. Genetic structures of the A. paniculata populations were estimated with Powermarker V 3.25 and Mega 6.0 software, and polymorphic SNPs were identified with CodonCode Aligner software. The results showed that the genetic distances of domestic A. paniculata germplasm ranged from 0. 01 to 0.09, and no polymorphic SNPs were discovered in coding sequence fragments of ent-copalyl diphosphate synthase. A. paniculata germplasm from various regions in China had poor genetic diversity. This phenomenon was closely related to strict self-fertilization and earlier introduction from the same origin. Therefore, genetic background had little impact on variable qualities of A. paniculata in domestic market. Mutation breeding, polyploid breeding and molecular breeding were proposed as promising strategies in germplasm innovation.

  20. A functional SNP catalog of overlapping miRNA-binding sites in genes implicated in prion disease and other neurodegenerative disorders.

    PubMed

    Saba, Reuben; Medina, Sarah J; Booth, Stephanie A

    2014-10-01

    The involvement of SNPs in miRNA target sites remains poorly investigated in neurodegenerative disease. In addition to associations with disease risk, such genetic variations can also provide novel insight into mechanistic pathways that may be responsible for disease etiology and/or pathobiology. To identify SNPs associated specifically with degenerating neurons, we restricted our analysis to genes that are dysregulated in CA1 hippocampal neurons of mice during early, preclinical phase of Prion disease. The 125 genes chosen are also implicated in other numerous degenerative and neurological diseases and disorders and are therefore likely to be of fundamental importance. We predicted those SNPs that could increase, decrease, or have neutral effects on miRNA binding. This group of genes was more likely to possess DNA variants than were genes chosen at random. Furthermore, many of the SNPs are common within the human population, and could contribute to the growing awareness that miRNAs and associated SNPs could account for detrimental neurological states. Interestingly, SNPs that overlapped miRNA-binding sites in the 3'-UTR of GABA-receptor subunit coding genes were particularly enriched. Moreover, we demonstrated that SNP rs9291296 would strengthen miR-26a-5p binding to a highly conserved site in the 3'-UTR of gamma-aminobutyric acid receptor subunit alpha-4. © 2014 WILEY PERIODICALS, INC.

  1. Identification of a single nucleotide polymorphism indicative of high risk in acute myocardial infarction

    PubMed Central

    Shalia, Kavita; Saranath, Dhananjaya; Rayar, Jaipreet; Shah, Vinod K.; Mashru, Manoj R.; Soneji, Surendra L.

    2017-01-01

    Background & objectives: Acute myocardial infarction (AMI) is a major health concern in India. The aim of the study was to identify single nucleotide polymorphisms (SNPs) associated with AMI in patients using dedicated chip and validating the identified SNPs on custom-designed chips using high-throughput microarray analysis. Methods: In pilot phase, 48 AMI patients and 48 healthy controls were screened for SNPs using human CVD55K BeadChip with 48,472 SNP probes on Illumina high-throughput microarray platform. The identified SNPs were validated by genotyping additional 160 patients and 179 controls using custom-made Illumina VeraCode GoldenGate Genotyping Assay. Analysis was carried out using PLINK software. Results: From the pilot phase, 98 SNPs present on 94 genes were identified with increased risk of AMI (odds ratio of 1.84-8.85, P=0.04861-0.003337). Five of these SNPs demonstrated association with AMI in the validation phase (P<0.05). Among these, one SNP rs9978223 on interferon gamma receptor 2 [IFNGR2, interferon (IFN)-gamma transducer 1] gene showed a significant association (P=0.00021) with AMI below Bonferroni corrected P value (P=0.00061). IFNGR2 is the second subunit of the receptor for IFN-gamma, an important cytokine in inflammatory reactions. Interpretation & conclusions: The study identified an SNP rs9978223 on IFNGR2 gene, associated with increased risk in AMI patient from India. PMID:29434065

  2. Alternative SNP detection platforms, HRM and biosensors, for varietal identification in Vitis vinifera L. using F3H and LDOX genes.

    PubMed

    Gomes, Sónia; Castro, Cláudia; Barrias, Sara; Pereira, Leonor; Jorge, Pedro; Fernandes, José R; Martins-Lopes, Paula

    2018-04-11

    The wine sector requires quick and reliable methods for Vitis vinifera L. varietal identification. The number of V. vinifera varieties is estimated in about 5,000 worldwide. Single Nucleotide Polymorphisms (SNPs) represent the most basic and abundant form of genetic sequence variation, being adequate for varietal discrimination. The aim of this work was to develop DNA-based assays suitable to detect SNP variation in V. vinifera, allowing varietal discrimination. Genotyping by sequencing allowed the detection of eleven SNPs on two genes of the anthocyanin pathway, the flavanone 3-hydroxylase (F3H, EC: 1.14.11.9), and the leucoanthocyanidin dioxygenase (LDOX, EC 1.14.11.19; synonym anthocyanidin synthase, ANS) in twenty V. vinifera varieties. Three High Resolution Melting (HRM) assays were designed based on the sequencing information, discriminating five of the 20 varieties: Alicante Bouschet, Donzelinho Tinto, Merlot, Moscatel Galego and Tinta Roriz. Sanger sequencing of the HRM assay products confirmed the HRM profiles. Three probes, with different lengths and sequences, were used as bio-recognition elements in an optical biosensor platform based on a long period grating (LPG) fiber optic sensor. The label free platform detected a difference of a single SNP using genomic DNA samples. The two different platforms were successfully applied for grapevine varietal identification.

  3. Mutation analysis of GLDC, AMT and GCSH in cataract captive-bred vervet monkeys (Chlorocebus aethiops).

    PubMed

    Chauke, Chesa G; Magwebu, Zandisiwe E; Sharma, Jyoti R; Arieff, Zainunisha; Seier, Jürgen V

    2016-08-01

    Non-ketotic hyperglycinaemia (NKH) is an autosomal recessive inborn error of glycine metabolism characterized by accumulation of glycine in body fluids and various neurological symptoms. This study describes the first screening of NKH in cataract captive-bred vervet monkeys (Chlorocebus aethiops). Glycine dehydrogenase (GLDC), aminomethyltransferase (AMT) and glycine cleavage system H protein (GCSH) were prioritized. Mutation analysis of the complete coding sequence of GLDC and AMT revealed six novel single-base substitutions, of which three were non-synonymous missense and three were silent nucleotide changes. Although deleterious effects of the three amino acid substitutions were not evaluated, one substitution of GLDC gene (S44R) could be disease-causing because of its drastic amino acid change, affecting amino acids conserved in different primate species. This study confirms the diagnosis of NKH for the first time in vervet monkeys with cataracts. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  4. Mapping of Mcs30, a new mammary carcinoma susceptibility quantitative trait locus (QTL30) on rat chromosome 12: identification of fry as a candidate Mcs gene.

    PubMed

    Ren, Xuefeng; Graham, Jessica C; Jing, Lichen; Mikheev, Andrei M; Gao, Yuan; Lew, Jenny Pan; Xie, Hong; Kim, Andrea S; Shang, Xiuling; Friedman, Cynthia; Vail, Graham; Fang, Ming Zhu; Bromberg, Yana; Zarbl, Helmut

    2013-01-01

    Rat strains differ dramatically in their susceptibility to mammary carcinogenesis. On the assumption that susceptibility genes are conserved across mammalian species and hence inform human carcinogenesis, numerous investigators have used genetic linkage studies in rats to identify genes responsible for differential susceptibility to carcinogenesis. Using a genetic backcross between the resistant Copenhagen (Cop) and susceptible Fischer 344 (F344) strains, we mapped a novel mammary carcinoma susceptibility (Mcs30) locus to the centromeric region on chromosome 12 (LOD score of ∼8.6 at the D12Rat59 marker). The Mcs30 locus comprises approximately 12 Mbp on the long arm of rat RNO12 whose synteny is conserved on human chromosome 13q12 to 13q13. After analyzing numerous genes comprising this locus, we identified Fry, the rat ortholog of the furry gene of Drosophila melanogaster, as a candidate Mcs gene. We cloned and determined the complete nucleotide sequence of the 13 kbp Fry mRNA. Sequence analysis indicated that the Fry gene was highly conserved across evolution, with 90% similarity of the predicted amino acid sequence among eutherian mammals. Comparison of the Fry sequence in the Cop and F344 strains identified two non-synonymous single nucleotide polymorphisms (SNPs), one of which creates a putative, de novo phosphorylation site. Further analysis showed that the expression of the Fry gene is reduced in a majority of rat mammary tumors. Our results also suggested that FRY activity was reduced in human breast carcinoma cell lines as a result of reduced levels or mutation. This study is the first to identify the Fry gene as a candidate Mcs gene. Our data suggest that the SNPs within the Fry gene contribute to the genetic susceptibility of the F344 rat strain to mammary carcinogenesis. These results provide the foundation for analyzing the role of the human FRY gene in cancer susceptibility and progression.

  5. ABCB4 missense mutations D243A, K435T, G535D, I490T, R545C, and S978P significantly impair the lipid floppase and likely predispose to secondary pathologies in the human population.

    PubMed

    Andress, Edward J; Nicolaou, Michael; McGeoghan, Farrell; Linton, Kenneth J

    2017-07-01

    Bile salts are natural detergents required to solubilise dietary fat and lipid soluble vitamins. They are synthesised in hepatocytes and secreted into the luminal space of the biliary tree by the bile salt export pump (BSEP), an ATP-binding cassette (ABC) transporter in the canalicular membrane. BSEP deficiency causes cytotoxic accumulation of bile salts in the hepatocyte that results in mild-to-severe forms of cholestasis. The resulting inflammation can also progress to hepatocellular cancer via a novel mechanism involving upregulation of proliferative signalling pathways. A second ABC transporter of the canalicular membrane is also critical for bile formation. ABCB4 flops phosphatidylcholine into the outer leaflet of the membrane to be extracted by bile salts in the canalicular space. These mixed micelles reduce the detergent action of the bile salts and protect the biliary tree from their cytotoxic activity. ABCB4 deficiency also causes cholestasis, and might be expected to cause cholangitis and predispose to liver cancer. Non-synonymous SNPs in ABCB4 have now been described in patients with liver cancer or with inflammatory liver diseases that are known to predispose to cancer, but data showing that the SNPs are sufficiently deleterious to be an etiological factor are lacking. Here, we report the first characterisation at the protein level of six ABCB4 variants (D243A, K435T, G535D, I490T, R545C, and S978P) previously found in patients with inflammatory liver diseases or liver cancer. All significantly impair the transporter with a range of phenotypes exhibited, including low abundance, intracellular retention, and reduced floppase activity, suggesting that ABCB4 deficiency is the root cause of the pathology in these cases.

  6. SLC39A2 and FSIP1 polymorphisms as potential modifiers of arsenic-related bladder cancer

    PubMed Central

    Andrew, Angeline S.; Nelson, Heather H.; Li, Zhongze; Punshon, Tracy; Schned, Alan; Marsit, Carmen J.; Morris, J. Steven; Moore, Jason H.; Tyler, Anna L.; Gilbert-Diamond, Diane; Guerinot, Mary-Lou; Kelsey, Karl T.

    2012-01-01

    Arsenic is a carcinogen that contaminates drinking water worldwide. Accumulating evidence suggests that both exposure and genetic factors may influence susceptibility to arsenic-induced malignancies. We sought to identify novel susceptibility loci for arsenic-related bladder cancer in a US population with low to moderate drinking water levels of arsenic. We first screened a subset of bladder cancer cases using a panel of approximately 10,000 non-synonymous single nucleotide polymorphisms (SNPs). Top ranking hits on the SNP array then were considered for further analysis in our population-based case–control study (n = 832 cases and 1,191 controls). SNPs in the fibrous sheath interacting protein 1 (FSIP1) gene (rs10152640) and the solute carrier family 39, member 2 (SLC39A2) in the ZIP gene family of metal transporters (rs2234636) were detected as potential hits in the initial scan and validated in the full case–control study. The adjusted odds ratio (OR) for the FSIP1 polymorphism was 2.57 [95% confidence interval (CI) 1.13, 5.85] for heterozygote variants (AG) and 12.20 (95% CI 2.51, 59.30) for homozygote variants (GG) compared to homozygote wild types (AA) in the high arsenic group (greater than the 90th percentile), and unrelated in the low arsenic group (equal to or below the 90th percentile) (P for interaction = 0.002). For the SLC39A2 polymorphism, the adjusted ORs were 2.96 (95% CI 1.23, 7.15) and 2.91 (95% CI 1.00, 8.52) for heterozygote (TC) and homozygote (CC) variants compared to homozygote wild types (TT), respectively, and close to one in the low arsenic group (P for interaction = 0.03). Our findings suggest novel variants that may influence risk of arsenic-associated bladder cancer and those who may be at greatest risk from this widespread exposure. PMID:21947419

  7. SLC39A2 and FSIP1 polymorphisms as potential modifiers of arsenic-related bladder cancer.

    PubMed

    Karagas, Margaret R; Andrew, Angeline S; Nelson, Heather H; Li, Zhongze; Punshon, Tracy; Schned, Alan; Marsit, Carmen J; Morris, J Steven; Moore, Jason H; Tyler, Anna L; Gilbert-Diamond, Diane; Guerinot, Mary-Lou; Kelsey, Karl T

    2012-03-01

    Arsenic is a carcinogen that contaminates drinking water worldwide. Accumulating evidence suggests that both exposure and genetic factors may influence susceptibility to arsenic-induced malignancies. We sought to identify novel susceptibility loci for arsenic-related bladder cancer in a US population with low to moderate drinking water levels of arsenic. We first screened a subset of bladder cancer cases using a panel of approximately 10,000 non-synonymous single nucleotide polymorphisms (SNPs). Top ranking hits on the SNP array then were considered for further analysis in our population-based case-control study (n = 832 cases and 1,191 controls). SNPs in the fibrous sheath interacting protein 1 (FSIP1) gene (rs10152640) and the solute carrier family 39, member 2 (SLC39A2) in the ZIP gene family of metal transporters (rs2234636) were detected as potential hits in the initial scan and validated in the full case-control study. The adjusted odds ratio (OR) for the FSIP1 polymorphism was 2.57 [95% confidence interval (CI) 1.13, 5.85] for heterozygote variants (AG) and 12.20 (95% CI 2.51, 59.30) for homozygote variants (GG) compared to homozygote wild types (AA) in the high arsenic group (greater than the 90th percentile), and unrelated in the low arsenic group (equal to or below the 90th percentile) (P for interaction = 0.002). For the SLC39A2 polymorphism, the adjusted ORs were 2.96 (95% CI 1.23, 7.15) and 2.91 (95% CI 1.00, 8.52) for heterozygote (TC) and homozygote (CC) variants compared to homozygote wild types (TT), respectively, and close to one in the low arsenic group (P for interaction = 0.03). Our findings suggest novel variants that may influence risk of arsenic-associated bladder cancer and those who may be at greatest risk from this widespread exposure.

  8. [Evaluation of Wondfo Rapid Diagnostic Kit for detecting Plasmodium ovale and analysis of influencing factors].

    PubMed

    Feng, Tang; Jian-Xia, Tang; Feng, Lu; Sui, Xu; Ya-Ping, Gu; De-Sheng, Tong; Guo-Ding, Zhu; Hai-Yong, Hua; Hua-Yun, Zhou; Jun, Cao

    2016-03-21

    To evaluate the Wondfo Rapid Diagnostic Kit (Pf-LDH/Pan -pLDH) for detecting Plasmodium ovale and analyze the influence of parasitaemia, concentration and polymorphism of pLDH on the performances. A total of 100 blood samples from P. ovale patients confirmed by PCR were detected with the Wondfo Rapid Diagnostic Kit according to the manufacturers'instructions. The parasitaemia was determined by the microscopic examination. The concentration of pLDH was measured by ELISA tests. The LDH gene of P. ovale was amplified by PCR and sequenced. The influence of these three factors on the positive rate was analyzed. The overall positive rate of Wondfo Rapid Diagnostic Kit was 70.0% (70/100). The positive rate was 27.3% for the samples with parasitaemia ≤ 500 parasites/μl and reached 75.0%-75.4% when parasitaemia > 500 parasites/μl. The positive rate was 6.7% for samples with a low pLDH concentration ( A values ≤ 0.100) and reached 95.1%-100% at a high pLDH concentration ( A values > 0.100). The results of sequence analysis indicated that all the samples could be divided into 2 types, P. o. curtisi and P. o. wallikeri . The gene homology of LDH between 2 types was 97%. There were 24 single nucleotide polymorphism (s) (SNPs) between 2 types, while only 3 SNPs were non-synonymous mutations. The homology of LDH amino acid sequences between 2 types was 99%; only 3 amino acids were different. The positive rates for P. o. curtisi and P. o. wallikeri were 73.1% (38/52) and 66.7% (32/48) respectively; there was no statistically significant difference ( P > 0.05). The Wondfo Rapid Diagnostic Kit (Pf-LDH/Pan-pLDH) performs better than most of the similar products for the detection of P. ovale , and the positive rates are closely related to the parasitaemia and concentration of pLDH, while no related to the polymorphism of pLDH gene.

  9. A pilot integrative genomics study of GABA and glutamate neurotransmitter systems in suicide, suicidal behavior, and major depressive disorder.

    PubMed

    Yin, Honglei; Pantazatos, Spiro P; Galfalvy, Hanga; Huang, Yung-Yu; Rosoklija, Gorazd B; Dwork, Andrew J; Burke, Ainsley; Arango, Victoria; Oquendo, Maria A; Mann, John J

    2016-04-01

    Gamma-amino butyric acid (GABA) and glutamate are the major inhibitory and excitatory neurotransmitters in the mammalian central nervous system, respectively, and have been associated with suicidal behavior and major depressive disorder (MDD). We examined the relationship between genotype, brain transcriptome, and MDD/suicide for 24 genes involved in GABAergic and glutamatergic signaling. In part 1 of the study, 119 candidate SNPs in 24 genes (4 transporters, 4 enzymes, and 16 receptors) were tested for associations with MDD and suicidal behavior in 276 live participants (86 nonfatal suicide attempters with MDD and 190 non-attempters of whom 70% had MDD) and 209 postmortem cases (121 suicide deaths of whom 62% had MDD and 88 sudden death from other causes of whom 11% had MDD) using logistic regression adjusting for sex and age. In part 2, RNA-seq was used to assay isoform-level expression in dorsolateral prefrontal cortex of 59 postmortem samples (21 with MDD and suicide, 9 MDD without suicide, and 29 sudden death non-suicides and no psychiatric illness) using robust regression adjusting for sex, age, and RIN score. In part 3, SNPs with subthreshold (uncorrected) significance levels below 0.05 for an association with suicidal behavior and/or MDD in part 1 were tested for eQTL effects in prefrontal cortex using the Brain eQTL Almanac (www.braineac.org). No SNPs or transcripts were significant after adjustment for multiple comparisons. However, a protein coding transcript (ENST00000414552) of the GABA A receptor, gamma 2 (GABRG2) had lower brain expression postmortem in suicide (P = 0.01) and evidence for association with suicide death (P = 0.03) in a SNP that may be an eQTL in prefrontal cortex (rs424740, P = 0.02). These preliminary results implicate GABRG2 in suicide and warrant further investigation and replication in larger samples. © 2016 Wiley Periodicals, Inc.

  10. DNA fingerprinting of Chinese melon provides evidentiary support of seed quality appraisal.

    PubMed

    Gao, Peng; Ma, Hongyan; Luan, Feishi; Song, Haibin

    2012-01-01

    Melon, Cucumis melo L. is an important vegetable crop worldwide. At present, there are phenomena of homonyms and synonyms present in the melon seed markets of China, which could cause variety authenticity issues influencing the process of melon breeding, production, marketing and other aspects. Molecular markers, especially microsatellites or simple sequence repeats (SSRs) are playing increasingly important roles for cultivar identification. The aim of this study was to construct a DNA fingerprinting database of major melon cultivars, which could provide a possibility for the establishment of a technical standard system for purity and authenticity identification of melon seeds. In this study, to develop the core set SSR markers, 470 polymorphic SSRs were selected as the candidate markers from 1219 SSRs using 20 representative melon varieties (lines). Eighteen SSR markers, evenly distributed across the genome and with the highest contents of polymorphism information (PIC) were identified as the core marker set for melon DNA fingerprinting analysis. Fingerprint codes for 471 melon varieties (lines) were established. There were 51 materials which were classified into17 groups based on sharing the same fingerprint code, while field traits survey results showed that these plants in the same group were synonyms because of the same or similar field characters. Furthermore, DNA fingerprinting quick response (QR) codes of 471 melon varieties (lines) were constructed. Due to its fast readability and large storage capacity, QR coding melon DNA fingerprinting is in favor of read convenience and commercial applications.

  11. DNA Fingerprinting of Chinese Melon Provides Evidentiary Support of Seed Quality Appraisal

    PubMed Central

    Gao, Peng; Ma, Hongyan; Luan, Feishi; Song, Haibin

    2012-01-01

    Melon, Cucumis melo L. is an important vegetable crop worldwide. At present, there are phenomena of homonyms and synonyms present in the melon seed markets of China, which could cause variety authenticity issues influencing the process of melon breeding, production, marketing and other aspects. Molecular markers, especially microsatellites or simple sequence repeats (SSRs) are playing increasingly important roles for cultivar identification. The aim of this study was to construct a DNA fingerprinting database of major melon cultivars, which could provide a possibility for the establishment of a technical standard system for purity and authenticity identification of melon seeds. In this study, to develop the core set SSR markers, 470 polymorphic SSRs were selected as the candidate markers from 1219 SSRs using 20 representative melon varieties (lines). Eighteen SSR markers, evenly distributed across the genome and with the highest contents of polymorphism information (PIC) were identified as the core marker set for melon DNA fingerprinting analysis. Fingerprint codes for 471 melon varieties (lines) were established. There were 51 materials which were classified into17 groups based on sharing the same fingerprint code, while field traits survey results showed that these plants in the same group were synonyms because of the same or similar field characters. Furthermore, DNA fingerprinting quick response (QR) codes of 471 melon varieties (lines) were constructed. Due to its fast readability and large storage capacity, QR coding melon DNA fingerprinting is in favor of read convenience and commercial applications. PMID:23285039

  12. New Insights into the Lake Chad Basin Population Structure Revealed by High-Throughput Genotyping of Mitochondrial DNA Coding SNPs

    PubMed Central

    Černý, Viktor; Carracedo, Ángel

    2011-01-01

    Background Located in the Sudan belt, the Chad Basin forms a remarkable ecosystem, where several unique agricultural and pastoral techniques have been developed. Both from an archaeological and a genetic point of view, this region has been interpreted to be the center of a bidirectional corridor connecting West and East Africa, as well as a meeting point for populations coming from North Africa through the Saharan desert. Methodology/Principal Findings Samples from twelve ethnic groups from the Chad Basin (n = 542) have been high-throughput genotyped for 230 coding region mitochondrial DNA (mtDNA) Single Nucleotide Polymorphisms (mtSNPs) using Matrix-Assisted Laser Desorption/Ionization Time-Of-Flight (MALDI-TOF) mass spectrometry. This set of mtSNPs allowed for much better phylogenetic resolution than previous studies of this geographic region, enabling new insights into its population history. Notable haplogroup (hg) heterogeneity has been observed in the Chad Basin mirroring the different demographic histories of these ethnic groups. As estimated using a Bayesian framework, nomadic populations showed negative growth which was not always correlated to their estimated effective population sizes. Nomads also showed lower diversity values than sedentary groups. Conclusions/Significance Compared to sedentary population, nomads showed signals of stronger genetic drift occurring in their ancestral populations. These populations, however, retained more haplotype diversity in their hypervariable segments I (HVS-I), but not their mtSNPs, suggesting a more ancestral ethnogenesis. Whereas the nomadic population showed a higher Mediterranean influence signaled mainly by sub-lineages of M1, R0, U6, and U5, the other populations showed a more consistent sub-Saharan pattern. Although lifestyle may have an influence on diversity patterns and hg composition, analysis of molecular variance has not identified these differences. The present study indicates that analysis of mtSNPs at high resolution could be a fast and extensive approach for screening variation in population studies where labor-intensive techniques such as entire genome sequencing remain unfeasible. PMID:21533064

  13. Accuracy of various human NAT2 SNP genotyping panels to infer rapid, intermediate and slow acetylator phenotypes

    PubMed Central

    Hein, David W; Doll, Mark A

    2012-01-01

    Aim Humans exhibit genetic polymorphism in NAT2 resulting in rapid, intermediate and slow acetylator phenotypes. Over 65 NAT2 variants possessing one or more SNPs in the 870-bp NAT2 coding region have been reported. The seven most frequent SNPs are rs1801279 (191G>A), rs1041983 (282C>T), rs1801280 (341T>C), rs1799929 (481C>T), rs1799930 (590G>A), rs1208 (803A>G) and rs1799931 (857G>A). The majority of studies investigate the NAT2 genotype assay for three SNPs: 481C>T, 590G>A and 857G>A. A tag-SNP (rs1495741) recently identified in a genome-wide association study has also been proposed as a biomarker for the NAT2 phenotype. Materials & methods Sulfamethazine N-acetyltransferase catalytic activities were measured in cryopreserved human hepatocytes from a convenience sample of individuals in the USA with an ethnic frequency similar to the 2010 US population census. These activities were segregated by the tag-SNP rs1495741 and each of the seven SNPs described above. We assessed the accuracy of the tag-SNP and various two-, three-, four- and seven-SNP genotyping panels for their ability to accurately infer NAT2 phenotype. Results The accuracy of the various NAT2 SNP genotype panels to infer NAT2 phenotype were as follows: seven-SNP: 98.4%; tag-SNP: 77.7%; two-SNP: 96.1%; three-SNP: 92.2%; and four-SNP: 98.4%. Conclusion A NAT2 four-SNP genotype panel of rs1801279 (191G>A), rs1801280 (341T>C), rs1799930 (590G>A) and rs1799931 (857G>A) infers NAT2 acetylator phenotype with high accuracy, and is recommended over the tag-, two-, three- and (for economy of scale) the seven-SNP genotyping panels, particularly in populations of non-European ancestry. PMID:22092036

  14. Common Polymorphisms in the Solute Carrier SLC30A10 are Associated With Blood Manganese and Neurological Function

    PubMed Central

    Kippler, Maria; Alhamdow, Ayman; Rahman, Syed Moshfiqur; Smith, Donald R.; Vahter, Marie; Lucchini, Roberto G.; Broberg, Karin

    2016-01-01

    Manganese (Mn) is an essential nutrient in humans, but excessive exposure to Mn may cause neurotoxicity. Despite homeostatic regulation, Mn concentrations in blood vary considerably among individuals. We evaluated if common single-nucleotide polymorphisms (SNPs) in SLC30A10, which likely encodes an Mn transporter, influence blood Mn concentrations and neurological function. We measured blood Mn concentrations by ICP-MS or atomic absorption spectroscopy and genotyped 2 SLC30A10 non-coding SNPs (rs2275707 and rs12064812) by TaqMan PCR in cohorts from Bangladesh (N = 406), the Argentinean Andes (N = 198), and Italy (N = 238). We also measured SLC30A10 expression in whole blood by TaqMan PCR in a sub-group (N = 101) from the Andean cohort, and neurological parameters (sway velocity and finger-tapping speed) in the Italian cohort. The rs2275707 variant allele was associated with increased Mn concentrations in the Andes (8%, P = .027) and Italy (10.6%, P = .012), but not as clear in Bangladesh (3.4%, P = .21; linear regression analysis adjusted for age, gender, and plasma ferritin). This allele was also associated with increased sway velocity (15%, P = .033; adjusted for age and sex) and reduced SLC30A10 expression (−24.6%, P = .029). In contrast, the rs12064812 variant homozygous genotype was associated with reduced Mn concentrations, particularly in the Italian cohort (−18.4%, P = .04), and increased finger-tapping speed (8.7%, P = .025). We show that common SNPs in SLC30A10 are associated with blood Mn concentrations in 3 unrelated cohorts and that their influence may be mediated by altered SLC30A10 expression. Moreover, the SNPs appeared to influence neurological functions independent of blood Mn concentrations, suggesting that SLC30A10 could regulate brain Mn levels. PMID:26628504

  15. New insights into mitogenomic phylogeny and copy number in eight indigenous sheep populations based on the ATP synthase and cytochrome c oxidase genes.

    PubMed

    Xiao, P; Niu, L L; Zhao, Q J; Chen, X Y; Wang, L J; Li, L; Zhang, H P; Guo, J Z; Xu, H Y; Zhong, T

    2017-11-16

    The origins and phylogeny of different sheep breeds has been widely studied using polymorphisms within the mitochondrial hypervariable region. However, little is known about the mitochondrial DNA (mtDNA) content and phylogeny based on mtDNA protein-coding genes. In this study, we assessed the phylogeny and copy number of the mtDNA in eight indigenous (population size, n=184) and three introduced (n=66) sheep breeds in China based on five mitochondrial coding genes (COX1, COX2, ATP8, ATP6 and COX3). The mean haplotype and nucleotide diversities were 0.944 and 0.00322, respectively. We identified a correlation between the lineages distribution and the genetic distance, whereby Valley-type Tibetan sheep had a closer genetic relationship with introduced breeds (Dorper, Poll Dorset and Suffolk) than with other indigenous breeds. Similarly, the Median-joining profile of haplotypes revealed the distribution of clusters according to genetic differences. Moreover, copy number analysis based on the five mitochondrial coding genes was affected by the genetic distance combining with genetic phylogeny; we also identified obvious non-synonymous mutations in ATP6 between the different levels of copy number expressions. These results imply that differences in mitogenomic compositions resulting from geographical separation lead to differences in mitochondrial function.

  16. Variants in intron 13 of the ELMO1 gene are associated with diabetic nephropathy in African Americans

    PubMed Central

    Leak, T. S.; Perlegas, P.S.; Smith, S.G.; Keene, K.L.; Hicks, P.J.; Langefeld, C.D.; Mychaleckyj, J.C.; Rich, S.S.; Kirk, J.K.; Freedman, B.I.; Bowden, D.W.; Sale, M.M.

    2009-01-01

    Variants in the engulfment and cell motility 1 (ELMO1) gene are associated with nephropathy due to type 2 diabetes mellitus (T2DM) in a Japanese cohort. We comprehensively evaluated this gene in African American (AA) T2DM patients with end-stage renal disease (ESRD). Three hundred nine HapMap tagging SNPs and 9 reportedly associated SNPs were genotyped in 577 AA T2DM-ESRD patients and 596 AA non-diabetic controls, plus 43 non-diabetic European American controls and 45 Yoruba Nigerian samples for admixture adjustment. Replication analyses were conducted in 558 AAs with T2DM-ESRD and 564 controls without diabetes. Extension analyses included 328 AA with T2DM lacking nephropathy and 326 with non-diabetic ESRD. The original and replication analyses confirmed association with four SNPs in intron 13 (permutation p-values for combined analyses = 0.001-0.003), one in intron 1 (P=0.004) and one in intron 5 (P=0.002) with T2DM-associated ESRD. In a subsequent combined analysis of all 1,135 T2DM-ESRD cases and 1,160 controls, an additional 7 intron 13 SNPs produced evidence of association (P = 3.5×10-5 – P=0.05). No associations were seen with these SNPs in those with T2DM lacking nephropathy or with ESRD due to non-diabetic causes. Variants in intron 13 of the ELMO1 gene appear to confer risk for diabetic nephropathy in AA. PMID:19183347

  17. Genetic variations and patient-reported quality of life among patients with lung cancer.

    PubMed

    Sloan, Jeff A; de Andrade, Mariza; Decker, Paul; Wampfler, Jason; Oswold, Curtis; Clark, Matthew; Yang, Ping

    2012-05-10

    Recent evidence has suggested a relationship between the baseline quality of life (QOL) self-reported by patients with cancer and genetic disposition. We report an analysis exploring relationships among baseline QOL assessments and candidate genetic variations in a large cohort of patients with lung cancer. QOL data were provided by 1,299 patients with non-small-cell lung cancer observed at the Mayo Clinic between 1997 and 2007. Overall QOL and subdomains were assessed by either Lung Cancer Symptom Scale or Linear Analog Self Assessment measures; scores were transformed to a scale of 0 to 10, with higher scores representing better status. Baseline QOL scores assessed within 1 year of diagnosis were dichotomized as clinically deficient (CD) or not. A total of 470 single nucleotide polymorphisms (SNPs) in 56 genes of three biologic pathways were assessed for association with QOL measures. Logistic regression with training/validation samples was used to test the association of SNPs with CD QOL. Six SNPs on four genes were replicated using our split schemes. Three SNPs in the MGMT gene (adjusted analysis, rs3858300; unadjusted analysis, rs10741191 and rs3852507) from DNA repair pathway were associated with overall QOL. Two SNPs (rs2287396 [GSTZ1] and rs9524885 [ABCC4]) from glutathione metabolic pathway were associated with fatigue in unadjusted analysis. In adjusted analysis, two SNPs (rs2756109 [ABCC2] and rs9524885 [ABCC4]) from glutathione metabolic pathway were associated with pain. We identified three SNPs in three glutathione metabolic pathway genes and three SNPs in two DNA repair pathway genes associated with QOL measures in patients with non-small-cell lung cancer.

  18. Identification of Conflicting Selective Effects on Highly Expressed Genes

    PubMed Central

    Higgs, Paul G.; Hao, Weilong; Golding, G. Brian

    2007-01-01

    Many different selective effects on DNA and proteins influence the frequency of codons and amino acids in coding sequences. Selection is often stronger on highly expressed genes. Hence, by comparing high- and low-expression genes it is possible to distinguish the factors that are selected by evolution. It has been proposed that highly expressed genes should (i) preferentially use codons matching abundant tRNAs (translational efficiency), (ii) preferentially use amino acids with low cost of synthesis, (iii) be under stronger selection to maintain the required amino acid content, and (iv) be selected for translational robustness. These effects act simultaneously and can be contradictory. We develop a model that combines these factors, and use Akaike’s Information Criterion for model selection. We consider pairs of paralogues that arose by whole-genome duplication in Saccharmyces cerevisiae. A codon-based model is used that includes asymmetric effects due to selection on highly expressed genes. The largest effect is translational efficiency, which is found to strongly influence synonymous, but not non-synonymous rates. Minimization of the cost of amino acid synthesis is implicated. However, when a more general measure of selection for amino acid usage is used, the cost minimization effect becomes redundant. Small effects that we attribute to selection for translational robustness can be identified as an improvement in the model fit on top of the effects of translational efficiency and amino acid usage. PMID:19430600

  19. Positive selection on the killer whale mitogenome

    PubMed Central

    Foote, Andrew D.; Morin, Phillip A.; Durban, John W.; Pitman, Robert L.; Wade, Paul; Willerslev, Eske; Gilbert, M. Thomas P.; da Fonseca, Rute R.

    2011-01-01

    Mitochondria produce up to 95 per cent of the eukaryotic cell's energy. The coding genes of the mitochondrial DNA may therefore evolve under selection owing to metabolic requirements. The killer whale, Orcinus orca, is polymorphic, has a global distribution and occupies a range of ecological niches. It is therefore a suitable organism for testing this hypothesis. We compared a global dataset of the complete mitochondrial genomes of 139 individuals for amino acid changes that were associated with radical physico-chemical property changes and were influenced by positive selection. Two such selected non-synonymous amino acid changes were found; one in each of two ecotypes that inhabit the Antarctic pack ice. Both substitutions were associated with changes in local polarity, increased steric constraints and α-helical tendencies that could influence overall metabolic performance, suggesting a functional change. PMID:20810427

  20. Non-additive genetic variation in growth, carcass and fertility traits of beef cattle.

    PubMed

    Bolormaa, Sunduimijid; Pryce, Jennie E; Zhang, Yuandan; Reverter, Antonio; Barendse, William; Hayes, Ben J; Goddard, Michael E

    2015-04-02

    A better understanding of non-additive variance could lead to increased knowledge on the genetic control and physiology of quantitative traits, and to improved prediction of the genetic value and phenotype of individuals. Genome-wide panels of single nucleotide polymorphisms (SNPs) have been mainly used to map additive effects for quantitative traits, but they can also be used to investigate non-additive effects. We estimated dominance and epistatic effects of SNPs on various traits in beef cattle and the variance explained by dominance, and quantified the increase in accuracy of phenotype prediction by including dominance deviations in its estimation. Genotype data (729 068 real or imputed SNPs) and phenotypes on up to 16 traits of 10 191 individuals from Bos taurus, Bos indicus and composite breeds were used. A genome-wide association study was performed by fitting the additive and dominance effects of single SNPs. The dominance variance was estimated by fitting a dominance relationship matrix constructed from the 729 068 SNPs. The accuracy of predicted phenotypic values was evaluated by best linear unbiased prediction using the additive and dominance relationship matrices. Epistatic interactions (additive × additive) were tested between each of the 28 SNPs that are known to have additive effects on multiple traits, and each of the other remaining 729 067 SNPs. The number of significant dominance effects was greater than expected by chance and most of them were in the direction that is presumed to increase fitness and in the opposite direction to inbreeding depression. Estimates of dominance variance explained by SNPs varied widely between traits, but had large standard errors. The median dominance variance across the 16 traits was equal to 5% of the phenotypic variance. Including a dominance deviation in the prediction did not significantly increase its accuracy for any of the phenotypes. The number of additive × additive epistatic effects that were statistically significant was greater than expected by chance. Significant dominance and epistatic effects occur for growth, carcass and fertility traits in beef cattle but they are difficult to estimate precisely and including them in phenotype prediction does not increase its accuracy.

  1. The Impact of Polymorphic Variations in the 5p15, 6p12, 6p21 and 15q25 Loci on the Risk and Prognosis of Portuguese Patients with Non-Small Cell Lung Cancer

    PubMed Central

    de Mello, Ramon Andrade; Ferreira, Mónica; Soares-Pires, Filipa; Costa, Sandra; Cunha, João; Oliveira, Pedro; Hespanhol, Venceslau; Reis, Rui Manuel

    2013-01-01

    Introduction Polymorphic variants in the 5p15, 6p12, 6p21, and 15q25 loci were demonstrated to potentially contribute to lung cancer carcinogenesis. Therefore, this study was performed to assess the role of those variants in non-small cell lung cancer (NSCLC) risk and prognosis in a Portuguese population. Materials and Methods Blood from patients with NSCLC was prospectively collected. To perform an association study, DNA from these patients and healthy controls were genotyped for a panel of 19 SNPs using a Sequenom® MassARRAY platform. Kaplan-Meier curves were used to assess the overall survival (OS) and progression-free survival (PFS). Results One hundred and forty-four patients with NSCLC were successfully consecutively genotyped for the 19 SNPs. One SNP was associated with NSCLC risk: rs9295740 G/A. Two SNPs were associated with non-squamous histology: rs3024994 (VEGF intron 2) T/C and rs401681 C/T. Three SNPs were associated with response rate: rs3025035 (VEGF intron 7) C/T, rs833061 (VEGF –460) C/T and rs9295740 G/A. One SNP demonstrated an influence on PFS: rs401681 C/T at 5p15, p = 0.021. Four SNPs demonstrated an influence on OS: rs2010963 (VEGF +405 G/C), p = 0.042; rs3025010 (VEGF intron 5 C/T), p = 0.047; rs401681 C/T at 5p15, p = 0.046; and rs31489 C/A at 5p15, p = 0.029. Conclusions Our study suggests that SNPs in the 6p12, 6p21, and 5p15 loci may serve as risk, predictive and prognostic NSCLC biomarkers. In the future, SNPs identified in the genomes of patients may improve NSCLC screening strategies and therapeutic management as well. PMID:24039754

  2. Lactobacillus cypricasei Lawson et al. 2001 is a later heterotypic synonym of Lactobacillus acidipiscis Tanasupawat et al. 2000.

    PubMed

    Naser, Sabri M; Vancanneyt, Marc; Hoste, Bart; Snauwaert, Cindy; Swings, Jean

    2006-07-01

    The applicability of a multilocus sequence analysis (MLSA)-based identification system for lactobacilli was evaluated. Two housekeeping genes that code for the phenylalanyl-tRNA synthase alpha-subunit (pheS) and RNA polymerase alpha-subunit (rpoA) were sequenced and analysed for members of the Lactobacillus salivarius species group. The type strains of Lactobacillus acidipiscis and Lactobacillus cypricasei were investigated further using a third gene that encodes the alpha-subunit of ATP synthase (atpA). The MLSA data revealed close relatedness between L. acidipiscis and L. cypricasei, with 99.8-100 % pheS, rpoA and atpA gene sequence similarities. Comparison of the 16S rRNA gene sequences of the type strains of the two species confirmed the close relatedness (99.8 % gene sequence similarity) between the two taxa. Similar phenotypes and high DNA-DNA binding values in the range of 84 to 97.5 % confirmed that L. acidipiscis and L. cypricasei are synonymous species. On the basis of the present study, it is proposed that Lactobacillus cypricasei is a later heterotypic synonym of Lactobacillus acidipiscis.

  3. Streptomyces phaeopurpureus Shinobu 1957 (Approved Lists 1980) and Streptomyces griseorubiginosus (Ryabova and Preobrazhenskaya 1957) Pridham et al. 1958 (Approved Lists 1980) are heterotypic subjective synonyms.

    PubMed

    Kämpfer, Peter; Rückert, Christian; Blom, Jochen; Goesmann, Alexander; Wink, Joachim; Kalinowski, Jörn; Glaeser, Stefanie P

    2017-08-01

    On the basis of whole genome comparisons of Streptomyces griseorubiginosus and Streptomyces phaeopurpureus it could by shown that these two species are subjective synonyms. The names of both species have been published in the Approved Lists of Bacterial Names and, in such a case, normally Rule 24b (1) of the Prokaryotic Code applies, which reads: 'If two names compete for priority and if both names date from 1 January 1980 on an Approved List, the priority shall be determined by the date of the original publication of the name before 1 January 1980'. Streptomyces griseorubiginosus and Streptomyces phaeopurpureus were both effectively published in 1957, and for both publications, the exact date cannot be obtained. In this case a further statement of Rule 24 applies, which reads: 'If the names or epithets are of the same date, the author who first unites the taxa has the right to choose one of them, and his choice must be followed.' Hence we propose that Streptomyces phaeopurpureus is a later heterotypic subjective synonym of Streptomyces griseorubiginosus.

  4. Emergent rules for codon choice elucidated by editing rare arginine codons in Escherichia coli

    PubMed Central

    Napolitano, Michael G.; Landon, Matthieu; Gregg, Christopher J.; Lajoie, Marc J.; Govindarajan, Lakshmi; Mosberg, Joshua A.; Kuznetsov, Gleb; Goodman, Daniel B.; Vargas-Rodriguez, Oscar; Isaacs, Farren J.; Söll, Dieter; Church, George M.

    2016-01-01

    The degeneracy of the genetic code allows nucleic acids to encode amino acid identity as well as noncoding information for gene regulation and genome maintenance. The rare arginine codons AGA and AGG (AGR) present a case study in codon choice, with AGRs encoding important transcriptional and translational properties distinct from the other synonymous alternatives (CGN). We created a strain of Escherichia coli with all 123 instances of AGR codons removed from all essential genes. We readily replaced 110 AGR codons with the synonymous CGU codons, but the remaining 13 “recalcitrant” AGRs required diversification to identify viable alternatives. Successful replacement codons tended to conserve local ribosomal binding site-like motifs and local mRNA secondary structure, sometimes at the expense of amino acid identity. Based on these observations, we empirically defined metrics for a multidimensional “safe replacement zone” (SRZ) within which alternative codons are more likely to be viable. To evaluate synonymous and nonsynonymous alternatives to essential AGRs further, we implemented a CRISPR/Cas9-based method to deplete a diversified population of a wild-type allele, allowing us to evaluate exhaustively the fitness impact of all 64 codon alternatives. Using this method, we confirmed the relevance of the SRZ by tracking codon fitness over time in 14 different genes, finding that codons that fall outside the SRZ are rapidly depleted from a growing population. Our unbiased and systematic strategy for identifying unpredicted design flaws in synthetic genomes and for elucidating rules governing codon choice will be crucial for designing genomes exhibiting radically altered genetic codes. PMID:27601680

  5. Bridging two scholarly islands enriches both: COI DNA barcodes for species identification versus human mitochondrial variation for the study of migrations and pathologies.

    PubMed

    Thaler, David S; Stoeckle, Mark Y

    2016-10-01

    DNA barcodes for species identification and the analysis of human mitochondrial variation have developed as independent fields even though both are based on sequences from animal mitochondria. This study finds questions within each field that can be addressed by reference to the other. DNA barcodes are based on a 648-bp segment of the mitochondrially encoded cytochrome oxidase I. From most species, this segment is the only sequence available. It is impossible to know whether it fairly represents overall mitochondrial variation. For modern humans, the entire mitochondrial genome is available from thousands of healthy individuals. SNPs in the human mitochondrial genome are evenly distributed across all protein-encoding regions arguing that COI DNA barcode is representative. Barcode variation among related species is largely based on synonymous codons. Data on human mitochondrial variation support the interpretation that most - possibly all - synonymous substitutions in mitochondria are selectively neutral. DNA barcodes confirm reports of a low variance in modern humans compared to nonhuman primates. In addition, DNA barcodes allow the comparison of modern human variance to many other extant animal species. Birds are a well-curated group in which DNA barcodes are coupled with census and geographic data. Putting modern human variation in the context of intraspecies variation among birds shows humans to be a single breeding population of average variance.

  6. Genetic Variants of BMP2 and Their Association with the Risk of Non-Syndromic Tooth Agenesis.

    PubMed

    Lu, Yun; Qian, Yajing; Zhang, Jinglu; Gong, Miao; Wang, Yuting; Gu, Ning; Ma, Lan; Xu, Min; Ma, Junqing; Zhang, Weibing; Pan, Yongchu; Wang, Lin

    2016-01-01

    Non-syndromic tooth agenesis (or non-syndromic congenitally missing tooth) is one of the most common congenital defects in humans affecting the craniofacial function and appearance. Single nucleotide polymorphisms (SNPs) have been associated with an individual's susceptibility to these anomalies. The aim of the present study was therefore to investigate the roles of the potentially functional SNPs of BMP2 in the occurrence of tooth agenesis. Overall, four potentially functional SNPs of BMP2 (rs15705, rs235768, rs235769 and rs3178250) were selected, and their associations with the susceptibility of tooth agenesis were evaluated in a case-control study of 335 non-syndromic tooth agenesis cases and 444 healthy controls. The SNPs rs15705 and rs3178250 were found to be associated with an individual's risk of tooth agenesis (P = 0.046 and P = 0.039, respectively). Both SNPs showed an increased risk of mandibular incisor agenesis (rs15705, AA/AC vs. CC = 1.58, 95% CI = [1.06-2.34], P = 0.024; rs3178250, TT/TC vs. CC = 1.60, 95% CI = [1.08-2.37], P = 0.020). Bioinformatics analysis indicated that these two SNPs located at the 3'-untranslated region (3'-UTR) of BMP2 might alter the binding ability of miR-1273d and miR-4639-5p, respectively, which was confirmed by luciferase activity assays in the 293A and COS7 cell lines (P < 0.001 in 293A and P < 0.01 in COS7 for miR-1273d; and P < 0.001 in both cells for miR-4639-5p). Furthermore, BMP2 mRNA expression decreased after transfecting either miR-1273d or miR-4639-5p into these two cell lines (P < 0.01 in 293A and P < 0.001 in COS7 for miR-1273d, and P < 0.01 in both cell lines for miR-4639-5p). Taken together, our findings indicate that rs15705 and rs317250 are associated with the susceptibility of non-syndromic tooth agenesis by possibly affecting miRNAs and mRNA interaction.

  7. PVRL1 as a Candidate Gene for Nonsyndromic Cleft Lip With or Without Cleft Palate: No Evidence for the Involvement of Common or Rare Variants in Southern Han Chinese Patients

    PubMed Central

    Cheng, Hong-Qiu; Huang, En-Min; Xu, Ming-Yan; Shu, Shen-You

    2012-01-01

    The poliovirus receptor related-1 (PVRL1) gene encodes nectin-1, a cell–cell adhesion molecule (OMIM #600644), and is mutated in the cleft lip with or without cleft palate/ectodermal dysplasia-1 syndrome (CLPED1, OMIM #225000). In addition, PVRL1 mutations have been associated with nonsyndromic cleft lip with or without a cleft palate (NSCL/P) in studies of multiethnic samples. To investigate the possible involvement of this gene in southern Han Chinese NSCL/P patients, we performed (i) a case–control association study, and (ii) a resequencing study. A set of 470 patients with NSCL/P and 693 controls were recruited, and a total of 45 tagging single-nucleotide polymorphisms (SNPs) were genotyped by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. In the resequencing study, the coding regions of the PVRL1 α isoform were direct sequenced in 45 trios from multiply affected families. One (rs7128327) of the 45 tested SNPs showed a trend toward statistical significance in the genotypic-level chi-square test (p=0.009567). However, this result did not withstand correction for multiple testing. Likewise, sliding window haplotype analyses consisting of two, three, or four SNPs failed to detect any positive association. Resequencing analysis also failed to identify any novel rare sequence variants. In conclusion, the present study provided no support for the hypothesis that common or rare variants in PVRL1 play a significant role in NSCL/P development in the southern Han Chinese population. This is the first study that has used tagging SNPs covering all the coding and noncoding regions to search for common NSCL/P-associated mutations of PVRL1. PMID:22455396

  8. Effect of OPRM1 and stressful life events on symptoms of major depression in African American adolescents.

    PubMed

    Swann, Gregory; Byck, Gayle R; Dick, Danielle M; Aliev, Fazil; Latendresse, Shawn J; Riley, Brien; Kertes, Darlene; Sun, Cuie; Salvatore, Jessica E; Bolland, John; Mustanski, Brian

    2014-06-01

    In a community sample of low-income African American adolescents, we tested the interactive effects of variation in the mu 1 opioid receptor (OPRM1) gene and the occurrence of stressful life events on symptoms of depression. Interactive effects of 24 OPRM1 simple nucleotide polymorphisms (SNP) and adolescent report of stressful life events on depression were tested using multilevel regressions. SNPs were dummy coded to test both additive and dominate forms of coding. Five OPRM1 SNPs showed significant evidence of interaction with stressful life events to alter depression risk (or symptoms) after adjusting for multiple testing and the correlated nature of the SNPs. Follow-up analyses showed significant differences based on OPRM1 genotype at both lower and higher frequencies of stressful life events, suggesting that participants with a copy of the minor allele on OPRM1 SNPs rs524731, rs9478503, rs3778157, rs10485057, and rs511420 have fewer symptoms in low stress conditions but more symptoms in high stress conditions compared to major allele homozygotes. The genetic variants associated with depression in African American adolescents may not translate to other ethnic groups. This study is also limited in that only one gene that functions within a complex biological system is addressed. This current study is the first to find an interaction between OPRM1 and life stress that is associated with depression. It also addressed an understudied population within the behavioral genetics literature. Further research should test additional genes involved in the opioid system and expand the current findings to more diverse samples. Copyright © 2014 Elsevier B.V. All rights reserved.

  9. Estimate of within population incremental selection through branch imbalance in lineage trees

    PubMed Central

    Liberman, Gilad; Benichou, Jennifer I.C.; Maman, Yaakov; Glanville, Jacob; Alter, Idan; Louzoun, Yoram

    2016-01-01

    Incremental selection within a population, defined as limited fitness changes following mutation, is an important aspect of many evolutionary processes. Strongly advantageous or deleterious mutations are detected using the synonymous to non-synonymous mutations ratio. However, there are currently no precise methods to estimate incremental selection. We here provide for the first time such a detailed method and show its precision in multiple cases of micro-evolution. The proposed method is a novel mixed lineage tree/sequence based method to detect within population selection as defined by the effect of mutations on the average number of offspring. Specifically, we propose to measure the log of the ratio between the number of leaves in lineage trees branches following synonymous and non-synonymous mutations. The method requires a high enough number of sequences, and a large enough number of independent mutations. It assumes that all mutations are independent events. It does not require of a baseline model and is practically not affected by sampling biases. We show the method's wide applicability by testing it on multiple cases of micro-evolution. We show that it can detect genes and inter-genic regions using the selection rate and detect selection pressures in viral proteins and in the immune response to pathogens. PMID:26586802

  10. CAPRRESI: Chimera Assembly by Plasmid Recovery and Restriction Enzyme Site Insertion.

    PubMed

    Santillán, Orlando; Ramírez-Romero, Miguel A; Dávila, Guillermo

    2017-06-25

    Here, we present chimera assembly by plasmid recovery and restriction enzyme site insertion (CAPRRESI). CAPRRESI benefits from many strengths of the original plasmid recovery method and introduces restriction enzyme digestion to ease DNA ligation reactions (required for chimera assembly). For this protocol, users clone wildtype genes into the same plasmid (pUC18 or pUC19). After the in silico selection of amino acid sequence regions where chimeras should be assembled, users obtain all the synonym DNA sequences that encode them. Ad hoc Perl scripts enable users to determine all synonym DNA sequences. After this step, another Perl script searches for restriction enzyme sites on all synonym DNA sequences. This in silico analysis is also performed using the ampicillin resistance gene (ampR) found on pUC18/19 plasmids. Users design oligonucleotides inside synonym regions to disrupt wildtype and ampR genes by PCR. After obtaining and purifying complementary DNA fragments, restriction enzyme digestion is accomplished. Chimera assembly is achieved by ligating appropriate complementary DNA fragments. pUC18/19 vectors are selected for CAPRRESI because they offer technical advantages, such as small size (2,686 base pairs), high copy number, advantageous sequencing reaction features, and commercial availability. The usage of restriction enzymes for chimera assembly eliminates the need for DNA polymerases yielding blunt-ended products. CAPRRESI is a fast and low-cost method for fusing protein-coding genes.

  11. Bolbase: a comprehensive genomics database for Brassica oleracea.

    PubMed

    Yu, Jingyin; Zhao, Meixia; Wang, Xiaowu; Tong, Chaobo; Huang, Shunmou; Tehrim, Sadia; Liu, Yumei; Hua, Wei; Liu, Shengyi

    2013-09-30

    Brassica oleracea is a morphologically diverse species in the family Brassicaceae and contains a group of nutrition-rich vegetable crops, including common heading cabbage, cauliflower, broccoli, kohlrabi, kale, Brussels sprouts. This diversity along with its phylogenetic membership in a group of three diploid and three tetraploid species, and the recent availability of genome sequences within Brassica provide an unprecedented opportunity to study intra- and inter-species divergence and evolution in this species and its close relatives. We have developed a comprehensive database, Bolbase, which provides access to the B. oleracea genome data and comparative genomics information. The whole genome of B. oleracea is available, including nine fully assembled chromosomes and 1,848 scaffolds, with 45,758 predicted genes, 13,382 transposable elements, and 3,581 non-coding RNAs. Comparative genomics information is available, including syntenic regions among B. oleracea, Brassica rapa and Arabidopsis thaliana, synonymous (Ks) and non-synonymous (Ka) substitution rates between orthologous gene pairs, gene families or clusters, and differences in quantity, category, and distribution of transposable elements on chromosomes. Bolbase provides useful search and data mining tools, including a keyword search, a local BLAST server, and a customized GBrowse tool, which can be used to extract annotations of genome components, identify similar sequences and visualize syntenic regions among species. Users can download all genomic data and explore comparative genomics in a highly visual setting. Bolbase is the first resource platform for the B. oleracea genome and for genomic comparisons with its relatives, and thus it will help the research community to better study the function and evolution of Brassica genomes as well as enhance molecular breeding research. This database will be updated regularly with new features, improvements to genome annotation, and new genomic sequences as they become available. Bolbase is freely available at http://ocri-genomics.org/bolbase.

  12. Fine-scale genetic mapping of a hybrid sterility factor between Drosophila simulans and D. mauritiana: the varied and elusive functions of "speciation genes".

    PubMed

    Araripe, Luciana O; Montenegro, Horácio; Lemos, Bernardo; Hartl, Daniel L

    2010-12-14

    Hybrid male sterility (HMS) is a usual outcome of hybridization between closely related animal species. It arises because interactions between alleles that are functional within one species may be disrupted in hybrids. The identification of genes leading to hybrid sterility is of great interest for understanding the evolutionary process of speciation. In the current work we used marked P-element insertions as dominant markers to efficiently locate one genetic factor causing a severe reduction in fertility in hybrid males of Drosophila simulans and D. mauritiana. Our mapping effort identified a region of 9 kb on chromosome 3, containing three complete and one partial coding sequences. Within this region, two annotated genes are suggested as candidates for the HMS factor, based on the comparative molecular characterization and public-source information. Gene Taf1 is partially contained in the region, but yet shows high polymorphism with four fixed non-synonymous substitutions between the two species. Its molecular functions involve sequence-specific DNA binding and transcription factor activity. Gene agt is a small, intronless gene, whose molecular function is annotated as methylated-DNA-protein-cysteine S-methyltransferase activity. High polymorphism and one fixed non-synonymous substitution suggest this is a fast evolving gene. The gene trees of both genes perfectly separate D. simulans and D. mauritiana into monophyletic groups. Analysis of gene expression using microarray revealed trends that were similar to those previously found in comparisons between whole-genome hybrids and parental species. The identification following confirmation of the HMS candidate gene will add another case study leading to understanding the evolutionary process of hybrid incompatibility.

  13. The association of six non-synonymous variants in three DNA repair genes with hepatocellular carcinoma risk: a meta-analysis.

    PubMed

    Shi, Yan-Hui; Wang, Bin; Xu, Bai-Ping; Jiang, Dan-Na; Zhao, Dong-Mei; Ji, Man-Ru; Zhou, Li; Li, Xue; Lu, Chang-Zhu

    2016-11-01

    Hepatocellular carcinoma is a complex polygenic disease. Despite the huge advances in genetic epidemiology, it still remains a challenge to unveil the genetic architecture of hepatocellular carcinoma. We, therefore, decided to meta-analytically assess the association of six non-synonymous coding variants from XRCC1, XRCC3 and XPD genes with hepatocellular carcinoma risk by pooling the results of 20 English articles. This meta-analysis was conducted according to the PRISMA statement, and data collection was independently completed in duplicate. In overall analyses, the minor alleles of four variants, Arg280His (odds ratio, 95% confidence interval, P: 1.37, 1.13-1.66, 0.001), Thr241Met (1.93, 1.17-3.20, 0.011), Asp312Asn (1.22, 1.08-1.38, 0.001) and Lys751Gln (1.42, 1.02-1.97, 0.038), were associated with the significant risk for hepatocellular carcinoma. There were low probabilities of publication bias for all variants. Subgroup analyses revealed significant association of XRCC1 gene Arg399Gln with hepatocellular carcinoma in Chinese especially from south China (odds ratio, 95% confidence interval, P: 1.57, 1.16-2.14, 0.004), in larger studies (1.48, 1.11-1.98, 0.007) and in studies with population-based controls (1.33, 1.06-1.68, 0.016). Taken together, our findings demonstrated that XPD gene Asp312Asn and XRCC1 gene Arg399Gln might be candidate susceptibility loci for hepatocellular carcinoma. Considering the ubiquity of genetic heterogeneity, further validation in a broad range of ethnic populations is warranted. © 2016 The Authors. Journal of Cellular and Molecular Medicine published by John Wiley & Sons Ltd and Foundation for Cellular and Molecular Medicine.

  14. Identification and allelic dissection uncover roles of lncRNAs in secondary growth of Populus tomentosa.

    PubMed

    Zhou, Daling; Du, Qingzhang; Chen, Jinhui; Wang, Qingshi; Zhang, Deqiang

    2017-10-01

    Long non-coding RNAs (lncRNAs) function in various biological processes. However, their roles in secondary growth of plants remain poorly understood. Here, 15,691 lncRNAs were identified from vascular cambium, developing xylem, and mature xylem of Populus tomentosa with high and low biomass using RNA-seq, including 1,994 lncRNAs that were differentially expressed (DE) among the six libraries. 3,569 cis-regulated and 3,297 trans-regulated protein-coding genes were predicted as potential target genes (PTGs) of the DE lncRNAs to participate in biological regulation. Then, 476 and 28 lncRNAs were identified as putative targets and endogenous target mimics (eTMs) of Populus known microRNAs (miRNAs), respectively. Genome re-sequencing of 435 individuals from a natural population of P. tomentosa found 34,015 single nucleotide polymorphisms (SNPs) within 178 lncRNA loci and 522 PTGs. Single-SNP associations analysis detected 2,993 associations with 10 growth and wood-property traits under additive and dominance model. Epistasis analysis identified 17,656 epistatic SNP pairs, providing evidence for potential regulatory interactions between lncRNAs and their PTGs. Furthermore, a reconstructed epistatic network, representing interactions of 8 lncRNAs and 15 PTGs, might enrich regulation roles of genes in the phenylpropanoid pathway. These findings may enhance our understanding of non-coding genes in plants. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  15. Genome-wide minor histocompatibility matching as related to the risk of graft-versus-host disease.

    PubMed

    Martin, Paul J; Levine, David M; Storer, Barry E; Warren, Edus H; Zheng, Xiuwen; Nelson, Sarah C; Smith, Anajane G; Mortensen, Bo K; Hansen, John A

    2017-02-09

    The risk of acute graft-versus-host disease (GVHD) is higher after allogeneic hematopoietic cell transplantation (HCT) from unrelated donors as compared with related donors. This difference has been explained by increased recipient mismatching for major histocompatibility antigens or minor histocompatibility antigens. In the current study, we used genome-wide arrays to enumerate single nucleotide polymorphisms (SNPs) that produce graft-versus-host (GVH) amino acid coding differences between recipients and donors. We then tested the hypothesis that higher degrees of genome-wide recipient GVH mismatching correlate with higher risks of GVHD after allogeneic HCT. In HLA-genotypically matched sibling recipients, the average recipient mismatching of coding SNPs was 9.35%. Each 1% increase in genome-wide recipient mismatching was associated with an estimated 20% increase in the hazard of grades III-IV GVHD (hazard ratio [HR], 1.20; 95% confidence interval [CI], 1.05-1.37; P = .007) and an estimated 22% increase in the hazard of stage 2-4 acute gut GVHD (HR, 1.22; 95% CI, 1.02-1.45; P = .03). In HLA-A, B, C, DRB1, DQA1, DQB1, DPA1, DPB1-phenotypically matched unrelated recipients, the average recipient mismatching of coding SNPs was 17.3%. The estimated risks of GVHD-related outcomes in HLA-phenotypically matched unrelated recipients were low, relative to the large difference in genome-wide mismatching between the 2 groups. In contrast, the risks of GVHD-related outcomes were higher in HLA-DP GVH-mismatched unrelated recipients than in HLA-matched sibling recipients. Taken together, these results suggest that the increased GVHD risk after unrelated HCT is predominantly an effect of HLA-mismatching. © 2017 by The American Society of Hematology.

  16. Regulatory single nucleotide polymorphisms (rSNPs) at the promoters 1A and 1B of the human APC gene.

    PubMed

    Matveeva, Marina Yu; Kashina, Elena V; Reshetnikov, Vasily V; Bryzgalov, Leonid O; Antontseva, Elena V; Bondar, Natalia P; Merkulova, Tatiana I

    2016-12-22

    Germline mutations in the coding sequence of the tumour suppressor APC gene give rise to familial adenomatous polyposis (which leads to colorectal cancer) and are associated with many other oncopathologies. The loss of APC function because of deletion of putative promoter 1A or 1B also results in the development of colorectal cancer. Since the regions of promoters 1A and 1B contain many single nucleotide polymorphisms (SNPs), the aim of this study was to perform functional analysis of some of these SNPs by means of an electrophoretic mobility shift assay (EMSA) and a luciferase reporter assay. First, it was shown that both putative promoters of APC (1A and 1B) drive transcription in an in vitro reporter experiment. From eleven randomly selected SNPs of promoter 1A and four SNPs of promoter 1B, nine and two respectively showed differential patterns of binding of nuclear proteins to oligonucleotide probes corresponding to alternative alleles. The luciferase reporter assay showed that among the six SNPs tested, the rs75612255 C allele and rs113017087 C allele in promoter 1A as well as the rs138386816 T allele and rs115658307 T allele in promoter 1B significantly increased luciferase activity in the human erythromyeloblastoid leukaemia cell line K562. In human colorectal cancer HCT-116 cells, none of the substitutions under study had any effect, with the exception of minor allele G of rs79896135 in promoter 1B. This allele significantly decreased the luciferase reporter's activity CONCLUSION: Our results indicate that many SNPs in APC promoters 1A and 1B are functionally relevant and that allele G of rs79896135 may be associated with the predisposition to colorectal cancer.

  17. Multifaceted Genomic Risk for Brain Function in Schizophrenia

    PubMed Central

    Chen, Jiayu; Calhoun, Vince D.; Pearlson, Godfrey D.; Ehrlich, Stefan; Turner, Jessica A.; Ho, Beng-Choon; Wassink, Thomas H.; Michael, Andrew M; Liu, Jingyu

    2012-01-01

    Recently, deriving candidate endophenotypes from brain imaging data has become a valuable approach to study genetic influences on schizophrenia (SZ), whose pathophysiology remains unclear. In this work we utilized a multivariate approach, parallel independent component analysis, to identify genomic risk components associated with brain function abnormalities in SZ. 5157 candidate single nucleotide polymorphisms (SNPs) were derived from genome-wide array based on their possible connections with SZ and further investigated for their associations with brain activations captured with functional magnetic resonance imaging (fMRI) during a sensorimotor task. Using data from 92 SZ patients and 116 healthy controls, we detected a significant correlation (r= 0.29; p= 2.41×10−5) between one fMRI component and one SNP component, both of which significantly differentiated patients from controls. The fMRI component mainly consisted of precentral and postcentral gyri, the major activated regions in the motor task. On average, higher activation in these regions was observed in participants with higher loadings of the linked SNP component, predominantly contributed to by 253 SNPs. 138 identified SNPs were from known coding regions of 100 unique genes. 31 identified SNPs did not differ between groups, but moderately correlated with some other group-discriminating SNPs, indicating interactions among alleles contributing towards elevated SZ susceptibility. The genes associated with the identified SNPs participated in four neurotransmitter pathways: GABA receptor signaling, dopamine receptor signaling, neuregulin signaling and glutamate receptor signaling. In summary, our work provides further evidence for the complexity of genomic risk to the functional brain abnormality in SZ and suggests a pathological role of interactions between SNPs, genes and multiple neurotransmitter pathways. PMID:22440650

  18. Machine learning shows association between genetic variability in PPARG and cerebral connectivity in preterm infants

    PubMed Central

    Krishnan, Michelle L.; Wang, Zi; Aljabar, Paul; Ball, Gareth; Mirza, Ghazala; Saxena, Alka; Counsell, Serena J.; Hajnal, Joseph V.; Montana, Giovanni

    2017-01-01

    Preterm infants show abnormal structural and functional brain development, and have a high risk of long-term neurocognitive problems. The molecular and cellular mechanisms involved are poorly understood, but novel methods now make it possible to address them by examining the relationship between common genetic variability and brain endophenotype. We addressed the hypothesis that variability in the Peroxisome Proliferator Activated Receptor (PPAR) pathway would be related to brain development. We employed machine learning in an unsupervised, unbiased, combined analysis of whole-brain diffusion tractography together with genomewide, single-nucleotide polymorphism (SNP)-based genotypes from a cohort of 272 preterm infants, using Sparse Reduced Rank Regression (sRRR) and correcting for ethnicity and age at birth and imaging. Empirical selection frequencies for SNPs associated with cerebral connectivity ranged from 0.663 to zero, with multiple highly selected SNPs mapping to genes for PPARG (six SNPs), ITGA6 (four SNPs), and FXR1 (two SNPs). SNPs in PPARG were significantly overrepresented (ranked 7–11 and 67 of 556,000 SNPs; P < 2.2 × 10−7), and were mostly in introns or regulatory regions with predicted effects including protein coding and nonsense-mediated decay. Edge-centric graph-theoretic analysis showed that highly selected white-matter tracts were consistent across the group and important for information transfer (P < 2.2 × 10−17); they most often connected to the insula (P < 6 × 10−17). These results suggest that the inhibited brain development seen in humans exposed to the stress of a premature extrauterine environment is modulated by genetic factors, and that PPARG signaling has a previously unrecognized role in cerebral development. PMID:29229843

  19. Resequencing of IRS2 reveals rare variants for obesity but not fasting glucose homeostasis in Hispanic children.

    PubMed

    Butte, Nancy F; Voruganti, V Saroja; Cole, Shelley A; Haack, Karin; Comuzzie, Anthony G; Muzny, Donna M; Wheeler, David A; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A

    2011-09-22

    Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3'-UTR, and 2 in the 5'-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001-0.009) were associated with obesity-related traits (P = 0.01-0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77-0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children.

  20. Associations between Cytokine/Cytokine Receptor SNPs and Humoral Immunity to Measles, Mumps and Rubella in a Somali Population

    PubMed Central

    Dhiman, Neelam; Ovsyannikova, Inna G.; Vierkant, Robert A.; Pankratz, V. Shane; Jacobson, Robert M.; Poland, Gregory A.

    2008-01-01

    We genotyped a Somali population (n=85; age ≤ 30 years) for 617 cytokine and cytokine receptor SNPs using Illumina GoldenGate genotyping to determine associations with measles, mumps and rubella immunity. Overall, sixty-one significant associations (p≤0.01) were found between SNPs belonging to cytokine receptor genes regulating Th1 (IL12RB2, IL2RA and B) and Th2 (IL4R, IL10RB) immunity, and cytokine (IL1B, TNFA, IL6 and IFNB1) and cytokine receptor (IL1RA, IFNAR2, IL18R1, TNFRSF1A and B) genes regulating innate immunity, and variations in antibody levels to measles, mumps or rubella. SNPs within two major inflammatory cytokine genes, TNFA and IL6, demonstrated associations with measles-specific antibodies. Specifically, the minor allele variant of rs1799964 (TNFA -1211 C>T) was associated with primarily seronegative values (median EIA index values ≤0.87; p=0.002; q=0.23) in response to measles disease and/or vaccination. A heterozygous variant CT for rs2069849 (IL6 +4272C>T; Phe201Phe) was also associated with seronegative values and a lower median level of antibody response to measles disease and/or vaccination (p=0.004; q=0.36) or measles vaccination alone (p=0.008). Several SNPs within the coding and regulatory regions of cytokine and cytokine receptor genes demonstrated associations with mumps and rubella antibody levels, but were less informative as strong LD patterns and lower frequencies for minor alleles were observed among these SNPs. Our study identifies specific SNPs in innate immune response genes that may play a role in modulating antibody responses to measles vaccination and/or infection in Somali subjects. PMID:18715339

Top