synonymous single nucleotide: Topics by Science.gov

Sample records for synonymous single nucleotide

E6 and E7 Gene Polymorphisms in Human Papillomavirus Types-58 and 33 Identified in Southwest China

PubMed Central

Wen, Qiang; Wang, Tao; Mu, Xuemei; Chenzhang, Yuwei; Cao, Man

2017-01-01

Cancer of the cervix is associated with infection by certain types of human papillomavirus (HPV). The gene variants differ in immune responses and oncogenic potential. The E6 and E7 proteins encoded by high-risk HPV play a key role in cellular transformation. HPV-33 and HPV-58 types are highly prevalent among Chinese women. To study the gene intratypic variations, polymorphisms and positive selections of HPV-33 and HPV-58 E6/E7 in southwest China, HPV-33 (E6, E7: n = 216) and HPV-58 (E6, E7: n = 405) E6 and E7 genes were sequenced and compared to others submitted to GenBank. Phylogenetic trees were constructed by Maximum-likelihood and the Kimura 2-parameters methods by MEGA 6 (Molecular Evolutionary Genetics Analysis version 6.0). The diversity of secondary structure was analyzed by PSIPred software. The selection pressures acting on the E6/E7 genes were estimated by PAML 4.8 (Phylogenetic Analyses by Maximun Likelihood version4.8) software. The positive sites of HPV-33 and HPV-58 E6/E7 were contrasted by ClustalX 2.1. Among 216 HPV-33 E6 sequences, 8 single nucleotide mutations were observed with 6/8 non-synonymous and 2/8 synonymous mutations. The 216 HPV-33 E7 sequences showed 3 single nucleotide mutations that were non-synonymous. The 405 HPV-58 E6 sequences revealed 8 single nucleotide mutations with 4/8 non-synonymous and 4/8 synonymous mutations. Among 405 HPV-58 E7 sequences, 13 single nucleotide mutations were observed with 10/13 non-synonymous mutations and 3/13 synonymous mutations. The selective pressure analysis showed that all HPV-33 and 4/6 HPV-58 E6/E7 major non-synonymous mutations were sites of positive selection. All variations were observed in sites belonging to major histocompatibility complex and/or B-cell predicted epitopes. K93N and R145 (I/N) were observed in both HPV-33 and HPV-58 E6. PMID:28141822
Demonstration of Protein-Based Human Identification Using the Hair Shaft Proteome

PubMed Central

Leppert, Tami; Anex, Deon S.; Hilmer, Jonathan K.; Matsunami, Nori; Baird, Lisa; Stevens, Jeffery; Parsawar, Krishna; Durbin-Johnson, Blythe P.; Rocke, David M.; Nelson, Chad; Fairbanks, Daniel J.; Wilson, Andrew S.; Rice, Robert H.; Woodward, Scott R.; Bothner, Brian; Hart, Bradley R.; Leppert, Mark

2016-01-01

Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 single nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). This study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts. PMID:27603779
Demonstration of protein-based human identification using the hair shaft proteome [Protein-based human identification: A proof of concept using the hair shaft proteome

DOE PAGES

Parker, Glendon J.; Leppert, Tami; Anex, Deon S.; ...

2016-09-07

Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
Demonstration of protein-based human identification using the hair shaft proteome [Protein-based human identification: A proof of concept using the hair shaft proteome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Parker, Glendon J.; Leppert, Tami; Anex, Deon S.

Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein-protein interactions.

PubMed

Yates, Christopher M; Sternberg, Michael J E

2013-11-01

Non-synonymous single nucleotide polymorphisms (nsSNPs) are single base changes leading to a change to the amino acid sequence of the encoded protein. Many of these variants are associated with disease, so nsSNPs have been well studied, with studies looking at the effects of nsSNPs on individual proteins, for example, on stability and enzyme active sites. In recent years, the impact of nsSNPs upon protein-protein interactions has also been investigated, giving a greater insight into the mechanisms by which nsSNPs can lead to disease. In this review, we summarize these studies, looking at the various mechanisms by which nsSNPs can affect protein-protein interactions. We focus on structural changes that can impair interaction, changes to disorder, gain of interaction, and post-translational modifications before looking at some examples of nsSNPs at human-pathogen protein-protein interfaces and the analysis of nsSNPs from a network perspective. © 2013.
Identification and Characterization of Novel Variations in Platelet G-Protein Coupled Receptor (GPCR) Genes in Patients Historically Diagnosed with Type 1 von Willebrand Disease.

PubMed

Stockley, Jacqueline; Nisar, Shaista P; Leo, Vincenzo C; Sabi, Essa; Cunningham, Margaret R; Eikenboom, Jeroen C; Lethagen, Stefan; Schneppenheim, Reinhard; Goodeve, Anne C; Watson, Steve P; Mundell, Stuart J; Daly, Martina E

2015-01-01

The clinical expression of type 1 von Willebrand disease may be modified by co-inheritance of other mild bleeding diatheses. We previously showed that mutations in the platelet P2Y12 ADP receptor gene (P2RY12) could contribute to the bleeding phenotype in patients with type 1 von Willebrand disease. Here we investigated whether variations in platelet G protein-coupled receptor genes other than P2RY12 also contributed to the bleeding phenotype. Platelet G protein-coupled receptor genes P2RY1, F2R, F2RL3, TBXA2R and PTGIR were sequenced in 146 index cases with type 1 von Willebrand disease and the potential effects of identified single nucleotide variations were assessed using in silico methods and heterologous expression analysis. Seven heterozygous single nucleotide variations were identified in 8 index cases. Two single nucleotide variations were detected in F2R; a novel c.-67G>C transversion which reduced F2R transcriptional activity and a rare c.1063C>T transition predicting a p.L355F substitution which did not interfere with PAR1 expression or signalling. Two synonymous single nucleotide variations were identified in F2RL3 (c.402C>G, p.A134 =; c.1029 G>C p.V343 =), both of which introduced less commonly used codons and were predicted to be deleterious, though neither of them affected PAR4 receptor expression. A third single nucleotide variation in F2RL3 (c.65 C>A; p.T22N) was co-inherited with a synonymous single nucleotide variation in TBXA2R (c.6680 C>T, p.S218 =). Expression and signalling of the p.T22N PAR4 variant was similar to wild-type, while the TBXA2R variation introduced a cryptic splice site that was predicted to cause premature termination of protein translation. The enrichment of single nucleotide variations in G protein-coupled receptor genes among type 1 von Willebrand disease patients supports the view of type 1 von Willebrand disease as a polygenic disorder.
regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.

PubMed

Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong

2017-09-01

While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.
Functional effects of single nucleotide polymorphisms in the coding region of human N-acetyltransferase 1

PubMed Central

Zhu, Yuanqi; Hein, David W.

2007-01-01

Genetic variants of human N-acetyltransferase 1 (NAT1) are associated with cancer and birth defects. N- and O-acetyltransferase catalytic activities, Michaelis-Menten kinetic constants (Km & Vmax), and steady state expression levels of NAT1-specific mRNA and protein were determined for the reference NAT1*4 and variant human NAT1 haplotypes possessing single nucleotide polymorphisms (SNPs) in the open reading frame. Although none of the SNPs caused a significant effect on steady state levels of NAT1-specific mRNA, C97T(R33stop), C190T(R64W), C559T (R187stop) and A752T(D251V) each reduced NAT1 protein level and/or N- and O-acetyltransferase catalytic activities to levels below detection. G560A(R187Q) substantially reduced NAT1 protein level and catalytic activities and increased substrate Km. The G445A(V149I), G459A(synonymous) and T640G(S214A) haplotype present in NAT1*11 significantly (p<0.05) increased NAT1 protein level and catalytic activity. Neither T21G(synonymous), T402C(synonymous), A613G(M205V), T777C(synonymous), G781A(E261K), or A787G(I263V) significantly affected Km, catalytic activity, mRNA or protein level. These results suggest heterogeneity among slow NAT1 acetylator phenotypes. PMID:17909564
Complete genome analysis of dengue virus type 3 isolated from the 2013 dengue outbreak in Yunnan, China.

PubMed

Wang, Xiaodan; Ma, Dehong; Huang, Xinwei; Li, Lihua; Li, Duo; Zhao, Yujiao; Qiu, Lijuan; Pan, Yue; Chen, Junying; Xi, Juemin; Shan, Xiyun; Sun, Qiangming

2017-06-15

In the past few decades, dengue has spread rapidly and is an emerging disease in China. An unexpected dengue outbreak occurred in Xishuangbanna, Yunnan, China, resulting in 1331 patients in 2013. In order to obtain the complete genome information and perform mutation and evolutionary analysis of causative agent related to this largest outbreak of dengue fever. The viruses were isolated by cell culture and evaluated by genome sequence analysis. Phylogenetic trees were then constructed by Neighbor-Joining methods (MEGA6.0), followed by analysis of nucleotide mutation and amino acid substitution. The analysis of the diversity of secondary structure for E and NS1 protein were also performed. Then selection pressures acting on the coding sequences were estimated by PAML software. The complete genome sequences of two isolated strains (YNSW1, YNSW2) were 10,710 and 10,702 nucleotides in length, respectively. Phylogenetic analysis revealed both strain were classified as genotype II of DENV-3. The results indicated that both isolated strains of Xishuangbanna in 2013 and Laos 2013 stains (KF816161.1, KF816158.1, LC147061.1, LC147059.1, KF816162.1) were most similar to Bangladesh (AY496873.2) in 2002. After comparing with the DENV-3SS (H87) 62 amino acid substitutions were identified in translated regions, and 38 amino acid substitutions were identified in translated regions compared with DENV-3 genotype II stains Bangladesh (AY496873.2). 27(YNSW1) or 28(YNSW2) single nucleotide changes were observed in structural protein sequences with 7(YNSW1) or 8(YNSW2) non-synonymous mutations compared with AY496873.2. Of them, 4 non-synonymous mutations were identified in E protein sequences with (2 in the β-sheet, 2 in the coil). Meanwhile, 117(YNSW1) or 115 (YNSW2) single nucleotide changes were observed in non-structural protein sequences with 31(YNSW1) or 30 (YNSW2) non-synonymous mutations. Particularly, 14 single nucleotide changes were observed in NS1 sequences with 4/14 non-synonymous substitutions (4 in the coil). Selection pressure analysis revealed no positive selection in the amino acid sites of the genes encoding for structural and non-structural proteins. This study may help understand the intrinsic geographical relatedness of dengue virus 3 and contributes further to research on their infectivity, pathogenicity and vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.
Statistical analysis of nucleotide sequences of the hemagglutinin gene of human influenza A viruses.

PubMed Central

Ina, Y; Gojobori, T

1994-01-01

To examine whether positive selection operates on the hemagglutinin 1 (HA1) gene of human influenza A viruses (H1 subtype), 21 nucleotide sequences of the HA1 gene were statistically analyzed. The nucleotide sequences were divided into antigenic and nonantigenic sites. The nucleotide diversities for antigenic and nonantigenic sites of the HA1 gene were computed at synonymous and nonsynonymous sites separately. For nonantigenic sites, the nucleotide diversities were larger at synonymous sites than at nonsynonymous sites. This is consistent with the neutral theory of molecular evolution. For antigenic sites, however, the nucleotide diversities at nonsynonymous sites were larger than those at synonymous sites. These results suggest that positive selection operates on antigenic sites of the HA1 gene of human influenza A viruses (H1 subtype). PMID:8078892
Synonymous Mutations at the Beginning of the Influenza A Virus Hemagglutinin Gene Impact Experimental Fitness.

PubMed

Canale, Aneth S; Venev, Sergey V; Whitfield, Troy W; Caffrey, Daniel R; Marasco, Wayne A; Schiffer, Celia A; Kowalik, Timothy F; Jensen, Jeffrey D; Finberg, Robert W; Zeldovich, Konstantin B; Wang, Jennifer P; Bolon, Daniel N A

2018-04-13

The fitness effects of synonymous mutations can provide insights into biological and evolutionary mechanisms. We analyzed the experimental fitness effects of all single-nucleotide mutations, including synonymous substitutions, at the beginning of the influenza A virus hemagglutinin (HA) gene. Many synonymous substitutions were deleterious both in bulk competition and for individually isolated clones. Investigating protein and RNA levels of a subset of individually expressed HA variants revealed that multiple biochemical properties contribute to the observed experimental fitness effects. Our results indicate that a structural element in the HA segment viral RNA may influence fitness. Examination of naturally evolved sequences in human hosts indicates a preference for the unfolded state of this structural element compared to that found in swine hosts. Our overall results reveal that synonymous mutations may have greater fitness consequences than indicated by simple models of sequence conservation, and we discuss the implications of this finding for commonly used evolutionary tests and analyses. Copyright © 2018. Published by Elsevier Ltd.
Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms.

PubMed

Zhang, Wei; Qi, Weihong; Albert, Thomas J; Motiwala, Alifiya S; Alland, David; Hyytia-Trees, Eija K; Ribot, Efrain M; Fields, Patricia I; Whittam, Thomas S; Swaminathan, Bala

2006-06-01

Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7x10(-9) per site per year), we estimate that the most recent common ancestor of the contemporary beta-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens.
Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms

PubMed Central

Zhang, Wei; Qi, Weihong; Albert, Thomas J.; Motiwala, Alifiya S.; Alland, David; Hyytia-Trees, Eija K.; Ribot, Efrain M.; Fields, Patricia I.; Whittam, Thomas S.; Swaminathan, Bala

2006-01-01

Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7 × 10−9 per site per year), we estimate that the most recent common ancestor of the contemporary β-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens. PMID:16606700
Identification of Four Novel Synonymous Substitutions in the X-Linked Genes Neuroligin 3 and Neuroligin 4X in Japanese Patients with Autistic Spectrum Disorder.

PubMed

Yanagi, Kumiko; Kaname, Tadashi; Wakui, Keiko; Hashimoto, Ohiko; Fukushima, Yoshimitsu; Naritomi, Kenji

2012-01-01

Mutations in the X-linked genes neuroligin 3 (NLGN3) and neuroligin 4X (NLGN4X) were first implicated in the pathogenesis of X-linked autism in Swedish families. However, reports of mutations in these genes in autism spectrum disorder (ASD) patients from various ethnic backgrounds present conflicting results regarding the etiology of ASD, possibly because of genetic heterogeneity and/or differences in their ethnic background. Additional mutation screening study on another ethnic background could help to clarify the relevance of the genes to ASD. We scanned the entire coding regions of NLGN3 and NLGN4X in 62 Japanese patients with ASD by polymerase chain reaction-high-resolution melting curve and direct sequencing analyses. Four synonymous substitutions, one in NLGN3 and three in NLGN4X, were identified in four of the 62 patients. These substitutions were not present in 278 control X-chromosomes from unrelated Japanese individuals and were not registered in the database of Single Nucleotide Polymorphisms build 132 or in the Japanese Single Nucleotide Polymorphisms database, indicating that they were novel and specific to ASD. Though further analysis is necessary to determine the physiological and clinical importance of such substitutions, the possibility of the relevance of both synonymous and nonsynonymous substitutions with the etiology of ASD should be considered.
Identification of Four Novel Synonymous Substitutions in the X-Linked Genes Neuroligin 3 and Neuroligin 4X in Japanese Patients with Autistic Spectrum Disorder

PubMed Central

Yanagi, Kumiko; Kaname, Tadashi; Wakui, Keiko; Hashimoto, Ohiko; Fukushima, Yoshimitsu; Naritomi, Kenji

2012-01-01

Mutations in the X-linked genes neuroligin 3 (NLGN3) and neuroligin 4X (NLGN4X) were first implicated in the pathogenesis of X-linked autism in Swedish families. However, reports of mutations in these genes in autism spectrum disorder (ASD) patients from various ethnic backgrounds present conflicting results regarding the etiology of ASD, possibly because of genetic heterogeneity and/or differences in their ethnic background. Additional mutation screening study on another ethnic background could help to clarify the relevance of the genes to ASD. We scanned the entire coding regions of NLGN3 and NLGN4X in 62 Japanese patients with ASD by polymerase chain reaction-high-resolution melting curve and direct sequencing analyses. Four synonymous substitutions, one in NLGN3 and three in NLGN4X, were identified in four of the 62 patients. These substitutions were not present in 278 control X-chromosomes from unrelated Japanese individuals and were not registered in the database of Single Nucleotide Polymorphisms build 132 or in the Japanese Single Nucleotide Polymorphisms database, indicating that they were novel and specific to ASD. Though further analysis is necessary to determine the physiological and clinical importance of such substitutions, the possibility of the relevance of both synonymous and nonsynonymous substitutions with the etiology of ASD should be considered. PMID:22934180
Sounds of silence: synonymous nucleotides as a key to biological regulation and complexity

PubMed Central

Shabalina, Svetlana A.; Spiridonov, Nikolay A.; Kashina, Anna

2013-01-01

Messenger RNA is a key component of an intricate regulatory network of its own. It accommodates numerous nucleotide signals that overlap protein coding sequences and are responsible for multiple levels of regulation and generation of biological complexity. A wealth of structural and regulatory information, which mRNA carries in addition to the encoded amino acid sequence, raises the question of how these signals and overlapping codes are delineated along non-synonymous and synonymous positions in protein coding regions, especially in eukaryotes. Silent or synonymous codon positions, which do not determine amino acid sequences of the encoded proteins, define mRNA secondary structure and stability and affect the rate of translation, folding and post-translational modifications of nascent polypeptides. The RNA level selection is acting on synonymous sites in both prokaryotes and eukaryotes and is more common than previously thought. Selection pressure on the coding gene regions follows three-nucleotide periodic pattern of nucleotide base-pairing in mRNA, which is imposed by the genetic code. Synonymous positions of the coding regions have a higher level of hybridization potential relative to non-synonymous positions, and are multifunctional in their regulatory and structural roles. Recent experimental evidence and analysis of mRNA structure and interspecies conservation suggest that there is an evolutionary tradeoff between selective pressure acting at the RNA and protein levels. Here we provide a comprehensive overview of the studies that define the role of silent positions in regulating RNA structure and processing that exert downstream effects on proteins and their functions. PMID:23293005
Genetic variability in E6, E7, and L1 genes of human papillomavirus genotype 52 from Southwest China.

PubMed

Zhang, Yiwen; Cao, Man; Wang, Mengting; Ding, Xianping; Jing, Yaling; Chen, Zuyi; Ma, Tengjiao; Chen, Honghan

2016-07-01

Human papillomavirus (HPV) is the major causative agent of cervical cancer, which accounts for the second highest cancer burden in women worldwide. HPV-52, the prevalent subtype in Asia, especially in southwest China, was analyzed in this study. To analyze polymorphisms, intratypic variants, and genetic variability in the E6-E7 (n=26) and L1 (n=53) genes of HPV-52, these genes were sequenced and the sequences were submitted to GenBank. Phylogenetic trees were constructed using the neighbor-joining and Kimura 2-parameters methods, followed by analysis of the diversity of secondary structure. Finally, we estimated the selection pressures acting on the E6-E7 and L1 genes. Fifty-one novel variants of HPV-52 L1, and two novel variants of HPV-52 E6-E7 were identified in this study. Thirty single nucleotide changes were observed in HPV-52 E6-E7 sequences with 19/30 non-synonymous mutations and 11/30 synonymous mutations (five in the alpha helix and five in the beta sheet). Fifty-five single nucleotide changes were observed in HPV-52 L1 sequences with 17/55 non-synonymous mutations (seven in the alpha helix and fourteen in the beta sheet) and 38/55 synonymous mutations. Selective pressure analysis predicted that most of these mutations reflect positive selection. Identifying new variants in HPV-52 may inform the rational design of new vaccines specifically for women in southwest China. Knowledge of genetic variation in HPV may be useful as an epidemiologic correlate of cervical cancer risk, or may even provide critical information for developing diagnostic probes. Copyright © 2016 Elsevier B.V. All rights reserved.
EST-derived SNP discovery and selective pressure analysis in Pacific white shrimp ( Litopenaeus vannamei)

NASA Astrophysics Data System (ADS)

Liu, Chengzhang; Wang, Xia; Xiang, Jianhai; Li, Fuhua

2012-09-01

Pacific white shrimp has become a major aquaculture and fishery species worldwide. Although a large scale EST resource has been publicly available since 2008, the data have not yet been widely used for SNP discovery or transcriptome-wide assessment of selective pressure. In this study, a set of 155 411 expressed sequence tags (ESTs) from the NCBI database were computationally analyzed and 17 225 single nucleotide polymorphisms (SNPs) were predicted, including 9 546 transitions, 5 124 transversions and 2 481 indels. Among the 7 298 SNP substitutions located in functionally annotated contigs, 58.4% (4 262) are non-synonymous SNPs capable of introducing amino acid mutations. Two hundred and fifty nonsynonymous SNPs in genes associated with economic traits have been identified as candidates for markers in selective breeding. Diversity estimates among the synonymous nucleotides were on average 3.49 times greater than those in non-synonymous, suggesting negative selection. Distribution of non-synonymous to synonymous substitutions (Ka/Ks) ratio ranges from 0 to 4.01, (average 0.42, median 0.26), suggesting that the majority of the affected genes are under purifying selection. Enrichment analysis identified multiple gene ontology categories under positive or negative selection. Categories involved in innate immune response and male gamete generation are rich in positively selected genes, which is similar to reports in Drosophila and primates. This work is the first transcriptome-wide assessment of selective pressure in a Penaeid shrimp species. The functionally annotated SNPs provide a valuable resource of potential molecular markers for selective breeding.
Energy efficiency trade-offs drive nucleotide usage in transcribed regions

PubMed Central

Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.

2016-01-01

Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217
Differential Single Nucleotide Polymorphism-Based Analysis of an Outbreak Caused by Salmonella enterica Serovar Manhattan Reveals Epidemiological Details Missed by Standard Pulsed-Field Gel Electrophoresis

PubMed Central

Scaltriti, Erika; Sassera, Davide; Comandatore, Francesco; Morganti, Marina; Mandalari, Carmen; Gaiarsa, Stefano; Bandi, Claudio; Zehender, Gianguglielmo; Bolzoni, Luca; Casadei, Gabriele

2015-01-01

We retrospectively analyzed a rare Salmonella enterica serovar Manhattan outbreak that occurred in Italy in 2009 to evaluate the potential of new genomic tools based on differential single nucleotide polymorphism (SNP) analysis in comparison with the gold standard genotyping method, pulsed-field gel electrophoresis. A total of 39 isolates were analyzed from patients (n = 15) and food, feed, animal, and environmental sources (n = 24), resulting in five different pulsed-field gel electrophoresis (PFGE) profiles. Isolates epidemiologically related to the outbreak clustered within the same pulsotype, SXB_BS.0003, without any further differentiation. Thirty-three isolates were considered for genomic analysis based on different sets of SNPs, core, synonymous, nonsynonymous, as well as SNPs in different codon positions, by Bayesian and maximum likelihood algorithms. Trees generated from core and nonsynonymous SNPs, as well as SNPs at the second and first plus second codon positions detailed four distinct groups of isolates within the outbreak pulsotype, discriminating outbreak-related isolates of human and food origins. Conversely, the trees derived from synonymous and third-codon-position SNPs clustered food and human isolates together, indicating that all outbreak-related isolates constituted a single clone, which was in line with the epidemiological evidence. Further experiments are in place to extend this approach within our regional enteropathogen surveillance system. PMID:25653407

Differential single nucleotide polymorphism-based analysis of an outbreak caused by Salmonella enterica serovar Manhattan reveals epidemiological details missed by standard pulsed-field gel electrophoresis.

PubMed

Scaltriti, Erika; Sassera, Davide; Comandatore, Francesco; Morganti, Marina; Mandalari, Carmen; Gaiarsa, Stefano; Bandi, Claudio; Zehender, Gianguglielmo; Bolzoni, Luca; Casadei, Gabriele; Pongolini, Stefano

2015-04-01

We retrospectively analyzed a rare Salmonella enterica serovar Manhattan outbreak that occurred in Italy in 2009 to evaluate the potential of new genomic tools based on differential single nucleotide polymorphism (SNP) analysis in comparison with the gold standard genotyping method, pulsed-field gel electrophoresis. A total of 39 isolates were analyzed from patients (n=15) and food, feed, animal, and environmental sources (n=24), resulting in five different pulsed-field gel electrophoresis (PFGE) profiles. Isolates epidemiologically related to the outbreak clustered within the same pulsotype, SXB_BS.0003, without any further differentiation. Thirty-three isolates were considered for genomic analysis based on different sets of SNPs, core, synonymous, nonsynonymous, as well as SNPs in different codon positions, by Bayesian and maximum likelihood algorithms. Trees generated from core and nonsynonymous SNPs, as well as SNPs at the second and first plus second codon positions detailed four distinct groups of isolates within the outbreak pulsotype, discriminating outbreak-related isolates of human and food origins. Conversely, the trees derived from synonymous and third-codon-position SNPs clustered food and human isolates together, indicating that all outbreak-related isolates constituted a single clone, which was in line with the epidemiological evidence. Further experiments are in place to extend this approach within our regional enteropathogen surveillance system. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Phylogeny and polymorphism in the long control regions E6, E7, and L1 of HPV Type 56 in women from southwest China

PubMed Central

Jing, Yaling; Wang, Tao; Chen, Zuyi; Ding, Xianping; Xu, Jianju; Mu, Xuemei; Cao, Man; Chen, Honghan

2018-01-01

Globally, human papillomavirus (HPV)-56 accounts for a small proportion of all high-risk HPV types; however, HPV-56 is detected at a higher rate in Asia, particularly in southwest China. The present study analyzed polymorphisms, intratypic variants, and genetic variability in the long control regions (LCR), E6, E7, and L1 of HPV-56 (n=75). The LCRs, E6, E7 and L1 were sequenced using a polymerase chain reaction and the sequences were submitted to GenBank. Maximum-likelihood trees were constructed using Kimura's two-parameter model, followed by secondary structure analysis and protein damaging prediction. Additionally, in order to assess the effect of variations in the LCR on putative binding sites for cellular proteins, MATCH server was used. Finally, the selection pressures of the E6-E7 and L1 genes were estimated. A total of 18 point substitutions, a 42-bp deletion and a 19-bp deletion of LCR were identified. Some of those mutations are embedded in the putative binding sites for transcription factors. 18 single nucleotide changes occurred in the E6-E7 sequence, 11/18 were non-synonymous substitutions and 7/18 were synonymous mutations. A total 24 single nucleotide changes were identified in the L1 sequence, 6/24 being non-synonymous mutations and 18/24 synonymous mutations. Selective pressure analysis predicted that the majority of mutations of HPV-56 E6, E7 and L1 were of positive selection. The phylogenetic tree demonstrated that the isolates distributed in two lineages. Data on the prevalence and genetic variation of HPV-56 types in southwest China may aid future studies on viral molecular mechanisms and contribute to future investigations of diagnostic probes and therapeutic vaccines. PMID:29568922
Genetic diversity of tyrosine hydroxylase (TH) and dopamine β-hydroxylase (DBH) genes in cattle breeds

PubMed Central

Lourenco-Jaramillo, Diana Lelidett; Sifuentes-Rincón, Ana María; Parra-Bracamonte, Gaspar Manuel; de la Rosa-Reyna, Xochitl Fabiola; Segura-Cabrera, Aldo; Arellano-Vera, Williams

2012-01-01

DNA from four cattle breeds was used to re-sequence all of the exons and 56% of the introns of the bovine tyrosine hydroxylase (TH) gene and 97% and 13% of the bovine dopamine β-hydroxylase (DBH) coding and non-coding sequences, respectively. Two novel single nucleotide polymorphisms (SNPs) and a microsatellite motif were found in the TH sequences. The DBH sequences contained 62 nucleotide changes, including eight non-synonymous SNPs (nsSNPs) that are of particular interest because they may alter protein function and therefore affect the phenotype. These DBH nsSNPs resulted in amino acid substitutions that were predicted to destabilize the protein structure. Six SNPs (one from TH and five from DBH non-synonymous SNPs) were genotyped in 140 animals; all of them were polymorphic and had a minor allele frequency of > 9%. There were significant differences in the intra- and inter-population haplotype distributions. The haplotype differences between Brahman cattle and the three B. t. taurus breeds (Charolais, Holstein and Lidia) were interesting from a behavioural point of view because of the differences in temperament between these breeds. PMID:22888292
LISTA, LISTA-HOP and LISTA-HON: a comprehensive compilation of protein encoding sequences and its associated homology databases from the yeast Saccharomyces.

PubMed Central

Dölz, R; Mossé, M O; Slonimski, P P; Bairoch, A; Linder, P

1994-01-01

We continued our effort to make a comprehensive database (LISTA) for the yeast Saccharomyces cerevisiae. In this database each sequence has been attributed a single genetic name. In the case of duplicated sequences a simple method has been applied to distinguish between sequences of one and the same gene from non-allelic sequences of duplicated genes. If necessary, synonyms are given in the case of allelic duplicated sequences. Thus sequences can be found either by the name or by synonyms given in LISTA. Each entry contains the genetic name, the mnemonic from the EMBL data bank, the codon bias, reference of the publication of the sequence, Chromosomal location as far as known, Swissprot and EMBL accession numbers. To obtain more information on the included sequences, each entry has been screened against non-redundant nucleotide and protein data bank collections resulting in LISTA-HON and LISTA-HOP. The LISTA data base can be linked to the associated data sets or to nucleotide and protein banks by the Sequence Retrieval System (SRS). PMID:7937046
Identification of novel mutations and sequence variants in the SOX2 and CHX10 genes in patients with anophthalmia/microphthalmia

PubMed Central

Zhou, Jie; Kherani, Femida; Bardakjian, Tanya M.; Katowitz, James; Hughes, Nkecha; Schimmenti, Lisa A.; Schneider, Adele

2008-01-01

Purpose Mutations in the SOX2 and CHX10 genes have been reported in patients with anophthalmia and/or microphthalmia. In this study, we evaluated 34 anophthalmic/microphthalmic patient DNA samples (two sets of siblings included) for mutations and sequence variants in SOX2 and CHX10. Methods Conformational sensitive gel electrophoresis (CSGE) was used for the initial SOX2 and CHX10 screening of 34 affected individuals (two sets of siblings), five unaffected family members, and 80 healthy controls. Patient samples containing heteroduplexes were selected for sequence analysis. Base pair changes in SOX2 and CHX10 were confirmed by sequencing bidirectionally in patient samples. Results Two novel heterozygous mutations and two sequence variants (one known) in SOX2 were identified in this cohort. Mutation c.310 G>T (p. Glu104X), found in one patient, was in the region encoding the high mobility group (HMG) DNA-binding domain and resulted in a change from glutamic acid to a stop codon. The second mutation, noted in two affected siblings, was a single nucleotide deletion c.549delC (p. Pro184ArgfsX19) in the region encoding the activation domain, resulting in a frameshift and premature termination of the coding sequence. The shortened protein products may result in the loss of function. In addition, a novel nucleotide substitution c.*557G>A was identified in the 3′-untranslated region in one patient. The relationship between the nucleotide change and the protein function is indeterminate. A known single nucleotide polymorphism (c. *469 C>A, SNP rs11915160) was also detected in 2 of the 34 patients. Screening of CHX10 identified two synonymous sequence variants, c.471 C>T (p.Ser157Ser, rs35435463) and c.579 G>A (p. Gln193Gln, novel SNP), and one non-synonymous sequence variant, c.871 G>A (p. Asp291Asn, novel SNP). The non-synonymous polymorphism was also present in healthy controls, suggesting non-causality. Conclusions These results support the role of SOX2 in ocular development. Loss of SOX2 function results in severe eye malformation. CHX10 was not implicated with microphthalmia/anophthalmia in our patient cohort. PMID:18385794
Single nucleotide variations: Biological impact and theoretical interpretation

PubMed Central

Katsonis, Panagiotis; Koire, Amanda; Wilson, Stephen Joseph; Hsu, Teng-Kuei; Lua, Rhonald C; Wilkins, Angela Dawn; Lichtarge, Olivier

2014-01-01

Genome-wide association studies (GWAS) and whole-exome sequencing (WES) generate massive amounts of genomic variant information, and a major challenge is to identify which variations drive disease or contribute to phenotypic traits. Because the majority of known disease-causing mutations are exonic non-synonymous single nucleotide variations (nsSNVs), most studies focus on whether these nsSNVs affect protein function. Computational studies show that the impact of nsSNVs on protein function reflects sequence homology and structural information and predict the impact through statistical methods, machine learning techniques, or models of protein evolution. Here, we review impact prediction methods and discuss their underlying principles, their advantages and limitations, and how they compare to and complement one another. Finally, we present current applications and future directions for these methods in biological research and medical genetics. PMID:25234433
SNPGenie: estimating evolutionary parameters to detect natural selection using pooled next-generation sequencing data.

PubMed

Nelson, Chase W; Moncla, Louise H; Hughes, Austin L

2015-11-15

New applications of next-generation sequencing technologies use pools of DNA from multiple individuals to estimate population genetic parameters. However, no publicly available tools exist to analyse single-nucleotide polymorphism (SNP) calling results directly for evolutionary parameters important in detecting natural selection, including nucleotide diversity and gene diversity. We have developed SNPGenie to fill this gap. The user submits a FASTA reference sequence(s), a Gene Transfer Format (.GTF) file with CDS information and a SNP report(s) in an increasing selection of formats. The program estimates nucleotide diversity, distance from the reference and gene diversity. Sites are flagged for multiple overlapping reading frames, and are categorized by polymorphism type: nonsynonymous, synonymous, or ambiguous. The results allow single nucleotide, single codon, sliding window, whole gene and whole genome/population analyses that aid in the detection of positive and purifying natural selection in the source population. SNPGenie version 1.2 is a Perl program with no additional dependencies. It is free, open-source, and available for download at https://github.com/hugheslab/snpgenie. nelsoncw@email.sc.edu or austin@biol.sc.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer.

PubMed

Wong, Wing Chung; Kim, Dewey; Carter, Hannah; Diekhans, Mark; Ryan, Michael C; Karchin, Rachel

2011-08-01

Thousands of cancer exomes are currently being sequenced, yielding millions of non-synonymous single nucleotide variants (SNVs) of possible relevance to disease etiology. Here, we provide a software toolkit to prioritize SNVs based on their predicted contribution to tumorigenesis. It includes a database of precomputed, predictive features covering all positions in the annotated human exome and can be used either stand-alone or as part of a larger variant discovery pipeline. MySQL database, source code and binaries freely available for academic/government use at http://wiki.chasmsoftware.org, Source in Python and C++. Requires 32 or 64-bit Linux system (tested on Fedora Core 8,10,11 and Ubuntu 10), 2.5*≤ Python <3.0*, MySQL server >5.0, 60 GB available hard disk space (50 MB for software and data files, 40 GB for MySQL database dump when uncompressed), 2 GB of RAM.
Artemisinin Resistance-Associated Polymorphisms at the K13-Propeller Locus Are Absent in Plasmodium falciparum Isolates from Haiti

PubMed Central

Carter, Tamar E.; Boulter, Alexis; Existe, Alexandre; Romain, Jean R.; St. Victor, Jean Yves; Mulligan, Connie J.; Okech, Bernard A.

2015-01-01

Antimalarial drugs are a key tool in malaria elimination programs. With the emergence of artemisinin resistance in southeast Asia, an effort to identify molecular markers for surveillance of resistant malaria parasites is underway. Non-synonymous mutations in the kelch propeller domain (K13-propeller) in Plasmodium falciparum have been associated with artemisinin resistance in samples from southeast Asia, but additional studies are needed to characterize this locus in other P. falciparum populations with different levels of artemisinin use. Here, we sequenced the K13-propeller locus in 82 samples from Haiti, where limited government oversight of non-governmental organizations may have resulted in low-level use of artemisinin-based combination therapies. We detected a single-nucleotide polymorphism (SNP) at nucleotide 1,359 in a single isolate. Our results contribute to our understanding of the global genomic diversity of the K13-propeller locus in P. falciparum populations. PMID:25646258
Inference of purifying and positive selection in three subspecies of chimpanzees (Pan troglodytes) from exome sequencing.

PubMed

Bataillon, Thomas; Duan, Jinjie; Hvilsom, Christina; Jin, Xin; Li, Yingrui; Skov, Laurits; Glemin, Sylvain; Munch, Kasper; Jiang, Tao; Qian, Yu; Hobolth, Asger; Wang, Jun; Mailund, Thomas; Siegismund, Hans R; Schierup, Mikkel H

2015-03-30

We study genome-wide nucleotide diversity in three subspecies of extant chimpanzees using exome capture. After strict filtering, Single Nucleotide Polymorphisms and indels were called and genotyped for greater than 50% of exons at a mean coverage of 35× per individual. Central chimpanzees (Pan troglodytes troglodytes) are the most polymorphic (nucleotide diversity, θw = 0.0023 per site) followed by Eastern (P. t. schweinfurthii) chimpanzees (θw = 0.0016) and Western (P. t. verus) chimpanzees (θw = 0.0008). A demographic scenario of divergence without gene flow fits the patterns of autosomal synonymous nucleotide diversity well except for a signal of recent gene flow from Western into Eastern chimpanzees. The striking contrast in X-linked versus autosomal polymorphism and divergence previously reported in Central chimpanzees is also found in Eastern and Western chimpanzees. We show that the direction of selection statistic exhibits a strong nonmonotonic relationship with the strength of purifying selection S, making it inappropriate for estimating S. We instead use counts in synonymous versus nonsynonymous frequency classes to infer the distribution of S coefficients acting on nonsynonymous mutations in each subspecies. The strength of purifying selection we infer is congruent with the differences in effective sizes of each subspecies: Central chimpanzees are undergoing the strongest purifying selection followed by Eastern and Western chimpanzees. Coding indels show stronger selection against indels changing the reading frame than observed in human populations. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evaluation and identification of damaged single nucleotide polymorphisms in COL1A1 gene involved in osteoporosis

PubMed Central

Alsaif, Mohammed A.; Al Shammari, Sulaiman A.; Alhamdan, Adel A.

2012-01-01

Introduction Single-nucleotide polymorphisms (SNPs) are biomarkers for exploring the genetic basis of many complex human diseases. The prediction of SNPs is promising in modern genetic analysis but it is still a great challenge to identify the functional SNPs in a disease-related gene. The computational approach has overcome this challenge and an increase in the successful rate of genetic association studies and reduced cost of genotyping have been achieved. The objective of this study is to identify deleterious non-synonymous SNPs (nsSNPs) associated with the COL1A1 gene. Material and methods The SNPs were retrieved from the Single Nucleotide Polymorphism Database (dbSNP). Using I-Mutant, protein stability change was calculated. The potentially functional nsSNPs and their effect on proteins were predicted by PolyPhen and SIFT respectively. FASTSNP was used for estimation of risk score. Results Our analysis revealed 247 SNPs as non-synonymous, out of which 5 nsSNPs were found to be least stable by I-Mutant 2.0 with a DDG value of > –1.0. Four nsSNPs, namely rs17853657, rs17857117, rs57377812 and rs1059454, showed a highly deleterious tolerance index score of 0.00 with a change in their physicochemical properties by the SIFT server. Seven nsSNPs, namely rs1059454, rs8179178, rs17853657, rs17857117, rs72656340, rs72656344 and rs72656351, were found to be probably damaging with a PSIC score difference between 2.0 and 3.5 by the PolyPhen server. Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, were found to be highly polymorphic with a risk score of 3-4 with a possible effect of non-conservative change and splicing regulation by FASTSNP. Conclusions Three nsSNPs, namely rs1059454, rs17853657 and rs17857117, are potential functional polymorphisms that are likely to have a functional impact on the COL1A1 gene. PMID:24273577
Brief Note Low diversity of the major histocompatibility complex class II DRA gene in domestic goats (Capra hircus) in Southern China.

PubMed

Chen, L P; E, G X; Zhao, Y J; Na, R S; Zhao, Z Q; Zhang, J H; Ma, Y H; Sun, Y W; Zhong, T; Zhang, H P; Huang, Y F

2015-06-18

DRA encodes the alpha chain of the DR heterodimer, is closely linked to DRB and is considered almost monomorphic in major histocompatibility complex region. In this study, we identified the exon 2 of DRA to evaluate the immunogenetic diversity of Chinese south indigenous goat. Two single nucleotide polymorphisms in an untranslated region and one synonymous substitution in coding region were identified. These data suggest that high immunodiversity in native Chinese population.
Analysis of transitions at two-fold redundant sites in mammalian genomes. Transition redundant approach-to-equilibrium (TREx) distance metrics

PubMed Central

Li, Tang; Chamberlin, Stephen G; Caraco, M Daniel; Liberles, David A; Gaucher, Eric A; Benner, Steven A

2006-01-01

Background The exchange of nucleotides at synonymous sites in a gene encoding a protein is believed to have little impact on the fitness of a host organism. This should be especially true for synonymous transitions, where a pyrimidine nucleotide is replaced by another pyrimidine, or a purine is replaced by another purine. This suggests that transition redundant exchange (TREx) processes at the third position of conserved two-fold codon systems might offer the best approximation for a neutral molecular clock, serving to examine, within coding regions, theories that require neutrality, determine whether transition rate constants differ within genes in a single lineage, and correlate dates of events recorded in genomes with dates in the geological and paleontological records. To date, TREx analysis of the yeast genome has recognized correlated duplications that established a new metabolic strategies in fungi, and supported analyses of functional change in aromatases in pigs. TREx dating has limitations, however. Multiple transitions at synonymous sites may cause equilibration and loss of information. Further, to be useful to correlate events in the genomic record, different genes within a genome must suffer transitions at similar rates. Results A formalism to analyze divergence at two fold redundant codon systems is presented. This formalism exploits two-state approach-to-equilibrium kinetics from chemistry. This formalism captures, in a single equation, the possibility of multiple substitutions at individual sites, avoiding any need to "correct" for these. The formalism also connects specific rate constants for transitions to specific approximations in an underlying evolutionary model, including assumptions that transition rate constants are invariant at different sites, in different genes, in different lineages, and at different times. Therefore, the formalism supports analyses that evaluate these approximations. Transitions at synonymous sites within two-fold redundant coding systems were examined in the mouse, rat, and human genomes. The key metric (f2), the fraction of those sites that holds the same nucleotide, was measured for putative ortholog pairs. A transition redundant exchange (TREx) distance was calculated from f2 for these pairs. Pyrimidine-pyrimidine transitions at these sites occur approximately 14% faster than purine-purine transitions in various lineages. Transition rate constants were similar in different genes within the same lineages; within a set of orthologs, the f2 distribution is only modest overdispersed. No correlation between disparity and overdispersion is observed. In rodents, evidence was found for greater conservation of TREx sites in genes on the X chromosome, accounting for a small part of the overdispersion, however. Conclusion The TREx metric is useful to analyze the history of transition rate constants within these mammals over the past 100 million years. The TREx metric estimates the extent to which silent nucleotide substitutions accumulate in different genes, on different chromosomes, with different compositions, in different lineages, and at different times. PMID:16545144
Profiling deleterious non-synonymous SNPs of smoker's gene CYP1A1.

PubMed

Ramesh, A Sai; Khan, Imran; Farhan, Md; Thiagarajan, Padma

2013-01-01

CYP1A1 gene belongs to the cytochrome P450 family and is known better as smokers' gene due to its hyperactivation as a consequence of long term smoking. The expression of CYP1A1 induces polycyclic aromatic hydrocarbon production in the lungs, which when over expressed, is known to cause smoking related diseases, such as cardiovascular pathologies, cancer, and diabetes. Single nucleotide polymorphisms (SNPs) are the simplest form of genetic variations that occur at a higher frequency, and are denoted as synonymous and non-synonymous SNPs on the basis of their effects on the amino acids. This study adopts a systematic in silico approach to predict the deleterious SNPs that are associated with disease conditions. It is inferred that four SNPs are highly deleterious, among which the SNP with rs17861094 is commonly predicted to be harmful by all tools. Hydrophobic (isoleucine) to hydrophilic (serine) amino acid variation was observed in the candidate gene. Hence, this investigation aims to characterize a candidate gene from 159 SNPs of CYP1A1.
Investigating DNA-, RNA-, and protein-based features as a means to discriminate pathogenic synonymous variants.

PubMed

Livingstone, Mark; Folkman, Lukas; Yang, Yuedong; Zhang, Ping; Mort, Matthew; Cooper, David N; Liu, Yunlong; Stantic, Bela; Zhou, Yaoqi

2017-10-01

Synonymous single-nucleotide variants (SNVs), although they do not alter the encoded protein sequences, have been implicated in many genetic diseases. Experimental studies indicate that synonymous SNVs can lead to changes in the secondary and tertiary structures of DNA and RNA, thereby affecting translational efficiency, cotranslational protein folding as well as the binding of DNA-/RNA-binding proteins. However, the importance of these various features in disease phenotypes is not clearly understood. Here, we have built a support vector machine (SVM) model (termed DDIG-SN) as a means to discriminate disease-causing synonymous variants. The model was trained and evaluated on nearly 900 disease-causing variants. The method achieves robust performance with the area under the receiver operating characteristic curve of 0.84 and 0.85 for protein-stratified 10-fold cross-validation and independent testing, respectively. We were able to show that the disease-causing effects in the immediate proximity to exon-intron junctions (1-3 bp) are driven by the loss of splicing motif strength, whereas the gain of splicing motif strength is the primary cause in regions further away from the splice site (4-69 bp). The method is available as a part of the DDIG server at http://sparks-lab.org/ddig. © 2017 Wiley Periodicals, Inc.
Whole-Genome Sequencing of Theileria parva Strains Provides Insight into Parasite Migration and Diversification in the African Continent

PubMed Central

Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

2013-01-01

The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814–121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent. PMID:23404454
Whole-genome sequencing of Theileria parva strains provides insight into parasite migration and diversification in the African continent.

PubMed

Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

2013-06-01

The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814-121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent.
Age-related macular degeneration-associated silent polymorphisms in HtrA1 impair its ability to antagonize insulin-like growth factor 1.

PubMed

Jacobo, Sarah Melissa P; Deangelis, Margaret M; Kim, Ivana K; Kazlauskas, Andrius

2013-05-01

Synonymous single nucleotide polymorphisms (SNPs) within a transcript's coding region produce no change in the amino acid sequence of the protein product and are therefore intuitively assumed to have a neutral effect on protein function. We report that two common variants of high-temperature requirement A1 (HTRA1) that increase the inherited risk of neovascular age-related macular degeneration (NvAMD) harbor synonymous SNPs within exon 1 of HTRA1 that convert common codons for Ala34 and Gly36 to less frequently used codons. The frequent-to-rare codon conversion reduced the mRNA translation rate and appeared to compromise HtrA1's conformation and function. The protein product generated from the SNP-containing cDNA displayed enhanced susceptibility to proteolysis and a reduced affinity for an anti-HtrA1 antibody. The NvAMD-associated synonymous polymorphisms lie within HtrA1's putative insulin-like growth factor 1 (IGF-1) binding domain. They reduced HtrA1's abilities to associate with IGF-1 and to ameliorate IGF-1-stimulated signaling events and cellular responses. These observations highlight the relevance of synonymous codon usage to protein function and implicate homeostatic protein quality control mechanisms that may go awry in NvAMD.
A survey of genome-wide single nucleotide polymorphisms through genome resequencing in the Périgord black truffle (Tuber melanosporum Vittad.).

PubMed

Payen, Thibaut; Murat, Claude; Gigant, Anaïs; Morin, Emmanuelle; De Mita, Stéphane; Martin, Francis

2015-09-01

The Périgord black truffle (Tuber melanosporum Vittad.), considered a gastronomic delicacy worldwide, is an ectomycorrhizal filamentous fungus that is ecologically important in Mediterranean French, Italian and Spanish woodlands. In this study, we developed a novel resource of single nucleotide polymorphisms (SNPs) for T. melanosporum using Illumina high-throughput resequencing. The genome from six T. melanosporum geographical accessions was sequenced to a depth of approximately 20×. These geographical accessions were selected from different populations within the northern and southern regions of the geographical species distribution. Approximately 80% of the reads for each of the six resequenced geographical accessions mapped against the reference T. melanosporum genome assembly, estimating the core genome size of this organism to be approximately 110 Mbp. A total of 442 326 SNPs corresponding to 3540 SNPs/Mbps were identified as being included in all seven genomes. The SNPs occurred more frequently in repeated sequences (85%), although 4501 SNPs were also identified in the coding regions of 2587 genes. Using the ratio of nonsynonymous mutations per nonsynonymous site (pN) to synonymous mutations per synonymous site (pS) and Tajima's D index scanning the whole genome, we were able to identify genomic regions and genes potentially subjected to positive or purifying selection. The SNPs identified represent a valuable resource for future population genetics and genomics studies. © 2015 John Wiley & Sons Ltd.
SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage

PubMed Central

Pouyet, Fanny; Bailly-Bechet, Marc; Mouchiroud, Dominique; Guéguen, Laurent

2016-01-01

Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences. PMID:27401173

Neutral changes during divergent evolution of hemoglobins

NASA Technical Reports Server (NTRS)

Jukes, T. H.

1978-01-01

A comparison of the mRNAs for rabbit and human beta-hemoglobins shows that synonymous changes in codons have accumulated three times as rapidly as nucleotide replacements that produced changes in amino acids. This agrees with predictions based on the so-called neutral theory. In addition, seven codon changes that appear to be single-base changes (according to maximum parsimony) are actually two-base changes. This indicates that the construction of primordial sequences is of limited significance when based on inferences that assume minimum base changes for amino acid replacements.
CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer

PubMed Central

Carter, Hannah; Diekhans, Mark; Ryan, Michael C.; Karchin, Rachel

2011-01-01

Summary: Thousands of cancer exomes are currently being sequenced, yielding millions of non-synonymous single nucleotide variants (SNVs) of possible relevance to disease etiology. Here, we provide a software toolkit to prioritize SNVs based on their predicted contribution to tumorigenesis. It includes a database of precomputed, predictive features covering all positions in the annotated human exome and can be used either stand-alone or as part of a larger variant discovery pipeline. Availability and Implementation: MySQL database, source code and binaries freely available for academic/government use at http://wiki.chasmsoftware.org, Source in Python and C++. Requires 32 or 64-bit Linux system (tested on Fedora Core 8,10,11 and Ubuntu 10), 2.5*≤ Python <3.0*, MySQL server >5.0, 60 GB available hard disk space (50 MB for software and data files, 40 GB for MySQL database dump when uncompressed), 2 GB of RAM. Contact: karchin@jhu.edu Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:21685053
Artemisinin resistance-associated polymorphisms at the K13-propeller locus are absent in Plasmodium falciparum isolates from Haiti.

PubMed

Carter, Tamar E; Boulter, Alexis; Existe, Alexandre; Romain, Jean R; St Victor, Jean Yves; Mulligan, Connie J; Okech, Bernard A

2015-03-01

Antimalarial drugs are a key tool in malaria elimination programs. With the emergence of artemisinin resistance in southeast Asia, an effort to identify molecular markers for surveillance of resistant malaria parasites is underway. Non-synonymous mutations in the kelch propeller domain (K13-propeller) in Plasmodium falciparum have been associated with artemisinin resistance in samples from southeast Asia, but additional studies are needed to characterize this locus in other P. falciparum populations with different levels of artemisinin use. Here, we sequenced the K13-propeller locus in 82 samples from Haiti, where limited government oversight of non-governmental organizations may have resulted in low-level use of artemisinin-based combination therapies. We detected a single-nucleotide polymorphism (SNP) at nucleotide 1,359 in a single isolate. Our results contribute to our understanding of the global genomic diversity of the K13-propeller locus in P. falciparum populations. © The American Society of Tropical Medicine and Hygiene.
Association of two synonymous splicing-associated CpG single nucleotide polymorphisms in calpain 10 and solute carrier family 2 member 2 with type 2 diabetes

PubMed Central

Karambataki, Maria; Malousi, Andigoni; Tzimagiorgis, Georgios; Haitoglou, Constantinos; Fragou, Aikaterini; Georgiou, Elisavet; Papadopoulou, Foteini; Krassas, Gerasimos E.; Kouidou, Sofia

2017-01-01

Coding synonymous single nucleotide polymorphisms (SNPs) have attracted little attention until recently. However, such SNPs located in epigenetic, CpG sites modifying exonic splicing enhancers (ESEs) can be informative with regards to the recently verified association of intragenic methylation and splicing. The present study describes the association of type 2 diabetes (T2D) with the exonic, synonymous, epigenetic SNPs, rs3749166 in calpain 10 (CAPN10) glucose transporter (GLUT4) translocator and rs5404 in solute carrier family 2, member 2 (SLC2A2), also termed GLUT2, which, according to prior bioinformatic analysis, strongly modify the splicing potential of glucose transport-associated genes. Previous association studies reveal that only rs5404 exhibits a strong negative T2D association, while data on the CAPN10 polymorphism are contradictory. In the present study DNA from blood samples of 99 Greek non-diabetic control subjects and 71 T2D patients was analyzed. In addition, relevant publicly available cases (40) resulting from examination of 110 Personal Genome Project data files were analyzed. The frequency of the rs3749166 A allele, was similar in the patients and non-diabetic control subjects. However, AG heterozygotes were more frequent among patients (73.24% for Greek patients and 54.55% for corresponding non-diabetic control subjects; P=0.0262; total cases, 52.99 and 75.00%, respectively; P=0.0039). The rs5404 T allele was only observed in CT heterozygotes (Greek non-diabetic control subjects, 39.39% and Greek patients, 22.54%; P=0.0205; total cases, 34.69 and 21.28%, respectively; P=0.0258). Notably, only one genotype, heterozygous AG/CC, was T2D-associated (Greek non-diabetic control subjects, 29.29% and Greek patients, 56.33%; P=0.004; total cases, 32.84 and 56.58%, respectively; P=0.0008). Furthermore, AG/CC was strongly associated with very high (≥8.5%) glycosylated plasma hemoglobin levels among patients (P=0.0002 for all cases). These results reveal the complex heterozygotic SNP association with T2D, and indicate possible synergies of these epigenetic, splicing-regulatory, synonymous SNPs, which modify the splicing potential of two alternative glucose transport-associated genes. PMID:28357066
Regions of extreme synonymous codon selection in mammalian genes

PubMed Central

Schattner, Peter; Diekhans, Mark

2006-01-01

Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911
Limits of variation, specific infectivity, and genome packaging of massively recoded poliovirus genomes.

PubMed

Song, Yutong; Gorbatsevych, Oleksandr; Liu, Ying; Mugavero, JoAnn; Shen, Sam H; Ward, Charles B; Asare, Emmanuel; Jiang, Ping; Paul, Aniko V; Mueller, Steffen; Wimmer, Eckard

2017-10-10

Computer design and chemical synthesis generated viable variants of poliovirus type 1 (PV1), whose ORF (6,189 nucleotides) carried up to 1,297 "Max" mutations (excess of overrepresented synonymous codon pairs) or up to 2,104 "SD" mutations (randomly scrambled synonymous codons). "Min" variants (excess of underrepresented synonymous codon pairs) are nonviable except for P2 Min , a variant temperature-sensitive at 33 and 39.5 °C. Compared with WT PV1, P2 Min displayed a vastly reduced specific infectivity (si) (WT, 1 PFU/118 particles vs. P2 Min , 1 PFU/35,000 particles), a phenotype that will be discussed broadly. Si of haploid PV presents cellular infectivity of a single genotype. We performed a comprehensive analysis of sequence and structures of the PV genome to determine if evolutionary conserved cis-acting packaging signal(s) were preserved after recoding. We showed that conserved synonymous sites and/or local secondary structures that might play a role in determining packaging specificity do not survive codon pair recoding. This makes it unlikely that numerous "cryptic, sequence-degenerate, dispersed RNA packaging signals mapping along the entire viral genome" [Patel N, et al. (2017) Nat Microbiol 2:17098] play the critical role in poliovirus packaging specificity. Considering all available evidence, we propose a two-step assembly strategy for +ssRNA viruses: step I, acquisition of packaging specificity, either ( a ) by specific recognition between capsid protein(s) and replication proteins (poliovirus), or ( b ) by the high affinity interaction of a single RNA packaging signal (PS) with capsid protein(s) (most +ssRNA viruses so far studied); step II, cocondensation of genome/capsid precursors in which an array of hairpin structures plays a role in virion formation.
Single nucleotide polymorphism analysis of Korean native chickens using next generation sequencing data.

PubMed

Seo, Dong-Won; Oh, Jae-Don; Jin, Shil; Song, Ki-Duk; Park, Hee-Bok; Heo, Kang-Nyeong; Shin, Younhee; Jung, Myunghee; Park, Junhyung; Jo, Cheorun; Lee, Hak-Kyo; Lee, Jun-Heon

2015-02-01

There are five native chicken lines in Korea, which are mainly classified by plumage colors (black, white, red, yellow, gray). These five lines are very important genetic resources in the Korean poultry industry. Based on a next generation sequencing technology, whole genome sequence and reference assemblies were performed using Gallus_gallus_4.0 (NCBI) with whole genome sequences from these lines to identify common and novel single nucleotide polymorphisms (SNPs). We obtained 36,660,731,136 ± 1,257,159,120 bp of raw sequence and average 26.6-fold of 25-29 billion reference assembly sequences representing 97.288 % coverage. Also, 4,006,068 ± 97,534 SNPs were observed from 29 autosomes and the Z chromosome and, of these, 752,309 SNPs are the common SNPs across lines. Among the identified SNPs, the number of novel- and known-location assigned SNPs was 1,047,951 ± 14,956 and 2,948,648 ± 81,414, respectively. The number of unassigned known SNPs was 1,181 ± 150 and unassigned novel SNPs was 8,238 ± 1,019. Synonymous SNPs, non-synonymous SNPs, and SNPs having character changes were 26,266 ± 1,456, 11,467 ± 604, 8,180 ± 458, respectively. Overall, 443,048 ± 26,389 SNPs in each bird were identified by comparing with dbSNP in NCBI. The presently obtained genome sequence and SNP information in Korean native chickens have wide applications for further genome studies such as genetic diversity studies to detect causative mutations for economic and disease related traits.
LISTA, a comprehensive compilation of nucleotide sequences encoding proteins from the yeast Saccharomyces.

PubMed Central

Linder, P; Dölz, R; Mossé, M O; Lazowska, J; Slonimski, P P

1993-01-01

The amount of nucleotide sequence data is increasing exponentially. We therefore made an effort to make a comprehensive database (LISTA) for the yeast Saccharomyces cerevisiae. Each sequence has been attributed a single genetic name and in the case of allelic duplicated sequences, synonyms are given, if necessary. For the nomenclature we have introduced a standard principle for naming gene sequences based on priority rules. We have also applied a simple method to distinguish duplicated sequences of one and the same gene from non-allelic sequences of duplicated genes. By using these principles we have sorted out a lot of confusion in the literature and databanks. Along with the genetic name, the mnemonic from the EMBL databank, the codon bias, reference of the publication of the sequence and the EMBL accession numbers are included in each entry. PMID:8332521
Refactoring the Genetic Code for Increased Evolvability

DOE Office of Scientific and Technical Information (OSTI.GOV)

Pines, Gur; Winkler, James D.; Pines, Assaf

ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Refactoring the Genetic Code for Increased Evolvability

DOE PAGES

Pines, Gur; Winkler, James D.; Pines, Assaf; ...

2017-11-14

ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Plastome-Wide Nucleotide Substitution Rates Reveal Accelerated Rates in Papilionoideae and Correlations with Genome Features Across Legume Subfamilies.

PubMed

Schwarz, Erika N; Ruhlman, Tracey A; Weng, Mao-Lun; Khiyami, Mohammad A; Sabir, Jamal S M; Hajarah, Nahid H; Alharbi, Njud S; Rabah, Samar O; Jansen, Robert K

2017-04-01

This study represents the most comprehensive plastome-wide comparison of nucleotide substitution rates across the three subfamilies of Fabaceae: Caesalpinioideae, Mimosoideae, and Papilionoideae. Caesalpinioid and mimosoid legumes have large, unrearranged plastomes compared with papilionoids, which exhibit varying levels of rearrangement including the loss of the inverted repeat (IR) in the IR-lacking clade (IRLC). Using 71 genes common to 39 legume taxa representing all the three subfamilies, we show that papilionoids consistently have higher nucleotide substitution rates than caesalpinioids and mimosoids, and rates in the IRLC papilionoids are generally higher than those in the IR-containing papilionoids. Unsurprisingly, this pattern was significantly correlated with growth habit as most papilionoids are herbaceous, whereas caesalpinioids and mimosoids are largely woody. Both nonsynonymous (dN) and synonymous (dS) substitution rates were also correlated with several biological features including plastome size and plastomic rearrangements such as the number of inversions and indels. In agreement with previous reports, we found that genes in the IR exhibit between three and fourfold reductions in the substitution rates relative to genes within the large single-copy or small single-copy regions. Furthermore, former IR genes in IR-lacking taxa exhibit accelerated rates compared with genes contained in the IR.
Rejection of reclassification of Lactobacillus kimchii and Lactobacillus bobalius as later subjective synonyms of Lactobacillus paralimentarius using comparative genomics.

PubMed

Yang, Seung-Jo; Kim, Byung-Yong; Chun, Jongsik

2017-11-01

Lactobacillus bobalius, Lactobacillus kimchii and Lactobacillus paralimentarius belong to the genus Lactobacillus and show close phylogenetic relationships. In a previous study, L. bobalius and L. kimchii were proposed to be reclassified as later heterotypic synonyms of L. paralimentarius using high 16S rRNA gene sequence similarities (≥99.5 %) and DNA-DNA hybridization values (≥82 %). We determined high quality whole genome assemblies of the type strains of L. bobalius and L. kimchii, which were then compared with that of L. paralimentarius. Average nucleotide identity values among three genomes ranged from 91.4 to 92.3 % which are clearly below 95~96 %, the generally recognized cutoff value for bacterial species boundaries. On the basis of comparative genomic evidence, L. bobalius, L. kimchii, and L. paralimentarius should stand as separate species in the genus Lactobacillus. We therefore suggest rejecting the previous proposal to combine these three species into a single species.
SeqReporter: automating next-generation sequencing result interpretation and reporting workflow in a clinical laboratory.

PubMed

Roy, Somak; Durso, Mary Beth; Wald, Abigail; Nikiforov, Yuri E; Nikiforova, Marina N

2014-01-01

A wide repertoire of bioinformatics applications exist for next-generation sequencing data analysis; however, certain requirements of the clinical molecular laboratory limit their use: i) comprehensive report generation, ii) compatibility with existing laboratory information systems and computer operating system, iii) knowledgebase development, iv) quality management, and v) data security. SeqReporter is a web-based application developed using ASP.NET framework version 4.0. The client-side was designed using HTML5, CSS3, and Javascript. The server-side processing (VB.NET) relied on interaction with a customized SQL server 2008 R2 database. Overall, 104 cases (1062 variant calls) were analyzed by SeqReporter. Each variant call was classified into one of five report levels: i) known clinical significance, ii) uncertain clinical significance, iii) pending pathologists' review, iv) synonymous and deep intronic, and v) platform and panel-specific sequence errors. SeqReporter correctly annotated and classified 99.9% (859 of 860) of sequence variants, including 68.7% synonymous single-nucleotide variants, 28.3% nonsynonymous single-nucleotide variants, 1.7% insertions, and 1.3% deletions. One variant of potential clinical significance was re-classified after pathologist review. Laboratory information system-compatible clinical reports were generated automatically. SeqReporter also facilitated quality management activities. SeqReporter is an example of a customized and well-designed informatics solution to optimize and automate the downstream analysis of clinical next-generation sequencing data. We propose it as a model that may envisage the development of a comprehensive clinical informatics solution. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
Large-scale mass spectrometric detection of variant peptides resulting from non-synonymous nucleotide differences

PubMed Central

Sheynkman, Gloria M.; Shortreed, Michael R.; Frey, Brian L.; Scalf, Mark; Smith, Lloyd M.

2013-01-01

Each individual carries thousands of non-synonymous single nucleotide variants (nsSNVs) in their genome, each corresponding to a single amino acid polymorphism (SAP) in the encoded proteins. It is important to be able to directly detect and quantify these variations at the protein level in order to study post-transcriptional regulation, differential allelic expression, and other important biological processes. However, such variant peptides are not generally detected in standard proteomic analyses, due to their absence from the generic databases that are employed for mass spectrometry searching. Here, we extend previous work that demonstrated the use of customized SAP databases constructed from sample-matched RNA-Seq data. We collected deep coverage RNA-Seq data from the Jurkat cell line, compiled the set of nsSNVs that are expressed, used this information to construct a customized SAP database, and searched it against deep coverage shotgun MS data obtained from the same sample. This approach enabled detection of 421 SAP peptides mapping to 395 nsSNVs. We compared these peptides to peptides identified from a large generic search database containing all known nsSNVs (dbSNP) and found that more than 70% of the SAP peptides from this dbSNP-derived search were not supported by the RNA-Seq data, and thus are likely false positives. Next, we increased the SAP coverage from the RNA-Seq derived database by utilizing multiple protease digestions, thereby increasing variant detection to 695 SAP peptides mapping to 504 nsSNV sites. These detected SAP peptides corresponded to moderate to high abundance transcripts (30+ transcripts per million, TPM). The SAP peptides included 192 allelic pairs; the relative expression levels of the two alleles were evaluated for 51 of those pairs, and found to be comparable in all cases. PMID:24175627
How immunogenetically different are domestic pigs from wild boars: a perspective from single-nucleotide polymorphisms of 19 immunity-related candidate genes.

PubMed

Chen, Shanyuan; Gomes, Rui; Costa, Vânia; Santos, Pedro; Charneca, Rui; Zhang, Ya-ping; Liu, Xue-hong; Wang, Shao-qing; Bento, Pedro; Nunes, Jose-Luis; Buzgó, József; Varga, Gyula; Anton, István; Zsolnai, Attila; Beja-Pereira, Albano

2013-10-01

The coexistence of wild boars and domestic pigs across Eurasia makes it feasible to conduct comparative genetic or genomic analyses for addressing how genetically different a domestic species is from its wild ancestor. To test whether there are differences in patterns of genetic variability between wild and domestic pigs at immunity-related genes and to detect outlier loci putatively under selection that may underlie differences in immune responses, here we analyzed 54 single-nucleotide polymorphisms (SNPs) of 19 immunity-related candidate genes on 11 autosomes in three pairs of wild boar and domestic pig populations from China, Iberian Peninsula, and Hungary. Our results showed no statistically significant differences in allele frequency and heterozygosity across SNPs between three pairs of wild and domestic populations. This observation was more likely due to the widespread and long-lasting gene flow between wild boars and domestic pigs across Eurasia. In addition, we detected eight coding SNPs from six genes as outliers being under selection consistently by three outlier tests (BayeScan2.1, FDIST2, and Arlequin3.5). Among four non-synonymous outlier SNPs, one from TLR4 gene was identified as being subject to positive (diversifying) selection and three each from CD36, IFNW1, and IL1B genes were suggested as under balancing selection. All of these four non-synonymous variants were predicted as being benign by PolyPhen-2. Our results were supported by other independent lines of evidence for positive selection or balancing selection acting on these four immune genes (CD36, IFNW1, IL1B, and TLR4). Our study showed an example applying a candidate gene approach to identify functionally important mutations (i.e., outlier loci) in wild and domestic pigs for subsequent functional experiments.
Identification and characterization of single nucleotide polymorphisms in 6 growth-correlated genes in porcine by denaturing high performance liquid chromatography.

PubMed

Liu, Dewu; Zhang, Yushan; Du, Yinjun; Yang, Guanfu; Zhang, Xiquan

2007-06-01

The growth-correlated genes that are part of the neuroendocrine growth axis play crucial roles in the regulation of growth and development of pig. The identification of genetic polymorphisms in these genes will enable the scientist to evaluate the biological relevance of such polymorphisms and to gain a better understanding of quantitative traits like growth. In the present study, seven pairs of primers were designed to obtain unknown sequences of growth-correlated genes, and other 25 pairs of primers were designed to identify single nucleotide polymorphisms (SNP) using the denaturing high-performance liquid chromatography (DHPLC) technology in four pig breeds (Duroc, Landrace, Lantang and Wuzhishan), significantly differing in growth and development characteristics. A total of 101 polymorphisms were discovered in 10,707 base pairs (bp) from six genes of the ghrelin (GHRL), leptin (LEP), insulin-like growth factor II (IGF-II), insulin-like growth factor binding protein 2 (IGFBP-2), insulin-like growth factor binding protein 3 (IGFBP-3), and somatostatin (SS). The observed average distances between the SNP in the 5'UTR, coding regions, introns and 3'UTR were 134, 521, 81 and 92 bp, respectively. Four SNPs were found in the coding regions of IGF-II, IGFBP-2 and LEP, respectively. Two synonymous mutations were obtained in IGF-II and LEP genes respectively, and two non-synonymous were found in IGFBP-2 and LEP genes, respectively. Seven other mutations were also observed. Thirty-two PCR-RFLP markers were found among 101 polymorphisms of the six genes. The SNP discovered in this study would provide suitable markers for association studies of candidate genes with growth related traits in pig.
Calibration of Multiple Poliovirus Molecular Clocks Covering an Extended Evolutionary Range▿ †

PubMed Central

Jorba, Jaume; Campagnoli, Ray; De, Lina; Kew, Olen

2008-01-01

We have calibrated five different molecular clocks for circulating poliovirus based upon the rates of fixation of total substitutions (Kt), synonymous substitutions (Ks), synonymous transitions (As), synonymous transversions (Bs), and nonsynonymous substitutions (Ka) into the P1/capsid region (2,643 nucleotides). Rates were determined over a 10-year period by analysis of sequences of 31 wild poliovirus type 1 isolates representing a well-defined phylogeny derived from a common imported ancestor. Similar rates were obtained by linear regression, the maximum likelihood/single-rate dated-tip method, and Bayesian inference. The very rapid Kt [(1.03 ± 0.10) × 10−2 substitutions/site/year] and Ks [(1.00 ± 0.08) × 10−2] clocks were driven primarily by the As clock [(0.96 ± 0.09) × 10−2], the Bs clock was ∼10-fold slower [(0.10 ± 0.03) × 10−2], and the more stochastic Ka clock was ∼30-fold slower [(0.03 ± 0.01) × 10−2]. Nonsynonymous substitutions at all P1/capsid sites, including the neutralizing antigenic sites, appeared to be constrained by purifying selection. Simulation of the evolution of third-codon positions suggested that saturation of synonymous transitions would be evident at 10 years and complete at ∼65 years of independent transmission. Saturation of synonymous transversions was predicted to be minimal at 20 years and incomplete at 100 years. The rapid evolution of the Kt, Ks, and As clocks can be used to estimate the dates of divergence of closely related viruses, whereas the slower Bs and Ka clocks may be used to explore deeper evolutionary relationships within and across poliovirus genotypes. PMID:18287242
Rapid evolution of avirulence genes in rice blast fungus Magnaporthe oryzae

PubMed Central

2014-01-01

Background Rice blast fungus Magnaporthe oryzae is one of the most devastating pathogens in rice. Avirulence genes in this fungus share a gene-for-gene relationship with the resistance genes in its host rice. Although numerous studies have shown that rice blast R-genes are extremely diverse and evolve rapidly in their host populations, little is known about the evolutionary patterns of the Avr-genes in the pathogens. Results Here, six well-characterized Avr-genes and seven randomly selected non-Avr control genes were used to investigate the genetic variations in 62 rice blast strains from different parts of China. Frequent presence/absence polymorphisms, high levels of nucleotide variation (~10-fold higher than non-Avr genes), high non-synonymous to synonymous substitution ratios, and frequent shared non-synonymous substitution were observed in the Avr-genes of these diversified blast strains. In addition, most Avr-genes are closely associated with diverse repeated sequences, which may partially explain the frequent presence/absence polymorphisms in Avr-genes. Conclusion The frequent deletion and gain of Avr-genes and rapid non-synonymous variations might be the primary mechanisms underlying rapid adaptive evolution of pathogens toward virulence to their host plants, and these features can be used as the indicators for identifying additional Avr-genes. The high number of nucleotide polymorphisms among Avr-gene alleles could also be used to distinguish genetic groups among different strains. PMID:24725999
Allelic variation of the Waxy gene in foxtail millet [Setaria italica (L.) P. Beauv.] by single nucleotide polymorphisms.

PubMed

Van, K; Onoda, S; Kim, M Y; Kim, K D; Lee, S-H

2008-03-01

The Waxy (Wx) gene product controls the formation of a straight chain polymer of amylose in the starch pathway. Dominance/recessiveness of the Wx allele is associated with amylose content, leading to non-waxy/waxy phenotypes. For a total of 113 foxtail millet accessions, agronomic traits and the molecular differences of the Wx gene were surveyed to evaluate genetic diversities. Molecular types were associated with phenotypes determined by four specific primer sets (non-waxy, Type I; low amylose, Type VI; waxy, Type IV or V). Additionally, the insertion of transposable element in waxy was confirmed by ex1/TSI2R, TSI2F/ex2, ex2int2/TSI7R and TSI7F/ex4r. Seventeen single nucleotide polymorphims (SNPs) were observed from non-coding regions, while three SNPs from coding regions were non-synonymous. Interestingly, the phenotype of No. 88 was still non-waxy, although seven nucleotides (AATTGGT) insertion at 2,993 bp led to 78 amino acids shorter. The rapid decline of r (2) in the sequenced region (exon 1-intron 1-exon 2) suggested a low level of linkage disequilibrium and limited haplotype structure. K (s) values and estimation of evolutionary events indicate early divergence of S. italica among cereal crops. This study suggested the Wx gene was one of the targets in the selection process during domestication.
Analysis of nucleotide diversity among alleles of the major bacterial blight resistance gene Xa27 in cultivars of rice (Oryza sativa) and its wild relatives.

PubMed

Bimolata, Waikhom; Kumar, Anirudh; Sundaram, Raman Meenakshi; Laha, Gouri Shankar; Qureshi, Insaf Ahmed; Reddy, Gajjala Ashok; Ghazi, Irfan Ahmad

2013-08-01

Xa27 is one of the important R-genes, effective against bacterial blight disease of rice caused by Xanthomonas oryzae pv. oryzae (Xoo). Using natural population of Oryza, we analyzed the sequence variation in the functionally important domains of Xa27 across the Oryza species. DNA sequences of Xa27 alleles from 27 rice accessions revealed higher nucleotide diversity among the reported R-genes of rice. Sequence polymorphism analysis revealed synonymous and non-synonymous mutations in addition to a number of InDels in non-coding regions of the gene. High sequence variation was observed in the promoter region including the 5'UTR with 'π' value 0.00916 and 'θ w ' = 0.01785. Comparative analysis of the identified Xa27 alleles with that of IRBB27 and IR24 indicated the operation of both positive selection (Ka/Ks > 1) and neutral selection (Ka/Ks ≈ 0). The genetic distances of alleles of the gene from Oryza nivara were nearer to IRBB27 as compared to IR24. We also found the presence of conserved and null UPT (upregulated by transcriptional activator) box in the isolated alleles. Considerable amino acid polymorphism was localized in the trans-membrane domain for which the functional significance is yet to be elucidated. However, the absence of functional UPT box in all the alleles except IRBB27 suggests the maintenance of single resistant allele throughout the natural population.

Molecular epidemiological and phylogenetic analyses of canine parvovirus in domestic dogs and cats in Beijing, 2010-2013.

PubMed

Wu, Jing; Gao, Xin-Tao; Hou, Shao-Hua; Guo, Xiao-Yu; Yang, Xue-Shong; Yuan, Wei-Feng; Xin, Ting; Zhu, Hong-Fei; Jia, Hong

2015-10-01

Fifty-five samples (15.62%) collected from dogs and cats were identified as canine parvovirus (CPV) infection in Beijing during 2010-2013. The nucleotide identities and aa similarities were 98.2-100% and 97.7-100%, respectively, when compared with the reference isolates. Also, several synonymous and non-synonymous mutations were also recorded for the first time. New CPV-2a was dominant, accounting for 90.90% of the samples. Two of the 16 samples collected from cats were identified as new CPV-2a (12.5%), showing nucleotide identities of 100% with those from dogs. Twelve samples (15.78%) collected from completely immunized dogs were found to be new CPV-2a, which means CPV-2 vaccines may not provide sufficient protection for the epidemic strains.
Correlations of nucleotide substitution rates and base composition of mammalian coding sequences with protein structure.

PubMed

Chiusano, M L; D'Onofrio, G; Alvarez-Valin, F; Jabbari, K; Colonna, G; Bernardi, G

1999-09-30

We investigated the relationships between the nucleotide substitution rates and the predicted secondary structures in the three states representation (alpha-helix, beta-sheet, and coil). The analysis was carried out on 34 alignments, each of which comprised sequences belonging to at least four different mammalian orders. The rates of synonymous substitution were found to be significantly different in regions predicted to be alpha-helix, beta-sheet, or coil. Likewise, the nonsynonymous rates also differ, although expectedly at a lower extent, in the three types of secondary structure, suggesting that different selective constraints associated with the different structures are affecting in a similar way the synonymous and nonsynonymous rates. Moreover, the base composition of the third codon positions is different in coding sequence regions corresponding to different secondary structures of proteins.
Molecular Population Genetics of Sex Determination Genes: The Transformer Gene of Drosophila Melanogaster

PubMed Central

Walthour, C. S.; Schaeffer, S. W.

1994-01-01

The transformer locus (tra) produces an RNA processing protein that alternatively splices the doublesex pre-mRNA in the sex determination hierarchy of Drosophila melanogaster. Comparisons of the tra coding region among Drosophila species have revealed an unusually high degree of divergence in synonymous and nonsynonymous sites. In this study, we tested the hypothesis that the tra gene will be polymorphic in synonymous and nonsynonymous sites within species by investigating nucleotide sequence variation in eleven tra alleles within D. melanogaster. Of the 1063 nucleotides examined, two synonymous sites were polymorphic and no amino acid variation was detected. Three statistical tests were used to detect departures from an equilibrium neutral model. Two tests failed to reject a neutral model of molecular evolution because of low statisitical power associated with low levels of genetic variation (Tajima/Fu and Li). The Hudson, Kreitman, and Aguade test rejected a neutral model when the tra region was compared to the 5'-flanking region of alcohol dehydrogenase (Adh). The lack of variability in the tra gene is consistent with a recent selective sweep of a beneficial allele in or near the tra locus. PMID:8013913
Molecular Characterization and Expression Analysis of Creatine Kinase Muscle (CK-M) Gene in Horse.

PubMed

Do, Kyong-Tak; Cho, Hyun-Woo; Badrinath, Narayanasamy; Park, Jeong-Woong; Choi, Jae-Young; Chung, Young-Hwa; Lee, Hak-Kyo; Song, Ki-Duk; Cho, Byung-Wook

2015-12-01

Since ancient days, domestic horses have been closely associated with human civilization. Today, horse racing is an important industry. Various genes involved in energy production and muscle contraction are differentially regulated during a race. Among them, creatine kinase (CK) is well known for its regulation of energy preservation in animal cells. CK is an iso-enzyme, encoded by different genes and expressed in skeletal muscle, heart, brain and leucocytes. We confirmed that the expression of CK-M significantly increased in the blood after a 30 minute exercise period, while no considerable change was observed in skeletal muscle. Analysis of various tissues showed an ubiquitous expression of the CK-M gene in the horse; CK-M mRNA expression was predominant in the skeletal muscle and the cardiac muscle compared to other tissues. An evolutionary study by synonymous and non-synonymous single nucleotide polymorphism ratio of CK-M gene revealed a positive selection that was conserved in the horse. More studies are warranted in order to develop the expression of CK-M gene as a biomarker in blood of thoroughbred horses.
Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences.

PubMed

Pang, Erli; Wu, Xiaomei; Lin, Kui

2016-06-01

Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
Theileria parva antigens recognized by CD8+ T cells show varying degrees of diversity in buffalo-derived infected cell lines.

PubMed

Sitt, Tatjana; Pelle, Roger; Chepkwony, Maurine; Morrison, W Ivan; Toye, Philip

2018-05-06

The extent of sequence diversity among the genes encoding 10 antigens (Tp1-10) known to be recognized by CD8+ T lymphocytes from cattle immune to Theileria parva was analysed. The sequences were derived from parasites in 23 buffalo-derived cell lines, three cattle-derived isolates and one cloned cell line obtained from a buffalo-derived stabilate. The results revealed substantial variation among the antigens through sequence diversity. The greatest nucleotide and amino acid diversity were observed in Tp1, Tp2 and Tp9. Tp5 and Tp7 showed the least amount of allelic diversity, and Tp5, Tp6 and Tp7 had the lowest levels of protein diversity. Tp6 was the most conserved protein; only a single non-synonymous substitution was found in all obtained sequences. The ratio of non-synonymous: synonymous substitutions varied from 0.84 (Tp1) to 0.04 (Tp6). Apart from Tp2 and Tp9, we observed no variation in the other defined CD8+ T cell epitopes (Tp4, 5, 7 and 8), indicating that epitope variation is not a universal feature of T. parva antigens. In addition to providing markers that can be used to examine the diversity in T. parva populations, the results highlight the potential for using conserved antigens to develop vaccines that provide broad protection against T. parva.
Polymorphisms in the microglial marker molecule CX3CR1 affect the blood volume of the human brain.

PubMed

Sakai, Mai; Takeuchi, Hikaru; Yu, Zhiqian; Kikuchi, Yoshie; Ono, Chiaki; Takahashi, Yuta; Ito, Fumiaki; Matsuoka, Hiroo; Tanabe, Osamu; Yasuda, Jun; Taki, Yasuyuki; Kawashima, Ryuta; Tomita, Hiroaki

2018-06-01

CX3CR1, a G-protein-coupled receptor, is involved in various inflammatory processes. Two non-synonymous single nucleotide polymorphisms, V249I (rs3732379) and T280M (rs3732378), are located in the sixth and seventh transmembrane domains of the CX3CR1 protein, respectively. Previous studies have indicated significant associations between T280M and leukocyte functional characteristics, including adhesion, signaling, and chemotaxis, while the function of V249I is unclear. In the brain, microglia are the only proven and widely accepted CX3CR1-expressing cells. This study aimed to specify whether there were specific brain regions on which these two single nucleotide polymorphisms exert their biological impacts through their functional effects on microglia. Associations between the single nucleotide polymorphisms and brain characteristics, including gray and white matter volumes, white matter integrity, resting arterial blood volume, and cerebral blood flow, were evaluated among 1300 healthy Japanese individuals. The major allele carriers (V249 and T280) were significantly associated with an increased total arterial blood volume of the whole brain, especially around the bilateral precuneus, left posterior cingulate cortex, and left posterior parietal cortex. There were no significant associations between the genotypes and other brain structural indicators. This finding suggests that the CX3CR1 variants may affect arterial structures in the brain, possibly via interactions between microglia and brain microvascular endothelial cells. © 2018 The Authors. Psychiatry and Clinical Neurosciences © 2018 Japanese Society of Psychiatry and Neurology.
Cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human

PubMed Central

Song, Huai-Dong; Tu, Chang-Chun; Zhang, Guo-Wei; Wang, Sheng-Yue; Zheng, Kui; Lei, Lian-Cheng; Chen, Qiu-Xia; Gao, Yu-Wei; Zhou, Hui-Qiong; Xiang, Hua; Zheng, Hua-Jun; Chern, Shur-Wern Wang; Cheng, Feng; Pan, Chun-Ming; Xuan, Hua; Chen, Sai-Juan; Luo, Hui-Ming; Zhou, Duan-Hua; Liu, Yu-Fei; He, Jian-Feng; Qin, Peng-Zhe; Li, Ling-Hui; Ren, Yu-Qi; Liang, Wen-Jia; Yu, Ye-Dong; Anderson, Larry; Wang, Ming; Xu, Rui-Heng; Wu, Xin-Wei; Zheng, Huan-Ying; Chen, Jin-Ding; Liang, Guodong; Gao, Yang; Liao, Ming; Fang, Ling; Jiang, Li-Yun; Li, Hui; Chen, Fang; Di, Biao; He, Li-Juan; Lin, Jin-Yan; Tong, Suxiang; Kong, Xiangang; Du, Lin; Hao, Pei; Tang, Hua; Bernini, Andrea; Yu, Xiao-Jing; Spiga, Ottavia; Guo, Zong-Ming; Pan, Hai-Yan; He, Wei-Zhong; Manuguerra, Jean-Claude; Fontanet, Arnaud; Danchin, Antoine; Niccolai, Neri; Li, Yi-Xue; Wu, Chung-I; Zhao, Guo-Ping

2005-01-01

The genomic sequences of severe acute respiratory syndrome coronaviruses from human and palm civet of the 2003/2004 outbreak in the city of Guangzhou, China, were nearly identical. Phylogenetic analysis suggested an independent viral invasion from animal to human in this new episode. Combining all existing data but excluding singletons, we identified 202 single-nucleotide variations. Among them, 17 are polymorphic in palm civets only. The ratio of nonsynonymous/synonymous nucleotide substitution in palm civets collected 1 yr apart from different geographic locations is very high, suggesting a rapid evolving process of viral proteins in civet as well, much like their adaptation in the human host in the early 2002–2003 epidemic. Major genetic variations in some critical genes, particularly the Spike gene, seemed essential for the transition from animal-to-human transmission to human-to-human transmission, which eventually caused the first severe acute respiratory syndrome outbreak of 2002/2003. PMID:15695582
Detection of 549 new HLA alleles in potential stem cell donors from the United States, Poland and Germany.

PubMed

Hernández-Frederick, C J; Cereb, N; Giani, A S; Ruppel, J; Maraszek, A; Pingel, J; Sauter, J; Schmidt, A H; Yang, S Y

2016-01-01

We characterized 549 new human leukocyte antigen (HLA) class I and class II alleles found in newly registered stem cell donors as a result of high-throughput HLA typing. New alleles include 101 HLA-A, 132 HLA-B, 105 HLA-C, 2 HLA-DRB1, 89 HLA-DQB1 and 120 HLA-DPB1 alleles. Mainly, new alleles comprised single nucleotide variations when compared with homologous sequences. We identified nonsynonymous nucleotide mutations in 70.7% of all new alleles, synonymous variations in 26.4% and nonsense substitutions in 2.9% (null alleles). Some new alleles (55, 10.0%) were found multiple times, HLA-DPB1 alleles being the most frequent among these. Furthermore, as several new alleles were identified in individuals from ethnic minority groups, the relevance of recruiting donors belonging to such groups and the importance of ethnicity data collection in donor centers and registries is highlighted. © 2015 The Authors. HLA published by John Wiley & Sons Ltd.
Genome-wide re-sequencing of multidrug-resistant Mycobacterium leprae Airaku-3.

PubMed

Singh, P; Benjak, A; Carat, S; Kai, M; Busso, P; Avanzi, C; Paniz-Mondolfi, A; Peter, C; Harshman, K; Rougemont, J; Matsuoka, M; Cole, S T

2014-10-01

Genotyping and molecular characterization of drug resistance mechanisms in Mycobacterium leprae enables disease transmission and drug resistance trends to be monitored. In the present study, we performed genome-wide analysis of Airaku-3, a multidrug-resistant strain with an unknown mechanism of resistance to rifampicin. We identified 12 unique non-synonymous single-nucleotide polymorphisms (SNPs) including two in the transporter-encoding ctpC and ctpI genes. In addition, two SNPs were found that improve the resolution of SNP-based genotyping, particularly for Venezuelan and South East Asian strains of M. leprae. © 2014 The Authors Clinical Microbiology and Infection © 2014 European Society of Clinical Microbiology and Infectious Diseases.
A non-synonymous nucleotide substitution can account for one evolutionary route to sesquiterpene synthase activity in the TPS-b subgroup.

PubMed

Green, Sol; Baker, Edward N; Laing, William

2011-06-23

Plant sesquiterpene and hemiterpene synthases in the monoterpene synthase dominated TPS-b subgroup are thought to have evolved independently from a monoterpene synthase ancestor. A TPS-b sesquiterpene synthase from apple (MdAFS1), which predominantly produces α-farnesene, can also synthesize the monoterpene (E)-β-ocimene. The dual activity offered a functional link to an ancestral MdAFS1 enzyme and a rational basis for investigation of the evolution of TPS-b sesquiterpene enzymes. Protein modelling and mutagenesis analysis of the MdAFS1 active site identified a non-synonymous nucleotide substitution that could account for the requisite shift in substrate specificity necessary for the emergence of its sesquiterpene activity during the evolution of the TPS-b enzymes. Copyright © 2011 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.
Genetic variation in eleven phase I drug metabolism genes in an ethnically diverse population.

PubMed

Solus, Joseph F; Arietta, Brenda J; Harris, James R; Sexton, David P; Steward, John Q; McMunn, Chara; Ihrie, Patrick; Mehall, Janelle M; Edwards, Todd L; Dawson, Elliott P

2004-10-01

The extent of genetic variation found in drug metabolism genes and its contribution to interindividual variation in response to medication remains incompletely understood. To better determine the identity and frequency of variation in 11 phase I drug metabolism genes, the exons and flanking intronic regions of the cytochrome P450 (CYP) isoenzyme genes CYP1A1, CYP1A2, CYP2A6, CYP2B6, CYP2C8, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4 and CYP3A5 were amplified from genomic DNA and sequenced. A total of 60 kb of bi-directional sequence was generated from each of 93 human DNAs, which included Caucasian, African-American and Asian samples. There were 388 different polymorphisms identified. These included 269 non-coding, 45 synonymous and 74 non-synonymous polymorphisms. Of these, 54% were novel and included 176 non-coding, 14 synonymous and 21 non-synonymous polymorphisms. Of the novel variants observed, 85 were represented by single occurrences of the minor allele in the sample set. Much of the variation observed was from low-frequency alleles. Comparatively, these genes are variation-rich. Calculations measuring genetic diversity revealed that while the values for the individual genes are widely variable, the overall nucleotide diversity of 7.7 x 10(-4) and polymorphism parameter of 11.5 x 10(-4) are higher than those previously reported for other gene sets. Several independent measurements indicate that these genes are under selective pressure, particularly for polymorphisms corresponding to non-synonymous amino acid changes. There is relatively little difference in measurements of diversity among the ethnic groups, but there are large differences among the genes and gene subfamilies themselves. Of the three CYP subfamilies involved in phase I drug metabolism (1, 2, and 3), subfamily 2 displays the highest levels of genetic diversity.
Sequencing of the needle transcriptome from Norway spruce (Picea abies Karst L.) reveals lower substitution rates, but similar selective constraints in gymnosperms and angiosperms

PubMed Central

2012-01-01

Background A detailed knowledge about spatial and temporal gene expression is important for understanding both the function of genes and their evolution. For the vast majority of species, transcriptomes are still largely uncharacterized and even in those where substantial information is available it is often in the form of partially sequenced transcriptomes. With the development of next generation sequencing, a single experiment can now simultaneously identify the transcribed part of a species genome and estimate levels of gene expression. Results mRNA from actively growing needles of Norway spruce (Picea abies) was sequenced using next generation sequencing technology. In total, close to 70 million fragments with a length of 76 bp were sequenced resulting in 5 Gbp of raw data. A de novo assembly of these reads, together with publicly available expressed sequence tag (EST) data from Norway spruce, was used to create a reference transcriptome. Of the 38,419 PUTs (putative unique transcripts) longer than 150 bp in this reference assembly, 83.5% show similarity to ESTs from other spruce species and of the remaining PUTs, 3,704 show similarity to protein sequences from other plant species, leaving 4,167 PUTs with limited similarity to currently available plant proteins. By predicting coding frames and comparing not only the Norway spruce PUTs, but also PUTs from the close relatives Picea glauca and Picea sitchensis to both Pinus taeda and Taxus mairei, we obtained estimates of synonymous and non-synonymous divergence among conifer species. In addition, we detected close to 15,000 SNPs of high quality and estimated gene expression differences between samples collected under dark and light conditions. Conclusions Our study yielded a large number of single nucleotide polymorphisms as well as estimates of gene expression on transcriptome scale. In agreement with a recent study we find that the synonymous substitution rate per year (0.6 × 10−09 and 1.1 × 10−09) is an order of magnitude smaller than values reported for angiosperm herbs. However, if one takes generation time into account, most of this difference disappears. The estimates of the dN/dS ratio (non-synonymous over synonymous divergence) reported here are in general much lower than 1 and only a few genes showed a ratio larger than 1. PMID:23122049
A large-scale study of the random variability of a coding sequence: a study on the CFTR gene.

PubMed

Modiano, Guido; Bombieri, Cristina; Ciminelli, Bianca Maria; Belpinati, Francesca; Giorgi, Silvia; Georges, Marie des; Scotet, Virginie; Pompei, Fiorenza; Ciccacci, Cinzia; Guittard, Caroline; Audrézet, Marie Pierre; Begnini, Angela; Toepfer, Michael; Macek, Milan; Ferec, Claude; Claustres, Mireille; Pignatti, Pier Franco

2005-02-01

Coding single nucleotide substitutions (cSNSs) have been studied on hundreds of genes using small samples (n(g) approximately 100-150 genes). In the present investigation, a large random European population sample (average n(g) approximately 1500) was studied for a single gene, the CFTR (Cystic Fibrosis Transmembrane conductance Regulator). The nonsynonymous (NS) substitutions exhibited, in accordance with previous reports, a mean probability of being polymorphic (q > 0.005), much lower than that of the synonymous (S) substitutions, but they showed a similar rate of subpolymorphic (q < 0.005) variability. This indicates that, in autosomal genes that may have harmful recessive alleles (nonduplicated genes with important functions), genetic drift overwhelms selection in the subpolymorphic range of variability, making disadvantageous alleles behave as neutral. These results imply that the majority of the subpolymorphic nonsynonymous alleles of these genes are selectively negative or even pathogenic.
Effects of the BDNF Val66Met Polymorphism on Anxiety-Like Behavior Following Nicotine Withdrawal in Mice

PubMed Central

Lee, Bridgin G.; Anastasia, Agustin; Hempstead, Barbara L.; Lee, Francis S.

2015-01-01

Introduction: Nicotine withdrawal is characterized by both affective and cognitive symptoms. Identifying genetic polymorphisms that could affect the symptoms associated with nicotine withdrawal are important in predicting withdrawal sensitivity and identifying personalized cessation therapies. In the current study we used a mouse model of a non-synonymous single nucleotide polymorphism in the translated region of the brain-derived neurotrophic factor (BDNF) gene that substitutes a valine (Val) for a methionine (Met) amino acid (Val66Met) to examine the relationship between the Val66Met single nucleotide polymorphism and nicotine dependence. Methods: This study measured proBDNF and the BDNF prodomain levels following nicotine and nicotine withdrawal and examined a mouse model of a common polymorphism in this protein (BDNFMet/Met) in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test. Results: Using the BDNF knock-in mouse containing the BDNF Val66Met polymorphism we found: (1) blunted anxiety-like behavior in BDNFMet/Met mice following withdrawal in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test; (2) the anxiolytic effects of chronic nicotine are absent in BDNFMet/Met mice; and (3) an increase in BDNF prodomain in BDNFMet/Met mice following nicotine withdrawal. Conclusions: Our study is the first to examine the effect of the BDNF Val66Met polymorphism on the affective symptoms of withdrawal from nicotine in mice. In these mice, a single-nucleotide polymorphism in the translated region of the BDNF gene can result in a blunted withdrawal, as measured by decreased anxiety-like behavior. The significant increase in the BDNF prodomain in BDNFMet/Met mice following nicotine cessation suggests a possible role of this ligand in the circuitry remodeling after withdrawal. PMID:25744957
Sequence diversity and molecular evolutionary rates between buffalo and cattle.

PubMed

Moaeen-ud-Din, M; Bilal, G

2015-02-01

Identification of genes of importance regarding production traits in buffalo is impaired by a paucity of genomic resources. Choice to fill this gap is to exploit data available for cow. The cross-species application of comparative genomics tools is potential gear to investigate the buffalo genome. However, this is dependent on nucleotide sequences similarity. In this study, gene diversity between buffalo and cattle was determined using 86 gene orthologues. There was approximately 3% difference in all genes in terms of nucleotide diversity and 0.267 ± 0.134 in amino acids, indicating the possibility for successfully using cross-species strategies for genomic studies. There were significantly higher non-synonymous substitutions both in cattle and buffalo; however, there was similar difference in terms of dN- dS (4.414 versus 4.745) in buffalo and cattle, respectively. Higher rate of non-synonymous substitutions at similar level in buffalo and cattle indicated a similar positive selection pressure. Results for relative rate test were assessed with the chi-squared test. There was no significance difference on unique mutations between cattle and buffalo lineages at synonymous sites. However, there was a significance difference on unique mutations for non-synonymous sites, indicating ongoing mutagenic process that generates substitutional mutation at approximately the same rate at silent sites. Moreover, despite of common ancestry, our results indicate a different divergent time among genes of cattle and buffalo. This is the first demonstration that variable rates of molecular evolution may be present within the family Bovidae. © 2014 Blackwell Verlag GmbH.
Genome Analysis of the Domestic Dog (Korean Jindo) by Massively Parallel Sequencing

PubMed Central

Kim, Ryong Nam; Kim, Dae-Soo; Choi, Sang-Haeng; Yoon, Byoung-Ha; Kang, Aram; Nam, Seong-Hyeuk; Kim, Dong-Wook; Kim, Jong-Joo; Ha, Ji-Hong; Toyoda, Atsushi; Fujiyama, Asao; Kim, Aeri; Kim, Min-Young; Park, Kun-Hyang; Lee, Kang Seon; Park, Hong-Seog

2012-01-01

Although pioneering sequencing projects have shed light on the boxer and poodle genomes, a number of challenges need to be met before the sequencing and annotation of the dog genome can be considered complete. Here, we present the DNA sequence of the Jindo dog genome, sequenced to 45-fold average coverage using Illumina massively parallel sequencing technology. A comparison of the sequence to the reference boxer genome led to the identification of 4 675 437 single nucleotide polymorphisms (SNPs, including 3 346 058 novel SNPs), 71 642 indels and 8131 structural variations. Of these, 339 non-synonymous SNPs and 3 indels are located within coding sequences (CDS). In particular, 3 non-synonymous SNPs and a 26-bp deletion occur in the TCOF1 locus, implying that the difference observed in cranial facial morphology between Jindo and boxer dogs might be influenced by those variations. Through the annotation of the Jindo olfactory receptor gene family, we found 2 unique olfactory receptor genes and 236 olfactory receptor genes harbouring non-synonymous homozygous SNPs that are likely to affect smelling capability. In addition, we determined the DNA sequence of the Jindo dog mitochondrial genome and identified Jindo dog-specific mtDNA genotypes. This Jindo genome data upgrade our understanding of dog genomic architecture and will be a very valuable resource for investigating not only dog genetics and genomics but also human and dog disease genetics and comparative genomics. PMID:22474061
Identification of single nucleotide polymorphisms in the ASB15 gene and their associations with chicken growth and carcass traits.

PubMed

Wang, Y C; Jiang, R R; Kang, X T; Li, Z J; Han, R L; Geng, J; Fu, J X; Wang, J F; Wu, J P

2015-09-25

ASB15 is a member of the ankyrin repeat and suppressor of cytokine signaling box family, and is predominantly expressed in skeletal muscle. In the present study, an F2 resource population of Gushi chickens crossed with Anka broilers was used to investigate the genetic effects of the chicken ASB15 gene. Two single nucleotide polymorphisms (SNPs) (rs315759231 A>G and rs312619270 T>C) were identified in exon 7 of the ASB15 gene using forced chain reaction-restriction fragment length polymorphism and DNA sequencing. One was a missense SNP (rs315759231 A>G) and the other was a synonymous SNP (rs312619270 T>C). The rs315759231 A>G polymorphism was significantly associated with body weight at birth, 12-week body slanting length, semi-evisceration weight, evisceration weight, leg muscle weight, and carcass weight (P < 0.05). The rs312619270 T>C polymorphism was significantly associated with body weight at birth, 4, 8, and 12-week body weight, 8-week shank length, 12-week breast bone length, 8 and 12-week body slanting length, breast muscle weight, and carcass weight (P < 0.05). Our results suggest that the ASB15 gene profoundly affects chicken growth and carcass traits.
Development of 101 novel EST-derived single nucleotide polymorphism markers for Zhikong scallop ( Chlamys farreri)

NASA Astrophysics Data System (ADS)

Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli

2013-09-01

Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.
VarMod: modelling the functional effects of non-synonymous variants

PubMed Central

Pappalardo, Morena; Wass, Mark N.

2014-01-01

Unravelling the genotype–phenotype relationship in humans remains a challenging task in genomics studies. Recent advances in sequencing technologies mean there are now thousands of sequenced human genomes, revealing millions of single nucleotide variants (SNVs). For non-synonymous SNVs present in proteins the difficulties of the problem lie in first identifying those nsSNVs that result in a functional change in the protein among the many non-functional variants and in turn linking this functional change to phenotype. Here we present VarMod (Variant Modeller) a method that utilises both protein sequence and structural features to predict nsSNVs that alter protein function. VarMod develops recent observations that functional nsSNVs are enriched at protein–protein interfaces and protein–ligand binding sites and uses these characteristics to make predictions. In benchmarking on a set of nearly 3000 nsSNVs VarMod performance is comparable to an existing state of the art method. The VarMod web server provides extensive resources to investigate the sequence and structural features associated with the predictions including visualisation of protein models and complexes via an interactive JSmol molecular viewer. VarMod is available for use at http://www.wasslab.org/varmod. PMID:24906884

Automated SNP detection from a large collection of white spruce expressed sequences: contributing factors and approaches for the categorization of SNPs

PubMed Central

Pavy, Nathalie; Parsons, Lee S; Paule, Charles; MacKay, John; Bousquet, Jean

2006-01-01

Background High-throughput genotyping technologies represent a highly efficient way to accelerate genetic mapping and enable association studies. As a first step toward this goal, we aimed to develop a resource of candidate Single Nucleotide Polymorphisms (SNP) in white spruce (Picea glauca [Moench] Voss), a softwood tree of major economic importance. Results A white spruce SNP resource encompassing 12,264 SNPs was constructed from a set of 6,459 contigs derived from Expressed Sequence Tags (EST) and by using the bayesian-based statistical software PolyBayes. Several parameters influencing the SNP prediction were analysed including the a priori expected polymorphism, the probability score (PSNP), and the contig depth and length. SNP detection in 3' and 5' reads from the same clones revealed a level of inconsistency between overlapping sequences as low as 1%. A subset of 245 predicted SNPs were verified through the independent resequencing of genomic DNA of a genotype also used to prepare cDNA libraries. The validation rate reached a maximum of 85% for SNPs predicted with either PSNP ≥ 0.95 or ≥ 0.99. A total of 9,310 SNPs were detected by using PSNP ≥ 0.95 as a criterion. The SNPs were distributed among 3,590 contigs encompassing an array of broad functional categories, with an overall frequency of 1 SNP per 700 nucleotide sites. Experimental and statistical approaches were used to evaluate the proportion of paralogous SNPs, with estimates in the range of 8 to 12%. The 3,789 coding SNPs identified through coding region annotation and ORF prediction, were distributed into 39% nonsynonymous and 61% synonymous substitutions. Overall, there were 0.9 SNP per 1,000 nonsynonymous sites and 5.2 SNPs per 1,000 synonymous sites, for a genome-wide nonsynonymous to synonymous substitution rate ratio (Ka/Ks) of 0.17. Conclusion We integrated the SNP data in the ForestTreeDB database along with functional annotations to provide a tool facilitating the choice of candidate genes for mapping purposes or association studies. PMID:16824208
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.

PubMed

Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y

2013-02-27

We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Bison PRNP genotyping and potential association with Brucella spp. seroprevalence

USGS Publications Warehouse

Seabury, C.M.; Halbert, N.D.; Gogan, P.J.P.; Templeton, J.W.; Derr, J.N.

2005-01-01

The implication that host cellular prion protein (PrPC) may function as a cell surface receptor and/or portal protein for Brucella abortus in mice prompted an evaluation of nucleotide and amino acid variation within exon 3 of the prion protein gene (PRNP) for six US bison populations. A non-synonymous single nucleotide polymorphism (T50C), resulting in the predicted amino acid replacement M17T (Met ??? Thr), was identified in each population. To date, no variation (T50: Met) has been detected at the corresponding exon 3 nucleotide and/or amino acid position for domestic cattle. Notably, 80% (20 of 25) of the Yellowstone National Park bison possessing the C/C genotype were Brucella spp. seropositive, representing a significant (P = 0.021) association between seropositivity and the C/C genotypic class. Moreover, significant differences in the distribution of PRNP exon 3 alleles and genotypes were detected between Yellowstone National Park bison and three bison populations that were either founded from seronegative stock or previously subjected to test-and-slaughter management to eradicate brucellosis. Unlike domestic cattle, no indel polymorphisms were detected within the corresponding regions of the putative bison PRNP promoter, intron 1, octapeptide repeat region or 3???-untranslated region for any population examined. This study provides the first evidence of a potential association between nucleotide variation within PRNP exon 3 and the presence of Brucella spp. antibodies in bison, implicating PrPC in the natural resistance of bison to brucellosis infection. ?? 2005 International Society for Animal Genetics.
Genomic analysis of codon usage shows influence of mutation pressure, natural selection, and host features on Marburg virus evolution.

PubMed

Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang

2015-08-26

The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Molecular Evolution of a Type 1 Wild-Vaccine Poliovirus Recombinant during Widespread Circulation in China

PubMed Central

Liu, Hong-Mei; Zheng, Du-Ping; Zhang, Li-Bi; Oberste, M. Steven; Pallansch, Mark A.; Kew, Olen M.

2000-01-01

Type 1 wild-vaccine recombinant polioviruses were isolated from poliomyelitis patients in China from 1991 to 1993. We compared the sequences of 34 recombinant isolates over the 1,353-nucleotide (nt) genomic interval (nt 2480 to 3832) encoding the major capsid protein, VP1, and the protease, 2A. All recombinants had a 367-nt block of sequence (nt 3271 to 3637) derived from the Sabin 1 oral poliovirus vaccine strain spanning the 3′-terminal sequences of VP1 (115 nt) and the 5′ half of 2A (252 nt). The remaining VP1 sequences were closely (up to 99.5%) related to those of a major genotype of wild type 1 poliovirus endemic to China up to 1994. In contrast, the non-vaccine-derived sequences at the 3′ half of 2A were more distantly related (<90% nucleotide sequence match) to those of other contemporary wild polioviruses from China. The vaccine-derived sequences of the earliest (April 1991) isolates completely matched those of Sabin 1. Later isolates diverged from the early isolates primarily by accumulation of synonymous base substitutions (at a rate of ∼3.7 × 10−2 substitutions per synonymous site per year) over the entire VP1-2A interval. Distinct evolutionary lineages were found in different Chinese provinces. From the combined epidemiologic and evolutionary analyses, we propose that the recombinant virus arose during mixed infection of a single individual in northern China in early 1991 and that its progeny spread by multiple independent chains of transmission into some of the most populous areas of China within a year of the initiating infection. PMID:11070012
Effect prediction of identified SNPs linked to fruit quality and chilling injury in peach [Prunus persica (L.) Batsch].

PubMed

Martínez-García, Pedro J; Fresnedo-Ramírez, Jonathan; Parfitt, Dan E; Gradziel, Thomas M; Crisosto, Carlos H

2013-01-01

Single nucleotide polymorphisms (SNPs) are a fundamental source of genomic variation. Large SNP panels have been developed for Prunus species. Fruit quality traits are essential peach breeding program objectives since they determine consumer acceptance, fruit consumption, industry trends and cultivar adoption. For many cultivars, these traits are negatively impacted by cold storage, used to extend fruit market life. The major symptoms of chilling injury are lack of flavor, off flavor, mealiness, flesh browning, and flesh bleeding. A set of 1,109 SNPs was mapped previously and 67 were linked with these complex traits. The prediction of the effects associated with these SNPs on downstream products from the 'peach v1.0' genome sequence was carried out. A total of 2,163 effects were detected, 282 effects (non-synonymous, synonymous or stop codon gained) were located in exonic regions (13.04 %) and 294 placed in intronic regions (13.59 %). An extended list of genes and proteins that could be related to these traits was developed. Two SNP markers that explain a high percentage of the observed phenotypic variance, UCD_SNP_1084 and UCD_SNP_46, are associated with zinc finger (C3HC4-type RING finger) family protein and AOX1A (alternative oxidase 1a) protein groups, respectively. In addition, phenotypic variation suggests that the observed polymorphism for SNP UCD_SNP_1084 [A/G] mutation could be a candidate quantitative trait nucleotide affecting quantitative trait loci for mealiness. The interaction and expression of affected proteins could explain the variation observed in each individual and facilitate understanding of gene regulatory networks for fruit quality traits in peach.
Genetic Code Optimization for Cotranslational Protein Folding: Codon Directional Asymmetry Correlates with Antiparallel Betasheets, tRNA Synthetase Classes.

PubMed

Seligmann, Hervé; Warthi, Ganesh

2017-01-01

A new codon property, codon directional asymmetry in nucleotide content (CDA), reveals a biologically meaningful genetic code dimension: palindromic codons (first and last nucleotides identical, codon structure XZX) are symmetric (CDA = 0), codons with structures ZXX/XXZ are 5'/3' asymmetric (CDA = - 1/1; CDA = - 0.5/0.5 if Z and X are both purines or both pyrimidines, assigning negative/positive (-/+) signs is an arbitrary convention). Negative/positive CDAs associate with (a) Fujimoto's tetrahedral codon stereo-table; (b) tRNA synthetase class I/II (aminoacylate the 2'/3' hydroxyl group of the tRNA's last ribose, respectively); and (c) high/low antiparallel (not parallel) betasheet conformation parameters. Preliminary results suggest CDA-whole organism associations (body temperature, developmental stability, lifespan). Presumably, CDA impacts spatial kinetics of codon-anticodon interactions, affecting cotranslational protein folding. Some synonymous codons have opposite CDA sign (alanine, leucine, serine, and valine), putatively explaining how synonymous mutations sometimes affect protein function. Correlations between CDA and tRNA synthetase classes are weaker than between CDA and antiparallel betasheet conformation parameters. This effect is stronger for mitochondrial genetic codes, and potentially drives mitochondrial codon-amino acid reassignments. CDA reveals information ruling nucleotide-protein relations embedded in reversed (not reverse-complement) sequences (5'-ZXX-3'/5'-XXZ-3').
Shifts in the evolutionary rate and intensity of purifying selection between two Brassica genomes revealed by analyses of orthologous transposons and relics of a whole genome triplication.

PubMed

Zhao, Meixia; Du, Jianchang; Lin, Feng; Tong, Chaobo; Yu, Jingyin; Huang, Shunmou; Wang, Xiaowu; Liu, Shengyi; Ma, Jianxin

2013-10-01

Recent sequencing of the Brassica rapa and Brassica oleracea genomes revealed extremely contrasting genomic features such as the abundance and distribution of transposable elements between the two genomes. However, whether and how these structural differentiations may have influenced the evolutionary rates of the two genomes since their split from a common ancestor are unknown. Here, we investigated and compared the rates of nucleotide substitution between two long terminal repeats (LTRs) of individual orthologous LTR-retrotransposons, the rates of synonymous and non-synonymous substitution among triplicated genes retained in both genomes from a shared whole genome triplication event, and the rates of genetic recombination estimated/deduced by the comparison of physical and genetic distances along chromosomes and ratios of solo LTRs to intact elements. Overall, LTR sequences and genic sequences showed more rapid nucleotide substitution in B. rapa than in B. oleracea. Synonymous substitution of triplicated genes retained from a shared whole genome triplication was detected at higher rates in B. rapa than in B. oleracea. Interestingly, non-synonymous substitution was observed at lower rates in the former than in the latter, indicating shifted densities of purifying selection between the two genomes. In addition to evolutionary asymmetry, orthologous genes differentially regulated and/or disrupted by transposable elements between the two genomes were also characterized. Our analyses suggest that local genomic and epigenomic features, such as recombination rates and chromatin dynamics reshaped by independent proliferation of transposable elements and elimination between the two genomes, are perhaps partially the causes and partially the outcomes of the observed inter-specific asymmetric evolution. © 2013 Purdue University The Plant Journal © 2013 John Wiley & Sons Ltd.
In-host microevolution of Aspergillus fumigatus: A phenotypic and genotypic analysis.

PubMed

Ballard, Eloise; Melchers, Willem J G; Zoll, Jan; Brown, Alistair J P; Verweij, Paul E; Warris, Adilia

2018-04-01

In order to survive, Aspergillus fumigatus must adapt to specific niche environments. Adaptation to the human host includes modifications facilitating persistent colonisation and the development of azole resistance. The aim of this study is to advance understanding of the genetic and physiological adaptation of A. fumigatus in patients during infection and treatment. Thirteen A. fumigatus strains were isolated from a single chronic granulomatous disease patient suffering from persistent and recurrent invasive aspergillosis over a period of 2 years. All strains had identical microsatellite genotypes and were considered isogenic. Whole genome comparisons identified 248 non-synonymous single nucleotide polymorphisms. These non-synonymous mutations have potential to play a role in in-host adaptation. The first 2 strains isolated were azole susceptible, whereas later isolates were itraconazole, voriconazole and/or posaconazole resistant. Growth assays in the presence and absence of various antifungal stressors highlighted minor changes in growth rate and stress resistance, with exception of one isolate showing a significant growth defect. Poor conidiation was observed in later isolates. In certain drug resistant isolates conidiation was restored in the presence of itraconazole. Differences in virulence were observed as demonstrated in a Galleria mellonella infection model. We conclude that the microevolution of A. fumigatus in this patient has driven the emergence of both Cyp51A-independent and Cyp51A-dependent, azole resistance mechanisms, and additional phenotypes that are likely to have promoted fungal persistence. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.
IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants

PubMed Central

Knecht, Carolin; Mort, Matthew; Junge, Olaf; Cooper, David N.; Krawczak, Michael

2017-01-01

Abstract The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. PMID:28180317
Identical substitutions in magnesium chelatase paralogs result in chlorophyll deficient soybean mutants

USDA-ARS?s Scientific Manuscript database

The soybean (Glycine max (L.) Merr.) chlorophyll deficient line MinnGold is a spontaneous mutant characterized by yellow foliage. Map-based cloning and transgenic complementation revealed that the mutant phenotype is caused by a non-synonymous nucleotide substitution in the third exon of a Mg-chelat...
Integration of Structural Dynamics and Molecular Evolution via Protein Interaction Networks: A New Era in Genomic Medicine

PubMed Central

Kumar, Avishek; Butler, Brandon M.; Kumar, Sudhir; Ozkan, S. Banu

2016-01-01

Summary Sequencing technologies are revealing many new non-synonymous single nucleotide variants (nsSNVs) in each personal exome. To assess their functional impacts, comparative genomics is frequently employed to predict if they are benign or not. However, evolutionary analysis alone is insufficient, because it misdiagnoses many disease-associated nsSNVs, such as those at positions involved in protein interfaces, and because evolutionary predictions do not provide mechanistic insights into functional change or loss. Structural analyses can aid in overcoming both of these problems by incorporating conformational dynamics and allostery in nSNV diagnosis. Finally, protein-protein interaction networks using systems-level methodologies shed light onto disease etiology and pathogenesis. Bridging these network approaches with structurally resolved protein interactions and dynamics will advance genomic medicine. PMID:26684487
SNP detection in Na/K ATP-ase gene α1 subunit of bisexual and parthenogenetic Artemia strains by RFLP screening.

PubMed

Manaffar, R; Zare, S; Agh, N; Abdolahzadeh, N; Soltanian, S; Sorgeloos, P; Bossier, P; Van Stappen, G

2011-01-01

In order to find a marker for differentiating between a bisexual and a parthenogenetic Artemia strain, Exon-7 of the Na/K ATPase α(1) subunit gene was screened by RFLP technique. The results revealed a constant synonymous SNP (single nucleotide polymorphism) in digestion by the Tru1I enzyme that was consistent with these two types of Artemia. This SNP was identified as an accurate molecular marker for discrimination between bisexual and parthenogenetic Artemia. According to the Nei's genetic distance (1973), the lowest genetic distance was found between individuals from Artemia urmiana Günther 1890 and parthenogenetic populations, making the described marker the first marker to easily distinguish between these two cooccurring species. © 2010 Blackwell Publishing Ltd.
A genomic scale map of genetic diversity in Trypanosoma cruzi

PubMed Central

2012-01-01

Background Trypanosoma cruzi, the causal agent of Chagas Disease, affects more than 16 million people in Latin America. The clinical outcome of the disease results from a complex interplay between environmental factors and the genetic background of both the human host and the parasite. However, knowledge of the genetic diversity of the parasite, is currently limited to a number of highly studied loci. The availability of a number of genomes from different evolutionary lineages of T. cruzi provides an unprecedented opportunity to look at the genetic diversity of the parasite at a genomic scale. Results Using a bioinformatic strategy, we have clustered T. cruzi sequence data available in the public domain and obtained multiple sequence alignments in which one or two alleles from the reference CL-Brener were included. These data covers 4 major evolutionary lineages (DTUs): TcI, TcII, TcIII, and the hybrid TcVI. Using these set of alignments we have identified 288,957 high quality single nucleotide polymorphisms and 1,480 indels. In a reduced re-sequencing study we were able to validate ~ 97% of high-quality SNPs identified in 47 loci. Analysis of how these changes affect encoded protein products showed a 0.77 ratio of synonymous to non-synonymous changes in the T. cruzi genome. We observed 113 changes that introduce or remove a stop codon, some causing significant functional changes, and a number of tri-allelic and tetra-allelic SNPs that could be exploited in strain typing assays. Based on an analysis of the observed nucleotide diversity we show that the T. cruzi genome contains a core set of genes that are under apparent purifying selection. Interestingly, orthologs of known druggable targets show statistically significant lower nucleotide diversity values. Conclusions This study provides the first look at the genetic diversity of T. cruzi at a genomic scale. The analysis covers an estimated ~ 60% of the genetic diversity present in the population, providing an essential resource for future studies on the development of new drugs and diagnostics, for Chagas Disease. These data is available through the TcSNP database (http://snps.tcruzi.org). PMID:23270511
Standardization of PCR-RFLP analysis of nsSNP rs1468384 of NPC1L1 gene

PubMed Central

Balgir, Praveen P.; Khanna, Divya; Kaur, Gurlovleen

2008-01-01

Niemann-Pick C1-like 1 (NPC1L1) protein, a newly identified sterol influx transporter, located at the apical membrane of the enterocyte, which may actively facilitate the uptake of cholesterol by promoting the passage of sterols across the brush border membrane of the enterocyte. It effects intestinal cholesterol absorption and intracellular transport and as such is an integral part of complex process of cholesterol homeostasis. The study of population data for the distribution of these single nucleotide polymorphisms (SNP) of NPC1L1 has lead to the identification of six non-synonymous single nucleotide polymorphisms (nsSNP). The in vitro analysis using the software MuPro and StructureSNP shows that nsSNP M510I (rs1468384), which involves A→G base pair change leads to decrease in the stability of the protein. A reproducible and a cost-effective PCR-RFLP based assay was developed to screen for the SNP among population data. This SNP has been studied in Caucasian, Asian, and African American populations. Till date, no data is available on Indian population. The distribution of M510I NPC1L1 genotype was estimated in the North Western Indian Population as a test case. The allele distribution in Indian Population differs significantly from that of other populations. The methodology thus proved to be robust enough to bring out these differences. PMID:20300301
Transcriptome-wide single nucleotide polymorphisms (SNPs) for abalone (Haliotis midae): validation and application using GoldenGate medium-throughput genotyping assays.

PubMed

Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay

2013-09-23

Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.
Screening for susceptibility genes in hereditary non-polyposis colorectal cancer.

PubMed

Yu, Li; Yin, Bo; Qu, Kaiying; Li, Jingjing; Jin, Qiao; Liu, Ling; Liu, Chunlan; Zhu, Yuxing; Wang, Qi; Peng, Xiaowei; Zhou, Jianda; Cao, Peiguo; Cao, Ke

2018-06-01

In the present study, hereditary non-polyposis colorectal cancer (HNPCC) susceptibility genes were screened for using whole exome sequencing in 3 HNPCC patients from 1 family and using single nucleotide polymorphism (SNP) genotyping assays in 96 other colorectal cancer and control samples. Peripheral blood was obtained from 3 HNPCC patients from 1 family; the proband and the proband's brother and cousin. High-throughput sequencing was performed using whole exome capture technology. Sequences were aligned against the HAPMAP, dbSNP130 and 1,000 Genome Project databases. Reported common variations and synonymous mutations were filtered out. Non-synonymous single nucleotide variants in the 3 HNPCC patients were integrated and the candidate genes were identified. Finally, SNP genotyping was performed for the genes in 96 peripheral blood samples. In total, 60.4 Gb of data was retrieved from the 3 HNPCC patients using whole exome capture technology. Subsequently, according to certain screening criteria, 15 candidate genes were identified. Among the 96 samples that had been SNP genotyped, 92 were successfully genotyped for 15 gene loci, while genotyping for HTRA1 failed in 4 sporadic colorectal cancer patient samples. In 12 control subjects and 81 sporadic colorectal cancer patients, genotypes at 13 loci were wild-type, namely DDX20, ZFYVE26, PIK3R3, SLC26A8, ZEB2, TP53INP1, SLC11A1, LRBA, CEBPZ, ETAA1, SEMA3G, IFRD2 and FAT1 . The CEP290 genotype was mutant in 1 sporadic colorectal cancer patient and was wild-type in all other subjects. A total of 5 of the 12 control subjects and 30 of the 81 sporadic colorectal cancer patients had a mutant HTRA1 genotype. In all 3 HNPCC patients, the same mutant genotypes were identified at all 15 gene loci. Overall, 13 potential susceptibility genes for HNPCC were identified, namely DDX20, ZFYVE26, PIK3R3, SLC26A8, ZEB2, TP53INP1, SLC11A1, LRBA, CEBPZ, ETAA1, SEMA3G, IFRD2 and FAT1 .
Non-Synonymous Single-Nucleotide Polymorphisms and Physical Activity Interactions on Adiposity Parameters in Malaysian Adolescents.

PubMed

Zaharan, Nur Lisa; Muhamad, Nor Hanisah; Jalaludin, Muhammad Yazid; Su, Tin Tin; Mohamed, Zahurin; Mohamed, M N A; A Majid, Hazreen

2018-01-01

Several non-synonymous single-nucleotide polymorphisms (nsSNPs) have been shown to be associated with obesity. Little is known about their associations and interactions with physical activity (PA) in relation to adiposity parameters among adolescents in Malaysia. We examined whether (a) PA and (b) selected nsSNPs are associated with adiposity parameters and whether PA interacts with these nsSNPs on these outcomes in adolescents from the Malaysian Health and Adolescents Longitudinal Research Team study ( n = 1,151). Body mass indices, waist-hip ratio, and percentage body fat (% BF) were obtained. PA was assessed using Physical Activity Questionnaire for Older Children (PAQ-C). Five nsSNPs were included: beta-3 adrenergic receptor (ADRB3) rs4994, FABP2 rs1799883, GHRL rs696217, MC3R rs3827103, and vitamin D receptor rs2228570, individually and as combined genetic risk score (GRS). Associations and interactions between nsSNPs and PAQ-C scores were examined using generalized linear model. PAQ-C scores were associated with % BF (β = -0.44 [95% confidence interval -0.72, -0.16], p = 0.002). The CC genotype of ADRB3 rs4994 (β = -0.16 [-0.28, -0.05], corrected p = 0.01) and AA genotype of MC3R rs3827103 (β = -0.06 [-0.12, -0.00], p = 0.02) were significantly associated with % BF compared to TT and GG genotypes, respectively. Significant interactions with PA were found between ADRB3 rs4994 (β = -0.05 [-0.10, -0.01], p = 0.02) and combined GRS (β = -0.03 [-0.04, -0.01], p = 0.01) for % BF. Higher PA score was associated with reduced % BF in Malaysian adolescents. Of the nsSNPs, ADRB3 rs4994 and MC3R rs3827103 were associated with % BF. Significant interactions with PA were found for ADRB3 rs4994 and combined GRS on % BF but not on measurements of weight or circumferences. Targeting body fat represent prospects for molecular studies and lifestyle intervention in this population.
A second generation human haplotype map of over 3.1 million SNPs.

PubMed

Frazer, Kelly A; Ballinger, Dennis G; Cox, David R; Hinds, David A; Stuve, Laura L; Gibbs, Richard A; Belmont, John W; Boudreau, Andrew; Hardenbol, Paul; Leal, Suzanne M; Pasternak, Shiran; Wheeler, David A; Willis, Thomas D; Yu, Fuli; Yang, Huanming; Zeng, Changqing; Gao, Yang; Hu, Haoran; Hu, Weitao; Li, Chaohua; Lin, Wei; Liu, Siqi; Pan, Hao; Tang, Xiaoli; Wang, Jian; Wang, Wei; Yu, Jun; Zhang, Bo; Zhang, Qingrun; Zhao, Hongbin; Zhao, Hui; Zhou, Jun; Gabriel, Stacey B; Barry, Rachel; Blumenstiel, Brendan; Camargo, Amy; Defelice, Matthew; Faggart, Maura; Goyette, Mary; Gupta, Supriya; Moore, Jamie; Nguyen, Huy; Onofrio, Robert C; Parkin, Melissa; Roy, Jessica; Stahl, Erich; Winchester, Ellen; Ziaugra, Liuda; Altshuler, David; Shen, Yan; Yao, Zhijian; Huang, Wei; Chu, Xun; He, Yungang; Jin, Li; Liu, Yangfan; Shen, Yayun; Sun, Weiwei; Wang, Haifeng; Wang, Yi; Wang, Ying; Xiong, Xiaoyan; Xu, Liang; Waye, Mary M Y; Tsui, Stephen K W; Xue, Hong; Wong, J Tze-Fei; Galver, Luana M; Fan, Jian-Bing; Gunderson, Kevin; Murray, Sarah S; Oliphant, Arnold R; Chee, Mark S; Montpetit, Alexandre; Chagnon, Fanny; Ferretti, Vincent; Leboeuf, Martin; Olivier, Jean-François; Phillips, Michael S; Roumy, Stéphanie; Sallée, Clémentine; Verner, Andrei; Hudson, Thomas J; Kwok, Pui-Yan; Cai, Dongmei; Koboldt, Daniel C; Miller, Raymond D; Pawlikowska, Ludmila; Taillon-Miller, Patricia; Xiao, Ming; Tsui, Lap-Chee; Mak, William; Song, You Qiang; Tam, Paul K H; Nakamura, Yusuke; Kawaguchi, Takahisa; Kitamoto, Takuya; Morizono, Takashi; Nagashima, Atsushi; Ohnishi, Yozo; Sekine, Akihiro; Tanaka, Toshihiro; Tsunoda, Tatsuhiko; Deloukas, Panos; Bird, Christine P; Delgado, Marcos; Dermitzakis, Emmanouil T; Gwilliam, Rhian; Hunt, Sarah; Morrison, Jonathan; Powell, Don; Stranger, Barbara E; Whittaker, Pamela; Bentley, David R; Daly, Mark J; de Bakker, Paul I W; Barrett, Jeff; Chretien, Yves R; Maller, Julian; McCarroll, Steve; Patterson, Nick; Pe'er, Itsik; Price, Alkes; Purcell, Shaun; Richter, Daniel J; Sabeti, Pardis; Saxena, Richa; Schaffner, Stephen F; Sham, Pak C; Varilly, Patrick; Altshuler, David; Stein, Lincoln D; Krishnan, Lalitha; Smith, Albert Vernon; Tello-Ruiz, Marcela K; Thorisson, Gudmundur A; Chakravarti, Aravinda; Chen, Peter E; Cutler, David J; Kashuk, Carl S; Lin, Shin; Abecasis, Gonçalo R; Guan, Weihua; Li, Yun; Munro, Heather M; Qin, Zhaohui Steve; Thomas, Daryl J; McVean, Gilean; Auton, Adam; Bottolo, Leonardo; Cardin, Niall; Eyheramendy, Susana; Freeman, Colin; Marchini, Jonathan; Myers, Simon; Spencer, Chris; Stephens, Matthew; Donnelly, Peter; Cardon, Lon R; Clarke, Geraldine; Evans, David M; Morris, Andrew P; Weir, Bruce S; Tsunoda, Tatsuhiko; Mullikin, James C; Sherry, Stephen T; Feolo, Michael; Skol, Andrew; Zhang, Houcan; Zeng, Changqing; Zhao, Hui; Matsuda, Ichiro; Fukushima, Yoshimitsu; Macer, Darryl R; Suda, Eiko; Rotimi, Charles N; Adebamowo, Clement A; Ajayi, Ike; Aniagwu, Toyin; Marshall, Patricia A; Nkwodimmah, Chibuzor; Royal, Charmaine D M; Leppert, Mark F; Dixon, Missy; Peiffer, Andy; Qiu, Renzong; Kent, Alastair; Kato, Kazuto; Niikawa, Norio; Adewole, Isaac F; Knoppers, Bartha M; Foster, Morris W; Clayton, Ellen Wright; Watkin, Jessica; Gibbs, Richard A; Belmont, John W; Muzny, Donna; Nazareth, Lynne; Sodergren, Erica; Weinstock, George M; Wheeler, David A; Yakub, Imtaz; Gabriel, Stacey B; Onofrio, Robert C; Richter, Daniel J; Ziaugra, Liuda; Birren, Bruce W; Daly, Mark J; Altshuler, David; Wilson, Richard K; Fulton, Lucinda L; Rogers, Jane; Burton, John; Carter, Nigel P; Clee, Christopher M; Griffiths, Mark; Jones, Matthew C; McLay, Kirsten; Plumb, Robert W; Ross, Mark T; Sims, Sarah K; Willey, David L; Chen, Zhu; Han, Hua; Kang, Le; Godbout, Martin; Wallenburg, John C; L'Archevêque, Paul; Bellemare, Guy; Saeki, Koji; Wang, Hongguang; An, Daochang; Fu, Hongbo; Li, Qing; Wang, Zhen; Wang, Renwu; Holden, Arthur L; Brooks, Lisa D; McEwen, Jean E; Guyer, Mark S; Wang, Vivian Ota; Peterson, Jane L; Shi, Michael; Spiegel, Jack; Sung, Lawrence M; Zacharia, Lynn F; Collins, Francis S; Kennedy, Karen; Jamieson, Ruth; Stewart, John

2007-10-18

We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.
VarMod: modelling the functional effects of non-synonymous variants.

PubMed

Pappalardo, Morena; Wass, Mark N

2014-07-01

Unravelling the genotype-phenotype relationship in humans remains a challenging task in genomics studies. Recent advances in sequencing technologies mean there are now thousands of sequenced human genomes, revealing millions of single nucleotide variants (SNVs). For non-synonymous SNVs present in proteins the difficulties of the problem lie in first identifying those nsSNVs that result in a functional change in the protein among the many non-functional variants and in turn linking this functional change to phenotype. Here we present VarMod (Variant Modeller) a method that utilises both protein sequence and structural features to predict nsSNVs that alter protein function. VarMod develops recent observations that functional nsSNVs are enriched at protein-protein interfaces and protein-ligand binding sites and uses these characteristics to make predictions. In benchmarking on a set of nearly 3000 nsSNVs VarMod performance is comparable to an existing state of the art method. The VarMod web server provides extensive resources to investigate the sequence and structural features associated with the predictions including visualisation of protein models and complexes via an interactive JSmol molecular viewer. VarMod is available for use at http://www.wasslab.org/varmod. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Alpha S1-casein polymorphisms in camel (Camelus dromedarius) and descriptions of biological active peptides and allergenic epitopes.

PubMed

Erhardt, Georg; Shuiep, El Tahir Salih; Lisson, Maria; Weimann, Christina; Wang, Zhaoxin; El Zubeir, Ibtisam El Yas Mohamed; Pauciullo, Alfredo

2016-06-01

Milk samples of 193 camels (Camelus dromedarius) from different regions of Sudan were screened for casein variability by isoelectric focusing. Kappa-casein and beta-casein were monomorphic, whereas three protein patterns named αs1-casein A, C, and D were identified. The major allele A revealed frequencies of 0.79 (Lahaoi), 0.75 (Shanbali), 0.90 (Arabi Khali), and 0.88 (Arabi Gharbawi) in the different ecotypes. CSN1S1*C shows a single G > T nucleotide substitution in the exon 5, leading to a non-synonymous amino acid exchange (p.Glu30 > Asp30) in comparison to CSN1S1*A and D. At cDNA level, no further single nucleotide polymorphisms could be identified in CSN1S1* A, C, and D, whereas the variants CSN1S1*A and CSN1S1*C are characterized by missing of exon 18 compared to the already described CSN1S1*B, as consequence of DNA insertion of 11 bp at intron 17 which alter the pre-mRNA spliceosome machinery. A polymerase chain-restriction fragment length polymorphism method (PCR-RFLP) was established to type for G > T nucleotide substitution at genomic DNA level. The occurrence and differences of IgE-binding epitopes and bioactive peptides between αs1-casein A, C, and D after digestion were analyzed in silico. The amino acid substitutions and deletion affected the arising peptide pattern and thus modifications between IgE-binding epitopes and bioactive peptides of the variants were found. The allergenic potential of these different peptides will be investigated by microarray immunoassay using sera from milk-sensitized individuals, as it was already demonstrated for bovine αs1-casein variants.
Measles virus genetic evolution throughout an imported epidemic outbreak in a highly vaccinated population.

PubMed

Muñoz-Alía, Miguel Ángel; Fernández-Muñoz, Rafael; Casasnovas, José María; Porras-Mansilla, Rebeca; Serrano-Pardo, Ángela; Pagán, Israel; Ordobás, María; Ramírez, Rosa; Celma, María Luisa

2015-01-22

Measles virus circulates endemically in African and Asian large urban populations, causing outbreaks worldwide in populations with up-to-95% immune protection. We studied the natural genetic variability of genotype B3.1 in a population with 95% vaccine coverage throughout an imported six month measles outbreak. From first pass viral isolates of 47 patients we performed direct sequencing of genomic cDNA. Whilst no variation from index case sequence occurred in the Nucleocapsid gene hyper-variable carboxy end, in the Hemagglutinin gene, main target for neutralizing antibodies, we observed gradual nucleotide divergence from index case along the outbreak (0% to 0.380%, average 0.138%) with the emergence of transient and persistent non-synonymous and synonymous mutations. Little or no variation was observed between the index and last outbreak cases in Phosphoprotein, Nucleocapsid, Matrix and Fusion genes. Most of the H non-synonymous mutations were mapped on the protein surface near antigenic and receptors binding sites. We estimated a MV-Hemagglutinin nucleotide substitution rate of 7.28 × 10-6 substitutions/site/day by a Bayesian phylogenetic analysis. The dN/dS analysis did not suggest significant immune or other selective pressures on the H gene during the outbreak. These results emphasize the usefulness of MV-H sequence analysis in measles epidemiological surveillance and elimination programs, and in detection of potentially emergence of measles virus neutralization-resistant mutants. Copyright © 2014 Elsevier B.V. All rights reserved.
Effects of the BDNF Val66Met Polymorphism on Anxiety-Like Behavior Following Nicotine Withdrawal in Mice.

PubMed

Lee, Bridgin G; Anastasia, Agustin; Hempstead, Barbara L; Lee, Francis S; Blendy, Julie A

2015-12-01

Nicotine withdrawal is characterized by both affective and cognitive symptoms. Identifying genetic polymorphisms that could affect the symptoms associated with nicotine withdrawal are important in predicting withdrawal sensitivity and identifying personalized cessation therapies. In the current study we used a mouse model of a non-synonymous single nucleotide polymorphism in the translated region of the brain-derived neurotrophic factor (BDNF) gene that substitutes a valine (Val) for a methionine (Met) amino acid (Val66Met) to examine the relationship between the Val66Met single nucleotide polymorphism and nicotine dependence. This study measured proBDNF and the BDNF prodomain levels following nicotine and nicotine withdrawal and examined a mouse model of a common polymorphism in this protein (BDNF(Met/Met)) in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test. Using the BDNF knock-in mouse containing the BDNF Val66Met polymorphism we found: (1) blunted anxiety-like behavior in BDNF(Met/Met) mice following withdrawal in three behavioral paradigms: novelty-induced hypophagia, marble burying, and the open-field test; (2) the anxiolytic effects of chronic nicotine are absent in BDNF(Met/Met) mice; and (3) an increase in BDNF prodomain in BDNF(Met/Met) mice following nicotine withdrawal. Our study is the first to examine the effect of the BDNF Val66Met polymorphism on the affective symptoms of withdrawal from nicotine in mice. In these mice, a single-nucleotide polymorphism in the translated region of the BDNF gene can result in a blunted withdrawal, as measured by decreased anxiety-like behavior. The significant increase in the BDNF prodomain in BDNF(Met/Met) mice following nicotine cessation suggests a possible role of this ligand in the circuitry remodeling after withdrawal. © The Author 2015. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Genome-Wide SNP Genotyping to Infer the Effects on Gene Functions in Tomato

PubMed Central

Hirakawa, Hideki; Shirasawa, Kenta; Ohyama, Akio; Fukuoka, Hiroyuki; Aoki, Koh; Rothan, Christophe; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi

2013-01-01

The genotype data of 7054 single nucleotide polymorphism (SNP) loci in 40 tomato lines, including inbred lines, F1 hybrids, and wild relatives, were collected using Illumina's Infinium and GoldenGate assay platforms, the latter of which was utilized in our previous study. The dendrogram based on the genotype data corresponded well to the breeding types of tomato and wild relatives. The SNPs were classified into six categories according to their positions in the genes predicted on the tomato genome sequence. The genes with SNPs were annotated by homology searches against the nucleotide and protein databases, as well as by domain searches, and they were classified into the functional categories defined by the NCBI's eukaryotic orthologous groups (KOG). To infer the SNPs' effects on the gene functions, the three-dimensional structures of the 843 proteins that were encoded by the genes with SNPs causing missense mutations were constructed by homology modelling, and 200 of these proteins were considered to carry non-synonymous amino acid substitutions in the predicted functional sites. The SNP information obtained in this study is available at the Kazusa Tomato Genomics Database (http://plant1.kazusa.or.jp/tomato/). PMID:23482505
Integration of structural dynamics and molecular evolution via protein interaction networks: a new era in genomic medicine.

PubMed

Kumar, Avishek; Butler, Brandon M; Kumar, Sudhir; Ozkan, S Banu

2015-12-01

Sequencing technologies are revealing many new non-synonymous single nucleotide variants (nsSNVs) in each personal exome. To assess their functional impacts, comparative genomics is frequently employed to predict if they are benign or not. However, evolutionary analysis alone is insufficient, because it misdiagnoses many disease-associated nsSNVs, such as those at positions involved in protein interfaces, and because evolutionary predictions do not provide mechanistic insights into functional change or loss. Structural analyses can aid in overcoming both of these problems by incorporating conformational dynamics and allostery in nSNV diagnosis. Finally, protein-protein interaction networks using systems-level methodologies shed light onto disease etiology and pathogenesis. Bridging these network approaches with structurally resolved protein interactions and dynamics will advance genomic medicine. Copyright © 2015 Elsevier Ltd. All rights reserved.
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.

PubMed

Karniychuk, Uladzimir U

2016-09-02

Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms.

PubMed

Buschiazzo, Emmanuel; Ritland, Carol; Bohlmann, Jörg; Ritland, Kermit

2012-01-20

Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10(-9) synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations.
LISTA, LISTA-HOP and LISTA-HON: a comprehensive compilation of protein encoding sequences and its associated homology databases from the yeast Saccharomyces.

PubMed Central

Dölz, R; Mossé, M O; Slonimski, P P; Bairoch, A; Linder, P

1996-01-01

We continued our effort to make a comprehensive database (LISTA) for the yeast Saccharomyces cerevisiae. As in previous editions the genetic names are consistently associated to each sequence with a known and confirmed ORF. If necessary, synonyms are given in the case of allelic duplicated sequences. Although the first publication of a sequence gives-according to our rules-the genetic name of a gene, in some instances more commonly used names are given to avoid nomenclature problems and the use of ancient designations which are no longer used. In these cases the old designation is given as synonym. Thus sequences can be found either by the name or by synonyms given in LISTA. Each entry contains the genetic name, the mnemonic from the EMBL data bank, the codon bias, reference of the publication of the sequence, Chromosomal location as far as known, SWISSPROT and EMBL accession numbers. New entries will also contain the name from the systematic sequencing efforts. Since the release of LISTA4.1 we update the database continuously. To obtain more information on the included sequences, each entry has been screened against non-redundant nucleotide and protein data bank collections resulting in LISTA-HON and LISTA-HOP. This release includes reports from full Smith and Watermann peptide-level searches against a non-redundant protein sequence database. The LISTA data base can be linked to the associated data sets or to nucleotide and protein banks by the Sequence Retrieval System (SRS). The database is available by FTP and on World Wide Web. PMID:8594599
Molecular detection and analysis of a novel metalloprotease gene of entomopathogenic Serratia marcescens strains in infected Galleria mellonella.

PubMed

Tambong, J T; Xu, R; Sadiku, A; Chen, Q; Badiss, A; Yu, Q

2014-04-01

Serratia marcescens strains isolated from entomopathogenic nematodes (Rhabditis sp.) were examined for their pathogenicity and establishment in wax moth (Galleria mellonella) larvae. All the Serratia strains were potently pathogenic to G. mellonella larvae, leading to death within 48 h. The strains were shown to possess a metalloprotease gene encoding for a novel serralysin-like protein. Rapid establishment of the bacteria in infected larvae was confirmed by specific polymerase chain reaction (PCR) detection of a DNA fragment encoding for this protein. Detection of the viable Serratia strains in infected larvae was validated using the SYBR Green reverse transcriptase real-time PCR assay targeting the metalloprotease gene. Nucleotide sequences of the metalloprotease gene obtained in our study showed 72 single nucleotide polymorphisms (SNP) and 3 insertions compared with the metalloprotease gene of S. marcescens E-15. The metalloprotease gene had 60 synonymous and 8 nonsynonymous substitutions relative to the closest GenBank entry, S. marcescens E-15. A comparison of the amino acid composition of the new serralysin-like protein with that of the serralysin protein of S. marcescens E-15 revealed differences at 11 positions and a new aspartic acid residue. Analysis of the effect of protein variation suggests that a new aspartic acid residue resulting from nonsynonymous nucleotide mutations in the protein structure could have the most significant effect on its biological function. The new metalloprotease gene and (or) its product could have applications in plant agricultural biotechnology.
Single Nucleotide Variations of the Human GR Gene Manifested as Pathologic Mutations or Polymorphisms.

PubMed

Kino, Tomoshige

2018-05-11

The human genome contains numerous single nucleotide variations (SNVs), and the human GR gene harbors ∼450 of these genetic changes. Among them, extremely rare non-synonymous variants known as pathologic GR gene mutations develop a characteristic pathologic condition, familial/sporadic generalized glucocorticoid resistance syndrome, by replacing the amino acids critical for GR protein structure and functions, whereas others known as pathologic polymorphisms develop mild manifestations recognized mainly at population bases by changing the GR activities slightly. Recent progress on the structural analysis to the GR protein and subsequent computer-based structural simulation revealed details of the molecular defects caused by such pathologic GR gene mutations, including their impact on the receptor interaction to ligands, nuclear receptor coactivators (NCoAs) or DNA glucocorticoid response elements (GREs). Indeed, those found in the GR ligand-binding domain significantly damage protein structure of the ligand-binding pocket and/or the activation function-2 transactivation domain and change their molecular interaction to glucocorticoids or the LxxLL signature motif of NCoAs. Two mutations found in GR DBD also affect interaction of the mutant receptors to GRE DNA by affecting the critical amino acid for the interaction or changing local hydrophobic circumstance. In this review, we discuss recent findings on the structural simulation of the pathologic GR mutants in connection to their functional and clinical impacts along with brief explanation to recent research achievement on the GR polymorphisms.
Donor single nucleotide polymorphism in the CCR9 gene affects the incidence of skin GVHD.

PubMed

Inamoto, Y; Murata, M; Katsumi, A; Kuwatsuka, Y; Tsujimura, A; Ishikawa, Y; Sugimoto, K; Onizuka, M; Terakura, S; Nishida, T; Kanie, T; Taji, H; Iida, H; Suzuki, R; Abe, A; Kiyoi, H; Matsushita, T; Miyamura, K; Kodera, Y; Naoe, T

2010-02-01

The interactions between chemokines and their receptors may have an important role in initiating GVHD after allogeneic hematopoietic SCT (allo-HSCT). CCL25 and CCR9 are unique because they are exclusively expressed in epithelial cells and in Peyer's patches of the small intestine. We focused on rs12721497 (G926A), one of the non-synonymous single nucleotide polymorphisms (SNPs) in the CCR9 gene, and analyzed the SNP of donors in 167 consecutive patients who received allo-HSCT from an HLA-identical sibling donor. Genotypes were tested for associations with acute and chronic GVHD in each organ and transplant outcome. Multivariate analyses showed that the genotype 926AG was significantly associated with the incidence of acute stage > or =2 skin GVHD (hazard ratio: 3.2; 95% confidence interval (95% CI): 1.1-9.1; P=0.032) and chronic skin GVHD (hazard ratio: 4.1; 95% CI: 1.1-15; P=0.036), but not with GVHD in other organs or with relapse, non-relapse mortality or OS. To clarify the functional differences between genotypes, each SNP in retroviral vectors was transfected into Jurkat cells. In chemotaxis assays, the 926G transfectant showed greater response to CCL25 than the 926A transfectant. In conclusion, more active homing of CCR9-926AG T cells to Peyer's patches may produce changes in Ag presentation and result in increased incidence of skin GVHD.
Non-synonymous FGD3 Variant as Positional Candidate for Disproportional Tall Stature Accounting for a Carcass Weight QTL (CW-3) and Skeletal Dysplasia in Japanese Black Cattle

PubMed Central

Takasuga, Akiko; Sato, Kunio; Nakamura, Ryouichi; Saito, Yosuke; Sasaki, Shinji; Tsuji, Takehito; Suzuki, Akio; Kobayashi, Hiroshi; Matsuhashi, Tamako; Setoguchi, Koji; Okabe, Hiroshi; Ootsubo, Toshitake; Tabuchi, Ichiro; Fujita, Tatsuo; Watanabe, Naoto; Hirano, Takashi; Nishimura, Shota; Watanabe, Toshio; Hayakawa, Makio; Sugimoto, Yoshikazu; Kojima, Takatoshi

2015-01-01

Recessive skeletal dysplasia, characterized by joint- and/or hip bone-enlargement, was mapped within the critical region for a major quantitative trait locus (QTL) influencing carcass weight; previously named CW-3 in Japanese Black cattle. The risk allele was on the same chromosome as the Q allele that increases carcass weight. Phenotypic characterization revealed that the risk allele causes disproportional tall stature and bone size that increases carcass weight in heterozygous individuals but causes disproportionately narrow chest width in homozygotes. A non-synonymous variant of FGD3 was identified as a positional candidate quantitative trait nucleotide (QTN) and the corresponding mutant protein showed reduced activity as a guanine nucleotide exchange factor for Cdc42. FGD3 is expressed in the growth plate cartilage of femurs from bovine and mouse. Thus, loss of FDG3 activity may lead to subsequent loss of Cdc42 function. This would be consistent with the columnar disorganization of proliferating chondrocytes in chondrocyte-specific inactivated Cdc42 mutant mice. This is the first report showing association of FGD3 with skeletal dysplasia. PMID:26306008
Evolution of DMY, a newly emergent male sex-determination gene of medaka fish.

PubMed

Zhang, Jianzhi

2004-04-01

The Japanese medaka fish Oryzias latipes has an XX/XY sex-determination system. The Y-linked sex-determination gene DMY is a duplicate of the autosomal gene DMRT1, which encodes a DM-domain-containing transcriptional factor. DMY appears to have originated recently within Oryzias, allowing a detailed evolutionary study of the initial steps that led to the new gene and new sex-determination system. Here I analyze the publicly available DMRT1 and DMY gene sequences of Oryzias species and report the following findings. First, the synonymous substitution rate in DMY is 1.73 times that in DMRT1, consistent with the male-driven evolution hypothesis. Second, the ratio of the rate of nonsynonymous nucleotide substitution (d(N)) to that of synonymous substitution (d(S)) is significantly higher in DMY than in DMRT1. Third, in DMRT1, the d(N)/d(S) ratio for the DM domain is lower than that for non-DM regions, as expected from the functional importance of the DM domain. But in DMY, the opposite is observed and the DM domain is likely under positive Darwinian selection. Fourth, only one characteristic amino acid distinguishes all DMY sequences from all DMRT1 sequences, suggesting that a single amino acid change may be largely responsible for the establishment of DMY as the male sex-determination gene in medaka fish.
DoGSD: the dog and wolf genome SNP database.

PubMed

Bai, Bing; Zhao, Wen-Ming; Tang, Bi-Xia; Wang, Yan-Qing; Wang, Lu; Zhang, Zhang; Yang, He-Chuan; Liu, Yan-Hu; Zhu, Jun-Wei; Irwin, David M; Wang, Guo-Dong; Zhang, Ya-Ping

2015-01-01

The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor, grey wolves, which have simultaneously broadened our understanding of domestication and diseases that are shared by humans and dogs. To address the scarcity of single nucleotide polymorphism (SNP) data provided by authorized databases and to make SNP data more easily/friendly usable and available, we propose DoGSD (http://dogsd.big.ac.cn), the first canidae-specific database which focuses on whole genome SNP data from domesticated dogs and grey wolves. The DoGSD is a web-based, open-access resource comprising ∼ 19 million high-quality whole-genome SNPs. In addition to the dbSNP data set (build 139), DoGSD incorporates a comprehensive collection of SNPs from two newly sequenced samples (1 wolf and 1 dog) and collected SNPs from three latest dog/wolf genetic studies (7 wolves and 68 dogs), which were taken together for analysis with the population genetic statistics, Fst. In addition, DoGSD integrates some closely related information including SNP annotation, summary lists of SNPs located in genes, synonymous and non-synonymous SNPs, sampling location and breed information. All these features make DoGSD a useful resource for in-depth analysis in dog-/wolf-related studies. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Study of five novel non-synonymous polymorphisms in human brain-expressed genes in a Colombian sample.

PubMed

Ojeda, Diego A; Forero, Diego A

2014-10-01

Non-synonymous single nucleotide polymorphisms (nsSNPs) in brain-expressed genes represent interesting candidates for genetic research in neuropsychiatric disorders. To study novel nsSNPs in brain-expressed genes in a sample of Colombian subjects. We applied an approach based on in silico mining of available genomic data to identify and select novel nsSNPs in brain-expressed genes. We developed novel genotyping assays, based in allele-specific PCR methods, for these nsSNPs and genotyped them in 171 Colombian subjects. Five common nsSNPs (rs6855837; p.Leu395Ile, rs2305160; p.Thr394Ala, rs10503929; p.Met289Thr, rs2270641; p.Thr4Pro and rs3822659; p.Ser735Ala) were studied, located in the CLOCK, NPAS2, NRG1, SLC18A1 and WWC1 genes. We reported allele and genotype frequencies in a sample of South American healthy subjects. There is previous experimental evidence, arising from genome-wide expression and association studies, for the involvement of these genes in several neuropsychiatric disorders and endophenotypes, such as schizophrenia, mood disorders or memory performance. Frequencies for these nsSNPSs in the Colombian samples varied in comparison to different HapMap populations. Future study of these nsSNPs in brain-expressed genes, a synaptogenomics approach, will be important for a better understanding of neuropsychiatric diseases and endophenotypes in different populations.
Identification of an alternative knockdown resistance (kdr)-like mutation, M918L, and a novel mutation, V1010A, in the Thrips tabaci voltage-gated sodium channel gene.

PubMed

Wu, Meixiang; Gotoh, Hiroki; Waters, Timothy; Walsh, Douglas B; Lavine, Laura Corley

2014-06-01

Knockdown resistance (kdr) has been identified as a main mechanism against pyrethroid insecticides in many arthropod pests including in the onion thrips, Thrips tabaci. To characterize and identify pyrethroid-resistance in onion thrips in Washington state, we conducted insecticide bioassays and sequenced a region of the voltage gated sodium channel gene from several different T. tabaci populations. Field collected Thrips tabaci were found to have large variations in resistance to the pyrethroid insecticide lambda-cyhalothrin. We identified two single nucleotide substitutions in our analysis of a partial sequence of the T. tabaci voltage-gated sodium channel gene. One mutation resulted in the non-synonymous substitution of methionine with leucine (M918L), which is well known to be responsible for super knockdown resistance in some pest species. Another non-synonymous substitution, a valine (GTT) to alanine (GCT) replacement at amino acid 1010 (V1010A) was identified in our study and was associated with lambda-cyhalothrin resistance. We have characterized a known kdr mutation and identified a novel mutation in the voltage-gated sodium channel gene of Thrips tabaci associated with resistance to lambda-cyhalothrin. This gene region and these mutations are expected to be useful in the development of a diagnostic test to detect kdr resistance in many onion thrips populations. © 2013 Society of Chemical Industry.
De novo gene mutations highlight patterns of genetic and neural complexity in schizophrenia

PubMed Central

Xu, Bin; Ionita-Laza, Iuliana; Roos, J. Louw; Boone, Braden; Woodrick, Scarlet; Sun, Yan; Levy, Shawn; Gogos, Joseph A.; Karayiorgou, Maria

2013-01-01

To evaluate evidence for de novo etiologies in schizophrenia, we sequenced at high coverage the exomes of families recruited from two populations with distinct demographic structure and history. We sequenced a total of 795 exomes from 231 parent-proband trios enriched for sporadic schizophrenia cases, as well as 34 unaffected trios. We observed in cases an excess of non-synonymous single nucleotide variants as well as a higher prevalence of gene-disruptive de novo mutations. We found four genes (LAMA2, DPYD, TRRAP and VPS39) affected by recurrent de novo events within or across the two populations, a finding unlikely to have occurred by chance. We show that de novo mutations affect genes with diverse functions and developmental profiles but we also find a substantial contribution of mutations in genes with higher expression in early fetal life. Our results help define the pattern of genomic and neural architecture of schizophrenia. PMID:23042115
Emergence of canine distemper virus strains with two amino acid substitutions in the haemagglutinin protein, detected from vaccinated carnivores in North-Eastern China in 2012-2013.

PubMed

Zhao, Jianjun; Zhang, Hailing; Bai, Xue; Martella, Vito; Hu, Bo; Sun, Yangang; Zhu, Chunsheng; Zhang, Lei; Liu, Hao; Xu, Shujuan; Shao, Xiqun; Wu, Wei; Yan, Xijun

2014-04-01

A total of 16 strains of canine distemper virus (CDV) were detected from vaccinated minks, foxes, and raccoon dogs in four provinces in North-Eastern China between the end of 2011 and 2013. Upon sequence analysis of the haemagglutinin gene and comparison with wild-type CDV from different species in the same geographical areas, two non-synonymous single nucleotide polymorphisms were identified in 10 CDV strains, which led to amino acid changes at positions 542 (isoleucine to asparagine) and 549 (tyrosine to histidine) of the haemagglutinin protein coding sequence. The change at residue 542 generated a potentially novel N-glycosylation site. Masking of antigenic epitopes by sugar moieties might represent a mechanism for evasion of virus neutralising antibodies and reduced protection by vaccination. Copyright © 2014 Elsevier Ltd. All rights reserved.
Polymorphisms of 20 regulatory proteins between Mycobacterium tuberculosis and Mycobacterium bovis.

PubMed

Bigi, María M; Blanco, Federico Carlos; Araújo, Flabio R; Thacker, Tyler C; Zumárraga, Martín J; Cataldi, Angel A; Soria, Marcelo A; Bigi, Fabiana

2016-08-01

Mycobacterium tuberculosis and Mycobacterium bovis are responsible for tuberculosis in humans and animals, respectively. Both species are closely related and belong to the Mycobacterium tuberculosis complex (MTC). M. tuberculosis is the most ancient species from which M. bovis and other members of the MTC evolved. The genome of M. bovis is over >99.95% identical to that of M. tuberculosis but with seven deletions ranging in size from 1 to 12.7 kb. In addition, 1200 single nucleotide mutations in coding regions distinguish M. bovis from M. tuberculosis. In the present study, we assessed 75 M. tuberculosis genomes and 23 M. bovis genomes to identify non-synonymous mutations in 202 coding sequences of regulatory genes between both species. We identified species-specific variants in 20 regulatory proteins and confirmed differential expression of hypoxia-related genes between M. bovis and M. tuberculosis. © 2016 The Societies and John Wiley & Sons Australia, Ltd.
Control of total GFP expression by alterations to the 3′ region nucleotide sequence

PubMed Central

2013-01-01

Background Previously, we distinguished the Escherichia coli type II cytoplasmic membrane translocation pathways of Tat, Yid, and Sec for unfolded and folded soluble target proteins. The translocation of folded protein to the periplasm for soluble expression via the Tat pathway was controlled by an N-terminal hydrophilic leader sequence. In this study, we investigated the effect of the hydrophilic C-terminal end and its nucleotide sequence on total and soluble protein expression. Results The native hydrophilic C-terminal end of GFP was obtained by deleting the C-terminal peptide LeuGlu-6×His, derived from pET22b(+). The corresponding clones induced total and soluble GFP expression that was either slightly increased or dramatically reduced, apparently through reconstruction of the nucleotide sequence around the stop codon in the 3′ region. In the expression-induced clones, the hydrophilic C-terminus showed increased Tat pathway specificity for soluble expression. However, in the expression-reduced clone, after analyzing the role of the 5′ poly(A) coding sequence with a substituted synonymous codon, we proved that the longer 5′ poly(A) coding sequence interacted with the reconstructed 3′ region nucleotide sequence to create a new mRNA tertiary structure between the 5′ and 3′ regions, which resulted in reduced total GFP expression. Further, to recover the reduced expression by changing the 3′ nucleotide sequence, after replacing selected C-terminal 5′ codons and the stop codon in the ORF with synonymous codons, total GFP expression in most of the clones was recovered to the undeleted control level. The insertion of trinucleotides after the stop codon in the 3′-UTR recovered or reduced total GFP expression. RT-PCR revealed that the level of total protein expression was controlled by changes in translational or transcriptional regulation, which were induced or reduced by the substitution or insertion of 3′ region nucleotides. Conclusions We found that the hydrophilic C-terminal end of GFP increased Tat pathway specificity and that the 3′ nucleotide sequence played an important role in total protein expression through translational and transcriptional regulation. These findings may be useful for efficiently producing recombinant proteins as well as for potentially controlling the expression level of specific genes in the body for therapeutic purposes. PMID:23834827

Codon Usage Selection Can Bias Estimation of the Fraction of Adaptive Amino Acid Fixations.

PubMed

Matsumoto, Tomotaka; John, Anoop; Baeza-Centurion, Pablo; Li, Boyang; Akashi, Hiroshi

2016-06-01

A growing number of molecular evolutionary studies are estimating the proportion of adaptive amino acid substitutions (α) from comparisons of ratios of polymorphic and fixed DNA mutations. Here, we examine how violations of two of the model assumptions, neutral evolution of synonymous mutations and stationary base composition, affect α estimation. We simulated the evolution of coding sequences assuming weak selection on synonymous codon usage bias and neutral protein evolution, α = 0. We show that weak selection on synonymous mutations can give polymorphism/divergence ratios that yield α-hat (estimated α) considerably larger than its true value. Nonstationary evolution (changes in population size, selection, or mutation) can exacerbate such biases or, in some scenarios, give biases in the opposite direction, α-hat < α. These results demonstrate that two factors that appear to be prevalent among taxa, weak selection on synonymous mutations and non-steady-state nucleotide composition, should be considered when estimating α. Estimates of the proportion of adaptive amino acid fixations from large-scale analyses of Drosophila melanogaster polymorphism and divergence data are positively correlated with codon usage bias. Such patterns are consistent with α-hat inflation from weak selection on synonymous mutations and/or mutational changes within the examined gene trees. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Dietary nitrogen alters codon bias and genome composition in parasitic microorganisms.

PubMed

Seward, Emily A; Kelly, Steven

2016-11-15

Genomes are composed of long strings of nucleotide monomers (A, C, G and T) that are either scavenged from the organism's environment or built from metabolic precursors. The biosynthesis of each nucleotide differs in atomic requirements with different nucleotides requiring different quantities of nitrogen atoms. However, the impact of the relative availability of dietary nitrogen on genome composition and codon bias is poorly understood. Here we show that differential nitrogen availability, due to differences in environment and dietary inputs, is a major determinant of genome nucleotide composition and synonymous codon use in both bacterial and eukaryotic microorganisms. Specifically, low nitrogen availability species use nucleotides that require fewer nitrogen atoms to encode the same genes compared to high nitrogen availability species. Furthermore, we provide a novel selection-mutation framework for the evaluation of the impact of metabolism on gene sequence evolution and show that it is possible to predict the metabolic inputs of related organisms from an analysis of the raw nucleotide sequence of their genes. Taken together, these results reveal a previously hidden relationship between cellular metabolism and genome evolution and provide new insight into how genome sequence evolution can be influenced by adaptation to different diets and environments.
SNPdbe: constructing an nsSNP functional impacts database.

PubMed

Schaefer, Christian; Meier, Alice; Rost, Burkhard; Bromberg, Yana

2012-02-15

Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe-SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from >2600 organisms; 'human' being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. http://www.rostlab.org/services/snpdbe.
Identification and validation of single nucleotide polymorphisms in growth- and maturation-related candidate genes in sole (Solea solea L.).

PubMed

Diopere, Eveline; Hellemans, Bart; Volckaert, Filip A M; Maes, Gregory E

2013-03-01

Genomic methodologies applied in evolutionary and fisheries research have been of great benefit to understand the marine ecosystem and the management of natural resources. Although single nucleotide polymorphisms (SNPs) are attractive for the study of local adaptation, spatial stock management and traceability, and investigating the effects of fisheries-induced selection, they have rarely been exploited in non-model organisms. This is partly due to difficulties in finding and validating SNPs in species with limited or no genomic resources. Complementary to random genome-scan approaches, a targeted candidate gene approach has the potential to unveil pre-selected functional diversity and provides more in depth information on the action of selection at specific genes. For example genes can be under selective pressure due to climate change and sustained periods of heavy fishing pressure. In this study, we applied a candidate gene approach in sole (Solea solea L.), an important member of the demersal ecosystem. As consumption flatfish it is heavy exploited and has experienced associated life-history changes over the last 60years. To discover novel genetic polymorphisms in or around genes linked to important life history traits in sole, we screened a total of 76 candidate genes related to growth and maturation using a targeted resequencing approach. We identified in total 86 putative SNPs in 22 genes and validated 29 SNPs using a multiplex single-base extension genotyping assay. We found 22 informative SNPs, of which two represent non-synonymous mutations, potentially of functional relevance. These novel markers should be rapidly and broadly applicable in analyses of natural sole populations, as a measure of the evolutionary signature of overfishing and for initiatives on marker assisted selection. Copyright © 2012 Elsevier B.V. All rights reserved.
The importance of mRNA structure in determining the pathogenicity of synonymous and non-synonymous mutations in haemophilia

PubMed Central

Hamasaki-Katagiri, Nobuko; Lin, Brian C.; Simon, Jonathan; Hunt, Ryan C.; Schiller, Tal; Russek-Cohen, Estelle; Komar, Anton A.; Bar, Haim; Kimchi-Sarfaty, Chava

2016-01-01

Introduction Mutational analysis is commonly used to support the diagnosis and management of haemophilia. This has allowed for the generation of large mutation databases which provide unparalleled insight into genotype-phenotype relationships. Haemophilia is associated with inversions, deletions, insertions, nonsense and missense mutations. Both synonymous and non-synonymous mutations influence the base pairing of messenger RNA (mRNA), which can alter mRNA structure, cellular half-life and ribosome processivity/elongation. However, the role of mRNA structure in determining the pathogenicity of point mutations in haemophilia has not been evaluated. Aim To evaluate mRNA thermodynamic stability and associated RNA prediction software as a means to distinguish between neutral and disease-associated mutations in haemophilia. Methods Five mRNA structure prediction software programs were used to assess the thermodynamic stability of mRNA fragments carrying neutral vs. disease-associated and synonymous vs. non-synonymous point mutations in F8, F9 and a third X-linked gene, DMD (dystrophin). Results In F8 and DMD, disease-associated mutations tend to occur in more structurally stable mRNA regions, represented by lower MFE (minimum free energy) levels. In comparing multiple software packages for mRNA structure prediction, a 101–151 nucleotide fragment length appears to be a feasible range for structuring future studies. Conclusion mRNA thermodynamic stability is one predictive characteristic, which when combined with other RNA and protein features, may offer significant insight when screening sequencing data for novel disease-associated mutations. Our results also suggest potential utility in evaluating the mRNA thermodynamic stability profile of a gene when determining the viability of interchanging codons for biological and therapeutic applications. PMID:27933712
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation

PubMed Central

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F.; Sampson, Juliana K.; Khalid, Haniya; Sheth, Nihar U.; Batalo, Michael; Serrano, Myrna G.; Roberts, Catherine H.; Hess, Michael L.; Buck, Gregory A.; Neale, Michael C.; Manjili, Masoud H.; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor–recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential. PMID:25414699
In silico Derivation of HLA-Specific Alloreactivity Potential from Whole Exome Sequencing of Stem-Cell Transplant Donors and Recipients: Understanding the Quantitative Immunobiology of Allogeneic Transplantation.

PubMed

Jameson-Lee, Max; Koparde, Vishal; Griffith, Phil; Scalora, Allison F; Sampson, Juliana K; Khalid, Haniya; Sheth, Nihar U; Batalo, Michael; Serrano, Myrna G; Roberts, Catherine H; Hess, Michael L; Buck, Gregory A; Neale, Michael C; Manjili, Masoud H; Toor, Amir Ahmed

2014-01-01

Donor T-cell mediated graft versus host (GVH) effects may result from the aggregate alloreactivity to minor histocompatibility antigens (mHA) presented by the human leukocyte antigen (HLA) molecules in each donor-recipient pair undergoing stem-cell transplantation (SCT). Whole exome sequencing has previously demonstrated a large number of non-synonymous single nucleotide polymorphisms (SNP) present in HLA-matched recipients of SCT donors (GVH direction). The nucleotide sequence flanking each of these SNPs was obtained and the amino acid sequence determined. All the possible nonameric peptides incorporating the variant amino acid resulting from these SNPs were interrogated in silico for their likelihood to be presented by the HLA class I molecules using the Immune Epitope Database stabilized matrix method (SMM) and NetMHCpan algorithms. The SMM algorithm predicted that a median of 18,396 peptides weakly bound HLA class I molecules in individual SCT recipients, and 2,254 peptides displayed strong binding. A similar library of presented peptides was identified when the data were interrogated using the NetMHCpan algorithm. The bioinformatic algorithm presented here demonstrates that there may be a high level of mHA variation in HLA-matched individuals, constituting a HLA-specific alloreactivity potential.
Sequence variations and protein expression levels of the two immune evasion proteins Gpm1 and Pra1 influence virulence of clinical Candida albicans isolates.

PubMed

Luo, Shanshan; Hipler, Uta-Christina; Münzberg, Christin; Skerka, Christine; Zipfel, Peter F

2015-01-01

Candida albicans, the important human fungal pathogen uses multiple evasion strategies to control, modulate and inhibit host complement and innate immune attack. Clinical C. albicans strains vary in pathogenicity and in serum resistance, in this work we analyzed sequence polymorphisms and variations in the expression levels of two central fungal complement evasion proteins, Gpm1 (phosphoglycerate mutase 1) and Pra1 (pH-regulated antigen 1) in thirteen clinical C. albicans isolates. Four nucleotide (nt) exchanges, all representing synonymous exchanges, were identified within the 747-nt long GPM1 gene. For the 900-nt long PRA1 gene, sixteen nucleotide exchanges were identified, which represented synonymous, as well as non-synonymous exchanges. All thirteen clinical isolates had a homozygous exchange (A to G) at position 73 of the PRA1 gene. Surface levels of Gpm1 varied by 8.2, and Pra1 levels by 3.3 fold in thirteen tested isolates and these differences influenced fungal immune fitness. The high Gpm1/Pra1 expressing candida strains bound the three human immune regulators more efficiently, than the low expression strains. The difference was 44% for Factor H binding, 51% for C4BP binding and 23% for plasminogen binding. This higher Gpm1/Pra1 expressing strains result in enhanced survival upon challenge with complement active, Factor H depleted human serum (difference 40%). In addition adhesion to and infection of human endothelial cells was increased (difference 60%), and C3b surface deposition was less effective (difference 27%). Thus, variable expression levels of central immune evasion protein influences immune fitness of the human fungal pathogen C. albicans and thus contribute to fungal virulence.
The effect of ecosystem biodiversity on virus genetic diversity depends on virus species: A study of chiltepin-infecting begomoviruses in Mexico.

PubMed

Rodelo-Urrego, Manuel; García-Arenal, Fernando; Pagán, Israel

2015-01-01

Current declines in biodiversity put at risk ecosystem services that are fundamental for human welfare. Increasing evidence indicates that one such service is the ability to reduce virus emergence. It has been proposed that the reduction of virus emergence occurs at two levels: through a reduction of virus prevalence/transmission and, as a result of these epidemiological changes, through a limitation of virus genetic diversity. Although the former mechanism has been studied in a few host-virus interactions, very little is known about the association between ecosystem biodiversity and virus genetic diversity. To address this subject, we estimated genetic diversity, synonymous and non-synonymous nucleotide substitution rates, selection pressures, and frequency of recombinants and re-assortants in populations of Pepper golden mosaic virus (PepGMV) and Pepper huasteco yellow vein virus (PHYVV) that infect chiltepin plants in Mexico. We then analyzed how these parameters varied according to the level of habitat anthropization, which is the major cause of biodiversity loss. Our results indicated that genetic diversity of PepGMV (but not of PHYVV) populations increased with the loss of biodiversity at higher levels of habitat anthropization. This was mostly the consequence of higher rates of synonymous nucleotide substitutions, rather than of adaptive selection. The frequency of recombinants and re-assortants was higher in PepGMV populations infecting wild chiltepin than in those infecting cultivated ones, suggesting that genetic exchange is not the main mechanism for generating genetic diversity in PepGMV populations. These findings provide evidence that biodiversity may modulate the genetic diversity of plant viruses, but it may differentially affect even two closely related viruses. Our analyses may contribute to understanding the factors involved in virus emergence.
Comparative sequence analysis of domain I of Plasmodium falciparum apical membrane antigen 1 from Saudi Arabia and worldwide isolates.

PubMed

Al-Qahtani, Ahmed A; Abdel-Muhsin, Abdel-Muhsin A; Dajem, Saad M Bin; AlSheikh, Adel Ali H; Bohol, Marie Fe F; Al-Ahdal, Mohammed N; Putaporntip, Chaturong; Jongwutiwes, Somchai

2016-04-01

The apical membrane antigen 1 of Plasmodium falciparum (PfAMA1) plays a crucial role in erythrocyte invasion and is a target of protective antibodies. Although domain I of PfAMA1 has been considered a promising vaccine component, extensive sequence diversity in this domain could compromise an effective vaccine design. To explore the extent of sequence diversity in domain I of PfAMA1, P. falciparum-infected blood samples from Saudi Arabia collected between 2007 and 2009 were analyzed and compared with those from worldwide parasite populations. Forty-six haplotypes and a novel codon change (M190V) were found among Saudi Arabian isolates. The haplotype diversity (0.948±0.004) and nucleotide diversity (0.0191±0.0008) were comparable to those from African hyperendemic countries. Positive selection in domain I of PfAMA1 among Saudi Arabian parasite population was observed because nonsynonymous nucleotide substitutions per nonsynonymous site (dN) significantly exceeded synonymous nucleotide substitutions per synonymous site (dS) and Tajima's D and its related statistics significantly deviated from neutrality in the positive direction. Despite a relatively low prevalence of malaria in Saudi Arabia, a minimum of 17 recombination events occurred in domain I. Genetic differentiation was significant between P. falciparum in Saudi Arabia and parasites from other geographic origins. Several shared or closely related haplotypes were found among parasites from different geographic areas, suggesting that vaccine derived from multiple shared epitopes could be effective across endemic countries. Copyright © 2016 Elsevier B.V. All rights reserved.
Molecular characterization of the 17D-204 yellow fever vaccine.

PubMed

Salmona, Maud; Gazaignes, Sandrine; Mercier-Delarue, Severine; Garnier, Fabienne; Korimbocus, Jehanara; Colin de Verdière, Nathalie; LeGoff, Jerome; Roques, Pierre; Simon, François

2015-10-05

The worldwide use of yellow fever (YF) live attenuated vaccines came recently under close scrutiny as rare but serious adverse events have been reported. The population identified at major risk for these safety issues were extreme ages and immunocompromised subjects. Study NCT01426243 conducted by the French National Agency for AIDS research is an ongoing interventional study to evaluate the safety of the vaccine and the specific immune responses in HIV-infected patients following 17D-204 vaccination. As a preliminary study, we characterized the molecular diversity from E gene of the single 17D-204 vaccine batch used in this clinical study. Eight vials of lyophilized 17D-204 vaccine (Stamaril, Sanofi-Pasteur, Lyon, France) of the E5499 batch were reconstituted for viral quantification, cloning and sequencing of C/prM/E region. The average rate of virions per vial was 8.68 ± 0.07 log₁₀ genome equivalents with a low coefficient of variation (0.81%). 246 sequences of the C/prM/E region (29-33 per vials) were generated and analyzed for the eight vials, 25 (10%) being defective and excluded from analyses. 95% of sequences had at least one nucleotide mutation. The mutations were observed on 662 variant sites distributed through all over the 1995 nucleotides sequence and were mainly non-synonymous (66%). Genome variability between vaccine vials was highly homogeneous with a nucleotide distance ranging from 0.29% to 0.41%. Average p-distances observed for each vial were also homogeneous, ranging from 0.15% to 0.31%. This study showed a homogenous YF virus RNA quantity in vaccine vials within a single lot and a low clonal diversity inter and intra vaccine vials. These results are consistent with a recent study showing that the main mechanism of attenuation resulted in the loss of diversity in the YF virus quasi-species. Copyright © 2015 Elsevier Ltd. All rights reserved.
The Complete Nucleotide Sequence of the Mitochondrial Genome of Bactrocera minax (Diptera: Tephritidae)

PubMed Central

Zhang, Bin; Nardi, Francesco; Hull-Sanders, Helen; Wan, Xuanwu; Liu, Yinghong

2014-01-01

The complete 16,043 bp mitochondrial genome (mitogenome) of Bactrocera minax (Diptera: Tephritidae) has been sequenced. The genome encodes 37 genes usually found in insect mitogenomes. The mitogenome information for B. minax was compared to the homologous sequences of Bactrocera oleae, Bactrocera tryoni, Bactrocera philippinensis, Bactrocera carambolae, Bactrocera papayae, Bactrocera dorsalis, Bactrocera correcta, Bactrocera cucurbitae and Ceratitis capitata. The analysis indicated the structure and organization are typical of, and similar to, the nine closely related species mentioned above, although it contains the lowest genome-wide A+T content (67.3%). Four short intergenic spacers with a high degree of conservation among the nine tephritid species mentioned above and B. minax were observed, which also have clear counterparts in the control regions (CRs). Correlation analysis among these ten tephritid species revealed close positive correlation between the A+T content of zero-fold degenerate sites (P0FD), the ratio of nucleotide substitution frequency at P0FD sites to all degenerate sites (zero-fold degenerate sites, two-fold degenerate sites and four-fold degenerate sites) and amino acid sequence distance (ASD) were found. Further, significant positive correlation was observed between the A+T content of four-fold degenerate sites (P4FD) and the ratio of nucleotide substitution frequency at P4FD sites to all degenerate sites; however, we found significant negative correlation between ASD and the A+T content of P4FD, and the ratio of nucleotide substitution frequency at P4FD sites to all degenerate sites. A higher nucleotide substitution frequency at non-synonymous sites compared to synonymous sites was observed in nad4, the first time that has been observed in an insect mitogenome. A poly(T) stretch at the 5′ end of the CR followed by a [TA(A)]n-like stretch was also found. In addition, a highly conserved G+A-rich sequence block was observed in front of the poly(T) stretch among the ten tephritid species and two tandem repeats were present in the CR. PMID:24964138
GESPA: classifying nsSNPs to predict disease association.

PubMed

Khurana, Jay K; Reeder, Jay E; Shrimpton, Antony E; Thakar, Juilee

2015-07-25

Non-synonymous single nucleotide polymorphisms (nsSNPs) are the most common DNA sequence variation associated with disease in humans. Thus determining the clinical significance of each nsSNP is of great importance. Potential detrimental nsSNPs may be identified by genetic association studies or by functional analysis in the laboratory, both of which are expensive and time consuming. Existing computational methods lack accuracy and features to facilitate nsSNP classification for clinical use. We developed the GESPA (GEnomic Single nucleotide Polymorphism Analyzer) program to predict the pathogenicity and disease phenotype of nsSNPs. GESPA is a user-friendly software package for classifying disease association of nsSNPs. It allows flexibility in acceptable input formats and predicts the pathogenicity of a given nsSNP by assessing the conservation of amino acids in orthologs and paralogs and supplementing this information with data from medical literature. The development and testing of GESPA was performed using the humsavar, ClinVar and humvar datasets. Additionally, GESPA also predicts the disease phenotype associated with a nsSNP with high accuracy, a feature unavailable in existing software. GESPA's overall accuracy exceeds existing computational methods for predicting nsSNP pathogenicity. The usability of GESPA is enhanced by fast SQL-based cloud storage and retrieval of data. GESPA is a novel bioinformatics tool to determine the pathogenicity and phenotypes of nsSNPs. We anticipate that GESPA will become a useful clinical framework for predicting the disease association of nsSNPs. The program, executable jar file, source code, GPL 3.0 license, user guide, and test data with instructions are available at http://sourceforge.net/projects/gespa.
Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds.

PubMed

Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

2017-01-01

Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.
Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds

PubMed Central

Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

2017-01-01

Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs. PMID:28323836
Slow but not low: genomic comparisons reveal slower evolutionary rate and higher dN/dS in conifers compared to angiosperms

PubMed Central

2012-01-01

Background Comparative genomics can inform us about the processes of mutation and selection across diverse taxa. Among seed plants, gymnosperms have been lacking in genomic comparisons. Recent EST and full-length cDNA collections for two conifers, Sitka spruce (Picea sitchensis) and loblolly pine (Pinus taeda), together with full genome sequences for two angiosperms, Arabidopsis thaliana and poplar (Populus trichocarpa), offer an opportunity to infer the evolutionary processes underlying thousands of orthologous protein-coding genes in gymnosperms compared with an angiosperm orthologue set. Results Based upon pairwise comparisons of 3,723 spruce and pine orthologues, we found an average synonymous genetic distance (dS) of 0.191, and an average dN/dS ratio of 0.314. Using a fossil-established divergence time of 140 million years between spruce and pine, we extrapolated a nucleotide substitution rate of 0.68 × 10-9 synonymous substitutions per site per year. When compared to angiosperms, this indicates a dramatically slower rate of nucleotide substitution rates in conifers: on average 15-fold. Coincidentally, we found a three-fold higher dN/dS for the spruce-pine lineage compared to the poplar-Arabidopsis lineage. This joint occurrence of a slower evolutionary rate in conifers with higher dN/dS, and possibly positive selection, showcases the uniqueness of conifer genome evolution. Conclusions Our results are in line with documented reduced nucleotide diversity, conservative genome evolution and low rates of diversification in conifers on the one hand and numerous examples of local adaptation in conifers on the other hand. We propose that reduced levels of nucleotide mutation in large and long-lived conifer trees, coupled with large effective population size, were the main factors leading to slow substitution rates but retention of beneficial mutations. PMID:22264329
Polymorphisms of clip domain serine proteinase and serine proteinase homolog in the swimming crab Portunus trituberculatus and their association with Vibrio alginolyticus

NASA Astrophysics Data System (ADS)

Liu, Meng; Liu, Yuan; Hui, Min; Song, Chengwen; Cui, Zhaoxia

2017-03-01

Clip domain serine proteases (cSPs) and their homologs (SPHs) play an important role in various biological processes that are essential components of extracellular signaling cascades, especially in the innate immune responses of invertebrates. Here, polymorphisms of PtcSP and PtSPH from the swimming crab Portunus trituberculatus were investigated to explore their association with resistance/susceptibility to Vibrio alginolyticus. Polymorphic loci were identified using Clustal X, and characterized with SPSS 16.0 software, and then the significance of genotype and allele frequencies between resistant and susceptible stocks was determined by a χ 2 test. A total of 109 and 77 single nucleotide polymorphisms (SNPs) were identified in the genomic fragments of PtcSP and PtSPH, respectively. Notably, nearly half of PtSPH polymorphisms were found in the non-coding exon 1. Fourteen SNPs investigated were significantly associated with susceptibility/resistance to V. alginolyticus ( P <0.05). Among them, eight SNPs were observed in introns, and one synonymous, four non-synonymous SNPs and one ins-del were found in coding exons. In addition, five simple sequence repeats (SSRs) were detected in intron 3 of PtcSP. Although there was no statistically significant difference of allele frequencies, the SSRs showed different polymorphic alleles on the basis of the repeat number between resistant and susceptible stocks. After further validation, polymorphisms investigated here might be applied to select potential molecular markers of P. trituberculatus with resistance to V. alginolyticus.
Rare missense mutations in P2RY11 in narcolepsy with cataplexy.

PubMed

Degn, Matilda; Dauvilliers, Yves; Dreisig, Karin; Lopez, Régis; Pfister, Corinne; Pradervand, Sylvain; Rahbek Kornum, Birgitte; Tafti, Mehdi

2017-06-01

The sleep disorder narcolepsy with cataplexy is characterized by a highly specific loss of hypocretin (orexin) neurons, leading to the hypothesis that the condition is caused by an immune or autoimmune mechanism. All genetic variants associated with narcolepsy are immune-related. Among these are single nucleotide polymorphisms in the P2RY11-EIF3G locus. It is unknown how these genetic variants affect narcolepsy pathogenesis and whether the effect is directly related to P2Y11 signalling or EIF3G function. Exome sequencing in 18 families with at least two affected narcolepsy with cataplexy subjects revealed non-synonymous mutations in the second exon of P2RY11 in two families, and P2RY11 re-sequencing in 250 non-familial cases and 135 healthy control subjects revealed further six different non-synonymous mutations in the second exon of P2RY11 in seven patients. No mutations were found in healthy controls. Six of the eight narcolepsy-associated P2Y11 mutations resulted in significant functional deficits in P2Y11 signalling through both Ca2+ and cAMP signalling pathways. In conclusion, our data show that decreased P2Y11 signalling plays an important role in the development of narcolepsy with cataplexy. © The Author (2017). Published by Oxford University Press on behalf of the Guarantors of Brain. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Identification of bovine NPC1 gene cSNPs and their effects on body size traits of Qinchuan cattle.

PubMed

Dang, Yonglong; Li, Mingxun; Yang, Mingjuan; Cao, Xiukai; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Lin, Qing; Chen, Hong

2014-05-01

NPC1 gene is an important gene closely related to the Niemann-Pick type C (NPC). Mutations in the NPC1 gene tend to cause Niemann-Pick type C, a lysosomal storage disorder. Previous studies have shown that NPC1 protein plays an important role in subcellular lipid transport, homeostasis, platelet function and formation, which are basic metabolic activities in the process of development. In this study, to explore the association between the NPC1 gene variation and body size traits in Qinchuan cattle, we detected four novel coding single nucleotide polymorphisms (cSNPs) in the bovine NPC1 gene, including one missense mutation (SNP1) and three synonymous mutations (SNP2, SNP3 and SNP4). Population genetic analyses of 518 individuals and association correlations between cSNPs and bovine body size traits were conducted in this research. A missense mutation at SNP1 locus was found to be significantly related to the heart girth, hip width and body weight (P<0.01 or P<0.05, 3.5-year-old). Two synonymous mutations at SNP2 and SNP3 loci also showed significant effects on hip width (P<0.05, 3.5-year-old). One synonymous mutation at SNP4 locus showed significant effect on body weight (P<0.05, 2.0-year-old). Combined haplotypes H2H6 and H6H6 showed significant effects on body size traits such as heart girth, hip width, and body weight (3.5-year-old, P<0.01 or P<0.05). This study provides evidence that the NPC1 gene might be involved in the regulation of bovine growth and body development, and may be considered as a candidate gene for marker assisted selection (MAS) in beef cattle breeding industry. Copyright © 2014. Published by Elsevier B.V.
Second generation DNA sequencing of the mitogenome of the Chinstrap penguin and comparative genomics of Antarctic penguins.

PubMed

Subramanian, Sankar; Lingala, Syamala Gowri; Swaminathan, Siva; Huynen, Leon; Lambert, David

2014-08-01

The complete mitochondrial genome of the Chinstrap penguin (Pygoscelis antarcticus) was sequenced and compared with other penguin mitogenomes. The genome is 15,972 bp in length with the number and order of protein coding genes and RNAs being very similar to that of other known penguin mitogenomes. Comparative nucleotide analysis showed the Chinstrap mitogenome shares 94% homology with the mitogenome of its sister species, Pygoscelis adelie (Adélie penguin). Divergence at nonsynonymous nucleotide positions was found to be up to 23 times less than that observed in synonymous positions of protein coding genes, suggesting high selection constraints. The complete mitogenome data will be useful for genetic and evolutionary studies of penguins.

Polymorphisms of the bovine DKK2 and their associations with body measurement traits and meat quality traits in Qinchuan cattle.

PubMed

Zhan, Xiaoli; Gao, Jianbin; Huangfu, Yifan; Fu, Changzhen; Zan, Linsen

2013-12-01

The objective of this research were to detect bovine Dickkopf 2 (DKK2) gene polymorphism and analyze their associations with body measurement traits (BMT) and meat quality traits (MQT) of animals. Blood samples were taken from a total of 541 Qinchuan cattle aged from 18 to 24 months. Polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) was employed to find out DKK2 single-polymorphism nucleotide (SNPs) and to explore their possible association with BMT and MQT. Sequence analysis of DKK2 gene revealed 2 SNPs (C29 T and A169C) in 5' untranslated region (5'UTR) of exon 1.C29T and A164T SNPs are both synonymous mutation, which showed 2 genotypes namely (CC, CT) and (AA and AC), respectively. Association analysis of polymorphism with body measurement and meat quality traits at the two locus showed that there were significant effects on CT, BL, RL, PBW, BFT, LMA, and IFC. These results suggest that the DKK2 gene might have potential effects on BMT and MQT in Qinchuan cattle population and could be used for marker-assisted selection.
Molecular Characterization and Comparative Sequence Analysis of Defense-Related Gene, Oryza rufipogon Receptor-Like Protein Kinase 1

PubMed Central

Law, Yee-Song; Gudimella, Ranganath; Song, Beng-Kah; Ratnam, Wickneswari; Harikrishna, Jennifer Ann

2012-01-01

Many of the plant leucine rich repeat receptor-like kinases (LRR-RLKs) have been found to regulate signaling during plant defense processes. In this study, we selected and sequenced an LRR-RLK gene, designated as Oryza rufipogon receptor-like protein kinase 1 (OrufRPK1), located within yield QTL yld1.1 from the wild rice Oryza rufipogon (accession IRGC105491). A 2055 bp coding region and two exons were identified. Southern blotting determined OrufRPK1 to be a single copy gene. Sequence comparison with cultivated rice orthologs (OsI219RPK1, OsI9311RPK1 and OsJNipponRPK1, respectively derived from O. sativa ssp. indica cv. MR219, O. sativa ssp. indica cv. 9311 and O. sativa ssp. japonica cv. Nipponbare) revealed the presence of 12 single nucleotide polymorphisms (SNPs) with five non-synonymous substitutions, and 23 insertion/deletion sites. The biological role of the OrufRPK1 as a defense related LRR-RLK is proposed on the basis of cDNA sequence characterization, domain subfamily classification, structural prediction of extra cellular domains, cluster analysis and comparative gene expression. PMID:22942769
Hybridization properties of long nucleic acid probes for detection of variable target sequences, and development of a hybridization prediction algorithm

PubMed Central

Öhrmalm, Christina; Jobs, Magnus; Eriksson, Ronnie; Golbob, Sultan; Elfaitouri, Amal; Benachenhou, Farid; Strømme, Maria; Blomberg, Jonas

2010-01-01

One of the main problems in nucleic acid-based techniques for detection of infectious agents, such as influenza viruses, is that of nucleic acid sequence variation. DNA probes, 70-nt long, some including the nucleotide analog deoxyribose-Inosine (dInosine), were analyzed for hybridization tolerance to different amounts and distributions of mismatching bases, e.g. synonymous mutations, in target DNA. Microsphere-linked 70-mer probes were hybridized in 3M TMAC buffer to biotinylated single-stranded (ss) DNA for subsequent analysis in a Luminex® system. When mismatches interrupted contiguous matching stretches of 6 nt or longer, it had a strong impact on hybridization. Contiguous matching stretches are more important than the same number of matching nucleotides separated by mismatches into several regions. dInosine, but not 5-nitroindole, substitutions at mismatching positions stabilized hybridization remarkably well, comparable to N (4-fold) wobbles in the same positions. In contrast to shorter probes, 70-nt probes with judiciously placed dInosine substitutions and/or wobble positions were remarkably mismatch tolerant, with preserved specificity. An algorithm, NucZip, was constructed to model the nucleation and zipping phases of hybridization, integrating both local and distant binding contributions. It predicted hybridization more exactly than previous algorithms, and has the potential to guide the design of variation-tolerant yet specific probes. PMID:20864443
Interplay Between Capsule Expression and Uracil Metabolism in Streptococcus pneumoniae D39

PubMed Central

Carvalho, Sandra M.; Kloosterman, Tomas G.; Manzoor, Irfan; Caldas, José; Vinga, Susana; Martinussen, Jan; Saraiva, Lígia M.; Kuipers, Oscar P.; Neves, Ana R.

2018-01-01

Pyrimidine nucleotides play an important role in the biosynthesis of activated nucleotide sugars (NDP-sugars). NDP-sugars are the precursors of structural polysaccharides in bacteria, including capsule, which is a major virulence factor of the human pathogen S. pneumoniae. In this work, we identified a spontaneous non-reversible mutant of strain D39 that displayed a non-producing capsule phenotype. Whole-genome sequencing analysis of this mutant revealed several non-synonymous single base modifications, including in genes of the de novo synthesis of pyrimidines and in the −10 box of capsule operon promoter (Pcps). By directed mutagenesis we showed that the point mutation in Pcps was solely responsible for the drastic decrease in capsule expression. We also demonstrated that D39 subjected to uracil deprivation shows increased biomass and decreased Pcps activity and capsule amounts. Importantly, Pcps expression is further decreased by mutating the first gene of the de novo synthesis of pyrimidines, carA. In contrast, the absence of uracil from the culture medium showed no effect on the spontaneous mutant strain. Co-cultivation of the wild-type and the mutant strain indicated a competitive advantage of the spontaneous mutant (non-producing capsule) in medium devoid of uracil. We propose a model in that uracil may act as a signal for the production of different capsule amounts in S. pneumoniae. PMID:29599757
Interplay Between Capsule Expression and Uracil Metabolism in Streptococcus pneumoniae D39.

PubMed

Carvalho, Sandra M; Kloosterman, Tomas G; Manzoor, Irfan; Caldas, José; Vinga, Susana; Martinussen, Jan; Saraiva, Lígia M; Kuipers, Oscar P; Neves, Ana R

2018-01-01

Pyrimidine nucleotides play an important role in the biosynthesis of activated nucleotide sugars (NDP-sugars). NDP-sugars are the precursors of structural polysaccharides in bacteria, including capsule, which is a major virulence factor of the human pathogen S. pneumoniae . In this work, we identified a spontaneous non-reversible mutant of strain D39 that displayed a non-producing capsule phenotype. Whole-genome sequencing analysis of this mutant revealed several non-synonymous single base modifications, including in genes of the de novo synthesis of pyrimidines and in the -10 box of capsule operon promoter (P cps ). By directed mutagenesis we showed that the point mutation in P cps was solely responsible for the drastic decrease in capsule expression. We also demonstrated that D39 subjected to uracil deprivation shows increased biomass and decreased P cps activity and capsule amounts. Importantly, P cps expression is further decreased by mutating the first gene of the de novo synthesis of pyrimidines, carA . In contrast, the absence of uracil from the culture medium showed no effect on the spontaneous mutant strain. Co-cultivation of the wild-type and the mutant strain indicated a competitive advantage of the spontaneous mutant (non-producing capsule) in medium devoid of uracil. We propose a model in that uracil may act as a signal for the production of different capsule amounts in S. pneumoniae .
A mutation of the p63 gene in non‐syndromic cleft lip

PubMed Central

Leoyklang, P; Siriwan, P; Shotelersuk, V

2006-01-01

Mutations in the p63 gene (TP63) underlie several monogenic malformation syndromes manifesting cleft lip with or without cleft palate (CL/P). We investigated whether p63 mutations also result in non‐syndromic CL/P. Specifically, we performed mutation analysis of the 16 exons of the p63 gene for 100 Thai patients with non‐syndromic CL/P. In total, 21 variant sites were identified. All were single nucleotide changes, with six in coding regions, including three novel non‐synonymous changes: S90L, R313G, and D564H. The R313G was concluded to be pathogenic on the basis of its amino acid change, evolutionary conservation, its occurrence in a functionally important domain, its predicted damaging function, its de novo occurrence, and its absence in 500 control individuals. Our data strongly suggest, for the first time, a causative role of a heterozygous mutation in the p63 gene in non‐syndromic CL/P, highlighting the wide phenotypic spectrum of p63 gene mutations. PMID:16740912
Domain- and nucleotide-specific Rev response element regulation of feline immunodeficiency virus production

PubMed Central

Na, Hong; Huisman, Willem; Ellestad, Kristofor K.; Phillips, Tom R.; Power, Christopher

2010-01-01

Computational analysis of feline immunodeficiency virus (FIV) RNA sequences indicated that common FIV strains contain a rev response element (RRE) defined by a long unbranched hairpin with 6 stem-loop sub-domains, termed stem-loop A (SLA). To examine the role of the RNA secondary structure of the RRE, mutational analyses were performed in both an infectious FIV molecular clone and a FIV CAT-RRE reporter system. These studies disclosed that the stems within SLA (SA1, 2, 3, 4, and 5) of the RRE were critical but SA6 was not essential for FIV replication and CAT expression. These studies also revealed that the secondary structure rather than an antisense protein (ASP) mediates virus expression and replication in vitro. In addition, a single synonymous mutation within the FIV-RRE, SA3/45, reduced viral reverse transcriptase activity and p24 expression after transfection but in addition also showed a marked reduction in viral expression and production following infection. PMID:20570310
Molecular characterization of Mycobacterium tuberculosis isolates from elephants of Nepal.

PubMed

Paudel, Sarad; Mikota, Susan K; Nakajima, Chie; Gairhe, Kamal P; Maharjan, Bhagwan; Thapa, Jeewan; Poudel, Ajay; Shimozuru, Michito; Suzuki, Yasuhiko; Tsubota, Toshio

2014-05-01

Mycobacterium tuberculosis was cultured from the lung tissues of 3 captive elephants in Nepal that died with extensive lung lesions. Spoligotyping, TbD1 detection and multi-locus variable number of tandem repeat analysis (MLVA) results suggested 3 isolates belonged to a specific lineage of Indo-Oceanic clade, EAI5 SIT 138. One of the elephant isolates had a new synonymous single nucleotide polymorphism (SNP) T231C in the gyrA sequence, and the same SNP was also found in human isolates in Nepal. MLVA results and transfer history of the elephants suggested that 2 of them might be infected with M. tuberculosis from the same source. These findings indicated the source of M. tuberculosis infection of those elephants were local residents, presumably their handlers. Further investigation including detailed genotyping of elephant and human isolates is needed to clarify the infection route and eventually prevent the transmission of tuberculosis to susceptible hosts. Copyright © 2014 Elsevier Ltd. All rights reserved.
Stochastic processes constrain the within and between host evolution of influenza virus.

PubMed

McCrone, John T; Woods, Robert J; Martin, Emily T; Malosh, Ryan E; Monto, Arnold S; Lauring, Adam S

2018-05-03

The evolutionary dynamics of influenza virus ultimately derive from processes that take place within and between infected individuals. Here we define influenza virus dynamics in human hosts through sequencing of 249 specimens from 200 individuals collected over 6290 person-seasons of observation. Because these viruses were collected from individuals in a prospective community-based cohort, they are broadly representative of natural infections with seasonal viruses. Consistent with a neutral model of evolution, sequence data from 49 serially sampled individuals illustrated the dynamic turnover of synonymous and nonsynonymous single nucleotide variants and provided little evidence for positive selection of antigenic variants. We also identified 43 genetically-validated transmission pairs in this cohort. Maximum likelihood optimization of multiple transmission models estimated an effective transmission bottleneck of 1-2 genomes. Our data suggest that positive selection is inefficient at the level of the individual host and that stochastic processes dominate the host-level evolution of influenza viruses. © 2018, McCrone et al.
Mutation analysis of GLDC, AMT and GCSH in cataract captive-bred vervet monkeys (Chlorocebus aethiops).

PubMed

Chauke, Chesa G; Magwebu, Zandisiwe E; Sharma, Jyoti R; Arieff, Zainunisha; Seier, Jürgen V

2016-08-01

Non-ketotic hyperglycinaemia (NKH) is an autosomal recessive inborn error of glycine metabolism characterized by accumulation of glycine in body fluids and various neurological symptoms. This study describes the first screening of NKH in cataract captive-bred vervet monkeys (Chlorocebus aethiops). Glycine dehydrogenase (GLDC), aminomethyltransferase (AMT) and glycine cleavage system H protein (GCSH) were prioritized. Mutation analysis of the complete coding sequence of GLDC and AMT revealed six novel single-base substitutions, of which three were non-synonymous missense and three were silent nucleotide changes. Although deleterious effects of the three amino acid substitutions were not evaluated, one substitution of GLDC gene (S44R) could be disease-causing because of its drastic amino acid change, affecting amino acids conserved in different primate species. This study confirms the diagnosis of NKH for the first time in vervet monkeys with cataracts. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
A rare genetic disorder causing persistent severe neonatal hypoglycaemia the diagnostic workup.

PubMed

Francescato, Gaia; Salvatoni, Alessandro; Persani, Luca; Agosti, Massimo

2012-07-19

We report a case of familial glucocorticoid deficiency (FGD), a rare genetic autosomal-recessive disorder with typical hyperpigmentation of the skin and mucous membranes, severe hypoglycaemia, occasionally leading to seizures and coma, feeding difficulties, failure to thrive and infections. A newborn child was admitted, on his second day of life, to our neonatal intensive care unit because of seizures and respiratory insufficiency. Hyperpigmentation was not evident due to his Senegalese origin. The clinical presentation led us to consider a wide range of diagnostic hypothesis. Laboratory findings brought us to the diagnosis of FGD that was confirmed by molecular analysis showing an MC2R:p.Y254C mutation previously reported as causative of type 1 FGD and two novel heterozygous non-synonymous single-nucleotide polymorphisms in exon 2 and 3 of melanocortin 2 receptor accessory protein-α, whose role in the disease is currently unknown. The importance of an early collection and storage of blood samples during hypoglycaemic event is emphasised.
Interleukin 1 beta gene and risk of schizophrenia: detailed case-control and family-based studies and an updated meta-analysis.

PubMed

Shibuya, Masako; Watanabe, Yuichiro; Nunokawa, Ayako; Egawa, Jun; Kaneko, Naoshi; Igeta, Hirofumi; Someya, Toshiyuki

2014-01-01

Interleukin-1 beta (IL-1β) has been implicated in the pathophysiology of schizophrenia. To assess whether the IL1B gene confers increased susceptibility to schizophrenia, we conducted case-control and family-based studies and an updated meta-analysis. We tested the association between IL1B and schizophrenia in 1229 case-control and 112 trio samples using 12 markers, including common tagging single nucleotide variations (SNVs) and a rare non-synonymous variation detected by resequencing the coding regions. We also performed a meta-analysis of rs16944 using a total of 8724 case-control and 201 trio samples from 16 independent populations. We found no significant associations between any of the 12 SNVs examined and schizophrenia in either case-control or trio samples. Moreover, our meta-analysis results showed no significant association between the common SNV, rs16944, and schizophrenia. The present study does not support a role for IL1B in schizophrenia susceptibility.
Population Genomic Analysis Reveals a Rich Speciation and Demographic History of Orang-utans (Pongo pygmaeus and Pongo abelii)

PubMed Central

Ma, Xin; Kelley, Joanna L.; Eilertson, Kirsten; Musharoff, Shaila; Degenhardt, Jeremiah D.; Martins, André L.; Vinar, Tomas; Kosiol, Carolin; Siepel, Adam; Gutenkunst, Ryan N.; Bustamante, Carlos D.

2013-01-01

To gain insights into evolutionary forces that have shaped the history of Bornean and Sumatran populations of orang-utans, we compare patterns of variation across more than 11 million single nucleotide polymorphisms found by previous mitochondrial and autosomal genome sequencing of 10 wild-caught orang-utans. Our analysis of the mitochondrial data yields a far more ancient split time between the two populations (∼3.4 million years ago) than estimates based on autosomal data (0.4 million years ago), suggesting a complex speciation process with moderate levels of primarily male migration. We find that the distribution of selection coefficients consistent with the observed frequency spectrum of autosomal non-synonymous polymorphisms in orang-utans is similar to the distribution in humans. Our analysis indicates that 35% of genes have evolved under detectable negative selection. Overall, our findings suggest that purifying natural selection, genetic drift, and a complex demographic history are the dominant drivers of genome evolution for the two orang-utan populations. PMID:24194868
Population genomic analysis reveals a rich speciation and demographic history of orang-utans (Pongo pygmaeus and Pongo abelii).

PubMed

Ma, Xin; Kelley, Joanna L; Eilertson, Kirsten; Musharoff, Shaila; Degenhardt, Jeremiah D; Martins, André L; Vinar, Tomas; Kosiol, Carolin; Siepel, Adam; Gutenkunst, Ryan N; Bustamante, Carlos D

2013-01-01

To gain insights into evolutionary forces that have shaped the history of Bornean and Sumatran populations of orang-utans, we compare patterns of variation across more than 11 million single nucleotide polymorphisms found by previous mitochondrial and autosomal genome sequencing of 10 wild-caught orang-utans. Our analysis of the mitochondrial data yields a far more ancient split time between the two populations (~3.4 million years ago) than estimates based on autosomal data (0.4 million years ago), suggesting a complex speciation process with moderate levels of primarily male migration. We find that the distribution of selection coefficients consistent with the observed frequency spectrum of autosomal non-synonymous polymorphisms in orang-utans is similar to the distribution in humans. Our analysis indicates that 35% of genes have evolved under detectable negative selection. Overall, our findings suggest that purifying natural selection, genetic drift, and a complex demographic history are the dominant drivers of genome evolution for the two orang-utan populations.
Low incidence of DNA sequence variation in human induced pluripotent stem cells generated by non-integrating plasmid expression

PubMed Central

Cheng, Linzhao; Hansen, Nancy F.; Zhao, Ling; Du, Yutao; Zou, Chunlin; Donovan, Frank X.; Chou, Bin-Kuan; Zhou, Guangyu; Li, Shijie; Dowey, Sarah N.; Ye, Zhaohui; Chandrasekharappa, Settara C.; Yang, Huanming; Mullikin, James C.; Liu, P. Paul

2012-01-01

Summary The utility of induced pluripotent stem cells (iPSCs) as models to study diseases and as sources for cell therapy depends on the integrity of their genomes. Despite recent publications of DNA sequence variations in the iPSCs, the true scope of such changes for the entire genome is not clear. Here we report the whole-genome sequencing of three human iPSC lines derived from two cell types of an adult donor by episomal vectors. The vector sequence was undetectable in the deeply sequenced iPSC lines. We identified 1058–1808 heterozygous single nucleotide variants (SNVs), but no copy number variants, in each iPSC line. Six to twelve of these SNVs were within coding regions in each iPSC line, but ~50% of them are synonymous changes and the remaining are not selectively enriched for known genes associated with cancers. Our data thus suggest that episome-mediated reprogramming is not inherently mutagenic during integration-free iPSC induction. PMID:22385660
Impact of genomic polymorphisms on the repertoire of human MHC class I-associated peptides

PubMed Central

Granados, Diana Paola; Sriranganadane, Dev; Daouda, Tariq; Zieger, Antoine; Laumont, Céline M.; Caron-Lizotte, Olivier; Boucher, Geneviève; Hardy, Marie-Pierre; Gendron, Patrick; Côté, Caroline; Lemieux, Sébastien; Thibault, Pierre; Perreault, Claude

2014-01-01

For decades, the global impact of genomic polymorphisms on the repertoire of peptides presented by major histocompatibility complex (MHC) has remained a matter of speculation. Here we present a novel approach that enables high-throughput discovery of polymorphic MHC class I-associated peptides (MIPs), which play a major role in allorecognition. On the basis of comprehensive analyses of the genomic landscape of MIPs eluted from B lymphoblasts of two MHC-identical siblings, we show that 0.5% of non-synonymous single nucleotide variations are represented in the MIP repertoire. The 34 polymorphic MIPs found in our subjects are encoded by bi-allelic loci with dominant and recessive alleles. Our analyses show that, at the population level, 12% of the MIP-coding exome is polymorphic. Our method provides fundamental insights into the relationship between the genomic self and the immune self and accelerates the discovery of polymorphic MIPs (also known as minor histocompatibility antigens). PMID:24714562
Triticum mosaic virus exhibits limited population variation yet shows evidence of parallel evolution after replicated serial passage in wheat.

PubMed

Bartels, Melissa; French, Roy; Graybosch, Robert A; Tatineni, Satyanarayana

2016-05-01

An infectious cDNA clone of Triticum mosaic virus (TriMV) (genus Poacevirus; family Potyviridae) was used to establish three independent lineages in wheat to examine intra-host population diversity levels within protein 1 (P1) and coat protein (CP) cistrons over time. Genetic variation was assessed at passages 9, 18 and 24 by single-strand conformation polymorphism, followed by nucleotide sequencing. The founding P1 region genotype was retained at high frequencies in most lineage/passage populations, while the founding CP genotype disappeared after passage 18 in two lineages. We found that rare TriMV genotypes were present only transiently and lineages followed independent evolutionary trajectories, suggesting that genetic drift dominates TriMV evolution. These results further suggest that experimental populations of TriMV exhibit lower mutant frequencies than that of Wheat streak mosaic virus (genus Tritimovirus; family Potyviridae) in wheat. Nevertheless, there was evidence for parallel evolution at a synonymous site in the TriMV CP cistron. Published by Elsevier Inc.
Polymorphisms of the artemisinin resistant marker (K13) in Plasmodium falciparum parasite populations of Grande Comore Island 10 years after artemisinin combination therapy.

PubMed

Huang, Bo; Deng, Changsheng; Yang, Tao; Xue, Linlu; Wang, Qi; Huang, Shiguang; Su, Xin-zhuan; Liu, Yajun; Zheng, Shaoqin; Guan, Yezhi; Xu, Qin; Zhou, Jiuyao; Yuan, Jie; Bacar, Afane; Abdallah, Kamal Said; Attoumane, Rachad; Mliva, Ahamada M S A; Zhong, Yanchun; Lu, Fangli; Song, Jianping

2015-12-15

Plasmodium falciparum malaria is a significant public health problem in Comoros, and artemisinin combination therapy (ACT) remains the first choice for treating acute uncomplicated P. falciparum. The emergence and spread of artemisinin-resistant P. falciparum in Southeast Asia, associated with mutations in K13-propeller gene, poses a potential threat to ACT efficacy. Detection of mutations in the P. falciparum K13-propeller gene may provide the first-hand information on changes in parasite susceptibility to artemisinin. The objective of this study is to determinate the prevalence of mutant K13-propeller gene among the P. falciparum isolates collected from Grande Comore Island, Union of Comoros, where ACT has been in use since 2004. A total of 207 P. falciparum clinical isolates were collected from the island during March 2006 and October 2007 (n = 118) and March 2013 and December 2014 (n = 89). All isolates were analysed for single nucleotide polymorphisms (SNPs) and haplotypes in the K13-propeller gene using nested PCR and DNA sequencing. Only three 2006-2007 samples carried SNPs in the K13-propeller gene, one having a synonymous (G538G) and the other having two non-synonymous (S477Y and D584E) substitutions leading to two mutated haplotypes (2.2%, 2/95). Three synonymous mutations (R471R, Y500Y, and G538G) (5.9%, 5/85) and 7 non-synonymous substitutions (21.2%, 18/85) with nine mutated haplotypes (18.8%, 16/85) were found in isolates from 2013 to 2014. However, none of the polymorphisms associated with artemisinin-resistance in Southeast Asia was detected from any of the parasites examined. This study showed increased K13-propeller gene diversity among P. falciparum populations on the Island over the course of 8 years (2006-2014). Nevertheless, none of the polymorphisms known to be associated with artemisinin resistance in Asia was detected in the parasite populations examined. Our data suggest that P. falciparum populations in Grande Comore are still effectively susceptible to artemisinin. Our results provide insights into P. falciparum populations regarding mutations in the gene associated with artemisinin resistance and will be useful for developing and updating anti-malarial guidance in Comoros.
Reclassification of Xuhuaishuia manganoxidans Wang et al. 2015 as a later heterotypic synonym of Brevirhabdus pacifica Wu et al. 2015 and emendation of the species description.

PubMed

Liu, Yang; Lai, Qiliang; Xu, Xue-Wei; Wu, Yue-Hong; Cheng, Hong; Zhang, Xiao-Hua; Wang, Long; Shao, Zongze

2017-08-01

A polyphasic taxonomic study was undertaken to clarify the exact position of type strain DY6-4T of Xuhuaishuia manganoxidans. A combination of physiological properties of X. manganoxidans DY6-4T was consistent with those of type strain 22DY15T of Brevirhabdus pacifica. The 16S rRNA gene sequence analyses indicated that X. manganoxidans DY6-4T and B. pacifica 22DY15T shared 100 % similarity and formed a monophyletic group. The close relationship between the two strains was underpinned by the results of chemotaxonomic characteristics, including the fatty acids, quinone and polar lipids. The digital DNA-DNA hybridization and average nucleotide identity values between the two strains were 99.90 and 99.98 %, respectively. Based on these results, we propose that Xuhuaishuia manganoxidans is a later heterotypic synonym of Brevirhabdus pacifica.
Functional effect of grapevine 1-deoxy-D-xylulose 5-phosphate synthase substitution K284N on Muscat flavour formation

PubMed Central

Battilana, Juri; Emanuelli, Francesco; Gambino, Giorgio; Gribaudo, Ivana; Gasperi, Flavia; Boss, Paul K.; Grando, Maria Stella

2011-01-01

Grape berries of Muscat cultivars (Vitis vinifera L.) contain high levels of monoterpenols and exhibit a distinct aroma related to this composition of volatiles. A structural gene of the plastidial methyl-erythritol-phosphate (MEP) pathway, 1-deoxy-D-xylulose 5-phosphate synthase (VvDXS), was recently suggested as a candidate gene for this trait, having been co-localized with a major quantitative trait locus for linalool, nerol, and geraniol concentrations in berries. In addition, a structured association study discovered a putative causal single nucleotide polymorphism (SNP) responsible for the substitution of a lysine with an asparagine at position 284 of the VvDXS protein, and this SNP was significantly associated with Muscat-flavoured varieties. The significance of this nucleotide difference was investigated by comparing the monoterpene profiles with the expression of VvDXS alleles throughout berry development in Moscato Bianco, a cultivar heterozygous for the SNP mutation. Although correlation was detected between the VvDXS transcript profile and the accumulation of free monoterpenol odorants, the modulation of VvDXS expression during berry development appears to be independent of nucleotide variation in the coding sequence. In order to assess how the non-synonymous mutation may enhance Muscat flavour, an in vitro characterization of enzyme isoforms was performed followed by in vivo overexpression of each VvDXS allele in tobacco. The results showed that the amino acid non-neutral substitution influences the enzyme kinetics by increasing the catalytic efficiency and also dramatically affects monoterpene levels in transgenic lines. These findings confirm a functional effect of the VvDXS gene polymorphism and may pave the way for metabolic engineering of terpenoid contents in grapevine. PMID:21868399

T/T homozygosity of the tenascin-C gene polymorphism rs2104772 negatively influences exercise-induced angiogenesis.

PubMed

Valdivieso, Paola; Toigo, Marco; Hoppeler, Hans; Flück, Martin

2017-01-01

Mechanical stress, including blood pressure related factors, up-regulate expression of the pro-angiogenic extracellular matrix protein tenascin-C in skeletal muscle. We hypothesized that increased capillarization of skeletal muscle with the repeated augmentation in perfusion during endurance training is associated with blood vessel-related expression of tenascin-C and would be affected by the single-nucleotide polymorphism (SNP) rs2104772, which characterizes the non-synonymous exchange of thymidine (T)-to-adenosine (A) in the amino acid codon 1677 of tenascin-C. Sixty-one healthy, untrained, male white participants of Swiss descent performed thirty 30-min bouts of endurance exercise on consecutive weekdays using a cycling ergometer. Genotype and training interactions were called significant at Bonferroni-corrected p-value of 5% (repeated measures ANOVA). Endurance training increased capillary-to-fiber-ratio (+11%), capillary density (+7%), and mitochondrial volume density (+30%) in m. vastus lateralis. Tenascin-C protein expression in this muscle was confined to arterioles and venules (80% of cases) and increased after training in A-allele carriers. Prior to training, volume densities of subsarcolemmal and myofibrillar mitochondria in m. vastus lateralis muscle were 49% and 18%, respectively, higher in A/A homozygotes relative to T-nucleotide carriers (A/T and T/T). Training specifically increased capillary-to-fiber ratio in A-nucleotide carriers but not in T/T homozygotes. Genotype specific regulation of angiogenesis was reflected by the expression response of 8 angiogenesis-associated transcripts after exercise, and confirmed by training-induced alterations of the shear stress related factors, vimentin and VEGF A. Our findings provide evidence for a negative influence of T/T homozygosity in rs2104772 on capillary remodeling with endurance exercise.
T/T homozygosity of the tenascin-C gene polymorphism rs2104772 negatively influences exercise-induced angiogenesis

PubMed Central

Toigo, Marco; Hoppeler, Hans

2017-01-01

Background Mechanical stress, including blood pressure related factors, up-regulate expression of the pro-angiogenic extracellular matrix protein tenascin-C in skeletal muscle. We hypothesized that increased capillarization of skeletal muscle with the repeated augmentation in perfusion during endurance training is associated with blood vessel-related expression of tenascin-C and would be affected by the single-nucleotide polymorphism (SNP) rs2104772, which characterizes the non-synonymous exchange of thymidine (T)-to-adenosine (A) in the amino acid codon 1677 of tenascin-C. Methods Sixty-one healthy, untrained, male white participants of Swiss descent performed thirty 30-min bouts of endurance exercise on consecutive weekdays using a cycling ergometer. Genotype and training interactions were called significant at Bonferroni-corrected p-value of 5% (repeated measures ANOVA). Results Endurance training increased capillary-to-fiber-ratio (+11%), capillary density (+7%), and mitochondrial volume density (+30%) in m. vastus lateralis. Tenascin-C protein expression in this muscle was confined to arterioles and venules (80% of cases) and increased after training in A-allele carriers. Prior to training, volume densities of subsarcolemmal and myofibrillar mitochondria in m. vastus lateralis muscle were 49% and 18%, respectively, higher in A/A homozygotes relative to T-nucleotide carriers (A/T and T/T). Training specifically increased capillary-to-fiber ratio in A-nucleotide carriers but not in T/T homozygotes. Genotype specific regulation of angiogenesis was reflected by the expression response of 8 angiogenesis-associated transcripts after exercise, and confirmed by training-induced alterations of the shear stress related factors, vimentin and VEGF A. Conclusion Our findings provide evidence for a negative influence of T/T homozygosity in rs2104772 on capillary remodeling with endurance exercise. PMID:28384286
Purifying Selection on Exonic Splice Enhancers in Intronless Genes

PubMed Central

Savisaar, Rosina; Hurst, Laurence D.

2016-01-01

Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites. PMID:26802218
Identification of Novel Single Nucleotide Polymorphisms Associated with Acute Respiratory Distress Syndrome by Exome-Seq

PubMed Central

Shortt, Katherine; Chaudhary, Suman; Grigoryev, Dmitry; Heruth, Daniel P.; Venkitachalam, Lakshmi; Zhang, Li Q.; Ye, Shui Q.

2014-01-01

Acute respiratory distress syndrome (ARDS) is a lung condition characterized by impaired gas exchange with systemic release of inflammatory mediators, causing pulmonary inflammation, vascular leak and hypoxemia. Existing biomarkers have limited effectiveness as diagnostic and therapeutic targets. To identify disease-associating variants in ARDS patients, whole-exome sequencing was performed on 96 ARDS patients, detecting 1,382,399 SNPs. By comparing these exome data to those of the 1000 Genomes Project, we identified a number of single nucleotide polymorphisms (SNP) which are potentially associated with ARDS. 50,190SNPs were found in all case subgroups and controls, of which89 SNPs were associated with susceptibility. We validated three SNPs (rs78142040, rs9605146 and rs3848719) in additional ARDS patients to substantiate their associations with susceptibility, severity and outcome of ARDS. rs78142040 (C>T) occurs within a histone mark (intron 6) of the Arylsulfatase D gene. rs9605146 (G>A) causes a deleterious coding change (proline to leucine) in the XK, Kell blood group complex subunit-related family, member 3 gene. rs3848719 (G>A) is a synonymous SNP in the Zinc-Finger/Leucine-Zipper Co-Transducer NIF1 gene. rs78142040, rs9605146, and rs3848719 are associated significantly with susceptibility to ARDS. rs3848719 is associated with APACHE II score quartile. rs78142040 is associated with 60-day mortality in the overall ARDS patient population. Exome-seq is a powerful tool to identify potential new biomarkers for ARDS. We selectively validated three SNPs which have not been previously associated with ARDS and represent potential new genetic biomarkers for ARDS. Additional validation in larger patient populations and further exploration of underlying molecular mechanisms are warranted. PMID:25372662
A non-synonymous SNP within the isopentenyl transferase 2 locus is associated with kernel weight in Chinese maize inbreds (Zea mays L.).

PubMed

Weng, Jianfeng; Li, Bo; Liu, Changlin; Yang, Xiaoyan; Wang, Hongwei; Hao, Zhuanfang; Li, Mingshun; Zhang, Degui; Ci, Xiaoke; Li, Xinhai; Zhang, Shihuang

2013-07-05

Kernel weight, controlled by quantitative trait loci (QTL), is an important component of grain yield in maize. Cytokinins (CKs) participate in determining grain morphology and final grain yield in crops. ZmIPT2, which is expressed mainly in the basal transfer cell layer, endosperm, and embryo during maize kernel development, encodes an isopentenyl transferase (IPT) that is involved in CK biosynthesis. The coding region of ZmIPT2 was sequenced across a panel of 175 maize inbred lines that are currently used in Chinese maize breeding programs. Only 16 single nucleotide polymorphisms (SNPs) and seven haplotypes were detected among these inbred lines. Nucleotide diversity (π) within the ZmIPT2 window and coding region were 0.347 and 0.0047, respectively, and they were significantly lower than the mean nucleotide diversity value of 0.372 for maize Chromosome 2 (P < 0.01). Association mapping revealed that a single nucleotide change from cytosine (C) to thymine (T) in the ZmIPT2 coding region, which converted a proline residue into a serine residue, was significantly associated with hundred kernel weight (HKW) in three environments (P <0.05), and explained 4.76% of the total phenotypic variation. In vitro characterization suggests that the dimethylallyl diphospate (DMAPP) IPT activity of ZmIPT2-T is higher than that of ZmIPT2-C, as the amounts of adenosine triphosphate (ATP), adenosine diphosphate (ADP), and adenosine monophosphate (AMP) consumed by ZmIPT2-T were 5.48-, 2.70-, and 1.87-fold, respectively, greater than those consumed by ZmIPT2-C. The effects of artificial selection on the ZmIPT2 coding region were evaluated using Tajima's D tests across six subgroups of Chinese maize germplasm, with the most frequent favorable allele identified in subgroup PB (Partner B). These results showed that ZmIPT2, which is associated with kernel weight, was subjected to artificial selection during the maize breeding process. ZmIPT2-T had higher IPT activity than ZmIPT2-C, and this favorable allele for kernel weight could be used in molecular marker-assisted selection for improvement of grain yield components in Chinese maize breeding programs.
Genetic diversity and natural selection of Plasmodium knowlesi merozoite surface protein 1 paralog gene in Malaysia.

PubMed

Ahmed, Md Atique; Fauzi, Muh; Han, Eun-Taek

2018-03-14

Human infections due to the monkey malaria parasite Plasmodium knowlesi is on the rise in most Southeast Asian countries specifically Malaysia. The C-terminal 19 kDa domain of PvMSP1P is a potential vaccine candidate, however, no study has been conducted in the orthologous gene of P. knowlesi. This study investigates level of polymorphisms, haplotypes and natural selection of full-length pkmsp1p in clinical samples from Malaysia. A total of 36 full-length pkmsp1p sequences along with the reference H-strain and 40 C-terminal pkmsp1p sequences from clinical isolates of Malaysia were downloaded from published genomes. Genetic diversity, polymorphism, haplotype and natural selection were determined using DnaSP 5.10 and MEGA 5.0 software. Genealogical relationships were determined using haplotype network tree in NETWORK software v5.0. Population genetic differentiation index (F ST ) and population structure of parasite was determined using Arlequin v3.5 and STRUCTURE v2.3.4 software. Comparison of 36 full-length pkmsp1p sequences along with the H-strain identified 339 SNPs (175 non-synonymous and 164 synonymous substitutions). The nucleotide diversity across the full-length gene was low compared to its ortholog pvmsp1p. The nucleotide diversity was higher toward the N-terminal domains (pkmsp1p-83 and 30) compared to the C-terminal domains (pkmsp1p-38, 33 and 19). Phylogenetic analysis of full-length genes identified 2 distinct clusters of P. knowlesi from Malaysian Borneo. The 40 pkmsp1p-19 sequences showed low polymorphisms with 16 polymorphisms leading to 18 haplotypes. In total there were 10 synonymous and 6 non-synonymous substitutions and 12 cysteine residues were intact within the two EGF domains. Evidence of strong purifying selection was observed within the full-length sequences as well in all the domains. Shared haplotypes of 40 pkmsp1p-19 were identified within Malaysian Borneo haplotypes. This study is the first to report on the genetic diversity and natural selection of pkmsp1p. A low level of genetic diversity and strong evidence of negative selection was detected and observed in all the domains of pkmsp1p of P. knowlesi indicating functional constrains. Shared haplotypes were identified within pkmsp1p-19 highlighting further evaluation using larger number of clinical samples from Malaysia.
Bactericidal activity of tracheal antimicrobial peptide against respiratory pathogens of cattle.

PubMed

Taha-Abdelaziz, Khaled; Perez-Casal, José; Schott, Courtney; Hsiao, Jason; Attah-Poku, Samuel; Slavić, Durđa; Caswell, Jeff L

2013-04-15

Tracheal antimicrobial peptide (TAP) is a β-defensin produced by mucosal epithelial cells of cattle. Although effective against several human pathogens, the activity of this bovine peptide against the bacterial pathogens that cause bovine respiratory disease have not been reported. This study compared the antibacterial effects of synthetic TAP against Mannheimia haemolytica, Histophilus somni, Pasteurella multocida, and Mycoplasma bovis. Bactericidal activity against M. bovis was not detected. In contrast, the Pasteurellaceae bacteria showed similar levels of susceptibility to that of Escherichia coli, with 0.125μg TAP inhibiting growth in a radial diffusion assay and minimum inhibitory concentrations of 1.56-6.25μg/ml in a bactericidal assay. Significant differences among isolates were not observed. Sequencing of exon 2 of the TAP gene from 23 cattle revealed a prevalent non-synonymous single nucleotide polymorphism (SNP) A137G, encoding either serine or asparagine at residue 20 of the mature peptide. The functional effect of this SNP was tested against M. haemolytica using synthetic peptides. The bactericidal effect of the asparagine-containing peptide was consistently higher than the serine-containing peptide. Bactericidal activities were similar for an acapsular mutant of M. haemolytica compared to the wild type. These findings indicate that the Pasteurellaceae bacteria that cause bovine respiratory disease are susceptible to killing by bovine TAP and appear not to have evolved resistance, whereas M. bovis appears to be resistant. A non-synonymous SNP was identified in the coding region of the TAP gene, and the corresponding peptides vary in their bactericidal activity against M. haemolytica. Copyright © 2013 Elsevier B.V. All rights reserved.
SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

PubMed Central

Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

2014-01-01

The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047
Pharmacogenetics of human 3'-phosphoadenosine 5'-phosphosulfate synthetase 1 (PAPSS1): gene resequencing, sequence variation, and functional genomics.

PubMed

Xu, Zhen-Hua; Thomae, Bianca A; Eckloff, Bruce W; Wieben, Eric D; Weinshilboum, Richard M

2003-06-01

3'-Phosphoadenosine 5'-phosphosulfate (PAPS) is the high-energy "sulfate donor" for reactions catalyzed by sulfotransferase (SULT) enzymes. The strict requirement of SULTs for PAPS suggests that PAPS synthesis might influence the rate of sulfate conjugation. In humans, PAPS is synthesized from ATP and SO(4)(2-) by two isoforms of PAPS synthetase (PAPSS): PAPSS1 and PAPSS2. As a step toward pharmacogenetic studies, we have resequenced the entire coding sequence of the human PAPSS1 gene, including exon-intron splice junctions, using DNA samples from 60 Caucasian-American and 58 African-American subjects. Twenty-one genetic polymorphisms were observed-1 insertion-deletion event and 20 single nucleotide polymorphisms (SNPs)-including two non-synonymous coding SNPs (cSNPs) that altered the following amino acids: Arg333Cys and Glu531Gln. Twelve pairs of these polymorphisms were tightly linked, and a total of twelve unequivocal haplotypes could be identified-two that were common to both ethnic groups and ten that were ethnic-specific. The Arg333Cys polymorphism, with an allele frequency of 2.5%, was observed only in DNA samples from Caucasian subjects. The Glu531Gln polymorphism was rare, with only a single copy of that allele in a DNA sample from an African-American subject. Transient expression in mammalian cells showed that neither of the non-synonymous cSNPs resulted in a change in the basal level of enzyme activity measured under optimal assay conditions. However, the Glu531Gln polymorphism altered the substrate kinetic properties of the enzyme. The Gln531 variant allozyme had a 5-fold higher K(m) value for SO(4)(2-) than did the wild-type allozyme and displayed monophasic kinetics for Na(2)SO(4). The wild-type allozyme (Glu531) showed biphasic kinetics for that substrate. These observations represent a step toward testing the hypothesis that genetic variation in PAPS synthesis catalyzed by PAPSS1 might alter in vivo sulfate conjugation.
In Silico Analysis of Single Nucleotide Polymorphism (SNPs) in Human β-Globin Gene

PubMed Central

Alanazi, Mohammed; Abduljaleel, Zainularifeen; Khan, Wajahatullah; Warsy, Arjumand S.; Elrobh, Mohamed; Khan, Zahid; Amri, Abdullah Al; Bazzi, Mohammad D.

2011-01-01

Single amino acid substitutions in the globin chain are the most common forms of genetic variations that produce hemoglobinopathies- the most widespread inherited disorders worldwide. Several hemoglobinopathies result from homozygosity or compound heterozygosity to beta-globin (HBB) gene mutations, such as that producing sickle cell hemoglobin (HbS), HbC, HbD and HbE. Several of these mutations are deleterious and result in moderate to severe hemolytic anemia, with associated complications, requiring lifelong care and management. Even though many hemoglobinopathies result from single amino acid changes producing similar structural abnormalities, there are functional differences in the generated variants. Using in silico methods, we examined the genetic variations that can alter the expression and function of the HBB gene. Using a sequence homology-based Sorting Intolerant from Tolerant (SIFT) server we have searched for the SNPs, which showed that 200 (80%) non-synonymous polymorphism were found to be deleterious. The structure-based method via PolyPhen server indicated that 135 (40%) non-synonymous polymorphism may modify protein function and structure. The Pupa Suite software showed that the SNPs will have a phenotypic consequence on the structure and function of the altered protein. Structure analysis was performed on the key mutations that occur in the native protein coded by the HBB gene that causes hemoglobinopathies such as: HbC (E→K), HbD (E→Q), HbE (E→K) and HbS (E→V). Atomic Non-Local Environment Assessment (ANOLEA), Yet Another Scientific Artificial Reality Application (YASARA), CHARMM-GUI webserver for macromolecular dynamics and mechanics, and Normal Mode Analysis, Deformation and Refinement (NOMAD-Ref) of Gromacs server were used to perform molecular dynamics simulations and energy minimization calculations on β-Chain residue of the HBB gene before and after mutation. Furthermore, in the native and altered protein models, amino acid residues were determined and secondary structures were observed for solvent accessibility to confirm the protein stability. The functional study in this investigation may be a good model for additional future studies. PMID:22028795
Non-synonymous single nucleotide polymorphisms in the watermelon eIF4E gene are closely associated with resistance to zucchini yellow mosaic virus.

PubMed

Ling, Kai-Shu; Harris, Karen R; Meyer, Jenelle D F; Levi, Amnon; Guner, Nihat; Wehner, Todd C; Bendahmane, Abdelhafid; Havey, Michael J

2009-12-01

Zucchini yellow mosaic virus (ZYMV) is one of the most economically important potyviruses infecting cucurbit crops worldwide. Using a candidate gene approach, we cloned and sequenced eIF4E and eIF(iso)4E gene segments in watermelon. Analysis of the nucleotide sequences between the ZYMV-resistant watermelon plant introduction PI 595203 (Citrullus lanatus var. lanatus) and the ZYMV-susceptible watermelon cultivar 'New Hampshire Midget' ('NHM') showed the presence of single nucleotide polymorphisms (SNPs). Initial analysis of the identified SNPs in association studies indicated that SNPs in the eIF4E, but not eIF(iso)4E, were closely associated to the phenotype of ZYMV-resistance in 70 F(2) and 114 BC(1R) progenies. Subsequently, we focused our efforts in obtaining the entire genomic sequence of watermelon eIF4E. Three SNPs were identified between PI 595203 and NHM. One of the SNPs (A241C) was in exon 1 and the other two SNPs (C309A and T554G) were in the first intron of the gene. SNP241 which resulted in an amino acid substitution (proline to threonine) was shown to be located in the critical cap recognition and binding area, similar to that of several plant species resistance to potyviruses. Analysis of a cleaved amplified polymorphism sequence (CAPS) marker derived from this SNP in F(2) and BC(1R) populations demonstrated a cosegregation between the CAPS-2 marker and their ZYMV resistance or susceptibility phenotype. When we investigated whether such SNP mutation in the eIF4E was also conserved in several other PIs of C. lanatus var. citroides, we identified a different SNP (A171G) resulting in another amino acid substitution (D71G) from four ZYMV-resistant C. lanatus var. citroides (PI 244018, PI 482261, PI 482299, and PI 482322). Additional CAPS markers were also identified. Availability of all these CAPS markers will enable marker-aided breeding of watermelon for ZYMV resistance.
Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

PubMed

Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

2017-11-28

Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.
Clonal population structure of Legionella pneumophila inferred from allelic profiling.

PubMed

Edwards, Martin T; Fry, Norman K; Harrison, Timothy G

2008-03-01

The population structure of Legionella pneumophila was investigated by analysing nucleotide sequences from six loci (flaA, pilE, asd, mip, mompS and proA) of 335 globally distributed isolates from clinical and environmental sources over a 29-year period (1977-2006). Data were obtained from unrelated isolates from Europe (n=270), Japan (n=31), Canada (n=7), the USA (n=24) and Australia (n=1). The country of origin of two strains was unknown. Analysis of these isolates indicated significant linkage disequilibrium between the six loci. Application of six sequence-based recombination detection tests did not reveal evidence of recombination, but estimates of rates of recombination and mutation made by a seventh test suggested that recombination could have occurred at a rate similar to, but probably lower than, that of mutation. Genealogies inferred under models with and without recombination were congruent with each other, providing no definitive evidence regarding recombination, and were in agreement with sequence clusters identified by graph methods. Further evidence supporting the distinct nature of two of the three subspecies of L. pneumophila, subsp. fraseri and subsp. pascullei, was also found. The ratios of non-synonymous to synonymous nucleotide polymorphisms for each of the allele sets were examined and revealed that the putative virulence loci mompS and pilE are under diversifying pressure, while the allelic regions of three other loci linked to virulence (flaA, proA and mip) do not appear to be.
Estimating the parameters of background selection and selective sweeps in Drosophila in the presence of gene conversion

PubMed Central

Campos, José Luis; Charlesworth, Brian

2017-01-01

We used whole-genome resequencing data from a population of Drosophila melanogaster to investigate the causes of the negative correlation between the within-population synonymous nucleotide site diversity (πS) of a gene and its degree of divergence from related species at nonsynonymous nucleotide sites (KA). By using the estimated distributions of mutational effects on fitness at nonsynonymous and UTR sites, we predicted the effects of background selection at sites within a gene on πS and found that these could account for only part of the observed correlation between πS and KA. We developed a model of the effects of selective sweeps that included gene conversion as well as crossing over. We used this model to estimate the average strength of selection on positively selected mutations in coding sequences and in UTRs, as well as the proportions of new mutations that are selectively advantageous. Genes with high levels of selective constraint on nonsynonymous sites were found to have lower strengths of positive selection and lower proportions of advantageous mutations than genes with low levels of constraint. Overall, background selection and selective sweeps within a typical gene reduce its synonymous diversity to ∼75% of its value in the absence of selection, with larger reductions for genes with high KA. Gene conversion has a major effect on the estimates of the parameters of positive selection, such that the estimated strength of selection on favorable mutations is greatly reduced if it is ignored. PMID:28559322
The population genomics of rhesus macaques (Macaca mulatta) based on whole-genome sequences

PubMed Central

Xue, Cheng; Raveendran, Muthuswamy; Harris, R. Alan; Fawcett, Gloria L.; Liu, Xiaoming; White, Simon; Dahdouli, Mahmoud; Rio Deiros, David; Below, Jennifer E.; Salerno, William; Cox, Laura; Fan, Guoping; Ferguson, Betsy; Horvath, Julie; Johnson, Zach; Kanthaswamy, Sree; Kubisch, H. Michael; Liu, Dahai; Platt, Michael; Smith, David G.; Sun, Binghua; Vallender, Eric J.; Wang, Feng; Wiseman, Roger W.; Chen, Rui; Muzny, Donna M.; Gibbs, Richard A.; Yu, Fuli; Rogers, Jeffrey

2016-01-01

Rhesus macaques (Macaca mulatta) are the most widely used nonhuman primate in biomedical research, have the largest natural geographic distribution of any nonhuman primate, and have been the focus of much evolutionary and behavioral investigation. Consequently, rhesus macaques are one of the most thoroughly studied nonhuman primate species. However, little is known about genome-wide genetic variation in this species. A detailed understanding of extant genomic variation among rhesus macaques has implications for the use of this species as a model for studies of human health and disease, as well as for evolutionary population genomics. Whole-genome sequencing analysis of 133 rhesus macaques revealed more than 43.7 million single-nucleotide variants, including thousands predicted to alter protein sequences, transcript splicing, and transcription factor binding sites. Rhesus macaques exhibit 2.5-fold higher overall nucleotide diversity and slightly elevated putative functional variation compared with humans. This functional variation in macaques provides opportunities for analyses of coding and noncoding variation, and its cellular consequences. Despite modestly higher levels of nonsynonymous variation in the macaques, the estimated distribution of fitness effects and the ratio of nonsynonymous to synonymous variants suggest that purifying selection has had stronger effects in rhesus macaques than in humans. Demographic reconstructions indicate this species has experienced a consistently large but fluctuating population size. Overall, the results presented here provide new insights into the population genomics of nonhuman primates and expand genomic information directly relevant to primate models of human disease. PMID:27934697
Whole-exome sequencing of 228 patients with sporadic Parkinson's disease.

PubMed

Sandor, Cynthia; Honti, Frantisek; Haerty, Wilfried; Szewczyk-Krolikowski, Konrad; Tomlinson, Paul; Evetts, Sam; Millin, Stephanie; Keane, Thomas; McCarthy, Shane A; Durbin, Richard; Talbot, Kevin; Hu, Michele; Webber, Caleb; Ponting, Chris P; Wade-Martins, Richard

2017-01-24

Parkinson's disease (PD) is the most common neurodegenerative movement disorder, affecting 1% of the population over 65 years characterized clinically by both motor and non-motor symptoms accompanied by the preferential loss of dopamine neurons in the substantia nigra pars compacta. Here, we sequenced the exomes of 244 Parkinson's patients selected from the Oxford Parkinson's Disease Centre Discovery Cohort and, after quality control, 228 exomes were available for analyses. The PD patient exomes were compared to 884 control exomes selected from the UK10K datasets. No single non-synonymous (NS) single nucleotide variant (SNV) nor any gene carrying a higher burden of NS SNVs was significantly associated with PD status after multiple-testing correction. However, significant enrichments of genes whose proteins have roles in the extracellular matrix were amongst the top 300 genes with the most significantly associated NS SNVs, while regions associated with PD by a recent Genome Wide Association (GWA) study were enriched in genes containing PD-associated NS SNVs. By examining genes within GWA regions possessing rare PD-associated SNVs, we identified RAD51B. The protein-product of RAD51B interacts with that of its paralogue RAD51, which is associated with congenital mirror movements phenotypes, a phenotype also comorbid with PD.
Association of a novel SNP in exon 10 of the IGF2 gene with growth traits in Egyptian water buffalo (Bubalus bubalis).

PubMed

Abo-Al-Ela, Haitham G; El-Magd, Mohammed Abu; El-Nahas, Abeer F; Mansour, Ali A

2014-08-01

Insulin-like growth factor 2 (IGF2) plays an important role in muscle growth and it might be used as a marker for the growth traits selection strategies in farm animals. The objectives of this study were to detect polymorphisms in exon 10 of IGF2 and to determine associations between these polymorphisms and growth traits in Egyptian water buffalo. PCR-single-strand conformation polymorphism (SSCP) and DNA sequencing methods were used to detect any prospective polymorphism. A novel single nucleotide polymorphism (SNP), C287A, was detected. It was a non-synonymous mutation and led to replacement of glutamine (Q) amino acid (aa) by histidine (H) aa. Three different SSCP patterns were observed: AA, AC, and CC, with frequencies of 0.540, 0.325, and 0.135, respectively. Association analyses revealed that the AA individuals had a higher average daily gain (ADG) than other individuals (CC and AC) from birth to 9 months of age. We conclude that the AA genotype in C287A SNP in the exon 10 of the IGF2 gene is associated with the ADG during the age from birth to 9 months and could be used as a potential genetic marker for selection of growth traits in Egyptian buffalo.
Alternative SNP detection platforms, HRM and biosensors, for varietal identification in Vitis vinifera L. using F3H and LDOX genes.

PubMed

Gomes, Sónia; Castro, Cláudia; Barrias, Sara; Pereira, Leonor; Jorge, Pedro; Fernandes, José R; Martins-Lopes, Paula

2018-04-11

The wine sector requires quick and reliable methods for Vitis vinifera L. varietal identification. The number of V. vinifera varieties is estimated in about 5,000 worldwide. Single Nucleotide Polymorphisms (SNPs) represent the most basic and abundant form of genetic sequence variation, being adequate for varietal discrimination. The aim of this work was to develop DNA-based assays suitable to detect SNP variation in V. vinifera, allowing varietal discrimination. Genotyping by sequencing allowed the detection of eleven SNPs on two genes of the anthocyanin pathway, the flavanone 3-hydroxylase (F3H, EC: 1.14.11.9), and the leucoanthocyanidin dioxygenase (LDOX, EC 1.14.11.19; synonym anthocyanidin synthase, ANS) in twenty V. vinifera varieties. Three High Resolution Melting (HRM) assays were designed based on the sequencing information, discriminating five of the 20 varieties: Alicante Bouschet, Donzelinho Tinto, Merlot, Moscatel Galego and Tinta Roriz. Sanger sequencing of the HRM assay products confirmed the HRM profiles. Three probes, with different lengths and sequences, were used as bio-recognition elements in an optical biosensor platform based on a long period grating (LPG) fiber optic sensor. The label free platform detected a difference of a single SNP using genomic DNA samples. The two different platforms were successfully applied for grapevine varietal identification.
Polymorphisms in the Tlr4 and Tlr5 Gene Are Significantly Associated with Inflammatory Bowel Disease in German Shepherd Dogs

PubMed Central

Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

2010-01-01

Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease. PMID:21203467
Polymorphisms in the TLR4 and TLR5 gene are significantly associated with inflammatory bowel disease in German shepherd dogs.

PubMed

Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin

2010-12-23

Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease.

Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning

PubMed Central

Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry

2014-01-01

Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of large-scale protein-protein interaction networks, and can be useful for functional annotation of disease-associated SNPs. SNIP-IN tool is freely accessible as a web-server at http://korkinlab.org/snpintool/. PMID:24784581
Single Color Multiplexed ddPCR Copy Number Measurements and Single Nucleotide Variant Genotyping.

PubMed

Wood-Bouwens, Christina M; Ji, Hanlee P

2018-01-01

Droplet digital PCR (ddPCR) allows for accurate quantification of genetic events such as copy number variation and single nucleotide variants. Probe-based assays represent the current "gold-standard" for detection and quantification of these genetic events. Here, we introduce a cost-effective single color ddPCR assay that allows for single genome resolution quantification of copy number and single nucleotide variation.
Large-Scale Genomic Analysis of Codon Usage in Dengue Virus and Evaluation of Its Phylogenetic Dependence

PubMed Central

Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro

2014-01-01

The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
[Molecular evolution of the tick-borne encephalitis and Powassan viruses].

PubMed

Subbotina, E L; Loktev, V B

2012-01-01

The problem of emerging viruses, their genetic diversity and viral evolution in nature are attracting more attention. The phylogenetic analysis and evaluationary rate estimation were made for pathogenic flaviviruses such as tick-borne encephalitis virus (TBEV) and Powassan (PV) circulated in natural foci in Russia. 47 nucleotide sequences of encoded protein E of the TBEV and 17 sequences of NS5 genome region of the PV have been used. It was found that the rate of accumulation of nucleotide substitutions for E genome region of TBEV was approximately 1.4 x 10(-4) and 5.4 x 10(-5) substitutions per site per year for NS5 genome region of PV. The ratio of non-synonymous nucleotide substitutions to synonymous substitution (dN/dS) for viral sequences were estimated of 0.049 for TBEV and 0.098 for PV. Maximum value dN/dS was 0.201-0.220 for sub-cluster of Russian and Canadian strains of PV and the minimum - 0.024 for cluster of Russian and Chinese strains of Far Eastern genotype TBEV. Evaluation of time intervals of evolutionary events associated with these viruses showed that European subtype TBEV are diverged from all-TBEV ancestor within approximately 2750 years and the Siberian and Far Eastern subtypes are emerged about 2250 years ago. The PV was introduced into natural foci of the Primorsky Krai of Russia only about 70 years ago and PV is a very close to Canadian strains of PV. Evolutionary picture for PV in North America is similar to evolution of Siberian and Far Eastern subtypes TBEV in Asia. The divergence time for main genetic groups of TBEV and PV are correlated with historical periods of warming and cooling. These allow to propose a hypothesis that climate changes were essential to the evolution of the flaviviruses in the past millenniums.
Synonym extraction and abbreviation expansion with ensembles of semantic spaces.

PubMed

Henriksson, Aron; Moen, Hans; Skeppstedt, Maria; Daudaravičius, Vidas; Duneld, Martin

2014-02-05

Terminologies that account for variation in language use by linking synonyms and abbreviations to their corresponding concept are important enablers of high-quality information extraction from medical texts. Due to the use of specialized sub-languages in the medical domain, manual construction of semantic resources that accurately reflect language use is both costly and challenging, often resulting in low coverage. Although models of distributional semantics applied to large corpora provide a potential means of supporting development of such resources, their ability to isolate synonymy from other semantic relations is limited. Their application in the clinical domain has also only recently begun to be explored. Combining distributional models and applying them to different types of corpora may lead to enhanced performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. A combination of two distributional models - Random Indexing and Random Permutation - employed in conjunction with a single corpus outperforms using either of the models in isolation. Furthermore, combining semantic spaces induced from different types of corpora - a corpus of clinical text and a corpus of medical journal articles - further improves results, outperforming a combination of semantic spaces induced from a single source, as well as a single semantic space induced from the conjoint corpus. A combination strategy that simply sums the cosine similarity scores of candidate terms is generally the most profitable out of the ones explored. Finally, applying simple post-processing filtering rules yields substantial performance gains on the tasks of extracting abbreviation-expansion pairs, but not synonyms. The best results, measured as recall in a list of ten candidate terms, for the three tasks are: 0.39 for abbreviations to long forms, 0.33 for long forms to abbreviations, and 0.47 for synonyms. This study demonstrates that ensembles of semantic spaces can yield improved performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. This notion, which merits further exploration, allows different distributional models - with different model parameters - and different types of corpora to be combined, potentially allowing enhanced performance to be obtained on a wide range of natural language processing tasks.
Synonym extraction and abbreviation expansion with ensembles of semantic spaces

PubMed Central

2014-01-01

Background Terminologies that account for variation in language use by linking synonyms and abbreviations to their corresponding concept are important enablers of high-quality information extraction from medical texts. Due to the use of specialized sub-languages in the medical domain, manual construction of semantic resources that accurately reflect language use is both costly and challenging, often resulting in low coverage. Although models of distributional semantics applied to large corpora provide a potential means of supporting development of such resources, their ability to isolate synonymy from other semantic relations is limited. Their application in the clinical domain has also only recently begun to be explored. Combining distributional models and applying them to different types of corpora may lead to enhanced performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. Results A combination of two distributional models – Random Indexing and Random Permutation – employed in conjunction with a single corpus outperforms using either of the models in isolation. Furthermore, combining semantic spaces induced from different types of corpora – a corpus of clinical text and a corpus of medical journal articles – further improves results, outperforming a combination of semantic spaces induced from a single source, as well as a single semantic space induced from the conjoint corpus. A combination strategy that simply sums the cosine similarity scores of candidate terms is generally the most profitable out of the ones explored. Finally, applying simple post-processing filtering rules yields substantial performance gains on the tasks of extracting abbreviation-expansion pairs, but not synonyms. The best results, measured as recall in a list of ten candidate terms, for the three tasks are: 0.39 for abbreviations to long forms, 0.33 for long forms to abbreviations, and 0.47 for synonyms. Conclusions This study demonstrates that ensembles of semantic spaces can yield improved performance on the tasks of automatically extracting synonyms and abbreviation-expansion pairs. This notion, which merits further exploration, allows different distributional models – with different model parameters – and different types of corpora to be combined, potentially allowing enhanced performance to be obtained on a wide range of natural language processing tasks. PMID:24499679
BeAtMuSiC: Prediction of changes in protein-protein binding affinity on mutations.

PubMed

Dehouck, Yves; Kwasigroch, Jean Marc; Rooman, Marianne; Gilis, Dimitri

2013-07-01

The ability of proteins to establish highly selective interactions with a variety of (macro)molecular partners is a crucial prerequisite to the realization of their biological functions. The availability of computational tools to evaluate the impact of mutations on protein-protein binding can therefore be valuable in a wide range of industrial and biomedical applications, and help rationalize the consequences of non-synonymous single-nucleotide polymorphisms. BeAtMuSiC (http://babylone.ulb.ac.be/beatmusic) is a coarse-grained predictor of the changes in binding free energy induced by point mutations. It relies on a set of statistical potentials derived from known protein structures, and combines the effect of the mutation on the strength of the interactions at the interface, and on the overall stability of the complex. The BeAtMuSiC server requires as input the structure of the protein-protein complex, and gives the possibility to assess rapidly all possible mutations in a protein chain or at the interface, with predictive performances that are in line with the best current methodologies.
In silico prediction of a disease-associated STIL mutant and its affect on the recruitment of centromere protein J (CENPJ).

PubMed

Kumar, Ambuj; Rajendran, Vidya; Sethumadhavan, Rao; Purohit, Rituraj

2012-01-01

Human STIL (SCL/TAL1 interrupting locus) protein maintains centriole stability and spindle pole localisation. It helps in recruitment of CENPJ (Centromere protein J)/CPAP (centrosomal P4.1-associated protein) and other centrosomal proteins. Mutations in STIL protein are reported in several disorders, especially in deregulation of cell cycle cascades. In this work, we examined the non-synonymous single nucleotide polymorphisms (nsSNPs) reported in STIL protein for their disease association. Different SNP prediction tools were used to predict disease-associated nsSNPs. Our evaluation technique predicted rs147744459 (R242C) as a highly deleterious disease-associated nsSNP and its interaction behaviour with CENPJ protein. Molecular modelling, docking and molecular dynamics simulation were conducted to examine the structural consequences of the predicted disease-associated mutation. By molecular dynamic simulation we observed structural consequences of R242C mutation which affects interaction of STIL and CENPJ functional domains. The result obtained in this study will provide a biophysical insight into future investigations of pathological nsSNPs using a computational platform.
Transmembrane domain dependent inhibitory function of FcγRIIB.

PubMed

Wang, Junyi; Li, Zongyu; Xu, Liling; Yang, Hengwen; Liu, Wanli

2018-03-01

FcγRIIB, the only inhibitory IgG Fc receptor, functions to suppress the hyper-activation of immune cells. Numerous studies have illustrated its inhibitory function through the ITIM motif in the cytoplasmic tail of FcγRIIB. However, later studies revealed that in addition to the ITIM, the transmembrane (TM) domain of FcγRIIB is also indispensable for its inhibitory function. Indeed, recent epidemiological studies revealed that a non-synonymous single nucleotide polymorphism (rs1050501) within the TM domain of FcγRIIB, responsible for the I232T substitution, is associated with the susceptibility to systemic lupus erythematosus (SLE). In this review, we will summarize these epidemiological and functional studies of FcγRIIB-I232T in the past few years, and will further discuss the mechanisms accounting for the functional loss of FcγRIIB-I232T. Our review will help the reader gain a deeper understanding of the importance of the TM domain in mediating the inhibitory function of FcγRIIB and may provide insights to a new therapeutic target for the associated diseases.
Transcriptome analysis of adiposity in domestic ducks by transcriptomic comparison with their wild counterparts.

PubMed

Chen, L; Luo, J; Li, J X; Li, J J; Wang, D Q; Tian, Y; Lu, L Z

2015-06-01

Excessive adiposity is a major problem in the duck industry, but its molecular mechanisms remain unknown. Genetic comparisons between domestic and wild animals have contributed to the exploration of genetic mechanisms responsible for many phenotypic traits. Significant differences in body fat mass have been detected between domestic and wild ducks. In this study, we used the Peking duck and Anas platyrhynchos as the domestic breed and wild counterpart respectively and performed a transcriptomic comparison of abdominal fat between the two breeds to comprehensively analyze the transcriptome basis of adiposity in ducks. We obtained approximately 350 million clean reads; assembled 61 250 transcripts, including 23 699 novel ones; and identified alternative 5' splice sites, alternative 3' splice sites, skipped exons and retained intron as the main alternative splicing events. A differential expression analysis between the two breeds showed that 753 genes exhibited differential expression. In Peking ducks, some lipid metabolism-related genes (IGF2, FABP5, BMP7, etc.) and oncogenes (RRM2, AURKA, CYR61, etc.) were upregulated, whereas genes related to tumor suppression and immunity (TNFRSF19, TNFAIP6, IGSF21, NCF1, etc.) were downregulated, suggesting adiposity might closely associate with tumorigenesis in ducks. Furthermore, 280 576 single-nucleotide variations were found differentiated between the two breeds, including 8641 non-synonymous ones, and some of the non-synonymous ones were found enriched in genes involved in lipid-associated and immune-associated pathways, suggesting abdominal fat of the duck undertakes both a metabolic function and immune-related function. These datasets enlarge our genetic information of ducks and provide valuable resources for analyzing mechanisms underlying adiposity in ducks. © 2015 Stichting International Foundation for Animal Genetics.
Is α‐T catenin (VR22) an Alzheimer's disease risk gene?

PubMed Central

Bertram, Lars; Mullin, Kristina; Parkinson, Michele; Hsiao, Monica; Moscarillo, Thomas J; Wagner, Steven L; Becker, K David; Velicelebi, Gonul; Blacker, Deborah; Tanzi, Rudolph E

2007-01-01

Background Recently, conflicting reports have been published on the potential role of genetic variants in the α‐T catenin gene (VR22; CTNNA3) on the risk for Alzheimer's disease. In these papers, evidence for association is mostly observed in multiplex families with Alzheimer's disease, whereas case–control samples of sporadic Alzheimer's disease are predominantly negative. Methods After sequencing VR22 in multiplex families with Alzheimer's disease linked to chromosome 10q21, we identified a novel non‐synonymous (Ser596Asn; rs4548513) single nucleotide polymorphism (SNP). This and four non‐coding SNPs were assessed in two independent samples of families with Alzheimer's disease, one with 1439 subjects from 437 multiplex families with Alzheimer's disease and the other with 489 subjects from 217 discordant sibships. Results A weak association with the Ser596Asn SNP in the multiplex sample, predominantly in families with late‐onset Alzheimer's disease (p = 0.02), was observed. However, this association does not seem to contribute substantially to the chromosome 10 Alzheimer's disease linkage signal that we and others have reported previously. No evidence was found of association with any of the four additional SNPs tested in the multiplex families with Alzheimer's disease. Finally, the Ser596Asn change was not associated with the risk for Alzheimer's disease in the independent discordant sibship sample. Conclusions This is the first study to report evidence of an association between a potentially functional, non‐synonymous SNP in VR22 and the risk for Alzheimer's disease. As the underlying effects are probably small, and are only seen in families with multiple affected members, the population‐wide significance of this finding remains to be determined. PMID:17209133
A non-synonymous SNP in the NOS2 associated with septic shock in patients with sepsis in Chinese populations.

PubMed

Wang, Zhifu; Feng, Kai; Yue, Maoxing; Lu, Xiaoguang; Zheng, Qihan; Zhang, Hongxing; Zhai, Yun; Li, Peiyao; Yu, Lixia; Cai, Mi; Zhang, Xiumei; Kang, Xin; Shi, Weihai; Xia, Xia; Chen, Xi; Cao, Pengbo; Li, Yuanfeng; Chen, Huipeng; Ling, Yan; Li, Yuxia; He, Fuchu; Zhou, Gangqiao

2013-03-01

Sepsis represents a systemic inflammatory response to infection and its sequelae include severe sepsis, septic shock, multiple organ dysfunction syndrome (MODS) and death. Studies in mice and humans indicate that the inducible nitric oxide synthase (iNOS, NOS2) plays an important role in the development of sepsis and its sequelae. It was reported that several single nucleotide polymorphisms (SNPs) within NOS2 could influence the production or activity of NOS2. In this study, we assessed whether SNPs within NOS2 gene were associated with severity of sepsis in Chinese populations. A case-control study was conducted, which included 299 and 280 unrelated patients with sepsis recruited from Liaoning and Jiangsu provinces in China, respectively. Six SNPs within NOS2 were genotyped using Sequenom MassARRAY system. The associations between the SNPs and risk of sepsis complications were estimated by a binary logistic regression model adjusted for confounding factors. Functional assay was performed to assess the biological significance. The GA + AA genotype of a non-synonymous SNP in the exon 16 of NOS2 (rs2297518: G>A) was significantly associated with increased susceptibility to septic shock compared with GG genotype in Liaoning population (OR = 3.29, 95% CI = 1.40-7.72, P = 0.0047). This association was confirmed in the Jiangsu population (OR = 3.49, 95% CI = 1.57-7.79, P = 0.0019). Furthermore, the functional assay performed in the immortalized lymphocyte cell lines indicated that the at-risk GA genotype had a tendency of higher NOS2 activity than the GG genotype (P = 0.32). Our findings suggest that the NOS2 rs2297518 may play a role in mediating the susceptibility to septic shock in patients with sepsis in Chinese populations.
Paraoxonase promoter and intronic variants modify risk of sporadic amyotrophic lateral sclerosis

PubMed Central

Cronin, Simon; Greenway, Matthew J; Prehn, Jochen H M; Hardiman, Orla

2007-01-01

Background The paraoxonases, PON1–3, play a major protective role both against environmental toxins and as part of the antioxidant defence system. Recently, non‐synonymous coding single nucleotide polymorphisms (SNPs), known to lower serum PON activity, have been associated with sporadic ALS (SALS) in a Polish population. A separate trio based study described a detrimental allele at the PON3 intronic variant INS2+3651 (rs10487132). Association between PON gene cluster variants and SALS requires external validation in an independent dataset. Aims To examine the association of the promoter SNPs PON1−162G>A and PON1−108T>C; the non‐synonymous functional SNPs PON1Q192R and L55M and PON2C311S and A148G; and the intronic marker PON3INS2+3651A>G, with SALS in a genetically homogenous population. Methods 221 Irish patients with SALS and 202 unrelated control subjects were genotyped using KASPar chemistries. Statistical analyses and haplotype estimations were conducted using Haploview and Unphased software. Multiple permutation testing, as implemented in Unphased, was applied to haplotype p values to correct for multiple hypotheses. Results Two of the seven SNPs were associated with SALS in the Irish population: PON155M (OR 1.52, p = 0.006) and PON3INS2+3651 G (OR 1.36, p = 0.03). Two locus haplotype analysis showed association only when both of these risk alleles were present (OR 1.7, p = 0.005), suggesting a potential effect modification. Low functioning promoter variants were observed to influence this effect when compared with wild‐type. Conclusions These data provide additional evidence that genetic variation across the paroxanase loci may be common susceptibility factors for SALS. PMID:17702780
Evolutionary Diagnosis of non-synonymous variants involved in differential drug response

PubMed Central

2015-01-01

Background Many pharmaceutical drugs are known to be ineffective or have negative side effects in a substantial proportion of patients. Genomic advances are revealing that some non-synonymous single nucleotide variants (nsSNVs) may cause differences in drug efficacy and side effects. Therefore, it is desirable to evaluate nsSNVs of interest in their ability to modulate the drug response. Results We found that the available data on the link between drug response and nsSNV is rather modest. There were only 31 distinct drug response-altering (DR-altering) and 43 distinct drug response-neutral (DR-neutral) nsSNVs in the whole Pharmacogenomics Knowledge Base (PharmGKB). However, even with this modest dataset, it was clear that existing bioinformatics tools have difficulties in correctly predicting the known DR-altering and DR-neutral nsSNVs. They exhibited an overall accuracy of less than 50%, which was not better than random diagnosis. We found that the underlying problem is the markedly different evolutionary properties between positions harboring nsSNVs linked to drug responses and those observed for inherited diseases. To solve this problem, we developed a new diagnosis method, Drug-EvoD, which was trained on the evolutionary properties of nsSNVs associated with drug responses in a sparse learning framework. Drug-EvoD achieves a TPR of 84% and a TNR of 53%, with a balanced accuracy of 69%, which improves upon other methods significantly. Conclusions The new tool will enable researchers to computationally identify nsSNVs that may affect drug responses. However, much larger training and testing datasets are needed to develop more reliable and accurate tools. PMID:25952014
Polymorphisms of EpCAM gene and prognosis for non-small-cell lung cancer in Han Chinese

PubMed Central

Yang, Yuefan; Fei, Fei; Song, Yang; Li, Xiaofei; Zhang, Zhipei; Fei, Zhou; Su, Haichuan; Wan, Shaogui

2014-01-01

The epithelial cell adhesion molecule (EpCAM) is overexpressed in a wide variety of human cancers and is associated with patient prognosis, including those with lung cancer. However, the association of single nucleotide polymorphisms (SNPs) in the EpCAM gene with the prognosis for non-small-cell lung cancer (NSCLC) patients has never been investigated. We evaluated the association between two SNPs, rs1126497 and rs1421, in the EpCAM gene and clinical outcomes in a Chinese cohort of 506 NSCLC patients. The SNPs were genotyped using the Sequenom iPLEX genotyping system. Multivariate Cox proportional hazards model and Kaplan–Meier curves were used to assess the association of EpCAM gene genotypes with the prognosis of NSCLC. We found that the non-synonymous SNP rs1126497 was significantly associated with survival. Compared with the CC genotype, the CT+TT genotype was a risk factor for both death (hazard ratio, 1.40; 95% confidence interval [CI], 1.02–1.94; P = 0.040) and recurrence (hazard ratio, 1.34; 95% CI, 1.02–1.77; P = 0.039). However, the SNP rs1421 did not show any significant effect on patient prognosis. Instead, the AG+GG genotype in rs1421 was significantly associated with early T stages (T1/T2) when compared with the AA genotype (odds ratio for late stage = 0.65; 95% CI, 0.44–0.96, P = 0.029). Further stratified analysis showed notable modulating effects of clinical characteristics on the associations between variant genotypes of rs1126497 and NSCLC outcomes. In conclusion, our study indicated that the non-synonymous SNP rs1126497 may be a potential prognostic marker for NSCLC patients. PMID:24304228
Relevance of spontaneous fabT mutations to a streptococcal toxic shock syndrome to non-streptococcal toxic shock syndrome transition in the novel-type Streptococcus pyogenes isolates that lost a salRK.

PubMed

Tatsuno, Ichiro; Okada, Ryo; Matsumoto, Masakado; Hata, Nanako; Matsui, Hideyuki; Zhang, Yan; Isaka, Masanori; Hasegawa, Tadao

2016-05-01

Streptococcus pyogenes is a causative agent of streptococcal toxic shock syndrome (STSS). Mutations in covR/S or rgg, negative regulators, can reportedly modulate the severity of infection in this pathogen. Recently, we showed that the regions encoding the SalR-SalK, a two-component regulatory system, were deleted in some emm 1-type isolates (named as 'novel-type'). In this study, the two novel 'STSS' isolates 10-85stss and 11-171stss were more virulent than the two novel 'non-STSS' isolates 11O-2non and 11T-3non when examined using a mouse model of invasive infection. Genome-sequencing experiments using the three strains 10-85stss , 11-171stss , and 11O-2non detected only one single nucleotide polymorphism that causes a non-synonymous mutation in fabT encoding a transcriptional regulator in strain 11O-2non . Loss of fabT reduced the high level of virulence observed in the STSS isolates to that in the non-STSS isolates, and introduction of an intact fabT compensated the lower virulence of 11O-2non , suggesting that the mutation in fabT, but not in covR/S or rgg, is involved in the differential virulence among the novel-type clinical isolates. This type of non-synonymous fabT mutation was also identified in 12 non-STSS isolates (including 11O-2non and 11T-3non ), and most of those 12 isolates showed impaired FabT function. © 2016 APMIS. Published by John Wiley & Sons Ltd.
Is really endogenous ghrelin a hunger signal in chickens? Association of GHSR SNPs with increase appetite, growth traits, expression and serum level of GHRL, and GH.

PubMed

El-Magd, Mohammed Abu; Saleh, Ayman A; Abdel-Hamid, Tamer M; Saleh, Rasha M; Afifi, Mohammed A

2016-10-01

Chicken growth hormone secretagogue receptor (GHSR) is a receptor for ghrelin (GHRL), a peptide hormone produced by chicken proventriculus, which stimulates growth hormone (GH) release and food intake. The purpose of this study was to search for single nucleotide polymorphisms (SNPs) in exon 2 of GHSR gene and to analyze their effect on the appetite, growth traits and expression levels of GHSR, GHRL, and GH genes as well as serum levels of GH and GHRL in Mandara chicken. Two adjacent SNPs, A239G and G244A, were detected in exon 2 of GHSR gene. G244A SNP was non-synonymous mutation and led to replacement of lysine amino acid (aa) by arginine aa, while A239G SNP was synonymous mutation. The combined genotypes of A239G and G244A SNPs produced three haplotypes; GG/GG, GG/AG, AG/AG, which associated significantly (P<0.05) with growth traits (body weight, average daily gain, shank length, keel length, chest circumference) at age from >4 to 16w. Chickens with the homozygous GG/GG haplotype showed higher growth performance than other chickens. The two SNPs were also correlated with mRNA levels of GHSR and GH (in pituitary gland), and GHRL (in proventriculus and hypothalamus) as well as with serum level of GH and GHRL. Also, chickens with GG/GG haplotype showed higher mRNA and serum levels. This is the first study to demonstrate that SNPs in GHSR can increase appetite, growth traits, expression and level of GHRL, suggesting a hunger signal role for endogenous GHRL. Copyright © 2016 Elsevier Inc. All rights reserved.
Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus).

PubMed

Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

2015-07-27

Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1-8) were identified and genotyped via direct sequencing covering most of the coding region and 3'UTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3'UTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs.
Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus)

PubMed Central

Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

2015-01-01

Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1–8) were identified and genotyped via direct sequencing covering most of the coding region and 3ʹUTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3ʹUTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs. PMID:26225956
Variation in Genes Encoding the Neuroactive Steroid Synthetic Enzymes 5α-Reductase Type 1 and 3α-Reductase Type 2 is Associated with Alcohol Dependence

PubMed Central

Milivojevic, Verica; Kranzler, Henry R.; Gelernter, Joel; Burian, Linda; Covault, Jonathan

2010-01-01

Background Studies of alcohol effects in rodents and in vitro implicate endogenous neuroactive steroids as key mediators of alcohol effects at GABAA receptors. We used a case-control sample to test the association with alcohol dependence (AD) of single nucleotide polymorphisms (SNPs) in the genes encoding two key enzymes required for the generation of endogenous neuroactive steroids: 5α–reductase, type I (5α-R) and 3α-hydroxysteroid dehydrogenase, type 2 (3α-HSD), both of which are expressed in human brain. Methods We focused on markers previously associated with a biological phenotype. For 5α-R, we examined the synonymous SRD5A1 exon 1 SNP rs248793, which has been associated with the ratio of dihydrotestosterone to testosterone. For 3α-HSD, we examined the non-synonymous AKR1C3 SNP rs12529 (H5Q), which has been associated with bladder cancer. The SNPs were genotyped in a sample of 1,083 non-Hispanic Caucasians including 552 controls and 531 subjects with AD. Results The minor allele for both SNPs was more common among controls than subjects with AD: SRD5A1 rs248793 C-allele (χ2(1)=7.6, p=0.006) and AKR1C3 rs12529 G-allele (χ2(1)=14.6, p=0.0001). There was also an interaction of these alleles such that the “protective” effect of the minor allele at each marker for AD was conditional on the genotype of the second marker. Conclusions We found evidence of an association with AD of polymorphisms in two genes encoding neuroactive steroid biosynthetic enzymes, providing indirect evidence that neuroactive steroids are important mediators of alcohol effects in humans. PMID:21323680

Polymorphism of BMP4 gene in Indian goat breeds differing in prolificacy.

PubMed

Sharma, Rekha; Ahlawat, Sonika; Maitra, A; Roy, Manoranjan; Mandakmale, S; Tantia, M S

2013-12-10

Bone morphogenetic proteins (BMPs) are members of the TGF-β (transforming growth factor-beta) superfamily, of which BMP4 is the most important due to its crucial role in follicular growth and differentiation, cumulus expansion and ovulation. Reproduction is a crucial trait in goat breeding and based on the important role of BMP4 gene in reproduction it was considered as a possible candidate gene for the prolificacy of goats. The objective of the present study was to detect polymorphism in intronic, exonic and 3' un-translated regions of BMP4 gene in Indian goats. Nine different goat breeds (Barbari, Beetal, Black Bengal, Malabari, Jakhrana (Twinning>40%), Osmanabadi, Sangamneri (Twinning 20-30%), Sirohi and Ganjam (Twinning<10%)) differing in prolificacy and geographic distribution were employed for polymorphism scanning. Cattle sequence (AC_000167.1) was used to design primers for the amplification of a targeted region followed by direct DNA sequencing to identify the genetic variations. Single nucleotide polymorphisms (SNPs) were not detected in exon 3, the intronic region and the 3' flanking region. A SNP (G1534A) was identified in exon 2. It was a non-synonymous mutation resulting in an arginine to lysine change in a corresponding protein sequence. G to A transition at the 1534 locus revealed two genotypes GG and GA in the nine investigated goat breeds. The GG genotype was predominant with a genotype frequency of 0.98. The GA genotype was present in the Black Bengal as well as Jakhrana breed with a genotype frequency of 0.02. A microsatellite was identified in the 3' flanking region, only 20 nucleotides downstream from the termination site of the coding region, as a short sequence with more than nineteen continuous and repeated CA dinucleotides. Since the gene is highly evolutionarily conserved, identification of a non-synonymous SNP (G1534A) in the coding region gains further importance. To our knowledge, this is the first report of a mutation in the coding region of the caprine BMP4 gene. But whether the reproduction trait of goat is associated with the BMP4 polymorphism, needs to be further defined by association studies in more populations so as to delineate an effect on it. © 2013 Elsevier B.V. All rights reserved.
No variation and low synonymous substitution rates in coral mtDNA despite high nuclear variation

PubMed Central

Hellberg, Michael E

2006-01-01

Background The mitochondrial DNA (mtDNA) of most animals evolves more rapidly than nuclear DNA, and often shows higher levels of intraspecific polymorphism and population subdivision. The mtDNA of anthozoans (corals, sea fans, and their kin), by contrast, appears to evolve slowly. Slow mtDNA evolution has been reported for several anthozoans, however this slow pace has been difficult to put in phylogenetic context without parallel surveys of nuclear variation or calibrated rates of synonymous substitution that could permit quantitative rate comparisons across taxa. Here, I survey variation in the coding region of a mitochondrial gene from a coral species (Balanophyllia elegans) known to possess high levels of nuclear gene variation, and estimate synonymous rates of mtDNA substitution by comparison to another coral (Tubastrea coccinea). Results The mtDNA surveyed (630 bp of cytochrome oxidase subunit I) was invariant among individuals sampled from 18 populations spanning 3000 km of the range of B. elegans, despite high levels of variation and population subdivision for allozymes over these same populations. The synonymous substitution rate between B. elegans and T. coccinea (0.05%/site/106 years) is similar to that in most plants, but 50–100 times lower than rates typical for most animals. In addition, while substitutions to mtDNA in most animals exhibit a strong bias toward transitions, mtDNA from these corals does not. Conclusion Slow rates of mitochondrial nucleotide substitution result in low levels of intraspecific mtDNA variation in corals, even when nuclear loci vary. Slow mtDNA evolution appears to be the basal condition among eukaryotes. mtDNA substitution rates switch from slow to fast abruptly and unidirectionally. This switch may stem from the loss of just one or a few mitochondrion-specific DNA repair or replication genes. PMID:16542456
Genetic variability in G2 and F2 region between biological clones of human respiratory syncytial virus with or without host immune selection pressure

PubMed Central

Moraes, Claudia Trigo Pedroso; Oliveira, Danielle Bruna Leal; Campos, Angelica Cristine Almeida; Bosso, Patricia Alves; Lima, Hildener Nogueira; Stewien, Klaus Eberhard; Gilio, Alfredo Elias; Vieira, Sandra Elisabete; Botosso, Viviane Fongaro; Durigon, Edison Luiz

2015-01-01

Human respiratory syncytial virus (HRSV) is an important respiratory pathogens among children between zero-five years old. Host immunity and viral genetic variability are important factors that can make vaccine production difficult. In this work, differences between biological clones of HRSV were detected in clinical samples in the absence and presence of serum collected from children in the convalescent phase of the illness and from their biological mothers. Viral clones were selected by plaque assay in the absence and presence of serum and nucleotide sequences of the G2 and F2 genes of HRSV biological clones were compared. One non-synonymous mutation was found in the F gene (Ile5Asn) in one clone of an HRSV-B sample and one non-synonymous mutation was found in the G gene (Ser291Pro) in four clones of the same HRSV-B sample. Only one of these clones was obtained after treatment with the child's serum. In addition, some synonymous mutations were determined in two clones of the HRSV-A samples. In conclusion, it is possible that minor sequences could be selected by host antibodies contributing to the HRSV evolutionary process, hampering the development of an effective vaccine, since we verify the same codon alteration in absence and presence of human sera in individual clones of BR-85 sample. PMID:25742274
Genetic variability in G2 and F2 region between biological clones of human respiratory syncytial virus with or without host immune selection pressure.

PubMed

Moraes, Claudia Trigo Pedroso; Oliveira, Danielle Bruna Leal; Campos, Angelica Cristine Almeida; Bosso, Patricia Alves; Lima, Hildener Nogueira; Stewien, Klaus Eberhard; Gilio, Alfredo Elias; Vieira, Sandra Elisabete; Botosso, Viviane Fongaro; Durigon, Edison Luiz

2015-02-01

Human respiratory syncytial virus (HRSV) is an important respiratory pathogens among children between zero-five years old. Host immunity and viral genetic variability are important factors that can make vaccine production difficult. In this work, differences between biological clones of HRSV were detected in clinical samples in the absence and presence of serum collected from children in the convalescent phase of the illness and from their biological mothers. Viral clones were selected by plaque assay in the absence and presence of serum and nucleotide sequences of the G2 and F2 genes of HRSV biological clones were compared. One non-synonymous mutation was found in the F gene (Ile5Asn) in one clone of an HRSV-B sample and one non-synonymous mutation was found in the G gene (Ser291Pro) in four clones of the same HRSV-B sample. Only one of these clones was obtained after treatment with the child's serum. In addition, some synonymous mutations were determined in two clones of the HRSV-A samples. In conclusion, it is possible that minor sequences could be selected by host antibodies contributing to the HRSV evolutionary process, hampering the development of an effective vaccine, since we verify the same codon alteration in absence and presence of human sera in individual clones of BR-85 sample.
Emended description of Salinivibrio proteolyticus, including Salinivibrio costicola subsp. vallismortis and five new isolates.

PubMed

López-Hermoso, Clara; de la Haba, Rafael R; Sánchez-Porro, Cristina; Ventosa, Antonio

2018-05-01

We carried out a comparative taxonomic study of Salinivibrio proteolyticus and Salinivibrio costicola subsp. vallismortis, as well as of five halophilic strains (IB574, IB872, PR5, PR919 and PR932), isolated from salterns in Spain and Puerto Rico that were closely related to these bacteria. Multilocus sequence analysis of concatenated gyrB, recA, rpoA and rpoD housekeeping genes showed that they constituted a single cluster separate from the other species and subspecies of Salinivibrio. Experimental and in silico DNA-DNA hybridization studies indicated that they are members of the same species, with relatedness of 100-74 % and 97.8-70.0 %, respectively. The average nucleotide identity (ANI) determined for these strains was 99.7-95.6 % for ANIb and 99.7-95.7 % for OrthoANI. However, the ANI values for S. costicolasubsp.vallismortis DSM 8285 T with respect to S. costicolasubsp.costicola DSM 11403 T and S. costicolasubsp.alcaliphilus DSM 16359 T were 78.7 and 78.9 % (ANIb) and 79.4 and 79.4 % (OrthoANI), respectively. The phylogenomic tree based on 1072 concatenated orthologous single-copy core genes confirmed that S. proteolyticus, S. costicolasubsp.vallismortis and the five new isolates constitute a coherent single phylogroup, separated from the other species and subspecies of Salinivibrio. All these data indicate that S. costicolasubsp.vallismortis is a heterotypic synonym of S. proteolyticus and we propose an emended description of this species.
Origin of a function by tandem gene duplication limits the evolutionary capability of its sister copy.

PubMed

Hasselmann, Martin; Lechner, Sarah; Schulte, Christina; Beye, Martin

2010-07-27

The most remarkable outcome of a gene duplication event is the evolution of a novel function. Little information exists on how the rise of a novel function affects the evolution of its paralogous sister gene copy, however. We studied the evolution of the feminizer (fem) gene from which the gene complementary sex determiner (csd) recently derived by tandem duplication within the honey bee (Apis) lineage. Previous studies showed that fem retained its sex determination function, whereas the rise of csd established a new primary signal of sex determination. We observed a specific reduction of nonsynonymous to synonymous substitution ratios in Apis to non-Apis fem. We found a contrasting pattern at two other genetically linked genes, suggesting that hitchhiking effects to csd, the locus under balancing selection, is not the cause of this evolutionary pattern. We also excluded higher synonymous substitution rates by relative rate testing. These results imply that stronger purifying selection is operating at the fem gene in the presence of csd. We propose that csd's new function interferes with the function of Fem protein, resulting in molecular constraints and limited evolvability of fem in the Apis lineage. Elevated silent nucleotide polymorphism in fem relative to the genome-wide average suggests that genetic linkage to the csd gene maintained more nucleotide variation in today's population. Our findings provide evidence that csd functionally and genetically interferes with fem, suggesting that a newly evolved gene and its functions can limit the evolutionary capability of other genes in the genome.
Rabbit haemorrhagic disease virus 2 (RHDV2) outbreak in Azores: Disclosure of common genetic markers and phylogenetic segregation within the European strains.

PubMed

Duarte, Margarida; Carvalho, Carina; Bernardo, Susana; Barros, Sílvia Vanessa; Benevides, Sandra; Flor, Lídia; Monteiro, Madalena; Marques, Isabel; Henriques, Margarida; Barros, Sílvia C; Fagulha, Teresa; Ramos, Fernanda; Luís, Tiago; Fevereiro, Miguel

2015-10-01

Rabbit haemorrhagic disease virus 2 (RHDV2) is widespread in several countries of Western Europe, but it has not been introduced to other continents. However, between late 2014 and early 2015, the presence of RHDV2 was confirmed outside of the European continent, in the Azores, initially in the islands of Graciosa, Flores, S. Jorge and Terceira. In this study we report the subsequent detection of RHDV2 in wild rabbits from the islands of Faial, St. Maria and S. Miguel, and display the necropsy and microscopic examination data obtained, which showed lesions similar to those induced by classical strains of RHDV, with severe affection of lungs and liver. We also disclose the result of a genetic investigation carried out with RHDV2 positive samples from wild rabbits found dead in the seven islands. Partial vp60 sequences were amplified from 27 tissue samples. Nucleotide analysis showed that the Azorean strains are closely related to each other, sharing a high genetic identity (>99.15%). None of the obtained sequences were identical to any RHDV2 sequence publically known, hampering a clue for the source of the outbreaks. However, Bayesian and maximum likelihood phylogenetic analyses disclosed that Azorean strains are more closely related to a few strains from Southern Portugal than with any others presently known. In the analysed region comprising the terminal 942 nucleotides of the vp60 gene, four new single nucleotide polymorphisms (SNP) were identified. Based on the present data, these four SNPs, which are unique in the strains from Azores, may constitute putative molecular geographic markers for Azorean RHDV2 strains, if they persist in the future. One of these variations is a non-synonymous substitution that involves the replacement of one amino acid in a hypervariable region of the capsid protein. Copyright © 2015 Elsevier B.V. All rights reserved.
Nucleotide Excision Repair Gene Polymorphisms, Meat Intake and Colon Cancer Risk

PubMed Central

Steck, Susan E.; Butler, Lesley M.; Keku, Temitope; Antwi, Samuel; Galanko, Joseph; Sandler, Robert S.; Hu, Jennifer J.

2014-01-01

Purpose Much of the DNA damage from colon cancer-related carcinogens, including heterocyclic amines (HCA) and polycyclic aromatic hydrocarbons (PAH) from red meat cooked at high temperature, are repaired by the nucleotide excision repair (NER) pathway. Thus, we examined whether NER non-synonymous single nucleotide polymorphisms (nsSNPs) modified the association between red meat intake and colon cancer risk. Methods The study consists of 244 African-American and 311 white colon cancer cases and population-based controls (331 African Americans and 544 whites) recruited from 33 counties in North Carolina from 1996 to 2000. Information collected by food frequency questionnaire on meat intake and preparation methods were used to estimate HCA and benzo(a)pyrene (BaP, a PAH) intake. We tested 7 nsSNPs in 5 NER genes: XPC A499V and K939Q, XPD D312N and K751Q, XPF R415Q, XPG D1104H, and RAD23B A249V. Adjusted odds ratios (OR) and 95% confidence intervals (CI) were calculated using unconditional logistic regression. Results Among African Americans, we observed a statistically significant positive association between colon cancer risk and XPC 499 AV+VV genotype (OR=1.7, 95% CI: 1.1, 2.7, AA as referent), and an inverse association with XPC 939 QQ (OR=0.3, 95%CI: 0.2, 0.8, KK as referent). These associations were not observed among whites. For both races combined, there was interaction between the XPC 939 genotype, well-done red meat intake and colon cancer risk (OR=1.5, 95% CI=1.0, 2.2 for high well-done red meat and KK genotype as compared to low well-done red meat and KK genotype, pinteraction =0.05). Conclusions Our data suggest that NER nsSNPs are associated with colon cancer risk and may modify the association between well-done red meat intake and colon cancer risk. PMID:24607854
Identification of the Q969R gain-of-function polymorphism in the gene encoding porcine NLRP3 and its distribution in pigs of Asian and European origin.

PubMed

Tohno, Masanori; Shinkai, Hiroki; Toki, Daisuke; Okumura, Naohiko; Tajima, Kiyoshi; Uenishi, Hirohide

2016-10-01

The nucleotide-binding domain, leucine-rich-containing family, pyrin-domain containing-3 (NLRP3) inflammasome comprises the major components caspase-1, apoptosis-associated speck-like protein containing a caspase recruitment domain (ASC), and NLRP3. NLRP3 plays important roles in maintaining immune homeostasis mediated by intestinal microorganisms and in the immunostimulatory properties of vaccine adjuvants used to induce an immune response. In the present study, we first cloned a complementary DNA (cDNA) encoding porcine ASC because its genomic sequence was not completely determined. The availability of the ASC cDNA enabled us to reconstitute porcine NLRP3 inflammasomes using an in vitro system that led to the identification of the immune functions of porcine NLRP3 and ASC based on the production of interleukin-1β (IL-1β). Further, we identified six synonymous and six nonsynonymous single-nucleotide polymorphisms (SNPs) in the coding sequence of NLRP3 of six breeds of pigs, including major commercial breeds. Among the nonsynonymous SNPs, the Q969R polymorphism is associated with an increased release of IL-1β compared with other porcine NLRP3 variants, indicating that this polymorphism represents a gain-of-function mutation. This allele was detected in 100 % of the analyzed Chinese Jinhua and Japanese wild boars, suggesting that the allele is maintained in the major commercial native European breeds Landrace, Large White, and Berkshire. These findings represent an important contribution to our knowledge of the diversity of NLRP3 nucleotide sequences among various pig populations. Moreover, efforts to exploit the gain of function induced by the Q969R polymorphism promise to improve pig breeding and husbandry by conferring enhanced resistance to pathogens as well as contributing to vaccine efficacy.
Relationship between mRNA secondary structure and sequence variability in Chloroplast genes: possible life history implications.

PubMed

Krishnan, Neeraja M; Seligmann, Hervé; Rao, Basuthkar J

2008-01-28

Synonymous sites are freer to vary because of redundancy in genetic code. Messenger RNA secondary structure restricts this freedom, as revealed by previous findings in mitochondrial genes that mutations at third codon position nucleotides in helices are more selected against than those in loops. This motivated us to explore the constraints imposed by mRNA secondary structure on evolutionary variability at all codon positions in general, in chloroplast systems. We found that the evolutionary variability and intrinsic secondary structure stability of these sequences share an inverse relationship. Simulations of most likely single nucleotide evolution in Psilotum nudum and Nephroselmis olivacea mRNAs, indicate that helix-forming propensities of mutated mRNAs are greater than those of the natural mRNAs for short sequences and vice-versa for long sequences. Moreover, helix-forming propensity estimated by the percentage of total mRNA in helices increases gradually with mRNA length, saturating beyond 1000 nucleotides. Protection levels of functionally important sites vary across plants and proteins: r-strategists minimize mutation costs in large genes; K-strategists do the opposite. Mrna length presumably predisposes shorter mRNAs to evolve under different constraints than longer mRNAs. The positive correlation between secondary structure protection and functional importance of sites suggests that some sites might be conserved due to packing-protection constraints at the nucleic acid level in addition to protein level constraints. Consequently, nucleic acid secondary structure a priori biases mutations. The converse (exposure of conserved sites) apparently occurs in a smaller number of cases, indicating a different evolutionary adaptive strategy in these plants. The differences between the protection levels of functionally important sites for r- and K-strategists reflect their respective molecular adaptive strategies. These converge with increasing domestication levels of K-strategists, perhaps because domestication increases reproductive output.
High-resolution melting analysis of the single nucleotide polymorphism hot-spot region in the rpoB gene as an indicator of reduced susceptibility to rifaximin in Clostridium difficile.

PubMed

Pecavar, Verena; Blaschitz, Marion; Hufnagl, Peter; Zeinzinger, Josef; Fiedler, Anita; Allerberger, Franz; Maass, Matthias; Indra, Alexander

2012-06-01

Clostridium difficile, a Gram-positive, spore-forming, anaerobic bacterium, is the main causative agent of hospital-acquired diarrhoea worldwide. In addition to metronidazole and vancomycin, rifaximin, a rifamycin derivative, is a promising antibiotic for the treatment of recurring C. difficile infections (CDI). However, exposure of C. difficile to this antibiotic has led to the development of rifaximin-resistance due to point mutations in the β-subunit of the RNA polymerase (rpoB) gene. In the present study, 348 C. difficile strains with known PCR-ribotypes were investigated for respective single nucleotide polymorphisms (SNPs) within the proposed rpoB hot-spot region by using high-resolution melting (HRM) analysis. This method allows the detection of SNPs by comparing the altered melting behaviour of dsDNA with that of wild-type DNA. Discrimination between wild-type and mutant strains was enhanced by creating heteroduplexes by mixing sample DNA with wild-type DNA, leading to characteristic melting curve shapes from samples containing SNPs in the respective rpoB section. In the present study, we were able to identify 16 different rpoB sequence-types (ST) by sequencing analysis of a 325 bp fragment. The 16 PCR STs displayed a total of 24 different SNPs. Fifteen of these 24 SNPs were located within the proposed 151 bp SNP hot-spot region, resulting in 11 different HRM curve profiles (CP). Eleven SNPs (seven of which were within the proposed hot-spot region) led to amino acid substitutions associated with reduced susceptibility to rifaximin and 13 SNPs (eight of which were within the hot-spot region) were synonymous. This investigation clearly demonstrates that HRM analysis of the proposed SNP hot-spot region in the rpoB gene of C. difficile is a fast and cost-effective method for the identification of C. difficile samples with reduced susceptibility to rifaximin and even allows simultaneous SNP subtyping of the respective C. difficile isolates.
Transcriptome sequencing of Eucalyptus camaldulensis seedlings subjected to water stress reveals functional single nucleotide polymorphisms and genes under selection

PubMed Central

2012-01-01

Background Water stress limits plant survival and production in many parts of the world. Identification of genes and alleles responding to water stress conditions is important in breeding plants better adapted to drought. Currently there are no studies examining the transcriptome wide gene and allelic expression patterns under water stress conditions. We used RNA sequencing (RNA-seq) to identify the candidate genes and alleles and to explore the evolutionary signatures of selection. Results We studied the effect of water stress on gene expression in Eucalyptus camaldulensis seedlings derived from three natural populations. We used reference-guided transcriptome mapping to study gene expression. Several genes showed differential expression between control and stress conditions. Gene ontology (GO) enrichment tests revealed up-regulation of 140 stress-related gene categories and down-regulation of 35 metabolic and cell wall organisation gene categories. More than 190,000 single nucleotide polymorphisms (SNPs) were detected and 2737 of these showed differential allelic expression. Allelic expression of 52% of these variants was correlated with differential gene expression. Signatures of selection patterns were studied by estimating the proportion of nonsynonymous to synonymous substitution rates (Ka/Ks). The average Ka/Ks ratio among the 13,719 genes was 0.39 indicating that most of the genes are under purifying selection. Among the positively selected genes (Ka/Ks > 1.5) apoptosis and cell death categories were enriched. Of the 287 positively selected genes, ninety genes showed differential expression and 27 SNPs from 17 positively selected genes showed differential allelic expression between treatments. Conclusions Correlation of allelic expression of several SNPs with total gene expression indicates that these variants may be the cis-acting variants or in linkage disequilibrium with such variants. Enrichment of apoptosis and cell death gene categories among the positively selected genes reveals the past selection pressures experienced by the populations used in this study. PMID:22853646
Association between Single Nucleotide Polymorphisms of the Major Histocompatibility Complex Class II Gene and Newcastle Disease Virus Titre and Body Weight in Leung Hang Khao Chickens

PubMed Central

Molee, A.; Kongroi, K.; Kuadsantia, P.; Poompramun, C.; Likitdecharote, B.

2016-01-01

The aim of the present study was to investigate the effect of single nucleotide polymorphisms in the major histocompatibility complex (MHC) class II gene on resistance to Newcastle disease virus and body weight of the Thai indigenous chicken, Leung Hang Khao (Gallus gallus domesticus). Blood samples were collected for single nucleotide polymorphism analysis from 485 chickens. Polymerase chain reaction sequencing was used to classify single nucleotide polymorphisms of class II MHC. Body weights were measured at the ages of 3, 4, 5, and 7 months. Titres of Newcastle disease virus at 2 weeks to 7 months were determined and the correlation between body weight and titre was analysed. The association between single nucleotide polymorphisms and body weight and titre were analysed by a generalized linear model. Seven single nucleotide polymorphisms were identified: C125T, A126T, C209G, C242T, A243T, C244T, and A254T. Significant correlations between log titre and body weight were found at 2 and 4 weeks. Associations between single nucleotide polymorphisms and titre were found for C209G and A254T, and between all single nucleotide polymorphisms (except A243T) and body weight. The results showed that class II MHC is associated with both titre of Newcastle disease virus and body weight in Leung Hang Khao chickens. This is of concern because improved growth traits are the main goal of breeding selection. Moreover, the results suggested that MHC has a pleiotropic effect on the titre and growth performance. This mechanism should be investigated in a future study. PMID:26732325
Nucleotide sequences of immunoglobulin eta genes of chimpanzee and orangutan: DNA molecular clock and hominoid evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sakoyama, Y.; Hong, K.J.; Byun, S.M.

To determine the phylogenetic relationships among hominoids and the dates of their divergence, the complete nucleotide sequences of the constant region of the immunoglobulin eta-chain (C/sub eta1/) genes from chimpanzee and orangutan have been determined. These sequences were compared with the human eta-chain constant-region sequence. A molecular clock (silent molecular clock), measured by the degree of sequence divergence at the synonymous (silent) positions of protein-encoding regions, was introduced for the present study. From the comparison of nucleotide sequences of ..cap alpha../sub 1/-antitrypsin and ..beta..- and delta-globulin genes between humans and Old World monkeys, the silent molecular clock was calibrated: themore » mean evolutionary rate of silent substitution was determined to be 1.56 x 10/sup -9/ substitutions per site per year. Using the silent molecular clock, the mean divergence dates of chimpanzee and orangutan from the human lineage were estimated as 6.4 +/- 2.6 million years and 17.3 +/- 4.5 million years, respectively. It was also shown that the evolutionary rate of primate genes is considerably slower than those of other mammalian genes.« less
High levels of MHC class II allelic diversity in lake trout from Lake Superior

USGS Publications Warehouse

Dorschner, M.O.; Duris, T.; Bronte, C.R.; Burnham-Curtis, M. K.; Phillips, R.B.

2000-01-01

Sequence variation in a 216 bp portion of the major histocompatibility complex (MHC) II B1 domain was examined in 74 individual lake trout (Salvelinus namaycush) from different locations in Lake Superior. Forty-three alleles were obtained which encoded 71-72 amino acids of the mature protein. These sequences were compared with previous data obtained from five Pacific salmon species and Atlantic salmon using the same primers. Although all of the lake trout alleles clustered together in the neighbor-joining analysis of amino acid sequences, one amino acid allelic lineage was shared with Atlantic salmon (Salmo salar), a species in another genus which probably diverged from Salvelinus more than 10-20 million years ago. As shown previously in other salmonids, the level of nonsynonymous nucleotide substitution (d(N)) exceeded the level of synonymous substitution (d(S)). The level of nucleotide diversity at the MHC class II B1 locus was considerably higher in lake trout than in the Pacific salmon (genus Oncorhynchus). These results are consistent with the hypothesis that lake trout colonized Lake Superior from more than one refuge following the Wisconsin glaciation. Recent population bottlenecks may have reduced nucleotide diversity in Pacific salmon populations.
Nucleotide diversity at two phytochrome loci along a latitudinal cline in Pinus sylvestris.

PubMed

García-Gil, M R; Mikkonen, M; Savolainen, O

2003-05-01

Forest tree species provide many examples of well-studied adaptive differentiation, where the search for the underlying genes might be possible. In earlier studies and in our common conditions in a greenhouse, northern populations set bud earlier than southern ones. A difference in latitude of origin of one degree corresponded to a change of 1.4 days in number of days to terminal bud set of seedlings. Earlier physiological and ecological genetics work in conifers and other plants have suggested that such variation could be governed by phytochromes. Nucleotide variation was examined at two phytochrome loci (PHYP and PHYO, homologues of the Arabidopsis thaliana PHYB and PHYA, respectively) in three populations: northern Finland, southern Finland and northern Spain. In our samples of 12-15 sequences (2980 and 1156 base pairs at the two loci) we found very low nonsynonymous variation; pi was 0.0003 and 0.0002 at PHYP and PHYO loci, respectively. There was no functional differentiation between populations at the photosensory domains of either locus. The overall silent variation was also low, only 0.0024 for the PHYP locus. The low estimates of silent variation are consistent with the estimated low synonymous substitution rates between Pinus sylvestris and Picea abies at the PHYO locus. Despite the low level of nucleotide variation, haplotypic diversity was relatively high (0.42 and 0.41 for fragments of 1156 nucleotides) at the two loci.
Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

PubMed Central

Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

1994-01-01

To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378
Functional analysis of regulatory single-nucleotide polymorphisms.

PubMed

Pampín, Sandra; Rodríguez-Rey, José C

2007-04-01

The identification of regulatory polymorphisms has become a key problem in human genetics. In the past few years there has been a conceptual change in the way in which regulatory single-nucleotide polymorphisms are studied. We revise the new approaches and discuss how gene expression studies can contribute to a better knowledge of the genetics of common diseases. New techniques for the association of single-nucleotide polymorphisms with changes in gene expression have been recently developed. This, together with a more comprehensive use of the old in-vitro methods, has produced a great amount of genetic information. When added to current databases, it will help to design better tools for the detection of regulatory single-nucleotide polymorphisms. The identification of functional regulatory single-nucleotide polymorphisms cannot be done by the simple inspection of DNA sequence. In-vivo techniques, based on primer-extension, and the more recently developed 'haploChIP' allow the association of gene variants to changes in gene expression. Gene expression analysis by conventional in-vitro techniques is the only way to identify the functional consequences of regulatory single-nucleotide polymorphisms. The amount of information produced in the last few years will help to refine the tools for the future analysis of regulatory gene variants.
Whole-genome sequencing reveals that Shewanella haliotis Kim et al. 2007 can be considered a later heterotypic synonym of Shewanella algae Simidu et al. 1990.

PubMed

Szeinbaum, Nadia; Kellum, Cailin E; Glass, Jennifer B; Janda, J Michael; DiChristina, Thomas J

2018-04-01

Previously, experimental DNA-DNA hybridization (DDH) between Shewanellahaliotis JCM 14758 T and Shewanellaalgae JCM 21037 T had suggested that the two strains could be considered different species, despite minimal phenotypic differences. The recent isolation of Shewanella sp. MN-01, with 99 % 16S rRNA gene identity to S. algae and S. haliotis, revealed a potential taxonomic problem between these two species. In this study, we reassessed the nomenclature of S. haliotis and S. algae using available whole-genome sequences. The whole-genome sequence of S. haliotis JCM 14758 T and ten S. algae strains showed ≥97.7 % average nucleotide identity and >78.9 % digital DDH, clearly above the recommended species thresholds. According to the rules of priority and in view of the results obtained, S. haliotis is to be considered a later heterotypic synonym of S. algae. Because the whole-genome sequence of Shewanella sp. strain MN-01 shares >99 % ANI with S. algae JCM 14758 T , it can be confidently identified as S. algae.
Overdispersion of the Molecular Clock: Temporal Variation of Gene-Specific Substitution Rates in Drosophila

PubMed Central

Hartl, Daniel L.

2008-01-01

Simple models of molecular evolution assume that sequences evolve by a Poisson process in which nucleotide or amino acid substitutions occur as rare independent events. In these models, the expected ratio of the variance to the mean of substitution counts equals 1, and substitution processes with a ratio greater than 1 are called overdispersed. Comparing the genomes of 10 closely related species of Drosophila, we extend earlier evidence for overdispersion in amino acid replacements as well as in four-fold synonymous substitutions. The observed deviation from the Poisson expectation can be described as a linear function of the rate at which substitutions occur on a phylogeny, which implies that deviations from the Poisson expectation arise from gene-specific temporal variation in substitution rates. Amino acid sequences show greater temporal variation in substitution rates than do four-fold synonymous sequences. Our findings provide a general phenomenological framework for understanding overdispersion in the molecular clock. Also, the presence of substantial variation in gene-specific substitution rates has broad implications for work in phylogeny reconstruction and evolutionary rate estimation. PMID:18480070

Evolutionary genomics of the cold-adapted diatom Fragilariopsis cylindrus

DOE PAGES

Mock, Thomas; Otillar, Robert P.; Strauss, Jan; ...

2017-01-26

The Southern Ocean houses a diverse and productive community of organisms. Unicellular eukaryotic diatoms are the main primary producers in this environment, where photosynthesis is limited by low concentrations of dissolved iron and large seasonal fluctuations in light, temperature and the extent of sea ice. How diatoms have adapted to this extreme environment is largely unknown. Here we present insights into the genome evolution of a cold-Adapted diatom from the Southern Ocean, Fragilariopsis cylindrus, based on a comparison with temperate diatoms. We find that approximately 24.7 per cent of the diploid F. cylindrus genome consists of genetic loci with allelesmore » that are highly divergent (15.1 megabases of the total genome size of 61.1 megabases). These divergent alleles were differentially expressed across environmental conditions, including darkness, low iron, freezing, elevated temperature and increased CO 2. Alleles with the largest ratio of non-synonymous to synonymous nucleotide substitutions also show the most pronounced condition-dependent expression, suggesting a correlation between diversifying selection and allelic differentiation. Divergent alleles may be involved in adaptation to environmental fluctuations in the Southern Ocean.« less
Evolutionary genomics of the cold-adapted diatom Fragilariopsis cylindrus

DOE Office of Scientific and Technical Information (OSTI.GOV)

Mock, Thomas; Otillar, Robert P.; Strauss, Jan

The Southern Ocean houses a diverse and productive community of organisms. Unicellular eukaryotic diatoms are the main primary producers in this environment, where photosynthesis is limited by low concentrations of dissolved iron and large seasonal fluctuations in light, temperature and the extent of sea ice. How diatoms have adapted to this extreme environment is largely unknown. Here we present insights into the genome evolution of a cold-Adapted diatom from the Southern Ocean, Fragilariopsis cylindrus, based on a comparison with temperate diatoms. We find that approximately 24.7 per cent of the diploid F. cylindrus genome consists of genetic loci with allelesmore » that are highly divergent (15.1 megabases of the total genome size of 61.1 megabases). These divergent alleles were differentially expressed across environmental conditions, including darkness, low iron, freezing, elevated temperature and increased CO 2. Alleles with the largest ratio of non-synonymous to synonymous nucleotide substitutions also show the most pronounced condition-dependent expression, suggesting a correlation between diversifying selection and allelic differentiation. Divergent alleles may be involved in adaptation to environmental fluctuations in the Southern Ocean.« less
Comparative evolutionary genomics of Corynebacterium with special reference to codon and amino acid usage diversities.

PubMed

Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab

2018-02-01

The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
When silence is noise: infantile-onset Barth syndrome caused by a synonymous substitution affecting TAZ gene transcription.

PubMed

Ferri, L; Dionisi-Vici, C; Taurisano, R; Vaz, F M; Guerrini, R; Morrone, A

2016-11-01

Barth syndrome (BTHS) is an X-linked inborn error of metabolism which affects males. The main manifestations are cardiomyopathy, myopathy, hypotonia, growth delay, intermittent neutropenia and 3-methylglutaconic aciduria. Diagnosis is confirmed by mutational analysis of the TAZ gene and biochemical dosage of the monolysocardiolipin/tetralinoleoyl cardiolipin (MLCL:L4-CL) ratio. We report a 6-year-old boy who presented with severe hypoglycemia, lactic acidosis and severe dilated cardiomyopathy soon after birth. The MLCL:L4-CL ratio confirmed BTHS (3.90 on patient's fibroblast, normal: 0-0.3). Subsequent sequencing of the TAZ gene revealed only the new synonymous variant NM_000116.3 (TAZ):c.348C>T p.(Gly116Gly), which did not appear to affect the protein sequence. In silico prediction analysis suggested the new c.348C>T nucleotide change could alter the TAZ mRNA splicing processing. We analyzed TAZ mRNAs in the patient's fibroblasts and found an abnormal skipping of 24 bases (NM_000116.3:c.346_371), with the consequent ablation of 8 amino acid residues in the tafazzin protein (NP_000107.1:p.Lys117_Gly124del). Molecular analysis of at risk female family members identified the patient's sister and mother as heterozygous carriers. Apparently harmless synonymous variants in the TAZ gene can damage gene expression. Such findings widen our knowledge of molecular heterogeneity in BTHS. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Information Entropy of Influenza A Segment 7

NASA Astrophysics Data System (ADS)

Thompson, William A.; Fan, Shaohua; Weltman, Joel K.

2008-12-01

Information entropy (H) is a measure of uncertainty at each position within in a sequence of nucleotides.H was used to characterize a set of influenza A segment 7 nucleotide sequences. Nucleotide locations of high entropy were identified near the 5’ start of all of the sequences and the sequences were assigned to subsets according to synonymous nucleotide variants at those positions: either uracil at position six (U6), cytosine at position six (C6), adenine (A12) at position 12, guanine at position 12 (G12), adenine at position 15 (A15) or cytosine (C15) at position 15. H values were found to be correlated/corresponding (Kendall tau) along the lengths of the nucleotide segments of the subset pairs at each position. However, the H values of each subset of sequences were statistically distinguishable from those of the other member of the pair (Kolmogorov-Smirnov test). The joint probability of uncorrelated distributions of U6 and C6 sequences to viral subtypes and to viral host species was 34 times greater than for the A12:G12 subset pair and 214 times greater than for the A15:C15 pair. This result indicates that the high entropy position six of segment 7 is either a reporter or a sentinel location. The fact that not one of the H5N1 sequences in the dataset was a member of the C6 subset, but all 125 H5N1 sequences are members of the U6 subset suggests a non-random sentinel function.
Insight into pattern of codon biasness and nucleotide base usage in serotonin receptor gene family from different mammalian species.

PubMed

Dass, J Febin Prabhu; Sudandiradoss, C

2012-07-15

5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Long-term excretion of vaccine-derived poliovirus by a healthy child.

PubMed

Martín, Javier; Odoom, Kofi; Tuite, Gráinne; Dunn, Glynis; Hopewell, Nicola; Cooper, Gill; Fitzharris, Catherine; Butler, Karina; Hall, William W; Minor, Philip D

2004-12-01

A child was found to be excreting type 1 vaccine-derived poliovirus (VDPV) with a 1.1% sequence drift from Sabin type 1 vaccine strain in the VP1 coding region 6 months after he was immunized with oral live polio vaccine. Seventeen type 1 poliovirus isolates were recovered from stools taken from this child during the following 4 months. Contrary to expectation, the child was not deficient in humoral immunity and showed high levels of serum neutralization against poliovirus. Selected virus isolates were characterized in terms of their antigenic properties, virulence in transgenic mice, sensitivity for growth at high temperatures, and differences in nucleotide sequence from the Sabin type 1 strain. The VDPV isolates showed mutations at key nucleotide positions that correlated with the observed reversion to biological properties typical of wild polioviruses. A number of capsid mutations mapped at known antigenic sites leading to changes in the viral antigenic structure. Estimates of sequence evolution based on the accumulation of nucleotide changes in the VP1 coding region detected a "defective" molecular clock running at an apparent faster speed of 2.05% nucleotide changes per year versus 1% shown in previous studies. Remarkably, when compared to several type 1 VDPV strains of different origins, isolates from this child showed a much higher proportion of nonsynonymous versus synonymous nucleotide changes in the capsid coding region. This anomaly could explain the high VP1 sequence drift found and the ability of these virus strains to replicate in the gut for a longer period than expected.
Comparative genome-wide analysis and evolutionary history of haemoglobin-processing and haem detoxification enzymes in malarial parasites.

PubMed

Ponsuwanna, Patrath; Kochakarn, Theerarat; Bunditvorapoom, Duangkamon; Kümpornsin, Krittikorn; Otto, Thomas D; Ridenour, Chase; Chotivanich, Kesinee; Wilairat, Prapon; White, Nicholas J; Miotto, Olivo; Chookajorn, Thanat

2016-01-29

Malaria parasites have evolved a series of intricate mechanisms to survive and propagate within host red blood cells. Intra-erythrocytic parasitism requires these organisms to digest haemoglobin and detoxify iron-bound haem. These tasks are executed by haemoglobin-specific proteases and haem biocrystallization factors that are components of a large multi-subunit complex. Since haemoglobin processing machineries are functionally and genetically linked to the modes of action and resistance mechanisms of several anti-malarial drugs, an understanding of their evolutionary history is important for drug development and drug resistance prevention. Maximum likelihood trees of genetic repertoires encoding haemoglobin processing machineries within Plasmodium species, and with the representatives of Apicomplexan species with various host tropisms, were created. Genetic variants were mapped onto existing three-dimensional structures. Genome-wide single nucleotide polymorphism data were used to analyse the selective pressure and the effect of these mutations at the structural level. Recent expansions in the falcipain and plasmepsin repertoires are unique to human malaria parasites especially in the Plasmodium falciparum and P. reichenowi lineage. Expansion of haemoglobin-specific plasmepsins occurred after the separation event of Plasmodium species, but the other members of the plasmepsin family were evolutionarily conserved with one copy for each sub-group in every Apicomplexan species. Haemoglobin-specific falcipains are separated from invasion-related falcipain, and their expansions within one specific locus arose independently in both P. falciparum and P. vivax lineages. Gene conversion between P. falciparum falcipain 2A and 2B was observed in artemisinin-resistant strains. Comparison between the numbers of non-synonymous and synonymous mutations suggests a strong selective pressure at falcipain and plasmepsin genes. The locations of amino acid changes from non-synonymous mutations mapped onto protein structures revealed clusters of amino acid residues in close proximity or near the active sites of proteases. A high degree of polymorphism at the haemoglobin processing genes implicates an imposition of selective pressure. The identification in recent years of functional redundancy of haemoglobin-specific proteases makes them less appealing as potential drug targets, but their expansions, especially in the human malaria parasite lineages, unequivocally point toward their functional significance during the independent and repetitive adaptation events in malaria parasite evolutionary history.
The Evolution of Vp1 Gene in Enterovirus C Species Sub-Group That Contains Types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99

PubMed Central

Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

2014-01-01

Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific ‘signature’ amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific ‘signature’ amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared. PMID:24695547
The evolution of Vp1 gene in enterovirus C species sub-group that contains types CVA-21, CVA-24, EV-C95, EV-C96 and EV-C99.

PubMed

Smura, Teemu; Blomqvist, Soile; Vuorinen, Tytti; Ivanova, Olga; Samoilovich, Elena; Al-Hello, Haider; Savolainen-Kopra, Carita; Hovi, Tapani; Roivainen, Merja

2014-01-01

Genus Enterovirus (Family Picornaviridae,) consists of twelve species divided into genetically diverse types by their capsid protein VP1 coding sequences. Each enterovirus type can further be divided into intra-typic sub-clusters (genotypes). The aim of this study was to elucidate what leads to the emergence of novel enterovirus clades (types and genotypes). An evolutionary analysis was conducted for a sub-group of Enterovirus C species that contains types Coxsackievirus A21 (CVA-21), CVA-24, Enterovirus C95 (EV-C95), EV-C96 and EV-C99. VP1 gene datasets were collected and analysed to infer the phylogeny, rate of evolution, nucleotide and amino acid substitution patterns and signs of selection. In VP1 coding gene, high intra-typic sequence diversities and robust grouping into distinct genotypes within each type were detected. Within each type the majority of nucleotide substitutions were synonymous and the non-synonymous substitutions tended to cluster in distinct highly polymorphic sites. Signs of positive selection were detected in some of these highly polymorphic sites, while strong negative selection was indicated in most of the codons. Despite robust clustering to intra-typic genotypes, only few genotype-specific 'signature' amino acids were detected. In contrast, when different enterovirus types were compared, there was a clear tendency towards fixation of type-specific 'signature' amino acids. The results suggest that permanent fixation of type-specific amino acids is a hallmark associated with evolution of different enterovirus types, whereas neutral evolution and/or (frequency-dependent) positive selection in few highly polymorphic amino acid sites are the dominant forms of evolution when strains within an enterovirus type are compared.
Digital gene expression analysis of the zebra finch genome

PubMed Central

2010-01-01

Background In order to understand patterns of adaptation and molecular evolution it is important to quantify both variation in gene expression and nucleotide sequence divergence. Gene expression profiling in non-model organisms has recently been facilitated by the advent of massively parallel sequencing technology. Here we investigate tissue specific gene expression patterns in the zebra finch (Taeniopygia guttata) with special emphasis on the genes of the major histocompatibility complex (MHC). Results Almost 2 million 454-sequencing reads from cDNA of six different tissues were assembled and analysed. A total of 11,793 zebra finch transcripts were represented in this EST data, indicating a transcriptome coverage of about 65%. There was a positive correlation between the tissue specificity of gene expression and non-synonymous to synonymous nucleotide substitution ratio of genes, suggesting that genes with a specialised function are evolving at a higher rate (or with less constraint) than genes with a more general function. In line with this, there was also a negative correlation between overall expression levels and expression specificity of contigs. We found evidence for expression of 10 different genes related to the MHC. MHC genes showed relatively tissue specific expression levels and were in general primarily expressed in spleen. Several MHC genes, including MHC class I also showed expression in brain. Furthermore, for all genes with highest levels of expression in spleen there was an overrepresentation of several gene ontology terms related to immune function. Conclusions Our study highlights the usefulness of next-generation sequence data for quantifying gene expression in the genome as a whole as well as in specific candidate genes. Overall, the data show predicted patterns of gene expression profiles and molecular evolution in the zebra finch genome. Expression of MHC genes in particular, corresponds well with expression patterns in other vertebrates. PMID:20359325
Development and characterization of a novel human Waldenström Macroglobulinemia cell line (RPCI-WM1; Roswell Park Cancer Institute-Waldenström Macroglobulinemia 1)

PubMed Central

Chitta, Kasyapa S.; Paulus, Aneel; Ailawadhi, Sikander; Foster, Barbara A.; Moser, Michael T.; Starostik, Petr; Masood, Aisha; Sher, Taimur; Miller, Kena C.; Iancu, Dan M.; Conroy, Jeffrey; Nowak, Norma J.; Sait, Sheila N.; Personett, David A.; Coleman, Morton; Furman, Richard R.; Martin, Peter; Ansell, Stephen M.; Lee, Kelvin; Chanan-Khan, Asher A.

2015-01-01

Understanding the biology of Waldenström Macroglobulinemia is hindered by a lack of preclinical models. We report a novel cell line, RPCI-WM1, from a patient treated for WM. The cell line secreted human IgM (hIgM) with k-light chain restriction identical to the primary tumor. The cell line has a modal chromosomal number of 46 and harbors chromosomal changes such as deletion of 6q21, monoallelic deletion of 9p21 (CDKN2A), 13q14 (RB1) and 18q21 (BCL-2) with a consistent amplification of 14q32 (IgH) identical to its founding tumor sample. Clonal relationship was confirmed by identical CDR3 length and single nucleotide polymorphisms as well as a matching IgH sequence of the cell line and founding tumor. Both also harbor a heterozygous, non-synonymous mutation at amino acid 265 in MYD88 gene (L265P). The cell line expresses most of the cell surface markers present on the parent cells. Over all, RPCI-WM1 represents a valuable model to study WM. PMID:22812491
Development and characterization of a novel human Waldenström macroglobulinemia cell line: RPCI-WM1, Roswell Park Cancer Institute - Waldenström Macroglobulinemia 1.

PubMed

Chitta, Kasyapa S; Paulus, Aneel; Ailawadhi, Sikander; Foster, Barbara A; Moser, Michael T; Starostik, Petr; Masood, Aisha; Sher, Taimur; Miller, Kena C; Iancu, Dan M; Conroy, Jeffrey; Nowak, Norma J; Sait, Sheila N; Personett, David A; Coleman, Morton; Furman, Richard R; Martin, Peter; Ansell, Stephen M; Lee, Kelvin; Chanan-Khan, Asher A

2013-02-01

Understanding the biology of Waldenström macroglobulinemia is hindered by a lack of preclinical models. We report a novel cell line, RPCI-WM1, from a patient treated for WM. The cell line secretes human immunoglobulin M (h-IgM) with κ-light chain restriction identical to the primary tumor. The cell line has a modal chromosomal number of 46 and harbors chromosomal changes such as deletion of 6q21, monoallelic deletion of 9p21 (CDKN2A), 13q14 (RB1) and 18q21 (BCL-2), with a consistent amplification of 14q32 (immunoglobulin heavy chain; IgH) identical to its founding tumor sample. The clonal relationship is confirmed by identical CDR3 length and single nucleotide polymorphisms as well as a matching IgH sequence of the cell line and founding tumor. Both also harbor a heterozygous, non-synonymous mutation at amino acid 265 in the MYD88 gene (L265P). The cell line expresses most of the cell surface markers present on the parent cells. Overall, RPCI-WM1 represents a valuable model to study Waldenström macroglobulinemia.
Association studies on the bovine lipoprotein lipase gene polymorphism with growth and carcass quality traits in Qinchuan cattle.

PubMed

Gui, Linsheng; Jia, Cuiling; Zhang, Yaran; Zhao, Chunping; Zan, Linsen

2016-04-01

Lipoprotein lipase (LPL) is considered as an essential enzyme in lipid deposition and tissue metabolism. It has been proposed to be a lead candidate gene for genetic markers of lipid deposition and energy balance. In this paper, polymorphisms in the LPL gene were investigated in 554 Chinese Qinchuan cattle by PCR-RFLP and DNA sequencing. Seven single nucleotide polymorphisms (SNPs) were identified, which included one mutation (g.91C > T) in the 5'untranslated region (UTR), four synonymous mutations (g.17015A > G, g.18362G > A, g.18377T > C and g.19873T > C) and two mutations (g.25225A > G and g.25316T > G) in the 3'UTR. The frequencies of SNP g.18377T > C and g.25316T > G were skewed from Hardy-Weinberg equilibrium in all the samples (chi-square test, P < 0.05). An association analysis showed that five loci (except for g.91C > T and g.18377T > C) were significantly correlated with some growth and carcass quality traits. These results demonstrate that LPL might be a potential candidate gene for marker-assisted selection (MAS). Copyright © 2016. Published by Elsevier Ltd.
Polymorphisms in the Myostatin-1 gene and their association with growth traits in Ancherythroculter nigrocauda

NASA Astrophysics Data System (ADS)

Sun, Yanhong; Li, Qing; Wang, Guiying; Zhu, Dongmei; Chen, Jian; Li, Pei; Tong, Jingou

2017-05-01

Myostatin ( MSTN) is a member of the transforming growth factor-β gene superfamily that negatively regulates skeletal muscle development and growth. In the present study, partial genomic fragments of Myostatin-1 ( MSTN-1) in two commercial hatchery populations of Ancherythroculter nigrocauda, an economically important freshwater fish, were screened for single nucleotide polymorphisms (SNPs) and then genotyped by direct sequencing of PCR products. Five SNPs were identified in intron 1 and exon 2, including a non-synonymous mutation causing an amino acid change (Val to Ile) at position 180. Association analyses based on 300 individuals revealed that the g.1129T>C SNP locus was significantly associated with total length (TL), body length (BL), body height (BH) and body weight (BW) in 6- and 18-month-old populations, while the g.1289G>A locus was significantly associated with BH and BW in the 6-month-old population. Haplotype analyses revealed that fish with the genotype combinations TC/TC or TC/GA showed better growth performance. Our results suggest that g.1129T>C and g.1289G>A have positive effects on growth traits and may be candidate gene markers for marker-assisted selection in A. nigrocauda.
Spectrum of mutations in leiomyosarcomas identified by clinical targeted next-generation sequencing.

PubMed

Lee, Paul J; Yoo, Naomi S; Hagemann, Ian S; Pfeifer, John D; Cottrell, Catherine E; Abel, Haley J; Duncavage, Eric J

2017-02-01

Recurrent genomic mutations in uterine and non-uterine leiomyosarcomas have not been well established. Using a next generation sequencing (NGS) panel of common cancer-associated genes, 25 leiomyosarcomas arising from multiple sites were examined to explore genetic alterations, including single nucleotide variants (SNV), small insertions/deletions (indels), and copy number alterations (CNA). Sequencing showed 86 non-synonymous, coding region somatic variants within 151 gene targets in 21 cases, with a mean of 4.1 variants per case; 4 cases had no putative mutations in the panel of genes assayed. The most frequently altered genes were TP53 (36%), ATM and ATRX (16%), and EGFR and RB1 (12%). CNA were identified in 85% of cases, with the most frequent copy number losses observed in chromosomes 10 and 13 including PTEN and RB1; the most frequent gains were seen in chromosomes 7 and 17. Our data show that deletions in canonical cancer-related genes are common in leiomyosarcomas. Further, the spectrum of gene mutations observed shows that defects in DNA repair and chromosomal maintenance are central to the biology of leiomyosarcomas, and that activating mutations observed in other common cancer types are rare in leiomyosarcomas. Copyright © 2017 Elsevier Inc. All rights reserved.
LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.

PubMed

Karchin, Rachel; Diekhans, Mark; Kelly, Libusha; Thomas, Daryl J; Pieper, Ursula; Eswar, Narayanan; Haussler, David; Sali, Andrej

2005-06-15

The NCBI dbSNP database lists over 9 million single nucleotide polymorphisms (SNPs) in the human genome, but currently contains limited annotation information. SNPs that result in amino acid residue changes (nsSNPs) are of critical importance in variation between individuals, including disease and drug sensitivity. We have developed LS-SNP, a genomic scale software pipeline to annotate nsSNPs. LS-SNP comprehensively maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models, and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding or severely impact human health. It currently annotates 28,043 validated SNPs that produce amino acid residue substitutions in human proteins from the SwissProt/TrEMBL database. Annotations can be viewed via a web interface either in the context of a genomic region or by selecting sets of SNPs, genes, proteins or pathways. These results are useful for identifying candidate functional SNPs within a gene, haplotype or pathway and in probing molecular mechanisms responsible for functional impacts of nsSNPs. http://www.salilab.org/LS-SNP CONTACT: rachelk@salilab.org http://salilab.org/LS-SNP/supp-info.pdf.
Genome-wide association studies to identify rice salt-tolerance markers.

PubMed

Patishtan, Juan; Hartley, Tom N; Fonseca de Carvalho, Raquel; Maathuis, Frans J M

2018-05-01

Salinity is an ever increasing menace that affects agriculture worldwide. Crops such as rice are salt sensitive, but its degree of susceptibility varies widely between cultivars pointing to extensive genetic diversity that can be exploited to identify genes and proteins that are relevant in the response of rice to salt stress. We used a diversity panel of 306 rice accessions and collected phenotypic data after short (6 h), medium (7 d) and long (30 d) salinity treatment (50 mm NaCl). A genome-wide association study (GWAS) was subsequently performed, which identified around 1200 candidate genes from many functional categories, but this was treatment period dependent. Further analysis showed the presence of cation transporters and transcription factors with a known role in salinity tolerance and those that hitherto were not known to be involved in salt stress. Localization analysis of single nucleotide polymorphisms (SNPs) showed the presence of several hundred non-synonymous SNPs (nsSNPs) in coding regions and earmarked specific genomic regions with increased numbers of nsSNPs. It points to components of the ubiquitination pathway as important sources of genetic diversity that could underpin phenotypic variation in stress tolerance. © 2017 John Wiley & Sons Ltd.
Significance of 5,10-methylenetetrahydrofolate reductase gene variants in acute lymphoblastic leukemia in Indian population: an experimental, computational and meta-analysis.

PubMed

Bellampalli, Ravishankara; Phani, Nagaraja M; Bhat, Kamalakshi G; Prasad, Krishna; Bhaskaranand, Nalini; Guruprasad, Kanive P; Rai, Padmalatha S; Satyamoorthy, Kapaettu

2015-05-01

Acute lymphoblastic leukemia (ALL) arises due to several genetic alterations in progenitor cells, and methotrexate is frequently used as part of the treatment regimen. Although there is evidence for an effect of 5,10-methylenetetrahydrofolate reductase gene (MTHFR) C677T and A1298C variations on drug response in ALL, its risk association for ALL is still unresolved. In a case-control study of 203 patients with ALL and 246 controls and meta-analysis in the Indian population, we showed an insignificant association of MTHFR C677T and A1298C genotypes with childhood and adult ALL. Comprehensive in silico characterization of non-synonymous single nucleotide polymorphisms (nsSNPs) and SNPs of the 3' untranslated region (UTR) revealed nine nsSNPs as deleterious, and three SNPs in the 3'UTR could possibly alter the binding of miRNAs. The study revealed that several overlooked SNPs may contribute to the risk of ALL susceptibility and further studies of these SNPs with functional characterization in a large sample size are required to understand the significant role of MTHFR in ALL development.
Detecting associated single-nucleotide polymorphisms on the X chromosome in case control genome-wide association studies.

PubMed

Chen, Zhongxue; Ng, Hon Keung Tony; Li, Jing; Liu, Qingzhong; Huang, Hanwen

2017-04-01

In the past decade, hundreds of genome-wide association studies have been conducted to detect the significant single-nucleotide polymorphisms that are associated with certain diseases. However, most of the data from the X chromosome were not analyzed and only a few significant associated single-nucleotide polymorphisms from the X chromosome have been identified from genome-wide association studies. This is mainly due to the lack of powerful statistical tests. In this paper, we propose a novel statistical approach that combines the information of single-nucleotide polymorphisms on the X chromosome from both males and females in an efficient way. The proposed approach avoids the need of making strong assumptions about the underlying genetic models. Our proposed statistical test is a robust method that only makes the assumption that the risk allele is the same for both females and males if the single-nucleotide polymorphism is associated with the disease for both genders. Through simulation study and a real data application, we show that the proposed procedure is robust and have excellent performance compared to existing methods. We expect that many more associated single-nucleotide polymorphisms on the X chromosome will be identified if the proposed approach is applied to current available genome-wide association studies data.

Genome-wide divergence and linkage disequilibrium analyses for Capsicum baccatum revealed by genome-anchored single nucleotide polymorphisms

USDA-ARS?s Scientific Manuscript database

Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...
Variation analysis of the severe acute respiratory syndrome coronavirus putative non-structural protein 2 gene and construction of three-dimensional model.

PubMed

Lu, Jia-hai; Zhang, Ding-mei; Wang, Guo-ling; Guo, Zhong-min; Zhang, Chuan-hai; Tan, Bing-yan; Ouyang, Li-ping; Lin, Li; Liu, Yi-min; Chen, Wei-qing; Ling, Wen-hua; Yu, Xin-bing; Zhong, Nan-shan

2005-05-05

The rapid transmission and high mortality rate made severe acute respiratory syndrome (SARS) a global threat for which no efficacious therapy is available now. Without sufficient knowledge about the SARS coronavirus (SARS-CoV), it is impossible to define the candidate for the anti-SARS targets. The putative non-structural protein 2 (nsp2) (3CL(pro), following the nomenclature by Gao et al, also known as nsp5 in Snidjer et al) of SARS-CoV plays an important role in viral transcription and replication, and is an attractive target for anti-SARS drug development, so we carried on this study to have an insight into putative polymerase nsp2 of SARS-CoV Guangdong (GD) strain. The SARS-CoV strain was isolated from a SARS patient in Guangdong, China, and cultured in Vero E6 cells. The nsp2 gene was amplified by reverse transcription-polymerase chain reaction (RT-PCR) and cloned into eukaryotic expression vector pCI-neo (pCI-neo/nsp2). Then the recombinant eukaryotic expression vector pCI-neo/nsp2 was transfected into COS-7 cells using lipofectin reagent to express the nsp2 protein. The expressive protein of SARS-CoV nsp2 was analyzed by 7% sodium dodecylsulfate polyacrylamide gel electrophoresis (SDS-PAGE). The nucleotide sequence and protein sequence of GD nsp2 were compared with that of other SARS-CoV strains by nucleotide-nucleotide basic local alignment search tool (BLASTN) and protein-protein basic local alignment search tool (BLASTP) to investigate its variance trend during the transmission. The secondary structure of GD strain and that of other strains were predicted by Garnier-Osguthorpe-Robson (GOR) Secondary Structure Prediction. Three-dimensional-PSSM Protein Fold Recognition (Threading) Server was employed to construct the three-dimensional model of the nsp2 protein. The putative polymerase nsp2 gene of GD strain was amplified by RT-PCR. The eukaryotic expression vector (pCI-neo/nsp2) was constructed and expressed the protein in COS-7 cells successfully. The result of sequencing and sequence comparison with other SARS-CoV strains showed that nsp2 gene was relatively conservative during the transmission and total five base sites mutated in about 100 strains investigated, three of which in the early and middle phases caused synonymous mutation, and another two base sites variation in the late phase resulted in the amino acid substitutions and secondary structure changes. The three-dimensional structure of the nsp2 protein was successfully constructed. The results suggest that polymerase nsp2 is relatively stable during the phase of epidemic. The amino acid and secondary structure change may be important for viral infection. The fact that majority of single nucleotide variations (SNVs) are predicted to cause synonymous, as well as the result of low mutation rate of nsp2 gene in the epidemic variations, indicates that the nsp2 is conservative and could be a target for anti-SARS drugs. The three-dimensional structure result indicates that the nsp2 protein of GD strain is high homologous with 3CL(pro) of SARS-CoV urbani strain, 3CL(pro) of transmissible gastroenteritis virus and 3CL(pro) of human coronavirus 229E strain, which further suggests that nsp2 protein of GD strain possesses the activity of 3CL(pro).
Genetic variability of the equine casein genes.

PubMed

Brinkmann, J; Jagannathan, V; Drögemüller, C; Rieder, S; Leeb, T; Thaller, G; Tetens, J

2016-07-01

The casein genes are known to be highly variable in typical dairy species, such as cattle and goat, but the knowledge about equine casein genes is limited. Nevertheless, mare milk production and consumption is gaining importance because of its high nutritive value, use in naturopathy, and hypoallergenic properties with respect to cow milk protein allergies. In the current study, the open reading frames of the 4 casein genes CSN1S1 (αS1-casein), CSN2 (β-casein), CSN1S2 (αS2-casein), and CSN3 (κ-casein) were resequenced in 253 horses of 14 breeds. The analysis revealed 21 nonsynonymous nucleotide exchanges, as well as 11 synonymous nucleotide exchanges, leading to a total of 31 putative protein isoforms predicted at the DNA level, 26 of which considered novel. Although the majority of the alleles need to be confirmed at the transcript and protein level, a preliminary nomenclature was established for the equine casein alleles. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Multilocus amplicon sequencing of Pseudomonas aeruginosa cystic fibrosis airways isolates collected prior to and after early antipseudomonal chemotherapy.

PubMed

Fischer, Sebastian; Greipel, Leonie; Klockgether, Jens; Dorda, Marie; Wiehlmann, Lutz; Cramer, Nina; Tümmler, Burkhard

2017-05-01

Early antimicrobial chemotherapy can prevent or at least delay chronic cystic fibrosis (CF) airways infections with Pseudomonas aeruginosa. During a 10-year study period P. aeruginosa was detected for the first time in 54 CF patients regularly seen at the CF centre Hannover. Amplicon sequencing of 34 loci of the P. aeruginosa core genome was performed in baseline and post-treatment isolates of the 15 CF patients who had remained P. aeruginosa - positive after the first round of antipseudomonal chemotherapy. Deep sequencing uncovered coexisting alternative nucleotides at in total 33 of 55,284 examined genome positions including six non-synonymous polymorphisms in the lasR gene, a key regulator of quorum sensing. After early treatment 42 of 50 novel nucleotide substitutions had emerged in exopolysaccharide biosynthesis, efflux pump and porin genes. Early treatment selects pathoadaptive mutations in P. aeruginosa that are typical for chronic infections of CF lungs. Copyright © 2016 European Cystic Fibrosis Society. Published by Elsevier B.V. All rights reserved.
Genetic polymorphisms in ESR1 and ESR2 genes, and risk of hypospadias in a multiethnic study population.

PubMed

Choudhry, Shweta; Baskin, Laurence S; Lammer, Edward J; Witte, John S; Dasgupta, Sudeshna; Ma, Chen; Surampalli, Abhilasha; Shen, Joel; Shaw, Gary M; Carmichael, Suzan L

2015-05-01

Estrogenic endocrine disruptors acting via estrogen receptors α (ESR1) and β (ESR2) have been implicated in the etiology of hypospadias, a common congenital malformation of the male external genitalia. We determined the association of single nucleotide polymorphisms in ESR1 and ESR2 genes with hypospadias in a racially/ethnically diverse study population of California births. We investigated the relationship between hypospadias and 108 ESR1 and 36 ESR2 single nucleotide polymorphisms in 647 cases and 877 population based nonmalformed controls among infants born in selected California counties from 1990 to 2003. Subgroup analyses were performed by race/ethnicity (nonHispanic white and Hispanic subjects) and by hypospadias severity (mild to moderate and severe). Odds ratios for 33 of the 108 ESR1 single nucleotide polymorphisms had p values less than 0.05 (p = 0.05 to 0.007) for risk of hypospadias. However, none of the 36 ESR2 single nucleotide polymorphisms was significantly associated. In stratified analyses the association results were consistent by disease severity but different sets of single nucleotide polymorphisms were significantly associated with hypospadias in nonHispanic white and Hispanic subjects. Due to high linkage disequilibrium across the single nucleotide polymorphisms, haplotype analyses were conducted and identified 6 haplotype blocks in ESR1 gene that had haplotypes significantly associated with an increased risk of hypospadias (OR 1.3 to 1.8, p = 0.04 to 0.00001). Similar to single nucleotide polymorphism analysis, different ESR1 haplotypes were associated with risk of hypospadias in nonHispanic white and Hispanic subjects. No significant haplotype association was observed for ESR2. The data provide evidence that ESR1 single nucleotide polymorphisms and haplotypes influence the risk of hypospadias in white and Hispanic subjects, and warrant further examination in other study populations. Copyright © 2015 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Influence of ghrelin gene polymorphisms on hypertension and atherosclerotic disease.

PubMed

Berthold, Heiner K; Giannakidou, Eleni; Krone, Wilhelm; Trégouët, David-Alexandre; Gouni-Berthold, Ioanna

2010-02-01

Ghrelin is involved in several metabolic and cardiovascular processes. Recent evidence suggests its involvement in blood pressure regulation and hypertension. The aim of the study was to determine associations of single-nucleotide polymorphisms (SNPs) and haplotypes of the ghrelin gene (GHRL) with hypertension and atherosclerotic disease. Six GHRL SNPs (rs27647, rs26802, rs34911341, rs696217, rs4684677 and a -473G/A (with no assigned rsID)) were investigated in a sample of 1143 hypertensive subjects and 1489 controls of Caucasian origin. Both single-locus and haplotype association analyses were performed. In single-locus analyses, only the non-synonymous rs34911341 was associated with hypertension (odds ratio (OR)=1.95 (95% confidence interval (CI): 1.26-3.02), P=0.003). Six common haplotypes with frequency >1% were inferred from the studied GHRL SNPs, and their frequency distribution was significantly different between hypertensive subjects and controls (chi(2)=12.96 with 5 d.f. (degree of freedom), P=0.024). The effect of rs26802 was found to be significantly (P=0.017) modulated by other GHRL SNPs, as its C allele conferred either an increased risk (OR=1.30 (1.08-1.57), P=0.005) or a decreased risk (OR=0.50 (0.23-1.06), P=0.07) of hypertension according to the two different haplotypes on which it can be found. No association of GHRL SNPs or haplotypes with atherosclerotic disease was observed. In conclusion, we observed statistical evidence for association between GHRL SNPs and risk of hypertension.
Compositions and methods for detecting single nucleotide polymorphisms

DOEpatents

Yeh, Hsin-Chih; Werner, James; Martinez, Jennifer S.

2016-11-22

Described herein are nucleic acid based probes and methods for discriminating and detecting single nucleotide variants in nucleic acid molecules (e.g., DNA). The methods include use of a pair of probes can be used to detect and identify polymorphisms, for example single nucleotide polymorphism in DNA. The pair of probes emit a different fluorescent wavelength of light depending on the association and alignment of the probes when hybridized to a target nucleic acid molecule. Each pair of probes is capable of discriminating at least two different nucleic acid molecules that differ by at least a single nucleotide difference. The methods can probes can be used, for example, for detection of DNA polymorphisms that are indicative of a particular disease or condition.
A novel synonymous variant in the AVP gene associated with adFNDI causes partial RNA missplicing.

PubMed

Kvistgaard, Helene; Christensen, Jane H; Johansson, Jan-Ove; Gregersen, Niels; Rittig, Charlotte; Rittig, Soeren; Corydon, Thomas Juhl

2018-06-27

Objective: Autosomal dominant familial neurohypophyseal diabetes insipidus (adFNDI) is characterized by severe polyuria and polydipsia and is caused by variations in the gene encoding the AVP prohormone. The study aimed to ascertain a correct diagnosis, to identify the underlying genetic cause of adFNDI in a Swedish kindred, and to test the hypothesis that the identified synonymous exonic variant in the AVP gene (c.324G>A), causes missplicing, and endoplasmic reticulum (ER) retention of the prohormone. Three affected family members were admitted for fluid deprivation test and dDAVP challenge test. Direct sequencing of the AVP gene was performed in affected subjects, and genotyping of the identified variant was performed in family members. The variant was examined by expression of AVP minigenes containing the entire coding regions as well as intron 2 of AVP. Clinical tests revealed significant phenotypical variation with both complete and partial adFNDI phenotype. DNA analysis revealed a synonymous c.324G>A substitution in one allele of the AVP gene in affected family members only. Cellular studies revealed both normally spliced and misspliced pre-mRNA in cells transfected with the AVP c.324G>A minigene. Confocal laser scanning microscopy showed collective localization of the variant prohormone to ER and vesicular structures at the tip of cellular processes. We have identified a synonymous variant affecting the second nucleotide of exon 3 in the AVP gene (c.324G>A) in a kindred in which adFNDI segregates. Notably, we showed that this variant causes partial missplicing of pre-mRNA resulting in accumulation of variant prohormone in ER. Our study suggests that even a small amount of aberrant mRNA might be sufficient to disturb cellular function resulting in adFNDI.
. ©2018S. Karger AG, Basel.
[Polymorphisms of inhibin α gene exon 1 in buffalo (Bubalus bubalis), gayal (Bos frontalis) and yak (Bos grunniens)].

PubMed

Miao, Yong-Wang; Ha, Fu; Gao, Hua-Shan; Yuan, Feng; Li, Da-Lin; Yuan, Yue-Yun

2012-08-01

To elucidate the genetic characteristics of the bovine Inhibin α subunit (INHA) gene, the polymorphisms in exon 1 of INHA and its bilateral sequences were assayed using PCR with direct sequencing in buffalo, gayal and yak. A comparative analysis was conducted by pooled the results in this study with the published data of INHA on some mammals including some bovine species together. A synonymous substitution c.73C>A was identified in exon 1 of INHA for buffalo, which results in identical encoding product in river and swamp buffalo. In gayal, two non-synonymous but same property substitutions in exon 1 of INHA, viz. c.62 C>T and c.187 G>A, were detected, which lead to p. P21L, p. V63M changes in INHA, respectively. In yak, nucleotide substitution c.62C> T, c.129A>G were found in exon 1 of INHA, the former still causes p. P21L substitution and the latter is synonymous. For the sequence of the 5'-flanking region of INHA examined, no SNPs were found within the species, but a substitution, c. -6T>G, was found. The nucleotide in this site in gayal, yak and cattle was c. -6G, whereas in buffalo it was c. -6T. Meanwhile, a 6-bp deletion, namely c. 262+31_262+36delTCTGAC, was found in the intron of buffalo INHA gene. For this deletion, wild types (+/+) account for main part in river buffalo while mutant types (-/-) are predominant in swamp buffalo. This deletion was not found in gayal, yak and cattle, though these all have another deletion in the intron of INHA, c. 262+78_262+79delTG. The results of sequence alignment showed that the substitutions c. 43A and c. 67G in exon 1 of INHA are specific to buffalo, whereas the substitutions c. 173A and c. 255G are exclusive to gayal, yak and cattle, and c. 24C, c. 47G, c. 174T and c. 206T are specific to goat. Furthermore, there are few differences among gayal, yak and cattle, but there relatively great differences between buffalo, goat and other bovine species regarding the sequences of INHA exon 1.
Lack of Association Between Toll-like Receptor 2 Polymorphisms (R753Q and A-16934T) and Atopic Dermatitis in Children from Thrace Region of Turkey

PubMed Central

Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi

2017-01-01

Background: Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. Aims: To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Study Design: Case-control study. Methods: The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Results: Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. Conclusion: The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients. PMID:28443596
Lack of Association Between Toll-like Receptor 2 Polymorphisms (R753Q and A-16934T) and Atopic Dermatitis in Children from Thrace Region of Turkey.

PubMed

Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi

2017-05-05

Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Case-control study. The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients.
Natural selection of K13 mutants of Plasmodium falciparum in response to artemisinin combination therapies in Thailand.

PubMed

Putaporntip, C; Kuamsab, N; Kosuwin, R; Tantiwattanasub, W; Vejakama, P; Sueblinvong, T; Seethamchai, S; Jongwutiwes, S; Hughes, A L

2016-03-01

Resistance of Plasmodium falciparum to artemisinin combination therapy (ACT) in Southeast Asia can have a devastating impact on chemotherapy and control measures. In this study, the evolution of artemisinin-resistant P. falciparum in Thailand was assessed by exploring mutations in the K13 locus believed to confer drug resistance phenotype. P. falciparum-infected blood samples were obtained from patients in eight provinces of Thailand over two decades (1991-2014; n = 904). Analysis of the K13 gene was performed by either sequencing the complete coding region (n = 259) or mutation-specific PCR-restriction fragment length polymorphism method (n = 645). K13 mutations related to artesunate resistance were detected in isolates from Trat province bordering Cambodia in 1991, about 4 years preceding widespread deployment of ACT in Thailand and increased in frequency over time. Nonsynonymous nucleotide diversity exceeded synonymous nucleotide diversity in the propeller region of the K13 gene, supporting the hypothesis that this diversity was driven by natural selection. No single mutant appeared to be favoured in every population, and propeller-region mutants were rarely observed in linkage with each other in the same haplotype. On the other hand, there was a highly significant association between the occurrence of a propeller mutant and the insertion of two or three asparagines after residue 139 of K13. Whether this insertion plays a compensatory role for deleterious effects of propeller mutants on the function of the K13 protein requires further investigation. However, modification of duration of ACT from 2-day to 3-day regimens in 2008 throughout the country does not halt the increase in frequency of mutants conferring artemisinin resistance phenotype. Copyright © 2015 European Society of Clinical Microbiology and Infectious Diseases. All rights reserved.
A syndrome of congenital microcephaly, intellectual disability and dysmorphism with a homozygous mutation in FRMD4A.

PubMed

Fine, Dina; Flusser, Hagit; Markus, Barak; Shorer, Zamir; Gradstein, Libe; Khateeb, Shareef; Langer, Yshia; Narkis, Ginat; Birk, Ruth; Galil, Aharon; Shelef, Ilan; Birk, Ohad S

2015-12-01

A consanguineous Bedouin Israeli kindred presented with a novel autosomal recessive intellectual disability syndrome of congenital microcephaly, low anterior hairline, bitemporal narrowing, low-set protruding ears, strabismus and tented thick eyebrows with sparse hair in their medial segment. Brain imaging demonstrated various degrees of agenesis of corpus callosum and hypoplasia of the vermis and cerebellum. Genome-wide linkage analysis followed by fine mapping defined a 7.67 Mb disease-associated locus (LOD score 4.99 at θ=0 for marker D10S1653). Sequencing of the 48 genes within the locus identified a single non-synonymous homozygous duplication frameshift mutation of 13 nucleotides (c.2134_2146dup13) within the coding region of FRMD4A, that was common to all affected individuals and not found in 180 non-related Bedouin controls. Three of 50 remotely related healthy controls of the same tribe were heterozygous for the mutation. FRMD4A, member of the FERM superfamily, is involved in cell structure, transport and signaling. It regulates cell polarity by playing an important role in the activation of ARF6, mediating the interaction between Par3 and the ARF6 guanine nucleotide exchange factor. ARF6 is known to modulate cell polarity in neurons, and regulates dendritic branching in hippocampal neurons and neurite outgrowth. The FRMD4 domain that is essential for determining cell polarity through interaction with Par3 is truncated by the c.2134_2146dup13 mutation. FRMD4A polymorphisms were recently suggested to be a risk factor for Alzheimer's disease. We now show a homozygous frameshift mutation of the same gene in a severe neurologic syndrome with unique dysmorphism.
Multilocus sequence analysis of Thermoanaerobacter isolates reveals recombining, but differentiated, populations from geothermal springs of the Uzon Caldera, Kamchatka, Russia

PubMed Central

Wagner, Isaac D.; Varghese, Litty B.; Hemme, Christopher L.; Wiegel, Juergen

2013-01-01

Thermal environments have island-like characteristics and provide a unique opportunity to study population structure and diversity patterns of microbial taxa inhabiting these sites. Strains having ≥98% 16S rRNA gene sequence similarity to the obligately anaerobic Firmicutes Thermoanaerobacter uzonensis were isolated from seven geothermal springs, separated by up to 1600 m, within the Uzon Caldera (Kamchatka, Russian Far East). The intraspecies variation and spatial patterns of diversity for this taxon were assessed by multilocus sequence analysis (MLSA) of 106 strains. Analysis of eight protein-coding loci (gyrB, lepA, leuS, pyrG, recA, recG, rplB, and rpoB) revealed that all loci were polymorphic and that nucleotide substitutions were mostly synonymous. There were 148 variable nucleotide sites across 8003 bp concatenates of the protein-coding loci. While pairwise FST values indicated a small but significant level of genetic differentiation between most subpopulations, there was a negligible relationship between genetic divergence and spatial separation. Strains with the same allelic profile were only isolated from the same hot spring, occasionally from consecutive years, and single locus variant (SLV) sequence types were usually derived from the same spring. While recombination occurred, there was an “epidemic” population structure in which a particular T. uzonensis sequence type rose in frequency relative to the rest of the population. These results demonstrate spatial diversity patterns for an anaerobic bacterial species in a relative small geographic location and reinforce the view that terrestrial geothermal springs are excellent places to look for biogeographic diversity patterns regardless of the involved distances. PMID:23801987
Extraordinary Genetic Diversity in a Wood Decay Mushroom.

PubMed

Baranova, Maria A; Logacheva, Maria D; Penin, Aleksey A; Seplyarskiy, Vladimir B; Safonova, Yana Y; Naumenko, Sergey A; Klepikova, Anna V; Gerasimov, Evgeny S; Bazykin, Georgii A; James, Timothy Y; Kondrashov, Alexey S

2015-10-01

Populations of different species vary in the amounts of genetic diversity they possess. Nucleotide diversity π, the fraction of nucleotides that are different between two randomly chosen genotypes, has been known to range in eukaryotes between 0.0001 in Lynx lynx and 0.16 in Caenorhabditis brenneri. Here, we report the results of a comparative analysis of 24 haploid genotypes (12 from the United States and 12 from European Russia) of a split-gill fungus Schizophyllum commune. The diversity at synonymous sites is 0.20 in the American population of S. commune and 0.13 in the Russian population. This exceptionally high level of nucleotide diversity also leads to extreme amino acid diversity of protein-coding genes. Using whole-genome resequencing of 2 parental and 17 offspring haploid genotypes, we estimate that the mutation rate in S. commune is high, at 2.0 × 10(-8) (95% CI: 1.1 × 10(-8) to 4.1 × 10(-8)) per nucleotide per generation. Therefore, the high diversity of S. commune is primarily determined by its elevated mutation rate, although high effective population size likely also plays a role. Small genome size, ease of cultivation and completion of the life cycle in the laboratory, free-living haploid life stages and exceptionally high variability of S. commune make it a promising model organism for population, quantitative, and evolutionary genetics. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Improved prediction of biochemical recurrence after radical prostatectomy by genetic polymorphisms.

PubMed

Morote, Juan; Del Amo, Jokin; Borque, Angel; Ars, Elisabet; Hernández, Carlos; Herranz, Felipe; Arruza, Antonio; Llarena, Roberto; Planas, Jacques; Viso, María J; Palou, Joan; Raventós, Carles X; Tejedor, Diego; Artieda, Marta; Simón, Laureano; Martínez, Antonio; Rioja, Luis A

2010-08-01

Single nucleotide polymorphisms are inherited genetic variations that can predispose or protect individuals against clinical events. We hypothesized that single nucleotide polymorphism profiling may improve the prediction of biochemical recurrence after radical prostatectomy. We performed a retrospective, multi-institutional study of 703 patients treated with radical prostatectomy for clinically localized prostate cancer who had at least 5 years of followup after surgery. All patients were genotyped for 83 prostate cancer related single nucleotide polymorphisms using a low density oligonucleotide microarray. Baseline clinicopathological variables and single nucleotide polymorphisms were analyzed to predict biochemical recurrence within 5 years using stepwise logistic regression. Discrimination was measured by ROC curve AUC, specificity, sensitivity, predictive values, net reclassification improvement and integrated discrimination index. The overall biochemical recurrence rate was 35%. The model with the best fit combined 8 covariates, including the 5 clinicopathological variables prostate specific antigen, Gleason score, pathological stage, lymph node involvement and margin status, and 3 single nucleotide polymorphisms at the KLK2, SULT1A1 and TLR4 genes. Model predictive power was defined by 80% positive predictive value, 74% negative predictive value and an AUC of 0.78. The model based on clinicopathological variables plus single nucleotide polymorphisms showed significant improvement over the model without single nucleotide polymorphisms, as indicated by 23.3% net reclassification improvement (p = 0.003), integrated discrimination index (p <0.001) and likelihood ratio test (p <0.001). Internal validation proved model robustness (bootstrap corrected AUC 0.78, range 0.74 to 0.82). The calibration plot showed close agreement between biochemical recurrence observed and predicted probabilities. Predicting biochemical recurrence after radical prostatectomy based on clinicopathological data can be significantly improved by including patient genetic information. Copyright (c) 2010 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
A large outbreak of acute gastroenteritis in Shippensburg, Pennsylvania, 1972 revisited: evidence for common source exposure to a recombinant GII.Pg/GII.3 norovirus.

PubMed

Johnson, J A; Parra, G I; Levenson, E A; Green, K Y

2017-06-01

Historical outbreaks can be an important source of information in the understanding of norovirus evolution and epidemiology. Here, we revisit an outbreak of undiagnosed gastroenteritis that occurred in Shippensburg, Pennsylvania in 1972. Nearly 5000 people fell ill over the course of 10 days. Symptoms included diarrhea, vomiting, stomach cramps, and fever, lasting for a median of 24 h. Using current techniques, including next-generation sequencing of full-length viral genomic amplicons, we identified an unusual norovirus recombinant (GII.Pg/GII.3) in nine of 15 available stool samples from the outbreak. This particular recombinant virus has not been reported in recent decades, although GII.3 and GII.Pg genotypes have been detected individually in current epidemic strains. The consensus nucleotide sequences were nearly identical among the four viral genomes analysed, although each strain had three to seven positions in the genome with heterogenous non-synonymous nucleotide subpopulations. Two of these resulting amino acid polymorphisms were conserved in frequency among all four cases, consistent with common source exposure and successful transmission of a mixed viral population. Continued investigation of variant nucleotide populations and recombination events among ancestral norovirus strains such as the Shippensburg virus may provide unique insight into the origin of contemporary strains.
Are mutagenic non D-loop direct repeat motifs in mitochondrial DNA under a negative selection pressure?

PubMed Central

Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto

2015-01-01

Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
[Meta-analysis on relationship between single nucleotide polymorphism of rs2231142 in ABCG2 gene and gout in East Asian population].

PubMed

Wu, Lei; He, Yao; Zhang, Di

2015-11-01

To systematically evaluate the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout in East Asian population. The literature retrieval was conducted by using English databases (Medline, EMbase), Chinese databases (CNKI, Vip, Wanfang, SinaMed) and others to collect the published papers on the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout by the end of December 2014. Meta-analysis was performed with software Stata 12.0. Nine studies were included. There were significant associations between increased risk of gout and single nucleotide polymorphism of rs2231142, the combined OR was 2.04 (95%CI: 1.82-2.28) for A allele and C allele, 1.97 (95%CI: 1.57-2.48) for CA and CC, 3.71 (95%CI: 3.07-4.47) for AA and CC. Sex and region specific subgroup analysis showed less heterogeneity. There is significant association between gout and single nucleotide polymorphism of rs2231142 in East Asian population, and A allele is a high risk gene for gout.
CNTNAP2 Is Significantly Associated With Speech Sound Disorder in the Chinese Han Population.

PubMed

Zhao, Yun-Jing; Wang, Yue-Ping; Yang, Wen-Zhu; Sun, Hong-Wei; Ma, Hong-Wei; Zhao, Ya-Ru

2015-11-01

Speech sound disorder is the most common communication disorder. Some investigations support the possibility that the CNTNAP2 gene might be involved in the pathogenesis of speech-related diseases. To investigate single-nucleotide polymorphisms in the CNTNAP2 gene, 300 unrelated speech sound disorder patients and 200 normal controls were included in the study. Five single-nucleotide polymorphisms were amplified and directly sequenced. Significant differences were found in the genotype (P = .0003) and allele (P = .0056) frequencies of rs2538976 between patients and controls. The excess frequency of the A allele in the patient group remained significant after Bonferroni correction (P = .0280). A significant haplotype association with rs2710102T/+rs17236239A/+2538976A/+2710117A (P = 4.10e-006) was identified. A neighboring single-nucleotide polymorphism, rs10608123, was found in complete linkage disequilibrium with rs2538976, and the genotypes exactly corresponded to each other. The authors propose that these CNTNAP2 variants increase the susceptibility to speech sound disorder. The single-nucleotide polymorphisms rs10608123 and rs2538976 may merge into one single-nucleotide polymorphism. © The Author(s) 2015.

Cacao single-nucleotide polymorphism (SNP) markers: A discovery strategy to identify SNPs for genotyping, genetic mapping and genome wide association studies (GWAS)

USDA-ARS?s Scientific Manuscript database

Single-nucleotide polymorphisms (SNPs) are the most common genetic markers in Theobroma cacao, occurring approximately once in every 200 nucleotides. SNPs, like microsatellites, are co-dominant and PCR-based, but they have several advantages over microsatellites. They are unambiguous, so that a SN...
Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

DOEpatents

McCutchen-Maloney, Sandra L.

2002-01-01

DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
The association of single-nucleotide polymorphisms in the oxytocin receptor and G protein-coupled receptor kinase 6 (GRK6) genes with oxytocin dosing requirements and labor outcomes.

PubMed

Grotegut, Chad A; Ngan, Emily; Garrett, Melanie E; Miranda, Marie Lynn; Ashley-Koch, Allison E; Swamy, Geeta K

2017-09-01

Oxytocin is a potent uterotonic agent that is widely used for induction and augmentation of labor. Oxytocin has a narrow therapeutic index and the optimal dosing for any individual woman varies widely. The objective of this study was to determine whether genetic variation in the oxytocin receptor (OXTR) or in the gene encoding G protein-coupled receptor kinase 6 (GRK6), which regulates desensitization of the oxytocin receptor, could explain variation in oxytocin dosing and labor outcomes among women being induced near term. Pregnant women with a singleton gestation residing in Durham County, NC, were prospectively enrolled as part of the Healthy Pregnancy, Healthy Baby cohort study. Those women undergoing an induction of labor at 36 weeks or greater were genotyped for 18 haplotype-tagging single-nucleotide polymorphisms in OXTR and 7 haplotype-tagging single-nucleotide polymorphisms in GRK6 using TaqMan assays. Linear regression was used to examine the relationship between maternal genotype and maximal oxytocin infusion rate, total oxytocin dose received, and duration of labor. Logistic regression was used to test for the association of maternal genotype with mode of delivery. For each outcome, backward selection techniques were utilized to control for important confounding variables and additive genetic models were used. Race/ethnicity was included in all models because of differences in allele frequencies across populations, and Bonferroni correction for multiple testing was used. DNA was available from 482 women undergoing induction of labor at 36 weeks or greater. Eighteen haplotype-tagging single-nucleotide polymorphisms within OXTR and 7 haplotype-tagging single-nucleotide polymorphisms within GRK6 were examined. Five single-nucleotide polymorphisms in OXTR showed nominal significance with maximal infusion rate of oxytocin, and two single-nucleotide polymorphisms in OXTR were associated with total oxytocin dose received. One single-nucleotide polymorphism in OXTR and two single-nucleotide polymorphisms in GRK6 were associated with duration of labor, one of which met the multiple testing threshold (P = .0014, rs2731664 [GRK6], mean duration of labor, 17.7 hours vs 20.2 hours vs 23.5 hours for AA, AC, and CC genotypes, respectively). Three single-nucleotide polymorphisms, two in OXTR and one in GRK6, showed nominal significance with mode of delivery. Genetic variation in OXTR and GRK6 is associated with the amount of oxytocin required as well as the duration of labor and risk for cesarean delivery among women undergoing induction of labor near term. With further research, pharmacogenomic approaches may potentially be utilized to develop personalized treatment to improve safety and efficacy outcomes among women undergoing induction of labor. Copyright © 2017 Elsevier Inc. All rights reserved.
Switches in Genomic GC Content Drive Shifts of Optimal Codons under Sustained Selection on Synonymous Sites

PubMed Central

Sun, Yu; Tamarit, Daniel

2017-01-01

Abstract The major codon preference model suggests that codons read by tRNAs in high concentrations are preferentially utilized in highly expressed genes. However, the identity of the optimal codons differs between species although the forces driving such changes are poorly understood. We suggest that these questions can be tackled by placing codon usage studies in a phylogenetic framework and that bacterial genomes with extreme nucleotide composition biases provide informative model systems. Switches in the background substitution biases from GC to AT have occurred in Gardnerella vaginalis (GC = 32%), and from AT to GC in Lactobacillus delbrueckii (GC = 62%) and Lactobacillus fermentum (GC = 63%). We show that despite the large effects on codon usage patterns by these switches, all three species evolve under selection on synonymous sites. In G. vaginalis, the dramatic codon frequency changes coincide with shifts of optimal codons. In contrast, the optimal codons have not shifted in the two Lactobacillus genomes despite an increased fraction of GC-ending codons. We suggest that all three species are in different phases of an on-going shift of optimal codons, and attribute the difference to a stronger background substitution bias and/or longer time since the switch in G. vaginalis. We show that comparative and correlative methods for optimal codon identification yield conflicting results for genomes in flux and discuss possible reasons for the mispredictions. We conclude that switches in the direction of the background substitution biases can drive major shifts in codon preference patterns even under sustained selection on synonymous codon sites. PMID:27540085
Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3.

PubMed

McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S

2014-01-01

The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.
Bovine Exome Sequence Analysis and Targeted SNP Genotyping of Recessive Fertility Defects BH1, HH2, and HH3 Reveal a Putative Causative Mutation in SMC2 for HH3

PubMed Central

McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.

2014-01-01

The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array. PMID:24667746
Mitochondrial pathogenic mutations are population-specific.

PubMed

Breen, Michael S; Kondrashov, Fyodor A

2010-12-31

Surveying deleterious variation in human populations is crucial for our understanding, diagnosis and potential treatment of human genetic pathologies. A number of recent genome-wide analyses focused on the prevalence of segregating deleterious alleles in the nuclear genome. However, such studies have not been conducted for the mitochondrial genome. We present a systematic survey of polymorphisms in the human mitochondrial genome, including those predicted to be deleterious and those that correspond to known pathogenic mutations. Analyzing 4458 completely sequenced mitochondrial genomes we characterize the genetic diversity of different types of single nucleotide polymorphisms (SNPs) in African (L haplotypes) and non-African (M and N haplotypes) populations. We find that the overall level of polymorphism is higher in the mitochondrial compared to the nuclear genome, although the mitochondrial genome appears to be under stronger selection as indicated by proportionally fewer nonsynonymous than synonymous substitutions. The African mitochondrial genomes show higher heterozygosity, a greater number of polymorphic sites and higher frequencies of polymorphisms for synonymous, benign and damaging polymorphism than non-African genomes. However, African genomes carry significantly fewer SNPs that have been previously characterized as pathogenic compared to non-African genomes. Finding SNPs classified as pathogenic to be the only category of polymorphisms that are more abundant in non-African genomes is best explained by a systematic ascertainment bias that favours the discovery of pathogenic polymorphisms segregating in non-African populations. This further suggests that, contrary to the common disease-common variant hypothesis, pathogenic mutations are largely population-specific and different SNPs may be associated with the same disease in different populations. Therefore, to obtain a comprehensive picture of the deleterious variability in the human population, as well as to improve the diagnostics of individuals carrying African mitochondrial haplotypes, it is necessary to survey different populations independently. This article was reviewed by Dr Mikhail Gelfand, Dr Vasily Ramensky (nominated by Dr Eugene Koonin) and Dr David Rand (nominated by Dr Laurence Hurst).
Fine Mapping and Functional Analysis of the Multiple Sclerosis Risk Gene CD6

PubMed Central

Swaminathan, Bhairavi; Cuapio, Angélica; Alloza, Iraide; Matesanz, Fuencisla; Alcina, Antonio; García-Barcina, Maria; Fedetz, Maria; Fernández, Óscar; Lucas, Miguel; Órpez, Teresa; Pinto-Medel, Mª Jesus; Otaegui, David; Olascoaga, Javier; Urcelay, Elena; Ortiz, Miguel A.; Arroyo, Rafael; Oksenberg, Jorge R.; Antigüedad, Alfredo; Tolosa, Eva; Vandenbroeck, Koen

2013-01-01

CD6 has recently been identified and validated as risk gene for multiple sclerosis (MS), based on the association of a single nucleotide polymorphism (SNP), rs17824933, located in intron 1. CD6 is a cell surface scavenger receptor involved in T-cell activation and proliferation, as well as in thymocyte differentiation. In this study, we performed a haptag SNP screen of the CD6 gene locus using a total of thirteen tagging SNPs, of which three were non-synonymous SNPs, and replicated the recently reported GWAS SNP rs650258 in a Spanish-Basque collection of 814 controls and 823 cases. Validation of the six most strongly associated SNPs was performed in an independent collection of 2265 MS patients and 2600 healthy controls. We identified association of haplotypes composed of two non-synonymous SNPs [rs11230563 (R225W) and rs2074225 (A257V)] in the 2nd SRCR domain with susceptibility to MS (P max(T) permutation = 1×10−4). The effect of these haplotypes on CD6 surface expression and cytokine secretion was also tested. The analysis showed significantly different CD6 expression patterns in the distinct cell subsets, i.e. – CD4+ naïve cells, P = 0.0001; CD8+ naïve cells, P<0.0001; CD4+ and CD8+ central memory cells, P = 0.01 and 0.05, respectively; and natural killer T (NKT) cells, P = 0.02; with the protective haplotype (RA) showing higher expression of CD6. However, no significant changes were observed in natural killer (NK) cells, effector memory and terminally differentiated effector memory T cells. Our findings reveal that this new MS-associated CD6 risk haplotype significantly modifies expression of CD6 on CD4+ and CD8+ T cells. PMID:23638056
Contrasting association of a non-synonymous leptin receptor gene polymorphism with Wegener's granulomatosis and Churg-Strauss syndrome.

PubMed

Wieczorek, Stefan; Holle, Julia U; Bremer, Jan P; Wibisono, David; Moosig, Frank; Fricke, Harald; Assmann, Gunter; Harper, Lorraine; Arning, Larissa; Gross, Wolfgang L; Epplen, Joerg T

2010-05-01

There is evidence that the leptin/ghrelin system is involved in T-cell regulation and plays a role in (auto)immune disorders such as SLE, RA and ANCA-associated vasculitides (AAVs). Here, we evaluate the genetic background of this system in WG. We screened variations in the genes encoding leptin, ghrelin and their receptors, the leptin receptor (LEPR) and the growth hormone secretagogue receptor (GHSR). Three single nucleotide polymorphisms (SNPs) in each gene region were analysed in 460 German WG cases and 878 ethnically matched healthy controls. A three-SNP haplotype of GHSR was significantly associated with WG [P = 0.0067; corrected P-value (P(c)) = 0.026; odds ratio (OR) = 1.30; 95% CI 1.08, 1.57], as was one non-synonymous SNP in LEPR (Lys656Asn, P = 0.0034; P(c) = 0.013; OR = 0.72; 95% CI 0.58, 0.90). These four SNPs were re-analysed in independent cohorts of 226 German WG cases and 519 controls. While the GHSR association was not confirmed, allele frequencies of the LEPR SNP were virtually identical to those from the initial cohorts. Analysis of this SNP in the combined WG and control panels revealed a significant association of the LEPR 656Lys allele with WG (P = 0.00032; P(c) = 0.0013; OR = 0.72; 95% CI 0.60, 0.86). Remarkably, the Lys656Asn SNP showed contrasting allele distribution in two cohorts of 108 and 88 German cases diagnosed with Churg-Strauss syndrome (CSS, combined P = 0.0067; OR = 1.41; 95% CI 1.10, 1.81), whereas identical allele frequencies were revealed when comparing British WG and microscopic polyangiitis cases. While GHSR has to be further evaluated, these data provide profound evidence for an association of the LEPR Lys656Asn SNP with AAV, resulting in opposing effects in WG and CSS.
Long-Term Evolution of the Hypervariable Region of Hepatitis C Virus in a Common-Source-Infected Cohort

PubMed Central

McAllister, Jane; Casino, Carmela; Davidson, Fiona; Power, Joan; Lawlor, Emer; Yap, Peng Lee; Simmonds, Peter; Smith, Donald B.

1998-01-01

The long-term evolution of the hepatitis C virus hypervariable region (HVR) and flanking regions of the E1 and E2 envelope proteins have been studied in a cohort of women infected from a common source of anti-D immunoglobulin. Whereas virus sequences in the infectious source were relatively homogeneous, distinct HVR variants were observed in each anti-D recipient, indicating that this region can evolve in multiple directions from the same point. Where HVR variants with dissimilar sequences were present in a single individual, the frequency of synonymous substitution in the flanking regions suggested that the lineages diverged more than a decade previously. Even where a single major HVR variant was present in an infected individual, this lineage was usually several years old. Multiple lineages can therefore coexist during long periods of chronic infection without replacement. The characteristics of amino acid substitution in the HVR were not consistent with the random accumulation of mutations and imply that amino acid replacement in the HVR was strongly constrained. Another variable region of E2 centered on codon 60 shows similar constraints, while HVR2 was relatively unconstrained. Several of these features are difficult to explain if a neutralizing immune response against the HVR is the only selective force operating on E2. The impact of PCR artifacts such as nucleotide misincorporation and the shuffling of dissimilar templates is discussed. PMID:9573256
Feline hypersomatotropism and acromegaly tumorigenesis: a potential role for the AIP gene.

PubMed

Scudder, C J; Niessen, S J; Catchpole, B; Fowkes, R C; Church, D B; Forcada, Y

2017-04-01

Acromegaly in humans is usually sporadic, however up to 20% of familial isolated pituitary adenomas are caused by germline sequence variants of the aryl-hydrocarbon-receptor interacting protein (AIP) gene. Feline acromegaly has similarities to human acromegalic families with AIP mutations. The aim of this study was to sequence the feline AIP gene, identify sequence variants and compare the AIP gene sequence between feline acromegalic and control cats, and in acromegalic siblings. The feline AIP gene was amplified through PCR using whole blood genomic DNA from 10 acromegalic and 10 control cats, and 3 sibling pairs affected by acromegaly. PCR products were sequenced and compared with the published predicted feline AIP gene. A single nonsynonymous SNP was identified in exon 1 (AIP:c.9T > G) of two acromegalic cats and none of the control cats, as well as both members of one sibling pair. The region of this SNP is considered essential for the interaction of the AIP protein with its receptor. This sequence variant has not previously been reported in humans. Two additional synonymous sequence variants were identified (AIP:c.481C > T and AIP:c.826C > T). This is the first molecular study to investigate a potential genetic cause of feline acromegaly and identified a nonsynonymous AIP single nucleotide polymorphism in 20% of the acromegalic cat population evaluated, as well as in one of the sibling pairs evaluated. Copyright © 2016 Elsevier Inc. All rights reserved.
Bovine GDF10 gene polymorphism analysis and its association with body measurement traits in Chinese indigenous cattle.

PubMed

Adoligbe, C; Zan, Linsen; Farougou, S; Wang, Hongbao; Ujjan, J A

2012-04-01

The objective of this research was to detect bovine GDF10 gene polymorphism and analyze its association with body measurement traits (BMT) of animals sampled from 6 different Chinese indigenous cattle populations. The populations included Xuelong (Xl), Luxi (Lx), Qinchuan (Qc), Jiaxian red (Jx), Xianang (Xn) and Nanyang (Ny). Blood samples were taken from a total of 417 female animals stratified into age categories of 12-36 months. Polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) was employed to find out GDF10 single polymorphism nucleotide (SNPs) and explore their possible association with BMT. Sequence analysis of GDF10 gene revealed 3 SNPs in total: 1 in exon1 (G142A) and 2 in exon3 (A11471G, and T12495C). G142A and T12495C SNPs are both synonymous mutation. They showed 2 genotypes namely respectively (GG, GA) and (PP and PB). A11471G SNP is a missense mutation leading to the change of Alanine to Threonine amino acid. It showed three genotypes namely AA, BB and AB. Analysis of association of polymorphism with body measurement traits at the three locus showed that there were significant effects on BMT in Qc, Jx and Ny cattle population. These results suggest that the GDF10 gene might have potential effects on body measurement traits in the above mentioned cattle populations and could be used for marker-assisted selection.
Australian bat lyssavirus: a recently discovered new rhabdovirus.

PubMed

Warrilow, D

2005-01-01

Australian bat lyssavirus (ABLV), first identified in 1996, has been associated with two human fatalities. ABLV is genetically and serologically distinct from, but is closely related to, classical rabies. It has a bullet-shaped morphology by electron microscopy. There are two strains of ABLV known: one circulates in frugivorous bats, sub-order Megachiroptera, and the other circulates in the smaller, mainly insectivorous bats, sub-order Microchiroptera. Each strain has been associated with one human fatality. Surveillance indicates infected bats are widespread at a low frequency on the Australian mainland. It is unclear how long ABLV has been present in Australia, although molecular clock studies suggest the two strains separated 950 or 1,700 years ago based on synonymous or non-synonymous nucleotide changes, respectively. Recent serological surveys suggest a closely related virus may exist in the Philippines. Due to demonstrated cross-protection in mice, rabies vaccine is used to prevent infection. Rabies post-exposure prophylaxis (PEP) protocols have been adopted for when a human is scratched or bitten by a suspect bat. A long-term commitment to public health programs that test bats that have been involved in scratch or bite incidents, followed by PEP if appropriate, will be necessary to minimise further human infection.
A PYY Q62P variant linked to human obesity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ahituv, Nadav; Kavaslar, Nihan; Schackwitz, Wendy

2005-06-27

Members of the pancreatic polypeptide family and the irreceptors have been implicated in the control of food intake in rodents and humans. To investigate whether nucleotide changes in these candidate genes result in abnormal weight in humans, we sequenced the coding exons and splice sites of seven family members (NPY, PYY, PPY, NPY1R, NPY2R, NPY4R, and NPY5R) in a large cohort of extremely obese (n=379) and lean (n=378) individuals. In total we found eleven rare non-synonymous variants, four of which exhibited familial segregation, NPY1R L53P and PPY P63L with leanness and NPY2R D42G and PYY Q62P with obesity. Functional analysismore » of the obese variants revealed NPY2R D42G to have reduced cell surface expression, while previous cell culture based studies indicated variant PYY Q62P to have altered receptor binding selectivity and we show that it fails to reduce food intake through mouse peptide injection experiments. These results support that rare non-synonymous variants within these genes can alter susceptibility to human body mass index extremes.« less
Complete mitochondrial genome sequence from an endangered Indian snake, Python molurus molurus (Serpentes, Pythonidae).

PubMed

Dubey, Bhawna; Meganathan, P R; Haque, Ikramul

2012-07-01

This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.
Genetic characterization of the dihydrofolate reductase gene of Pneumocystis jirovecii isolates from Portugal.

PubMed

Costa, Marina C; Esteves, Francisco; Antunes, Francisco; Matos, Olga

2006-12-01

The aim of the present study was to evaluate the genetic variation of Pneumocystis jirovecii dihydrofolate reductase (DHFR) gene in an immunocompromised Portuguese population and to investigate the possible association between DHFR genotypes and P. jirovecii pneumonia (PcP) prophylaxis with co-trimoxazole. One hundred and thirty-eight P. jirovecii isolates were submitted to DHFR genetic characterization by PCR and sequencing. In the studied population, 72.7% of the patients presented sequences identical to the wild-type sequence of the P. jirovecii DHFR gene and 27.3% presented point substitutions. A total of nine substitution sites were identified; four synonymous substitutions at nucleotide positions 201, 272, 312 and 381 were detected in 31 patients. Five non-synonymous substitutions were observed, leading to the DHFR mutations Leu-13-->Ser, Asn-23-->Ser, Ser-31-->Phe, Met-52-->Leu and Ala-67-->Val. With the exception of the polymorphism at position 312 and the mutation at codon 52, all polymorphisms were reported in this study for the first time. Our results suggest that DHFR gene polymorphisms are frequent in the Portuguese immunocompromised population but do not seem to be associated with PcP prophylaxis failure (P = 0.748 and P = 0.730).
Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement.

PubMed

Blazier, J Chris; Ruhlman, Tracey A; Weng, Mao-Lun; Rehman, Sumaiyah K; Sabir, Jamal S M; Jansen, Robert K

2016-04-18

Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA.
Complex codon usage pattern and compositional features of retroviruses.

PubMed

RoyChoudhury, Sourav; Mukherjee, Debaprasad

2013-01-01

Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Next-generation sequencing traces human induced pluripotent stem cell lines clonally generated from heterogeneous cancer tissue.

PubMed

Ishikawa, Tetsuya

2017-05-26

To investigate genotype variation among induced pluripotent stem cell (iPSC) lines that were clonally generated from heterogeneous colon cancer tissues using next-generation sequencing. Human iPSC lines were clonally established by selecting independent single colonies expanded from heterogeneous primary cells of S-shaped colon cancer tissues by retroviral gene transfer ( OCT3/4 , SOX2 , and KLF4 ). The ten iPSC lines, their starting cancer tissues, and the matched adjacent non-cancerous tissues were analyzed using next-generation sequencing and bioinformatics analysis using the human reference genome hg19. Non-synonymous single-nucleotide variants (SNVs) (missense, nonsense, and read-through) were identified within the target region of 612 genes related to cancer and the human kinome. All SNVs were annotated using dbSNP135, CCDS, RefSeq, GENCODE, and 1000 Genomes. The SNVs of the iPSC lines were compared with the genotypes of the cancerous and non-cancerous tissues. The putative genotypes were validated using allelic depth and genotype quality. For final confirmation, mutated genotypes were manually curated using the Integrative Genomics Viewer. In eight of the ten iPSC lines, one or two non-synonymous SNVs in EIF2AK2 , TTN , ULK4 , TSSK1B , FLT4 , STK19 , STK31 , TRRAP , WNK1 , PLK1 or PIK3R5 were identified as novel SNVs and were not identical to the genotypes found in the cancer and non-cancerous tissues. This result suggests that the SNVs were de novo or pre-existing mutations that originated from minor populations, such as multifocal pre-cancer (stem) cells or pre-metastatic cancer cells from multiple, different clonal evolutions, present within the heterogeneous cancer tissue. The genotypes of all ten iPSC lines were different from the mutated ERBB2 and MKNK2 genotypes of the cancer tissues and were identical to those of the non-cancerous tissues and that found in the human reference genome hg19. Furthermore, two of the ten iPSC lines did not have any confirmed mutated genotypes, despite being derived from cancerous tissue. These results suggest that the traceability and preference of the starting single cells being derived from pre-cancer (stem) cells, stroma cells such as cancer-associated fibroblasts, and immune cells that co-existed in the tissues along with the mature cancer cells. The genotypes of iPSC lines derived from heterogeneous cancer tissues can provide information on the type of starting cell that the iPSC line was generated from.
Infectious mononucleosis-linked HLA class I single nucleotide polymorphism is associated with multiple sclerosis.

PubMed

Jafari, Naghmeh; Broer, Linda; Hoppenbrouwers, Ilse A; van Duijn, Cornelia M; Hintzen, Rogier Q

2010-11-01

Multiple sclerosis is a presumed autoimmune disease associated with genetic and environmental risk factors such as infectious mononucleosis. Recent research has shown infectious mononucleosis to be associated with a specific HLA class I polymorphism. Our aim was to test if the infectious mononucleosis-linked HLA class I single nucleotide polymorphism (rs6457110) is also associated with multiple sclerosis. Genotyping of the HLA-A single nucleotide polymorphism rs6457110 using TaqMan was performed in 591 multiple sclerosis cases and 600 controls. The association of multiple sclerosis with the HLA-A single nucleotide polymorphism was tested using logistic regression adjusted for age, sex and HLA-DRB1*1501. HLA-A minor allele (A) is associated with multiple sclerosis (OR = 0.68; p = 4.08 × 10( -5)). After stratification for HLA-DRB1*1501 risk allele (T) carrier we showed a significant OR of 0.70 (p = 0.003) for HLA-A. HLA class I single nucleotide polymorphism rs6457110 is associated with infectious mononucleosis and multiple sclerosis, independent of the major class II allele, supporting the hypothesis that shared genetics may contribute to the association between infectious mononucleosis and multiple sclerosis.

Electrical detection and quantification of single and mixed DNA nucleotides in suspension

NASA Astrophysics Data System (ADS)

Ahmad, Mahmoud Al; Panicker, Neena G.; Rizvi, Tahir A.; Mustafa, Farah

2016-09-01

High speed sequential identification of the building blocks of DNA, (deoxyribonucleotides or nucleotides for short) without labeling or processing in long reads of DNA is the need of the hour. This can be accomplished through exploiting their unique electrical properties. In this study, the four different types of nucleotides that constitute a DNA molecule were suspended in a buffer followed by performing several types of electrical measurements. These electrical parameters were then used to quantify the suspended DNA nucleotides. Thus, we present a purely electrical counting scheme based on the semiconductor theory that allows one to determine the number of nucleotides in a solution by measuring their capacitance-voltage dependency. The nucleotide count was observed to be similar to the multiplication of the corresponding dopant concentration and debye volume after de-embedding the buffer contribution. The presented approach allows for a fast and label-free quantification of single and mixed nucleotides in a solution.
Effects of rs6234/rs6235 and rs6232/rs6234/rs6235 PCSK1 single-nucleotide polymorphism clusters on proprotein convertase 1/3 biosynthesis and activity.

PubMed

Mbikay, Majambu; Sirois, Francine; Nkongolo, Kabwe K; Basak, Ajoy; Chrétien, Michel

2011-12-01

Proprotein convertase 1/3 (PC1/3) is one of the endoproteases initiating the proteolytic activation of prohormones and proneuropeptides in the secretory pathway. It is produced as a zymogen that is subsequently modified by activity-determining cleavages at the amino and the carboxyl termini. In human, it is encoded by the PCSK1 locus on chromosome 5. Spontaneous inactivating mutations in its gene have been linked to obesity. Minor alleles of the common non-synonymous single-nucleotide polymorphisms (SNPs) rs6232 (T>C, N221D), rs6234 (G>C, Q665E) and rs6235 (C>G, S690T) have been associated with increased risk of obesity. We have shown that the variations associated with these SNPs are linked on minor PCSK1 alleles. In this study, we examined the impact of amino acid substitutions specified by the minor PCSK1 alleles on PC1/3 biosynthesis and prohormone processing activity in cultured cells. The common and variant isoforms of PC1/3 were expressed in transfected rat pituitary GH4C1 cells with or without proopiomelanocortin (POMC) as a substrate. Secreted PC1/3- or POMC-related proteins and peptides were analyzed by immunoblotting and immunoprecipitation. When expressed in GH4C1 cells, the triple-variant PC1/3 underwent significantly more proteolytic processing at the amino and carboxyl termini than the common and double-variant isoforms. However, there was no detectable difference among these isoforms in their ability to process POMC in the transfected cells. Since truncation of PC1/3 in its C-terminal region reportedly renders the enzyme unstable, we speculate that the accentuated processing of the triple variant in this region may, in vivo, create a subtle deficit of PC1/3 enzymatic activity in endocrine and neuroendocrine cells, causing impaired processing of prohormones and proneuropeptides to their bioactive forms. Copyright © 2011 Elsevier Inc. All rights reserved.
Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

PubMed

Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

2010-07-16

Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.
Targeted deep sequencing identifies rare loss-of-function variants in IFNGR1 for risk of atopic dermatitis complicated by eczema herpeticum.

PubMed

Gao, Li; Bin, Lianghua; Rafaels, Nicholas M; Huang, Lili; Potee, Joseph; Ruczinski, Ingo; Beaty, Terri H; Paller, Amy S; Schneider, Lynda C; Gallo, Rich; Hanifin, Jon M; Beck, Lisa A; Geha, Raif S; Mathias, Rasika A; Barnes, Kathleen C; Leung, Donald Y M

2015-12-01

A subset of atopic dermatitis is associated with increased susceptibility to eczema herpeticum (ADEH+). We previously reported that common single nucleotide polymorphisms (SNPs) in the IFN-γ (IFNG) and IFN-γ receptor 1 (IFNGR1) genes were associated with the ADEH+ phenotype. We sought to interrogate the role of rare variants in interferon pathway genes for the risk of ADEH+. We performed targeted sequencing of interferon pathway genes (IFNG, IFNGR1, IFNAR1, and IL12RB1) in 228 European American patients with AD selected according to their eczema herpeticum status, and severity was measured by using the Eczema Area and Severity Index. Replication genotyping was performed in independent samples of 219 European American and 333 African American subjects. Functional investigation of loss-of-function variants was conducted by using site-directed mutagenesis. We identified 494 single nucleotide variants encompassing 105 kb of sequence, including 145 common, 349 (70.6%) rare (minor allele frequency <5%), and 86 (17.4%) novel variants, of which 2.8% were coding synonymous, 93.3% were noncoding (64.6% intronic), and 3.8% were missense. We identified 6 rare IFNGR1 missense variants, including 3 damaging variants (Val14Met [V14M], Val61Ile, and Tyr397Cys [Y397C]) conferring a higher risk for ADEH+ (P = .031). Variants V14M and Y397C were confirmed to be deleterious, leading to partial IFNGR1 deficiency. Seven common IFNGR1 SNPs, along with common protective haplotypes (2-7 SNPs), conferred a reduced risk of ADEH+ (P = .015-.002 and P = .0015-.0004, respectively), and both SNP and haplotype associations were replicated in an independent African American sample (P = .004-.0001 and P = .001-.0001, respectively). Our results provide evidence that both genetic variants in the gene encoding IFNGR1 are implicated in susceptibility to the ADEH+ phenotype. Copyright © 2015 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Targeted Deep Sequencing Identifies Rare ‘loss-of-function’ Variants in IFNGR1 for Risk of Atopic Dermatitis Complicated by Eczema Herpeticum

PubMed Central

Gao, Li; Rafaels, Nicholas M; Huang, Lili; Potee, Joseph; Ruczinski, Ingo; Beaty, Terri H.; Paller, Amy S.; Schneider, Lynda C.; Gallo, Rich; Hanifin, Jon M.; Beck, Lisa A.; Geha, Raif S.; Mathias, Rasika A.; Leung, Donald Y. M.

2015-01-01

Background A subset of atopic dermatitis (AD) is associated with increased susceptibility to eczema herpeticum (ADEH+). We previously reported that common single nucleotide polymorphisms (SNPs) in interferon-gamma (IFNG) and receptor 1 (IFNGR1) were associated with ADEH+ phenotype. Objective To interrogate the role of rare variants in IFN-pathway genes for risk of ADEH+. Methods We performed targeted sequencing of interferon-pathway genes (IFNG, IFNGR1, IFNAR1 and IL12RB1) in 228 European American (EA) AD patients selected according to their EH status and severity measured by Eczema Area and Severity Index (EASI). Replication genotyping was performed in independent samples of 219 EA and 333 African Americans (AA). Functional investigation of ‘loss-of-function’ variants was conducted using site-directed mutagenesis. Results We identified 494 single nucleotide variants (SNVs) encompassing 105kb of sequence, including 145 common, 349 (70.6%) rare (minor allele frequency (MAF) <5%) and 86 (17.4%) novel variants, of which 2.8% were coding-synonymous, 93.3% were non-coding (64.6% intronic), and 3.8% were missense. We identified six rare IFNGR1 missense including three damaging variants (Val14Met (V14M), Val61Ile and Tyr397Cys (Y397C)) conferring a higher risk for ADEH+ (P=0.031). Variants V14M and Y397C were confirmed to be deleterious leading to partial IFNGR1 deficiency. Seven common IFNGR1 SNPs, along with common protective haplotypes (2 to 7-SNPs) conferred a reduced risk of ADEH+ (P=0.015-0.002, P=0.0015-0.0004, respectively), and both SNP and haplotype associations were replicated in an independent AA sample (P=0.004-0.0001 and P=0.001-0.0001, respectively). Conclusion Our results provide evidence that both genetic variants in the gene encoding IFNGR1 are implicated in susceptibility to the ADEH+ phenotype. CAPSULE SUMMARY We provided the first evidence that rare functional IFNGR1 mutations contribute to a defective systemic IFN-γ immune response that accounts for the propensity of AD patients to disseminated viral skin infections. PMID:26343451
PGen: large-scale genomic variations analysis workflow and browser in SoyKB.

PubMed

Liu, Yang; Khan, Saad M; Wang, Juexin; Rynge, Mats; Zhang, Yuanxun; Zeng, Shuai; Chen, Shiyuan; Maldonado Dos Santos, Joao V; Valliyodan, Babu; Calyam, Prasad P; Merchant, Nirav; Nguyen, Henry T; Xu, Dong; Joshi, Trupti

2016-10-06

With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed "PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. We have developed both a Linux version in GitHub ( https://github.com/pegasus-isi/PGen-GenomicVariations-Workflow ) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), ( http://soykb.org/Pegasus/index.php ). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser ( http://soykb.org/NGS_Resequence/NGS_index.php ) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. PGen workflow has been optimized for the most efficient analysis of soybean data using thorough testing and validation. This research serves as an example of best practices for development of genomics data analysis workflows by integrating remote HPC resources and efficient data management with ease of use for biological users. PGen workflow can also be easily customized for analysis of data in other species.
Genome-scale investigation of phenotypically distinct but nearly clonal Trichoderma strains

PubMed Central

Weld, Richard J.; Cox, Murray P.; Bradshaw, Rosie E.; McLean, Kirstin L.; Stewart, Alison; Steyaert, Johanna M.

2016-01-01

Biological control agents (BCA) are beneficial organisms that are applied to protect plants from pests. Many fungi of the genus Trichoderma are successful BCAs but the underlying mechanisms are not yet fully understood. Trichoderma cf. atroviride strain LU132 is a remarkably effective BCA compared to T. cf. atroviride strain LU140 but these strains were found to be highly similar at the DNA sequence level. This unusual combination of phenotypic variability and high DNA sequence similarity between separately isolated strains prompted us to undertake a genome comparison study in order to identify DNA polymorphisms. We further investigated if the polymorphisms had functional effects on the phenotypes. The two strains were clearly identified as individuals, exhibiting different growth rates, conidiation and metabolism. Superior pathogen control demonstrated by LU132 depended on its faster growth, which is a prerequisite for successful distribution and competition. Genome sequencing identified only one non-synonymous single nucleotide polymorphism (SNP) between the strains. Based on this SNP, we successfully designed and validated an RFLP protocol that can be used to differentiate LU132 from LU140 and other Trichoderma strains. This SNP changed the amino acid sequence of SERF, encoded by the previously undescribed single copy gene “small EDRK-rich factor” (serf). A deletion of serf in the two strains did not lead to identical phenotypes, suggesting that, in addition to the single functional SNP between the nearly clonal Trichoderma cf. atroviride strains, other non-genomic factors contribute to their phenotypic variation. This finding is significant as it shows that genomics is an extremely useful but not exhaustive tool for the study of biocontrol complexity and for strain typing. PMID:27190719
Non-synonymous variations in cancer and their effects on the human proteome: workflow for NGS data biocuration and proteome-wide analysis of TCGA data.

PubMed

Cole, Charles; Krampis, Konstantinos; Karagiannis, Konstantinos; Almeida, Jonas S; Faison, William J; Motwani, Mona; Wan, Quan; Golikov, Anton; Pan, Yang; Simonyan, Vahan; Mazumder, Raja

2014-01-27

Next-generation sequencing (NGS) technologies have resulted in petabytes of scattered data, decentralized in archives, databases and sometimes in isolated hard-disks which are inaccessible for browsing and analysis. It is expected that curated secondary databases will help organize some of this Big Data thereby allowing users better navigate, search and compute on it. To address the above challenge, we have implemented a NGS biocuration workflow and are analyzing short read sequences and associated metadata from cancer patients to better understand the human variome. Curation of variation and other related information from control (normal tissue) and case (tumor) samples will provide comprehensive background information that can be used in genomic medicine research and application studies. Our approach includes a CloudBioLinux Virtual Machine which is used upstream of an integrated High-performance Integrated Virtual Environment (HIVE) that encapsulates Curated Short Read archive (CSR) and a proteome-wide variation effect analysis tool (SNVDis). As a proof-of-concept, we have curated and analyzed control and case breast cancer datasets from the NCI cancer genomics program - The Cancer Genome Atlas (TCGA). Our efforts include reviewing and recording in CSR available clinical information on patients, mapping of the reads to the reference followed by identification of non-synonymous Single Nucleotide Variations (nsSNVs) and integrating the data with tools that allow analysis of effect nsSNVs on the human proteome. Furthermore, we have also developed a novel phylogenetic analysis algorithm that uses SNV positions and can be used to classify the patient population. The workflow described here lays the foundation for analysis of short read sequence data to identify rare and novel SNVs that are not present in dbSNP and therefore provides a more comprehensive understanding of the human variome. Variation results for single genes as well as the entire study are available from the CSR website (http://hive.biochemistry.gwu.edu/dna.cgi?cmd=csr). Availability of thousands of sequenced samples from patients provides a rich repository of sequence information that can be utilized to identify individual level SNVs and their effect on the human proteome beyond what the dbSNP database provides.
The impact of single nucleotide polymorphism in monomeric alpha-amylase inhibitor genes from wild emmer wheat, primarily from Israel and Golan

PubMed Central

2010-01-01

Background Various enzyme inhibitors act on key insect gut digestive hydrolases, including alpha-amylases and proteinases. Alpha-amylase inhibitors have been widely investigated for their possible use in strengthening a plant's defense against insects that are highly dependent on starch as an energy source. We attempted to unravel the diversity of monomeric alpha-amylase inhibitor genes of Israeli and Golan Heights' wild emmer wheat with different ecological factors (e.g., geography, water, and temperature). Population methods that analyze the nature and frequency of allele diversity within a species and the codon analysis method (comparing patterns of synonymous and non-synonymous changes in protein coding sequences) were used to detect natural selection. Results Three hundred and forty-eight sequences encoding monomeric alpha-amylase inhibitors (WMAI) were obtained from 14 populations of wild emmer wheat. The frequency of SNPs in WMAI genes was 1 out of 16.3 bases, where 28 SNPs were detected in the coding sequence. The results of purifying and the positive selection hypothesis (p < 0.05) showed that the sequences of WMAI were contributed by both natural selection and co-evolution, which ensured conservation of protein function and inhibition against diverse insect amylases. The majority of amino acid substitutions occurred at the C-terminal (positive selection domain), which ensured the stability of WMAI. SNPs in this gene could be classified into several categories associated with water, temperature, and geographic factors, respectively. Conclusions Great diversity at the WMAI locus, both between and within populations, was detected in the populations of wild emmer wheat. It was revealed that WMAI were naturally selected for across populations by a ratio of dN/dS as expected. Ecological factors, singly or in combination, explained a significant proportion of the variations in the SNPs. A sharp genetic divergence over very short geographic distances compared to a small genetic divergence between large geographic distances also suggested that the SNPs were subjected to natural selection, and ecological factors had an important evolutionary role in polymorphisms at this locus. According to population and codon analysis, these results suggested that monomeric alpha-amylase inhibitors are adaptively selected under different environmental conditions. PMID:20534122
Non-synonymous variations in cancer and their effects on the human proteome: workflow for NGS data biocuration and proteome-wide analysis of TCGA data

PubMed Central

2014-01-01

Background Next-generation sequencing (NGS) technologies have resulted in petabytes of scattered data, decentralized in archives, databases and sometimes in isolated hard-disks which are inaccessible for browsing and analysis. It is expected that curated secondary databases will help organize some of this Big Data thereby allowing users better navigate, search and compute on it. Results To address the above challenge, we have implemented a NGS biocuration workflow and are analyzing short read sequences and associated metadata from cancer patients to better understand the human variome. Curation of variation and other related information from control (normal tissue) and case (tumor) samples will provide comprehensive background information that can be used in genomic medicine research and application studies. Our approach includes a CloudBioLinux Virtual Machine which is used upstream of an integrated High-performance Integrated Virtual Environment (HIVE) that encapsulates Curated Short Read archive (CSR) and a proteome-wide variation effect analysis tool (SNVDis). As a proof-of-concept, we have curated and analyzed control and case breast cancer datasets from the NCI cancer genomics program - The Cancer Genome Atlas (TCGA). Our efforts include reviewing and recording in CSR available clinical information on patients, mapping of the reads to the reference followed by identification of non-synonymous Single Nucleotide Variations (nsSNVs) and integrating the data with tools that allow analysis of effect nsSNVs on the human proteome. Furthermore, we have also developed a novel phylogenetic analysis algorithm that uses SNV positions and can be used to classify the patient population. The workflow described here lays the foundation for analysis of short read sequence data to identify rare and novel SNVs that are not present in dbSNP and therefore provides a more comprehensive understanding of the human variome. Variation results for single genes as well as the entire study are available from the CSR website (http://hive.biochemistry.gwu.edu/dna.cgi?cmd=csr). Conclusions Availability of thousands of sequenced samples from patients provides a rich repository of sequence information that can be utilized to identify individual level SNVs and their effect on the human proteome beyond what the dbSNP database provides. PMID:24467687
Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions.

PubMed

Nishizawa, M; Nishizawa, K

2000-10-01

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Amino acid and nucleotide recurrence in aligned sequences: synonymous substitution patterns in association with global and local base compositions

PubMed Central

Nishizawa, Manami; Nishizawa, Kazuhisa

2000-01-01

The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Phylogeny of the Genus Flavivirus

PubMed Central

Kuno, Goro; Chang, Gwong-Jen J.; Tsuchiya, K. Richard; Karabatsos, Nick; Cropp, C. Bruce

1998-01-01

We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses. PMID:9420202
Phylogeny of the genus Flavivirus.

PubMed

Kuno, G; Chang, G J; Tsuchiya, K R; Karabatsos, N; Cropp, C B

1998-01-01

We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses.
Analysis of evolutionary rate of HIV-1 subtype B using blood donor samples in Japan.

PubMed

Shinohara, Naoya; Matsumoto, Chieko; Matsubayashi, Keiji; Nagai, Tadashi; Satake, Masahiro

2018-06-01

There are few reports on HIV-1 intra-host evolutionary rate in asymptomatic treatment-naïve patients. Here, the HIV-1 intra-host evolutionary rate was estimated based on HIV-1 RNA sequences from plasma samples of blood donors in Japan. Blood donors were assumed to have received no treatment for and have no symptoms of HIV-1 infection because they were healthy, and declared no risky behaviors of HIV-1 infection on a self-reported questionnaire or interview followed by donation. HIV-1 RNA was obtained from 85 plasma samples from 36 blood donors who donated blood multiple times and were HIV-1-positive. The C2V3C3 region which encodes for a part of the envelope protein, and the V3 loop in the C2V3C3 region were analyzed by RT-PCR and direct sequencing, and the sequences were compared. The nucleotide substitution rate was calculated by linear regression. All HIV-1 samples analyzed were classified as subtype B. The mean nucleotide substitution rate in C2V3C3 was calculated to be 6.2 × 10 -3 -1.8 × 10 -2 /site/year (V3: 4.5 × 10 -3 -2.3 × 10 -2 /site/year). The mean non-synonymous substitution rate in C2V3C3 was calculated to be 5.2 × 10 -3 -1.7 × 10 -2 /site/year (V3: 4.5 × 10 -3 -2.1 × 10 -2 /site/year). The mean synonymous substitution rate in C2V3C3 was calculated to be 1.1 × 10 -4 -2.3 × 10 -3 /site/year (V3: 2.9 × 10 -3 /site/year). Among HIV-1 subtype B RNA-positive blood donors in Japan, the nucleotide substitution rate in C2V3C3 was estimated to be higher than that of reported cases using HIV-1 samples mainly obtained from AIDS patients. Compared to AIDS patients, immune responses against HIV-1 are probably more effective in HIV-1 RNA-positive blood donors. Consequently, immune pressure presumably promotes mutation of the virus genome.
Genome-wide detection and characterization of positive selection in human populations.

PubMed

Sabeti, Pardis C; Varilly, Patrick; Fry, Ben; Lohmueller, Jason; Hostetter, Elizabeth; Cotsapas, Chris; Xie, Xiaohui; Byrne, Elizabeth H; McCarroll, Steven A; Gaudet, Rachelle; Schaffner, Stephen F; Lander, Eric S; Frazer, Kelly A; Ballinger, Dennis G; Cox, David R; Hinds, David A; Stuve, Laura L; Gibbs, Richard A; Belmont, John W; Boudreau, Andrew; Hardenbol, Paul; Leal, Suzanne M; Pasternak, Shiran; Wheeler, David A; Willis, Thomas D; Yu, Fuli; Yang, Huanming; Zeng, Changqing; Gao, Yang; Hu, Haoran; Hu, Weitao; Li, Chaohua; Lin, Wei; Liu, Siqi; Pan, Hao; Tang, Xiaoli; Wang, Jian; Wang, Wei; Yu, Jun; Zhang, Bo; Zhang, Qingrun; Zhao, Hongbin; Zhao, Hui; Zhou, Jun; Gabriel, Stacey B; Barry, Rachel; Blumenstiel, Brendan; Camargo, Amy; Defelice, Matthew; Faggart, Maura; Goyette, Mary; Gupta, Supriya; Moore, Jamie; Nguyen, Huy; Onofrio, Robert C; Parkin, Melissa; Roy, Jessica; Stahl, Erich; Winchester, Ellen; Ziaugra, Liuda; Altshuler, David; Shen, Yan; Yao, Zhijian; Huang, Wei; Chu, Xun; He, Yungang; Jin, Li; Liu, Yangfan; Shen, Yayun; Sun, Weiwei; Wang, Haifeng; Wang, Yi; Wang, Ying; Xiong, Xiaoyan; Xu, Liang; Waye, Mary M Y; Tsui, Stephen K W; Xue, Hong; Wong, J Tze-Fei; Galver, Luana M; Fan, Jian-Bing; Gunderson, Kevin; Murray, Sarah S; Oliphant, Arnold R; Chee, Mark S; Montpetit, Alexandre; Chagnon, Fanny; Ferretti, Vincent; Leboeuf, Martin; Olivier, Jean-François; Phillips, Michael S; Roumy, Stéphanie; Sallée, Clémentine; Verner, Andrei; Hudson, Thomas J; Kwok, Pui-Yan; Cai, Dongmei; Koboldt, Daniel C; Miller, Raymond D; Pawlikowska, Ludmila; Taillon-Miller, Patricia; Xiao, Ming; Tsui, Lap-Chee; Mak, William; Song, You Qiang; Tam, Paul K H; Nakamura, Yusuke; Kawaguchi, Takahisa; Kitamoto, Takuya; Morizono, Takashi; Nagashima, Atsushi; Ohnishi, Yozo; Sekine, Akihiro; Tanaka, Toshihiro; Tsunoda, Tatsuhiko; Deloukas, Panos; Bird, Christine P; Delgado, Marcos; Dermitzakis, Emmanouil T; Gwilliam, Rhian; Hunt, Sarah; Morrison, Jonathan; Powell, Don; Stranger, Barbara E; Whittaker, Pamela; Bentley, David R; Daly, Mark J; de Bakker, Paul I W; Barrett, Jeff; Chretien, Yves R; Maller, Julian; McCarroll, Steve; Patterson, Nick; Pe'er, Itsik; Price, Alkes; Purcell, Shaun; Richter, Daniel J; Sabeti, Pardis; Saxena, Richa; Schaffner, Stephen F; Sham, Pak C; Varilly, Patrick; Altshuler, David; Stein, Lincoln D; Krishnan, Lalitha; Smith, Albert Vernon; Tello-Ruiz, Marcela K; Thorisson, Gudmundur A; Chakravarti, Aravinda; Chen, Peter E; Cutler, David J; Kashuk, Carl S; Lin, Shin; Abecasis, Gonçalo R; Guan, Weihua; Li, Yun; Munro, Heather M; Qin, Zhaohui Steve; Thomas, Daryl J; McVean, Gilean; Auton, Adam; Bottolo, Leonardo; Cardin, Niall; Eyheramendy, Susana; Freeman, Colin; Marchini, Jonathan; Myers, Simon; Spencer, Chris; Stephens, Matthew; Donnelly, Peter; Cardon, Lon R; Clarke, Geraldine; Evans, David M; Morris, Andrew P; Weir, Bruce S; Tsunoda, Tatsuhiko; Johnson, Todd A; Mullikin, James C; Sherry, Stephen T; Feolo, Michael; Skol, Andrew; Zhang, Houcan; Zeng, Changqing; Zhao, Hui; Matsuda, Ichiro; Fukushima, Yoshimitsu; Macer, Darryl R; Suda, Eiko; Rotimi, Charles N; Adebamowo, Clement A; Ajayi, Ike; Aniagwu, Toyin; Marshall, Patricia A; Nkwodimmah, Chibuzor; Royal, Charmaine D M; Leppert, Mark F; Dixon, Missy; Peiffer, Andy; Qiu, Renzong; Kent, Alastair; Kato, Kazuto; Niikawa, Norio; Adewole, Isaac F; Knoppers, Bartha M; Foster, Morris W; Clayton, Ellen Wright; Watkin, Jessica; Gibbs, Richard A; Belmont, John W; Muzny, Donna; Nazareth, Lynne; Sodergren, Erica; Weinstock, George M; Wheeler, David A; Yakub, Imtaz; Gabriel, Stacey B; Onofrio, Robert C; Richter, Daniel J; Ziaugra, Liuda; Birren, Bruce W; Daly, Mark J; Altshuler, David; Wilson, Richard K; Fulton, Lucinda L; Rogers, Jane; Burton, John; Carter, Nigel P; Clee, Christopher M; Griffiths, Mark; Jones, Matthew C; McLay, Kirsten; Plumb, Robert W; Ross, Mark T; Sims, Sarah K; Willey, David L; Chen, Zhu; Han, Hua; Kang, Le; Godbout, Martin; Wallenburg, John C; L'Archevêque, Paul; Bellemare, Guy; Saeki, Koji; Wang, Hongguang; An, Daochang; Fu, Hongbo; Li, Qing; Wang, Zhen; Wang, Renwu; Holden, Arthur L; Brooks, Lisa D; McEwen, Jean E; Guyer, Mark S; Wang, Vivian Ota; Peterson, Jane L; Shi, Michael; Spiegel, Jack; Sung, Lawrence M; Zacharia, Lynn F; Collins, Francis S; Kennedy, Karen; Jamieson, Ruth; Stewart, John

2007-10-18

With the advent of dense maps of human genetic variation, it is now possible to detect positive natural selection across the human genome. Here we report an analysis of over 3 million polymorphisms from the International HapMap Project Phase 2 (HapMap2). We used 'long-range haplotype' methods, which were developed to identify alleles segregating in a population that have undergone recent selection, and we also developed new methods that are based on cross-population comparisons to discover alleles that have swept to near-fixation within a population. The analysis reveals more than 300 strong candidate regions. Focusing on the strongest 22 regions, we develop a heuristic for scrutinizing these regions to identify candidate targets of selection. In a complementary analysis, we identify 26 non-synonymous, coding, single nucleotide polymorphisms showing regional evidence of positive selection. Examination of these candidates highlights three cases in which two genes in a common biological process have apparently undergone positive selection in the same population:LARGE and DMD, both related to infection by the Lassa virus, in West Africa;SLC24A5 and SLC45A2, both involved in skin pigmentation, in Europe; and EDAR and EDA2R, both involved in development of hair follicles, in Asia.
Prioritisation of associations between protein domains and complex diseases using domain-domain interaction networks.

PubMed

Wang, W; Zhang, W; Jiang, R; Luan, Y

2010-05-01

It is of vital importance to find genetic variants that underlie human complex diseases and locate genes that are responsible for these diseases. Since proteins are typically composed of several structural domains, it is reasonable to assume that harmful genetic variants may alter structures of protein domains, affect functions of proteins and eventually cause disorders. With this understanding, the authors explore the possibility of recovering associations between protein domains and complex diseases. The authors define associations between protein domains and disease families on the basis of associations between non-synonymous single nucleotide polymorphisms (nsSNPs) and complex diseases, similarities between diseases, and relations between proteins and domains. Based on a domain-domain interaction network, the authors propose a 'guilt-by-proximity' principle to rank candidate domains according to their average distance to a set of seed domains in the domain-domain interaction network. The authors validate the method through large-scale cross-validation experiments on simulated linkage intervals, random controls and the whole genome. Results show that areas under receiver operating characteristic curves (AUC scores) can be as high as 77.90%, and the mean rank ratios can be as low as 21.82%. The authors further offer a freely accessible web interface for a genome-wide landscape of associations between domains and disease families.
Genetic polymorphisms of ATP-binding cassette (ABC) proteins, overall survival and drug toxicity in patients with Acute Myeloid Leukemia

PubMed Central

Hampras, Shalaka S; Sucheston, Lara; Weiss, Joli; Baer, Maria R; Zirpoli, Gary; Singh, Prashant K; Wetzler, Meir; Chennamaneni, Raj; Blanco, Javier G; Ford, LaurieAnn; Moysich, Kirsten B

2010-01-01

The overall survival of patients with acute myeloid leukemia (AML) remains poor due to both intrinsic and acquired chemotherapy resistance. Over expression of ATP binding cassette (ABC) proteins in AML cells has been suggested as a putative mechanism of drug resistance. Genetic variation among individuals affecting the expression or function of these proteins may contribute to inter-individual variation in treatment outcomes. DNA from pre-treatment bone marrow or blood samples from 261 patients age 20-85 years, who received cytarabine and anthracycline-based therapy at Roswell Park Cancer Institute between 1994 and 2006, was genotyped for eight non-synonymous single nucleotide polymorphisms in the ABCB1, ABCC1 and ABCG2 drug transporter genes. Heterozygous (AG) or homozygous (AA) variant genotypes for rs2231137 (G34A) in the ABCG2 (BRCP) gene, compared to the wild type (GG) genotype were associated with both significantly improved survival (HR=0.44, 95%CI=0.25-0.79), and increased odds for toxicity (OR=8.41, 95%CI= 1.10-64.28). Thus genetic polymorphisms in the ABCG2 (BRCP) gene may contribute to differential survival outcomes and toxicities in AML patients via a mechanism of decreased drug efflux in both, AML cells and normal progenitors. PMID:21311724
Comparative analysis of myostatin gene and promoter sequences of Qinchuan and Red Angus cattle.

PubMed

He, Y L; Wu, Y H; Quan, F S; Liu, Y G; Zhang, Y

2013-09-04

To better understand the function of the myostatin gene and its promoter region in bovine, we amplified and sequenced the myostatin gene and promoter from the blood of Qinchuan and Red Angus cattle by using polymerase chain reaction. The sequences of Qinchuan and Red Angus cattle were compared with those of other cattle breeds available in GenBank. Exon splice sites were confirmed by mRNA sequencing. Compared to the published sequence (GenBank accession No. AF320998), 69 single nucleotide polymorphisms (SNPs) were identified in the Qinchuan myostatin gene, only one of which was an insertion mutation in Qinchuan cattle. There was a 16-bp insertion in the first 705-bp intron in 3 Qinchuan cattle. A total of 7 SNPs were identified in exon 3, in which the mutation occurred in the third base of the codon and was synonymous. On comparing the Qinchuan myostatin gene sequence to that of Red Angus cattle, a total of 50 SNPs were identified in the first and third exons. In addition, there were 18 SNPs identified in the Qinchuan cattle promoter region compared with those of other cattle compared to the Red Angus cattle myostatin promoter region. breeds (GenBank accession No. AF348479), but only 14 SNPs when compared to the Red Angus cattle myostatin promoter region.
Two novel polymorphisms of bovine SIRT2 gene are associated with higher body weight in Nanyang cattle.

PubMed

Sun, Xiaomei; Li, Mingxun; Hao, Dan; Hua, Liushuai; Lan, Xianyong; Lei, Chuzhao; Hu, Shenrong; Qi, Xinglei; Chen, Hong

2015-03-01

Identification of polymorphisms associated with economic traits is important for successful marker-assisted selection in cattle breeding. The family of mammalian sirtuin regulates many biological functions, such as life span extension and energy metabolism. SIRT2, a most abundant sirtuin in adipocytes, acts as a crucial regulator of adipogenic differentiation and plays a key role in controlling adipose tissue function and mass. Here we investigated single nucleotide polymorphisms (SNPs) of bovine SIRT2 in 1226 cattle from five breeds and further evaluated the effects of identified SNPs on economically important traits of Nanyang cattle. Our results revealed four novel SNPs in bovine SIRT2, one was located in intronic region and the other three were synonymous mutations. Linkage disequilibrium and haplotype analyses based on the identified SNPs showed obvious difference between crossbred breed and the other four beef breeds. Association analyses demonstrated that SNPs g.17333C > T and g.17578A > G have a significantly effect on 18-months-old body weight of Nanyang population. Animals with combined genotype TTGG at the above two loci exhibited especially higher body weight. Our data for the first time demonstrated that polymorphisms in bovine SIRT2 are associated with economic traits of Nanyang cattle, which will be helpful for future cattle selection practices.

Variations in endothelin receptor B subtype 2 (EDNRB2) coding sequences and mRNA expression levels in 4 Muscovy duck plumage colour phenotypes.

PubMed

Wu, N; Qin, H; Wang, M; Bian, Y; Dong, B; Sun, G; Zhao, W; Chang, G; Xu, Q; Chen, G

2017-04-01

1. Endothelin receptor B subtype 2 (EDNRB2) is a paralog of EDNRB, which encodes a 7-transmembrane G-protein coupled receptor. Previous studies reported that EDNRB was essential for melanoblast migration in mammals and ducks. 2. Muscovy ducks have different plumage colour phenotypes. Variations in EDNRB2 coding sequences (CDSs) and mRNA expression levels were investigated in 4 different Muscovy duck plumage colour phenotypes, including black, black mutant, silver and white head. 3. The EDNRB2 gene from Muscovy duck was cloned; it had a length of 6435 bp and encoded 437 amino acids. The coding region was screened and potential single nucleotide polymorphisms were identified. Eight mutations were obtained, including one missense variant (c.64C > T) and 7 synonymous substitutions. The substitutions were associated with plumage colour phenotypes. 4. The EDNRB2 mRNA expression levels were compared between feather pulp from black birds and black mutant birds. The results indicated that EDNRB2 transcripts in feather pulp were significantly higher in black feathers than in white feathers. 5. The results determined the variation of EDNRB2 CDS and mRNA expression in Muscovy ducks of various plumage colours.
Phylodynamic Analysis of Clinical and Environmental Vibrio cholerae Isolates from Haiti Reveals Diversification Driven by Positive Selection

PubMed Central

Azarian, Taj; Ali, Afsar; Johnson, Judith A.; Mohr, David; Prosperi, Mattia; Veras, Nazle M.; Jubair, Mohammed; Strickland, Samantha L.; Rashid, Mohammad H.; Alam, Meer T.; Weppelmann, Thomas A.; Katz, Lee S.; Tarr, Cheryl L.; Colwell, Rita R.

2014-01-01

ABSTRACT Phylodynamic analysis of genome-wide single-nucleotide polymorphism (SNP) data is a powerful tool to investigate underlying evolutionary processes of bacterial epidemics. The method was applied to investigate a collection of 65 clinical and environmental isolates of Vibrio cholerae from Haiti collected between 2010 and 2012. Characterization of isolates recovered from environmental samples identified a total of four toxigenic V. cholerae O1 isolates, four non-O1/O139 isolates, and a novel nontoxigenic V. cholerae O1 isolate with the classical tcpA gene. Phylogenies of strains were inferred from genome-wide SNPs using coalescent-based demographic models within a Bayesian framework. A close phylogenetic relationship between clinical and environmental toxigenic V. cholerae O1 strains was observed. As cholera spread throughout Haiti between October 2010 and August 2012, the population size initially increased and then fluctuated over time. Selection analysis along internal branches of the phylogeny showed a steady accumulation of synonymous substitutions and a progressive increase of nonsynonymous substitutions over time, suggesting diversification likely was driven by positive selection. Short-term accumulation of nonsynonymous substitutions driven by selection may have significant implications for virulence, transmission dynamics, and even vaccine efficacy. PMID:25538191
De novo characterisation of the greenlip abalone transcriptome (Haliotis laevigata) with a focus on the heat shock protein 70 (HSP70) family.

PubMed

Shiel, Brett P; Hall, Nathan E; Cooke, Ira R; Robinson, Nicholas A; Strugnell, Jan M

2015-02-01

Abalone (Haliotis) are economically important molluscs for fisheries and aquaculture industries worldwide. Despite this, genomic resources for abalone and molluscs are still limited. Here we present a description and functional annotation of the greenlip abalone (Haliotis laevigata) transcriptome. We present a focused analysis on the heat shock protein 70 (HSP70) family of genes with putative functions affecting temperature stress and immunity. A total of ~38 million paired end Illumina reads were obtained, resulting in a Trinity assembly of 222,172 contigs with minimum length of 200 base pairs and maximum length of 33 kilobases. The 20,702 contigs were annotated with gene descriptions by BLAST. We created a program to maximise the number of functionally annotated genes, and over 10,000 contigs were assigned Gene ontologies (GO terms). By using CateGOrizer, immunity related GO terms for stressors such as heat, hypoxia, oxidative stress and wounding received the highest counts. Twenty-six contigs with homology to the HSP70 family of genes were identified. Ninety-one putative single-nucleotide polymorphisms were observed in the abalone HSP70 contigs. Eleven of these were considered non-synonymous. The annotated transcriptome described in this study will be a useful basis for future work investigating the genetic response of abalone to stress.
Polymorphisms in the K13-propeller gene in artemisinin-susceptible Plasmodium falciparum parasites from Bougoula-Hameau and Bandiagara, Mali.

PubMed

Ouattara, Amed; Kone, Aminatou; Adams, Matthew; Fofana, Bakary; Maiga, Amelia Walling; Hampton, Shay; Coulibaly, Drissa; Thera, Mahamadou A; Diallo, Nouhoum; Dara, Antoine; Sagara, Issaka; Gil, Jose Pedro; Bjorkman, Anders; Takala-Harrison, Shannon; Doumbo, Ogobara K; Plowe, Christopher V; Djimde, Abdoulaye A

2015-06-01

Artemisinin-resistant Plasmodium falciparum malaria has been documented in southeast Asia and may already be spreading in that region. Molecular markers are important tools for monitoring the spread of antimalarial drug resistance. Recently, single-nucleotide polymorphisms (SNPs) in the PF3D7_1343700 kelch propeller (K13-propeller) domain were shown to be associated with artemisinin resistance in vivo and in vitro. The prevalence and role of K13-propeller mutations are poorly known in sub-Saharan Africa. K13-propeller mutations were genotyped by direct sequencing of nested polymerase chain reaction (PCR) amplicons from dried blood spots of pre-treatment falciparum malaria infections collected before and after the use of artemisinin-based combination therapy (ACT) as first-line therapy in Mali. Although K13-propeller mutations previously associated with delayed parasite clearance in Cambodia were not identified, 26 K13-propeller mutations were identified in both recent samples and pre-ACT infections. Parasite clearance time was comparable between infections with non-synonymous K13-propeller mutations and infections with the reference allele. These findings suggest that K13-propeller mutations are present in artemisinin-sensitive parasites and that they preceded the wide use of ACTs in Mali. © The American Society of Tropical Medicine and Hygiene.
Molecular genetic analysis of consanguineous Pakistani families with autosomal recessive hypohidrotic ectodermal dysplasia.

PubMed

Bibi, Nosheen; Ahmad, Saeed; Ahmad, Wasim; Naeem, Muhammad

2011-02-01

Hypohidrotic ectodermal dysplasia is an inherited disorder characterized by defective development of teeth, hairs and sweat glands. X-linked hypohidrotic ectodermal dysplasia is caused by mutations in the EDA gene, and autosomal forms of hypohidrotic ectodermal dysplasia are caused by mutations in either the EDAR or the EDARADD genes. To study the molecular genetic cause of autosomal recessive hypohidrotic ectodermal dysplasia in three consanguineous Pakistani families (A, B and C), genotyping of 13 individuals was carried out by using polymorphic microsatellite markers that are closely linked to the EDAR gene on chromosome 2q11-q13 and the EDARADD gene on chromosome 1q42.2-q43. The results revealed linkage in the three families to the EDAR locus. Sequence analysis of the coding exons and splice junctions of the EDAR gene revealed two mutations: a novel non-sense mutation (p.E124X) in the probands of families A and B and a missense mutation (p.G382S) in the proband of family C. In addition, two synonymous single-nucleotide polymorphisms were also identified. The finding of mutations in Pakistani families extends the body of evidence that supports the importance of EDAR for the development of hypohidrotic ectodermal dysplasia. © 2010 The Authors. Australasian Journal of Dermatology © 2010 The Australasian College of Dermatologists.
Significance of genetic variants in DLC1 and their association with hepatocellular carcinoma

PubMed Central

XIE, CHENG-RONG; SUN, HONG-GUANG; SUN, YU; ZHAO, WEN-XIU; ZHANG, SHENG; WANG, XIAO-MIN; YIN, ZHEN-YU

2015-01-01

DLC1 has been shown to be downregulated or absent in hepatocellular carcinoma (HCC) and is associated with tumorigenesis and development. However, only a small number of studies have focused on genetic variations of DLC1. The present study performed exon sequencing for the DLC1 gene in HCC tissue samples from 105 patients to identify functional genetic variation of DLC1 and its association with HCC susceptibility, clinicopathological features and prognosis. A novel missense mutation and four non-synonymous single nucleotide polymorphisms (SNPs; rs3816748, rs11203495, rs3816747 and rs532841) were identified. A significant correlation of rs3816747 polymorphisms with HCC susceptibility was identified. Compared to individuals with the GG genotype of rs3816747, those with the GA (odds ratio (OR)=0.486; P=0.037) or GA+AA genotype (OR=0.51; P=0.039) were associated with a significantly decreased HCC risk. Furthermore, patients with the GC+CC genotype of rs3816748, the TC+CC genotype of rs11203495 or the GA+AA genotype of rs3816747 had small-sized tumors compared with those carrying the wild-type genotype. No significant association of DLC1 SNPs with the patients' prognosis was found. These results indicated that genetic variations in the DLC1 gene may confer a risk for HCC. PMID:26095787
Missense Mutation in Fam83H Gene in Iranian Patients with Amelogenesis Imperfecta.

PubMed

Pourhashemi, S Jalal; Ghandehari Motlagh, Mehdi; Meighani, Ghasem; Ebrahimi Takaloo, Azadeh; Mansouri, Mahsa; Mohandes, Fatemeh; Mirzaii, Maryam; Khoshzaban, Ahad; Moshtaghi, Faranak; Abedkhojasteh, Hoda; Heidari, Mansour

2014-12-01

Amelogenesis Imperfecta (AI) is a disorder of tooth development where there is an abnormal formation of enamel or the external layer of teeth. The aim of this study was to screen mutations in the four most important candidate genes, ENAM, KLK4, MMP20 and FAM83H responsible for amelogenesis imperfect. Geneomic DNA was isolated from five Iranian families with 22 members affected with enamel malformations. The PCR amplifications were typically carried out for amplification the coding regions for AI patients and unaffected family members. The PCR products were subjected to direct sequencing. The pedigree analysis was performed using Cyrillic software. One family had four affected members with autosomal dominant hypocalcified amelogenesis imperfecta (ADHPCAI); pedigree analysis revealed four consanguineous families with 18 patients with autosomal recessive hypoplastic amelogenesis imperfecta (ARHPAI). One non-synonymous single-nucleotide substitution, c.1150T>A, p. Ser 342Thr was identified in the FAM83H, which resulted in ADHCAI. Furthermore, different polymorphisms or unclassified variants were detected in MMP20, ENAM and KLK4. Our results are consistent with other studies and provide further evidence for pathogenic mutations of FAM83H gene. These findings suggest different loci and genes could be implicated in the pathogenesis of AI.
Metabolic Interactions of Purine Derivatives with Human ABC Transporter ABCG2: Genetic Testing to Assess Gout Risk.

PubMed

Ishikawa, Toshihisa; Aw, Wanping; Kaneko, Kiyoko

2013-11-04

In mammals, excess purine nucleosides are removed from the body by breakdown in the liver and excretion from the kidneys. Uric acid is the end product of purine metabolism in humans. Two-thirds of uric acid in the human body is normally excreted through the kidney, whereas one-third undergoes uricolysis (decomposition of uric acid) in the gut. Elevated serum uric acid levels result in gout and could be a risk factor for cardiovascular disease and diabetes. Recent studies have shown that human ATP-binding cassette transporter ABCG2 plays a role of renal excretion of uric acid. Two non-synonymous single nucleotide polymorphisms (SNPs), i.e., 421C>A (major) and 376C>T (minor), in the ABCG2 gene result in impaired transport activity, owing to ubiquitination-mediated proteosomal degradation and truncation of ABCG2, respectively. These genetic polymorphisms are associated with hyperuricemia and gout. Allele frequencies of those SNPs are significantly higher in Asian populations than they are in African and Caucasian populations. A rapid and isothermal genotyping method has been developed to detect the SNP 421C>A, where one drop of peripheral blood is sufficient for the detection. Development of simple genotyping methods would serve to improve prevention and early therapeutic intervention for high-risk individuals in personalized healthcare.
Phylogenetic and population-based approaches to mitogenome variation do not support association with male infertility.

PubMed

Gómez-Carballa, Alberto; Pardo-Seco, Jacobo; Martinón-Torres, Federico; Salas, Antonio

2017-03-01

Infertility has a complex multifactorial etiology and a high prevalence worldwide. Several studies have pointed to variation in the mitochondrial DNA (mtDNA) molecule as a factor responsible for the different disease phenotypes related to infertility. We analyzed 53 mitogenomes of infertile males from Galicia (northwest Spain), and these haplotypes were meta-analyzed phylogenetically with 43 previously reported from Portugal. Taking advantage of the large amount of information available, we additionally carried out association tests between patient mtDNA single-nucleotide polymorphisms (mtSNPs) and haplogroups against Iberian matched controls retrieved from The 1000 Genomes Project and the literature. Phylogenetic and association analyses did not reveal evidence of association between mtSNPs/haplogroups and infertility. Ratios and patterns in patients of nonsynonymous/synonymous changes, and variation at homoplasmic, heteroplasmic and private variants, fall within expected values for healthy individuals. Moreover, the haplogroup background of patients was variable and fits well with patterns typically observed in healthy western Europeans. We did not find evidence of association of mtSNPs or haplogroups pointing to a role for mtDNA in male infertility. A thorough review of the literature on mtDNA variation and infertility revealed contradictory findings and methodological and theoretical problems that overall undermine previous positive findings.
Genetic diversity of the Mycobacterium tuberculosis Beijing family based on multiple genotyping profiles.

PubMed

Liu, Y; Wang, S; Lu, H; Chen, W; Wang, W

2016-06-01

Among the most prevalent Mycobacterium tuberculosis (Mtb) strains worldwide is the Beijing genotype, which has caused large outbreaks of tuberculosis (TB). Characteristics facilitating the dissemination of Beijing family strains remain unknown, but they are presumed to have been acquired through evolution of the lineage. To explore the genetic diversity of the Beijing family Mtb and explore the discriminatory ability of mycobacterial interspersed repetitive units-variable number of tandem repeats (MIRU-VNTR) loci in several regions of East Asia, a cross-sectional study was conducted with a total of 163 Beijing strains collected from registered TB patients between 1 June 2009 and 31 November 2010 in Funing County, China. The isolated strains were analysed by 15-MIRU-VNTR loci typing and compared with published MIRU-VNTR profiles of Beijing strains. Synonymous single nucleotide polymorphisms at 10 chromosomal positions were also analysed. The combination of SNP and MIRU-VNTR typing may be used to assess Mtb genotypes in areas dominated by Beijing strains. The modern subfamily in Shanghai overlapped with strains from other countries, whereas the ancient subfamily was genetically differentiated across several countries. Modern subfamilies, especially ST10, were prevalent. Qub11b and four other loci (MIRU 26, Mtub21, Qub26, Mtub04) could be used to discriminate Beijing strains.
Human germline and pan-cancer variomes and their distinct functional profiles

PubMed Central

Pan, Yang; Karagiannis, Konstantinos; Zhang, Haichen; Dingerdissen, Hayley; Shamsaddini, Amirhossein; Wan, Quan; Simonyan, Vahan; Mazumder, Raja

2014-01-01

Identification of non-synonymous single nucleotide variations (nsSNVs) has exponentially increased due to advances in Next-Generation Sequencing technologies. The functional impacts of these variations have been difficult to ascertain because the corresponding knowledge about sequence functional sites is quite fragmented. It is clear that mapping of variations to sequence functional features can help us better understand the pathophysiological role of variations. In this study, we investigated the effect of nsSNVs on more than 17 common types of post-translational modification (PTM) sites, active sites and binding sites. Out of 1 705 285 distinct nsSNVs on 259 216 functional sites we identified 38 549 variations that significantly affect 10 major functional sites. Furthermore, we found distinct patterns of site disruptions due to germline and somatic nsSNVs. Pan-cancer analysis across 12 different cancer types led to the identification of 51 genes with 106 nsSNV affected functional sites found in 3 or more cancer types. 13 of the 51 genes overlap with previously identified Significantly Mutated Genes (Nature. 2013 Oct 17;502(7471)). 62 mutations in these 13 genes affecting functional sites such as DNA, ATP binding and various PTM sites occur across several cancers and can be prioritized for additional validation and investigations. PMID:25232094
Whole genome sequencing of 35 individuals provides insights into the genetic architecture of Korean population.

PubMed

Zhang, Wenqian; Meehan, Joe; Su, Zhenqiang; Ng, Hui Wen; Shu, Mao; Luo, Heng; Ge, Weigong; Perkins, Roger; Tong, Weida; Hong, Huixiao

2014-01-01

Due to a significant decline in the costs associated with next-generation sequencing, it has become possible to decipher the genetic architecture of a population by sequencing a large number of individuals to a deep coverage. The Korean Personal Genomes Project (KPGP) recently sequenced 35 Korean genomes at high coverage using the Illumina Hiseq platform and made the deep sequencing data publicly available, providing the scientific community opportunities to decipher the genetic architecture of the Korean population. In this study, we used two single nucleotide variant (SNV) calling pipelines: mapping the raw reads obtained from whole genome sequencing of 35 Korean individuals in KPGP using BWA and SOAP2 followed by SNV calling using SAMtools and SOAPsnp, respectively. The consensus SNVs obtained from the two SNV pipelines were used to represent the SNVs of the Korean population. We compared these SNVs to those from 17 other populations provided by the HapMap consortium and the 1000 Genomes Project (1KGP) and identified SNVs that were only present in the Korean population. We studied the mutation spectrum and analyzed the genes of non-synonymous SNVs only detected in the Korean population. We detected a total of 8,555,726 SNVs in the 35 Korean individuals and identified 1,213,613 SNVs detected in at least one Korean individual (SNV-1) and 12,640 in all of 35 Korean individuals (SNV-35) but not in 17 other populations. In contrast with the SNVs common to other populations in HapMap and 1KGP, the Korean only SNVs had high percentages of non-silent variants, emphasizing the unique roles of these Korean only SNVs in the Korean population. Specifically, we identified 8,361 non-synonymous Korean only SNVs, of which 58 SNVs existed in all 35 Korean individuals. The 5,754 genes of non-synonymous Korean only SNVs were highly enriched in some metabolic pathways. We found adhesion is the top disease term associated with SNV-1 and Nelson syndrome is the only disease term associated with SNV-35. We found that a significant number of Korean only SNVs are in genes that are associated with the drug term of adenosine. We identified the SNVs that were found in the Korean population but not seen in other populations, and explored the corresponding genes and pathways as well as the associated disease terms and drug terms. The results expand our knowledge of the genetic architecture of the Korean population, which will benefit the implementation of personalized medicine for the Korean population.
Effect of two non-synonymous ecto-5'-nucleotidase variants on the genetic architecture of inosine 5'-monophosphate (IMP) and its degradation products in Japanese Black beef.

PubMed

Uemoto, Yoshinobu; Ohtake, Tsuyoshi; Sasago, Nanae; Takeda, Masayuki; Abe, Tsuyoshi; Sakuma, Hironori; Kojima, Takatoshi; Sasaki, Shinji

2017-11-13

Umami is a Japanese term for the fifth basic taste and is an important sensory property of beef palatability. Inosine 5'-monophosphate (IMP) contributes to umami taste in beef. Thus, the overall change in concentration of IMP and its degradation products can potentially affect the beef palatability. In this study, we investigated the genetic architecture of IMP and its degradation products in Japanese Black beef. First, we performed genome-wide association study (GWAS), candidate gene analysis, and functional analysis to detect the causal variants that affect IMP, inosine, and hypoxanthine. Second, we evaluated the allele frequencies in the different breeds, the contribution of genetic variance, and the effect on other economical traits using the detected variants. A total of 574 Japanese Black cattle were genotyped using the Illumina BovineSNP50 BeadChip and were then used for GWAS. The results of GWAS showed that the genome-wide significant single nucleotide polymorphisms (SNPs) on BTA9 were detected for IMP, inosine, and hypoxanthine. The ecto-5'-nucleotidase (NT5E) gene, which encodes the enzyme NT5E for the extracellular degradation of IMP to inosine, was located near the significant region on BTA9. The results of candidate gene analysis and functional analysis showed that two non-synonymous SNPs (c.1318C > T and c.1475 T > A) in NT5E affected the amount of IMP and its degradation products in beef by regulating the enzymatic activity of NT5E. The Q haplotype showed a positive effect on IMP and a negative effect on the enzymatic activity of NT5E in IMP degradation. The two SNPs were under perfect linkage disequilibrium in five different breeds, and different haplotype frequencies were seen among breeds. The two SNPs contribute to about half of the total genetic variance in IMP, and the results of genetic relationship between IMP and its degradation products showed that NT5E affected the overall concentration balance of IMP and its degradation products. In addition, the SNPs in NT5E did not have an unfavorable effect on the other economical traits. Based on all the above findings taken together, two non-synonymous SNPs in NT5E would be useful for improving IMP and its degradation products by marker-assisted selection in Japanese Black cattle.
Coevolution between Nuclear-Encoded DNA Replication, Recombination, and Repair Genes and Plastid Genome Complexity

PubMed Central

Zhang, Jin; Ruhlman, Tracey A.; Sabir, Jamal S. M.; Blazier, John Chris; Weng, Mao-Lun; Park, Seongjun; Jansen, Robert K.

2016-01-01

Disruption of DNA replication, recombination, and repair (DNA-RRR) systems has been hypothesized to cause highly elevated nucleotide substitution rates and genome rearrangements in the plastids of angiosperms, but this theory remains untested. To investigate nuclear–plastid genome (plastome) coevolution in Geraniaceae, four different measures of plastome complexity (rearrangements, repeats, nucleotide insertions/deletions, and substitution rates) were evaluated along with substitution rates of 12 nuclear-encoded, plastid-targeted DNA-RRR genes from 27 Geraniales species. Significant correlations were detected for nonsynonymous (dN) but not synonymous (dS) substitution rates for three DNA-RRR genes (uvrB/C, why1, and gyrA) supporting a role for these genes in accelerated plastid genome evolution in Geraniaceae. Furthermore, correlation between dN of uvrB/C and plastome complexity suggests the presence of nucleotide excision repair system in plastids. Significant correlations were also detected between plastome complexity and 13 of the 90 nuclear-encoded organelle-targeted genes investigated. Comparisons revealed significant acceleration of dN in plastid-targeted genes of Geraniales relative to Brassicales suggesting this correlation may be an artifact of elevated rates in this gene set in Geraniaceae. Correlation between dN of plastid-targeted DNA-RRR genes and plastome complexity supports the hypothesis that the aberrant patterns in angiosperm plastome evolution could be caused by dysfunction in DNA-RRR systems. PMID:26893456
Genetic diversity and potential vectors and reservoirs of Cucurbit aphid-borne yellows virus in southeastern Spain.

PubMed

Kassem, Mona A; Juarez, Miguel; Gómez, Pedro; Mengual, Carmen M; Sempere, Raquel N; Plaza, María; Elena, Santiago F; Moreno, Aranzazu; Fereres, Alberto; Aranda, Miguel A

2013-11-01

The genetic variability of a Cucurbit aphid-borne yellows virus (CABYV) (genus Polerovirus, family Luteoviridae) population was evaluated by determining the nucleotide sequences of two genomic regions of CABYV isolates collected in open-field melon and squash crops during three consecutive years in Murcia (southeastern Spain). A phylogenetic analysis showed the existence of two major clades. The sequences did not cluster according to host, year, or locality of collection, and nucleotide similarities among isolates were 97 to 100 and 94 to 97% within and between clades, respectively. The ratio of nonsynonymous to synonymous nucleotide substitutions reflected that all open reading frames have been under purifying selection. Estimates of the population's genetic diversity were of the same magnitude as those previously reported for other plant virus populations sampled at larger spatial and temporal scales, suggesting either the presence of CABYV in the surveyed area long before it was first described, multiple introductions, or a particularly rapid diversification. We also determined the full-length sequences of three isolates, identifying the occurrence and location of recombination events along the CABYV genome. Furthermore, our field surveys indicated that Aphis gossypii was the major vector species of CABYV and the most abundant aphid species colonizing melon fields in the Murcia (Spain) region. Our surveys also suggested the importance of the weed species Ecballium elaterium as an alternative host and potential virus reservoir.
PCR/LDR/capillary electrophoresis for detection of single-nucleotide differences between fetal and maternal DNA in maternal plasma.

PubMed

Yi, Ping; Chen, Zhuqin; Zhao, Yan; Guo, Jianxin; Fu, Huabin; Zhou, Yuanguo; Yu, Lili; Li, Li

2009-03-01

The discovery of fetal DNA in maternal plasma has opened up an approach for noninvasive diagnosis. We have now assessed the possibility of detecting single-nucleotide differences between fetal and maternal DNA in maternal plasma by polymerase chain reaction (PCR)/ligase detection reaction((LDR)/capillary electrophoresis. PCR/LDR/capillary electrophoresis was applied to detect the genotype of c.454-397T>gene (ESR1) from experimental DNA models of maternal plasma at different sensitivity levels and 13 maternal plasma samples.alphaC in estrogen receptor. (1) Our results demonstrated that the technique could discriminate low abundance single-nucleotide mutation with a mutant/normal allele ratio up to 1:10 000. (2) Examination of ESR1 c.454-397T>C genotypes by using the method of restriction fragment length analysis was performed in 25 pregnant women, of whom 13 pregnant women had homozygous genotypes. The c.454-397T>C genotypes of paternally inherited fetal DNA in maternal plasma of these 13 women were detected by PCR/LDR/capillary electrophoresis, which were accordant with the results of umbilical cord blood. PCR/LDR/capillary electrophoresis has very high sensitivity to distinguish low abundance single nucleotide differences and can discriminate point mutations and single-nucleotide polymorphisms(SNPs) of paternally inherited fetal DNA in maternal plasma.
Phylogenomic Analyses and Reclassification of Species within the Genus Tsukamurella: Insights to Species Definition in the Post-genomic Era.

PubMed

Teng, Jade L L; Tang, Ying; Huang, Yi; Guo, Feng-Biao; Wei, Wen; Chen, Jonathan H K; Wong, Samson S Y; Lau, Susanna K P; Woo, Patrick C Y

2016-01-01

Owing to the highly similar phenotypic profiles, protein spectra and 16S rRNA gene sequences observed between three pairs of Tsukamurella species (Tsukamurella pulmonis/Tsukamurella spongiae, Tsukamurella tyrosinosolvens/Tsukamurella carboxy-divorans, and Tsukamurella pseudospumae/Tsukamurella sunchonensis), we hypothesize that and the six Tsukamurella species may have been misclassified and that there may only be three Tsukamurella species. In this study, we characterized the type strains of these six Tsukamurella species by tradition DNA-DNA hybridization (DDH) and "digital DDH" after genome sequencing to determine their exact taxonomic positions. Traditional DDH showed 81.2 ± 0.6% to 99.7 ± 1.0% DNA-DNA relatedness between the two Tsukamurella species in each of the three pairs, which was above the threshold for same species designation. "Digital DDH" based on Genome-To-Genome Distance Calculator and Average Nucleotide Identity for the three pairs also showed similarity results in the range of 82.3-92.9 and 98.1-99.1%, respectively, in line with results of traditional DDH. Based on these evidence and according to Rules 23a and 42 of the Bacteriological Code, we propose that T. spongiae Olson et al. 2007, should be reclassified as a later heterotypic synonym of T. pulmonis Yassin et al. 1996, T. carboxydivorans Park et al. 2009, as a later heterotypic synonym of T. tyrosinosolvens Yassin et al. 1997, and T. sunchonensis Seong et al. 2008 as a later heterotypic synonym of T. pseudospumae Nam et al. 2004. With the advancement of genome sequencing technologies, classification of bacterial species can be readily achieved by "digital DDH" than traditional DDH.
Aspergillus and Penicillium identification using DNA sequences: Barcode or MLST?

USDA-ARS?s Scientific Manuscript database

Current methods in DNA technology can detect single nucleotide polymorphisms with measurable accuracy using several different approaches appropriate for different uses. If there are even single nucleotide differences that are invariant markers of the species, we can accomplish identification through...
Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants

PubMed Central

Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

2015-01-01

Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5′ transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5′UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5′end can modulate protein levels up to 160%–300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes. PMID:26176266
Rationally designed, heterologous S. cerevisiae transcripts expose novel expression determinants.

PubMed

Ben-Yehezkel, Tuval; Atar, Shimshi; Zur, Hadas; Diament, Alon; Goz, Eli; Marx, Tzipy; Cohen, Rafael; Dana, Alexandra; Feldman, Anna; Shapiro, Ehud; Tuller, Tamir

2015-01-01

Deducing generic causal relations between RNA transcript features and protein expression profiles from endogenous gene expression data remains a major unsolved problem in biology. The analysis of gene expression from heterologous genes contributes significantly to solving this problem, but has been heavily biased toward the study of the effect of 5' transcript regions and to prokaryotes. Here, we employ a synthetic biology driven approach that systematically differentiates the effect of different regions of the transcript on gene expression up to 240 nucleotides into the ORF. This enabled us to discover new causal effects between features in previously unexplored regions of transcripts, and gene expression in natural regimes. We rationally designed, constructed, and analyzed 383 gene variants of the viral HRSVgp04 gene ORF, with multiple synonymous mutations at key positions along the transcript in the eukaryote S. cerevisiae. Our results show that a few silent mutations at the 5'UTR can have a dramatic effect of up to 15 fold change on protein levels, and that even synonymous mutations in positions more than 120 nucleotides downstream from the ORF 5'end can modulate protein levels up to 160%-300%. We demonstrate that the correlation between protein levels and folding energy increases with the significance of the level of selection of the latter in endogenous genes, reinforcing the notion that selection for folding strength in different parts of the ORF is related to translation regulation. Our measured protein abundance correlates notably(correlation up to r = 0.62 (p=0.0013)) with mean relative codon decoding times, based on ribosomal densities (Ribo-Seq) in endogenous genes, supporting the conjecture that translation elongation and adaptation to the tRNA pool can modify protein levels in a causal/direct manner. This report provides an improved understanding of transcript evolution, design principles of gene expression regulation, and suggests simple rules for engineering synthetic gene expression in eukaryotes.

The complete mitochondrial genome of dhole Cuon alpinus: phylogenetic analysis and dating evolutionary divergence within Canidae.

PubMed

Zhang, Honghai; Chen, Lei

2011-03-01

The dhole (Cuon alpinus) is the only existent species in the genus Cuon (Carnivora: Canidae). In the present study, the complete mitochondrial genome of the dhole was sequenced. The total length is 16672 base pairs which is the shortest in Canidae. Sequence analysis revealed that most mitochondrial genomic functional regions were highly consistent among canid animals except the CSB domain of the control region. The difference in length among the Canidae mitochondrial genome sequences is mainly due to the number of short segments of tandem repeated in the CSB domain. Phylogenetic analysis was progressed based on the concatenated data set of 14 mitochondrial genes of 8 canid animals by using maximum parsimony (MP), maximum likelihood (ML) and Bayesian (BI) inference methods. The genera Vulpes and Nyctereutes formed a sister group and split first within Canidae, followed by that in the Cuon. The divergence in the genus Canis was the latest. The divarication of domestic dogs after that of the Canis lupus laniger is completely supported by all the three topologies. Pairwise sequence divergence data of different mitochondrial genes among canid animals were also determined. Except for the synonymous substitutions in protein-coding genes, the control region exhibits the highest sequence divergences. The synonymous rates are approximately two to six times higher than those of the non-synonymous sites except for a slightly higher rate in the non-synonymous substitution between Cuon alpinus and Vulpes vulpes. 16S rRNA genes have a slightly faster sequence divergence than 12S rRNA and tRNA genes. Based on nucleotide substitutions of tRNA genes and rRNA genes, the times since divergence between dhole and other canid animals, and between domestic dogs and three subspecies of wolves were evaluated. The result indicates that Vulpes and Nyctereutes have a close phylogenetic relationship and the divergence of Nyctereutes is a little earlier. The Tibetan wolf may be an archaic pedigree within wolf subspecies. The genetic distance between wolves and domestic dogs is less than that among different subspecies of wolves. The domestication of dogs was about 1.56-1.92 million years ago or even earlier.
Bitterness of the Non-nutritive Sweetener Acesulfame Potassium Varies With Polymorphisms in TAS2R9 and TAS2R31

PubMed Central

2013-01-01

Demand for nonnutritive sweeteners continues to increase due to their ability to provide desirable sweetness with minimal calories. Acesulfame potassium and saccharin are well-studied nonnutritive sweeteners commonly found in food products. Some individuals report aversive sensations from these sweeteners, such as bitter and metallic side tastes. Recent advances in molecular genetics have provided insight into the cause of perceptual differences across people. For example, common alleles for the genes TAS2R9 and TAS2R38 explain variable response to the bitter drugs ofloxacin in vitro and propylthiouracil in vivo. Here, we wanted to determine whether differences in the bitterness of acesulfame potassium could be predicted by common polymorphisms (genetic variants) in bitter taste receptor genes (TAS2Rs). We genotyped participants (n = 108) for putatively functional single nucleotide polymorphisms in 5 TAS2Rs and asked them to rate the bitterness of 25 mM acesulfame potassium on a general labeled magnitude scale. Consistent with prior reports, we found 2 single nucleotide polymorphisms in TAS2R31 were associated with acesulfame potassium bitterness. However, TAS2R9 alleles also predicted additional variation in acesulfame potassium bitterness. Conversely, single nucleotide polymorphisms in TAS2R4, TAS2R38, and near TAS2R16 were not significant predictors. Using 1 single nucleotide polymorphism each from TAS2R9 and TAS2R31, we modeled the simultaneous influence of these single nucleotide polymorphisms on acesulfame potassium bitterness; together, these 2 single nucleotide polymorphisms explained 13.4% of the variance in perceived bitterness. These data suggest multiple polymorphisms within TAS2Rs contribute to the ability to perceive the bitterness from acesulfame potassium. PMID:23599216
Nucleotide cleaving agents and method

DOEpatents

Que, Jr., Lawrence; Hanson, Richard S.; Schnaith, Leah M. T.

2000-01-01

The present invention provides a unique series of nucleotide cleaving agents and a method for cleaving a nucleotide sequence, whether single-stranded or double-stranded DNA or RNA, using and a cationic metal complex having at least one polydentate ligand to cleave the nucleotide sequence phosphate backbone to yield a hydroxyl end and a phosphate end.
An Engineered Kinetic Amplification Mechanism for Single Nucleotide Variant Discrimination by DNA Hybridization Probes.

PubMed

Chen, Sherry Xi; Seelig, Georg

2016-04-20

Even a single-nucleotide difference between the sequences of two otherwise identical biological nucleic acids can have dramatic functional consequences. Here, we use model-guided reaction pathway engineering to quantitatively improve the performance of selective hybridization probes in recognizing single nucleotide variants (SNVs). Specifically, we build a detection system that combines discrimination by competition with DNA strand displacement-based catalytic amplification. We show, both mathematically and experimentally, that the single nucleotide selectivity of such a system in binding to single-stranded DNA and RNA is quadratically better than discrimination due to competitive hybridization alone. As an additional benefit the integrated circuit inherits the property of amplification and provides at least 10-fold better sensitivity than standard hybridization probes. Moreover, we demonstrate how the detection mechanism can be tuned such that the detection reaction is agnostic to the position of the SNV within the target sequence. in contrast, prior strand displacement-based probes designed for kinetic discrimination are highly sensitive to position effects. We apply our system to reliably discriminate between different members of the let-7 microRNA family that differ in only a single base position. Our results demonstrate the power of systematic reaction network design to quantitatively improve biotechnology.
Association of single nucleotide polymorphism in CD28(C/T-I3 + 17) and CD40 (C/T-1) genes with the Graves' disease.

PubMed

Mustafa, Saima; Fatima, Hira; Fatima, Sadia; Khosa, Tafheem; Akbar, Atif; Shaikh, Rehan Sadiq; Iqbal, Furhan

2018-01-01

To find out a correlation between the single nucleotide polymorphisms in cluster of differentiation 28 and cluster of differentiation 40 genes with Graves' disease, if any. This case-control study was conducted at the Multan Institute of Nuclear Medicine and Radiotherapy, Multan, Pakistan, and comprised blood samples of Graves' disease patients and controls. Various risk factors were also correlated either with the genotype at each single-nucleotide polymorphism or with various combinations of genotypes studied during present investigation. Of the 160 samples, there were 80(50%) each from patients and controls. Risk factor analysis revealed that gender (p=0.008), marital status (p<0.001), education (p<0.001), smoking (p<0.001), tri-iodothyronine (P <0.001), thyroxin (p<0.001) and thyroid-stimulating hormone (p<0.000) levels in blood were associated with Graves' disease. Both single-nucleotide polymorphisms in both genes were not associated with Graves' disease, either individually or in any combined form.
Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

PubMed Central

Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

2016-01-01

DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement

PubMed Central

Blazier, J. Chris; Ruhlman, Tracey A.; Weng, Mao-Lun; Rehman, Sumaiyah K.; Sabir, Jamal S. M.; Jansen, Robert K.

2016-01-01

Genes for the plastid-encoded RNA polymerase (PEP) persist in the plastid genomes of all photosynthetic angiosperms. However, three unrelated lineages (Annonaceae, Passifloraceae and Geraniaceae) have been identified with unusually divergent open reading frames (ORFs) in the conserved region of rpoA, the gene encoding the PEP α subunit. We used sequence-based approaches to evaluate whether these genes retain function. Both gene sequences and complete plastid genome sequences were assembled and analyzed from each of the three angiosperm families. Multiple lines of evidence indicated that the rpoA sequences are likely functional despite retaining as low as 30% nucleotide sequence identity with rpoA genes from outgroups in the same angiosperm order. The ratio of non-synonymous to synonymous substitutions indicated that these genes are under purifying selection, and bioinformatic prediction of conserved domains indicated that functional domains are preserved. One of the lineages (Pelargonium, Geraniaceae) contains species with multiple rpoA-like ORFs that show evidence of ongoing inter-paralog gene conversion. The plastid genomes containing these divergent rpoA genes have experienced extensive structural rearrangement, including large expansions of the inverted repeat. We propose that illegitimate recombination, not positive selection, has driven the divergence of rpoA. PMID:27087667
Hierarchically Aligning 10 Legume Genomes Establishes a Family-Level Genomics Platform1[OPEN

PubMed Central

Sun, Pengchuan; Li, Yuxian; Liu, Yinzhe; Yu, Jigao; Ma, Xuelian; Sun, Sangrong; Yang, Nanshan; Xia, Ruiyan; Lei, Tianyu; Liu, Xiaojian; Jiao, Beibei; Xing, Yue; Ge, Weina; Wang, Li; Song, Xiaoming; Yuan, Min; Guo, Di; Zhang, Lan; Zhang, Jiaqi; Chen, Wei; Pan, Yuxin; Liu, Tao; Jin, Ling; Sun, Jinshuai; Yu, Jiaxiang; Duan, Xueqian; Shen, Shaoqi; Qin, Jun; Zhang, Meng-chen; Paterson, Andrew H.

2017-01-01

Mainly due to their economic importance, genomes of 10 legumes, including soybean (Glycine max), wild peanut (Arachis duranensis and Arachis ipaensis), and barrel medic (Medicago truncatula), have been sequenced. However, a family-level comparative genomics analysis has been unavailable. With grape (Vitis vinifera) and selected legume genomes as outgroups, we managed to perform a hierarchical and event-related alignment of these genomes and deconvoluted layers of homologous regions produced by ancestral polyploidizations or speciations. Consequently, we illustrated genomic fractionation characterized by widespread gene losses after the polyploidizations. Notably, high similarity in gene retention between recently duplicated chromosomes in soybean supported the likely autopolyploidy nature of its tetraploid ancestor. Moreover, although most gene losses were nearly random, largely but not fully described by geometric distribution, we showed that polyploidization contributed divergently to the copy number variation of important gene families. Besides, we showed significantly divergent evolutionary levels among legumes and, by performing synonymous nucleotide substitutions at synonymous sites correction, redated major evolutionary events during their expansion. This effort laid a solid foundation for further genomics exploration in the legume research community and beyond. We describe only a tiny fraction of legume comparative genomics analysis that we performed; more information was stored in the newly constructed Legume Comparative Genomics Research Platform (www.legumegrp.org). PMID:28325848
Food searching behaviour of a Lepidoptera pest species is modulated by the foraging gene polymorphism.

PubMed

Chardonnet, Floriane; Capdevielle-Dulac, Claire; Chouquet, Bastien; Joly, Nicolas; Harry, Myriam; Le Ru, Bruno; Silvain, Jean-François; Kaiser, Laure

2014-10-01

The extent of damage to crop plants from pest insects depends on the foraging behaviour of the insect's feeding stage. Little is known, however, about the genetic and molecular bases of foraging behaviour in phytophagous pest insects. The foraging gene (for), a candidate gene encoding a PKG-I, has an evolutionarily conserved function in feeding strategies. Until now, for had never been studied in Lepidoptera, which includes major pest species. The cereal stem borer Sesamia nonagrioides is therefore a relevant species within this order with which to study conservation of and polymorphism in the for gene, and its role in foraging - a behavioural trait that is directly associated with plant injuries. Full sequencing of for cDNA in S. nonagrioides revealed a high degree of conservation with other insect taxa. Activation of PKG by a cGMP analogue increased larval foraging activity, measured by how frequently larvae moved between food patches in an actimeter. We found one non-synonymous allelic variation in a natural population that defined two allelic variants. These variants presented significantly different levels of foraging activity, and the behaviour was positively correlated to gene expression levels. Our results show that for gene function is conserved in this species of Lepidoptera, and describe an original case of a single nucleotide polymorphism associated with foraging behaviour variation in a pest insect. By illustrating how variation in this single gene can predict phenotype, this work opens new perspectives into the evolutionary context of insect adaptation to plants, as well as pest management. © 2014. Published by The Company of Biologists Ltd.
New insights into the genetics of glioblastoma multiforme by familial exome sequencing

PubMed Central

Backes, Christina; Harz, Christian; Fischer, Ulrike; Schmitt, Jana; Ludwig, Nicole; Petersen, Britt-Sabina; Mueller, Sabine C.; Kim, Yoo-Jin; Wolf, Nadine M.; Katus, Hugo A.; Meder, Benjamin; Furtwängler, Rhoikos; Franke, Andre; Bohle, Rainer; Henn, Wolfram; Graf, Norbert; Keller, Andreas; Meese, Eckart

2015-01-01

Glioblastoma multiforme (GBM) is the most aggressive and malignant subtype of human brain tumors. While a family clustering of GBM has long been acknowledged, relevant hereditary factors still remained elusive. Exome sequencing of families offers the option to discover respective genetic factors. We sequenced blood samples of one of the rare affected families: while both parents were healthy, both children were diagnosed with GBM. We report 85 homozygous non-synonymous single nucleotide variations (SNVs) in both siblings that were heterozygous in the parents. Beyond known key players for GBM such as ERBB2, PMS2, or CHI3L1, we identified over 50 genes that have not been associated to GBM so far. We also discovered three accumulative effects potentially adding to the tumorigenesis in the siblings: a clustering of multiple variants in single genes (e.g. PTPRB, CROCC), the aggregation of affected genes on specific molecular pathways (e.g. Focal adhesion or ECM receptor interaction) and genomic proximity (e.g. chr22.q12.2, chr1.p36.33). We found a striking accumulation of SNVs in specific genes for the daughter, who developed not only a GBM at the age of 12 years but was subsequently diagnosed with a pilocytic astrocytoma, a common acute lymphatic leukemia and a diffuse pontine glioma. The reported variants underline the relevance of genetic predisposition and cancer development in this family and demonstrate that GBM has a complex and heterogeneous genetic background. Sequencing of other affected families will help to further narrow down the driving genetic causes for this disease. PMID:25537509
Detecting Single-Nucleotides by Tunneling Current Measurements at Sub-MHz Temporal Resolution.

PubMed

Morikawa, Takanori; Yokota, Kazumichi; Tanimoto, Sachie; Tsutsui, Makusu; Taniguchi, Masateru

2017-04-18

Label-free detection of single-nucleotides was performed by fast tunneling current measurements in a polar solvent at 1 MHz sampling rate using SiO₂-protected Au nanoprobes. Short current spikes were observed, suggestive of trapping/detrapping of individual nucleotides between the nanoelectrodes. The fall and rise features of the electrical signatures indicated signal retardation by capacitance effects with a time constant of about 10 microseconds. The high temporal resolution revealed current fluctuations, reflecting the molecular conformation degrees of freedom in the electrode gap. The method presented in this work may enable direct characterizations of dynamic changes in single-molecule conformations in an electrode gap in liquid.
[Single nucleotide polymorphism and its application in allogeneic hematopoietic stem cell transplantation--review].

PubMed

Li, Su-Xia

2004-12-01

Single nucleotide polymorphism (SNP) is the third genetic marker after restriction fragment length polymorphism (RFLP) and short tandem repeat. It represents the most density genetic variability in the human genome and has been widely used in gene location, cloning, and research of heredity variation, as well as parenthood identification in forensic medicine. As steady heredity polymorphism, single nucleotide polymorphism is becoming the focus of attention in monitoring chimerism and minimal residual disease in the patients after allogeneic hematopoietic stem cell transplantation. The article reviews SNP heredity characterization, analysis techniques and its applications in allogeneic stem cell transplantation and other fields.
Morphological Variation of the Scorpionfly Panorpa obtusa Cheng (Mecoptera: Panorpidae) with a New Synonym

PubMed Central

Ma, Na; Hu, Guilin; Zhang, Junxia; Hua, Baozhen

2014-01-01

Background The overabundance of synonyms is an unavoidable by-product of taxonomic practice in insects. How to reduce or even eliminate synonymy has long been a great challenge for insect taxonomists. The scorpionflies Panorpa obtusa Cheng, 1949 and Panorpa leei Cheng, 1949 (Insecta: Mecoptera: Panorpidae) were originally described from Taibaishan in the Qinling Mountains with identical collection data and both are based on a single gender, the former on a male and the latter on two females. However, whether P. leei is conspecific with P. obtusa or a good species remains an unsolved problem. Results On the basis of intensive morphological comparison of 93 males and 53 females of scorpionflies collected from the type locality using light and scanning electron microscopy, we found P. obtusa has considerable morphological variation (especially the wing markings and genitalia in both male and female), and Panorpa leei is totally comprised of one of the morphs of P. obtusa. Conclusions In combination with identical type localities and overlapping morphological variation, P. leei Cheng is proposed as a junior subjective synonym of P. obtusa Cheng. To avoid synonyms, taxonomists should pay more attention to individual variation and base decisions on a series of specimens to describe new species. PMID:25250880
A new single-nucleotide polymorphism database for rainbow trout generated through whole genome re-sequencing

USDA-ARS?s Scientific Manuscript database

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

USDA-ARS?s Scientific Manuscript database

High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...
Novel high-speed droplet-allele specific-polymerase chain reaction: application in the rapid genotyping of single nucleotide polymorphisms.

PubMed

Taira, Chiaki; Matsuda, Kazuyuki; Yamaguchi, Akemi; Sueki, Akane; Koeda, Hiroshi; Takagi, Fumio; Kobayashi, Yukihiro; Sugano, Mitsutoshi; Honda, Takayuki

2013-09-23

Single nucleotide alterations such as single nucleotide polymorphisms (SNP) and single nucleotide mutations are associated with responses to drugs and predisposition to several diseases, and they contribute to the pathogenesis of malignancies. We developed a rapid genotyping assay based on the allele-specific polymerase chain reaction (AS-PCR) with our droplet-PCR machine (droplet-AS-PCR). Using 8 SNP loci, we evaluated the specificity and sensitivity of droplet-AS-PCR. Buccal cells were pretreated with proteinase K and subjected directly to the droplet-AS-PCR without DNA extraction. The genotypes determined using the droplet-AS-PCR were then compared with those obtained by direct sequencing. Specific PCR amplifications for the 8 SNP loci were detected, and the detection limit of the droplet-AS-PCR was found to be 0.1-5.0% by dilution experiments. Droplet-AS-PCR provided specific amplification when using buccal cells, and all the genotypes determined within 9 min were consistent with those obtained by direct sequencing. Our novel droplet-AS-PCR assay enabled high-speed amplification retaining specificity and sensitivity and provided ultra-rapid genotyping. Crude samples such as buccal cells were available for the droplet-AS-PCR assay, resulting in the reduction of the total analysis time. Droplet-AS-PCR may therefore be useful for genotyping or the detection of single nucleotide alterations. Copyright © 2013 Elsevier B.V. All rights reserved.
Identification of a novel truncating PALB2 mutation and analysis of its contribution to early-onset breast cancer in French-Canadian women.

PubMed

Foulkes, William D; Ghadirian, Parviz; Akbari, Mohammed Reza; Hamel, Nancy; Giroux, Sylvie; Sabbaghian, Nelly; Darnel, Andrew; Royer, Robert; Poll, Aletta; Fafard, Eve; Robidoux, André; Martin, Ginette; Bismar, Tarek A; Tischkowitz, Marc; Rousseau, Francois; Narod, Steven A

2007-01-01

PALB2 has recently been identified as a breast cancer susceptibility gene. PALB2 mutations are rare causes of hereditary breast cancer but may be important in countries such as Finland where a founder mutation is present. We sought to estimate the contribution of PALB2 mutations to the burden of breast cancer in French Canadians from Quebec. We screened all coding exons of PALB2 in a sample of 50 French-Canadian women diagnosed with either early-onset breast cancer or familial breast cancer at a single Montreal hospital. The genetic variants identified in this sample were then studied in 356 additional women with breast cancer diagnosed before age 50 and in 6,448 newborn controls. We identified a single protein-truncating mutation in PALB2 (c.2323 C>T, resulting in Q775X) in 1 of the 50 high-risk women. This variant was present in 2 of 356 breast cancer cases and in none of 6,440 newborn French-Canadian controls (P = 0.003). We also identified two novel new non-synonymous single nucleotide polymorphisms in exon 4 of PALB2 (c.5038 A>G [I76V] and c.5156 G>T [G115V]). G115V was found in 1 of 356 cases and in 15 of 6,442 controls (P = 0.6). The I76V variant was not identified in either the extended case series or the controls. We have identified a novel truncating mutation in PALB2. The mutation was found in approximately 0.5% of unselected French-Canadian women with early-onset breast cancer and appears to have a single origin. Although mutations are infrequent, PALB2 can be added to the list of breast cancer susceptibility genes for which founder mutations have been identified in the French-Canadian population.
Identification of a novel truncating PALB2 mutation and analysis of its contribution to early-onset breast cancer in French-Canadian women

PubMed Central

Foulkes, William D; Ghadirian, Parviz; Akbari, Mohammed Reza; Hamel, Nancy; Giroux, Sylvie; Sabbaghian, Nelly; Darnel, Andrew; Royer, Robert; Poll, Aletta; Fafard, Eve; Robidoux, André; Martin, Ginette; Bismar, Tarek A; Tischkowitz, Marc; Rousseau, Francois; Narod, Steven A

2007-01-01

Background PALB2 has recently been identified as a breast cancer susceptibility gene. PALB2 mutations are rare causes of hereditary breast cancer but may be important in countries such as Finland where a founder mutation is present. We sought to estimate the contribution of PALB2 mutations to the burden of breast cancer in French Canadians from Quebec. Methods We screened all coding exons of PALB2 in a sample of 50 French-Canadian women diagnosed with either early-onset breast cancer or familial breast cancer at a single Montreal hospital. The genetic variants identified in this sample were then studied in 356 additional women with breast cancer diagnosed before age 50 and in 6,448 newborn controls. Results We identified a single protein-truncating mutation in PALB2 (c.2323 C>T, resulting in Q775X) in 1 of the 50 high-risk women. This variant was present in 2 of 356 breast cancer cases and in none of 6,440 newborn French-Canadian controls (P = 0.003). We also identified two novel new non-synonymous single nucleotide polymorphisms in exon 4 of PALB2 (c.5038 A>G [I76V] and c.5156 G>T [G115V]). G115V was found in 1 of 356 cases and in 15 of 6,442 controls (P = 0.6). The I76V variant was not identified in either the extended case series or the controls. Conclusion We have identified a novel truncating mutation in PALB2. The mutation was found in approximately 0.5% of unselected French-Canadian women with early-onset breast cancer and appears to have a single origin. Although mutations are infrequent, PALB2 can be added to the list of breast cancer susceptibility genes for which founder mutations have been identified in the French-Canadian population. PMID:18053174
Identification of Critical Residues for the Tight Binding of Both Correct and Incorrect Nucleotides to Human DNA Polymerase λ

PubMed Central

Brown, Jessica A.; Pack, Lindsey R.; Sherrer, Shanen M.; Kshetry, Ajay K.; Newmister, Sean A.; Fowler, Jason D.; Taylor, John-Stephen; Suo, Zucai

2010-01-01

DNA polymerase λ (Pol λ) is a novel X-family DNA polymerase that shares 34% sequence identity with DNA polymerase β (Pol β). Pre-steady state kinetic studies have shown that the Pol λ•DNA complex binds both correct and incorrect nucleotides 130-fold tighter on average than the Pol β•DNA complex, although, the base substitution fidelity of both polymerases is 10−4 to 10−5. To better understand Pol λ’s tight nucleotide binding affinity, we created single- and double-substitution mutants of Pol λ to disrupt interactions between active site residues and an incoming nucleotide or a template base. Single-turnover kinetic assays showed that Pol λ binds to an incoming nucleotide via cooperative interactions with active site residues (R386, R420, K422, Y505, F506, A510, and R514). Disrupting protein interactions with an incoming correct or incorrect nucleotide impacted binding with each of the common structural moieties in the following order: triphosphate ≫ base > ribose. In addition, the loss of Watson-Crick hydrogen bonding between the nucleotide and template base led to a moderate increase in the Kd. The fidelity of Pol λ was maintained predominantly by a single residue, R517, which has minor groove interactions with the DNA template. PMID:20851705
Mitochondrial DNA variations in ova and blastocyst: implications in assisted reproduction.

PubMed

Shamsi, Monis Bilal; Govindaraj, Periyasamy; Chawla, Latika; Malhotra, Neena; Singh, Neeta; Mittal, Suneeta; Talwar, Pankaj; Thangaraj, Kumarasamy; Dada, Rima

2013-03-01

Mitochondrial DNA (mtDNA) of oocyte is critical for its function, embryo quality and development. Analysis of complete mtDNA of 49 oocytes and 18 blastocysts from 67 females opting for IVF revealed 437 nucleotide variations. 40.29% samples had either disease associated or non-synonymous novel or pathogenic mutation in evolutionarily conserved regions. Samples with disease associated mtDNA mutations had low fertilization rate and poor embryo quality, however no difference in implantation or clinical pregnancy rate was observed. Screening mtDNA from oocyte/blastocyst is a simple, clinically reliable method for diagnostic evaluation of female infertility and may reduce risk of mtDNA disease transmission. Copyright © 2013 Elsevier B.V. and Mitochondria Research Society. All rights reserved.

Genetic characterization of the UCS and Kex1 loci of Pneumocystis jirovecii.

PubMed

Esteves, F; Tavares, A; Costa, M C; Gaspar, J; Antunes, F; Matos, O

2009-02-01

Nucleotide variation in the Pneumocystis jirovecii upstream conserved sequence (UCS) and kexin-like serine protease (Kex1) loci was studied in pulmonary specimens from Portuguese HIV-positive patients. DNA was extracted and used for specific molecular sequence analysis. The number of UCS tandem repeats detected in 13 successfully sequenced isolates ranged from three (9 isolates, 69%) to four (4 isolates, 31%). A novel tandem repeat pattern and two novel polymorphisms were detected in the UCS region. For the Kex1 gene, the wild-type (24 isolates, 86%) was the most frequent sequence detected among the 28 sequenced isolates. Nevertheless, a nonsynonymous (1 isolate, 3%) and three synonymous (3 isolates, 11%) polymorphisms were detected and are described here for the first time.
Genome-wide association study of fertility traits in dairy cattle using high-density single nucleotide polymorphism marker panels

USDA-ARS?s Scientific Manuscript database

Unfavorable genetic correlations between production and fertility traits are well documented. Genetic selection for fertility traits is slow, however, due to low heritabilities. Identification of single nucleotide polymorphisms (SNP) involved in reproduction could improve reliability of genomic esti...
Discovery, Validation and Characterization of 1039 Cattle Single Nucleotide Polymorphisms

USDA-ARS?s Scientific Manuscript database

We identified approximately 13000 putative single nucleotide polymorphisms (SNPs) by comparison of repeat-masked BAC-end sequences from the cattle RPCI-42 BAC library with whole-genome shotgun contigs of cattle genome assembly Btau 1.0. Genotyping of a subset of these SNPs was performed on a panel ...
High-throughput single nucleotide polymorphism genotyping for breeding applications in rice using the BeadXpress platform

USDA-ARS?s Scientific Manuscript database

Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Developing Single Nucleotide Polymorphism (SNP) markers from transcriptome sequences for the identification of longan (Dimocarpus longan) germplasm

USDA-ARS?s Scientific Manuscript database

Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...
Informativeness of single nucleotide polymorphisms and relationships among onion populations from important world production regions

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphisms (SNPs) were genotyped using a high-density array and DNAs from individual plants from important onion populations from major production regions world-wide and the likely progenitor of onion, Allium vavilovii. Genotypes at 1226 SNPs were used to estimate genetic relati...
Relationships among calpastatin single nucleotide polymorphisms, calpastatin expression and tenderness in pork longissimus

USDA-ARS?s Scientific Manuscript database

Genome scans in the pig have identified a region on chromosome 2 (SSC2) associated with tenderness. Calpastatin is a likely positional candidate gene in this region because of its inhibitory role in the calpain system that is involved in postmortem tenderization. Novel single nucleotide polymorphism...
Lineage and genogroup-defining single nucleotide polymorphisms of Escherichia coli 0157:H7

USDA-ARS?s Scientific Manuscript database

Escherichia coli O157:H7 is a zoonotic human pathogen for which cattle are an important reservoir host. Using both previously published and new sequencing data, a 48-locus single nucleotide polymorphism (SNP) based typing panel was developed that redundantly identified eleven genogroups that span ...
A new single-nucleotide polymorphisms database for rainbow trout generated through whole genome resequencing of selected samples

USDA-ARS?s Scientific Manuscript database

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
A novel MALDI–TOF based methodology for genotyping single nucleotide polymorphisms

PubMed Central

Blondal, Thorarinn; Waage, Benedikt G.; Smarason, Sigurdur V.; Jonsson, Frosti; Fjalldal, Sigridur B.; Stefansson, Kari; Gulcher, Jeffery; Smith, Albert V.

2003-01-01

A new MALDI–TOF based detection assay was developed for analysis of single nucleotide polymorphisms (SNPs). It is a significant modification on the classic three-step minisequencing method, which includes a polymerase chain reaction (PCR), removal of excess nucleotides and primers, followed by primer extension in the presence of dideoxynucleotides using modified thermostable DNA polymerase. The key feature of this novel assay is reliance upon deoxynucleotide mixes, lacking one of the nucleotides at the polymorphic position. During primer extension in the presence of depleted nucleotide mixes, standard thermostable DNA polymerases dissociate from the template at positions requiring a depleted nucleotide; this principal was harnessed to create a genotyping assay. The assay design requires a primer- extension primer having its 3′-end one nucleotide upstream from the interrogated site. The assay further utilizes the same DNA polymerase in both PCR and the primer extension step. This not only simplifies the assay but also greatly reduces the cost per genotype compared to minisequencing methodology. We demonstrate accurate genotyping using this methodology for two SNPs run in both singleplex and duplex reactions. We term this assay nucleotide depletion genotyping (NUDGE). Nucleotide depletion genotyping could be extended to other genotyping assays based on primer extension such as detection by gel or capillary electrophoresis. PMID:14654708
Biological nanopore MspA for DNA sequencing

NASA Astrophysics Data System (ADS)

Manrao, Elizabeth A.

Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Five new synonyms in Epimedium (Berberidaceae) from China.

PubMed

Zhang, Yanjun; Dang, Haishan; Li, Shengyu; Li, Jianqiang; Wang, Ying

2015-01-01

Five new synonyms in Chinese Epimedium are designated in the present paper. Epimediumchlorandrum is treated as a synonym of Epimediumacuminatum; Epimediumrhizomatosum as a synonym of Epimediummembranaceum; Epimediumbrachyrrhizum as a synonym of Epimediumleptorrhizum; Epimediumdewuense as a synonym of Epimediumdolichostemon; and Epimediumsagittatumvar.oblongifoliolatum as a synonym of Epimediumborealiguizhouense.
Variation of Cats under Domestication: Genetic Assignment of Domestic Cats to Breeds and Worldwide Random Bred Populations

PubMed Central

Kurushima, J. D.; Lipinski, M. J.; Gandolfi, B.; Froenicke, L.; Grahn, J. C.; Grahn, R. A.; Lyons, L. A.

2012-01-01

Summary Both cat breeders and the lay public have interests in the origins of their pets, not only in the genetic identity of the purebred individuals, but also the historical origins of common household cats. The cat fancy is a relatively new institution with over 85% of its 40–50 breeds arising only in the past 75 years, primarily through selection on single-gene aesthetic traits. The short, yet intense cat breed history poses a significant challenge to the development of a genetic marker-based breed identification strategy. Using different breed assignment strategies and methods, 477 cats representing 29 fancy breeds were analysed with 38 short tandem repeats, 148 intergenic and five phenotypic single nucleotide polymorphisms. Results suggest the frequentist method of Paetkau (accuracy single nucleotide polymorphisms = 0.78, short tandem repeats = 0.88) surpasses the Bayesian method of Rannala and Mountain (single nucleotide polymorphisms = 0.56, short tandem repeats = 0.83) for accurate assignment of individuals to the correct breed. Additionally, a post-assignment verification step with the five phenotypic single nucleotide polymorphisms accurately identified between 0.31 and 0.58 of the mis-assigned individuals raising the sensitivity of assignment with the frequentist method to 0.89 and 0.92 single nucleotide polymorphisms and short tandem repeats respectively. This study provides a novel multi-step assignment strategy and suggests that, despite their short breed history and breed family groupings, a majority of cats can be assigned to their proper breed or population of origin, i.e. race. PMID:23171373
Imputation of single nucleotide polymorhpism genotypes of Hereford cattle: reference panel size, family relationship and population structure

USDA-ARS?s Scientific Manuscript database

The objective of this study is to investigate single nucleotide polymorphism (SNP) genotypes imputation of Hereford cattle. Purebred Herefords were from two sources, Line 1 Hereford (N=240) and representatives of Industry Herefords (N=311). Using different reference panels of 62 and 494 males with 1...
A resource of single-nucleotide polymorphisms for rainbow trout generated by restriction-site associated DNA sequencing of doubled haploids

USDA-ARS?s Scientific Manuscript database

Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of an evolutionarily recent genome duplication event. This situation complicates single nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and ...
Single nucleotide polymorphisms in candidate genes associated with fertilizing ability of sperm and subsequent embryonic development in cattle

USDA-ARS?s Scientific Manuscript database

Fertilization and development of the preimplantation embryo is under genetic control. The goal of the current study was to test 434 single nucleotide polymorphisms (SNPs) for association with genetic variation in fertilization and early embryonic development. The approach was to produce embryos from...
Prospects for inferring pairwise relationships with single nucleotide polymorphisms

Treesearch

Jeffery C. Glaubitz; O. Eugene, Jr. Rhodes; J. Andrew DeWoody

2003-01-01

An extraordinarily large number of single nucleotide polymorphisms (SNPs) are now available in humans as well as in other model organisms. Technological advancements may soon make it feasible to assay hundreds of SNPs in virtually any organism of interest. One potential application of SNPs is the determination of pairwise genetic relationships in populations without...
Short communication: Relationship of call rate and accuracy of single nucleotide polymorphism genotypes in dairy cattle

USDA-ARS?s Scientific Manuscript database

Call rate has been used as a measure of quality on both a single nucleotide polymorphism (SNP) and animal basis since SNP genotypes were first used in genomic evaluation of dairy cattle. The genotyping laboratories perform initial quality control screening and genotypes that fail are usually exclude...
Single nucleotide polymorphisms generated by genotyping by sequencing to characterize genome-wide diversity, linkage disequilibrium, and selective sweeps in cultivated watermelon

USDA-ARS?s Scientific Manuscript database

Large datasets containing single nucleotide polymorphisms (SNPs) are used to analyze genome-wide diversity in a robust collection of cultivars from representative accessions, across the world. The extent of linkage disequilibrium (LD) within a population determines the number of markers required fo...
Lack of Association Between Polymorphisms in Dopa Decarboxylase and Dopamine Receptor-1 Genes With Childhood Autism in Chinese Han Population.

PubMed

Yu, Hong; Liu, Jun; Yang, Aiping; Yang, Guohui; Yang, Wenjun; Lei, Heyue; Quan, Jianjun; Zhang, Zengyu

2016-04-01

Genetic factors play an important role in childhood autism. This study is to determine the association of single-nucleotide polymorphisms in dopa decarboxylase (DDC) and dopamine receptor-1 (DRD1) genes with childhood autism, in a Chinese Han population. A total of 211 autistic children and 250 age- and gender-matched healthy controls were recruited. The severity of disease was determined by Children Autism Rating Scale scores. TaqMan Probe by real-time polymerase chain reaction was used to determine genotypes and allele frequencies of single-nucleotide polymorphism rs6592961 in DDC and rs251937 in DRD1. Case-control and case-only studies were respectively performed, to determine the contribution of both single-nucleotide polymorphisms to the predisposition of disease and its severity. Our results showed that there was no significant association of the genotypes and allele frequencies of both single-nucleotide polymorphisms concerning childhood autism and its severity. More studies with larger samples are needed to corroborate their predicting roles. © The Author(s) 2015.

Single-molecule comparison of DNA Pol I activity with native and analog nucleotides

NASA Astrophysics Data System (ADS)

Gul, Osman; Olsen, Tivoli; Choi, Yongki; Corso, Brad; Weiss, Gregory; Collins, Philip

2014-03-01

DNA polymerases are critical enzymes for DNA replication, and because of their complex catalytic cycle they are excellent targets for investigation by single-molecule experimental techniques. Recently, we studied the Klenow fragment (KF) of DNA polymerase I using a label-free, electronic technique involving single KF molecules attached to carbon nanotube transistors. The electronic technique allowed long-duration monitoring of a single KF molecule while processing thousands of template strands. Processivity of up to 42 nucleotide bases was directly observed, and statistical analysis of the recordings determined key kinetic parameters for the enzyme's open and closed conformations. Subsequently, we have used the same technique to compare the incorporation of canonical nucleotides like dATP to analogs like 1-thio-2'-dATP. The analog had almost no affect on duration of the closed conformation, during which the nucleotide is incorporated. On the other hand, the analog increased the rate-limiting duration of the open conformation by almost 40%. We propose that the thiolated analog interferes with KF's recognition and binding, two key steps that determine its ensemble turnover rate.
Polymorphism of the prion protein gene (PRNP) in two Chinese indigenous cattle breeds.

PubMed

Qin, L H; Zhao, Y M; Bao, Y H; Bai, W L; Chong, J; Zhang, G L; Zhang, J B; Zhao, Z H

2011-08-01

Prion protein (PRNP) gene has been located at position q17 of chromosome 13 in cattle. The polymorphisms of PRNP gene might be associated with BSE susceptibility. In the present work, we investigated the polymorphisms of PRNP gene, including SNP in exon 3, 23-bp indel in promoter region, 12-bp indel in intron 1 in 2 Chinese indigenous cattle breeds of northeast China. Eighty-six animals from Yanbian (34) and Chinese Red Steppes (52) were genotyped at PRNP locus by analyzing genomic DNA. A total of 4 single nucleotide polymorphism (SNP) sites were revealed in the PRNP gene exon 3 of the 2 cattle breeds investigated. Three of these SNPs were non-synonymous mutations that resulted in the amino acid exchanges (K119N, S154N, and M177V), and one is silent nucleotide substitutions (A234G). The two amino acid mutations of S154N and M177V were detected only in Yanbian with a very low frequency (0.0147), and they appears to be absent in Chinese Red Steppes. The average gene heterozygosity (He), effective allele numbers (Ne), Shannon's information index (I) and polymorphism information content (PIC) were 0.3088, 1.5013, 0.3814 and 0.2000 in Yanbian, respectively, being relatively higher than that of Chinese Red Steppes (0.2885, 1.4985, 0.3462 and 0.1873, respectively). In 23-bp indel and 12-bp indel loci, three different genotypes were identified in both Yanbian and Chinese Red Steppes breeds. Based 23- and 12-bp indels, four haplotypes was constructed in the 2 Chinese cattle breeds, of which the 23-bp (-)/12-bp (-) was main haplotypes accounting for more than 50% of the total in both Yanbian and Chinese Red Steppes breeds. These results might be useful in understanding the genetic characteristics of PRNP gene in Chinese indigenous cattle breeds.
Mapping of Mcs30, a new mammary carcinoma susceptibility quantitative trait locus (QTL30) on rat chromosome 12: identification of fry as a candidate Mcs gene.

PubMed

Ren, Xuefeng; Graham, Jessica C; Jing, Lichen; Mikheev, Andrei M; Gao, Yuan; Lew, Jenny Pan; Xie, Hong; Kim, Andrea S; Shang, Xiuling; Friedman, Cynthia; Vail, Graham; Fang, Ming Zhu; Bromberg, Yana; Zarbl, Helmut

2013-01-01

Rat strains differ dramatically in their susceptibility to mammary carcinogenesis. On the assumption that susceptibility genes are conserved across mammalian species and hence inform human carcinogenesis, numerous investigators have used genetic linkage studies in rats to identify genes responsible for differential susceptibility to carcinogenesis. Using a genetic backcross between the resistant Copenhagen (Cop) and susceptible Fischer 344 (F344) strains, we mapped a novel mammary carcinoma susceptibility (Mcs30) locus to the centromeric region on chromosome 12 (LOD score of ∼8.6 at the D12Rat59 marker). The Mcs30 locus comprises approximately 12 Mbp on the long arm of rat RNO12 whose synteny is conserved on human chromosome 13q12 to 13q13. After analyzing numerous genes comprising this locus, we identified Fry, the rat ortholog of the furry gene of Drosophila melanogaster, as a candidate Mcs gene. We cloned and determined the complete nucleotide sequence of the 13 kbp Fry mRNA. Sequence analysis indicated that the Fry gene was highly conserved across evolution, with 90% similarity of the predicted amino acid sequence among eutherian mammals. Comparison of the Fry sequence in the Cop and F344 strains identified two non-synonymous single nucleotide polymorphisms (SNPs), one of which creates a putative, de novo phosphorylation site. Further analysis showed that the expression of the Fry gene is reduced in a majority of rat mammary tumors. Our results also suggested that FRY activity was reduced in human breast carcinoma cell lines as a result of reduced levels or mutation. This study is the first to identify the Fry gene as a candidate Mcs gene. Our data suggest that the SNPs within the Fry gene contribute to the genetic susceptibility of the F344 rat strain to mammary carcinogenesis. These results provide the foundation for analyzing the role of the human FRY gene in cancer susceptibility and progression.
Coevolution between Nuclear-Encoded DNA Replication, Recombination, and Repair Genes and Plastid Genome Complexity.

PubMed

Zhang, Jin; Ruhlman, Tracey A; Sabir, Jamal S M; Blazier, John Chris; Weng, Mao-Lun; Park, Seongjun; Jansen, Robert K

2016-02-17

Disruption of DNA replication, recombination, and repair (DNA-RRR) systems has been hypothesized to cause highly elevated nucleotide substitution rates and genome rearrangements in the plastids of angiosperms, but this theory remains untested. To investigate nuclear-plastid genome (plastome) coevolution in Geraniaceae, four different measures of plastome complexity (rearrangements, repeats, nucleotide insertions/deletions, and substitution rates) were evaluated along with substitution rates of 12 nuclear-encoded, plastid-targeted DNA-RRR genes from 27 Geraniales species. Significant correlations were detected for nonsynonymous (dN) but not synonymous (dS) substitution rates for three DNA-RRR genes (uvrB/C, why1, and gyrA) supporting a role for these genes in accelerated plastid genome evolution in Geraniaceae. Furthermore, correlation between dN of uvrB/C and plastome complexity suggests the presence of nucleotide excision repair system in plastids. Significant correlations were also detected between plastome complexity and 13 of the 90 nuclear-encoded organelle-targeted genes investigated. Comparisons revealed significant acceleration of dN in plastid-targeted genes of Geraniales relative to Brassicales suggesting this correlation may be an artifact of elevated rates in this gene set in Geraniaceae. Correlation between dN of plastid-targeted DNA-RRR genes and plastome complexity supports the hypothesis that the aberrant patterns in angiosperm plastome evolution could be caused by dysfunction in DNA-RRR systems. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Transcript-specific, single-nucleotide polymorphism discovery and linkage analysis in hexaploid bread wheat (Triticum aestivum L.).

PubMed

Allen, Alexandra M; Barker, Gary L A; Berry, Simon T; Coghill, Jane A; Gwilliam, Rhian; Kirby, Susan; Robinson, Phil; Brenchley, Rachel C; D'Amore, Rosalinda; McKenzie, Neil; Waite, Darren; Hall, Anthony; Bevan, Michael; Hall, Neil; Edwards, Keith J

2011-12-01

Food security is a global concern and substantial yield increases in cereal crops are required to feed the growing world population. Wheat is one of the three most important crops for human and livestock feed. However, the complexity of the genome coupled with a decline in genetic diversity within modern elite cultivars has hindered the application of marker-assisted selection (MAS) in breeding programmes. A crucial step in the successful application of MAS in breeding programmes is the development of cheap and easy to use molecular markers, such as single-nucleotide polymorphisms. To mine selected elite wheat germplasm for intervarietal single-nucleotide polymorphisms, we have used expressed sequence tags derived from public sequencing programmes and next-generation sequencing of normalized wheat complementary DNA libraries, in combination with a novel sequence alignment and assembly approach. Here, we describe the development and validation of a panel of 1114 single-nucleotide polymorphisms in hexaploid bread wheat using competitive allele-specific polymerase chain reaction genotyping technology. We report the genotyping results of these markers on 23 wheat varieties, selected to represent a broad cross-section of wheat germplasm including a number of elite UK varieties. Finally, we show that, using relatively simple technology, it is possible to rapidly generate a linkage map containing several hundred single-nucleotide polymorphism markers in the doubled haploid mapping population of Avalon × Cadenza. © 2011 The Authors. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
TPH-2 Polymorphisms Interact with Early Life Stress to Influence Response to Treatment with Antidepressant Drugs.

PubMed

Xu, Zhi; Reynolds, Gavin P; Yuan, Yonggui; Shi, Yanyan; Pu, Mengjia; Zhang, Zhijun

2016-11-01

Variation in genes implicated in monoamine neurotransmission may interact with environmental factors to influence antidepressant response. We aimed to determine how a range of single nucleotide polymorphisms in monoaminergic genes influence this response to treatment and how they interact with childhood trauma and recent life stress in a Chinese sample. An initial study of monoaminergic coding region single nucleotide polymorphisms identified significant associations of TPH2 and HTR1B single nucleotide polymorphisms with treatment response that showed interactions with childhood and recent life stress, respectively (Xu et al., 2012). A total of 47 further single nucleotide polymorphisms in 17 candidate monoaminergic genes were genotyped in 281 Chinese Han patients with major depressive disorder. Response to 6 weeks' antidepressant treatment was determined by change in the 17-item Hamilton Depression Rating Scale score, and previous stressful events were evaluated by the Life Events Scale and Childhood Trauma Questionnaire-Short Form. Three TPH2 single nucleotide polymorphisms (rs11178998, rs7963717, and rs2171363) were significantly associated with antidepressant response in this Chinese sample, as was a haplotype in TPH2 (rs2171363 and rs1487278). One of these, rs2171363, showed a significant interaction with childhood adversity in its association with antidepressant response. These findings provide further evidence that variation in TPH2 is associated with antidepressant response and may also interact with childhood trauma to influence outcome of antidepressant treatment. © The Author 2016. Published by Oxford University Press on behalf of CINP.
TPH-2 Polymorphisms Interact with Early Life Stress to Influence Response to Treatment with Antidepressant Drugs

PubMed Central

Reynolds, Gavin P.; Yuan, Yonggui; Shi, Yanyan; Pu, Mengjia; Zhang, Zhijun

2016-01-01

Background: Variation in genes implicated in monoamine neurotransmission may interact with environmental factors to influence antidepressant response. We aimed to determine how a range of single nucleotide polymorphisms in monoaminergic genes influence this response to treatment and how they interact with childhood trauma and recent life stress in a Chinese sample. An initial study of monoaminergic coding region single nucleotide polymorphisms identified significant associations of TPH2 and HTR1B single nucleotide polymorphisms with treatment response that showed interactions with childhood and recent life stress, respectively (Xu et al., 2012). Methods: A total of 47 further single nucleotide polymorphisms in 17 candidate monoaminergic genes were genotyped in 281 Chinese Han patients with major depressive disorder. Response to 6 weeks’ antidepressant treatment was determined by change in the 17-item Hamilton Depression Rating Scale score, and previous stressful events were evaluated by the Life Events Scale and Childhood Trauma Questionnaire-Short Form. Results: Three TPH2 single nucleotide polymorphisms (rs11178998, rs7963717, and rs2171363) were significantly associated with antidepressant response in this Chinese sample, as was a haplotype in TPH2 (rs2171363 and rs1487278). One of these, rs2171363, showed a significant interaction with childhood adversity in its association with antidepressant response. Conclusions: These findings provide further evidence that variation in TPH2 is associated with antidepressant response and may also interact with childhood trauma to influence outcome of antidepressant treatment. PMID:27521242
Identification of IDUA and WNT16 Phosphorylation-Related Non-Synonymous Polymorphisms for Bone Mineral Density in Meta-Analyses of Genome-Wide Association Studies

PubMed Central

Niu, Tianhua; Liu, Ning; Yu, Xun; Zhao, Ming; Choi, Hyung Jin; Leo, Paul J.; Brown, Matthew A.; Zhang, Lei; Pei, Yu-Fang; Shen, Hui; He, Hao; Fu, Xiaoying; Lu, Shan; Chen, Xiang-Ding; Tan, Li-Jun; Yang, Tie-Lin; Guo, Yan; Cho, Nam H.; Shen, Jie; Guo, Yan-Fang; Nicholson, Geoffrey C.; Prince, Richard L.; Eisman, John A.; Jones, Graeme; Sambrook, Philip N.; Tian, Qing; Zhu, Xue-Zhen; Papasian, Christopher J.; Duncan, Emma L.; Uitterlinden, André G.; Shin, Chan Soo; Xiang, Shuanglin; Deng, Hong-Wen

2016-01-01

Protein phosphorylation regulates a wide variety of cellular processes. Thus, we hypothesize that single nucleotide polymorphisms (SNPs) that may modulate protein phosphorylation could affect osteoporosis risk. Based on a previous conventional genome-wide association (GWA) study, we conducted a three-stage meta-analysis targeting phosphorylation-related SNPs (phosSNPs) for femoral neck (FN)-, total hip (HIP)-, and Lumbar Spine (LS)-BMD phenotypes. In stage 1, 9,593 phosSNPs were meta-analyzed in 11,140 individuals of various ancestries. Genome-wide significance (GWS) and suggestive significance were defined by α = 5.21×10−6 (0.05/9,593) and 1.00×10−4, respectively. In stage 2, 9 stage 1-discovered phosSNPs (based on α = 1.00×10−4) were in silico meta-analyzed in Dutch, Korean, and Australian cohorts. In stage 3, four phosSNPs that replicated in stage 2 (based on α = 5.56×10−3, 0.05/9) were de novo genotyped in two independent cohorts. IDUA rs3755955 and rs6831280, and WNT16 rs2707466 were associated with BMD phenotypes in each respective stage, and in 3 stages combined, achieving GWS for both FN-BMD (P-value = 8.36×10−10, 5.26×10−10, and 3.01×10−10, respectively) and HIP-BMD (P-value = 3.26×10−6, 1.97×10−6, and 1.63×10−12, respectively). Although in vitro studies demonstrated no differences in expressions of wild-type and mutant forms of IDUA and WNT16B proteins, in silico analysis predicts that WNT16 rs2707466 directly abolishes a phosphorylation site, which could cause a deleterious effect on WNT16 protein, and that IDUA phosSNPs rs3755955 and rs6831280 could exert indirect effects on nearby phosphorylation sites. Further studies will be required to determine the detailed and specific molecular effects of these BMD-associated non-synonymous variants. PMID:26256109
Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase

PubMed Central

Hawkins, Vivian N; Auliff, Alyson; Prajapati, Surendra Kumar; Rungsihirunrat, Kanchana; Hapuarachchi, Hapuarachchige C; Maestre, Amanda; O'Neil, Michael T; Cheng, Qin; Joshi, Hema; Na-Bangchang, Kesara; Sibley, Carol Hopkins

2008-01-01

Background In order to maximize the useful therapeutic life of antimalarial drugs, it is crucial to understand the mechanisms by which parasites resistant to antimalarial drugs are selected and spread in natural populations. Recent work has demonstrated that pyrimethamine-resistance conferring mutations in Plasmodium falciparum dihydrofolate reductase (dhfr) have arisen rarely de novo, but spread widely in Asia and Africa. The origin and spread of mutations in Plasmodium vivax dhfr were assessed by constructing haplotypes based on sequencing dhfr and its flanking regions. Methods The P. vivax dhfr coding region, 792 bp upstream and 683 bp downstream were amplified and sequenced from 137 contemporary patient isolates from Colombia, India, Indonesia, Papua New Guinea, Sri Lanka, Thailand, and Vanuatu. A repeat motif located 2.6 kb upstream of dhfr was also sequenced from 75 of 137 patient isolates, and mutational relationships among the haplotypes were visualized using the programme Network. Results Synonymous and non-synonymous single nucleotide polymorphisms (SNPs) within the dhfr coding region were identified, as was the well-documented in-frame insertion/deletion (indel). SNPs were also identified upstream and downstream of dhfr, with an indel and a highly polymorphic repeat region identified upstream of dhfr. The regions flanking dhfr were highly variable. The double mutant (58R/117N) dhfr allele has evolved from several origins, because the 58R is encoded by at least 3 different codons. The triple (58R/61M/117T) and quadruple (57L/61M/117T/173F, 57I/58R/61M/117T and 57L/58R/61M/117T) mutant alleles had at least three independent origins in Thailand, Indonesia, and Papua New Guinea/Vanuatu. Conclusion It was found that the P. vivax dhfr coding region and its flanking intergenic regions are highly polymorphic and that mutations in P. vivax dhfr that confer antifolate resistance have arisen several times in the Asian region. This contrasts sharply with the selective sweep of rare antifolate resistant alleles observed in the P. falciparum populations in Asia and Africa. The finding of multiple origins of resistance-conferring mutations has important implications for drug policy. PMID:18442404
Multiple origins of resistance-conferring mutations in Plasmodium vivax dihydrofolate reductase.

PubMed

Hawkins, Vivian N; Auliff, Alyson; Prajapati, Surendra Kumar; Rungsihirunrat, Kanchana; Hapuarachchi, Hapuarachchige C; Maestre, Amanda; O'Neil, Michael T; Cheng, Qin; Joshi, Hema; Na-Bangchang, Kesara; Sibley, Carol Hopkins

2008-04-28

In order to maximize the useful therapeutic life of antimalarial drugs, it is crucial to understand the mechanisms by which parasites resistant to antimalarial drugs are selected and spread in natural populations. Recent work has demonstrated that pyrimethamine-resistance conferring mutations in Plasmodium falciparum dihydrofolate reductase (dhfr) have arisen rarely de novo, but spread widely in Asia and Africa. The origin and spread of mutations in Plasmodium vivax dhfr were assessed by constructing haplotypes based on sequencing dhfr and its flanking regions. The P. vivax dhfr coding region, 792 bp upstream and 683 bp downstream were amplified and sequenced from 137 contemporary patient isolates from Colombia, India, Indonesia, Papua New Guinea, Sri Lanka, Thailand, and Vanuatu. A repeat motif located 2.6 kb upstream of dhfr was also sequenced from 75 of 137 patient isolates, and mutational relationships among the haplotypes were visualized using the programme Network. Synonymous and non-synonymous single nucleotide polymorphisms (SNPs) within the dhfr coding region were identified, as was the well-documented in-frame insertion/deletion (indel). SNPs were also identified upstream and downstream of dhfr, with an indel and a highly polymorphic repeat region identified upstream of dhfr. The regions flanking dhfr were highly variable. The double mutant (58R/117N) dhfr allele has evolved from several origins, because the 58R is encoded by at least 3 different codons. The triple (58R/61M/117T) and quadruple (57L/61M/117T/173F, 57I/58R/61M/117T and 57L/58R/61M/117T) mutant alleles had at least three independent origins in Thailand, Indonesia, and Papua New Guinea/Vanuatu. It was found that the P. vivax dhfr coding region and its flanking intergenic regions are highly polymorphic and that mutations in P. vivax dhfr that confer antifolate resistance have arisen several times in the Asian region. This contrasts sharply with the selective sweep of rare antifolate resistant alleles observed in the P. falciparum populations in Asia and Africa. The finding of multiple origins of resistance-conferring mutations has important implications for drug policy.
Differential contribution of genomic regions to marked genetic variation and prediction of quantitative traits in broiler chickens.

PubMed

Abdollahi-Arpanahi, Rostam; Morota, Gota; Valente, Bruno D; Kranis, Andreas; Rosa, Guilherme J M; Gianola, Daniel

2016-02-03

Genome-wide association studies in humans have found enrichment of trait-associated single nucleotide polymorphisms (SNPs) in coding regions of the genome and depletion of these in intergenic regions. However, a recent release of the ENCyclopedia of DNA elements showed that ~80 % of the human genome has a biochemical function. Similar studies on the chicken genome are lacking, thus assessing the relative contribution of its genic and non-genic regions to variation is relevant for biological studies and genetic improvement of chicken populations. A dataset including 1351 birds that were genotyped with the 600K Affymetrix platform was used. We partitioned SNPs according to genome annotation data into six classes to characterize the relative contribution of genic and non-genic regions to genetic variation as well as their predictive power using all available quality-filtered SNPs. Target traits were body weight, ultrasound measurement of breast muscle and hen house egg production in broiler chickens. Six genomic regions were considered: intergenic regions, introns, missense, synonymous, 5' and 3' untranslated regions, and regions that are located 5 kb upstream and downstream of coding genes. Genomic relationship matrices were constructed for each genomic region and fitted in the models, separately or simultaneously. Kernel-based ridge regression was used to estimate variance components and assess predictive ability. Contribution of each class of genomic regions to dominance variance was also considered. Variance component estimates indicated that all genomic regions contributed to marked additive genetic variation and that the class of synonymous regions tended to have the greatest contribution. The marked dominance genetic variation explained by each class of genomic regions was similar and negligible (~0.05). In terms of prediction mean-square error, the whole-genome approach showed the best predictive ability. All genic and non-genic regions contributed to phenotypic variation for the three traits studied. Overall, the contribution of additive genetic variance to the total genetic variance was much greater than that of dominance variance. Our results show that all genomic regions are important for the prediction of the targeted traits, and the whole-genome approach was reaffirmed as the best tool for genome-enabled prediction of quantitative traits.
SNiPlay: a web-based tool for detection, management and analysis of SNPs. Application to grapevine diversity projects.

PubMed

Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice

2011-05-05

High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to export genotyping data or sequences in various formats. Our experiments on grapevine genetic projects showed that SNiPlay allows geneticists to rapidly obtain advanced results in several key research areas of plant genetic diversity. Both the management and treatment of large amounts of SNP data are rendered considerably easier for end-users through automation and integration. Current developments are taking into account new advances in high-throughput technologies.SNiPlay is available at: http://sniplay.cirad.fr/.
Plasmodium vivax rhomboid-like protease 1 gene diversity in Thailand.

PubMed

Mataradchakul, Touchchapol; Uthaipibull, Chairat; Nosten, Francois; Vega-Rodriguez, Joel; Jacobs-Lorena, Marcelo; Lek-Uthai, Usa

2017-10-01

Plasmodium vivax infection remains a major public health problem, especially along the Thailand border regions. We examined the genetic diversity of this parasite by analyzing single-nucleotide polymorphisms (SNPs) of the P. vivax rhomboid-like protease 1 gene (Pvrom1) in parasites collected from western (Tak province, Thai-Myanmar border) and eastern (Chanthaburi province, Thai-Cambodia border) regions. Data were collected by a cross-sectional survey, consisting of 47 and 45 P. vivax-infected filter paper-spotted blood samples from the western and eastern regions of Thailand, respectively during September 2013 to May 2014. Extracted DNA was examined for presence of P. vivax using Plasmodium species-specific nested PCR. Pvrom1 gene was PCR amplified, sequenced and the SNP diversity was analyzed using F-STAT, DnaSP, MEGA and LIAN programs. Comparison of sequences of the 92 Pvrom1 831-base open reading frames with that of a reference sequence (GenBank acc. no. XM001615211) revealed 17 samples with a total of 8 polymorphic sites, consisting of singleton (exon 3, nt 645) and parsimony informative (exon 1, nt 22 and 39; exon 3, nt 336, 537 and 656; and exon 4, nt 719 and 748) sites, which resulted in six different deduced Pvrom1 variants. Non-synonymous to synonymous substitutions ratio estimated by the DnaSP program was 1.65 indicating positive selection, but the Z-tests of selection showed no significant deviations from neutrality for Pvrom1 samples from western region of Thailand. In addition McDonald Kreitman test (MK) showed not significant, and Fst values are not different between the two regions and the regions combined. Interestingly, only Pvrom1 exon 2 was the most conserved sequences among the four exons. The relatively high degree of Pvrom1 polymorphism suggests that the protein is important for parasite survival in face of changes in both insect vector and human populations. These polymorphisms could serve as a sensitive marker for studying plasmodial genetic diversity. The significance of Pvrom1 conserved exon 2 sequence remains to be investigated. Copyright © 2017 Mahidol University. Published by Elsevier Inc. All rights reserved.
Five new synonyms in Epimedium (Berberidaceae) from China

PubMed Central

Zhang, Yanjun; Dang, Haishan; Li, Shengyu; Li, Jianqiang; Wang, Ying

2015-01-01

Abstract Five new synonyms in Chinese Epimedium are designated in the present paper. Epimedium chlorandrum is treated as a synonym of Epimedium acuminatum; Epimedium rhizomatosum as a synonym of Epimedium membranaceum; Epimedium brachyrrhizum as a synonym of Epimedium leptorrhizum; Epimedium dewuense as a synonym of Epimedium dolichostemon; and Epimedium sagittatum var. oblongifoliolatum as a synonym of Epimedium borealiguizhouense. PMID:25987882
A candidate gene approach to study nematode resistance traits in naturally infected sheep.

PubMed

Wilkie, Hazel; Riggio, Valentina; Matika, Oswald; Nicol, Louise; Watt, Kathryn A; Sinclair, Rona; Sparks, Alexandra M; Nussey, Daniel H; Pemberton, Josephine M; Houston, Ross D; Hopkins, John

2017-08-30

Sheep naturally acquire a degree of resistant immunity to parasitic worm infection through repeated exposure. However, the immune response and clinical outcome vary greatly between animals. Genetic polymorphisms in genes integral to differential T helper cell polarization may contribute to variation in host response and disease outcome. A total of twelve single nucleotide polymorphisms (SNPs) were sequenced in IL23R, RORC2 and TBX21 from genomic DNA of Scottish Blackface lambs. Of the twelve SNPs, six were non-synonymous (missense), four were within the 3' UTRs and two were intronic. The association between nine of these SNPs and the traits of body weight, faecal egg count (FEC) and relative T. circumcincta L3-specific IgA antibody levels was assessed in a population of domestic Scottish Blackface ewe lambs and a population of free-living Soay ewe lambs both naturally infected with a mixture of nematodes. There were no significant associations identified between any of the SNPs and phenotypes recorded in either of the populations after adjustment for multiple testing (Bonferroni corrected P value≤0.002). In the Blackface lambs, there was a nominally significant association (P=0.007) between IL23R p.V324M and weight at 20 weeks. This association may be worthy of further investigation in a larger sample of sheep. Copyright © 2017. Published by Elsevier B.V.
Impact of SNPs on Protein Phosphorylation Status in Rice (Oryza sativa L.).

PubMed

Lin, Shoukai; Chen, Lijuan; Tao, Huan; Huang, Jian; Xu, Chaoqun; Li, Lin; Ma, Shiwei; Tian, Tian; Liu, Wei; Xue, Lichun; Ai, Yufang; He, Huaqin

2016-11-11

Single nucleotide polymorphisms (SNPs) are widely used in functional genomics and genetics research work. The high-quality sequence of rice genome has provided a genome-wide SNP and proteome resource. However, the impact of SNPs on protein phosphorylation status in rice is not fully understood. In this paper, we firstly updated rice SNP resource based on the new rice genome Ver. 7.0, then systematically analyzed the potential impact of Non-synonymous SNPs (nsSNPs) on the protein phosphorylation status. There were 3,897,312 SNPs in Ver. 7.0 rice genome, among which 9.9% was nsSNPs. Whilst, a total 2,508,261 phosphorylated sites were predicted in rice proteome. Interestingly, we observed that 150,197 (39.1%) nsSNPs could influence protein phosphorylation status, among which 52.2% might induce changes of protein kinase (PK) types for adjacent phosphorylation sites. We constructed a database, SNP_rice, to deposit the updated rice SNP resource and phosSNPs information. It was freely available to academic researchers at http://bioinformatics.fafu.edu.cn. As a case study, we detected five nsSNPs that potentially influenced heterotrimeric G proteins phosphorylation status in rice, indicating that genetic polymorphisms showed impact on the signal transduction by influencing the phosphorylation status of heterotrimeric G proteins. The results in this work could be a useful resource for future experimental identification and provide interesting information for better rice breeding.
Molecular basis for resistance to ACCase-inhibiting fluazifop in Eleusine indica from Malaysia.

PubMed

Cha, Thye San; Najihah, Mohamed Ghazani; Sahid, Ismail Bin; Chuah, Tse Seng

2014-05-01

Eleusine indica (goosegrass) populations resistant to fluazifop, an acetyl-CoA carboxylase (ACCase: EC6.4.1.2)-inhibiting herbicide, were found in several states in Malaysia. Dose-response assay indicated a resistance factor of 87.5, 62.5 and 150 for biotypes P2, P3 and P4, respectively. DNA sequencing and allele-specific PCR revealed that both biotypes P2 and P3 exhibit a single non-synonymous point mutation from TGG to TGC that leads to a well known Trp-2027-Cys mutation. Interestingly, the highly resistant biotype, P4, did not contain any of the known mutation except the newly discovered target point Asn-2097-Asp, which resulted from a nucleotide change in the codon AAT to GAT. ACCase gene expression was found differentially regulated in the susceptible biotype (P1) and highly resistant biotype P4 from 24 to 72h after treatment (HAT) when being treated with the recommended field rate (198gha(-1)) of fluazifop. However, the small and erratic differences of ACCase gene expression between biotype P1 and P4 does not support the 150-fold resistance in biotype P4. Therefore, the involvement of the target point Asn-2097-Asp and other non-target-site-based resistance mechanisms in the biotype P4 could not be ruled out. Copyright © 2014 Elsevier Inc. All rights reserved.
Association between FTO polymorphism in exon 3 with carcass and meat quality traits in crossbred ducks.

PubMed

Gan, W; Song, Q; Zhang, N N; Xiong, X P; Wang, D M C; Li, L

2015-06-18

The fat mass and obesity-associated gene (FTO) is an excellent candidate gene that affects energy metabolism. Single nucleotide polymorphisms (SNPs) in FTO are associated with carcass and meat quality traits in pigs, cattle, and rabbits. The aim of this study was to investigate the association between novel SNPs in the FTO coding region and carcass and meat quality traits in 95 crossbred ducks, using DNA sequencing. We found two transitions G/A (SNP 387 and 473) within exon 3. SNP 387 was a synonymous mutation, whereas SNP 473 was a missense mutation. Association analysis suggested that SNP g.387G>A was significantly associated with all of the carcass traits measured, the intramuscular fat content (IMF), cooking yield (CY), pH values 45 min after slaughter (pH45m), drip losses from the breast muscle, and the leg muscle (P < 0.05). For SNP g.473G>A, the genotype AA exhibited greater leg muscle weight than the genotypes GG or AG (P < 0.05). The D value suggested that the two SNPs exhibited strong linkage disequilibrium. Three haplotypes (G1G2, G1A2, and A1A2) were significantly associated with IMF, CY, the a* value, and all of the carcass traits measured (P < 0.05). The results suggest that FTO is a candidate locus that affects carcass and meat quality traits in ducks.
Computational screening of disease-associated mutations in OCA2 gene.

PubMed

Kamaraj, Balu; Purohit, Rituraj

2014-01-01

Oculocutaneous albinism type 2 (OCA2), caused by mutations of OCA2 gene, is an autosomal recessive disorder characterized by reduced biosynthesis of melanin pigment in the skin, hair, and eyes. The OCA2 gene encodes instructions for making a protein called the P protein. This protein plays a crucial role in melanosome biogenesis, and controls the eumelanin content in melanocytes in part via the processing and trafficking of tyrosinase which is the rate-limiting enzyme in melanin synthesis. In this study we analyzed the pathogenic effect of 95 non-synonymous single nucleotide polymorphisms reported in OCA2 gene using computational methods. We found R305W mutation as most deleterious and disease associated using SIFT, PolyPhen, PANTHER, PhD-SNP, Pmut, and MutPred tools. To understand the atomic arrangement in 3D space, the native and mutant (R305W) structures were modeled. Molecular dynamics simulation was conducted to observe the structural significance of computationally prioritized disease-associated mutation (R305W). Root-mean-square deviation, root-mean-square fluctuation, radius of gyration, solvent accessibility surface area, hydrogen bond (NH bond), trace of covariance matrix, eigenvector projection analysis, and density analysis results showed prominent loss of stability and rise in mutant flexibility values in 3D space. This study presents a well designed computational methodology to examine the albinism-associated SNPs.
Coding variants in NOD-like receptors: An association study on risk and survival of colorectal cancer.

PubMed

Huhn, Stefanie; da Silva Filho, Miguel I; Sanmuganantham, Tharmila; Pichulik, Tica; Catalano, Calogerina; Pardini, Barbara; Naccarati, Alessio; Polakova-Vymetálkova, Veronika; Jiraskova, Katerina; Vodickova, Ludmila; Vodicka, Pavel; Löffler, Markus W; Courth, Lioba; Wehkamp, Jan; Din, Farhat V N; Timofeeva, Maria; Farrington, Susan M; Jansen, Lina; Hemminki, Kari; Chang-Claude, Jenny; Brenner, Hermann; Hoffmeister, Michael; Dunlop, Malcolm G; Weber, Alexander N R; Försti, Asta

2018-01-01

Nod-like receptors (NLRs) are important innate pattern recognition receptors and regulators of inflammation or play a role during development. We systematically analysed 41 non-synonymous single nucleotide polymorphisms (SNPs) in 21 NLR genes in a Czech discovery cohort of sporadic colorectal cancer (CRC) (1237 cases, 787 controls) for their association with CRC risk and survival. Five SNPs were found to be associated with CRC risk and eight with survival at 5% significance level. In a replication analysis using data of two large genome-wide association studies (GWASs) from Germany (DACHS: 1798 cases and 1810 controls) and Scotland (2210 cases and 9350 controls) the associations found in the Czech discovery set were not confirmed. However, expression analysis in human gut-related tissues and immune cells revealed that the NLRs associated with CRC risk or survival in the discovery set were expressed in primary human colon or rectum cells, CRC tissue and/or cell lines, providing preliminary evidence for a potential involvement of NLRs in general in CRC development and/or progression. Most interesting was the finding that the enigmatic development-related NLRP5 (also known as MATER) was not expressed in normal colon tissue but in colon cancer tissue and cell lines. Future studies may show whether regulatory variants instead of coding variants might affect the expression of NLRs and contribute to CRC risk and survival.

Characterization of Heterobasidion occidentale transcriptomes reveals candidate genes and DNA polymorphisms for virulence variations.

PubMed

Liu, Jun-Jun; Shamoun, Simon Francis; Leal, Isabel; Kowbel, Robert; Sumampong, Grace; Zamany, Arezoo

2018-05-01

Characterization of genes involved in differentiation of pathogen species and isolates with variations of virulence traits provides valuable information to control tree diseases for meeting the challenges of sustainable forest health and phytosanitary trade issues. Lack of genetic knowledge and genomic resources hinders novel gene discovery, molecular mechanism studies and development of diagnostic tools in the management of forest pathogens. Here, we report on transcriptome profiling of Heterobasidion occidentale isolates with contrasting virulence levels. Comparative transcriptomic analysis identified orthologous groups exclusive to H. occidentale and its isolates, revealing biological processes involved in the differentiation of isolates. Further bioinformatics analyses identified an H. occidentale secretome, CYPome and other candidate effectors, from which genes with species- and isolate-specific expression were characterized. A large proportion of differentially expressed genes were revealed to have putative activities as cell wall modification enzymes and transcription factors, suggesting their potential roles in virulence and fungal pathogenesis. Next, large numbers of simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) were detected, including more than 14 000 interisolate non-synonymous SNPs. These polymorphic loci and species/isolate-specific genes may contribute to virulence variations and provide ideal DNA markers for development of diagnostic tools and investigation of genetic diversity. © 2018 The Authors. Microbial Biotechnology published by John Wiley & Sons Ltd and Society for Applied Microbiology.
Next generation semiconductor based-sequencing of a nutrigenetics target gene (GPR120) and association with growth rate in Italian Large White pigs.

PubMed

Fontanesi, Luca; Bertolini, Francesca; Scotti, Emilio; Schiavo, Giuseppina; Colombo, Michela; Trevisi, Paolo; Ribani, Anisa; Buttazzoni, Luca; Russo, Vincenzo; Dall'Olio, Stefania

2015-01-01

The GPR120 gene (also known as FFAR4 or O3FAR1) encodes for a functional omega-3 fatty acid receptor/sensor that mediates potent insulin sensitizing effects by repressing macrophage-induced tissue inflammation. For its functional role, GPR120 could be considered a potential target gene in animal nutrigenetics. In this work we resequenced the porcine GPR120 gene by high throughput Ion Torrent semiconductor sequencing of amplified fragments obtained from 8 DNA pools derived, on the whole, from 153 pigs of different breeds/populations (two Italian Large White pools, Italian Duroc, Italian Landrace, Casertana, Pietrain, Meishan, and wild boars). Three single nucleotide polymorphisms (SNPs), two synonymous substitutions and one in the putative 3'-untranslated region (g.114765469C > T), were identified and their allele frequencies were estimated by sequencing reads count. The g.114765469C > T SNP was also genotyped by PCR-RFLP confirming estimated frequency in Italian Large White pools. Then, this SNP was analyzed in two Italian Large White cohorts using a selective genotyping approach based on extreme and divergent pigs for back fat thickness (BFT) estimated breeding value (EBV) and average daily gain (ADG) EBV. Significant differences of allele and genotype frequencies distribution was observed between the extreme ADG-EBV groups (P < 0.001) whereas this marker was not associated with BFT-EBV.
Effect of polymorphisms in the CSN3 (κ-casein) gene on milk production traits in Chinese Holstein Cattle.

PubMed

Alim, M A; Dong, T; Xie, Y; Wu, X P; Zhang, Yi; Zhang, Shengli; Sun, D X

2014-11-01

This study was designed to evaluate significant associations between single nucleotide polymorphisms (SNPs) and milk composition and milk production traits in Chinese Holstein cows. Six SNPs were identified in the κ-casein gene using pooled DNA sequencing. The identified SNPs were genotyped by Matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS) methods from 507 individuals. Out of six, we identified three non-synonymous SNPs (g.10888T>C, g.10924C>A and g.10944A>G) that changed in the protein product. SIFT (Sorting_Intolerant_From_Tolerant) prediction score (0.01) demonstrated that protein changed Isoleucine > Threonine (g.10888T>C) will affect the phenotypes. Significant associations between identified SNPs and three yield traits (milk, protein and fat) and two composition traits (fat and protein percentages) were found whereas it did not reach significance for fat percentage in haplotypes association. Importantly, the significant SNPs in our results showed a large proportion of the phenotypic variation of milk protein yield and concentration. Our results suggest that CSN3 is an important candidate gene that influences milk production traits, and identified polymorphisms and haplotypes could be used as a genetic marker in programs of marker-assisted selection for the genetic improvement of milk production traits in dairy cattle.
Major histocompatibility complex variation in the endangered Przewalski's horse.

PubMed Central

Hedrick, P W; Parker, K M; Miller, E L; Miller, P S

1999-01-01

The major histocompatibility complex (MHC) is a fundamental part of the vertebrate immune system, and the high variability in many MHC genes is thought to play an essential role in recognition of parasites. The Przewalski's horse is extinct in the wild and all the living individuals descend from 13 founders, most of whom were captured around the turn of the century. One of the primary genetic concerns in endangered species is whether they have ample adaptive variation to respond to novel selective factors. In examining 14 Przewalski's horses that are broadly representative of the living animals, we found six different class II DRB major histocompatibility sequences. The sequences showed extensive nonsynonymous variation, concentrated in the putative antigen-binding sites, and little synonymous variation. Individuals had from two to four sequences as determined by single-stranded conformation polymorphism (SSCP) analysis. On the basis of the SSCP data, phylogenetic analysis of the nucleotide sequences, and segregation in a family group, we conclude that four of these sequences are from one gene (although one sequence codes for a nonfunctional allele because it contains a stop codon) and two other sequences are from another gene. The position of the stop codon is at the same amino-acid position as in a closely related sequence from the domestic horse. Because other organisms have extensive variation at homologous loci, the Przewalski's horse may have quite low variation in this important adaptive region. PMID:10430594
Epidemiologic Consequences of Microvariation in Mycobacterium tuberculosis

PubMed Central

Mathema, Barun; Kurepina, Natalia; Yang, Guibin; Shashkina, Elena; Manca, Claudia; Mehaffy, Carolina; Bielefeldt-Ohmann, Helle; Ahuja, Shama; Fallows, Dorothy A.; Izzo, Angelo; Bifani, Pablo; Dobos, Karen; Kaplan, Gilla

2012-01-01

Background. Evidence from genotype-phenotype studies suggests that genetic diversity in pathogens have clinically relevant manifestations that can impact outcome of infection and epidemiologic success. We studied 5 closely related Mycobacterium tuberculosis strains that collectively caused extensive disease (n = 862), particularly among US-born tuberculosis patients. Methods. Representative isolates were selected using population-based genotyping data from New York City and New Jersey. Growth and cytokine/chemokine response were measured in infected human monocytes. Survival was determined in aerosol-infected guinea pigs. Results. Multiple genotyping methods and phylogenetically informative synonymous single nucleotide polymorphisms showed that all strains were related by descent. In axenic culture, all strains grew similarly. However, infection of monocytes revealed 2 growth phenotypes, slower (doubling ∼55 hours) and faster (∼25 hours). The faster growing strains elicited more tumor necrosis factor α and interleukin 1β than the slower growing strains, even after heat killing, and caused accelerated death of infected guinea pigs (∼9 weeks vs 24 weeks) associated with increased lung inflammation/pathology. Epidemiologically, the faster growing strains were associated with human immunodeficiency virus and more limited in spread, possibly related to their inherent ability to induce a strong protective innate immune response in immune competent hosts. Conclusions. Natural variation, with detectable phenotypic changes, among closely related clinical isolates of M. tuberculosis may alter epidemiologic patterns in human populations. PMID:22315279
Reconstruction of the ancestral plastid genome in Geraniaceae reveals a correlation between genome rearrangements, repeats, and nucleotide substitution rates.

PubMed

Weng, Mao-Lun; Blazier, John C; Govindu, Madhumita; Jansen, Robert K

2014-03-01

Geraniaceae plastid genomes are highly rearranged, and each of the four genera already sequenced in the family has a distinct genome organization. This study reports plastid genome sequences of six additional species, Francoa sonchifolia, Melianthus villosus, and Viviania marifolia from Geraniales, and Pelargonium alternans, California macrophylla, and Hypseocharis bilobata from Geraniaceae. These genome sequences, combined with previously published species, provide sufficient taxon sampling to reconstruct the ancestral plastid genome organization of Geraniaceae and the rearrangements unique to each genus. The ancestral plastid genome of Geraniaceae has a 4 kb inversion and a reduced, Pelargonium-like small single copy region. Our ancestral genome reconstruction suggests that a few minor rearrangements occurred in the stem branch of Geraniaceae followed by independent rearrangements in each genus. The genomic comparison demonstrates that a series of inverted repeat boundary shifts and inversions played a major role in shaping genome organization in the family. The distribution of repeats is strongly associated with breakpoints in the rearranged genomes, and the proportion and the number of large repeats (>20 bp and >60 bp) are significantly correlated with the degree of genome rearrangements. Increases in the degree of plastid genome rearrangements are correlated with the acceleration in nonsynonymous substitution rates (dN) but not with synonymous substitution rates (dS). Possible mechanisms that might contribute to this correlation, including DNA repair system and selection, are discussed.
Linkage and association studies identify a novel locus for Alzheimer disease at 7q36 in a Dutch population-based sample.

PubMed

Rademakers, Rosa; Cruts, Marc; Sleegers, Kristel; Dermaut, Bart; Theuns, Jessie; Aulchenko, Yurii; Weckx, Stefan; De Pooter, Tim; Van den Broeck, Marleen; Corsmit, Ellen; De Rijk, Peter; Del-Favero, Jurgen; van Swieten, John; van Duijn, Cornelia M; Van Broeckhoven, Christine

2005-10-01

We obtained conclusive linkage of Alzheimer disease (AD) with a candidate region of 19.7 cM at 7q36 in an extended multiplex family, family 1270, ascertained in a population-based study of early-onset AD in the northern Netherlands. Single-nucleotide polymorphism and haplotype association analyses of a Dutch patient-control sample further supported the linkage at 7q36. In addition, we identified a shared haplotype at 7q36 between family 1270 and three of six multiplex AD-affected families from the same geographical region, which is indicative of a founder effect and defines a priority region of 9.3 cM. Mutation analysis of coding exons of 29 candidate genes identified one linked synonymous mutation, g.38030G-->C in exon 10, that affected codon 626 of the PAX transactivation domain interacting protein gene (PAXIP1). It remains to be determined whether PAXIP1 has a functional role in the expression of AD in family 1270 or whether another mutation at this locus explains the observed linkage and sharing. Together, our linkage data from the informative family 1270 and the association data in the population-based early-onset AD patient-control sample strongly support the identification of a novel AD locus at 7q36 and re-emphasize the genetic heterogeneity of AD.
Advances in computational approaches for prioritizing driver mutations and significantly mutated genes in cancer genomes.

PubMed

Cheng, Feixiong; Zhao, Junfei; Zhao, Zhongming

2016-07-01

Cancer is often driven by the accumulation of genetic alterations, including single nucleotide variants, small insertions or deletions, gene fusions, copy-number variations, and large chromosomal rearrangements. Recent advances in next-generation sequencing technologies have helped investigators generate massive amounts of cancer genomic data and catalog somatic mutations in both common and rare cancer types. So far, the somatic mutation landscapes and signatures of >10 major cancer types have been reported; however, pinpointing driver mutations and cancer genes from millions of available cancer somatic mutations remains a monumental challenge. To tackle this important task, many methods and computational tools have been developed during the past several years and, thus, a review of its advances is urgently needed. Here, we first summarize the main features of these methods and tools for whole-exome, whole-genome and whole-transcriptome sequencing data. Then, we discuss major challenges like tumor intra-heterogeneity, tumor sample saturation and functionality of synonymous mutations in cancer, all of which may result in false-positive discoveries. Finally, we highlight new directions in studying regulatory roles of noncoding somatic mutations and quantitatively measuring circulating tumor DNA in cancer. This review may help investigators find an appropriate tool for detecting potential driver or actionable mutations in rapidly emerging precision cancer medicine. © The Author 2015. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Genetic variations of VDR/NR1I1 encoding vitamin D receptor in a Japanese population.

PubMed

Ukaji, Maho; Saito, Yoshiro; Fukushima-Uesaka, Hiromi; Maekawa, Keiko; Katori, Noriko; Kaniwa, Nahoko; Yoshida, Teruhiko; Nokihara, Hiroshi; Sekine, Ikuo; Kunitoh, Hideo; Ohe, Yuichiro; Yamamoto, Noboru; Tamura, Tomohide; Saijo, Nagahiro; Sawada, Jun-ichi

2007-12-01

The vitamin D receptor (VDR) is a transcriptional factor responsive to 1alpha,25-dihydroxyvitamin D(3) and lithocholic acid, and induces expression of drug metabolizing enzymes CYP3A4, CYP2B6 and CYP2C9. In this study, the promoter regions, 14 exons (including 6 exon 1's) and their flanking introns of VDR were comprehensively screened for genetic variations in 107 Japanese subjects. Sixty-one genetic variations including 25 novel ones were found: 9 in the 5'-flanking region, 2 in the 5'-untranslated region (UTR), 7 in the coding exons (5 synonymous and 2 nonsynonymous variations), 12 in the 3'-UTR, 19 in the introns between the exon 1's, and 12 in introns 2 to 8. Of these, one novel nonsynonymous variation, 154A>G (Met52Val), was detected with an allele frequency of 0.005. The single nucleotide polymorphisms (SNPs) that increase VDR expression or activity, -29649G>A, 2T>C and 1592((*)308)C>A tagging linked variations in the 3'-UTR, were detected at 0.430, 0.636, and 0.318 allele frequencies, respectively. Another SNP, -26930A>G, with reduced VDR transcription was found at a 0.028 frequency. These findings would be useful for association studies on VDR variations in Japanese.
Choline dehydrogenase polymorphism rs12676 is a functional variation and is associated with changes in human sperm cell function.

PubMed

Johnson, Amy R; Lao, Sai; Wang, Tongwen; Galanko, Joseph A; Zeisel, Steven H

2012-01-01

Approximately 15% of couples are affected by infertility and up to half of these cases arise from male factor infertility. Unidentified genetic aberrations such as chromosomal deletions, translocations and single nucleotide polymorphisms (SNPs) may be the underlying cause of many cases of idiopathic male infertility. Deletion of the choline dehydrogenase (Chdh) gene in mice results in decreased male fertility due to diminished sperm motility; sperm from Chdh(-/-) males have decreased ATP concentrations likely stemming from abnormal sperm mitochondrial morphology and function in these cells. Several SNPs have been identified in the human CHDH gene that may result in altered CHDH enzymatic activity. rs12676 (G233T), a non-synonymous SNP located in the CHDH coding region, is associated with increased susceptibility to dietary choline deficiency and risk of breast cancer. We now report evidence that this SNP is also associated with altered sperm motility patterns and dysmorphic mitochondrial structure in sperm. Sperm produced by men who are GT or TT for rs12676 have 40% and 73% lower ATP concentrations, respectively, in their sperm. rs12676 is associated with decreased CHDH protein in sperm and hepatocytes. A second SNP located in the coding region of IL17BR, rs1025689, is linked to altered sperm motility characteristics and changes in choline metabolite concentrations in sperm.
Choline Dehydrogenase Polymorphism rs12676 Is a Functional Variation and Is Associated with Changes in Human Sperm Cell Function

PubMed Central

Johnson, Amy R.; Lao, Sai; Wang, Tongwen; Galanko, Joseph A.; Zeisel, Steven H.

2012-01-01

Approximately 15% of couples are affected by infertility and up to half of these cases arise from male factor infertility. Unidentified genetic aberrations such as chromosomal deletions, translocations and single nucleotide polymorphisms (SNPs) may be the underlying cause of many cases of idiopathic male infertility. Deletion of the choline dehydrogenase (Chdh) gene in mice results in decreased male fertility due to diminished sperm motility; sperm from Chdh−/− males have decreased ATP concentrations likely stemming from abnormal sperm mitochondrial morphology and function in these cells. Several SNPs have been identified in the human CHDH gene that may result in altered CHDH enzymatic activity. rs12676 (G233T), a non-synonymous SNP located in the CHDH coding region, is associated with increased susceptibility to dietary choline deficiency and risk of breast cancer. We now report evidence that this SNP is also associated with altered sperm motility patterns and dysmorphic mitochondrial structure in sperm. Sperm produced by men who are GT or TT for rs12676 have 40% and 73% lower ATP concentrations, respectively, in their sperm. rs12676 is associated with decreased CHDH protein in sperm and hepatocytes. A second SNP located in the coding region of IL17BR, rs1025689, is linked to altered sperm motility characteristics and changes in choline metabolite concentrations in sperm. PMID:22558321
Single Nucleotide Polymorphisms Predict Symptom Severity of Autism Spectrum Disorder

ERIC Educational Resources Information Center

Jiao, Yun; Chen, Rong; Ke, Xiaoyan; Cheng, Lu; Chu, Kangkang; Lu, Zuhong; Herskovits, Edward H.

2012-01-01

Autism is widely believed to be a heterogeneous disorder; diagnosis is currently based solely on clinical criteria, although genetic, as well as environmental, influences are thought to be prominent factors in the etiology of most forms of autism. Our goal is to determine whether a predictive model based on single-nucleotide polymorphisms (SNPs)…
Single nucleotide polymorphisms in uracil-processing genes, intake of one-carbon nutrients and breast cancer risk

USDA-ARS?s Scientific Manuscript database

Background/Objectives: The misincorporation of uracil into DNA leads to genomic instability. In a previous study, some of us identified four common single nucleotide polymorphisms (SNPs) in uracil-processing genes (rs2029166 and rs7296239 in SMUG1, rs34259 in UNG and rs4775748 in DUT) that were asso...
Single nucleotide polymorphisms in common bean: their discovery and genotyping using a multiplex detection system

USDA-ARS?s Scientific Manuscript database

Single-nucleotide Polymorphism (SNP) markers are by far the most common form of DNA polymorphism in a genome. The objectives of this study were to discover SNPs in common bean comparing sequences from coding and non-coding regions obtained from Genbank and genomic DNA and to compare sequencing resu...
Single nucleotide polymorphisms in specific candidate genes are associated with phenotypic differences in days open for first lactation in Holstein cows

USDA-ARS?s Scientific Manuscript database

Previously, a candidate gene approach identified 51 single nucleotide polymorphisms (SNP) associated with genetic merit for reproductive traits and 26 associated with genetic merit for production in dairy bulls. We evaluated association of the 77 SNPs with days open (DO) for first lactation in a pop...
An integrated genetic linkage map of watermelon and genetic diversity based on single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers

USDA-ARS?s Scientific Manuscript database

Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...
Identification and characterization of single nucleotide polymorphisms (SNPs) in Culex theileri (Diptera: Culicidae).

PubMed

Demirci, Berna; Lee, Yoosook; Lanzaro, Gregory C; Alten, Bulent

2012-05-01

Culex theileri Theobald (Diptera: Culicidae) is one of the most common mosquito species in northeastern Turkey and serves as a vector for various zoonotic diseases including West Nile virus. Although there have been some studies on the ecology of Cx. theileri, very little genetic data has been made available. We successfully sequenced 11 gene fragments from Cx. theileri specimens collected from the northeastern part of Turkey. On average, we found a Single nucleotide polymorphism every 45 bp. Transitions outnumbered transversions, at a ratio of 2:1. This is the first report of genetic polymorphisms in Cx. theileri and Single nucleotide polymorphism discovered from this study can be used to investigate population structure and gene-environmental interactions.
Distinctive features of single nucleotide alterations in induced pluripotent stem cells with different types of DNA repair deficiency disorders

PubMed Central

Okamura, Kohji; Sakaguchi, Hironari; Sakamoto-Abutani, Rie; Nakanishi, Mahito; Nishimura, Ken; Yamazaki-Inoue, Mayu; Ohtaka, Manami; Periasamy, Vaiyapuri Subbarayan; Alshatwi, Ali Abdullah; Higuchi, Akon; Hanaoka, Kazunori; Nakabayashi, Kazuhiko; Takada, Shuji; Hata, Kenichiro; Toyoda, Masashi; Umezawa, Akihiro

2016-01-01

Disease-specific induced pluripotent stem cells (iPSCs) have been used as a model to analyze pathogenesis of disease. In this study, we generated iPSCs derived from a fibroblastic cell line of xeroderma pigmentosum (XP) group A (XPA-iPSCs), a rare autosomal recessive hereditary disease in which patients develop skin cancer in the areas of skin exposed to sunlight. XPA-iPSCs exhibited hypersensitivity to ultraviolet exposure and accumulation of single-nucleotide substitutions when compared with ataxia telangiectasia-derived iPSCs that were established in a previous study. However, XPA-iPSCs did not show any chromosomal instability in vitro, i.e. intact chromosomes were maintained. The results were mutually compensating for examining two major sources of mutations, nucleotide excision repair deficiency and double-strand break repair deficiency. Like XP patients, XPA-iPSCs accumulated single-nucleotide substitutions that are associated with malignant melanoma, a manifestation of XP. These results indicate that XPA-iPSCs may serve a monitoring tool (analogous to the Ames test but using mammalian cells) to measure single-nucleotide alterations, and may be a good model to clarify pathogenesis of XP. In addition, XPA-iPSCs may allow us to facilitate development of drugs that delay genetic alteration and decrease hypersensitivity to ultraviolet for therapeutic applications. PMID:27197874
Sequence polymorphism in an insect RNA virus field population: A snapshot from a single point in space and time reveals stochastic differences among and within individual hosts

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stenger, Drake C., E-mail: drake.stenger@ars.usda.

Population structure of Homalodisca coagulata Virus-1 (HoCV-1) among and within field-collected insects sampled from a single point in space and time was examined. Polymorphism in complete consensus sequences among single-insect isolates was dominated by synonymous substitutions. The mutant spectrum of the C2 helicase region within each single-insect isolate was unique and dominated by nonsynonymous singletons. Bootstrapping was used to correct the within-isolate nonsynonymous:synonymous arithmetic ratio (N:S) for RT-PCR error, yielding an N:S value ~one log-unit greater than that of consensus sequences. Probability of all possible single-base substitutions for the C2 region predicted N:S values within 95% confidence limits of themore » corrected within-isolate N:S when the only constraint imposed was viral polymerase error bias for transitions over transversions. These results indicate that bottlenecks coupled with strong negative/purifying selection drive consensus sequences toward neutral sequence space, and that most polymorphism within single-insect isolates is composed of newly-minted mutations sampled prior to selection. -- Highlights: •Sampling protocol minimized differential selection/history among isolates. •Polymorphism among consensus sequences dominated by negative/purifying selection. •Within-isolate N:S ratio corrected for RT-PCR error by bootstrapping. •Within-isolate mutant spectrum dominated by new mutations yet to undergo selection.« less
A single determinant dominates the rate of yeast protein evolution.

PubMed

Drummond, D Allan; Raval, Alpan; Wilke, Claus O

2006-02-01

A gene's rate of sequence evolution is among the most fundamental evolutionary quantities in common use, but what determines evolutionary rates has remained unclear. Here, we carry out the first combined analysis of seven predictors (gene expression level, dispensability, protein abundance, codon adaptation index, gene length, number of protein-protein interactions, and the gene's centrality in the interaction network) previously reported to have independent influences on protein evolutionary rates. Strikingly, our analysis reveals a single dominant variable linked to the number of translation events which explains 40-fold more variation in evolutionary rate than any other, suggesting that protein evolutionary rate has a single major determinant among the seven predictors. The dominant variable explains nearly half the variation in the rate of synonymous and protein evolution. We show that the two most commonly used methods to disentangle the determinants of evolutionary rate, partial correlation analysis and ordinary multivariate regression, produce misleading or spurious results when applied to noisy biological data. We overcome these difficulties by employing principal component regression, a multivariate regression of evolutionary rate against the principal components of the predictor variables. Our results support the hypothesis that translational selection governs the rate of synonymous and protein sequence evolution in yeast.

High-throughput discovery of rare human nucleotide polymorphisms by Ecotilling

PubMed Central

Till, Bradley J.; Zerr, Troy; Bowers, Elisabeth; Greene, Elizabeth A.; Comai, Luca; Henikoff, Steven

2006-01-01

Human individuals differ from one another at only ∼0.1% of nucleotide positions, but these single nucleotide differences account for most heritable phenotypic variation. Large-scale efforts to discover and genotype human variation have been limited to common polymorphisms. However, these efforts overlook rare nucleotide changes that may contribute to phenotypic diversity and genetic disorders, including cancer. Thus, there is an increasing need for high-throughput methods to robustly detect rare nucleotide differences. Toward this end, we have adapted the mismatch discovery method known as Ecotilling for the discovery of human single nucleotide polymorphisms. To increase throughput and reduce costs, we developed a universal primer strategy and implemented algorithms for automated band detection. Ecotilling was validated by screening 90 human DNA samples for nucleotide changes in 5 gene targets and by comparing results to public resequencing data. To increase throughput for discovery of rare alleles, we pooled samples 8-fold and found Ecotilling to be efficient relative to resequencing, with a false negative rate of 5% and a false discovery rate of 4%. We identified 28 new rare alleles, including some that are predicted to damage protein function. The detection of rare damaging mutations has implications for models of human disease. PMID:16893952
Theory of single-molecule controlled rotation experiments, predictions, tests, and comparison with stalling experiments in F1-ATPase.

PubMed

Volkán-Kacsó, Sándor; Marcus, Rudolph A

2016-10-25

A recently proposed chemomechanical group transfer theory of rotary biomolecular motors is applied to treat single-molecule controlled rotation experiments. In these experiments, single-molecule fluorescence is used to measure the binding and release rate constants of nucleotides by monitoring the occupancy of binding sites. It is shown how missed events of nucleotide binding and release in these experiments can be corrected using theory, with F 1 -ATP synthase as an example. The missed events are significant when the reverse rate is very fast. Using the theory the actual rate constants in the controlled rotation experiments and the corrections are predicted from independent data, including other single-molecule rotation and ensemble biochemical experiments. The effective torsional elastic constant is found to depend on the binding/releasing nucleotide, and it is smaller for ADP than for ATP. There is a good agreement, with no adjustable parameters, between the theoretical and experimental results of controlled rotation experiments and stalling experiments, for the range of angles where the data overlap. This agreement is perhaps all the more surprising because it occurs even though the binding and release of fluorescent nucleotides is monitored at single-site occupancy concentrations, whereas the stalling and free rotation experiments have multiple-site occupancy.
Genetic differentiation of Artyfechinostomum malayanum and A. sufrartyfex (Trematoda: Echinostomatidae) based on internal transcribed spacer sequences.

PubMed

Tantrawatpan, Chairat; Saijuntha, Weerachai; Sithithaworn, Paiboon; Andrews, Ross H; Petney, Trevor N

2013-01-01

Genetic differentiation between two synonymous echinostomes species, Artyfechinostomum malayanum and Artyfechinostomum sufrartyfex was determined by using the first and second internal transcribed spacers (ITS1 and ITS2), the non-coding region of rDNA as genetic makers. Of the 699 bp of combined ITS1 and ITS2 sequences examined, 18 variable nucleotide positions (2.58 %) were observed. Of these, 17 positions could be used as diagnostic position between these two sibling species, whereas the other one variation was intraspecific variation of A. malayanum. A clade of A. malayanum was closely aligned with A. sufrartyfex and clearly distance from the cluster of other echinostomes. Our results may sufficiently suggest that the current synonymy of these species is not valid.
Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

PubMed Central

2010-01-01

Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079
Hierarchically Aligning 10 Legume Genomes Establishes a Family-Level Genomics Platform.

PubMed

Wang, Jinpeng; Sun, Pengchuan; Li, Yuxian; Liu, Yinzhe; Yu, Jigao; Ma, Xuelian; Sun, Sangrong; Yang, Nanshan; Xia, Ruiyan; Lei, Tianyu; Liu, Xiaojian; Jiao, Beibei; Xing, Yue; Ge, Weina; Wang, Li; Wang, Zhenyi; Song, Xiaoming; Yuan, Min; Guo, Di; Zhang, Lan; Zhang, Jiaqi; Jin, Dianchuan; Chen, Wei; Pan, Yuxin; Liu, Tao; Jin, Ling; Sun, Jinshuai; Yu, Jiaxiang; Cheng, Rui; Duan, Xueqian; Shen, Shaoqi; Qin, Jun; Zhang, Meng-Chen; Paterson, Andrew H; Wang, Xiyin

2017-05-01

Mainly due to their economic importance, genomes of 10 legumes, including soybean ( Glycine max ), wild peanut ( Arachis duranensis and Arachis ipaensis ), and barrel medic ( Medicago truncatula ), have been sequenced. However, a family-level comparative genomics analysis has been unavailable. With grape ( Vitis vinifera ) and selected legume genomes as outgroups, we managed to perform a hierarchical and event-related alignment of these genomes and deconvoluted layers of homologous regions produced by ancestral polyploidizations or speciations. Consequently, we illustrated genomic fractionation characterized by widespread gene losses after the polyploidizations. Notably, high similarity in gene retention between recently duplicated chromosomes in soybean supported the likely autopolyploidy nature of its tetraploid ancestor. Moreover, although most gene losses were nearly random, largely but not fully described by geometric distribution, we showed that polyploidization contributed divergently to the copy number variation of important gene families. Besides, we showed significantly divergent evolutionary levels among legumes and, by performing synonymous nucleotide substitutions at synonymous sites correction, redated major evolutionary events during their expansion. This effort laid a solid foundation for further genomics exploration in the legume research community and beyond. We describe only a tiny fraction of legume comparative genomics analysis that we performed; more information was stored in the newly constructed Legume Comparative Genomics Research Platform (www.legumegrp.org). © 2017 American Society of Plant Biologists. All Rights Reserved.
Evolution of Circulating Wild Poliovirus and of Vaccine-Derived Poliovirus in an Immunodeficient Patient: a Unifying Model

PubMed Central

Gavrilin, Gene V.; Cherkasova, Elena A.; Lipskaya, Galina Y.; Kew, Olen M.; Agol, Vadim I.

2000-01-01

We determined nucleotide sequences of the VP1 and 2AB genes and portions of the 2C and 3D genes of two evolving poliovirus lineages: circulating wild viruses of T geotype and Sabin vaccine-derived isolates from an immunodeficient patient. Different regions of the viral RNA were found to evolve nonsynchronously, and the rate of evolution of the 2AB region in the vaccine-derived population was not constant throughout its history. Synonymous replacements occurred not completely randomly, suggesting the need for conservation of certain rare codons (possibly to control translation elongation) and the existence of unidentified constraints in the viral RNA structure. Nevertheless the major contribution to the evolution of the two lineages came from linear accumulation of synonymous substitutions. Therefore, in agreement with current theories of viral evolution, we suggest that the majority of the mutations in both lineages were fixed as a result of successive sampling, from the heterogeneous populations, of random portions containing predominantly neutral and possibly adverse mutations. As a result of such a mode of evolution, the virus fitness may be maintained at a more or less constant level or may decrease unless more-fit variants are stochastically generated. The proposed unifying model of natural poliovirus evolution has important implications for the epidemiology of poliomyelitis. PMID:10906191
Complete Genome Sequence of Zucchini Yellow Mosaic Virus Strain Kurdistan, Iran.

PubMed

Maghamnia, Hamid Reza; Hajizadeh, Mohammad; Azizi, Abdolbaset

2018-03-01

The complete genome sequence of Zucchini yellow mosaic virus strain Kurdistan (ZYMV-Kurdistan) infecting squash from Iran was determined from 13 overlapping fragments. Excluding the poly (A) tail, ZYMV-Kurdistan genome consisted of 9593 nucleotides (nt), with 138 and 211 nt at the 5' and 3' non-translated regions, respectively. It contained two open-reading frames (ORFs), the large ORF encoding a polyprotein of 3080 amino acids (aa) and the small overlapping ORF encoding a P3N-PIPO protein of 74 aa. This isolate had six unique aa differences compared to other ZYMV isolates and shared 79.6-98.8% identities with other ZYMV genome sequences at the nt level and 90.1-99% identities at the aa level. A phylogenetic tree of ZYMV complete genomic sequences showed that Iranian and Central European isolates are closely related and form a phylogenetically homogenous group. All values in the ratio of substitution rates at non-synonymous and synonymous sites ( d N / d S ) were below 1, suggestive of strong negative selection forces during ZYMV protein history. This is the first report of complete genome sequence information of the most prevalent virus in the west of Iran. This study helps our understanding of the genetic diversity of ZYMV isolates infecting cucurbit plants in Iran, virus evolution and epidemiology and can assist in designing better diagnostic tools.
The Single Nucleotide Polymorphism Consortium

NASA Technical Reports Server (NTRS)

Morgan, Michael

2003-01-01

I want to discuss both the Single Nucleotide Polymorphism (SNP) Consortium and the Human Genome Project. I am afraid most of my presentation will be thin on law and possibly too high on rhetoric. Having been engaged in a personal and direct way with these issues as a trained scientist, I find it quite difficult to be always as objective as I ought to be.
Analysis of single nucleotide polymorphisms in case-control studies.

PubMed

Li, Yonghong; Shiffman, Dov; Oberbauer, Rainer

2011-01-01

Single nucleotide polymorphisms (SNPs) are the most common type of genetic variants in the human genome. SNPs are known to modify susceptibility to complex diseases. We describe and discuss methods used to identify SNPs associated with disease in case-control studies. An outline on study population selection, sample collection and genotyping platforms is presented, complemented by SNP selection, data preprocessing and analysis.
A lateral flow biosensor for detection of single nucleotide polymorphism by circular strand displacement reaction.

PubMed

Xiao, Zhuo; Lie, Puchang; Fang, Zhiyuan; Yu, Luxin; Chen, Junhua; Liu, Jie; Ge, Chenchen; Zhou, Xuemeng; Zeng, Lingwen

2012-09-04

A lateral flow biosensor for detection of single nucleotide polymorphism based on circular strand displacement reaction (CSDPR) has been developed. Taking advantage of high fidelity of T4 DNA ligase, signal amplification by CSDPR, and the optical properties of gold nanoparticles, this assay has reached a detection limit of 0.01 fM.
A Laboratory Exercise for Genotyping Two Human Single Nucleotide Polymorphisms

ERIC Educational Resources Information Center

Fernando, James; Carlson, Bradley; LeBard, Timothy; McCarthy, Michael; Umali, Finianne; Ashton, Bryce; Rose, Ferrill F., Jr.

2016-01-01

The dramatic decrease in the cost of sequencing a human genome is leading to an era in which a wide range of students will benefit from having an understanding of human genetic variation. Since over 90% of sequence variation between humans is in the form of single nucleotide polymorphisms (SNPs), a laboratory exercise has been devised in order to…
The effects of single nucleotide polymorphisms (SNPs) of calpastatin (CAST) gene on meat tenderness of yak.

USDA-ARS?s Scientific Manuscript database

The association of single nucleotide polymorphisms (SNPs) of calpastatin (CAST) gene with shear force of 2.54 cm steaks from M. longissimus dorsi from Gannan yaks (Bos grunniens, n=181) was studied. Yaks were harvested at 2, 3, and 4 yr of age (n=51, 59, and 71, respectively), and samples of each ya...
Single nucleotide polymorphism analysis reveals heterogeneity within a seedling tree population of a polyembryonic mango cultivar.

PubMed

Winterhagen, Patrick; Wünsche, Jens-Norbert

2016-05-01

Within a polyembryonic mango seedling tree population, the genetic background of individuals should be identical because vigorous plants for cultivation are expected to develop from nucellar embryos representing maternal clones. Due to the fact that the mango cultivar 'Hôi' is assigned to the polyembryonic ecotype, an intra-cultivar variability of ethylene receptor genes was unexpected. Ethylene receptors in plants are conserved, but the number of receptors or receptor isoforms is variable regarding different plant species. However, it is shown here that the ethylene receptor MiETR1 is present in various isoforms within the mango cultivar 'Hôi'. The investigation of single nucleotide polymorphisms revealed that different MiETR1 isoforms can not be discriminated simply by individual single nucleotide exchanges but by the specific arrangement of single nucleotide polymorphisms at certain positions in the exons of MiETR1. Furthermore, an MiETR1 isoform devoid of introns in the genomic sequence was identified. The investigation demonstrates some limitations of high resolution melting and ScreenClust analysis and points out the necessity of sequencing to identify individual isoforms and to determine the variability within the tree population.
Pharmacogenomic prediction of anthracycline-induced cardiotoxicity in children.

PubMed

Visscher, Henk; Ross, Colin J D; Rassekh, S Rod; Barhdadi, Amina; Dubé, Marie-Pierre; Al-Saloos, Hesham; Sandor, George S; Caron, Huib N; van Dalen, Elvira C; Kremer, Leontien C; van der Pal, Helena J; Brown, Andrew M K; Rogers, Paul C; Phillips, Michael S; Rieder, Michael J; Carleton, Bruce C; Hayden, Michael R

2012-05-01

Anthracycline-induced cardiotoxicity (ACT) is a serious adverse drug reaction limiting anthracycline use and causing substantial morbidity and mortality. Our aim was to identify genetic variants associated with ACT in patients treated for childhood cancer. We carried out a study of 2,977 single-nucleotide polymorphisms (SNPs) in 220 key drug biotransformation genes in a discovery cohort of 156 anthracycline-treated children from British Columbia, with replication in a second cohort of 188 children from across Canada and further replication of the top SNP in a third cohort of 96 patients from Amsterdam, the Netherlands. We identified a highly significant association of a synonymous coding variant rs7853758 (L461L) within the SLC28A3 gene with ACT (odds ratio, 0.35; P = 1.8 × 10(-5) for all cohorts combined). Additional associations (P < .01) with risk and protective variants in other genes including SLC28A1 and several adenosine triphosphate-binding cassette transporters (ABCB1, ABCB4, and ABCC1) were present. We further explored combining multiple variants into a single-prediction model together with clinical risk factors and classification of patients into three risk groups. In the high-risk group, 75% of patients were accurately predicted to develop ACT, with 36% developing this within the first year alone, whereas in the low-risk group, 96% of patients were accurately predicted not to develop ACT. We have identified multiple genetic variants in SLC28A3 and other genes associated with ACT. Combined with clinical risk factors, genetic risk profiling might be used to identify high-risk patients who can then be provided with safer treatment options.
Analysis of the mitochondrial genome of cheetahs (Acinonyx jubatus) with neurodegenerative disease.

PubMed

Burger, Pamela A; Steinborn, Ralf; Walzer, Christian; Petit, Thierry; Mueller, Mathias; Schwarzenberger, Franz

2004-08-18

The complete mitochondrial genome of Acinonyx jubatus was sequenced and mitochondrial DNA (mtDNA) regions were screened for polymorphisms as candidates for the cause of a neurodegenerative demyelinating disease affecting captive cheetahs. The mtDNA reference sequences were established on the basis of the complete sequences of two diseased and two nondiseased animals as well as partial sequences of 26 further individuals. The A. jubatus mitochondrial genome is 17,047-bp long and shows a high sequence similarity (91%) to the domestic cat. Based on single nucleotide polymorphisms (SNPs) in the control region (CR) and pedigree information, the 18 myelopathic and 12 non-myelopathic cheetahs included in this study were classified into haplotypes I, II and III. In view of the phenotypic comparability of the neurodegenerative disease observed in cheetahs and human mtDNA-associated diseases, specific coding regions including the tRNAs leucine UUR, lysine, serine UCN, and partial complex I and V sequences were screened. We identified a heteroplasmic and a homoplasmic SNP at codon 507 in the subunit 5 (MTND5) of complex I. The heteroplasmic haplotype I-specific valine to methionine substitution represents a nonconservative amino acid change and was found in 11 myelopathic and eight non-myelopathic cheetahs with levels ranging from 29% to 79%. The homoplasmic conservative amino acid substitution valine to alanine was identified in two myelopathic animals of haplotype II. In addition, a synonymous SNP in the codon 76 of the MTND4L gene was found in the single haplotype III animal. The amino acid exchanges in the MTND5 gene were not associated with the occurrence of neurodegenerative disease in captive cheetahs.
Phylogenetic relationships among Perissodactyla: secretoglobin 1A1 gene duplication and triplication in the Equidae family.

PubMed

Côté, Olivier; Viel, Laurent; Bienzle, Dorothee

2013-12-01

Secretoglobin family 1A member 1 (SCGB 1A1) is a small anti-inflammatory and immunomodulatory protein that is abundantly secreted in airway surface fluids. We recently reported the existence of three distinct SCGB1A1 genes in the domestic horse genome as opposed to the single gene copy consensus present in other mammals. The origin of SCGB1A1 gene triplication and the evolutionary relationship of the three genes amongst Equidae family members are unknown. For this study, SCGB1A1 genomic data were collected from various Equus individuals including E. caballus, E. przewalskii, E. asinus, E. grevyi, and E. quagga. Three SCGB1A1 genes in E. przewalskii, two SCGB1A1 genes in E. asinus, and a single SCGB1A1 gene in E. grevyi and E. quagga were identified. Sequence analysis revealed that the non-synonymous nucleotide substitutions between the different equid genes coded for 17 amino acid changes. Most of these changes localized to the SCGB 1A1 central cavity that binds hydrophobic ligands, suggesting that this area of SCGB 1A1 evolved to accommodate diverse molecular interactions. Three-dimensional modeling of the proteins revealed that the size of the SCGB 1A1 central cavity is larger than that of SCGB 1A1A. Altogether, these findings suggest that evolution of the SCGB1A1 gene may parallel the separation of caballine and non-caballine species amongst Equidae, and may indicate an expansion of function for SCGB1A1 gene products. Copyright © 2013 Elsevier Inc. All rights reserved.
Protein-based forensic identification using genetically variant peptides in human bone.

PubMed

Mason, Katelyn Elizabeth; Anex, Deon; Grey, Todd; Hart, Bradley; Parker, Glendon

2018-04-22

Bone tissue contains organic material that is useful for forensic investigations and may contain preserved endogenous protein that can persist in the environment for extended periods of time over a range of conditions. Single amino acid polymorphisms in these proteins reflect genetic information since they result from non-synonymous single nucleotide polymorphisms (SNPs) in DNA. Detection of genetically variant peptides (GVPs) - those peptides that contain amino acid polymorphisms - in digests of bone proteins allows for the corresponding SNP alleles to be inferred. Resulting genetic profiles can be used to calculate statistical measures of association between a bone sample and an individual. In this study proteomic analysis on rib cortical bone samples from 10 recently deceased individuals demonstrates this concept. A straight-forward acidic demineralization protocol yielded proteins that were digested with trypsin. Tryptic digests were analyzed by liquid chromatography mass spectrometry. A total of 1736 different proteins were identified across all resulting datasets. On average, individual samples contained 454±121 (x¯±σ) proteins. Thirty-five genetically variant peptides were identified from 15 observed proteins. Overall, 134 SNP inferences were made based on proteomically detected GVPs, which were confirmed by sequencing of subject DNA. Inferred individual SNP genetic profiles ranged in random match probability (RMP) from 1/6 to 1/42,472 when calculated with European population frequencies in the 1000 Genomes Project, Phase 3. Similarly, RMPs based on African population frequencies were calculated for each SNP genetic profile and likelihood ratios (LR) were obtained by dividing each European RMP by the corresponding African RMP. Resulting LR values ranged from 1.4 to 825 with a median value of 16. GVP markers offer a basis for the identification of compromised skeletal remains independent of the presence of DNA template. Published by Elsevier B.V.
Harmonising the Microsystem of the Educational Concept "Competence"

ERIC Educational Resources Information Center

Pukelis, Kestutis; Smetona, Antanas

2012-01-01

Various texts related to education policy and education research have recently started using two conjugate concepts "competency" and "competence". These terms are sometimes treated as synonyms of a single concept; however, in other cases they are separated and used as two terms of different educational concepts. English also…
Protected DNA strand displacement for enhanced single nucleotide discrimination in double-stranded DNA.

PubMed

Khodakov, Dmitriy A; Khodakova, Anastasia S; Huang, David M; Linacre, Adrian; Ellis, Amanda V

2015-03-04

Single nucleotide polymorphisms (SNPs) are a prime source of genetic diversity. Discriminating between different SNPs provides an enormous leap towards the better understanding of the uniqueness of biological systems. Here we report on a new approach for SNP discrimination using toehold-mediated DNA strand displacement. The distinctiveness of the approach is based on the combination of both 3- and 4-way branch migration mechanisms, which allows for reliable discrimination of SNPs within double-stranded DNA generated from real-life human mitochondrial DNA samples. Aside from the potential diagnostic value, the current study represents an additional way to control the strand displacement reaction rate without altering other reaction parameters and provides new insights into the influence of single nucleotide substitutions on 3- and 4-way branch migration efficiency and kinetics.
Single nucleotide polymorphism analysis using different colored dye dimer probes

NASA Astrophysics Data System (ADS)

Marmé, Nicole; Friedrich, Achim; Denapaite, Dalia; Hakenbeck, Regine; Knemeyer, Jens-Peter

2006-09-01

Fluorescence quenching by dye dimer formation has been utilized to develop hairpin-structured DNA probes for the detection of a single nucleotide polymorphism (SNP) in the penicillin target gene pbp2x, which is implicated in the penicillin resistance of Streptococcus pneumoniae. We designed two specific DNA probes for the identification of the pbp2x genes from a penicillin susceptible strain R6 and a resistant strain Streptococcus mitis 661 using green-fluorescent tetramethylrhodamine (TMR) and red-fluorescent DY-636, respectively. Hybridization of each of the probes to its respective target DNA sequence opened the DNA hairpin probes, consequently breaking the nonfluorescent dye dimers into fluorescent species. This hybridization of the target with the hairpin probe achieved single nucleotide specific detection at nanomolar concentrations via increased fluorescence.

Intracellular nucleotide and nucleotide sugar contents of cultured CHO cells determined by a fast, sensitive, and high-resolution ion-pair RP-HPLC.

PubMed

Kochanowski, N; Blanchard, F; Cacan, R; Chirat, F; Guedon, E; Marc, A; Goergen, J-L

2006-01-15

Analysis of intracellular nucleotide and nucleotide sugar contents is essential in studying protein glycosylation of mammalian cells. Nucleotides and nucleotide sugars are the donor substrates of glycosyltransferases, and nucleotides are involved in cellular energy metabolism and its regulation. A sensitive and reproducible ion-pair reverse-phase high-performance liquid chromatography (RP-HPLC) method has been developed, allowing the direct and simultaneous detection and quantification of some essential nucleotides and nucleotide sugars. After a perchloric acid extraction, 13 molecules (8 nucleotides and 5 nucleotide sugars) were separated, including activated sugars such as UDP-glucose, UDP-galactose, GDP-mannose, UDP-N-acetylglucosamine, and UDP-N-acetylgalactosamine. To validate the analytical parameters, the reproducibility, linearity of calibration curves, detection limits, and recovery were evaluated for standard mixtures and cell extracts. The developed method is capable of resolving picomolar quantities of nucleotides and nucleotide sugars in a single chromatographic run. The HPLC method was then applied to quantify intracellular levels of nucleotides and nucleotide sugars of Chinese hamster ovary (CHO) cells cultivated in a bioreactor batch process. Evolutions of the titers of nucleotides and nucleotide sugars during the batch process are discussed.
Gender and single nucleotide polymorphisms in MTHFR, BHMT, SPTLC1, CRBP2R, and SCARB1 are significant predictors of plasma homocysteine normalized by RBC folate in healthy adults.

USDA-ARS?s Scientific Manuscript database

Using linear regression models, we studied the main and two-way interaction effects of the predictor variables gender, age, BMI, and 64 folate/vitamin B-12/homocysteine/lipid/cholesterol-related single nucleotide polymorphisms (SNP) on log-transformed plasma homocysteine normalized by red blood cell...
Brief Report: Glutamate Transporter Gene ("SLC1A1") Single Nucleotide Polymorphism (rs301430) and Repetitive Behaviors and Anxiety in Children with Autism Spectrum Disorder

ERIC Educational Resources Information Center

Gadow, Kenneth D.; Roohi, Jasmin; DeVincent, Carla J.; Kirsch, Sarah; Hatchwell, Eli

2010-01-01

Investigated association of single nucleotide polymorphism (SNP) rs301430 in glutamate transporter gene ("SLC1A1") with severity of repetitive behaviors (obsessive-compulsive behaviors, tics) and anxiety in children with autism spectrum disorder (ASD). Mothers and/or teachers completed a validated DSM-IV-referenced rating scale for 67 children…
Effect of increasing the number of single-nucleotide polymorphisms from 60,000 to 85,000 in genomic evaluation of Holsteins

USDA-ARS?s Scientific Manuscript database

The periodic need to restock reagent pools for genotyping chips provides an opportunity to increase the number of single-nucleotide polymorphisms (SNP) on a chip at no increase in cost. A high-density chip with >140,000 SNP has been developed by GeneSeek Inc. (Lincoln, NE) to increase accuracy of ge...
Development of single-nucleotide polymorphism markers for Bromus tectorum (Poaceae) from a partially sequenced transcriptome

Treesearch

Keith R. Merrill; Craig E. Coleman; Susan E. Meyer; Elizabeth A. Leger; Katherine A. Collins

2016-01-01

Premise of the study: Bromus tectorum (Poaceae) is an annual grass species that is invasive in many areas of the world but most especially in the U.S. Intermountain West. Single-nucleotide polymorphism (SNP) markers were developed for use in investigating the geospatial and ecological diversity of B. tectorum in the Intermountain West to better understand the...
A Comprehensive Experiment for Molecular Biology: Determination of Single Nucleotide Polymorphism in Human REV3 Gene Using PCR-RFLP

ERIC Educational Resources Information Center

Zhang, Xu; Shao, Meng; Gao, Lu; Zhao, Yuanyuan; Sun, Zixuan; Zhou, Liping; Yan, Yongmin; Shao, Qixiang; Xu, Wenrong; Qian, Hui

2017-01-01

Laboratory exercise is helpful for medical students to understand the basic principles of molecular biology and to learn about the practical applications of molecular biology. We have designed a lab course on molecular biology about the determination of single nucleotide polymorphism (SNP) in human REV3 gene, the product of which is a subunit of…
Antibiotic Resistance and Single-Nucleotide Polymorphism Cluster Grouping Type in a Multinational Sample of Resistant Mycobacterium tuberculosis Isolates▿

PubMed Central

Brimacombe, M.; Hazbon, M.; Motiwala, A. S.; Alland, D.

2007-01-01

A single-nucleotide polymorphism-based cluster grouping (SCG) classification system for Mycobacterium tuberculosis was used to examine antibiotic resistance type and resistance mutations in relationship to specific evolutionary lineages. Drug resistance and resistance mutations were seen across all SCGs. SCG-2 had higher proportions of katG codon 315 mutations and resistance to four drugs. PMID:17846140
Design and characterization of a nanopore-coupled polymerase for single-molecule DNA sequencing by synthesis on an electrode array

PubMed Central

Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.

2016-01-01

Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524
Heated oligonucleotide ligation assay (HOLA): an affordable single nucleotide polymorphism assay.

PubMed

Black, W C; Gorrochotegui-Escalante, N; Duteau, N M

2006-03-01

Most single nucleotide polymorphism (SNP) detection requires expensive equipment and reagents. The oligonucleotide ligation assay (OLA) is an inexpensive SNP assay that detects ligation between a biotinylated "allele-specific detector" and a 3' fluorescein-labeled "reporter" oligonucleotide. No ligation occurs unless the 3' detector nucleotide is complementary to the SNP nucleotide. The original OLA used chemical denaturation and neutralization. Heated OLA (HOLA) instead uses a thermal stable ligase and cycles of denaturing and hybridization for ligation and SNP detection. The cost per genotype is approximately US$1.25 with two-allele SNPs or approximately US$1.75 with three-allele SNPs. We illustrate the development of HOLA for SNP detection in the Early Trypsin and Abundant Trypsin loci in the mosquito Aedes aegypti (L.) and at the a-glycerophosphate dehydrogenase locus in the mosquito Anopheles gambiae s.s.
Identifying predictors of time-inhomogeneous viral evolutionary processes.

PubMed

Bielejec, Filip; Baele, Guy; Rodrigo, Allen G; Suchard, Marc A; Lemey, Philippe

2016-07-01

Various factors determine the rate at which mutations are generated and fixed in viral genomes. Viral evolutionary rates may vary over the course of a single persistent infection and can reflect changes in replication rates and selective dynamics. Dedicated statistical inference approaches are required to understand how the complex interplay of these processes shapes the genetic diversity and divergence in viral populations. Although evolutionary models accommodating a high degree of complexity can now be formalized, adequately informing these models by potentially sparse data, and assessing the association of the resulting estimates with external predictors, remains a major challenge. In this article, we present a novel Bayesian evolutionary inference method, which integrates multiple potential predictors and tests their association with variation in the absolute rates of synonymous and non-synonymous substitutions along the evolutionary history. We consider clinical and virological measures as predictors, but also changes in population size trajectories that are simultaneously inferred using coalescent modelling. We demonstrate the potential of our method in an application to within-host HIV-1 sequence data sampled throughout the infection of multiple patients. While analyses of individual patient populations lack statistical power, we detect significant evidence for an abrupt drop in non-synonymous rates in late stage infection and a more gradual increase in synonymous rates over the course of infection in a joint analysis across all patients. The former is predicted by the immune relaxation hypothesis while the latter may be in line with increasing replicative fitness during the asymptomatic stage.
Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA

PubMed Central

Metz, Edward C.; Robles-Sikisaka, Refugio; Vacquier, Victor D.

1998-01-01

Strong positive Darwinian selection acts on two sperm fertilization proteins, lysin and 18-kDa protein, from abalone (Haliotis). To understand the phylogenetic context for this dramatic molecular evolution, we obtained sequences of mitochondrial cytochrome c oxidase subunit I (mtCOI), and genomic sequences of lysin, 18-kDa, and a G protein subunit. Based on mtDNA differentiation, four north Pacific abalone species diverged within the past 2 million years (Myr), and remaining north Pacific species diverged over a period of 4–20 Myr. Between-species nonsynonymous differences in lysin and 18-kDa exons exceed nucleotide differences in introns by 3.5- to 24-fold. Remarkably, in some comparisons nonsynonymous substitutions in lysin and 18-kDa genes exceed synonymous substitutions in mtCOI. Lysin and 18-kDa intron/exon segments were sequenced from multiple red abalone individuals collected over a 1,200-km range. Only two nucleotide changes and two sites of slippage variation were detected in a total of >29,000 nucleotides surveyed. However, polymorphism in mtCOI and a G protein intron was found in this species. This finding suggests that positive selection swept one lysin allele and one 18-kDa allele to fixation. Similarities between mtCOI and lysin gene trees indicate that rapid adaptive evolution of lysin has occurred consistently through the history of the group. Comparisons with mtCOI molecular clock calibrations suggest that nonsynonymous substitutions accumulate 2–50 times faster in lysin and 18-kDa genes than in rapidly evolving mammalian genes. PMID:9724763
Receptor-like genes in the major resistance locus of lettuce are subject to divergent selection.

PubMed Central

Meyers, B C; Shen, K A; Rohani, P; Gaut, B S; Michelmore, R W

1998-01-01

Disease resistance genes in plants are often found in complex multigene families. The largest known cluster of disease resistance specificities in lettuce contains the RGC2 family of genes. We compared the sequences of nine full-length genomic copies of RGC2 representing the diversity in the cluster to determine the structure of genes within this family and to examine the evolution of its members. The transcribed regions range from at least 7.0 to 13.1 kb, and the cDNAs contain deduced open reading frames of approximately 5. 5 kb. The predicted RGC2 proteins contain a nucleotide binding site and irregular leucine-rich repeats (LRRs) that are characteristic of resistance genes cloned from other species. Unique features of the RGC2 gene products include a bipartite LRR region with >40 repeats. At least eight members of this family are transcribed. The level of sequence diversity between family members varied in different regions of the gene. The ratio of nonsynonymous (Ka) to synonymous (Ks) nucleotide substitutions was lowest in the region encoding the nucleotide binding site, which is the presumed effector domain of the protein. The LRR-encoding region showed an alternating pattern of conservation and hypervariability. This alternating pattern of variation was also found in all comparisons within families of resistance genes cloned from other species. The Ka /Ks ratios indicate that diversifying selection has resulted in increased variation at these codons. The patterns of variation support the predicted structure of LRR regions with solvent-exposed hypervariable residues that are potentially involved in binding pathogen-derived ligands. PMID:9811792
Detection of Strand Cleavage And Oxidation Damage Using Model DNA Molecules Captured in a Nanoscale Pore

NASA Technical Reports Server (NTRS)

Vercoutere, W.; Solbrig, A.; DeGuzman, V.; Deamer, D.; Akeson, M.

2003-01-01

We use a biological nano-scale pore to distinguish among individual DNA hairpins that differ by a single site of oxidation or a nick in the sugar-phosphate backbone. In earlier work we showed that the protein ion channel alpha-hemolysin can be used as a detector to distinguish single-stranded from double-stranded DNA, single base pair and single nucleotide differences. This resolution is in part a result of sensitivity to structural changes that influence the molecular dynamics of nucleotides within DNA. The strand cleavage products we examined here included a 5-base-pair (5-bp) hairpin with a 5-prime five-nucleotide overhang, and a complementary five-nucleotide oligomer. These produced predictable shoulder-spike and rapid near-full blockade signatures, respectively. When combined, strand annealing was monitored in real time. The residual current level dropped to a lower discrete level in the shoulder-spike blockade signatures, and the duration lengthened. However, these blockade signatures had a shorter duration than the unmodified l0bp hairpin. To test the pore sensitivity to nucleotide oxidation, we examined a 9-bp hairpin with a terminal 8-oxo-deoxyguanosine (8-oxo-dG), or a penultimate 8-oxo-dG. Each produced blockade signatures that differed from the otherwise identical control 9bp hairpins. This study showed that DNA structure is modified sufficiently by strand cleavage or oxidation damage at a single site to alter in a predictable manner the ionic current blockade signatures produced. This technique improves the ability to assess damage to DNA, and can provide a simple means to help characterize the risks of radiation exposure. It may also provide a method to test radiation protection.
Identification of rs7350481 at chromosome 11q23.3 as a novel susceptibility locus for metabolic syndrome in Japanese individuals by an exome-wide association study.

PubMed

Yamada, Yoshiji; Sakuma, Jun; Takeuchi, Ichiro; Yasukochi, Yoshiki; Kato, Kimihiko; Oguri, Mitsutoshi; Fujimaki, Tetsuo; Horibe, Hideki; Muramatsu, Masaaki; Sawabe, Motoji; Fujiwara, Yoshinori; Taniguchi, Yu; Obuchi, Shuichi; Kawai, Hisashi; Shinkai, Shoji; Mori, Seijiro; Arai, Tomio; Tanaka, Masashi

2017-06-13

We have performed exome-wide association studies to identify genetic variants that influence body mass index or confer susceptibility to obesity or metabolic syndrome in Japanese. The exome-wide association study for body mass index included 12,890 subjects, and those for obesity and metabolic syndrome included 12,968 subjects (3954 individuals with obesity, 9014 controls) and 6817 subjects (3998 individuals with MetS, 2819 controls), respectively. Exome-wide association studies were performed with Illumina HumanExome-12 DNA Analysis BeadChip or Infinium Exome-24 BeadChip arrays. The relation of genotypes of single nucleotide polymorphisms to body mass index was examined by linear regression analysis, and that of allele frequencies of single nucleotide polymorphisms to obesity or metabolic syndrome was evaluated with Fisher's exact test. The exome-wide association studies identified six, 11, and 40 single nucleotide polymorphisms as being significantly associated with body mass index, obesity (P <1.21 × 10-6), or metabolic syndrome (P <1.20 × 10-6), respectively. Subsequent multivariable logistic regression analysis with adjustment for age and sex revealed that three and five single nucleotide polymorphisms were related (P < 0.05) to obesity or metabolic syndrome, respectively, with one of these latter polymorphisms-rs7350481 (C/T) at chromosome 11q23.3-also being significantly (P < 3.13 × 10-4) associated with metabolic syndrome. The polymorphism rs7350481 may thus be a novel susceptibility locus for metabolic syndrome in Japanese. In addition, single nucleotide polymorphisms in three genes (CROT, TSC1, RIN3) and at four loci (ANKK1, ZNF804B, CSRNP3, 17p11.2) were implicated as candidate determinants of obesity and metabolic syndrome, respectively.
Meta-analysis of the relationship between single nucleotide polymorphism of IL-10-1082G/A and rheumatic heart disease.

PubMed

Dai, Weiran; Ye, Ziliang; Lu, Haili; Su, Qiang; Li, Hui; Li, Lang

2018-02-23

The results showed that there was a certain correlation between the single nucleotide polymorphism of IL-10-1082G/A and rheumatic heart disease, but there was no systematic study to verify this conclusion. Systematic review of the association between single nucleotide polymorphism of IL-10-1082G/A locus and rheumatic heart disease. Computer retrieval PubMed, EMbase, Cochrane Library, CBM, CNKI, VIP and Data WanFang, the retrieval time limit from inception to June 2017. A case control study of single nucleotide polymorphisms and rheumatic heart disease in patients with rheumatic heart disease in the IL-10-1082G/A was collected. Two researchers independently screened the literature, extracted data and evaluated the risk of bias in the study, and using RevMan5.3 software for data analysis. A total of 3 case control studies were included, including 318 patients with rheumatic heart disease and 502 controls. Meta-analysis showed that there was no correlation between IL-10-1082G/A gene polymorphism and rheumatic heart disease [AA+AG VS GG: OR = 0.62, 95% CI (0.28, 1.39), P = 0.25; AA VS AG+GG: OR = 0.73, 95% CI (0.54, 1.00), P = 0.05; AA VS GG: OR = 0.70, 95% CI(0.47, 1.05), P = 0.08; AG VS GG: OR = 0.65, 95% CI (0.22, 1.92), P = 0.43; A VS G: OR = 0.87, 95% CI (0.71, 1.06), P = 0.17]. When AA is a recessive gene, the single nucleotide polymorphism of IL-10-1082G/A is associated with the presence of rheumatic heart disease. Due to the limitations of the quantity and quality of the included literatures, the further research results were still needed.
OmpF, a nucleotide-sensing nanoprobe, computational evaluation of single channel activities

NASA Astrophysics Data System (ADS)

Abdolvahab, R. H.; Mobasheri, H.; Nikouee, A.; Ejtehadi, M. R.

2016-09-01

The results of highthroughput practical single channel experiments should be formulated and validated by signal analysis approaches to increase the recognition precision of translocating molecules. For this purpose, the activities of the single nano-pore forming protein, OmpF, in the presence of nucleotides were recorded in real time by the voltage clamp technique and used as a means for nucleotide recognition. The results were analyzed based on the permutation entropy of current Time Series (TS), fractality, autocorrelation, structure function, spectral density, and peak fraction to recognize each nucleotide, based on its signature effect on the conductance, gating frequency and voltage sensitivity of channel at different concentrations and membrane potentials. The amplitude and frequency of ion current fluctuation increased in the presence of Adenine more than Cytosine and Thymine in milli-molar (0.5 mM) concentrations. The variance of the current TS at various applied voltages showed a non-monotonic trend whose initial increasing slope in the presence of Thymine changed to a decreasing one in the second phase and was different from that of Adenine and Cytosine; e.g., by increasing the voltage from 40 to 140 mV in the 0.5 mM concentration of Adenine or Cytosine, the variance decreased by one third while for the case of Thymine it was doubled. Moreover, according to the structure function of TS, the fractality of current TS differed as a function of varying membrane potentials (pd) and nucleotide concentrations. Accordingly, the calculated permutation entropy of the TS, validated the biophysical approach defined for the recognition of different nucleotides at various concentrations, pd's and polarities. Thus, the promising outcomes of the combined experimental and theoretical methodologies presented here can be implemented as a complementary means in pore-based nucleotide recognition approaches.
Teaching Entrepreneurship and Micro-Entrepreneurship: An International Perspective

ERIC Educational Resources Information Center

Mondal, Wali I.; Jimenez, Lizandra

2015-01-01

Entrepreneurship is an integral part of business education. However, the concept is often confused or used synonymously with capitalism, perhaps because entrepreneurship is one of the four factors of production and profit maximization is considered as the single most important topic in teaching theory of the firm. Using risk as the key variable in…
Complete chloroplast genome of Tetragonia tetragonioides: Molecular phylogenetic relationships and evolution in Caryophyllales.

PubMed

Choi, Kyoung Su; Kwak, Myounghai; Lee, Byoungyoon; Park, SeonJoo

2018-01-01

The chloroplast genome of Tetragonia tetragonioides (Aizoaceae; Caryophyllales) was sequenced to provide information for studies on phylogeny and evolution within Caryophyllales. The chloroplast genome of Tetragonia tetragonioides is 149,506 bp in length and includes a pair of inverted repeats (IRs) of 24,769 bp that separate a large single copy (LSC) region of 82,780 bp and a small single copy (SSC) region of 17,188 bp. Comparative analysis of the chloroplast genome showed that Caryphyllales species have lost many genes. In particular, the rpl2 intron and infA gene were not found in T. tetragonioides, and core Caryophyllales lack the rpl2 intron. Phylogenetic analyses were conducted using 55 genes in 16 complete chloroplast genomes. Caryophyllales was found to divide into two clades; core Caryophyllales and noncore Caryophyllales. The genus Tetragonia is closely related to Mesembryanthemum. Comparisons of the synonymous (Ks), nonsynonymous (Ka), and Ka/Ks substitution rates revealed that nonsynonymous substitution rates were lower than synonymous substitution rates and that Ka/Ks rates were less than 1. The findings of the present study suggest that most genes are a purified selection.
Single-Molecule Counting of Point Mutations by Transient DNA Binding

NASA Astrophysics Data System (ADS)

Su, Xin; Li, Lidan; Wang, Shanshan; Hao, Dandan; Wang, Lei; Yu, Changyuan

2017-03-01

High-confidence detection of point mutations is important for disease diagnosis and clinical practice. Hybridization probes are extensively used, but are hindered by their poor single-nucleotide selectivity. Shortening the length of DNA hybridization probes weakens the stability of the probe-target duplex, leading to transient binding between complementary sequences. The kinetics of probe-target binding events are highly dependent on the number of complementary base pairs. Here, we present a single-molecule assay for point mutation detection based on transient DNA binding and use of total internal reflection fluorescence microscopy. Statistical analysis of single-molecule kinetics enabled us to effectively discriminate between wild type DNA sequences and single-nucleotide variants at the single-molecule level. A higher single-nucleotide discrimination is achieved than in our previous work by optimizing the assay conditions, which is guided by statistical modeling of kinetics with a gamma distribution. The KRAS c.34 A mutation can be clearly differentiated from the wild type sequence (KRAS c.34 G) at a relative abundance as low as 0.01% mutant to WT. To demonstrate the feasibility of this method for analysis of clinically relevant biological samples, we used this technology to detect mutations in single-stranded DNA generated from asymmetric RT-PCR of mRNA from two cancer cell lines.
Synonym Success--Thanks to the Thesaurus

ERIC Educational Resources Information Center

Mountain, Lee

2007-01-01

After a class of ninth graders discovered the helpfulness of the thesaurus in such synonym activities as Synonym Tic-Tac-Toe and Cross-Synonym Puzzles, they started using the thesaurus to locate "the exactly right word" while drafting compositions. They also enriched their oral vocabularies during these activities by discussing synonyms from the…

Adjusting for background mutation frequency biases improves the identification of cancer driver genes.

PubMed

Evans, Perry; Avey, Stefan; Kong, Yong; Krauthammer, Michael

2013-09-01

A common goal of tumor sequencing projects is finding genes whose mutations are selected for during tumor development. This is accomplished by choosing genes that have more non-synonymous mutations than expected from an estimated background mutation frequency. While this background frequency is unknown, it can be estimated using both the observed synonymous mutation frequency and the non-synonymous to synonymous mutation ratio. The synonymous mutation frequency can be determined across all genes or in a gene-specific manner. This choice introduces an interesting trade-off. A gene-specific frequency adjusts for an underlying mutation bias, but is difficult to estimate given missing synonymous mutation counts. Using a genome-wide synonymous frequency is more robust, but is less suited for adjusting biases. Studying four evaluation criteria for identifying genes with high non-synonymous mutation burden (reflecting preferential selection of expressed genes, genes with mutations in conserved bases, genes with many protein interactions, and genes that show loss of heterozygosity), we find that the gene-specific synonymous frequency is superior in the gene expression and protein interaction tests. In conclusion, the use of the gene-specific synonymous mutation frequency is well suited for assessing a gene's non-synonymous mutation burden.
Methods and kits for nucleic acid analysis using fluorescence resonance energy transfer

DOEpatents

Kwok, Pui-Yan; Chen, Xiangning

1999-01-01

A method for detecting the presence of a target nucleotide or sequence of nucleotides in a nucleic acid is disclosed. The method is comprised of forming an oligonucleotide labeled with two fluorophores on the nucleic acid target site. The doubly labeled oligonucleotide is formed by addition of a singly labeled dideoxynucleoside triphosphate to a singly labeled polynucleotide or by ligation of two singly labeled polynucleotides. Detection of fluorescence resonance energy transfer upon denaturation indicates the presence of the target. Kits are also provided. The method is particularly applicable to genotyping.
A new species of Tamenes Gounelle, 1912 (Coleoptera, Cerambycidae).

PubMed

Bezark, Larry G; Santos-Silva, Antonio; Galileo, Maria Helena M

2016-04-26

After the original description of Tamenes sarda Gounelle, 1912, based on a single male from Panama, the species was rarely mentioned in the literature except in catalogues and checklists. An exception appeared in Monné & Martins (1973) who speculated that Palaeotrachyderes Tippmann, 1960 may be equal to Tamenes (translated): "In 1960 Tippmann described Palaeotrachyderes, relating it with Lissonotini. The type species, P. laticornis, is also from Chiriqui [the type locality of T. sarda]. Those facts lead us to assume that the genera may be very close or even synonyms, which must be proven upon examination of material of that area." However, the scutellum of P. laticornis is notably small, while in T. sarda it is distinctly large. This feature makes it unlikely that those genera are synonymous.
Redescription of the Indo-West Pacific scorpionfish (Scorpaenidae), Neomerinthe erostris (Alcock 1896), a senior synonym of Scorpaena gibbifrons Fowler 1938, N. rotunda Chen 1981, and N. bathyperimensis Zajonz & Klausewitz 2002.

PubMed

Motomura, Hiroyuki; Causse, Romain; Béarez, Philippe; Mishra, Subhrendu Sekhar

2015-09-29

The Indo-West Pacific species, Neomerinthe erostris (Alcock 1896), originally described as Scorpaena erostris, is redescribed as a senior synonym of Scorpaena gibbifrons Fowler 1938, N. rotunda Chen 1981, and N. bathyperimensis Zajonz & Klausewitz 2002. Although the latter three nominal species have been regarded as valid species and N. erostris has not been reported since 1898, examinations of type specimens of the four nominal species revealed that they represent a single species. A lectotype of Scorpaena erostris is herein designated. Neomerinthe erostris is characterized by having a distinct longitudinal ridge on the lateral surface of the maxilla and a strongly rounded dorsal profile of the head.
Single nucleotide polymorphisms in CETP, SLC46A1, SLC19A1, CD36, BCOM1, APOA5, and ABCA1 are significant predictors of plasma HDL in healthy adults

USDA-ARS?s Scientific Manuscript database

In a marker-trait association study we estimated the statistical significance of 65 single nucleotide polymorphisms (SNP) in 23 candidate genes on HDL levels of two independent Caucasian populations. Each population consisted of men and women and their HDL levels were adjusted for gender and body we...
Rhabdomyolysis After Out-of-Water Exercise in an Elite Adolescent Water Polo Player Carrying the IL-6 174C Allele Single-Nucleotide Polymorphism.

PubMed

Eliakim, Alon; Ben Zaken, Sigal; Meckel, Yoav; Yamin, Chen; Dror, Nitzan; Nemet, Dan

2015-12-01

We present an adolescent elite water polo player who despite a genetic predisposition to develop exercise-induced severe muscle damage due to carrying the IL-6 174C allele single-nucleotide polymorphism, developed acute rhabdomyolysis only after a vigorous out-of-water training, suggesting that water polo training may be more suitable for genetically predisposed athletes.
Decreased necrotizing fasciitis capacity caused by a single nucleotide mutation that alters a multiple gene virulence axis

PubMed Central

Olsen, Randall J.; Sitkiewicz, Izabela; Ayeras, Ara A.; Gonulal, Vedia E.; Cantu, Concepcion; Beres, Stephen B.; Green, Nicole M.; Lei, Benfang; Humbird, Tammy; Greaver, Jamieson; Chang, Ellen; Ragasa, Willie P.; Montgomery, Charles A.; Cartwright, Joiner; McGeer, Allison; Low, Donald E.; Whitney, Adeline R.; Cagle, Philip T.; Blasdel, Terry L.; DeLeo, Frank R.; Musser, James M.

2010-01-01

Single-nucleotide changes are the most common cause of natural genetic variation among members of the same species, but there is remarkably little information bearing on how they alter bacterial virulence. We recently discovered a single-nucleotide mutation in the group A Streptococcus genome that is epidemiologically associated with decreased human necrotizing fasciitis (“flesh-eating disease”). Working from this clinical observation, we find that wild-type mtsR function is required for group A Streptococcus to cause necrotizing fasciitis in mice and nonhuman primates. Expression microarray analysis revealed that mtsR inactivation results in overexpression of PrsA, a chaperonin involved in posttranslational maturation of SpeB, an extracellular cysteine protease. Isogenic mutant strains that overexpress prsA or lack speB had decreased secreted protease activity in vivo and recapitulated the necrotizing fasciitis-negative phenotype of the ΔmtsR mutant strain in mice and monkeys. mtsR inactivation results in increased PrsA expression, which in turn causes decreased SpeB secreted protease activity and reduced necrotizing fasciitis capacity. Thus, a naturally occurring single-nucleotide mutation dramatically alters virulence by dysregulating a multiple gene virulence axis. Our discovery has broad implications for the confluence of population genomics and molecular pathogenesis research. PMID:20080771
Decreased necrotizing fasciitis capacity caused by a single nucleotide mutation that alters a multiple gene virulence axis.

PubMed

Olsen, Randall J; Sitkiewicz, Izabela; Ayeras, Ara A; Gonulal, Vedia E; Cantu, Concepcion; Beres, Stephen B; Green, Nicole M; Lei, Benfang; Humbird, Tammy; Greaver, Jamieson; Chang, Ellen; Ragasa, Willie P; Montgomery, Charles A; Cartwright, Joiner; McGeer, Allison; Low, Donald E; Whitney, Adeline R; Cagle, Philip T; Blasdel, Terry L; DeLeo, Frank R; Musser, James M

2010-01-12

Single-nucleotide changes are the most common cause of natural genetic variation among members of the same species, but there is remarkably little information bearing on how they alter bacterial virulence. We recently discovered a single-nucleotide mutation in the group A Streptococcus genome that is epidemiologically associated with decreased human necrotizing fasciitis ("flesh-eating disease"). Working from this clinical observation, we find that wild-type mtsR function is required for group A Streptococcus to cause necrotizing fasciitis in mice and nonhuman primates. Expression microarray analysis revealed that mtsR inactivation results in overexpression of PrsA, a chaperonin involved in posttranslational maturation of SpeB, an extracellular cysteine protease. Isogenic mutant strains that overexpress prsA or lack speB had decreased secreted protease activity in vivo and recapitulated the necrotizing fasciitis-negative phenotype of the DeltamtsR mutant strain in mice and monkeys. mtsR inactivation results in increased PrsA expression, which in turn causes decreased SpeB secreted protease activity and reduced necrotizing fasciitis capacity. Thus, a naturally occurring single-nucleotide mutation dramatically alters virulence by dysregulating a multiple gene virulence axis. Our discovery has broad implications for the confluence of population genomics and molecular pathogenesis research.
Genetic variants associated with the root system architecture of oilseed rape (Brassica napus L.) under contrasting phosphate supply.

PubMed

Wang, Xiaohua; Chen, Yanling; Thomas, Catherine L; Ding, Guangda; Xu, Ping; Shi, Dexu; Grandke, Fabian; Jin, Kemo; Cai, Hongmei; Xu, Fangsen; Yi, Bin; Broadley, Martin R; Shi, Lei

2017-08-01

Breeding crops with ideal root system architecture for efficient absorption of phosphorus is an important strategy to reduce the use of phosphate fertilizers. To investigate genetic variants leading to changes in root system architecture, 405 oilseed rape cultivars were genotyped with a 60K Brassica Infinium SNP array in low and high P environments. A total of 285 single-nucleotide polymorphisms were associated with root system architecture traits at varying phosphorus levels. Nine single-nucleotide polymorphisms corroborate a previous linkage analysis of root system architecture quantitative trait loci in the BnaTNDH population. One peak single-nucleotide polymorphism region on A3 was associated with all root system architecture traits and co-localized with a quantitative trait locus for primary root length at low phosphorus. Two more single-nucleotide polymorphism peaks on A5 for root dry weight at low phosphorus were detected in both growth systems and co-localized with a quantitative trait locus for the same trait. The candidate genes identified on A3 form a haplotype 'BnA3Hap', that will be important for understanding the phosphorus/root system interaction and for the incorporation into Brassica napus breeding programs. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Ancient mitochondrial haplotypes and evidence for intragenic recombination in a gynodioecious plant.

PubMed

Städler, Thomas; Delph, Lynda F

2002-09-03

Because of their extremely low nucleotide mutation rates, plant mitochondrial genes are generally not expected to show variation within species. Remarkably, we found nine distinct cytochrome b sequence haplotypes in the gynodioecious alpine plant Silene acaulis, with two or more haplotypes coexisting locally in each of three sampled regions. Moreover, there is evidence for intragenic recombination in the history of the haplotype sample, implying at least transient heteroplasmy of mitochondrial DNA (mtDNA). Heteroplasmy might be achieved by one of two potential mechanisms, either continuous coexistence of subgenomic fragments in low stoichiometry, or occasional paternal leakage of mtDNA. On the basis of levels of synonymous nucleotide substitutions, the average divergence time between haplotypes is estimated to be at least 15 million years. Ancient coalescence of extant haplotypes is further indicated by the paucity of fixed differences in haplotypes obtained from related species, a pattern expected under trans-specific evolution. Our data are consistent with models of frequency-dependent selection on linked cytoplasmic male-sterility factors, the putative molecular basis of females in gynodioecious populations. However, associations between marker loci and the inferred male-sterility genes can be maintained only with very low rates of recombination. Heteroplasmy and recombination between divergent haplotypes imply unexplored consequences for the evolutionary dynamics of gynodioecy, a widespread plant breeding system.
[Sequencing and analysis of complete genome of rabies viruses isolated from Chinese Ferret-Badger and dog in Zhejiang province].

PubMed

Lei, Yong-Liang; Wang, Xiao-Guang; Tao, Xiao-Yan; Li, Hao; Meng, Sheng-Li; Chen, Xiu-Ying; Liu, Fu-Ming; Ye, Bi-Feng; Tang, Qing

2010-01-01

Based on sequencing the full-length genomes of four Chinese Ferret-Badger and dog, we analyze the properties of rabies viruses genetic variation in molecular level, get the information about rabies viruses prevalence and variation in Zhejiang, and enrich the genome database of rabies viruses street strains isolated from China. Rabies viruses in suckling mice were isolated, overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses from Chinese Ferret-Badger, dog, sika deer, vole, used vaccine strain were determined. The four full-length genomes were sequenced completely and had the same genetic structure with the length of 11, 923 nts or 11, 925 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions(IGRs), 423 nts-Pseudogene-like sequence (psi), 70 nts-Trailer. The four full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by BLAST and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the four full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so the nucleotide mutations happened in these four genomes were most synonymous mutations. Compared with the reference rabies viruses, the lengths of the five protein coding regions had no change, no recombination, only with a few point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the four genomes were similar to the reference vaccine or street strains. And the four strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessed the distinct district characteristics of China. Therefore, these four rabies viruses are likely to be street viruses already existing in the natural world.
Histology of the Urogenital System in the American Bullfrog (Rana catesbeiana), with Emphasis on Male Reproductive Morphology.

PubMed

Rheubert, Justin L; Cook, Hanna E; Siegel, Dustin S; Trauth, Stanley E

2017-10-01

Previous studies have revealed variations in the urogenital system morphology of amphibians. Recently, the urogenital system of salamanders was reviewed and terminology was synonymized across taxa. Discrepancies exist in the terminology describing the urogenital system of anurans, which prompted our group to develop a complete, detailed description of the urogenital system in an anuran species and provide nomenclature that is synonymous with those of other amphibian taxa. In Rana catesbeiana, sperm mature within spermatocysts of the seminiferous tubule epithelia and are transported to a series of intratesticular ducts that exit the testes and merge to form vasa efferentia. Vasa efferentia converge into single longitudinal ducts (Bidder's ducts) on the lateral aspects of the kidneys. Branches from the longitudinal ducts merge with genital kidney renal tubules through renal corpuscles. The nephrons travel caudally and empty into the Wöffian ducts. Similar to salamanders, the caudal portion of the kidneys (termed the pelvic kidneys in salamanders) only possesses nephrons involved in urine formation, not sperm transport. Data from the present study provide a detailed description and synonymous nomenclature that can be used to make future comparative analyses between taxa more efficient.
Integrative taxonomy allows the identification of synonymous species and the erection of a new genus of Echiniscidae (Tardigrada, Heterotardigrada).

PubMed

Vicente, Filipe; Fontoura, Paulo; Cesari, Michele; Rebecchi, Lorena; Guidetti, Roberto; Serrano, Artur; Bertolani, Roberto

2013-02-14

The taxonomy of tardigrades is challenging as these animals demonstrate a limited number of useful morphological characters, therefore several species descriptions are supported by only minor differences. For example, Echiniscus oihonnae and Echiniscus multispinosus are separated exclusively by the absence or presence of dorsal spines at position Bd. Doubts were raised on the validity of these two species, which were often sampled together. Using an integrative approach, based on genetic and morphological investigations, we studied two new Portuguese populations, and compared these with archived collections. We have determined that the two species must be considered synonymous with Echiniscus oihonnae the senior synonym. Our study showed generally low genetic distances of cox1 gene (with a maximum of 4.1%), with specimens displaying both morphologies sharing the same haplotype, and revealed character Bd to be variable. Addition-ally, a more detailed morphological and phylogenetic study based on the 18S gene uncovered a new evolutionary line within the Echiniscidae, which justified the erection of Diploechiniscus gen. nov. The new genus is in a sister group relationship with Echiniscus and is, for the moment, composed of a single species.
Synonyms for some species of Mexican anoles (Squamata: Dactyloidae).

PubMed

De Oca, Adrián Nieto Montes; Poe, Steven; Scarpetta, Simon; Gray, Levi; Lieb, Carl S

2013-01-01

We studied type material and freshly collected topotypical specimens to assess the taxonomic status of five names associated with species of Mexican Anolis. We find A. schmidti to be a junior synonym of A. nebulosus, A. breedlovei to be a junior synonym of A. cuprinus, A. polyrhachis to be a junior synonym of A. rubiginosus, A. simmonsi to be a junior synonym of A. nebuloides, and A. adleri to be a junior synonym of A. liogaster.
Association of Cytokine Candidate Genes with Severity of Pain and Co-Occurring Symptoms in Breast Cancer Patients Receiving Chemotherapy

DTIC Science & Technology

2013-10-01

identify common genetic variations (i.e., single nucleotide polymorphisms [ SNPs ] and haplotypes) in cytokine genes, as well demographic, clinical, and...Center. The purpose of the proposed project is to identify common genetic variations (i.e., single nucleotide polymorphisms [ SNPs ] and haplotypes) in...research team continues to meet monthly to discuss progress with regards to recruitment, enrollment, and data collection. Training in Genetics In year
Single-cell analysis of intercellular heteroplasmy of mtDNA in Leber hereditary optic neuropathy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kobayashi, Y.; Sharpe, H.; Brown, N.

1994-07-01

The authors have investigated the distribution of mutant mtDNA molecules in single cells from a patient with Leber hereditary optic neuropathy (LHON). LHON is a maternally inherited disease that is characterized by a sudden-onset bilateral loss of central vision, which typically occurs in early adulthood. More than 50% of all LHON patients carry an mtDNA mutation at nucleotide position 11778. This nucleotide change converts a highly conserved arginine residue to histidine at codon 340 in the NADH-ubiquinone oxidoreductase subunit 4 (ND4) gene of mtDNA. In the present study, the authors used PCR amplification of mtDNA from lymphocytes to investigate mtDNAmore » heteroplasmy at the single-cell level in a LHON patient. They found that most cells were either homoplasmic normal or homoplasmic mutant at nucleotide position 11778. Some (16%) cells contained both mutant and normal mtDNA.« less
Single Locked Nucleic Acid-Enhanced Nanopore Genetic Discrimination of Pathogenic Serotypes and Cancer Driver Mutations.

PubMed

Tian, Kai; Chen, Xiaowei; Luan, Binquan; Singh, Prashant; Yang, Zhiyu; Gates, Kent S; Lin, Mengshi; Mustapha, Azlin; Gu, Li-Qun

2018-05-22

Accurate and rapid detection of single-nucleotide polymorphism (SNP) in pathogenic mutants is crucial for many fields such as food safety regulation and disease diagnostics. Current detection methods involve laborious sample preparations and expensive characterizations. Here, we investigated a single locked nucleic acid (LNA) approach, facilitated by a nanopore single-molecule sensor, to accurately determine SNPs for detection of Shiga toxin producing Escherichia coli (STEC) serotype O157:H7, and cancer-derived EGFR L858R and KRAS G12D driver mutations. Current LNA applications that require incorporation and optimization of multiple LNA nucleotides. But we found that in the nanopore system, a single LNA introduced in the probe is sufficient to enhance the SNP discrimination capability by over 10-fold, allowing accurate detection of the pathogenic mutant DNA mixed in a large amount of the wild-type DNA. Importantly, the molecular mechanistic study suggests that such a significant improvement is due to the effect of the single-LNA that both stabilizes the fully matched base-pair and destabilizes the mismatched base-pair. This sensitive method, with a simplified, low cost, easy-to-operate LNA design, could be generalized for various applications that need rapid and accurate identification of single-nucleotide variations.
Leveraging Paraphrase Labels to Extract Synonyms from Twitter

DOE Office of Scientific and Technical Information (OSTI.GOV)

Antoniak, Maria A.; Bell, Eric B.; Xia, Fei

2015-05-18

We present an approach for automatically learning synonyms from a paraphrase corpus of tweets. This work shows improvement on the task of paraphrase detection when we substitute our extracted synonyms into the training set. The synonyms are learned by using chunks from a shallow parse to create candidate synonyms and their context windows, and the synonyms are incorporated into a paraphrase detection system that uses machine translation metrics as features for a classifier. We demonstrate a 2.29% improvement in F1 when we train and test on the paraphrase training set, providing better coverage than previous systems, which shows the potentialmore » power of synonyms that are representative of a specific topic.« less
An NB-LRR gene, TYNBS1, is responsible for resistance mediated by the Ty-2 Begomovirus resistance locus of tomato.

PubMed

Yamaguchi, Hirotaka; Ohnishi, Jun; Saito, Atsushi; Ohyama, Akio; Nunome, Tsukasa; Miyatake, Koji; Fukuoka, Hiroyuki

2018-06-01

An NB-LRR gene, TYNBS1, was isolated from Begomovirus-resistance locus Ty-2. Transgenic plant analysis revealed that TYNBS1 is a functional resistance gene. TYNBS1 is considered to be synonymous with Ty-2. Tomato yellow leaf curl disease caused by Tomato yellow leaf curl virus (TYLCV) is a serious threat to tomato (Solanum lycopersicum L.) production worldwide. A Begomovirus resistance gene, Ty-2, was introduced into cultivated tomato from Solanum habrochaites by interspecific crossing. To identify the Ty-2 gene, we performed genetic analysis. Identification of recombinant line 3701 confirmed the occurrence of a chromosome inversion in the Ty-2 region of the resistant haplotype. Genetic analysis revealed that the Ty-2 gene is linked to an introgression encompassing two markers, SL11_25_54277 and repeat A (approximately 200 kb). Genomic sequences of the upper and lower border of the inversion section of susceptible and resistant haplotypes were determined. Two nucleotide-binding domain and leucine-rich repeat-containing (NB-LRR) genes, TYNBS1 and TYNBS2, were identified around the upper and lower ends of the inversion section, respectively. TYNBS1 strictly co-segregated with TYLCV resistance, whereas TYNBS2 did not. Genetic introduction of genomic fragments containing the TYNBS1 gene into susceptible tomato plants conferred TYLCV resistance. These results demonstrate that TYNBS1 is a functional resistance gene for TYLCV, and is synonymous with the Ty-2 gene.
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus

PubMed Central

Kumar, Chandra Shekhar; Kumar, Sachin

2014-01-01

Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071

Codon Usage Patterns of Tyrosinase Genes in Clonorchis sinensis.

PubMed

Bae, Young-An

2017-04-01

Codon usage bias (CUB) is a unique property of genomes and has contributed to the better understanding of the molecular features and the evolution processes of particular gene. In this study, genetic indices associated with CUB, including relative synonymous codon usage and effective numbers of codons, as well as the nucleotide composition, were investigated in the Clonorchis sinensis tyrosinase genes and their platyhelminth orthologs, which play an important role in the eggshell formation. The relative synonymous codon usage patterns substantially differed among tyrosinase genes examined. In a neutrality analysis, the correlation between GC 12 and GC 3 was statistically significant, and the regression line had a relatively gradual slope (0.218). NC-plot, i.e., GC 3 vs effective number of codons (ENC), showed that most of the tyrosinase genes were below the expected curve. The codon adaptation index (CAI) values of the platyhelminth tyrosinases had a narrow distribution between 0.685/0.714 and 0.797/0.837, and were negatively correlated with their ENC. Taken together, these results suggested that CUB in the tyrosinase genes seemed to be basically governed by selection pressures rather than mutational bias, although the latter factor provided an additional force in shaping CUB of the C. sinensis and Opisthorchis viverrini genes. It was also apparent that the equilibrium point between selection pressure and mutational bias is much more inclined to selection pressure in highly expressed C. sinensis genes, than in poorly expressed genes.
3-base periodicity in coding DNA is affected by intercodon dinucleotides

PubMed Central

Sánchez, Joaquín

2011-01-01

All coding DNAs exhibit 3-base periodicity (TBP), which may be defined as the tendency of nucleotides and higher order n-tuples, e.g. trinucleotides (triplets), to be preferentially spaced by 3, 6, 9 etc, bases, and we have proposed an association between TBP and clustering of same-phase triplets. We here investigated if TBP was affected by intercodon dinucleotide tendencies and whether clustering of same-phase triplets was involved. Under constant protein sequence intercodon dinucleotide frequencies depend on the distribution of synonymous codons. So, possible effects were revealed by randomly exchanging synonymous codons without altering protein sequences to subsequently document changes in TBP via frequency distribution of distances (FDD) of DNA triplets. A tripartite positive correlation was found between intercodon dinucleotide frequencies, clustering of same-phase triplets and TBP. So, intercodon C|A (where “|” indicates the boundary between codons) was more frequent in native human DNA than in the codon-shuffled sequences; higher C|A frequency occurred along with more frequent clustering of C|AN triplets (where N jointly represents A, C, G and T) and with intense CAN TBP. The opposite was found for C|G, which was less frequent in native than in shuffled sequences; lower C|G frequency occurred together with reduced clustering of C|GN triplets and with less intense CGN TBP. We hence propose that intercodon dinucleotides affect TBP via same-phase triplet clustering. A possible biological relevance of our findings is briefly discussed. PMID:21814388
Rice Ferredoxin-Dependent Glutamate Synthase Regulates Nitrogen-Carbon Metabolomes and Is Genetically Differentiated between japonica and indica Subspecies.

PubMed

Yang, Xiaolu; Nian, Jinqiang; Xie, Qingjun; Feng, Jian; Zhang, Fengxia; Jing, Hongwei; Zhang, Jian; Dong, Guojun; Liang, Yan; Peng, Juli; Wang, Guodong; Qian, Qian; Zuo, Jianru

2016-11-07

Plants assimilate inorganic nitrogen absorbed from soil into organic forms as Gln and Glu through the glutamine synthetase/glutamine:2-oxoglutarate amidotransferase (GS/GOGAT) cycle. Whereas GS catalyzes the formation of Gln from Glu and ammonia, GOGAT catalyzes the transfer of an amide group from Gln to 2-oxoglutarate to produce two molecules of Glu. However, the regulatory role of the GS/GOGAT cycle in the carbon-nitrogen balance is not well understood. Here, we report the functional characterization of rice ABNORMAL CYTOKININ RESPONSE 1 (ABC1) gene that encodes a ferredoxin-dependent (Fd)-GOGAT. The weak mutant allele abc1-1 mutant shows a typical nitrogen-deficient syndrome, whereas the T-DNA insertional mutant abc1-2 is seedling lethal. Metabolomics analysis revealed the accumulation of an excessive amount of amino acids with high N/C ratio (Gln and Asn) and several intermediates in the tricarboxylic acid cycle in abc1-1, suggesting that ABC1 plays a critical role in nitrogen assimilation and carbon-nitrogen balance. Five non-synonymous single-nucleotide polymorphisms were identified in the ABC1 coding region and characterized as three distinct haplotypes, which have been highly and specifically differentiated between japonica and indica subspecies. Collectively, these results suggest that ABC1/OsFd-GOGAT is essential for plant growth and development by modulating nitrogen assimilation and the carbon-nitrogen balance. Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.
Novel Compound Heterozygous CLCNKB Gene Mutations (c.1755A>G/ c.848_850delTCT) Cause Classic Bartter Syndrome.

PubMed

Wang, Chunli; Chen, Ying; Zheng, Bixia; Zhu, Mengshu; Fan, Jia; Wang, Juejin; Jia, Zhanjun; Huang, Songming; Zhang, Aihua

2018-02-14

Inactivated variants in CLCNKB gene encoding the basolateral chloride channel ClC-Kb cause classic Bartter syndrome characterized by hypokalemic metabolic alkalosis and hyperreninemic hyperaldosteronism. Here we identified two cBS siblings presenting hypokalemia in a Chinese family due to novel compound heterozygous CLCNKB mutations (c.848_850delTCT/c.1755A>G). Compound heterozygosity was confirmed by amplifying and sequencing the patient's genomic DNA. The synonymous mutation c.1755A>G (Thr585Thr) was located at +2bp from the 5' splice donor site in exon 15, further transcript analysis demonstrated that this single nucleotide mutation causes exclusion of exon 15 in the cDNA from the proband and his mother. Furthermore, we investigated the expression and protein trafficking change of c.848_850delTCT (TCT) and exon 15 deletion（E15）mutation in vitro. The E15 mutation markedly decreased the expression of ClC-Kb and resulted in a low-molecular-weight band (~55kD) trapping in the endoplasmic reticulum, while the TCT mutant only decreased the total and plasma membrane ClC-Kb protein expression but did not affect the subcellular localization. Finally, we studied the physiological functions of mutations by using whole-cell patch clamp and found that E15 or TCT mutation decreased the current of ClC-Kb/barttin channel. These results suggested that the compound defective mutations of CLCNKB gene are the molecular mechanism of the two cBS siblings.
A genetic variant of the NTCP gene is associated with HBV infection status in a Chinese population.

PubMed

Yang, Jingmin; Yang, Yuan; Xia, Mingying; Wang, Lianghui; Zhou, Weiping; Yang, Yajun; Jiang, Yueming; Wang, Hongyang; Qian, Ji; Jin, Li; Wang, Xiaofeng

2016-03-12

To investigate whether genetic variants of the HBV receptor gene NTCP are associated with HBV infection in the Han Chinese population. We sequenced the entire 23 kb NTCP gene from 111 HBeAg-positive HBsAg carriers (PSE group), 110 HBeAg-negative HBsAg carriers (PS group), and 110 control subjects. Then, we performed association analyses of suggestively significant SNPs with HBV infection in 1075 controls, 1936 PSs and 639 PSEs. In total, 109 rare variants (74 novel) and 38 single nucleotide polymorphisms (SNPs, one novel) were screened. Of the seven non-synonymous rare variants, six were singletons and one was a double hit. All three damaging rare singletons presented exclusively in the PSE group. Of the five SNPs validated in all 3650 subjects, the T allele of rs4646287 was significantly decreased (p = 0.002) in the PS group (10.1%) and PSE group (8.1%) compared to the controls (10.9%) and was decreased to 7.4% in the PSE hepatocellular carcinoma (HCC) subgroup. Additionally, rs4646287-T was associated with a 0.68-fold (95% CI = 0.51-0.89, p = 0.006) decreased risk of PSE compared with the controls. The NTCP mRNA level was lower in HCC tissues in "CT + TT" carriers than in "CC" carriers. We found a genetic variant (rs4646287) located in intron 1 of NTCP that may be associated with increased risk of HBV infection in Han Chinese.
Novel gene-by-environment interactions: APOB and NPC1L1 variants affect the relationship between dietary and total plasma cholesterol[S

PubMed Central

Kim, Daniel S.; Burt, Amber A.; Ranchalis, Jane E.; Jarvik, Ella R.; Rosenthal, Elisabeth A.; Hatsukami, Thomas S.; Furlong, Clement E.; Jarvik, Gail P.

2013-01-01

Cardiovascular disease (CVD) is the leading cause of death in developed countries. Plasma cholesterol level is a key risk factor in CVD pathogenesis. Genetic and dietary variation both influence plasma cholesterol; however, little is known about dietary interactions with genetic variants influencing the absorption and transport of dietary cholesterol. We sought to determine whether gut expressed variants predicting plasma cholesterol differentially affected the relationship between dietary and plasma cholesterol levels in 1,128 subjects (772/356 in the discovery/replication cohorts, respectively). Four single nucleotide polymorphisms (SNPs) within three genes (APOB, CETP, and NPC1L1) were significantly associated with plasma cholesterol in the discovery cohort. These were subsequently evaluated for gene-by-environment (GxE) interactions with dietary cholesterol for the prediction of plasma cholesterol, with significant findings tested for replication. Novel GxE interactions were identified and replicated for two variants: rs1042034, an APOB Ser4338Asn missense SNP and rs2072183 (in males only), a synonymous NPC1L1 SNP in linkage disequilibrium with SNPs 5′ of NPC1L1. This study identifies the presence of novel GxE and gender interactions implying that differential gut absorption is the basis for the variant associations with plasma cholesterol. These GxE interactions may account for part of the “missing heritability” not accounted for by genetic associations. PMID:23482652
Major Breeding Plumage Color Differences of Male Ruffs (Philomachus pugnax) Are Not Associated With Coding Sequence Variation in the MC1R Gene

PubMed Central

Küpper, Clemens; Burke, Terry; Lank, David B.

2015-01-01

Sequence variation in the melanocortin-1 receptor (MC1R) gene explains color morph variation in several species of birds and mammals. Ruffs (Philomachus pugnax) exhibit major dark/light color differences in melanin-based male breeding plumage which is closely associated with alternative reproductive behavior. A previous study identified a microsatellite marker (Ppu020) near the MC1R locus associated with the presence/absence of ornamental plumage. We investigated whether coding sequence variation in the MC1R gene explains major dark/light plumage color variation and/or the presence/absence of ornamental plumage in ruffs. Among 821bp of the MC1R coding region from 44 male ruffs we found 3 single nucleotide polymorphisms, representing 1 nonsynonymous and 2 synonymous amino acid substitutions. None were associated with major dark/light color differences or the presence/absence of ornamental plumage. At all amino acid sites known to be functionally important in other avian species with dark/light plumage color variation, ruffs were either monomorphic or the shared polymorphism did not coincide with color morph. Neither ornamental plumage color differences nor the presence/absence of ornamental plumage in ruffs are likely to be caused entirely by amino acid variation within the coding regions of the MC1R locus. Regulatory elements and structural variation at other loci may be involved in melanin expression and contribute to the extreme plumage polymorphism observed in this species. PMID:25534935
Tolerant industrial yeast Saccharomyces cerevisiae posses a more robust cell wall integrity signaling pathway against 2-furaldehyde and 5-(hydroxymethyl)-2-furaldehyde.

PubMed

Liu, Z Lewis; Wang, Xu; Weber, Scott A

2018-06-20

Cell wall integrity signaling pathway in Saccharomyces cerevisiae is a conserved function for detecting and responding to cell stress conditions but less understood for industrial yeast. We examined gene expression dynamics for a tolerant industrial yeast strain NRRL Y-50049 in response to challenges of furfural and HMF through comparative quantitative gene expression analysis using pathway-based qRT-PCR array assays. All tested genes from Y-50049, except for MLP2, demonstrated more resistant and significantly increased gene expression than that from a laboratory strain BY4741. While all five sensor encoding genes WSC1, WSC2, WSC3, MID2 and MTL1 from both strains were activated in response to the furfural-HMF treatment, WSC3 from Y-50049 demonstrated the most increased expression over time compared with any other sensor genes. These results suggested the industrial yeast poses more robust cell wall integrity pathway, and gene WSC3 could have the special capability for signal transmission against furfural and HMF. Among five single nucleotide variations discovered in WSC3 from Y-50049, three were found to be non-synonymous mutations resulting in amino acid alterations of Ser 158  → Tyr 158 , Val 186  → Ile 186 , and Glu 430  → Asp 430 . Our results suggest the industrial yeast as a more desirable delivery vehicle for the next-generation biocatalyst development. Published by Elsevier B.V.
Vkorc1 sequencing suggests anticoagulant resistance in rats in New Zealand.

PubMed

Cowan, Phil E; Gleeson, Dianne M; Howitt, Robyn Lj; Ramón-Laca, Ana; Esther, Alexandra; Pelz, Hans-Joachim

2017-01-01

Anticoagulant toxins are used globally to control rats. Resistance of Rattus species to these toxins now occurs in at least 18 countries in Europe, America and Asia. Resistance is often associated with single nucleotide polymorphisms (SNPs) in the Vkorc1 gene. This study gives a first overview of the distribution and frequency of Vkorc1 SNPs in rats in New Zealand. New Zealand is unusual in having no native rodents but three species of introduced Rattus - norvegicus Berk., rattus L. and exulans Peale. Sequence variants occurred in at least one species of rat at all 30 of the sites sampled. Three new SNPs were identified, one in kiore and two in ship rats. No SNPs previously associated with resistance were found in Norway rats or kiore, but seven ship rats were heterozygous and one homozygous for the A74T variant. Its resultant Tyr25Phe mutation has previously been associated with resistance to both first- and second-generation anticoagulants in ship rats in Spain. This is the first evidence of potential resistance to anticoagulant toxins in rats in New Zealand. Further testing using blood clotting response times in dosed rats is needed to confirm resistance potentially conferred by the Tyr25Phe mutation. Assessment is also needed of the potential of the other non-synonymous variants (Ala14Val, Ala26Val) recorded in this study to confer resistance to anticoagulant toxins. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.
Diversity in the Toll-Like Receptor Genes of the African Penguin (Spheniscus demersus).

PubMed

Dalton, Desiré Lee; Vermaak, Elaine; Roelofse, Marli; Kotze, Antoinette

2016-01-01

The African penguin, Spheniscus demersus, is listed as Endangered by the IUCN Red List of Threatened Species due to the drastic reduction in population numbers over the last 20 years. To date, the only studies on immunogenetic variation in penguins have been conducted on the major histocompatibility complex (MHC) genes. It was shown in humans that up to half of the genetic variability in immune responses to pathogens are located in non-MHC genes. Toll-like receptors (TLRs) are now increasingly being studied in a variety of taxa as a broader approach to determine functional genetic diversity. In this study, we confirm low genetic diversity in the innate immune region of African penguins similar to that observed in New Zealand robin that has undergone several severe population bottlenecks. Single nucleotide polymorphism (SNP) diversity across TLRs varied between ex situ and in situ penguins with the number of non-synonymous alterations in ex situ populations (n = 14) being reduced in comparison to in situ populations (n = 16). Maintaining adaptive diversity is of vital importance in the assurance populations as these animals may potentially be used in the future for re-introductions. Therefore, this study provides essential data on immune gene diversity in penguins and will assist in providing an additional monitoring tool for African penguin in the wild, as well as to monitor diversity in ex situ populations and to ensure that diversity found in the in situ populations are captured in the assurance populations.
Association of Arg194Trp, Arg280His and Arg399Gln Polymorphisms in X-ray Repair Cross-Complementing Group 1 Gene and Risk of Differentiated Thyroid Carcinoma in Iran

PubMed Central

Fard-Esfahani, Pezhman; Fard-Esfahani, Armaghan; Fayaz, Shima; Ghanbarzadeh, Bahareh; Saidi, Parinaz; Mohabati, Reyhaneh; Bidoki, Seyed Kazem; Majdi, Mina

2011-01-01

Background: X-ray repair cross-complementing group 1 (XRCC1) gene is a DNA repair gene and its non-synonymous single nucleotide polymorphisms (SNP) may influence DNA repair capacity which has been considered as a modifying risk factor for cancer development. Methods: A case-control study was conducted to investigate impact of three frequently studied polymorphisms (Arg194Trp, Arg280His and Arg399Gln) on developing differentiated thyroid carcinoma (DTC). Results: Increased risks for DTC were shown in homozygous (odds ratio [OR]: 3.66, 95% confidence interval [CI]: 0.38-35.60) and in dominant trait (OR: 1.22, 95% CI: 1.64-2.32) of Arg194Trp genotype. Also, for Arg280His genotype, an increased risk for DTC was shown in dominant trait (OR: 1.42, 95% confidence interval [CI]: 0.76-2.68), while a mildly reduction of risk for DTC (OR: 0.77, 95% [CI]: 0.50-1.17) was estimated in dominant Gln genotype of Arg399Gln. Considering combinatory effects of Arg194Trp and Arg280His genotypes on DTC, the calculated OR and 95% CI for being heterozygous for one of Arg194Trp or Arg280His genotypes were 1.57 and 0.90-2.74, respectively. Conclusion: Genotyping of codons 194, 280 and 399 in XRCC1 gene may use in risk assessment of DTC. PMID:21987112
Selection of functional 2A sequences within foot-and-mouth disease virus; requirements for the NPGP motif with a distinct codon bias.

PubMed

Kjær, Jonas; Belsham, Graham J

2018-01-01

Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long), which induces a nonproteolytic, cotranslational "cleavage" at its own C terminus. A conserved feature among variants of 2A is the C-terminal motif N 16 P 17 G 18 /P 19 , where P 19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E 14 , S 15 , and N 16 within the 2A sequence of infectious FMDVs, but no variants at residues P 17 , G 18 , or P 19 have been identified. In this study, using highly degenerate primers, we analyzed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after two, three, or four passages. However, surprisingly, a clear codon preference for the wt nucleotide sequence encoding the NPGP motif within these viruses was observed. Indeed, the codons selected to code for P 17 and P 19 within this motif were distinct; thus the synonymous codons are not equivalent. © 2018 Kjær and Belsham; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Sex dependent influence of a functional polymorphism in steroid 5-α-reductase type 2 (SRD5A2) on post-traumatic stress symptoms.

PubMed

Gillespie, Charles F; Almli, Lynn M; Smith, Alicia K; Bradley, Bekh; Kerley, Kimberly; Crain, Daniel F; Mercer, Kristina B; Weiss, Tamara; Phifer, Justine; Tang, Yilang; Cubells, Joseph F; Binder, Elisabeth B; Conneely, Karen N; Ressler, Kerry J

2013-04-01

A non-synonymous, single nucleotide polymorphism (SNP) in the gene coding for steroid 5-α-reductase type 2 (SRD5A2) is associated with reduced conversion of testosterone to dihydrotestosterone (DHT). Because SRD5A2 participates in the regulation of testosterone and cortisol metabolism, hormones shown to be dysregulated in patients with PTSD, we examined whether the V89L variant (rs523349) influences risk for post-traumatic stress disorder (PTSD). Study participants (N = 1,443) were traumatized African-American patients of low socioeconomic status with high rates of lifetime trauma exposure recruited from the primary care clinics of a large, urban hospital. PTSD symptoms were measured with the post-traumatic stress symptom scale (PSS). Subjects were genotyped for the V89L variant (rs523349) of SRD5A2. We initially found a significant sex-dependent effect of genotype in male but not female subjects on symptoms. Associations with PTSD symptoms were confirmed using a separate internal replication sample with identical methods of data analysis, followed by pooled analysis of the combined samples (N = 1,443, sex × genotype interaction P < 0.002; males: n = 536, P < 0.001). These data support the hypothesis that functional variation within SRD5A2 influences, in a sex-specific way, the severity of post-traumatic stress symptoms and risk for diagnosis of PTSD. Copyright © 2013 Wiley Periodicals, Inc.
No association of the Arg51Gln and Leu72Met polymorphisms of the ghrelin gene and polycystic ovary syndrome.

PubMed

Wang, Kehua; Wang, Leiguang; Zhao, Yueran; Shi, Yuhua; Wang, Laicheng; Chen, Zi-Jiang

2009-02-01

Ghrelin plays a role in regulating glucose metabolism and energy balance. Polymorphisms in preproghrelin and ghrelin gene could be responsible for obesity, insulin resistance and low ghrelin levels observed in some individuals. The objective of this study was to evaluate the influence of two single-nucleotide polymorphisms (SNPs) of ghrelin gene on the clinical, the hormonal and metabolic features in women with polycystic ovary syndrome (PCOS) in a Chinese population. A large sample of Chinese PCOS (n = 271) women and a control group (n = 296) of healthy women matched for age were studied. Hormone and metabolic profiles were measured and blood samples were collected for genotype and allelic frequency analysis. Non-synonymous SNPs in the coding region (exon 2) of the preproghrelin gene (Arg51Gln (346 G>A) and Leu72Met (408 C>A) were studied using PCR and restriction fragment length polymorphism analysis. The polymorphism Arg51Gln was not found in the cohorts studied. The distribution of Leu72Met was similar in PCOS group and in healthy controls. There was no significant difference in age, BMI, waist-hip-ratio and levels of FSH, LH, estradiol, testosterone and prolactin between PCOS patients with different genotypes, and the level of plasma glucose and insulin was also similar. No association was found between Leu72Met and Arg51Gln polymorphisms in the ghrelin gene and PCOS in Chinese population.
Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality.

PubMed

Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A

2015-01-01

Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.
Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

PubMed Central

Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.

2015-01-01

Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633
MicroRNA biogenesis pathway from the salmon louse (Caligus rogercresseyi): emerging role in delousing drug response.

PubMed

Valenzuela-Miranda, Diego; Nuñez-Acuña, Gustavo; Valenzuela-Muñoz, Valentina; Asgari, Sassan; Gallardo-Escárate, Cristian

2015-01-25

Despite the increasing evidence of the importance of microRNAs (miRNAs) in the regulation of multiple biological processes, the molecular bases supporting this regulation are still barely understood in crustaceans. Therefore, the molecular characterization and transcriptome modulation of the miRNA biogenesis pathway were evaluated in the salmon louse Caligus rogercresseyi, an ectoparasite that constitutes one of the biggest concerns for salmonid aquaculture industry. Hence, RNA-Seq analysis was conducted from six different developmental stages, and also after bioassays with delousing drugs Deltamethrin and Azamethiphos using adult individuals. In silico analysis evidenced 24 putative genes involved in the miRNA pathway such as biogenesis, transport, maturation and miRNA-target interaction. Moreover, 243 putative single nucleotide polymorphisms (SNPs) were identified, 15 of which showed non-synonym mutations. RNA-Seq analysis revealed that CCR4-Not complex subunit 3 (CNOT3) was upregulated at earlier developmental stages (nauplius I-II and copepodid), and also after the exposure to Azamethiphos, but not to Deltamethrin. In contrast, the subunit 7 (CNOT7) showed an inverse expression pattern. Different Argonaute transcripts were associated to chalimus and adult stages, revealing specific expression patterns in response to antiparasitic drugs. Our results suggest novel insights into the regulatory network of the post-transcriptional gene regulation in C. rogercresseyi mediated by miRNAs, evidencing a putative role during the ontogeny and drug response. Copyright © 2014 Elsevier B.V. All rights reserved.
Allelic variations and differential expressions detected at quantitative trait loci for salt stress tolerance in wheat.

PubMed

Oyiga, Benedict C; Sharma, Ram C; Baum, Michael; Ogbonnaya, Francis C; Léon, Jens; Ballvora, Agim

2018-05-01

The increasing salinization of agricultural lands is a threat to global wheat production. Understanding of the mechanistic basis of salt tolerance (ST) is essential for developing breeding and selection strategies that would allow for increased wheat production under saline conditions to meet the increasing global demand. We used a set that consists of 150 internationally derived winter and facultative wheat cultivars genotyped with a 90K SNP chip and phenotyped for ST across three growth stages and for ionic (leaf K + and Na + contents) traits to dissect the genetic architecture regulating ST in wheat. Genome-wide association mapping revealed 187 Single Nucleotide Polymorphism (SNPs) (R 2 = 3.00-30.67%), representing 37 quantitative trait loci (QTL), significantly associated with the ST traits. Of these, four QTL on 1BS, 2AL, 2BS and 3AL were associated with ST across the three growth stages and with the ionic traits. Novel QTL were also detected on 1BS and 1DL. Candidate genes linked to these polymorphisms were uncovered, and expression analyses were performed and validated on them under saline and non-saline conditions using transcriptomics and qRT-PCR data. Expressed sequence comparisons in contrasting ST wheat genotypes identified several non-synonymous/missense mutation sites that are contributory to the ST trait variations, indicating the biological relevance of these polymorphisms that can be exploited in breeding for ST in wheat. © 2017 The Authors. Plant, Cell & Environment published by JohnWiley & Sons Ltd.
An Evolutionary Landscape of A-to-I RNA Editome across Metazoan Species

PubMed Central

Hung, Li-Yuan; Chen, Yen-Ju; Mai, Te-Lun; Chen, Chia-Ying; Yang, Min-Yu; Chiang, Tai-Wei; Wang, Yi-Da

2018-01-01

Abstract Adenosine-to-inosine (A-to-I) editing is widespread across the kingdom Metazoa. However, for the lack of comprehensive analysis in nonmodel animals, the evolutionary history of A-to-I editing remains largely unexplored. Here, we detect high-confidence editing sites using clustering and conservation strategies based on RNA sequencing data alone, without using single-nucleotide polymorphism information or genome sequencing data from the same sample. We thereby unveil the first evolutionary landscape of A-to-I editing maps across 20 metazoan species (from worm to human), providing unprecedented evidence on how the editing mechanism gradually expands its territory and increases its influence along the history of evolution. Our result revealed that highly clustered and conserved editing sites tended to have a higher editing level and a higher magnitude of the ADAR motif. The ratio of the frequencies of nonsynonymous editing to that of synonymous editing remarkably increased with increasing the conservation level of A-to-I editing. These results thus suggest potentially functional benefit of highly clustered and conserved editing sites. In addition, spatiotemporal dynamics analyses reveal a conserved enrichment of editing and ADAR expression in the central nervous system throughout more than 300 Myr of divergent evolution in complex animals and the comparability of editing patterns between invertebrates and between vertebrates during development. This study provides evolutionary and dynamic aspects of A-to-I editome across metazoan species, expanding this important but understudied class of nongenomically encoded events for comprehensive characterization. PMID:29294013
The Plasmodium berghei RC strain is highly diverged and harbors putatively novel drug resistance variants

PubMed Central

Kulawonganunchai, Supasak; Wilantho, Alisa; Koonyosying, Pongpisid; Uthaipibull, Chairat

2017-01-01

Background The current first line drugs for treating uncomplicated malaria are artemisinin (ART) combination therapies. However, Plasmodium falciparum parasites resistant to ART and partner drugs are spreading, which threatens malaria control efforts. Rodent malaria species are useful models for understanding antimalarial resistance, in particular genetic variants responsible for cross resistance to different compounds. Methods The Plasmodium berghei RC strain (PbRC) is described as resistant to different antimalarials, including chloroquine (CQ) and ART. In an attempt to identify the genetic basis for the antimalarial resistance trait in PbRC, its genome was sequenced and compared with five other previously sequenced P. berghei strains. Results We found that PbRC is eight-fold less sensitive to the ART derivative artesunate than the reference strain PbANKA. The genome of PbRC is markedly different from other strains, and 6,974 single nucleotide variants private to PbRC were identified. Among these PbRC private variants, non-synonymous changes were identified in genes known to modulate antimalarial sensitivity in rodent malaria species, including notably the ubiquitin carboxyl-terminal hydrolase 1 gene. However, no variants were found in some genes with strong evidence of association with ART resistance in P. falciparum such as K13 propeller protein. Discussion The variants identified in PbRC provide insight into P. berghei genome diversity and genetic factors that could modulate CQ and ART resistance in Plasmodium spp. PMID:29018598

Deciphering the recent phylogenetic expansion of the originally deeply rooted Mycobacterium tuberculosis lineage 7.

PubMed

Yimer, Solomon A; Namouchi, Amine; Zegeye, Ephrem Debebe; Holm-Hansen, Carol; Norheim, Gunnstein; Abebe, Markos; Aseffa, Abraham; Tønjum, Tone

2016-06-30

A deeply rooted phylogenetic lineage of Mycobacterium tuberculosis (M. tuberculosis) termed lineage 7 was discovered in Ethiopia. Whole genome sequencing of 30 lineage 7 strains from patients in Ethiopia was performed. Intra-lineage genome variation was defined and unique characteristics identified with a focus on genes involved in DNA repair, recombination and replication (3R genes). More than 800 mutations specific to M. tuberculosis lineage 7 strains were identified. The proportion of non-synonymous single nucleotide polymorphisms (nsSNPs) in 3R genes was higher after the recent expansion of M. tuberculosis lineage 7 strain started. The proportion of nsSNPs in genes involved in inorganic ion transport and metabolism was significantly higher before the expansion began. A total of 22346 bp deletions were observed. Lineage 7 strains also exhibited a high number of mutations in genes involved in carbohydrate transport and metabolism, transcription, energy production and conversion. We have identified unique genomic signatures of the lineage 7 strains. The high frequency of nsSNP in 3R genes after the phylogenetic expansion may have contributed to recent variability and adaptation. The abundance of mutations in genes involved in inorganic ion transport and metabolism before the expansion period may indicate an adaptive response of lineage 7 strains to enable survival, potentially under environmental stress exposure. As lineage 7 strains originally were phylogenetically deeply rooted, this may indicate fundamental adaptive genomic pathways affecting the fitness of M. tuberculosis as a species.
Integrating transcriptome and genome re-sequencing data to identify key genes and mutations affecting chicken eggshell qualities.

PubMed

Zhang, Quan; Zhu, Feng; Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua

2015-01-01

Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as revealed by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus.
Tracking the roots of cellulase hyperproduction by the fungus Trichoderma reesei using massively parallel DNA sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Le Crom, Stphane; Schackwitz, Wendy; Pennacchiod, Len

2009-09-22

Trichoderma reesei (teleomorph Hypocrea jecorina) is the main industrial source of cellulases and hemicellulases harnessed for the hydrolysis of biomass to simple sugars, which can then be converted to biofuels, such as ethanol, and other chemicals. The highly productive strains in use today were generated by classical mutagenesis. To learn how cellulase production was improved by these techniques, we performed massively parallel sequencing to identify mutations in the genomes of two hyperproducing strains (NG14, and its direct improved descendant, RUT C30). We detected a surprisingly high number of mutagenic events: 223 single nucleotides variants, 15 small deletions or insertions andmore » 18 larger deletions leading to the loss of more than 100 kb of genomic DNA. From these events we report previously undocumented non-synonymous mutations in 43 genes that are mainly involved in nuclear transport, mRNA stability, transcription, secretion/vacuolar targeting, and metabolism. This homogeneity of functional categories suggests that multiple changes are necessary to improve cellulase production and not simply a few clear-cut mutagenic events. Phenotype microarrays show that some of these mutations result in strong changes in the carbon assimilation pattern of the two mutants with respect to the wild type strain QM6a. Our analysis provides the first genome-wide insights into the changes induced by classical mutagenesis in a filamentous fungus, and suggests new areas for the generation of enhanced T. reesei strains for industrial applications such as biofuel production.« less
Whole genome sequencing and integrative genomic analysis approach on two 22q11.2 deletion syndrome family trios for genotype to phenotype correlations

PubMed Central

Chung, Jonathan H.; Cai, Jinlu; Suskin, Barrie G.; Zhang, Zhengdong; Coleman, Karlene

2015-01-01

The 22q11.2 deletion syndrome (22q11DS) affects 1:4000 live births and presents with highly variable phenotype expressivity. In this study, we developed an analytical approach utilizing whole genome sequencing and integrative analysis to discover genetic modifiers. Our pipeline combined available tools in order to prioritize rare, predicted deleterious, coding and non-coding single nucleotide variants (SNVs) and insertion/deletions (INDELs) from whole genome sequencing (WGS). We sequenced two unrelated probands with 22q11DS, with contrasting clinical findings, and their unaffected parents. Proband P1 had cognitive impairment, psychotic episodes, anxiety, and tetralogy of Fallot (TOF); while proband P2 had juvenile rheumatoid arthritis but no other major clinical findings. In P1, we identified common variants in COMT and PRODH on 22q11.2 as well as rare potentially deleterious DNA variants in other behavioral/neurocognitive genes. We also identified a de novo SNV in ADNP2 (NM_014913.3:c.2243G>C), encoding a neuroprotective protein that may be involved in behavioral disorders. In P2, we identified a novel non-synonymous SNV in ZFPM2 (NM_012082.3:c.1576C>T), a known causative gene for TOF, which may act as a protective variant downstream of TBX1, haploinsufficiency of which is responsible for congenital heart disease in individuals with 22q11DS. PMID:25981510
GenProBiS: web server for mapping of sequence variants to protein binding sites.

PubMed

Konc, Janez; Skrlj, Blaz; Erzen, Nika; Kunej, Tanja; Janezic, Dusanka

2017-07-03

Discovery of potentially deleterious sequence variants is important and has wide implications for research and generation of new hypotheses in human and veterinary medicine, and drug discovery. The GenProBiS web server maps sequence variants to protein structures from the Protein Data Bank (PDB), and further to protein-protein, protein-nucleic acid, protein-compound, and protein-metal ion binding sites. The concept of a protein-compound binding site is understood in the broadest sense, which includes glycosylation and other post-translational modification sites. Binding sites were defined by local structural comparisons of whole protein structures using the Protein Binding Sites (ProBiS) algorithm and transposition of ligands from the similar binding sites found to the query protein using the ProBiS-ligands approach with new improvements introduced in GenProBiS. Binding site surfaces were generated as three-dimensional grids encompassing the space occupied by predicted ligands. The server allows intuitive visual exploration of comprehensively mapped variants, such as human somatic mis-sense mutations related to cancer and non-synonymous single nucleotide polymorphisms from 21 species, within the predicted binding sites regions for about 80 000 PDB protein structures using fast WebGL graphics. The GenProBiS web server is open and free to all users at http://genprobis.insilab.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Electron attachment to DNA single strands: gas phase and aqueous solution.

PubMed

Gu, Jiande; Xie, Yaoming; Schaefer, Henry F

2007-01-01

The 2'-deoxyguanosine-3',5'-diphosphate, 2'-deoxyadenosine-3',5'-diphosphate, 2'-deoxycytidine-3',5'-diphosphate and 2'-deoxythymidine-3',5'-diphosphate systems are the smallest units of a DNA single strand. Exploring these comprehensive subunits with reliable density functional methods enables one to approach reasonable predictions of the properties of DNA single strands. With these models, DNA single strands are found to have a strong tendency to capture low-energy electrons. The vertical attachment energies (VEAs) predicted for 3',5'-dTDP (0.17 eV) and 3',5'-dGDP (0.14 eV) indicate that both the thymine-rich and the guanine-rich DNA single strands have the ability to capture electrons. The adiabatic electron affinities (AEAs) of the nucleotides considered here range from 0.22 to 0.52 eV and follow the order 3',5'-dTDP > 3',5'-dCDP > 3',5'-dGDP > 3',5'-dADP. A substantial increase in the AEA is observed compared to that of the corresponding nucleic acid bases and the corresponding nucleosides. Furthermore, aqueous solution simulations dramatically increase the electron attracting properties of the DNA single strands. The present investigation illustrates that in the gas phase, the excess electron is situated both on the nucleobase and on the phosphate moiety for DNA single strands. However, the distribution of the extra negative charge is uneven. The attached electron favors the base moiety for the pyrimidine, while it prefers the 3'-phosphate subunit for the purine DNA single strands. In contrast, the attached electron is tightly bound to the base fragment for the cytidine, thymidine and adenosine nucleotides, while it almost exclusively resides in the vicinity of the 3'-phosphate group for the guanosine nucleotides due to the solvent effects. The comparatively low vertical detachment energies (VDEs) predicted for 3',5'-dADP(-) (0.26 eV) and 3',5'-dGDP(-) (0.32 eV) indicate that electron detachment might compete with reactions having high activation barriers such as glycosidic bond breakage. However, the radical anions of the pyrimidine nucleotides with high VDE are expected to be electronically stable. Thus the base-centered radical anions of the pyrimidine nucleotides might be the possible intermediates for DNA single-strand breakage.
Major histocompatibility complex alleles associated with parasite susceptibility in wild giant pandas.

PubMed

Zhang, L; Wu, Q; Hu, Y; Wu, H; Wei, F

2015-01-01

Major histocompatibility complex (MHC) polymorphism is thought to be driven by antagonistic coevolution between pathogens and hosts, mediated through either overdominance or frequency-dependent selection. However, investigations under natural conditions are still rare for endangered mammals which often exhibit depleted variation, and the mechanism of selection underlying the maintenance of characteristics remains a considerable debate. In this study, 87 wild giant pandas were used to investigate MHC variation associated with parasite load. With the knowledge of the MHC profile provided by the genomic data of the giant panda, seven DRB1, seven DQA1 and eight DQA2 alleles were identified at each single locus. Positive selection evidenced by a significantly higher number of non-synonymous substitutions per non-synonymous codon site relative to synonymous substitutions per synonymous codon site could only be detected at the DRB1 locus, which leads to the speculation that DRB1 may have a more important role in dealing with parasite infection for pandas. Coprological analyses revealed that 55.17% of individuals exhibited infection with 1-2 helminthes and 95.3% of infected pandas carried Baylisascaris shroederi. Using a generalized linear model, we found that Aime-DRB1*10 was significantly associated with parasite infection, but no resistant alleles could be detected. MHC heterozygosity of the pandas was found to be uncorrelated with the infection status or the infection intensity. These results suggested that the possible selection mechanisms in extant wild pandas may be frequency dependent rather than being determined by overdominance selection. Our findings could guide the candidate selection for the ongoing reintroduction or translocation of pandas.
Major histocompatibility complex alleles associated with parasite susceptibility in wild giant pandas

PubMed Central

Zhang, L; Wu, Q; Hu, Y; Wu, H; Wei, F

2015-01-01

Major histocompatibility complex (MHC) polymorphism is thought to be driven by antagonistic coevolution between pathogens and hosts, mediated through either overdominance or frequency-dependent selection. However, investigations under natural conditions are still rare for endangered mammals which often exhibit depleted variation, and the mechanism of selection underlying the maintenance of characteristics remains a considerable debate. In this study, 87 wild giant pandas were used to investigate MHC variation associated with parasite load. With the knowledge of the MHC profile provided by the genomic data of the giant panda, seven DRB1, seven DQA1 and eight DQA2 alleles were identified at each single locus. Positive selection evidenced by a significantly higher number of non-synonymous substitutions per non-synonymous codon site relative to synonymous substitutions per synonymous codon site could only be detected at the DRB1 locus, which leads to the speculation that DRB1 may have a more important role in dealing with parasite infection for pandas. Coprological analyses revealed that 55.17% of individuals exhibited infection with 1–2 helminthes and 95.3% of infected pandas carried Baylisascaris shroederi. Using a generalized linear model, we found that Aime-DRB1*10 was significantly associated with parasite infection, but no resistant alleles could be detected. MHC heterozygosity of the pandas was found to be uncorrelated with the infection status or the infection intensity. These results suggested that the possible selection mechanisms in extant wild pandas may be frequency dependent rather than being determined by overdominance selection. Our findings could guide the candidate selection for the ongoing reintroduction or translocation of pandas. PMID:25248466
Genetic risk profiling and gene signature modeling to predict risk of complications after IPAA.

PubMed

Sehgal, Rishabh; Berg, Arthur; Polinski, Joseph I; Hegarty, John P; Lin, Zhenwu; McKenna, Kevin J; Stewart, David B; Poritz, Lisa S; Koltun, Walter A

2012-03-01

Severe pouchitis and Crohn's disease-like complications are 2 adverse postoperative complications that confound the success of the IPAA in patients with ulcerative colitis. To date, approximately 83 single nucleotide polymorphisms within 55 genes have been associated with IBD. The aim of this study was to identify single-nucleotide polymorphisms that correlate with complications after IPAA that could be utilized in a gene signature fashion to predict postoperative complications and aid in preoperative surgical decision making. One hundred forty-two IPAA patients were retrospectively classified as "asymptomatic" (n = 104, defined as no Crohn's disease-like complications or severe pouchitis for at least 2 years after IPAA) and compared with a "severe pouchitis" group (n = 12, ≥ 4 episodes pouchitis per year for 2 years including the need for long-term therapy to maintain remission) and a "Crohn's disease-like" group (n = 26, presence of fistulae, pouch inlet stricture, proximal small-bowel disease, or pouch granulomata, occurring at least 6 months after surgery). Genotyping for 83 single-nucleotide polymorphisms previously associated with Crohn's disease and/or ulcerative colitis was performed on a customized Illumina genotyping platform. The top 2 single-nucleotide polymorphisms statistically identified as being independently associated with each of Crohn's disease-like and severe pouchitis were used in a multivariate logistic regression model. These single-nucleotide polymorphisms were then used to create probability equations to predict overall chance of a positive or negative outcome for that complication. The top 2 single-nucleotide polymorphisms for Crohn's disease-like complications were in the 10q21 locus and the gene for PTGER4 (p = 0.006 and 0.007), whereas for severe pouchitis it was NOD2 and TNFSF15 (p = 0.003 and 0.011). Probability equations suggested that the risk of these 2 complications greatly increased with increasing number of risk alleles, going as high as 92% for severe pouchitis and 65% for Crohn's disease-like complications. In this IPAA patient cohort, mutations in the 10q21 locus and the PTGER4 gene were associated with Crohn's disease-like complications, whereas mutations in NOD2 and TNFSF15 correlated with severe pouchitis. Preoperative genetic analysis and use of such gene signatures hold promise for improved preoperative surgical patient selection to minimize these IPAA complications.
Pre-steady-state Kinetic Analysis of a Family D DNA Polymerase from Thermococcus sp. 9°N Reveals Mechanisms for Archaeal Genomic Replication and Maintenance*

PubMed Central

Schermerhorn, Kelly M.; Gardner, Andrew F.

2015-01-01

Family D DNA polymerases (polDs) have been implicated as the major replicative polymerase in archaea, excluding the Crenarchaeota branch, and bear little sequence homology to other DNA polymerase families. Here we report a detailed kinetic analysis of nucleotide incorporation and exonuclease activity for a Family D DNA polymerase from Thermococcus sp. 9°N. Pre-steady-state single-turnover nucleotide incorporation assays were performed to obtain the kinetic parameters, kpol and Kd, for correct nucleotide incorporation, incorrect nucleotide incorporation, and ribonucleotide incorporation by exonuclease-deficient polD. Correct nucleotide incorporation kinetics revealed a relatively slow maximal rate of polymerization (kpol ∼2.5 s−1) and especially tight nucleotide binding (Kd(dNTP) ∼1.7 μm), compared with DNA polymerases from Families A, B, C, X, and Y. Furthermore, pre-steady-state nucleotide incorporation assays revealed that polD prevents the incorporation of incorrect nucleotides and ribonucleotides primarily through reduced nucleotide binding affinity. Pre-steady-state single-turnover assays on wild-type 9°N polD were used to examine 3′-5′ exonuclease hydrolysis activity in the presence of Mg2+ and Mn2+. Interestingly, substituting Mn2+ for Mg2+ accelerated hydrolysis rates >40-fold (kexo ≥110 s−1 versus ≥2.5 s−1). Preference for Mn2+ over Mg2+ in exonuclease hydrolysis activity is a property unique to the polD family. The kinetic assays performed in this work provide critical insight into the mechanisms that polD employs to accurately and efficiently replicate the archaeal genome. Furthermore, despite the unique properties of polD, this work suggests that a conserved polymerase kinetic pathway is present in all known DNA polymerase families. PMID:26160179
Comprehensive thermodynamic analysis of 3′ double-nucleotide overhangs neighboring Watson–Crick terminal base pairs

PubMed Central

O'Toole, Amanda S.; Miller, Stacy; Haines, Nathan; Zink, M. Coleen; Serra, Martin J.

2006-01-01

Thermodynamic parameters are reported for duplex formation of 48 self-complementary RNA duplexes containing Watson–Crick terminal base pairs (GC, AU and UA) with all 16 possible 3′ double-nucleotide overhangs; mimicking the structures of short interfering RNAs (siRNA) and microRNAs (miRNA). Based on nearest-neighbor analysis, the addition of a second dangling nucleotide to a single 3′ dangling nucleotide increases stability of duplex formation up to 0.8 kcal/mol in a sequence dependent manner. Results from this study in conjunction with data from a previous study [A. S. O'Toole, S. Miller and M. J. Serra (2005) RNA, 11, 512.] allows for the development of a refined nearest-neighbor model to predict the influence of 3′ double-nucleotide overhangs on the stability of duplex formation. The model improves the prediction of free energy and melting temperature when tested against five oligomers with various core duplex sequences. Phylogenetic analysis of naturally occurring miRNAs was performed to support our results. Selection of the effector miR strand of the mature miRNA duplex appears to be dependent upon the identity of the 3′ double-nucleotide overhang. Thermodynamic parameters for 3′ single terminal overhangs adjacent to a UA pair are also presented. PMID:16820533
Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision.

PubMed

Bao, Zehua; HamediRad, Mohammad; Xue, Pu; Xiao, Han; Tasan, Ipek; Chao, Ran; Liang, Jing; Zhao, Huimin

2018-07-01

We developed a CRISPR-Cas9- and homology-directed-repair-assisted genome-scale engineering method named CHAnGE that can rapidly output tens of thousands of specific genetic variants in yeast. More than 98% of target sequences were efficiently edited with an average frequency of 82%. We validate the single-nucleotide resolution genome-editing capability of this technology by creating a genome-wide gene disruption collection and apply our method to improve tolerance to growth inhibitors.
Mutations that Cause Human Disease: A Computational/Experimental Approach

DOE Office of Scientific and Technical Information (OSTI.GOV)

Beernink, P; Barsky, D; Pesavento, B

International genome sequencing projects have produced billions of nucleotides (letters) of DNA sequence data, including the complete genome sequences of 74 organisms. These genome sequences have created many new scientific opportunities, including the ability to identify sequence variations among individuals within a species. These genetic differences, which are known as single nucleotide polymorphisms (SNPs), are particularly important in understanding the genetic basis for disease susceptibility. Since the report of the complete human genome sequence, over two million human SNPs have been identified, including a large-scale comparison of an entire chromosome from twenty individuals. Of the protein coding SNPs (cSNPs), approximatelymore » half leads to a single amino acid change in the encoded protein (non-synonymous coding SNPs). Most of these changes are functionally silent, while the remainder negatively impact the protein and sometimes cause human disease. To date, over 550 SNPs have been found to cause single locus (monogenic) diseases and many others have been associated with polygenic diseases. SNPs have been linked to specific human diseases, including late-onset Parkinson disease, autism, rheumatoid arthritis and cancer. The ability to predict accurately the effects of these SNPs on protein function would represent a major advance toward understanding these diseases. To date several attempts have been made toward predicting the effects of such mutations. The most successful of these is a computational approach called ''Sorting Intolerant From Tolerant'' (SIFT). This method uses sequence conservation among many similar proteins to predict which residues in a protein are functionally important. However, this method suffers from several limitations. First, a query sequence must have a sufficient number of relatives to infer sequence conservation. Second, this method does not make use of or provide any information on protein structure, which can be used to understand how an amino acid change affects the protein. The experimental methods that provide the most detailed structural information on proteins are X-ray crystallography and NMR spectroscopy. However, these methods are labor intensive and currently cannot be carried out on a genomic scale. Nonetheless, Structural Genomics projects are being pursued by more than a dozen groups and consortia worldwide and as a result the number of experimentally determined structures is rising exponentially. Based on the expectation that protein structures will continue to be determined at an ever-increasing rate, reliable structure prediction schemes will become increasingly valuable, leading to information on protein function and disease for many different proteins. Given known genetic variability and experimentally determined protein structures, can we accurately predict the effects of single amino acid substitutions? An objective assessment of this question would involve comparing predicted and experimentally determined structures, which thus far has not been rigorously performed. The completed research leveraged existing expertise at LLNL in computational and structural biology, as well as significant computing resources, to address this question.« less
Synonymous ABCA3 Variants Do Not Increase Risk for Neonatal Respiratory Distress Syndrome

PubMed Central

Wambach, Jennifer A.; Wegner, Daniel J.; Heins, Hillary B.; Druley, Todd E.; Mitra, Robi D.; Hamvas, Aaron; Cole, F. Sessions

2014-01-01

Objective To determine whether synonymous variants in the adenosine triphosphate-binding cassette A3 transporter (ABCA3) gene increase the risk for neonatal respiratory distress syndrome (RDS) in term and late preterm infants of European and African descent. Study design Using next-generation pooled sequencing of race-stratified DNA samples from infants of European and African descent at $34 weeks gestation with and without RDS (n = 503), we scanned all exons of ABCA3, validated each synonymous variant with an independent genotyping platform, and evaluated race-stratified disease risk associated with common synonymous variants and collapsed frequencies of rare synonymous variants. Results The synonymous ABCA3 variant frequency spectrum differs between infants of European descent and those of African descent. Using in silico prediction programs and statistical strategies, we found no potentially disruptive synonymous ABCA3 variants or evidence of selection pressure. Individual common synonymous variants and collapsed frequencies of rare synonymous variants did not increase disease risk in term and late-preterm infants of European or African descent. Conclusion In contrast to rare, nonsynonymous ABCA3 mutations, synonymous ABCA3 variants do not increase the risk for neonatal RDS among term and late-preterm infants of European or African descent. PMID:24657120
Comparative genomic analysis of the Lipase3 gene family in five plant species reveals distinct evolutionary origins.

PubMed

Wang, Dan; Zhang, Lin; Hu, JunFeng; Gao, Dianshuai; Liu, Xin; Sha, Yan

2018-04-01

Lipases are physiologically important and ubiquitous enzymes that share a conserved domain and are classified into eight different families based on their amino acid sequences and fundamental biological properties. The Lipase3 family of lipases was reported to possess a canonical fold typical of α/β hydrolases and a typical catalytic triad, suggesting a distinct evolutionary origin for this family. Genes in the Lipase3 family do not have the same functions, but maintain the conserved Lipase3 domain. There have been extensive studies of Lipase3 structures and functions, but little is known about their evolutionary histories. In this study, all lipases within five plant species were identified, and their phylogenetic relationships and genetic properties were analyzed and used to group them into distinct evolutionary families. Each identified lipase family contained at least one dicot and monocot Lipase3 protein, indicating that the gene family was established before the split of dicots and monocots. Similar intron/exon numbers and predicted protein sequence lengths were found within individual groups. Twenty-four tandem Lipase3 gene duplications were identified, implying that the distinctive function of Lipase3 genes appears to be a consequence of translocation and neofunctionalization after gene duplication. The functional genes EDS1, PAD4, and SAG101 that are reportedly involved in pathogen response were all located in the same group. The nucleotide diversity (Dxy) and the ratio of nonsynonymous to synonymous nucleotide substitutions rates (Ka/Ks) of the three genes were significantly greater than the average across the genomes. We further observed evidence for selection maintaining diversity on three genes in the Toll-Interleukin-1 receptor type of nucleotide binding/leucine-rich repeat immune receptor (TIR-NBS LRR) immunity-response signaling pathway, indicating that they could be vulnerable to pathogen effectors.
Structural and functional effects of nucleotide variation on the human TB drug metabolizing enzyme arylamine N-acetyltransferase 1.

PubMed

Cloete, Ruben; Akurugu, Wisdom A; Werely, Cedric J; van Helden, Paul D; Christoffels, Alan

2017-08-01

The human arylamine N-acetyltransferase 1 (NAT1) enzyme plays a vital role in determining the duration of action of amine-containing drugs such as para-aminobenzoic acid (PABA) by influencing the balance between detoxification and metabolic activation of these drugs. Recently, four novel single nucleotide polymorphisms (SNPs) were identified within a South African mixed ancestry population. Modeling the effects of these SNPs within the structural protein was done to assess possible structure and function changes in the enzyme. The use of molecular dynamics simulations and stability predictions indicated less thermodynamically stable protein structures containing E264K and V231G, while the N245I change showed a stabilizing effect. Coincidently the N245I change displayed a similar free energy landscape profile to the known R64W amino acid substitution (slow acetylator), while the R242M displayed a similar profile to the published variant, I263V (proposed fast acetylator), and the wild type protein structure. Similarly, principal component analysis indicated that two amino acid substitutions (E264K and V231G) occupied less conformational clusters of folded states as compared to the WT and were found to be destabilizing (may affect protein function). However, two of the four novel SNPs that result in amino acid changes: (V231G and N245I) were predicted by both SIFT and POLYPHEN-2 algorithms to affect NAT1 protein function, while two other SNPs that result in R242M and E264K substitutions showed contradictory results based on SIFT and POLYPHEN-2 analysis. In conclusion, the structural methods were able to verify that two non-synonymous substitutions (E264K and V231G) can destabilize the protein structure, and are in agreement with mCSM predictions, and should therefore be experimentally tested for NAT1 activity. These findings could inform a strategy of incorporating genotypic data (i.e., functional SNP alleles) with phenotypic information (slow or fast acetylator) to better prescribe effective treatment using drugs metabolized by NAT1. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Effects of a novel SNP of IGF2R gene on growth traits and expression rate of IGF2R and IGF2 genes in gluteus medius muscle of Egyptian buffalo.

PubMed

El-Magd, Mohammed Abu; Abo-Al-Ela, Haitham G; El-Nahas, Abeer; Saleh, Ayman A; Mansour, Ali A

2014-05-01

Insulin-like growth factor 2 receptor (IGF2R) is responsible for degradation of the muscle development initiator, IGF2, and thus it can be used as a marker for selection strategies in the farm animals. The aim of this study was to search for polymorphisms in three coding loci of IGF2R, and to analyze their effect on the growth traits and on the expression levels of IGF2R and IGF2 genes in the gluteus medius muscle of Egyptian buffaloes. A novel A266C SNP was detected in the coding sequences of the third IGF2R locus (at nucleotide number 51 of exon 23) among Egyptian water buffaloes. This SNP was non-synonymous mutation and led to replacement of Y (tyrosine) amino acid (aa) by D (aspartic acid) aa. Three different single-strand conformation polymorphism patterns were observed in the third IGF2R locus: AA, AC, and CC with frequencies of 0.555, 0.195, and 0.250, respectively. Statistical analysis showed that the homozygous AA genotype significantly associated with the average daily gain than AC and CC genotypes from birth to 9 mo of age. Expression analysis showed that the A266C SNP was correlated with IGF2, but not with IGF2R, mRNA levels in the gluteus medius muscle of Egyptian buffaloes. The highest IGF2 mRNA level was estimated in the muscle of animals with the AA homozygous genotype as compared to the AC heterozygotes and CC homozygotes. We conclude that A266C SNP at nucleotide number 51 of exon 23 of the IGF2R gene is associated with the ADG during the early stages of life (from birth to 9 mo of age) and this effect is accompanied by, and may be caused by, increased expression levels of the IGF2 gene. Copyright © 2014 Elsevier B.V. All rights reserved.
Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium.

PubMed

Milne, Roger L; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk; Ko, Yon-Dschun; Brauch, Hiltrud; Hamann, Ute; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tajima, Kazuo; Li, Jingmei; Brand, Judith S; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lambrechts, Diether; Peuteman, Gilian; Christiaens, Marie-Rose; Smeets, Ann; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katazyna; Hartman, Mikael; Hui, Miao; Yen Lim, Wei; Wan Chan, Ching; Marme, Federick; Yang, Rongxi; Bugert, Peter; Lindblom, Annika; Margolin, Sara; García-Closas, Montserrat; Chanock, Stephen J; Lissowska, Jolanta; Figueroa, Jonine D; Bojesen, Stig E; Nordestgaard, Børge G; Flyger, Henrik; Hooning, Maartje J; Kriege, Mieke; van den Ouweland, Ans M W; Koppert, Linetta B; Fletcher, Olivia; Johnson, Nichola; dos-Santos-Silva, Isabel; Peto, Julian; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha J; Long, Jirong; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Schmidt, Marjanka K; Broeks, Annegien; Cornelissen, Sten; Braaf, Linde; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Simard, Jacques; Dumont, Martine; Goldberg, Mark S; Labrèche, France; Fasching, Peter A; Hein, Alexander; Ekici, Arif B; Beckmann, Matthias W; Radice, Paolo; Peterlongo, Paolo; Azzollini, Jacopo; Barile, Monica; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Hopper, John L; Schmidt, Daniel F; Makalic, Enes; Southey, Melissa C; Hwang Teo, Soo; Har Yip, Cheng; Sivanandan, Kavitta; Tay, Wan-Ting; Shen, Chen-Yang; Hsiung, Chia-Ni; Yu, Jyh-Cherng; Hou, Ming-Feng; Guénel, Pascal; Truong, Therese; Sanchez, Marie; Mulot, Claire; Blot, William; Cai, Qiuyin; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Bogdanova, Natalia; Dörk, Thilo; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Zhang, Ben; Couch, Fergus J; Toland, Amanda E; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; McKay, James; Wang, Xianshu; Olson, Janet E; Vachon, Celine; Purrington, Kristen; Severi, Gianluca; Baglietto, Laura; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Ahmed, Shahana; Shah, Mitul; Pharoah, Paul D P; Hall, Per; Giles, Graham G; Benítez, Javier; Dunning, Alison M; Chenevix-Trench, Georgia; Easton, Douglas F

2014-11-15

Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) and analyzed using unconditional logistic regression. Strong evidence of association was observed for three nsSNPs: ATXN7-K264R at 3p21 [rs1053338, per allele OR = 1.07, 95% confidence interval (CI) = 1.04-1.10, P = 2.9 × 10(-6)], AKAP9-M463I at 7q21 (rs6964587, OR = 1.05, 95% CI = 1.03-1.07, P = 1.7 × 10(-6)) and NEK10-L513S at 3p24 (rs10510592, OR = 1.10, 95% CI = 1.07-1.12, P = 5.1 × 10(-17)). The first two associations reached genome-wide statistical significance in a combined analysis of available data, including independent data from nine genome-wide association studies (GWASs): for ATXN7-K264R, OR = 1.07 (95% CI = 1.05-1.10, P = 1.0 × 10(-8)); for AKAP9-M463I, OR = 1.05 (95% CI = 1.04-1.07, P = 2.0 × 10(-10)). Further analysis of other common variants in these two regions suggested that intronic SNPs nearby are more strongly associated with disease risk. We have thus identified a novel susceptibility locus at 3p21, and confirmed previous suggestive evidence that rs6964587 at 7q21 is associated with risk. The third locus, rs10510592, is located in an established breast cancer susceptibility region; the association was substantially attenuated after adjustment for the known GWAS hit. Thus, each of the associated nsSNPs is likely to be a marker for another, non-coding, variant causally related to breast cancer risk. Further fine-mapping and functional studies are required to identify the underlying risk-modifying variants and the genes through which they act. © The Author 2014. Published by Oxford University Press.
Common non-synonymous SNPs associated with breast cancer susceptibility: findings from the Breast Cancer Association Consortium

PubMed Central

Milne, Roger L.; Burwinkel, Barbara; Michailidou, Kyriaki; Arias-Perez, Jose-Ignacio; Zamora, M. Pilar; Menéndez-Rodríguez, Primitiva; Hardisson, David; Mendiola, Marta; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Dennis, Joe; Wang, Qin; Bolla, Manjeet K.; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk; Ko, Yon-Dschun; Brauch, Hiltrud; Hamann, Ute; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Tajima, Kazuo; Li, Jingmei; Brand, Judith S.; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lambrechts, Diether; Peuteman, Gilian; Christiaens, Marie-Rose; Smeets, Ann; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katazyna; Hartman, Mikael; Hui, Miao; Yen Lim, Wei; Wan Chan, Ching; Marme, Federick; Yang, Rongxi; Bugert, Peter; Lindblom, Annika; Margolin, Sara; García-Closas, Montserrat; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Bojesen, Stig E.; Nordestgaard, Børge G.; Flyger, Henrik; Hooning, Maartje J.; Kriege, Mieke; van den Ouweland, Ans M.W.; Koppert, Linetta B.; Fletcher, Olivia; Johnson, Nichola; dos-Santos-Silva, Isabel; Peto, Julian; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha J.; Long, Jirong; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Cox, Angela; Cross, Simon S.; Reed, Malcolm W.R.; Schmidt, Marjanka K.; Broeks, Annegien; Cornelissen, Sten; Braaf, Linde; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Simard, Jacques; Dumont, Martine; Goldberg, Mark S.; Labrèche, France; Fasching, Peter A.; Hein, Alexander; Ekici, Arif B.; Beckmann, Matthias W.; Radice, Paolo; Peterlongo, Paolo; Azzollini, Jacopo; Barile, Monica; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Hopper, John L.; Schmidt, Daniel F.; Makalic, Enes; Southey, Melissa C.; Hwang Teo, Soo; Har Yip, Cheng; Sivanandan, Kavitta; Tay, Wan-Ting; Shen, Chen-Yang; Hsiung, Chia-Ni; Yu, Jyh-Cherng; Hou, Ming-Feng; Guénel, Pascal; Truong, Therese; Sanchez, Marie; Mulot, Claire; Blot, William; Cai, Qiuyin; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Wu, Anna H.; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O.; Bogdanova, Natalia; Dörk, Thilo; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Zhang, Ben; Couch, Fergus J.; Toland, Amanda E.; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; McKay, James; Wang, Xianshu; Olson, Janet E.; Vachon, Celine; Purrington, Kristen; Severi, Gianluca; Baglietto, Laura; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Ahmed, Shahana; Shah, Mitul; Pharoah, Paul D.P.; Hall, Per; Giles, Graham G.; Benítez, Javier; Dunning, Alison M.; Chenevix-Trench, Georgia; Easton, Douglas F.; Berchuck, Andrew; Eeles, Rosalind A.; Olama, Ali Amin Al; Kote-Jarai, Zsofia; Benlloch, Sara; Antoniou, Antonis; McGuffog, Lesley; Offit, Ken; Lee, Andrew; Dicks, Ed; Luccarini, Craig; Tessier, Daniel C.; Bacot, Francois; Vincent, Daniel; LaBoissière, Sylvie; Robidoux, Frederic; Nielsen, Sune F.; Cunningham, Julie M.; Windebank, Sharon A.; Hilker, Christopher A.; Meyer, Jeffrey; Angelakos, Maggie; Maskiell, Judi; van der Schoot, Ellen; Rutgers, Emiel; Verhoef, Senno; Hogervorst, Frans; Boonyawongviroj, Prat; Siriwanarungsan, Pornthep; Schrauder, Michael; Rübner, Matthias; Oeser, Sonja; Landrith, Silke; Williams, Eileen; Ryder-Mills, Elaine; Sargus, Kara; McInerney, Niall; Colleran, Gabrielle; Rowan, Andrew; Jones, Angela; Sohn, Christof; Schneeweiß, Andeas; Bugert, Peter; Álvarez, Núria; Lacey, James; Wang, Sophia; Ma, Huiyan; Lu, Yani; Deapen, Dennis; Pinder, Rich; Lee, Eunjung; Schumacher, Fred; Horn-Ross, Pam; Reynolds, Peggy; Nelson, David; Ziegler, Hartwig; Wolf, Sonja; Hermann, Volker; Lo, Wing-Yee; Justenhoven, Christina; Baisch, Christian; Fischer, Hans-Peter; Brüning, Thomas; Pesch, Beate; Rabstein, Sylvia; Lotz, Anne; Harth, Volker; Heikkinen, Tuomas; Erkkilä, Irja; Aaltonen, Kirsimari; von Smitten, Karl; Antonenkova, Natalia; Hillemanns, Peter; Christiansen, Hans; Myöhänen, Eija; Kemiläinen, Helena; Thorne, Heather; Niedermayr, Eveline; Bowtell, D; Chenevix-Trench, G; deFazio, A; Gertig, D; Green, A; Webb, P; Green, A.; Parsons, P.; Hayward, N.; Webb, P.; Whiteman, D.; Fung, Annie; Yashiki, June; Peuteman, Gilian; Smeets, Dominiek; Brussel, Thomas Van; Corthouts, Kathleen; Obi, Nadia; Heinz, Judith; Behrens, Sabine; Eilber, Ursula; Celik, Muhabbet; Olchers, Til; Manoukian, Siranoush; Peissel, Bernard; Scuvera, Giulietta; Zaffaroni, Daniela; Bonanni, Bernardo; Feroce, Irene; Maniscalco, Angela; Rossi, Alessandra; Bernard, Loris; Tranchant, Martine; Valois, Marie-France; Turgeon, Annie; Heguy, Lea; Sze Yee, Phuah; Kang, Peter; Nee, Kang In; Mariapun, Shivaani; Sook-Yee, Yoon; Lee, Daphne; Ching, Teh Yew; Taib, Nur Aishah Mohd; Otsukka, Meeri; Mononen, Kari; Selander, Teresa; Weerasooriya, Nayana; staff, OFBCR; Krol-Warmerdam, E.; Molenaar, J.; Blom, J.; Brinton, Louise; Szeszenia-Dabrowska, Neonila; Peplonska, Beata; Zatonski, Witold; Chao, Pei; Stagner, Michael; Bos, Petra; Blom, Jannet; Crepin, Ellen; Nieuwlaat, Anja; Heemskerk, Annette; Higham, Sue; Cross, Simon; Cramp, Helen; Connley, Dan; Balasubramanian, Sabapathy; Brock, Ian; Luccarini, Craig; Conroy, Don; Baynes, Caroline; Chua, Kimberley

2014-01-01

Candidate variant association studies have been largely unsuccessful in identifying common breast cancer susceptibility variants, although most studies have been underpowered to detect associations of a realistic magnitude. We assessed 41 common non-synonymous single-nucleotide polymorphisms (nsSNPs) for which evidence of association with breast cancer risk had been previously reported. Case-control data were combined from 38 studies of white European women (46 450 cases and 42 600 controls) and analyzed using unconditional logistic regression. Strong evidence of association was observed for three nsSNPs: ATXN7-K264R at 3p21 [rs1053338, per allele OR = 1.07, 95% confidence interval (CI) = 1.04–1.10, P = 2.9 × 10−6], AKAP9-M463I at 7q21 (rs6964587, OR = 1.05, 95% CI = 1.03–1.07, P = 1.7 × 10−6) and NEK10-L513S at 3p24 (rs10510592, OR = 1.10, 95% CI = 1.07–1.12, P = 5.1 × 10−17). The first two associations reached genome-wide statistical significance in a combined analysis of available data, including independent data from nine genome-wide association studies (GWASs): for ATXN7-K264R, OR = 1.07 (95% CI = 1.05–1.10, P = 1.0 × 10−8); for AKAP9-M463I, OR = 1.05 (95% CI = 1.04–1.07, P = 2.0 × 10−10). Further analysis of other common variants in these two regions suggested that intronic SNPs nearby are more strongly associated with disease risk. We have thus identified a novel susceptibility locus at 3p21, and confirmed previous suggestive evidence that rs6964587 at 7q21 is associated with risk. The third locus, rs10510592, is located in an established breast cancer susceptibility region; the association was substantially attenuated after adjustment for the known GWAS hit. Thus, each of the associated nsSNPs is likely to be a marker for another, non-coding, variant causally related to breast cancer risk. Further fine-mapping and functional studies are required to identify the underlying risk-modifying variants and the genes through which they act. PMID:24943594
Hot-spot KIF5A mutations cause familial ALS

PubMed Central

Yilmaz, Rüstem; Müller, Kathrin; Grehl, Torsten; Petri, Susanne; Meyer, Thomas; Grosskreutz, Julian; Weydt, Patrick; Ruf, Wolfgang; Neuwirth, Christoph; Weber, Markus; Pinto, Susana; Claeys, Kristl G; Schrank, Berthold; Jordan, Berit; Knehr, Antje; Günther, Kornelia; Hübers, Annemarie; Zeller, Daniel; Kubisch, Christian; Jablonka, Sibylle; Klopstock, Thomas; de Carvalho, Mamede; Sperfeld, Anne; Borck, Guntram; Volk, Alexander E; Dorst, Johannes; Weis, Joachim; Otto, Markus; Schuster, Joachim; Del Tredici, Kelly; Braak, Heiko; Danzer, Karin M; Freischmidt, Axel; Meitinger, Thomas; Strom, Tim M; Ludolph, Albert C; Andersen, Peter M; Weishaupt, Jochen H; Weyen, Ute; Hermann, Andreas; Hagenacker, Tim; Koch, Jan Christoph; Lingor, Paul; Göricke, Bettina; Zierz, Stephan; Baum, Petra; Wolf, Joachim; Winkler, Andrea; Young, Peter; Bogdahn, Ulrich; Prudlo, Johannes; Kassubek, Jan

2018-01-01

Abstract Heterozygous missense mutations in the N-terminal motor or coiled-coil domains of the kinesin family member 5A (KIF5A) gene cause monogenic spastic paraplegia (HSP10) and Charcot-Marie-Tooth disease type 2 (CMT2). Moreover, heterozygous de novo frame-shift mutations in the C-terminal domain of KIF5A are associated with neonatal intractable myoclonus, a neurodevelopmental syndrome. These findings, together with the observation that many of the disease genes associated with amyotrophic lateral sclerosis disrupt cytoskeletal function and intracellular transport, led us to hypothesize that mutations in KIF5A are also a cause of amyotrophic lateral sclerosis. Using whole exome sequencing followed by rare variant analysis of 426 patients with familial amyotrophic lateral sclerosis and 6137 control subjects, we detected an enrichment of KIF5A splice-site mutations in amyotrophic lateral sclerosis (2/426 compared to 0/6137 in controls; P = 4.2 × 10−3), both located in a hot-spot in the C-terminus of the protein and predicted to affect splicing exon 27. We additionally show co-segregation with amyotrophic lateral sclerosis of two canonical splice-site mutations in two families. Investigation of lymphoblast cell lines from patients with KIF5A splice-site mutations revealed the loss of mutant RNA expression and suggested haploinsufficiency as the most probable underlying molecular mechanism. Furthermore, mRNA sequencing of a rare non-synonymous missense mutation (predicting p.Arg1007Gly) located in the C-terminus of the protein shortly upstream of the splice donor of exon 27 revealed defective KIF5A pre-mRNA splicing in respective patient-derived cell lines owing to abrogation of the donor site. Finally, the non-synonymous single nucleotide variant rs113247976 (minor allele frequency = 1.00% in controls, n = 6137), also located in the C-terminal region [p.(Pro986Leu) in exon 26], was significantly enriched in familial amyotrophic lateral sclerosis patients (minor allele frequency = 3.40%; P = 1.28 × 10−7). Our study demonstrates that mutations located specifically in a C-terminal hotspot of KIF5A can cause a classical amyotrophic lateral sclerosis phenotype, and underline the involvement of intracellular transport processes in amyotrophic lateral sclerosis pathogenesis. PMID:29342275

Some links on this page may take you to non-federal websites. Their policies may differ from this site.