Analysis of single nucleotide polymorphisms in case-control studies.
Li, Yonghong; Shiffman, Dov; Oberbauer, Rainer
2011-01-01
Single nucleotide polymorphisms (SNPs) are the most common type of genetic variants in the human genome. SNPs are known to modify susceptibility to complex diseases. We describe and discuss methods used to identify SNPs associated with disease in case-control studies. An outline on study population selection, sample collection and genotyping platforms is presented, complemented by SNP selection, data preprocessing and analysis.
Allen, Alexandra M; Barker, Gary L A; Berry, Simon T; Coghill, Jane A; Gwilliam, Rhian; Kirby, Susan; Robinson, Phil; Brenchley, Rachel C; D'Amore, Rosalinda; McKenzie, Neil; Waite, Darren; Hall, Anthony; Bevan, Michael; Hall, Neil; Edwards, Keith J
2011-12-01
Food security is a global concern and substantial yield increases in cereal crops are required to feed the growing world population. Wheat is one of the three most important crops for human and livestock feed. However, the complexity of the genome coupled with a decline in genetic diversity within modern elite cultivars has hindered the application of marker-assisted selection (MAS) in breeding programmes. A crucial step in the successful application of MAS in breeding programmes is the development of cheap and easy to use molecular markers, such as single-nucleotide polymorphisms. To mine selected elite wheat germplasm for intervarietal single-nucleotide polymorphisms, we have used expressed sequence tags derived from public sequencing programmes and next-generation sequencing of normalized wheat complementary DNA libraries, in combination with a novel sequence alignment and assembly approach. Here, we describe the development and validation of a panel of 1114 single-nucleotide polymorphisms in hexaploid bread wheat using competitive allele-specific polymerase chain reaction genotyping technology. We report the genotyping results of these markers on 23 wheat varieties, selected to represent a broad cross-section of wheat germplasm including a number of elite UK varieties. Finally, we show that, using relatively simple technology, it is possible to rapidly generate a linkage map containing several hundred single-nucleotide polymorphism markers in the doubled haploid mapping population of Avalon × Cadenza. © 2011 The Authors. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
USDA-ARS?s Scientific Manuscript database
Unfavorable genetic correlations between production and fertility traits are well documented. Genetic selection for fertility traits is slow, however, due to low heritabilities. Identification of single nucleotide polymorphisms (SNP) involved in reproduction could improve reliability of genomic esti...
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
Chen, Sherry Xi; Seelig, Georg
2016-04-20
Even a single-nucleotide difference between the sequences of two otherwise identical biological nucleic acids can have dramatic functional consequences. Here, we use model-guided reaction pathway engineering to quantitatively improve the performance of selective hybridization probes in recognizing single nucleotide variants (SNVs). Specifically, we build a detection system that combines discrimination by competition with DNA strand displacement-based catalytic amplification. We show, both mathematically and experimentally, that the single nucleotide selectivity of such a system in binding to single-stranded DNA and RNA is quadratically better than discrimination due to competitive hybridization alone. As an additional benefit the integrated circuit inherits the property of amplification and provides at least 10-fold better sensitivity than standard hybridization probes. Moreover, we demonstrate how the detection mechanism can be tuned such that the detection reaction is agnostic to the position of the SNV within the target sequence. in contrast, prior strand displacement-based probes designed for kinetic discrimination are highly sensitive to position effects. We apply our system to reliably discriminate between different members of the let-7 microRNA family that differ in only a single base position. Our results demonstrate the power of systematic reaction network design to quantitatively improve biotechnology.
USDA-ARS?s Scientific Manuscript database
Large datasets containing single nucleotide polymorphisms (SNPs) are used to analyze genome-wide diversity in a robust collection of cultivars from representative accessions, across the world. The extent of linkage disequilibrium (LD) within a population determines the number of markers required fo...
Nelson, Chase W; Moncla, Louise H; Hughes, Austin L
2015-11-15
New applications of next-generation sequencing technologies use pools of DNA from multiple individuals to estimate population genetic parameters. However, no publicly available tools exist to analyse single-nucleotide polymorphism (SNP) calling results directly for evolutionary parameters important in detecting natural selection, including nucleotide diversity and gene diversity. We have developed SNPGenie to fill this gap. The user submits a FASTA reference sequence(s), a Gene Transfer Format (.GTF) file with CDS information and a SNP report(s) in an increasing selection of formats. The program estimates nucleotide diversity, distance from the reference and gene diversity. Sites are flagged for multiple overlapping reading frames, and are categorized by polymorphism type: nonsynonymous, synonymous, or ambiguous. The results allow single nucleotide, single codon, sliding window, whole gene and whole genome/population analyses that aid in the detection of positive and purifying natural selection in the source population. SNPGenie version 1.2 is a Perl program with no additional dependencies. It is free, open-source, and available for download at https://github.com/hugheslab/snpgenie. nelsoncw@email.sc.edu or austin@biol.sc.edu Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Molee, A.; Kongroi, K.; Kuadsantia, P.; Poompramun, C.; Likitdecharote, B.
2016-01-01
The aim of the present study was to investigate the effect of single nucleotide polymorphisms in the major histocompatibility complex (MHC) class II gene on resistance to Newcastle disease virus and body weight of the Thai indigenous chicken, Leung Hang Khao (Gallus gallus domesticus). Blood samples were collected for single nucleotide polymorphism analysis from 485 chickens. Polymerase chain reaction sequencing was used to classify single nucleotide polymorphisms of class II MHC. Body weights were measured at the ages of 3, 4, 5, and 7 months. Titres of Newcastle disease virus at 2 weeks to 7 months were determined and the correlation between body weight and titre was analysed. The association between single nucleotide polymorphisms and body weight and titre were analysed by a generalized linear model. Seven single nucleotide polymorphisms were identified: C125T, A126T, C209G, C242T, A243T, C244T, and A254T. Significant correlations between log titre and body weight were found at 2 and 4 weeks. Associations between single nucleotide polymorphisms and titre were found for C209G and A254T, and between all single nucleotide polymorphisms (except A243T) and body weight. The results showed that class II MHC is associated with both titre of Newcastle disease virus and body weight in Leung Hang Khao chickens. This is of concern because improved growth traits are the main goal of breeding selection. Moreover, the results suggested that MHC has a pleiotropic effect on the titre and growth performance. This mechanism should be investigated in a future study. PMID:26732325
Khrustaleva, A M; Gritsenko, O F; Klovach, N V
2013-11-01
The genetic polymorphism of 45 single-nucleotide polymorphism loci was examined in the four largest wild populations of sockeye salmon Oncorhynchusnerka from drainages of the Asian coast of the Pacific Ocean (Eastern and Western Kamchatka). It was demonstrated that sockeye salmon from the Palana River were considerably different from all other populations examined. The most probable explanation of the observed differences is the suggestion on possible demographic events in the history of this population associated with the decrease in its effective number. To study the origin, colonization patterns, and evolution of Asian sockeye salmon, as well as to resolve some of the applied tasks, like population assignment and genetic identification, a differentiation approach to SNP-marker selection was suggested. Adaptively important loci that evolve under the pressure of balancing (stabilizing) selection were identified, thanks to which the number of loci that provide the baseline classification error rates in the population assignment tests was reduced to 30. It was demonstrated that SNPs located in the MHC2 and GPH genes were affected by diversifying selection. Procedures for selecting single-nucleotide polymorphisms for phylogenetic studies of Asian sockeye salmon were suggested. Using principal-component analysis, 17 loci that adequately reproduce genetic differentiation within arid among the regions of the origin of Kamchatka sockeye salmon, were selected.
Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F
2009-07-14
In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.
Choudhry, Shweta; Baskin, Laurence S; Lammer, Edward J; Witte, John S; Dasgupta, Sudeshna; Ma, Chen; Surampalli, Abhilasha; Shen, Joel; Shaw, Gary M; Carmichael, Suzan L
2015-05-01
Estrogenic endocrine disruptors acting via estrogen receptors α (ESR1) and β (ESR2) have been implicated in the etiology of hypospadias, a common congenital malformation of the male external genitalia. We determined the association of single nucleotide polymorphisms in ESR1 and ESR2 genes with hypospadias in a racially/ethnically diverse study population of California births. We investigated the relationship between hypospadias and 108 ESR1 and 36 ESR2 single nucleotide polymorphisms in 647 cases and 877 population based nonmalformed controls among infants born in selected California counties from 1990 to 2003. Subgroup analyses were performed by race/ethnicity (nonHispanic white and Hispanic subjects) and by hypospadias severity (mild to moderate and severe). Odds ratios for 33 of the 108 ESR1 single nucleotide polymorphisms had p values less than 0.05 (p = 0.05 to 0.007) for risk of hypospadias. However, none of the 36 ESR2 single nucleotide polymorphisms was significantly associated. In stratified analyses the association results were consistent by disease severity but different sets of single nucleotide polymorphisms were significantly associated with hypospadias in nonHispanic white and Hispanic subjects. Due to high linkage disequilibrium across the single nucleotide polymorphisms, haplotype analyses were conducted and identified 6 haplotype blocks in ESR1 gene that had haplotypes significantly associated with an increased risk of hypospadias (OR 1.3 to 1.8, p = 0.04 to 0.00001). Similar to single nucleotide polymorphism analysis, different ESR1 haplotypes were associated with risk of hypospadias in nonHispanic white and Hispanic subjects. No significant haplotype association was observed for ESR2. The data provide evidence that ESR1 single nucleotide polymorphisms and haplotypes influence the risk of hypospadias in white and Hispanic subjects, and warrant further examination in other study populations. Copyright © 2015 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
3'-End labeling of nucleic acids by a polymerase ribozyme.
Samanta, Biswajit; Horning, David P; Joyce, Gerald F
2018-06-13
A polymerase ribozyme can be used to label the 3' end of RNA or DNA molecules by incorporating a variety of functionalized nucleotide analogs. Guided by a complementary template, the ribozyme adds a single nucleotide that may contain a fluorophore, biotin, azide or alkyne moiety, thus enabling the detection and/or capture of selectively labeled materials. Employing a variety of commercially available nucleotide analogs, efficient labeling was demonstrated for model RNAs and DNAs, human microRNAs and natural tRNA.
Huang, C.; Chien, M.S.; Landolt, M.L.; Batts, W.; Winton, J.
1996-01-01
Twelve neutralizing monoclonal antibodies (MAbs) against the fish rhabdovirus, infectious haematopoietic necrosis virus (IHNV), were used to select 20 MAb escape mutants. The nucleotide sequence of the entire glycoprotein (G) gene was determined for six mutants representing differing cross-neutralization patterns and each had a single nucleotide change leading to a single amino acid substitution within one of three regions of the protein. These data were used to design nested PCR primers to amplify portions of the G gene of the 14 remaining mutants. When the PCR products from these mutants were sequenced, they also had single nucleotide substitutions coding for amino acid substitutions at the same, or nearby, locations. Of the 20 mutants for which all or part of the glycoprotein gene was sequenced, two MAbs selected mutants with substitutions at amino acids 230-231 (antigenic site I) and the remaining MAbs selected mutants with substitutions at amino acids 272-276 (antigenic site II). Two MAbs that selected mutants mapping to amino acids 272-276, selected other mutants that mapped to amino acids 78-81, raising the possibility that this portion of the N terminus of the protein was part of a discontinuous epitope defining antigenic site II. CLUSTAL alignment of the glycoproteins of rabies virus, vesicular stomatitis virus and IHNV revealed similarities in the location of the neutralizing epitopes and a high degree of conservation among cysteine residues, indicating that the glycoproteins of three different genera of animal rhabdoviruses may share a similar three-dimensional structure in spite of extensive sequence divergence.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions.
Coari, Kristin M; Martin, Rebecca C; Jain, Kopal; McGown, Linda B
2017-09-01
In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions
NASA Astrophysics Data System (ADS)
Coari, Kristin M.; Martin, Rebecca C.; Jain, Kopal; McGown, Linda B.
2017-09-01
In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
Grotegut, Chad A; Ngan, Emily; Garrett, Melanie E; Miranda, Marie Lynn; Ashley-Koch, Allison E; Swamy, Geeta K
2017-09-01
Oxytocin is a potent uterotonic agent that is widely used for induction and augmentation of labor. Oxytocin has a narrow therapeutic index and the optimal dosing for any individual woman varies widely. The objective of this study was to determine whether genetic variation in the oxytocin receptor (OXTR) or in the gene encoding G protein-coupled receptor kinase 6 (GRK6), which regulates desensitization of the oxytocin receptor, could explain variation in oxytocin dosing and labor outcomes among women being induced near term. Pregnant women with a singleton gestation residing in Durham County, NC, were prospectively enrolled as part of the Healthy Pregnancy, Healthy Baby cohort study. Those women undergoing an induction of labor at 36 weeks or greater were genotyped for 18 haplotype-tagging single-nucleotide polymorphisms in OXTR and 7 haplotype-tagging single-nucleotide polymorphisms in GRK6 using TaqMan assays. Linear regression was used to examine the relationship between maternal genotype and maximal oxytocin infusion rate, total oxytocin dose received, and duration of labor. Logistic regression was used to test for the association of maternal genotype with mode of delivery. For each outcome, backward selection techniques were utilized to control for important confounding variables and additive genetic models were used. Race/ethnicity was included in all models because of differences in allele frequencies across populations, and Bonferroni correction for multiple testing was used. DNA was available from 482 women undergoing induction of labor at 36 weeks or greater. Eighteen haplotype-tagging single-nucleotide polymorphisms within OXTR and 7 haplotype-tagging single-nucleotide polymorphisms within GRK6 were examined. Five single-nucleotide polymorphisms in OXTR showed nominal significance with maximal infusion rate of oxytocin, and two single-nucleotide polymorphisms in OXTR were associated with total oxytocin dose received. One single-nucleotide polymorphism in OXTR and two single-nucleotide polymorphisms in GRK6 were associated with duration of labor, one of which met the multiple testing threshold (P = .0014, rs2731664 [GRK6], mean duration of labor, 17.7 hours vs 20.2 hours vs 23.5 hours for AA, AC, and CC genotypes, respectively). Three single-nucleotide polymorphisms, two in OXTR and one in GRK6, showed nominal significance with mode of delivery. Genetic variation in OXTR and GRK6 is associated with the amount of oxytocin required as well as the duration of labor and risk for cesarean delivery among women undergoing induction of labor near term. With further research, pharmacogenomic approaches may potentially be utilized to develop personalized treatment to improve safety and efficacy outcomes among women undergoing induction of labor. Copyright © 2017 Elsevier Inc. All rights reserved.
USDA-ARS?s Scientific Manuscript database
Genotyping by sequencing (GBS) technology was used to identify a set of 9,933 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1,087 cM for watermelon. The genome-wide variation of recombination rate (GWRR) across the map was evaluated and a positive co...
Kurushima, J. D.; Lipinski, M. J.; Gandolfi, B.; Froenicke, L.; Grahn, J. C.; Grahn, R. A.; Lyons, L. A.
2012-01-01
Summary Both cat breeders and the lay public have interests in the origins of their pets, not only in the genetic identity of the purebred individuals, but also the historical origins of common household cats. The cat fancy is a relatively new institution with over 85% of its 40–50 breeds arising only in the past 75 years, primarily through selection on single-gene aesthetic traits. The short, yet intense cat breed history poses a significant challenge to the development of a genetic marker-based breed identification strategy. Using different breed assignment strategies and methods, 477 cats representing 29 fancy breeds were analysed with 38 short tandem repeats, 148 intergenic and five phenotypic single nucleotide polymorphisms. Results suggest the frequentist method of Paetkau (accuracy single nucleotide polymorphisms = 0.78, short tandem repeats = 0.88) surpasses the Bayesian method of Rannala and Mountain (single nucleotide polymorphisms = 0.56, short tandem repeats = 0.83) for accurate assignment of individuals to the correct breed. Additionally, a post-assignment verification step with the five phenotypic single nucleotide polymorphisms accurately identified between 0.31 and 0.58 of the mis-assigned individuals raising the sensitivity of assignment with the frequentist method to 0.89 and 0.92 single nucleotide polymorphisms and short tandem repeats respectively. This study provides a novel multi-step assignment strategy and suggests that, despite their short breed history and breed family groupings, a majority of cats can be assigned to their proper breed or population of origin, i.e. race. PMID:23171373
Sub-micro-liter Electrochemical Single-Nucleotide-Polymorphism Detector for Lab-on-a-Chip System
NASA Astrophysics Data System (ADS)
Tanaka, Hiroyuki; Fiorini, Paolo; Peeters, Sara; Majeed, Bivragh; Sterken, Tom; de Beeck, Maaike Op; Hayashi, Miho; Yaku, Hidenobu; Yamashita, Ichiro
2012-04-01
A sub-micro-liter single-nucleotide-polymorphism (SNP) detector for lab-on-a-chip applications is developed. This detector enables a fast, sensitive, and selective SNP detection directly from human blood. The detector is fabricated on a Si substrate by a standard complementary metal oxide semiconductor/micro electro mechanical systems (CMOS/MEMS) process and Polydimethylsiloxane (PDMS) molding. Stable and reproducible measurements are obtained by implementing an on-chip Ag/AgCl electrode and encapsulating the detector. The detector senses the presence of SNPs by measuring the concentration of pyrophosphoric acid generated during selective DNA amplification. A 0.5-µL-volume detector enabled the successful performance of the typing of a SNP within the ABO gene using human blood. The measured sensitivity is 566 pA/µM.
Genomic selection in dairy cattle: the USDA experience
USDA-ARS?s Scientific Manuscript database
Genomic selection has revolutionized dairy cattle breeding. Since 2000, assays have been developed to genotype large numbers of single nucleotide polymorphisms (SNP) at relatively low cost. The first commercial SNP genotyping chip was released with a set of 54,001 SNP in December 2007. Over 15,000 ...
E6 and E7 Gene Polymorphisms in Human Papillomavirus Types-58 and 33 Identified in Southwest China
Wen, Qiang; Wang, Tao; Mu, Xuemei; Chenzhang, Yuwei; Cao, Man
2017-01-01
Cancer of the cervix is associated with infection by certain types of human papillomavirus (HPV). The gene variants differ in immune responses and oncogenic potential. The E6 and E7 proteins encoded by high-risk HPV play a key role in cellular transformation. HPV-33 and HPV-58 types are highly prevalent among Chinese women. To study the gene intratypic variations, polymorphisms and positive selections of HPV-33 and HPV-58 E6/E7 in southwest China, HPV-33 (E6, E7: n = 216) and HPV-58 (E6, E7: n = 405) E6 and E7 genes were sequenced and compared to others submitted to GenBank. Phylogenetic trees were constructed by Maximum-likelihood and the Kimura 2-parameters methods by MEGA 6 (Molecular Evolutionary Genetics Analysis version 6.0). The diversity of secondary structure was analyzed by PSIPred software. The selection pressures acting on the E6/E7 genes were estimated by PAML 4.8 (Phylogenetic Analyses by Maximun Likelihood version4.8) software. The positive sites of HPV-33 and HPV-58 E6/E7 were contrasted by ClustalX 2.1. Among 216 HPV-33 E6 sequences, 8 single nucleotide mutations were observed with 6/8 non-synonymous and 2/8 synonymous mutations. The 216 HPV-33 E7 sequences showed 3 single nucleotide mutations that were non-synonymous. The 405 HPV-58 E6 sequences revealed 8 single nucleotide mutations with 4/8 non-synonymous and 4/8 synonymous mutations. Among 405 HPV-58 E7 sequences, 13 single nucleotide mutations were observed with 10/13 non-synonymous mutations and 3/13 synonymous mutations. The selective pressure analysis showed that all HPV-33 and 4/6 HPV-58 E6/E7 major non-synonymous mutations were sites of positive selection. All variations were observed in sites belonging to major histocompatibility complex and/or B-cell predicted epitopes. K93N and R145 (I/N) were observed in both HPV-33 and HPV-58 E6. PMID:28141822
Single-Molecule Counting of Point Mutations by Transient DNA Binding
NASA Astrophysics Data System (ADS)
Su, Xin; Li, Lidan; Wang, Shanshan; Hao, Dandan; Wang, Lei; Yu, Changyuan
2017-03-01
High-confidence detection of point mutations is important for disease diagnosis and clinical practice. Hybridization probes are extensively used, but are hindered by their poor single-nucleotide selectivity. Shortening the length of DNA hybridization probes weakens the stability of the probe-target duplex, leading to transient binding between complementary sequences. The kinetics of probe-target binding events are highly dependent on the number of complementary base pairs. Here, we present a single-molecule assay for point mutation detection based on transient DNA binding and use of total internal reflection fluorescence microscopy. Statistical analysis of single-molecule kinetics enabled us to effectively discriminate between wild type DNA sequences and single-nucleotide variants at the single-molecule level. A higher single-nucleotide discrimination is achieved than in our previous work by optimizing the assay conditions, which is guided by statistical modeling of kinetics with a gamma distribution. The KRAS c.34 A mutation can be clearly differentiated from the wild type sequence (KRAS c.34 G) at a relative abundance as low as 0.01% mutant to WT. To demonstrate the feasibility of this method for analysis of clinically relevant biological samples, we used this technology to detect mutations in single-stranded DNA generated from asymmetric RT-PCR of mRNA from two cancer cell lines.
Yang, Yong; Wu, Zhihong; Zhao, Taimao; Wang, Hai; Zhao, Dong; Zhang, Jianguo; Wang, Yipeng; Ding, Yaozhong; Qiu, Guixing
2009-06-01
The etiology of adolescent idiopathic scoliosis is undetermined despite years of research. A number of hypotheses have been postulated to explain its development, including growth abnormalities. The irregular expression of growth hormone and insulin-like growth factor-1 (IGF-1) may disturb hormone metabolism, result in a gross asymmetry, and promote the progress of adolescent idiopathic scoliosis. Initial association studies in complex diseases have demonstrated the power of candidate gene association. Prior to our study, 1 study in this field had a negative result. A replicable study is vital for reliability. To determine the relationship of growth hormone receptor and IGF-1 genes with adolescent idiopathic scoliosis, a population-based association study was performed. Single nucleotide polymorphisms with potential function were selected from candidate genes and a distribution analysis was performed. A conclusion was made confirming the insufficiency of an association between adolescent idiopathic scoliosis and the single-nucleotide polymorphism of the growth hormone receptor and IGF-1 genes in Han Chinese.
Selection and Management of DNA Markers for Use in Genomic Evaluation
USDA-ARS?s Scientific Manuscript database
A database was constructed to store genotypes for 50,972 single-nucleotide polymorphisms (SNP) from the Illumina BovineSNP50 BeadChip for over 30,000 animals. The database allows storage of multiple samples per animal and stores all SNP genotypes for a sample in a single row. An indicator specifies ...
Shirasu, Naoto; Kuroki, Masahide
2014-01-01
We developed a time- and cost-effective multiplex allele-specific polymerase chain reaction (AS-PCR) method based on the two-step PCR thermal cycles for genotyping single-nucleotide polymorphisms in three alcoholism-related genes: alcohol dehydrogenase 1B, aldehyde dehydrogenase 2 and μ-opioid receptor. Applying MightyAmp(®) DNA polymerase with optimized AS-primers and PCR conditions enabled us to achieve effective and selective amplification of the target alleles from alkaline lysates of a human hair root, and simultaneously to determine the genotypes within less than 1.5 h using minimal lab equipment.
USDA-ARS?s Scientific Manuscript database
The family Rutaceae encompasses several genera including the economically important genus Citrus. In this study, we selected 22 citrus relatives belonging to the various sub groups of Rutaceae and compared the sequences of three gene fragments. The accessions selected belong to the subfamily Rutoide...
USDA-ARS?s Scientific Manuscript database
The promise of genomic selection is accurate prediction of animals' genetic potential from their genotypes. Simple DNA tests might replace low accuracy predictions for expensive or lowly heritable measures of puberty and fertility based on performance and pedigree. Knowing which DNA variants affec...
O'Toole, Amanda S.; Miller, Stacy; Haines, Nathan; Zink, M. Coleen; Serra, Martin J.
2006-01-01
Thermodynamic parameters are reported for duplex formation of 48 self-complementary RNA duplexes containing Watson–Crick terminal base pairs (GC, AU and UA) with all 16 possible 3′ double-nucleotide overhangs; mimicking the structures of short interfering RNAs (siRNA) and microRNAs (miRNA). Based on nearest-neighbor analysis, the addition of a second dangling nucleotide to a single 3′ dangling nucleotide increases stability of duplex formation up to 0.8 kcal/mol in a sequence dependent manner. Results from this study in conjunction with data from a previous study [A. S. O'Toole, S. Miller and M. J. Serra (2005) RNA, 11, 512.] allows for the development of a refined nearest-neighbor model to predict the influence of 3′ double-nucleotide overhangs on the stability of duplex formation. The model improves the prediction of free energy and melting temperature when tested against five oligomers with various core duplex sequences. Phylogenetic analysis of naturally occurring miRNAs was performed to support our results. Selection of the effector miR strand of the mature miRNA duplex appears to be dependent upon the identity of the 3′ double-nucleotide overhang. Thermodynamic parameters for 3′ single terminal overhangs adjacent to a UA pair are also presented. PMID:16820533
Sequence of a cDNA encoding pancreatic preprosomatostatin-22.
Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E
1982-01-01
We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673
A genetic variation map for chicken with 2.8 million single nucleotide polymorphisms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wong, G K; Hillier, L; Brandstrom, M
2005-02-20
We describe a genetic variation map for the chicken genome containing 2.8 million single nucleotide polymorphisms (SNPs), based on a comparison of the sequences of 3 domestic chickens (broiler, layer, Silkie) to their wild ancestor Red Jungle Fowl (RJF). Subsequent experiments indicate that at least 90% are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about 5 SNP/kb for almost every possible comparison between RJF and domestic lines, between two different domestic lines, and within domestic lines--contrary to the idea that domestic animals are highly inbred relative to theirmore » wild ancestors. In fact, most of the SNPs originated prior to domestication, and there is little to no evidence of selective sweeps for adaptive alleles on length scales of greater than 100 kb.« less
Using single nucleotide polymorphism to detect selection signature in Hereford beef cattle
USDA-ARS?s Scientific Manuscript database
The objective of this study was to investigate selection signature in 2 sources of purebred Hereford beef cattle. Data were available from 240 Line 1 Herefords (L1) born between 1953 to 2008, and 311 Industry Herefords (IH) born between 1970 and 2008. Line 1 Herefords were sampled from a closed line...
Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.
Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A
2006-08-01
Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Structure of a eukaryotic cyclic nucleotide-gated channel
Li, Minghui; Zhou, Xiaoyuan; Wang, Shu; Michailidis, Ioannis; Gong, Ye; Su, Deyuan; Li, Huan; Li, Xueming; Yang, Jian
2018-01-01
Summary Cyclic nucleotide-gated (CNG) channels are essential for vision and olfaction. They belong to the voltage-gated ion channel superfamily but their activities are controlled by intracellular cyclic nucleotides instead of transmembrane voltage. Here we report a 3.5 Å-resolution single-particle electron cryomicroscopy structure of a CNG channel from C. elegans in the cGMP-bound open state. The channel has an unusual voltage-sensor-like domain (VSLD), accounting for its deficient voltage dependence. A C-terminal linker connecting S6 and the cyclic nucleotide-binding domain interacts directly with both the VSLD and pore domain, forming a gating ring that couples conformational changes triggered by cyclic nucleotide binding to the gate. The selectivity filter is lined by the carboxylate side chains of a functionally important glutamate and three rings of backbone carbonyls. This structure provides a new framework for understanding mechanisms of ion permeation, gating and channelopathy of CNG channels and cyclic nucleotide modulation of related channels. PMID:28099415
Lipi, Farhana; Chen, Suxiang; Chakravarthy, Madhuri; Rakesh, Shilpa; Veedu, Rakesh N
2016-12-01
Nucleic acid aptamers are single-stranded DNA or RNA oligonucleotide sequences that bind to a specific target molecule with high affinity and specificity through their ability to adopt 3-dimensional structure in solution. Aptamers have huge potential as targeted therapeutics, diagnostics, delivery agents and as biosensors. However, aptamers composed of natural nucleotide monomers are quickly degraded in vivo and show poor pharmacodynamic properties. To overcome this, chemically-modified nucleic acid aptamers are developed by incorporating modified nucleotides after or during the selection process by Systematic Evolution of Ligands by EXponential enrichment (SELEX). This review will discuss the development of chemically-modified aptamers and provide the pros and cons, and new insights on in vitro aptamer selection strategies by using chemically-modified nucleic acid libraries.
Chen, Suxiang; Chakravarthy, Madhuri; Rakesh, Shilpa; Veedu, Rakesh N.
2016-01-01
ABSTRACT Nucleic acid aptamers are single-stranded DNA or RNA oligonucleotide sequences that bind to a specific target molecule with high affinity and specificity through their ability to adopt 3-dimensional structure in solution. Aptamers have huge potential as targeted therapeutics, diagnostics, delivery agents and as biosensors. However, aptamers composed of natural nucleotide monomers are quickly degraded in vivo and show poor pharmacodynamic properties. To overcome this, chemically-modified nucleic acid aptamers are developed by incorporating modified nucleotides after or during the selection process by Systematic Evolution of Ligands by EXponential enrichment (SELEX). This review will discuss the development of chemically-modified aptamers and provide the pros and cons, and new insights on in vitro aptamer selection strategies by using chemically-modified nucleic acid libraries. PMID:27715478
Bellucci, Elisa; Bitocchi, Elena; Ferrarini, Alberto; Benazzo, Andrea; Biagetti, Eleonora; Klie, Sebastian; Minio, Andrea; Rau, Domenico; Rodriguez, Monica; Panziera, Alex; Venturini, Luca; Attene, Giovanna; Albertini, Emidio; Jackson, Scott A.; Nanni, Laura; Fernie, Alisdair R.; Nikoloski, Zoran; Bertorelle, Giorgio; Delledonne, Massimo; Papa, Roberto
2014-01-01
Using RNA sequencing technology and de novo transcriptome assembly, we compared representative sets of wild and domesticated accessions of common bean (Phaseolus vulgaris) from Mesoamerica. RNA was extracted at the first true-leaf stage, and de novo assembly was used to develop a reference transcriptome; the final data set consists of ∼190,000 single nucleotide polymorphisms from 27,243 contigs in expressed genomic regions. A drastic reduction in nucleotide diversity (∼60%) is evident for the domesticated form, compared with the wild form, and almost 50% of the contigs that are polymorphic were brought to fixation by domestication. In parallel, the effects of domestication decreased the diversity of gene expression (18%). While the coexpression networks for the wild and domesticated accessions demonstrate similar seminal network properties, they show distinct community structures that are enriched for different molecular functions. After simulating the demographic dynamics during domestication, we found that 9% of the genes were actively selected during domestication. We also show that selection induced a further reduction in the diversity of gene expression (26%) and was associated with 5-fold enrichment of differentially expressed genes. While there is substantial evidence of positive selection associated with domestication, in a few cases, this selection has increased the nucleotide diversity in the domesticated pool at target loci associated with abiotic stress responses, flowering time, and morphology. PMID:24850850
NASA Astrophysics Data System (ADS)
Rachmatia, H.; Kusuma, W. A.; Hasibuan, L. S.
2017-05-01
Selection in plant breeding could be more effective and more efficient if it is based on genomic data. Genomic selection (GS) is a new approach for plant-breeding selection that exploits genomic data through a mechanism called genomic prediction (GP). Most of GP models used linear methods that ignore effects of interaction among genes and effects of higher order nonlinearities. Deep belief network (DBN), one of the architectural in deep learning methods, is able to model data in high level of abstraction that involves nonlinearities effects of the data. This study implemented DBN for developing a GP model utilizing whole-genome Single Nucleotide Polymorphisms (SNPs) as data for training and testing. The case study was a set of traits in maize. The maize dataset was acquisitioned from CIMMYT’s (International Maize and Wheat Improvement Center) Global Maize program. Based on Pearson correlation, DBN is outperformed than other methods, kernel Hilbert space (RKHS) regression, Bayesian LASSO (BL), best linear unbiased predictor (BLUP), in case allegedly non-additive traits. DBN achieves correlation of 0.579 within -1 to 1 range.
Bataillon, Thomas; Duan, Jinjie; Hvilsom, Christina; Jin, Xin; Li, Yingrui; Skov, Laurits; Glemin, Sylvain; Munch, Kasper; Jiang, Tao; Qian, Yu; Hobolth, Asger; Wang, Jun; Mailund, Thomas; Siegismund, Hans R; Schierup, Mikkel H
2015-03-30
We study genome-wide nucleotide diversity in three subspecies of extant chimpanzees using exome capture. After strict filtering, Single Nucleotide Polymorphisms and indels were called and genotyped for greater than 50% of exons at a mean coverage of 35× per individual. Central chimpanzees (Pan troglodytes troglodytes) are the most polymorphic (nucleotide diversity, θw = 0.0023 per site) followed by Eastern (P. t. schweinfurthii) chimpanzees (θw = 0.0016) and Western (P. t. verus) chimpanzees (θw = 0.0008). A demographic scenario of divergence without gene flow fits the patterns of autosomal synonymous nucleotide diversity well except for a signal of recent gene flow from Western into Eastern chimpanzees. The striking contrast in X-linked versus autosomal polymorphism and divergence previously reported in Central chimpanzees is also found in Eastern and Western chimpanzees. We show that the direction of selection statistic exhibits a strong nonmonotonic relationship with the strength of purifying selection S, making it inappropriate for estimating S. We instead use counts in synonymous versus nonsynonymous frequency classes to infer the distribution of S coefficients acting on nonsynonymous mutations in each subspecies. The strength of purifying selection we infer is congruent with the differences in effective sizes of each subspecies: Central chimpanzees are undergoing the strongest purifying selection followed by Eastern and Western chimpanzees. Coding indels show stronger selection against indels changing the reading frame than observed in human populations. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Gardner, Andrew F; Wang, Jinchun; Wu, Weidong; Karouby, Jennifer; Li, Hong; Stupi, Brian P; Jack, William E; Hersh, Megan N; Metzker, Michael L
2012-08-01
Recent developments of unique nucleotide probes have expanded our understanding of DNA polymerase function, providing many benefits to techniques involving next-generation sequencing (NGS) technologies. The cyclic reversible termination (CRT) method depends on efficient base-selective incorporation of reversible terminators by DNA polymerases. Most terminators are designed with 3'-O-blocking groups but are incorporated with low efficiency and fidelity. We have developed a novel class of 3'-OH unblocked nucleotides, called Lightning Terminators™, which have a terminating 2-nitrobenzyl moiety attached to hydroxymethylated nucleobases. A key structural feature of this photocleavable group displays a 'molecular tuning' effect with respect to single-base termination and improved nucleotide fidelity. Using Therminator DNA polymerase, we demonstrate that these 3'-OH unblocked terminators exhibit superior enzymatic performance compared to two other reversible terminators, 3'-O-amino-TTP and 3'-O-azidomethyl-TTP. Lightning Terminators show maximum incorporation rates (k(pol)) that range from 35 to 45 nt/s, comparable to the fastest NGS chemistries, yet with catalytic efficiencies (k(pol)/K(D)) comparable to natural nucleotides. Pre-steady-state kinetic studies of thymidine analogs revealed that the major determinant for improved nucleotide selectivity is a significant reduction in k(pol) by >1000-fold over TTP misincorporation. These studies highlight the importance of structure-function relationships of modified nucleotides in dictating polymerase performance.
Arnedo, Mireia; Taffé, Patrick; Sahli, Roland; Furrer, Hansjakob; Hirschel, Bernard; Elzi, Luigia; Weber, Rainer; Vernazza, Pietro; Bernasconi, Enos; Darioli, Roger; Bergmann, Sven; Beckmann, Jacques S; Telenti, Amalio; Tarr, Philip E
2007-09-01
HIV-1 infected individuals have an increased cardiovascular risk which is partially mediated by dyslipidemia. Single nucleotide polymorphisms in multiple genes involved in lipid transport and metabolism are presumed to modulate the risk of dyslipidemia in response to antiretroviral therapy. The contribution to dyslipidemia of 20 selected single nucleotide polymorphisms of 13 genes reported in the literature to be associated with plasma lipid levels (ABCA1, ADRB2, APOA5, APOC3, APOE, CETP, LIPC, LIPG, LPL, MDR1, MTP, SCARB1, and TNF) was assessed by longitudinally modeling more than 4400 plasma lipid determinations in 438 antiretroviral therapy-treated participants during a median period of 4.8 years. An exploratory genetic score was tested that takes into account the cumulative contribution of multiple gene variants to plasma lipids. Variants of ABCA1, APOA5, APOC3, APOE, and CETP contributed to plasma triglyceride levels, particularly in the setting of ritonavir-containing antiretroviral therapy. Variants of APOA5 and CETP contributed to high-density lipoprotein-cholesterol levels. Variants of CETP and LIPG contributed to non-high-density lipoprotein-cholesterol levels, a finding not reported previously. Sustained hypertriglyceridemia and low high-density lipoprotein-cholesterol during the study period was significantly associated with the genetic score. Single nucleotide polymorphisms of ABCA1, APOA5, APOC3, APOE, and CETP contribute to plasma triglyceride and high-density lipoprotein-cholesterol levels during antiretroviral therapy exposure. Genetic profiling may contribute to the identification of patients at risk for antiretroviral therapy-related dyslipidemia.
González-Martínez, Santiago C; Ersoz, Elhan; Brown, Garth R; Wheeler, Nicholas C; Neale, David B
2006-03-01
Genetic association studies are rapidly becoming the experimental approach of choice to dissect complex traits, including tolerance to drought stress, which is the most common cause of mortality and yield losses in forest trees. Optimization of association mapping requires knowledge of the patterns of nucleotide diversity and linkage disequilibrium and the selection of suitable polymorphisms for genotyping. Moreover, standard neutrality tests applied to DNA sequence variation data can be used to select candidate genes or amino acid sites that are putatively under selection for association mapping. In this article, we study the pattern of polymorphism of 18 candidate genes for drought-stress response in Pinus taeda L., an important tree crop. Data analyses based on a set of 21 putatively neutral nuclear microsatellites did not show population genetic structure or genomewide departures from neutrality. Candidate genes had moderate average nucleotide diversity at silent sites (pi(sil) = 0.00853), varying 100-fold among single genes. The level of within-gene LD was low, with an average pairwise r2 of 0.30, decaying rapidly from approximately 0.50 to approximately 0.20 at 800 bp. No apparent LD among genes was found. A selective sweep may have occurred at the early-response-to-drought-3 (erd3) gene, although population expansion can also explain our results and evidence for selection was not conclusive. One other gene, ccoaomt-1, a methylating enzyme involved in lignification, showed dimorphism (i.e., two highly divergent haplotype lineages at equal frequency), which is commonly associated with the long-term action of balancing selection. Finally, a set of haplotype-tagging SNPs (htSNPs) was selected. Using htSNPs, a reduction of genotyping effort of approximately 30-40%, while sampling most common allelic variants, can be gained in our ongoing association studies for drought tolerance in pine.
Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.
Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant
2017-11-28
Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.
USDA-ARS?s Scientific Manuscript database
Unfavorable genetic correlations between production and fertility traits are well documented. Genetic selection for fertility traits is slow, however, due to low heritabilities. Identification of single nucleotide polymorphisms (SNP) involved in reproduction has improved the reliability of genomic e...
Maternal grandsire confirmation and discovery in dairy cattle
USDA-ARS?s Scientific Manuscript database
Accurate pedigree information is essential for selecting dairy animals to improve economically important traits. Two methods of maternal grandsire (MGS) discovery were compared. The first compared one single nucleotide polymorphism (SNP) at a time using a genotype from one or both parents (SNP metho...
USDA-ARS?s Scientific Manuscript database
Reproductive success is an important component of commercial beef cattle production, and identification of DNA markers with predictive merit for reproductive success would facilitate accurate prediction of mean daughter pregnancy rate, enabling effective selection of bulls to improve female fertilit...
USDA-ARS?s Scientific Manuscript database
Hereford is a major beef breed in the USA and has been subjected to selection for a variety of goals. A sub-population, known as Line 1 (L1), was established in 1934 by joining two paternal half-sib bulls with 50 unrelated females. L1 has since been maintained as a closed population and selected p...
USDA-ARS?s Scientific Manuscript database
Bacterial cold water disease (BCWD), caused by Flavobacterium psychrophilum, is an endemic and problematic disease in rainbow trout (Oncorhynchus mykiss) aquaculture. Previously, we have identified SNPs (single nucleotide polymorphisms) associated with BCWD resistance in rainbow trout. The objective...
Optimal design of low-density SNP arrays for genomic prediction: algorithm and applications
USDA-ARS?s Scientific Manuscript database
Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for their optimal design. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optim...
Development and Applications of a Bovine 50,000 SNP Chip
USDA-ARS?s Scientific Manuscript database
To develop an Illumina iSelect high density single nucleotide polymorphism (SNP) assay for cattle, the collaborative iBMC (Illumina, USDA ARS Beltsville, University of Missouri, USDA ARS Clay Center) Consortium first performed a de novo SNP discovery project in which genomic reduced representation l...
Single Color Multiplexed ddPCR Copy Number Measurements and Single Nucleotide Variant Genotyping.
Wood-Bouwens, Christina M; Ji, Hanlee P
2018-01-01
Droplet digital PCR (ddPCR) allows for accurate quantification of genetic events such as copy number variation and single nucleotide variants. Probe-based assays represent the current "gold-standard" for detection and quantification of these genetic events. Here, we introduce a cost-effective single color ddPCR assay that allows for single genome resolution quantification of copy number and single nucleotide variation.
Structural basis for the D-stereoselectivity of human DNA polymerase β
Vyas, Rajan; Reed, Andrew J.; Raper, Austin T.; Zahurancik, Walter J.; Wallenmeyer, Petra C.
2017-01-01
Abstract Nucleoside reverse transcriptase inhibitors (NRTIs) with L-stereochemistry have long been an effective treatment for viral infections because of the strong D-stereoselectivity exhibited by human DNA polymerases relative to viral reverse transcriptases. The D-stereoselectivity of DNA polymerases has only recently been explored structurally and all three DNA polymerases studied to date have demonstrated unique stereochemical selection mechanisms. Here, we have solved structures of human DNA polymerase β (hPolβ), in complex with single-nucleotide gapped DNA and L-nucleotides and performed pre-steady-state kinetic analysis to determine the D-stereoselectivity mechanism of hPolβ. Beyond a similar 180° rotation of the L-nucleotide ribose ring seen in other studies, the pre-catalytic ternary crystal structures of hPolβ, DNA and L-dCTP or the triphosphate forms of antiviral drugs lamivudine ((-)3TC-TP) and emtricitabine ((-)FTC-TP) provide little structural evidence to suggest that hPolβ follows the previously characterized mechanisms of D-stereoselectivity. Instead, hPolβ discriminates against L-stereochemistry through accumulation of several active site rearrangements that lead to a decreased nucleotide binding affinity and incorporation rate. The two NRTIs escape some of the active site selection through the base and sugar modifications but are selected against through the inability of hPolβ to complete thumb domain closure. PMID:28402499
Genetic risk profiling and gene signature modeling to predict risk of complications after IPAA.
Sehgal, Rishabh; Berg, Arthur; Polinski, Joseph I; Hegarty, John P; Lin, Zhenwu; McKenna, Kevin J; Stewart, David B; Poritz, Lisa S; Koltun, Walter A
2012-03-01
Severe pouchitis and Crohn's disease-like complications are 2 adverse postoperative complications that confound the success of the IPAA in patients with ulcerative colitis. To date, approximately 83 single nucleotide polymorphisms within 55 genes have been associated with IBD. The aim of this study was to identify single-nucleotide polymorphisms that correlate with complications after IPAA that could be utilized in a gene signature fashion to predict postoperative complications and aid in preoperative surgical decision making. One hundred forty-two IPAA patients were retrospectively classified as "asymptomatic" (n = 104, defined as no Crohn's disease-like complications or severe pouchitis for at least 2 years after IPAA) and compared with a "severe pouchitis" group (n = 12, ≥ 4 episodes pouchitis per year for 2 years including the need for long-term therapy to maintain remission) and a "Crohn's disease-like" group (n = 26, presence of fistulae, pouch inlet stricture, proximal small-bowel disease, or pouch granulomata, occurring at least 6 months after surgery). Genotyping for 83 single-nucleotide polymorphisms previously associated with Crohn's disease and/or ulcerative colitis was performed on a customized Illumina genotyping platform. The top 2 single-nucleotide polymorphisms statistically identified as being independently associated with each of Crohn's disease-like and severe pouchitis were used in a multivariate logistic regression model. These single-nucleotide polymorphisms were then used to create probability equations to predict overall chance of a positive or negative outcome for that complication. The top 2 single-nucleotide polymorphisms for Crohn's disease-like complications were in the 10q21 locus and the gene for PTGER4 (p = 0.006 and 0.007), whereas for severe pouchitis it was NOD2 and TNFSF15 (p = 0.003 and 0.011). Probability equations suggested that the risk of these 2 complications greatly increased with increasing number of risk alleles, going as high as 92% for severe pouchitis and 65% for Crohn's disease-like complications. In this IPAA patient cohort, mutations in the 10q21 locus and the PTGER4 gene were associated with Crohn's disease-like complications, whereas mutations in NOD2 and TNFSF15 correlated with severe pouchitis. Preoperative genetic analysis and use of such gene signatures hold promise for improved preoperative surgical patient selection to minimize these IPAA complications.
USDA-ARS?s Scientific Manuscript database
Cacao is an economically important commodity in Jamaica. Knowledge of the genetic diversity in Jamaican cacao germplasm is essential for their conservation and management. In spite of cacao's economic importance in Jamaica, the crop is understudied therefore limiting sound decisions towards improvin...
Wang, Lin; Liu, Simin; Niu, Tianhua; Xu, Xin
2005-03-18
Single nucleotide polymorphisms (SNPs) provide an important tool in pinpointing susceptibility genes for complex diseases and in unveiling human molecular evolution. Selection and retrieval of an optimal SNP set from publicly available databases have emerged as the foremost bottlenecks in designing large-scale linkage disequilibrium studies, particularly in case-control settings. We describe the architectural structure and implementations of a novel software program, SNPHunter, which allows for both ad hoc-mode and batch-mode SNP search, automatic SNP filtering, and retrieval of SNP data, including physical position, function class, flanking sequences at user-defined lengths, and heterozygosity from NCBI dbSNP. The SNP data extracted from dbSNP via SNPHunter can be exported and saved in plain text format for further down-stream analyses. As an illustration, we applied SNPHunter for selecting SNPs for 10 major candidate genes for type 2 diabetes, including CAPN10, FABP4, IL6, NOS3, PPARG, TNF, UCP2, CRP, ESR1, and AR. SNPHunter constitutes an efficient and user-friendly tool for SNP screening, selection, and acquisition. The executable and user's manual are available at http://www.hsph.harvard.edu/ppg/software.htm
Generalization of Associations of Kidney-Related Genetic Loci to American Indians
Haack, Karin; Almasy, Laura; Laston, Sandra; Lee, Elisa T.; Best, Lyle G.; Fabsitz, Richard R.; MacCluer, Jean W.; Howard, Barbara V.; Umans, Jason G.; Cole, Shelley A.
2014-01-01
Summary Background and objectives CKD disproportionally affects American Indians, who similar to other populations, show genetic susceptibility to kidney outcomes. Recent studies have identified several loci associated with kidney traits, but their relevance in American Indians is unknown. Design, setting, participants, & measurements This study used data from a large, family-based genetic study of American Indians (the Strong Heart Family Study), which includes 94 multigenerational families enrolled from communities located in Oklahoma, the Dakotas, and Arizona. Individuals were recruited from the Strong Heart Study, a population-based study of cardiovascular disease in American Indians. This study selected 25 single nucleotide polymorphisms in 23 loci identified from recently published kidney-related genome-wide association studies in individuals of European ancestry to evaluate their associations with kidney function (estimated GFR; individuals 18 years or older, up to 3282 individuals) and albuminuria (urinary albumin to creatinine ratio; n=3552) in the Strong Heart Family Study. This study also examined the association of single nucleotide polymorphisms in the APOL1 region with estimated GFR in 1121 Strong Heart Family Study participants. GFR was estimated using the abbreviated Modification of Diet in Renal Disease Equation. Additive genetic models adjusted for age and sex were used. Results This study identified significant associations of single nucleotide polymorphisms with estimated GFR in or nearby PRKAG2, SLC6A13, UBE2Q2, PIP5K1B, and WDR72 (P<2.1 × 10-3 to account for multiple testing). Single nucleotide polymorphisms in these loci explained 2.2% of the estimated GFR total variance and 2.9% of its heritability. An intronic variant of BCAS3 was significantly associated with urinary albumin to creatinine ratio. APOL1 single nucleotide polymorphisms were not associated with estimated GFR in a single variant test or haplotype analyses, and the at-risk variants identified in individuals with African ancestry were not detected in DNA sequencing of American Indians. Conclusion This study extends the genetic associations of loci affecting kidney function to American Indians, a population at high risk of kidney disease, and provides additional support for a potential biologic relevance of these loci across ancestries. PMID:24311711
Zhang, Xi; Zhang, Jing; Wu, Dongzhi; Liu, Zhijing; Cai, Shuxian; Chen, Mei; Zhao, Yanping; Li, Chunyan; Yang, Huanghao; Chen, Jinghua
2014-12-07
Locked nucleic acid (LNA) is applied in toehold-mediated strand displacement reaction (TMSDR) to develop a junction-probe electrochemiluminescence (ECL) biosensor for single-nucleotide polymorphism (SNP) detection in the BRCA1 gene related to breast cancer. More than 65-fold signal difference can be observed with perfectly matched target sequence to single-base mismatched sequence under the same conditions, indicating good selectivity of the ECL biosensor.
Duellman, Tyler; Warren, Christopher; Yang, Jay
2014-01-01
Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221
Wang, Xiaodan; Ma, Dehong; Huang, Xinwei; Li, Lihua; Li, Duo; Zhao, Yujiao; Qiu, Lijuan; Pan, Yue; Chen, Junying; Xi, Juemin; Shan, Xiyun; Sun, Qiangming
2017-06-15
In the past few decades, dengue has spread rapidly and is an emerging disease in China. An unexpected dengue outbreak occurred in Xishuangbanna, Yunnan, China, resulting in 1331 patients in 2013. In order to obtain the complete genome information and perform mutation and evolutionary analysis of causative agent related to this largest outbreak of dengue fever. The viruses were isolated by cell culture and evaluated by genome sequence analysis. Phylogenetic trees were then constructed by Neighbor-Joining methods (MEGA6.0), followed by analysis of nucleotide mutation and amino acid substitution. The analysis of the diversity of secondary structure for E and NS1 protein were also performed. Then selection pressures acting on the coding sequences were estimated by PAML software. The complete genome sequences of two isolated strains (YNSW1, YNSW2) were 10,710 and 10,702 nucleotides in length, respectively. Phylogenetic analysis revealed both strain were classified as genotype II of DENV-3. The results indicated that both isolated strains of Xishuangbanna in 2013 and Laos 2013 stains (KF816161.1, KF816158.1, LC147061.1, LC147059.1, KF816162.1) were most similar to Bangladesh (AY496873.2) in 2002. After comparing with the DENV-3SS (H87) 62 amino acid substitutions were identified in translated regions, and 38 amino acid substitutions were identified in translated regions compared with DENV-3 genotype II stains Bangladesh (AY496873.2). 27(YNSW1) or 28(YNSW2) single nucleotide changes were observed in structural protein sequences with 7(YNSW1) or 8(YNSW2) non-synonymous mutations compared with AY496873.2. Of them, 4 non-synonymous mutations were identified in E protein sequences with (2 in the β-sheet, 2 in the coil). Meanwhile, 117(YNSW1) or 115 (YNSW2) single nucleotide changes were observed in non-structural protein sequences with 31(YNSW1) or 30 (YNSW2) non-synonymous mutations. Particularly, 14 single nucleotide changes were observed in NS1 sequences with 4/14 non-synonymous substitutions (4 in the coil). Selection pressure analysis revealed no positive selection in the amino acid sites of the genes encoding for structural and non-structural proteins. This study may help understand the intrinsic geographical relatedness of dengue virus 3 and contributes further to research on their infectivity, pathogenicity and vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.
An abbreviated SNP panel for ancestry assignment of honeybees (Apis mellifera)
USDA-ARS?s Scientific Manuscript database
This paper examines whether an abbreviated panel of 37 single nucleotide polymorphisms (SNPs) has the same power as a larger and more expensive panel of 95 SNPs to assign ancestry of honeybees (Apis mellifera) to three ancestral lineages. We selected 37 SNPs from the original 95 SNP panel using alle...
Methods for discovering and validating relationships among genotyped animals
USDA-ARS?s Scientific Manuscript database
Genomic selection based on single-nucleotide polymorphisms (SNPs) has led to the collection of genotypes for over 2.2 million animals by the Council on Dairy Cattle Breeding in the United States. To assure that a genotype is assigned to the correct animal and that the animal’s pedigree is correct, t...
USDA-ARS?s Scientific Manuscript database
The development of resources for genomic studies in Mangifera indica (mango) will allow marker-assisted selection and identification of genetically diverse germplasm, greatly aiding mango breeding programs. We report here a first step in developing such resources, our identification of thousands una...
USDA-ARS?s Scientific Manuscript database
Deep sequencing of viruses isolated from infected hosts is an efficient way to measure population-genetic variation and can reveal patterns of dispersal and natural selection. In this study, we mined existing Illumina sequence reads to investigate single-nucleotide polymorphisms (SNPs) within two RN...
Wu, Lucia R.; Chen, Sherry X.; Wu, Yalei; Patel, Abhijit A.; Zhang, David Yu
2018-01-01
Rare DNA-sequence variants hold important clinical and biological information, but existing detection techniques are expensive, complex, allele-specific, or don’t allow for significant multiplexing. Here, we report a temperature-robust polymerase-chain-reaction method, which we term blocker displacement amplification (BDA), that selectively amplifies all sequence variants, including single-nucleotide variants (SNVs), within a roughly 20-nucleotide window by 1,000-fold over wild-type sequences. This allows for easy detection and quantitation of hundreds of potential variants originally at ≤0.1% in allele frequency. BDA is compatible with inexpensive thermocycler instrumentation and employs a rationally designed competitive hybridization reaction to achieve comparable enrichment performance across annealing temperatures ranging from 56 °C to 64 °C. To show the sequence generality of BDA, we demonstrate enrichment of 156 SNVs and the reliable detection of single-digit copies. We also show that the BDA detection of rare driver mutations in cell-free DNA samples extracted from the blood plasma of lung-cancer patients is highly consistent with deep sequencing using molecular lineage tags, with a receiver operator characteristic accuracy of 95%. PMID:29805844
Defining the mRNA recognition signature of a bacterial toxin protein
Schureck, Marc A.; Dunkle, Jack A.; Maehigashi, Tatsuya; ...
2015-10-27
Bacteria contain multiple type II toxins that selectively degrade mRNAs bound to the ribosome to regulate translation and growth and facilitate survival during the stringent response. Ribosome-dependent toxins recognize a variety of three-nucleotide codons within the aminoacyl (A) site, but how these endonucleases achieve substrate specificity remains poorly understood. In this paper, we identify the critical features for how the host inhibition of growth B (HigB) toxin recognizes each of the three A-site nucleotides for cleavage. X-ray crystal structures of HigB bound to two different codons on the ribosome illustrate how HigB uses a microbial RNase-like nucleotide recognition loop tomore » recognize either cytosine or adenosine at the second A-site position. Strikingly, a single HigB residue and 16S rRNA residue C1054 form an adenosine-specific pocket at the third A-site nucleotide, in contrast to how tRNAs decode mRNA. Finally, our results demonstrate that the most important determinant for mRNA cleavage by ribosome-dependent toxins is interaction with the third A-site nucleotide.« less
Defining the mRNA recognition signature of a bacterial toxin protein
DOE Office of Scientific and Technical Information (OSTI.GOV)
Schureck, Marc A.; Dunkle, Jack A.; Maehigashi, Tatsuya
Bacteria contain multiple type II toxins that selectively degrade mRNAs bound to the ribosome to regulate translation and growth and facilitate survival during the stringent response. Ribosome-dependent toxins recognize a variety of three-nucleotide codons within the aminoacyl (A) site, but how these endonucleases achieve substrate specificity remains poorly understood. In this paper, we identify the critical features for how the host inhibition of growth B (HigB) toxin recognizes each of the three A-site nucleotides for cleavage. X-ray crystal structures of HigB bound to two different codons on the ribosome illustrate how HigB uses a microbial RNase-like nucleotide recognition loop tomore » recognize either cytosine or adenosine at the second A-site position. Strikingly, a single HigB residue and 16S rRNA residue C1054 form an adenosine-specific pocket at the third A-site nucleotide, in contrast to how tRNAs decode mRNA. Finally, our results demonstrate that the most important determinant for mRNA cleavage by ribosome-dependent toxins is interaction with the third A-site nucleotide.« less
Transient state kinetics of transcription elongation by T7 RNA polymerase.
Anand, Vasanti Subramanian; Patel, Smita S
2006-11-24
The single subunit DNA-dependent RNA polymerase (RNAP) from bacteriophage T7 catalyzes both promoter-dependent transcription initiation and promoter-independent elongation. Using a promoter-free substrate, we have dissected the kinetic pathway of single nucleotide incorporation during elongation. We show that T7 RNAP undergoes a slow conformational change (0.01-0.03 s(-1)) to form an elongation competent complex with the promoter-free substrate (dissociation constant (Kd) of 96 nM). The complex binds to a correct NTP (Kd of 80 microM) and incorporates the nucleoside monophosphate (NMP) into RNA primer very efficiently (220 s(-1) at 25 degrees C). An overall free energy change (-5.5 kcal/mol) and internal free energy change (-3.7 kcal/mol) of single NMP incorporation was calculated from the measured equilibrium constants. In the presence of inorganic pyrophosphate (PPi), the elongation complex catalyzes the reverse pyrophosphorolysis reaction at a maximum rate of 0.8 s(-1) with PPi Kd of 1.2 mM. Several experiments were designed to investigate the rate-limiting step in the pathway of single nucleotide addition. Acid-quench and pulse-chase kinetics indicated that an isomerization step before chemistry is rate-limiting. The very similar rate constants of sequential incorporation of two nucleotides indicated that the steps after chemistry are fast. Based on available data, we propose that the preinsertion to insertion isomerization of NTP observed in the crystallographic studies of T7 RNAP is a likely candidate for the rate-limiting step. The studies here provide a kinetic framework to investigate structure-function and fidelity of RNA synthesis and to further explore the role of the conformational change in nucleotide selection during RNA synthesis.
Screening of reproduction-related single-nucleotide variations from MeDIP-seq data in sheep.
Cao, Jiaxue; Wei, Caihong; Zhang, Shuzhen; Capellini, Terence D; Zhang, Li; Zhao, Fuping; Li, Li; Zhong, Tao; Wang, Linjie; Du, Lixin; Zhang, Hongping
2016-11-01
Extensive variation in reproduction has arisen in Chinese Mongolian sheep during recent domestication. Hu and Small-tailed Han sheep, for example, have become non-seasonal breeders and exhibit higher fecundity than Tan and Ujumqin breeds. We therefore scanned reproduction-related single-nucleotide variations from methylated DNA-immunoprecipitation sequencing data generated from each of those four breeds to uncover potential mechanisms underlying this breed variation. We generated a high-quality map of single nucleotide variations (SNVs) in DNA methylation enriched regions, and found that the majority of variants are located within non-coding regions. We identified 359 SNVs within the Sheep Quantitative Trait Locus (QTL) database. Nineteen of these SNVs associated with the Aseasonal Reproduction QTL, and 10 out of the 19 reside close to genes with known reproduction functions. We also identified the well-known FecB mutation in high-fecundity sheep (Hu and Small-tailed Han sheep). When we applied these FecB finding to our breeding system, we improved lambing rate by 175%. In summary, this study provided strong candidate SNVs associated with sheep fecundity that can serve as targets for functional testing and to enhance selective breeding strategies. Mol. Reprod. Dev. 83: 958-967, 2016 © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Diopere, Eveline; Hellemans, Bart; Volckaert, Filip A M; Maes, Gregory E
2013-03-01
Genomic methodologies applied in evolutionary and fisheries research have been of great benefit to understand the marine ecosystem and the management of natural resources. Although single nucleotide polymorphisms (SNPs) are attractive for the study of local adaptation, spatial stock management and traceability, and investigating the effects of fisheries-induced selection, they have rarely been exploited in non-model organisms. This is partly due to difficulties in finding and validating SNPs in species with limited or no genomic resources. Complementary to random genome-scan approaches, a targeted candidate gene approach has the potential to unveil pre-selected functional diversity and provides more in depth information on the action of selection at specific genes. For example genes can be under selective pressure due to climate change and sustained periods of heavy fishing pressure. In this study, we applied a candidate gene approach in sole (Solea solea L.), an important member of the demersal ecosystem. As consumption flatfish it is heavy exploited and has experienced associated life-history changes over the last 60years. To discover novel genetic polymorphisms in or around genes linked to important life history traits in sole, we screened a total of 76 candidate genes related to growth and maturation using a targeted resequencing approach. We identified in total 86 putative SNPs in 22 genes and validated 29 SNPs using a multiplex single-base extension genotyping assay. We found 22 informative SNPs, of which two represent non-synonymous mutations, potentially of functional relevance. These novel markers should be rapidly and broadly applicable in analyses of natural sole populations, as a measure of the evolutionary signature of overfishing and for initiatives on marker assisted selection. Copyright © 2012 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
He, Feng; Wen, Haishen; Yu, Dahui; Li, Jifang; Shi, Bao; Chen, Caifang; Zhang, Jiaren; Jin, Guoxiong; Chen, Xiaoyan; Shi, Dan; Yang, Yanping
2010-12-01
Follicle stimulating hormone β (FSHβ) of Japanese flounder ( Paralichthys olivaceus) plays a key role in the regulation of gonadal development. This study aimed to investigate molecular genetic characteristics of the FSHβ gene and elucidate the effects of single nucleotide polymorphisms (SNPs) of FSHβ on reproductive traits in Japanese flounder. We used polymerase chain reaction single-strand conformation polymorphism (PCR-SSCP) and sequencing of the FSHβ gene in 60 individuals. We identified only an SNP (T/C) in the coding region of exon3 of FSHβ. The SNP (T/C) did not lead to amino acid changes at the position 340 bp of FSHβ gene. Statistical analysis showed that the SNP was significantly associated with testosterone (T) level and gonadosomatic index (GSI) ( P < 0.05). Individuals with genotype TC of the SNP had significantly higher serum T levels and GSI ( P < 0.05) than that of genotype CC. Therefore, FSHβ gene could be a useful molecular marker in selection for prominent reproductive trait in Japanese Flounder.
Quantitative Understanding of SHAPE Mechanism from RNA Structure and Dynamics Analysis.
Hurst, Travis; Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie
2018-05-10
The selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) method probes RNA local structural and dynamic information at single nucleotide resolution. To gain quantitative insights into the relationship between nucleotide flexibility, RNA 3D structure, and SHAPE reactivity, we develop a 3D Structure-SHAPE Relationship model (3DSSR) to rebuild SHAPE profiles from 3D structures. The model starts from RNA structures and combines nucleotide interaction strength and conformational propensity, ligand (SHAPE reagent) accessibility, and base-pairing pattern through a composite function to quantify the correlation between SHAPE reactivity and nucleotide conformational stability. The 3DSSR model shows the relationship between SHAPE reactivity and RNA structure and energetics. Comparisons between the 3DSSR-predicted SHAPE profile and the experimental SHAPE data show correlation, suggesting that the extracted analytical function may have captured the key factors that determine the SHAPE reactivity profile. Furthermore, the theory offers an effective method to sieve RNA 3D models and exclude models that are incompatible with experimental SHAPE data.
Energy efficiency trade-offs drive nucleotide usage in transcribed regions
Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.
2016-01-01
Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217
The role of protozoa-driven selection in shaping human genetic variability.
Pozzoli, Uberto; Fumagalli, Matteo; Cagliani, Rachele; Comi, Giacomo P; Bresolin, Nereo; Clerici, Mario; Sironi, Manuela
2010-03-01
Protozoa exert a strong selective pressure in humans. The selection signatures left by these pathogens can be exploited to identify genetic modulators of infection susceptibility. We show that protozoa diversity in different geographic locations is a good measure of protozoa-driven selective pressure; protozoa diversity captured selection signatures at known malaria resistance loci and identified several selected single nucleotide polymorphisms in immune and hemolytic anemia genes. A genome-wide search enabled us to identify 5180 variants mapping to 1145 genes that are subjected to protozoa-driven selective pressure. We provide a genome-wide estimate of protozoa-driven selective pressure and identify candidate susceptibility genes for protozoa-borne diseases. Copyright 2010 Elsevier Ltd. All rights reserved.
Liu, Y; Yan, L; Li, Z; Huang, W-F; Pokhrel, S; Liu, X; Su, S
2016-06-01
Chalkbrood is a disease affecting honey bees that seriously impairs brood growth and productivity of diseased colonies. Although honey bees can develop chalkbrood resistance naturally, the details underlying the mechanisms of resistance are not fully understood, and no easy method is currently available for selecting and breeding resistant bees. Finding the genes involved in the development of resistance and identifying single nucleotide polymorphisms (SNPs) that can be used as molecular markers of resistance is therefore a high priority. We conducted genome resequencing to compare resistant (Res) and susceptible (Sus) larvae that were selected following in vitro chalkbrood inoculation. Twelve genomic libraries, including 14.4 Gb of sequence data, were analysed using SNP-finding algorithms. Unique SNPs derived from chromosomes 2 and 11 were analysed in this study. SNPs from resistant individuals were confirmed by PCR and Sanger sequencing using in vitro reared larvae and resistant colonies. We found strong support for an association between the C allele at SNP C2587245T and chalkbrood resistance. SNP C2587245T may be useful as a genetic marker for the selection of chalkbrood resistance and high royal jelly production honey bee lines, thereby helping to minimize the negative effects of chalkbrood on managed honey bees. © 2016 The Royal Entomological Society.
GenoCore: A simple and fast algorithm for core subset selection from large genotype datasets.
Jeong, Seongmun; Kim, Jae-Yoon; Jeong, Soon-Chun; Kang, Sung-Taeg; Moon, Jung-Kyung; Kim, Namshin
2017-01-01
Selecting core subsets from plant genotype datasets is important for enhancing cost-effectiveness and to shorten the time required for analyses of genome-wide association studies (GWAS), and genomics-assisted breeding of crop species, etc. Recently, a large number of genetic markers (>100,000 single nucleotide polymorphisms) have been identified from high-density single nucleotide polymorphism (SNP) arrays and next-generation sequencing (NGS) data. However, there is no software available for picking out the efficient and consistent core subset from such a huge dataset. It is necessary to develop software that can extract genetically important samples in a population with coherence. We here present a new program, GenoCore, which can find quickly and efficiently the core subset representing the entire population. We introduce simple measures of coverage and diversity scores, which reflect genotype errors and genetic variations, and can help to select a sample rapidly and accurately for crop genotype dataset. Comparison of our method to other core collection software using example datasets are performed to validate the performance according to genetic distance, diversity, coverage, required system resources, and the number of selected samples. GenoCore selects the smallest, most consistent, and most representative core collection from all samples, using less memory with more efficient scores, and shows greater genetic coverage compared to the other software tested. GenoCore was written in R language, and can be accessed online with an example dataset and test results at https://github.com/lovemun/Genocore.
Weak Negative and Positive Selection and the Drift Load at Splice Sites
Denisov, Stepan V.; Bazykin, Georgii A.; Sutormin, Roman; Favorov, Alexander V.; Mironov, Andrey A.; Gelfand, Mikhail S.; Kondrashov, Alexey S.
2014-01-01
Splice sites (SSs) are short sequences that are crucial for proper mRNA splicing in eukaryotic cells, and therefore can be expected to be shaped by strong selection. Nevertheless, in mammals and in other intron-rich organisms, many of the SSs often involve nonconsensus (Nc), rather than consensus (Cn), nucleotides, and beyond the two critical nucleotides, the SSs are not perfectly conserved between species. Here, we compare the SS sequences between primates, and between Drosophila fruit flies, to reveal the pattern of selection acting at SSs. Cn-to-Nc substitutions are less frequent, and Nc-to-Cn substitutions are more frequent, than neutrally expected, indicating, respectively, negative and positive selection. This selection is relatively weak (1 < |4Nes| < 4), and has a similar efficiency in primates and in Drosophila. Within some nucleotide positions, the positive selection in favor of Nc-to-Cn substitutions is weaker than the negative selection maintaining already established Cn nucleotides; this difference is due to site-specific negative selection favoring current Nc nucleotides. In general, however, the strength of negative selection protecting the Cn alleles is similar in magnitude to the strength of positive selection favoring replacement of Nc alleles, as expected under the simple nearly neutral turnover. In summary, although a fraction of the Nc nucleotides within SSs is maintained by selection, the abundance of deleterious nucleotides in this class suggests a substantial genome-wide drift load. PMID:24966225
Lou, Jing; Wang, Zhaoyin; Wang, Xiao; Bao, Jianchun; Tu, Wenwen; Dai, Zhihui
2015-10-07
A "signal-on" electrochemiluminescent DNA biosensing platform was proposed based on the dual quenching and strand displacement reaction. This novel "signal-on" detection strategy revealed its sensitivity in achieving a detection limit of 2.4 aM and its selectivity in distinguishing single nucleotide polymorphism of target DNA.
Gallium plasmonic nanoparticles for label-free DNA and single nucleotide polymorphism sensing
NASA Astrophysics Data System (ADS)
Marín, Antonio García; García-Mendiola, Tania; Bernabeu, Cristina Navio; Hernández, María Jesús; Piqueras, Juan; Pau, Jose Luis; Pariente, Félix; Lorenzo, Encarnación
2016-05-01
A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R2 = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells.A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R2 = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells. Electronic supplementary information (ESI) available. See DOI: 10.1039/c6nr00926c
Functional analysis of regulatory single-nucleotide polymorphisms.
Pampín, Sandra; Rodríguez-Rey, José C
2007-04-01
The identification of regulatory polymorphisms has become a key problem in human genetics. In the past few years there has been a conceptual change in the way in which regulatory single-nucleotide polymorphisms are studied. We revise the new approaches and discuss how gene expression studies can contribute to a better knowledge of the genetics of common diseases. New techniques for the association of single-nucleotide polymorphisms with changes in gene expression have been recently developed. This, together with a more comprehensive use of the old in-vitro methods, has produced a great amount of genetic information. When added to current databases, it will help to design better tools for the detection of regulatory single-nucleotide polymorphisms. The identification of functional regulatory single-nucleotide polymorphisms cannot be done by the simple inspection of DNA sequence. In-vivo techniques, based on primer-extension, and the more recently developed 'haploChIP' allow the association of gene variants to changes in gene expression. Gene expression analysis by conventional in-vitro techniques is the only way to identify the functional consequences of regulatory single-nucleotide polymorphisms. The amount of information produced in the last few years will help to refine the tools for the future analysis of regulatory gene variants.
Erdoğan, Onur; Aydin Son, Yeşim
2014-01-01
Single Nucleotide Polymorphisms (SNPs) are the most common genomic variations where only a single nucleotide differs between individuals. Individual SNPs and SNP profiles associated with diseases can be utilized as biological markers. But there is a need to determine the SNP subsets and patients' clinical data which is informative for the diagnosis. Data mining approaches have the highest potential for extracting the knowledge from genomic datasets and selecting the representative SNPs as well as most effective and informative clinical features for the clinical diagnosis of the diseases. In this study, we have applied one of the widely used data mining classification methodology: "decision tree" for associating the SNP biomarkers and significant clinical data with the Alzheimer's disease (AD), which is the most common form of "dementia". Different tree construction parameters have been compared for the optimization, and the most accurate tree for predicting the AD is presented.
Cruz, Vanessa P; Vera, Manuel; Pardo, Belén G; Taggart, John; Martinez, Paulino; Oliveira, Claudio; Foresti, Fausto
2017-05-01
Single nucleotide polymorphism (SNP) markers were identified and validated for two stingrays species, Potamotrygon motoro and Potamotrygon falkneri, using double digest restriction-site associated DNA (ddRAD) reads using 454-Roche technology. A total of 226 774 reads (65.5 Mb) were obtained (mean read length 289 ± 183 bp) detecting a total of 5399 contigs (mean contig length: 396 ± 91 bp). Mining this data set, a panel of 143 in silico SNPs was selected. Eighty-two of these SNPs were successfully validated and 61 were polymorphic: 14 in P. falkneri, 21 in P. motoro, 3 in both species and 26 fixed for alternative variants in both species, thus being useful for population analyses and hybrid detection. © 2016 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Maity, Sourav; Mazzolini, Monica; Arcangeletti, Manuel; Valbuena, Alejandro; Fabris, Paolo; Lazzarino, Marco; Torre, Vincent
2015-05-01
Cyclic nucleotide-gated (CNG) channels are activated by binding of cyclic nucleotides. Although structural studies have identified the channel pore and selectivity filter, conformation changes associated with gating remain poorly understood. Here we combine single-molecule force spectroscopy (SMFS) with mutagenesis, bioinformatics and electrophysiology to study conformational changes associated with gating. By expressing functional channels with SMFS fingerprints in Xenopus laevis oocytes, we were able to investigate gating of CNGA1 in a physiological-like membrane. Force spectra determined that the S4 transmembrane domain is mechanically coupled to S5 in the closed state, but S3 in the open state. We also show there are multiple pathways for the unfolding of the transmembrane domains, probably caused by a different degree of α-helix folding. This approach demonstrates that CNG transmembrane domains have dynamic structure and establishes SMFS as a tool for probing conformational change in ion channels.
Chávez-Galarza, Julio; Henriques, Dora; Johnston, J Spencer; Azevedo, João C; Patton, John C; Muñoz, Irene; De la Rúa, Pilar; Pinto, M Alice
2013-12-01
Understanding the genetic mechanisms of adaptive population divergence is one of the most fundamental endeavours in evolutionary biology and is becoming increasingly important as it will allow predictions about how organisms will respond to global environmental crisis. This is particularly important for the honey bee, a species of unquestionable ecological and economical importance that has been exposed to increasing human-mediated selection pressures. Here, we conducted a single nucleotide polymorphism (SNP)-based genome scan in honey bees collected across an environmental gradient in Iberia and used four FST -based outlier tests to identify genomic regions exhibiting signatures of selection. Additionally, we analysed associations between genetic and environmental data for the identification of factors that might be correlated or act as selective pressures. With these approaches, 4.4% (17 of 383) of outlier loci were cross-validated by four FST -based methods, and 8.9% (34 of 383) were cross-validated by at least three methods. Of the 34 outliers, 15 were found to be strongly associated with one or more environmental variables. Further support for selection, provided by functional genomic information, was particularly compelling for SNP outliers mapped to different genes putatively involved in the same function such as vision, xenobiotic detoxification and innate immune response. This study enabled a more rigorous consideration of selection as the underlying cause of diversity patterns in Iberian honey bees, representing an important first step towards the identification of polymorphisms implicated in local adaptation and possibly in response to recent human-mediated environmental changes. © 2013 John Wiley & Sons Ltd.
Chen, Jianchi; Civerolo, Edwin L; Jarret, Robert L; Van Sluys, Marie-Anne; de Oliveira, Mariana C
2005-02-01
Xylella fastidiosa causes many important plant diseases including Pierce's disease (PD) in grape and almond leaf scorch disease (ALSD). DNA-based methodologies, such as randomly amplified polymorphic DNA (RAPD) analysis, have been playing key roles in genetic information collection of the bacterium. This study further analyzed the nucleotide sequences of selected RAPDs from X. fastidiosa strains in conjunction with the available genome sequence databases and unveiled several previously unknown novel genetic traits. These include a sequence highly similar to those in the phage family of Podoviridae. Genome comparisons among X. fastidiosa strains suggested that the "phage" is currently active. Two other RAPDs were also related to horizontal gene transfer: one was part of a broadly distributed cryptic plasmid and the other was associated with conjugal transfer. One RAPD inferred a genomic rearrangement event among X. fastidiosa PD strains and another identified a single nucleotide polymorphism of evolutionary value.
SiNoPsis: Single Nucleotide Polymorphisms selection and promoter profiling.
Boloc, Daniel; Rodríguez, Natalia; Gassó, Patricia; Abril, Josep F; Bernardo, Miquel; Lafuente, Amalia; Mas, Sergi
2017-09-14
The selection of a Single Nucleotide Polymorphism (SNP) using bibliographic methods can be a very time-consuming task. Moreover, a SNP selected in this way may not be easily visualized in its genomic context by a standard user hoping to correlate it with other valuable information. Here we propose a web form built on top of Circos that can assist SNP-centred screening, based on their location in the genome and the regulatory modules they can disrupt. Its use may allow researchers to prioritize SNPs in genotyping and disease studies. SiNoPsis is bundled as a web portal. It focuses on the different structures involved in the genomic expression of a gene, especially those found in the core promoter upstream region. These structures include transcription factor binding sites (for promoter and enhancer signals), histones, and promoter flanking regions. Additionally, the tool provides eQTL and linkage disequilibrium (LD) properties for a given SNP query, yielding further clues about other indirectly associated SNPs. Possible disruptions of the aforementioned structures affecting gene transcription are reported using multiple resource databases. SiNoPsis has a simple user-friendly interface, which allows single queries by gene symbol, genomic coordinates, Ensembl gene identifiers, RefSeq transcript identifiers and SNPs. It is the only portal providing useful SNP selection based on regulatory modules and LD with functional variants in both textual and graphic modes (by properly defining the arguments and parameters needed to run Circos). SiNoPsis is freely available at https://compgen.bio.ub.edu/SiNoPsis /. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Genetic Variants of TPCN2 Associated with Type 2 Diabetes Risk in the Chinese Population
Zhang, Yu; Fan, Xiaofang; Zhang, Ning; Zheng, Hui; Song, Yuping; Shen, Chunfang; Shen, Jiayi; Ren, Fengdong; Yang, Jialin
2016-01-01
Objective The aim of this study was to determine whether TPCN2 genetic variants are associated with type 2 diabetes and to elucidate which variants in TPCN2 confer diabetes susceptibility in the Chinese population. Research Design and Methods The sample population included 384 patients with type 2 diabetes and 1468 controls. Anthropometric parameters, glycemic and lipid profiles and insulin resistance were measured. We selected 6 TPCN2 tag single nucleotide polymorphisms (rs35264875, rs267603153, rs267603154, rs3829241, rs1551305, and rs3750965). Genotypes were determined using a Sequenom MassARRAY SNP genotyping system. Results Ultimately, we genotyped 3 single nucleotide polymorphisms (rs3750965, rs3829241, and rs1551305) in all individuals. There was a 5.1% higher prevalence of the rs1551305 variant allele in type 2 diabetes individuals (A) compared with wild-type homozygous individuals (G). The AA genotype of rs1551305 was associated with a higher diabetes risk (p<0.05). The distributions of rs3829241 and rs3750965 polymorphisms were not significantly different between the two groups. HOMA-%B of subjects harboring the AA genotype of rs1551305 decreased by 14.87% relative to the GG genotype. Conclusions TPCN2 plays a role in metabolic regulation, and the rs1551305 single nucleotide polymorphism is associated with type 2 diabetes risk. Future work will begin to unravel the underlying mechanisms. PMID:26918892
Khrustaleva, A M; Klovach, N V; Vedischeva, E V; Seeb, J E
2015-10-01
The variability of 45 single nucleotide polymorphism loci (SNP) was studied in sockeye salmon from the Kamchatka River basin and four lake-river systems of the west coast of the Bering Sea. Based on the genetic differentiation estimates for the largest sockeye salmon populations of Eastern Kamchatka and Chukotka, the examined samples were combined into two regional groups represented by the population of the Kamchatka River drainage, which included numerous local subpopulations and seasonal races, and the northern population grouping from the rivers of Olutorsko-Navarinsky raion, wherein the sockeye salmon from Maynypilginskaya Lake-River system was relatively isolated. Considerable divergence was observed between the island (Sarannoe Lake, Bering Island) and continental populations. Genetic heterogeneity was revealed and groups of early- and late-maturing individuals were isolated in the sample of late-run sockeye salmon from Kamchatka River. In Apuka River, subdivision of the spawning run into two genetically distinct spatial and temporal groupings was also observed. The results suggest that the differentiation of sockeye salmon samples by single nucleotide substitution frequencies was largely due to differences in the direction and strength of local selection at some loci in the population complexes and intrapopulation groupings from the examined river basins of Eastern Kamchatka, Chukotka, and Commander Islands.
Chen, Zhongxue; Ng, Hon Keung Tony; Li, Jing; Liu, Qingzhong; Huang, Hanwen
2017-04-01
In the past decade, hundreds of genome-wide association studies have been conducted to detect the significant single-nucleotide polymorphisms that are associated with certain diseases. However, most of the data from the X chromosome were not analyzed and only a few significant associated single-nucleotide polymorphisms from the X chromosome have been identified from genome-wide association studies. This is mainly due to the lack of powerful statistical tests. In this paper, we propose a novel statistical approach that combines the information of single-nucleotide polymorphisms on the X chromosome from both males and females in an efficient way. The proposed approach avoids the need of making strong assumptions about the underlying genetic models. Our proposed statistical test is a robust method that only makes the assumption that the risk allele is the same for both females and males if the single-nucleotide polymorphism is associated with the disease for both genders. Through simulation study and a real data application, we show that the proposed procedure is robust and have excellent performance compared to existing methods. We expect that many more associated single-nucleotide polymorphisms on the X chromosome will be identified if the proposed approach is applied to current available genome-wide association studies data.
USDA-ARS?s Scientific Manuscript database
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...
USDA-ARS?s Scientific Manuscript database
The genome-wide association study (GWAS) is a useful tool for detecting and characterizing traits of interest including those associated with disease resistance in soybean. The availability of 50,000 single nucleotide polymorphism (SNP) markers (SoySNP50K iSelect BeadChip; www.soybase.org) on 19,652...
ERIC Educational Resources Information Center
Shah, Kushani; Thomas, Shelby; Stein, Arnold
2013-01-01
In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C…
USDA-ARS?s Scientific Manuscript database
Background: Our goal is to produce a high-throughput SNP genotyping platform for genomic analyses in rainbow trout that will enable fine mapping of QTL, whole genome association studies, genomic selection for improved aquaculture production traits, and genetic analyses of wild populations that aid ...
USDA-ARS?s Scientific Manuscript database
Soybean [Glycine max (L.) Merr.] cultivars with elevated concentrations of the a' subunit of ß-conglycinin (BC) may provide health benefits to soy protein consumers. Two Monsanto single nucleotide polymorphism markers were used to classify F2 plants in four segregating populations as having elevate...
Increasing the number of single nucleotide polymorphisms used in genomic evaluations of dairy cattle
USDA-ARS?s Scientific Manuscript database
A small increase in the accuracy of genomic evaluations of dairy cattle was achieved by increasing the number of SNP used to 61,013. All the 45,195 SNP used previously were retained, and 15,818 SNP were selected from higher density genotyping chips if the magnitude of the SNP effect was among the to...
USDA-ARS?s Scientific Manuscript database
Favorable associations between magnesium intake and glycemic traits, such as fasting glucose and insulin, are observed in observational and clinical studies, but whether genetic variation affects these associations is largely unknown. We hypothesized that single nucleotide polymorphisms (SNPs) assoc...
USDA-ARS?s Scientific Manuscript database
Prior knowledge on heading date enables the selection of parents for synthetic cultivars that are well-matched with respect to heading date, which is necessary to ensure plants put together will successfully cross with each other. Heading date of individual plants can be determined directly, which h...
Smola, Matthew J; Rice, Greggory M; Busan, Steven; Siegfried, Nathan A; Weeks, Kevin M
2015-11-01
Selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistries exploit small electrophilic reagents that react with 2'-hydroxyl groups to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues by using reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as can be done for simple model RNAs. This protocol describes the experimental steps, implemented over 3 d, that are required to perform SHAPE probing and to construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots and provides useful troubleshooting information. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures and visualize probable and alternative helices, often in under 1 d. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles and entire transcriptomes.
Shared Genetic Signals of Hypoxia Adaptation in Drosophila and in High-Altitude Human Populations
Jha, Aashish R.; Zhou, Dan; Brown, Christopher D.; Kreitman, Martin; Haddad, Gabriel G.; White, Kevin P.
2016-01-01
The ability to withstand low oxygen (hypoxia tolerance) is a polygenic and mechanistically conserved trait that has important implications for both human health and evolution. However, little is known about the diversity of genetic mechanisms involved in hypoxia adaptation in evolving populations. We used experimental evolution and whole-genome sequencing in Drosophila melanogaster to investigate the role of natural variation in adaptation to hypoxia. Using a generalized linear mixed model we identified significant allele frequency differences between three independently evolved hypoxia-tolerant populations and normoxic control populations for approximately 3,800 single nucleotide polymorphisms. Around 50% of these variants are clustered in 66 distinct genomic regions. These regions contain genes that are differentially expressed between hypoxia-tolerant and normoxic populations and several of the differentially expressed genes are associated with metabolic processes. Additional genes associated with respiratory and open tracheal system development also show evidence of directional selection. RNAi-mediated knockdown of several candidate genes’ expression significantly enhanced survival in severe hypoxia. Using genomewide single nucleotide polymorphism data from four high-altitude human populations—Sherpas, Tibetans, Ethiopians, and Andeans, we found that several human orthologs of the genes under selection in flies are also likely under positive selection in all four high-altitude human populations. Thus, our results indicate that selection for hypoxia tolerance can act on standing genetic variation in similar genes and pathways present in organisms diverged by hundreds of millions of years. PMID:26576852
Campbell, Nathan R.; LaPatra, Scott E.; Overturf, Ken; Towner, Richard; Narum, Shawn R.
2014-01-01
Recent advances in genotyping-by-sequencing have enabled genome-wide association studies in nonmodel species including those in aquaculture programs. As with other aquaculture species, rainbow trout and steelhead (Oncorhynchus mykiss) are susceptible to disease and outbreaks can lead to significant losses. Fish culturists have therefore been pursuing strategies to prevent losses to common pathogens such as Flavobacterium psychrophilum (the etiological agent for bacterial cold water disease [CWD]) and infectious hematopoietic necrosis virus (IHNV) by adjusting feed formulations, vaccine development, and selective breeding. However, discovery of genetic markers linked to disease resistance offers the potential to use marker-assisted selection to increase resistance and reduce outbreaks. For this study we sampled juvenile fish from 40 families from 2-yr classes that either survived or died after controlled exposure to either CWD or IHNV. Restriction site−associated DNA sequencing produced 4661 polymorphic single-nucleotide polymorphism loci after strict filtering. Genotypes from individual survivors and mortalities were then used to test for association between disease resistance and genotype at each locus using the program TASSEL. After we accounted for kinship and stratification of the samples, tests revealed 12 single-nucleotide polymorphism markers that were highly associated with resistance to CWD and 19 markers associated with resistance to IHNV. These markers are candidates for further investigation and are expected to be useful for marker assisted selection in future broodstock selection for various aquaculture programs. PMID:25354781
2017-01-01
The advent of next-generation sequencing tools has made it possible to conduct fine-scale surveys of population differentiation and genome-wide scans for signatures of selection in non-model organisms. Such surveys are of particular importance in sharply declining coral species, since knowledge of population boundaries and signs of local adaptation can inform restoration and conservation efforts. Here, we use genome-wide surveys of single-nucleotide polymorphisms in the threatened Caribbean elkhorn coral, Acropora palmata, to reveal fine-scale population structure and infer the major barrier to gene flow that separates the eastern and western Caribbean populations between the Bahamas and Puerto Rico. The exact location of this break had been subject to discussion because two previous studies based on microsatellite data had come to differing conclusions. We investigate this contradiction by analyzing an extended set of 11 microsatellite markers including the five previously employed and discovered that one of the original microsatellite loci is apparently under selection. Exclusion of this locus reconciles the results from the SNP and the microsatellite datasets. Scans for outlier loci in the SNP data detected 13 candidate loci under positive selection, however there was no correlation between available environmental parameters and genetic distance. Together, these results suggest that reef restoration efforts should use local sources and utilize existing functional variation among geographic regions in ex situ crossing experiments to improve stress resistance of this species. PMID:29181279
Gritz, L; Davies, J
1983-11-01
The plasmid-borne gene hph coding for hygromycin B phosphotransferase (HPH) in Escherichia coli has been identified and its nucleotide sequence determined. The hph gene is 1026 nucleotides long, coding for a protein with a predicted Mr of 39 000. The hph gene was placed in a shuttle plasmid vector, downstream from the promoter region of the cyc 1 gene of Saccharomyces cerevisiae, and an hph construction containing a single AUG in the 5' noncoding region allowed direct selection following transformation in yeast and in E. coli. Thus the hph gene can be used in cloning vectors for both pro- and eukaryotes.
Wang, Dingzhong; Tang, Wei; Wu, Xiaojie; Wang, Xinyi; Chen, Gengjia; Chen, Qiang; Li, Na; Liu, Feng
2012-08-21
Toehold-mediated strand displacement reaction (SDR) is first introduced to develop a simple quartz crystal microbalance (QCM) biosensor without an enzyme or label at normal temperature for highly selective and sensitive detection of single-nucleotide polymorphism (SNP) in the p53 tumor suppressor gene. A hairpin capture probe with an external toehold is designed and immobilized on the gold electrode surface of QCM. A successive SDR is initiated by the target sequence hybridization with the toehold domain and ends with the unfolding of the capture probe. Finally, the open-loop capture probe hybridizes with the streptavidin-coupled reporter probe as an efficient mass amplifier to enhance the QCM signal. The proposed biosensor displays remarkable specificity to target the p53 gene fragment against single-base mutant sequences (e.g., the largest discrimination factor is 63 to C-C mismatch) and high sensitivity with the detection limit of 0.3 nM at 20 °C. As the crucial component of the fabricated biosensor for providing the high discrimination capability, the design rationale of the capture probe is further verified by fluorescence sensing and atomic force microscopy imaging. Additionally, a recovery of 84.1% is obtained when detecting the target sequence in spiked HeLa cells lysate, demonstrating the feasibility of employing this biosensor in detecting SNPs in biological samples.
Compositions and methods for detecting single nucleotide polymorphisms
Yeh, Hsin-Chih; Werner, James; Martinez, Jennifer S.
2016-11-22
Described herein are nucleic acid based probes and methods for discriminating and detecting single nucleotide variants in nucleic acid molecules (e.g., DNA). The methods include use of a pair of probes can be used to detect and identify polymorphisms, for example single nucleotide polymorphism in DNA. The pair of probes emit a different fluorescent wavelength of light depending on the association and alignment of the probes when hybridized to a target nucleic acid molecule. Each pair of probes is capable of discriminating at least two different nucleic acid molecules that differ by at least a single nucleotide difference. The methods can probes can be used, for example, for detection of DNA polymorphisms that are indicative of a particular disease or condition.
Kim, Kiyeon; Omori, Ryosuke; Ueno, Keisuke; Iida, Sayaka; Ito, Kimihito
2016-01-01
Understanding the evolutionary dynamics of influenza viruses is essential to control both avian and human influenza. Here, we analyze host-specific and segment-specific Tajima's D trends of influenza A virus through a systematic review using viral sequences registered in the National Center for Biotechnology Information. To avoid bias from viral population subdivision, viral sequences were stratified according to their sampling locations and sampling years. As a result, we obtained a total of 580 datasets each of which consists of nucleotide sequences of influenza A viruses isolated from a single population of hosts at a single sampling site within a single year. By analyzing nucleotide sequences in the datasets, we found that Tajima's D values of viral sequences were different depending on hosts and gene segments. Tajima's D values of viruses isolated from chicken and human samples showed negative, suggesting purifying selection or a rapid population growth of the viruses. The negative Tajima's D values in rapidly growing viral population were also observed in computer simulations. Tajima's D values of PB2, PB1, PA, NP, and M genes of the viruses circulating in wild mallards were close to zero, suggesting that these genes have undergone neutral selection in constant-sized population. On the other hand, Tajima's D values of HA and NA genes of these viruses were positive, indicating HA and NA have undergone balancing selection in wild mallards. Taken together, these results indicated the existence of unknown factors that maintain viral subtypes in wild mallards.
Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi
2017-01-01
Background: Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. Aims: To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Study Design: Case-control study. Methods: The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Results: Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. Conclusion: The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients. PMID:28443596
Can, Ceren; Yazıcıoğlu, Mehtap; Gürkan, Hakan; Tozkır, Hilmi; Görgülü, Adnan; Süt, Necdet Hilmi
2017-05-05
Atopic dermatitis is the most common chronic inflammatory skin disease. A complex interaction of both genetic and environmental factors is thought to contribute to the disease. To evaluate whether single nucleotide polymorphisms in the TLR2 gene c.2258C>T (R753Q) (rs5743708) and TLR2 c.-148+1614T>A (A-16934T) (rs4696480) (NM_0032643) are associated with atopic dermatitis in Turkish children. Case-control study. The study was conducted on 70 Turkish children with atopic dermatitis aged 0.5-18 years. The clinical severity of atopic dermatitis was evaluated by the severity scoring of atopic dermatitis index. Serum total IgE levels, specific IgE antibodies to inhalant and food allergens were measured in both atopic dermatitis patients and controls, skin prick tests were done on 70 children with atopic dermatitis. Genotyping for TLR2 (R753Q and A-16934T) single nucleotide polymorphisms was performed in both atopic dermatitis patients and controls. Cytosine-cytosine and cytosin-thymine genotype frequencies of the TLR2 R753Q single nucleotide polymorphism in the atopic dermatitis group were determined as being 98.6% and 1.4%, cytosine allele frequency for TLR2 R753Q single nucleotide polymorphism was determined as 99.29% and the thymine allele frequency was 0.71%, thymine-thymine, thymine-adenine, and adenine-adenine genotype frequencies of the TLR2 A-16934T single nucleotide polymorphism were 24.3%, 44.3%, and 31.4%. The thymine allele frequency for the TLR2 A-16934T single nucleotide polymorphism in the atopic dermatitis group was 46.43%, and the adenine allele frequency was 53.57%, respectively. There was not statistically significant difference between the groups for all investigated polymorphisms (p>0.05). For all single nucleotide polymorphisms studied, allelic distribution was analogous among atopic dermatitis patients and controls, and no significant statistical difference was observed. No homozygous carriers of the TLR2 R753Q single nucleotide polymorphism were found in the atopic dermatitis and control groups. The TLR2 (R753Q and A-16934T) single nucleotide polymorphisms are not associated with atopic dermatitis in a group of Turkish patients.
Improved prediction of biochemical recurrence after radical prostatectomy by genetic polymorphisms.
Morote, Juan; Del Amo, Jokin; Borque, Angel; Ars, Elisabet; Hernández, Carlos; Herranz, Felipe; Arruza, Antonio; Llarena, Roberto; Planas, Jacques; Viso, María J; Palou, Joan; Raventós, Carles X; Tejedor, Diego; Artieda, Marta; Simón, Laureano; Martínez, Antonio; Rioja, Luis A
2010-08-01
Single nucleotide polymorphisms are inherited genetic variations that can predispose or protect individuals against clinical events. We hypothesized that single nucleotide polymorphism profiling may improve the prediction of biochemical recurrence after radical prostatectomy. We performed a retrospective, multi-institutional study of 703 patients treated with radical prostatectomy for clinically localized prostate cancer who had at least 5 years of followup after surgery. All patients were genotyped for 83 prostate cancer related single nucleotide polymorphisms using a low density oligonucleotide microarray. Baseline clinicopathological variables and single nucleotide polymorphisms were analyzed to predict biochemical recurrence within 5 years using stepwise logistic regression. Discrimination was measured by ROC curve AUC, specificity, sensitivity, predictive values, net reclassification improvement and integrated discrimination index. The overall biochemical recurrence rate was 35%. The model with the best fit combined 8 covariates, including the 5 clinicopathological variables prostate specific antigen, Gleason score, pathological stage, lymph node involvement and margin status, and 3 single nucleotide polymorphisms at the KLK2, SULT1A1 and TLR4 genes. Model predictive power was defined by 80% positive predictive value, 74% negative predictive value and an AUC of 0.78. The model based on clinicopathological variables plus single nucleotide polymorphisms showed significant improvement over the model without single nucleotide polymorphisms, as indicated by 23.3% net reclassification improvement (p = 0.003), integrated discrimination index (p <0.001) and likelihood ratio test (p <0.001). Internal validation proved model robustness (bootstrap corrected AUC 0.78, range 0.74 to 0.82). The calibration plot showed close agreement between biochemical recurrence observed and predicted probabilities. Predicting biochemical recurrence after radical prostatectomy based on clinicopathological data can be significantly improved by including patient genetic information. Copyright (c) 2010 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Evidence for a Complex Class of Nonadenylated mRNA in Drosophila
Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.
1980-01-01
The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246
NASA Astrophysics Data System (ADS)
Hoffert, M.; Anderson, R. E.; Stepanauskas, R.; Huber, J. A.
2017-12-01
Deep-sea hydrothermal vents sustain diverse communities of microorganisms. The effects of geochemical and biological interactions on the process of evolution in these ecosystems remains poorly understood because the majority of subsurface microorganisms remain uncultivated. By examining metagenomic samples from hydrothermal fluids and mapping the samples to closely-related genomes found in vent sites, we can better understand how the process of evolution is affected by the geochemical and environmental context in deep-sea vents. The Mid-Cayman Rise is a spreading ridge that hosts both mafic-influenced and ultramafic-influenced vent fields. Previous research on metagenomic samples from sites in the Mid-Cayman Rise has shown that these vents contain metabolically and taxonomically diverse microbial communities. Here, we investigate five single cell amplified Methanothermococcus genomes (SAGs) to investigate patterns in pangenomic variation and molecular evolution in these methanogens. Mappings of metagenomic reads from 15 sample sites to the SAGs reveal substantial variation in Methanothermococcus population abundance, nucleotide variability and selection pressure among the 15 geochemically distinct sample sites. Within each sample site, we observed distinct patterns of single nucleotide variant (SNV) accumulation and selection pressure within the SAG populations. Closely related genomes showed similar patterns of SNV accumulation. Analysis of open reading frames (ORFs) from the SAGs indicated that homologous genes accumulated variation at the same rate. For example, a genomic island for Nif genes was identified in three of the five genomes with significantly elevated SNV counts. dN/dS analyses revealed evidence for frequency-dependent selection, in which genes unique to individual SAGs displayed elevated diversifying selection relative to other genes. These results indicate that different strains of Methanothermococcus outcompete others in specific environmental settings, and that these fitness advantages may result from variation in the pangenome, as revealed by dN/dS and SNV analyses. By examining variation and the scale of nucleotide and genes, we aim to gain insight into the roles of genetic diversity and environmental selection on microbial evolution in these ecosystems.
Chen, Shanyuan; Gomes, Rui; Costa, Vânia; Santos, Pedro; Charneca, Rui; Zhang, Ya-ping; Liu, Xue-hong; Wang, Shao-qing; Bento, Pedro; Nunes, Jose-Luis; Buzgó, József; Varga, Gyula; Anton, István; Zsolnai, Attila; Beja-Pereira, Albano
2013-10-01
The coexistence of wild boars and domestic pigs across Eurasia makes it feasible to conduct comparative genetic or genomic analyses for addressing how genetically different a domestic species is from its wild ancestor. To test whether there are differences in patterns of genetic variability between wild and domestic pigs at immunity-related genes and to detect outlier loci putatively under selection that may underlie differences in immune responses, here we analyzed 54 single-nucleotide polymorphisms (SNPs) of 19 immunity-related candidate genes on 11 autosomes in three pairs of wild boar and domestic pig populations from China, Iberian Peninsula, and Hungary. Our results showed no statistically significant differences in allele frequency and heterozygosity across SNPs between three pairs of wild and domestic populations. This observation was more likely due to the widespread and long-lasting gene flow between wild boars and domestic pigs across Eurasia. In addition, we detected eight coding SNPs from six genes as outliers being under selection consistently by three outlier tests (BayeScan2.1, FDIST2, and Arlequin3.5). Among four non-synonymous outlier SNPs, one from TLR4 gene was identified as being subject to positive (diversifying) selection and three each from CD36, IFNW1, and IL1B genes were suggested as under balancing selection. All of these four non-synonymous variants were predicted as being benign by PolyPhen-2. Our results were supported by other independent lines of evidence for positive selection or balancing selection acting on these four immune genes (CD36, IFNW1, IL1B, and TLR4). Our study showed an example applying a candidate gene approach to identify functionally important mutations (i.e., outlier loci) in wild and domestic pigs for subsequent functional experiments.
Wong, Gerard; Leckie, Christopher; Kowalczyk, Adam
2012-01-15
Feature selection is a key concept in machine learning for microarray datasets, where features represented by probesets are typically several orders of magnitude larger than the available sample size. Computational tractability is a key challenge for feature selection algorithms in handling very high-dimensional datasets beyond a hundred thousand features, such as in datasets produced on single nucleotide polymorphism microarrays. In this article, we present a novel feature set reduction approach that enables scalable feature selection on datasets with hundreds of thousands of features and beyond. Our approach enables more efficient handling of higher resolution datasets to achieve better disease subtype classification of samples for potentially more accurate diagnosis and prognosis, which allows clinicians to make more informed decisions in regards to patient treatment options. We applied our feature set reduction approach to several publicly available cancer single nucleotide polymorphism (SNP) array datasets and evaluated its performance in terms of its multiclass predictive classification accuracy over different cancer subtypes, its speedup in execution as well as its scalability with respect to sample size and array resolution. Feature Set Reduction (FSR) was able to reduce the dimensions of an SNP array dataset by more than two orders of magnitude while achieving at least equal, and in most cases superior predictive classification performance over that achieved on features selected by existing feature selection methods alone. An examination of the biological relevance of frequently selected features from FSR-reduced feature sets revealed strong enrichment in association with cancer. FSR was implemented in MATLAB R2010b and is available at http://ww2.cs.mu.oz.au/~gwong/FSR.
Wu, Lei; He, Yao; Zhang, Di
2015-11-01
To systematically evaluate the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout in East Asian population. The literature retrieval was conducted by using English databases (Medline, EMbase), Chinese databases (CNKI, Vip, Wanfang, SinaMed) and others to collect the published papers on the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout by the end of December 2014. Meta-analysis was performed with software Stata 12.0. Nine studies were included. There were significant associations between increased risk of gout and single nucleotide polymorphism of rs2231142, the combined OR was 2.04 (95%CI: 1.82-2.28) for A allele and C allele, 1.97 (95%CI: 1.57-2.48) for CA and CC, 3.71 (95%CI: 3.07-4.47) for AA and CC. Sex and region specific subgroup analysis showed less heterogeneity. There is significant association between gout and single nucleotide polymorphism of rs2231142 in East Asian population, and A allele is a high risk gene for gout.
CNTNAP2 Is Significantly Associated With Speech Sound Disorder in the Chinese Han Population.
Zhao, Yun-Jing; Wang, Yue-Ping; Yang, Wen-Zhu; Sun, Hong-Wei; Ma, Hong-Wei; Zhao, Ya-Ru
2015-11-01
Speech sound disorder is the most common communication disorder. Some investigations support the possibility that the CNTNAP2 gene might be involved in the pathogenesis of speech-related diseases. To investigate single-nucleotide polymorphisms in the CNTNAP2 gene, 300 unrelated speech sound disorder patients and 200 normal controls were included in the study. Five single-nucleotide polymorphisms were amplified and directly sequenced. Significant differences were found in the genotype (P = .0003) and allele (P = .0056) frequencies of rs2538976 between patients and controls. The excess frequency of the A allele in the patient group remained significant after Bonferroni correction (P = .0280). A significant haplotype association with rs2710102T/+rs17236239A/+2538976A/+2710117A (P = 4.10e-006) was identified. A neighboring single-nucleotide polymorphism, rs10608123, was found in complete linkage disequilibrium with rs2538976, and the genotypes exactly corresponded to each other. The authors propose that these CNTNAP2 variants increase the susceptibility to speech sound disorder. The single-nucleotide polymorphisms rs10608123 and rs2538976 may merge into one single-nucleotide polymorphism. © The Author(s) 2015.
USDA-ARS?s Scientific Manuscript database
Cacao (Theobroma cacao L.) is the source of cocoa powder and butter used for chocolate and this species originated in the rainforests of South America. Indonesia is the 3rd largest cacao producer in the world with an annual cacao output of 0.55 million tons. Knowledge of on-farm genetic diversity is...
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are the most common genetic markers in Theobroma cacao, occurring approximately once in every 200 nucleotides. SNPs, like microsatellites, are co-dominant and PCR-based, but they have several advantages over microsatellites. They are unambiguous, so that a SN...
McCutchen-Maloney, Sandra L.
2002-01-01
DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.
Park, Ji Hye
2018-01-01
Estimation of postmortem interval (PMI) is paramount in modern forensic investigation. After the disappearance of the early postmortem phenomena conventionally used to estimate PMI, entomologic evidence provides important indicators for PMI estimation. The age of the oldest fly larvae or pupae can be estimated to pinpoint the time of oviposition, which is considered the minimum PMI (PMImin). The development rate of insects is usually temperature dependent and species specific. Therefore, species identification is mandatory for PMImin estimation using entomological evidence. The classical morphological identification method cannot be applied when specimens are damaged or have not yet matured. To overcome this limitation, some investigators employ molecular identification using mitochondrial cytochrome c oxidase subunit I (COI) nucleotide sequences. The molecular identification method commonly uses Sanger's nucleotide sequencing and molecular phylogeny, which are complex and time consuming and constitute another obstacle for forensic investigators. In this study, instead of using conventional Sanger's nucleotide sequencing, single-nucleotide polymorphisms (SNPs) in the COI gene region, which are unique between fly species, were selected and targeted for single-base extension (SBE) technology. These SNPs were genotyped using a SNaPshot® kit. Eleven Calliphoridae and seven Sarcophagidae species were covered. To validate this genotyping, fly DNA samples (103 adults, 84 larvae, and 4 pupae) previously confirmed by DNA barcoding were used. This method worked quickly with minimal DNA, providing a potential alternative to conventional DNA barcoding. Consisting of only a few simple electropherogram peaks, the results were more straightforward compared with those of the conventional DNA barcoding produced by Sanger's nucleotide sequencing. PMID:29682531
Chamala, Srikar; Beckstead, Wesley A; Rowe, Mark J; McClellan, David A
2007-01-01
We investigated whether the effect of evolutionary selection on three recent Single Nucleotide Polymorphisms (SNPs) in the mitochondrial sub-haplogroups of Pima Indians is consistent with their effects on metabolic efficiency. The mitochondrial SNPs impact metabolic rate and respiratory quotient, and may be adaptations to caloric restriction in a desert habitat. Using TreeSAAP software, we examined evolutionary selection in 107 mammalian species at these SNPs, characterising the biochemical shifts produced by the amino acid substitutions. Our results suggest that two SNPs were affected by selection during mammalian evolution in a manner consistent with their effects on metabolic efficiency in Pima Indians.
2012-01-01
Background Water stress limits plant survival and production in many parts of the world. Identification of genes and alleles responding to water stress conditions is important in breeding plants better adapted to drought. Currently there are no studies examining the transcriptome wide gene and allelic expression patterns under water stress conditions. We used RNA sequencing (RNA-seq) to identify the candidate genes and alleles and to explore the evolutionary signatures of selection. Results We studied the effect of water stress on gene expression in Eucalyptus camaldulensis seedlings derived from three natural populations. We used reference-guided transcriptome mapping to study gene expression. Several genes showed differential expression between control and stress conditions. Gene ontology (GO) enrichment tests revealed up-regulation of 140 stress-related gene categories and down-regulation of 35 metabolic and cell wall organisation gene categories. More than 190,000 single nucleotide polymorphisms (SNPs) were detected and 2737 of these showed differential allelic expression. Allelic expression of 52% of these variants was correlated with differential gene expression. Signatures of selection patterns were studied by estimating the proportion of nonsynonymous to synonymous substitution rates (Ka/Ks). The average Ka/Ks ratio among the 13,719 genes was 0.39 indicating that most of the genes are under purifying selection. Among the positively selected genes (Ka/Ks > 1.5) apoptosis and cell death categories were enriched. Of the 287 positively selected genes, ninety genes showed differential expression and 27 SNPs from 17 positively selected genes showed differential allelic expression between treatments. Conclusions Correlation of allelic expression of several SNPs with total gene expression indicates that these variants may be the cis-acting variants or in linkage disequilibrium with such variants. Enrichment of apoptosis and cell death gene categories among the positively selected genes reveals the past selection pressures experienced by the populations used in this study. PMID:22853646
Karimi, Mehran; Zarei, Tahereh; Haghpanah, Sezaneh; Moghadam, Mohamad; Ebrahimi, Ahmad; Rezaei, Narges; Heidari, Ghazaleh; Vazin, Afsaneh; Khavari, Maryam; Miri, Hamid R
2017-05-01
To evaluate the possible relationship between hydroxyurea (HU) response and some single-nucleotide polymorphism (SNP) in patients affected by β-thalassemia intermedia. In this cross-sectional study, 100 β-thalassemia intermedia patients who were taking HU with a dose of 8 to 15 mg/kg body weight per day for a period of at least 6 months were randomly selected between February 2013 and October 2014 in southern Iran. HU response was defined based on decrease or cessation of the blood transfusion need and evaluation of Hb level. In univariate analysis, from all evaluated SNPs, only rs10837814 SNP of olfactory receptors (ORs) OR51B2 showed a significant association with HU response (P=0.038) and from laboratory characteristics, only nucleated red blood cells showed significant associations (116%±183%) in good responders versus (264%±286%) in poor responders (P=0.045). In multiple logistic regression, neither laboratory variables nor different SNPs, showed significant association with HU response. Three novel nucleotide variations (-665 [A→C], -1301 [T→G],-1199 delA) in OR51B2 gene were found in good responders. None of the evaluated SNPs in our study showed significant association with HU response. Further larger studies and evaluation of other genes are suggested.
A high-throughput approach to profile RNA structure.
Delli Ponti, Riccardo; Marti, Stefanie; Armaos, Alexandros; Tartaglia, Gian Gaetano
2017-03-17
Here we introduce the Computational Recognition of Secondary Structure (CROSS) method to calculate the structural profile of an RNA sequence (single- or double-stranded state) at single-nucleotide resolution and without sequence length restrictions. We trained CROSS using data from high-throughput experiments such as Selective 2΄-Hydroxyl Acylation analyzed by Primer Extension (SHAPE; Mouse and HIV transcriptomes) and Parallel Analysis of RNA Structure (PARS; Human and Yeast transcriptomes) as well as high-quality NMR/X-ray structures (PDB database). The algorithm uses primary structure information alone to predict experimental structural profiles with >80% accuracy, showing high performances on large RNAs such as Xist (17 900 nucleotides; Area Under the ROC Curve AUC of 0.75 on dimethyl sulfate (DMS) experiments). We integrated CROSS in thermodynamics-based methods to predict secondary structure and observed an increase in their predictive power by up to 30%. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Does Marriage Moderate Genetic Effects on Delinquency and Violence?
Li, Yi; Liu, Hexuan; Guo, Guang
2015-01-01
Using data from the National Longitudinal Study of Adolescent to Adult Health (N = 1,254), the authors investigated whether marriage can foster desistance from delinquency and violence by moderating genetic effects. In contrast to existing gene–environment research that typically focuses on one or a few genetic polymorphisms, they extended a recently developed mixed linear model to consider the collective influence of 580 single nucleotide polymorphisms in 64 genes related to aggression and risky behavior. The mixed linear model estimates the proportion of variance in the phenotype that is explained by the single nucleotide polymorphisms. The authors found that the proportion of variance in delinquency/violence explained was smaller among married individuals than unmarried individuals. Because selection, confounding, and heterogeneity may bias the estimate of the Gene × Marriage interaction, they conducted a series of analyses to address these issues. The findings suggest that the Gene × Marriage interaction results were not seriously affected by these issues. PMID:26549892
[Association between single-nucleotide polymorphisms in the IRAK-4 gene and allergic rhinitis].
Zhang, Yuan; Xi, Lin; Zhao, Yan-ming; Zhao, Li-ping; Zhang, Luo
2012-06-01
To investigate the genetic association pattern between single-nucleotide polymorphisms (SNP) in the interleukin-1 receptor-associated kinase 4 (IRAK-4) gene and allergic rhinitis (AR). A population of 379 patients with the diagnosis of AR and 333 healthy controls who lived in Beijing region was recruited. A total of 8 reprehensive marker SNP which were in IRAK-4 gene region were selected according to the Beijing people database from Hapmap website. The individual genotyping was performed by MassARRAY platform. SPSS 13.0 software was used for statistic analysis. Subgroup analysis for the presence of different allergen sensitivities displayed associations only in the house dust mite-allergic cohorts (rs3794262: P = 0.0034, OR = 1.7388; rs4251481: P = 0.0023, OR = 2.6593), but not in subjects who were allergic to pollens as well as mix allergens. The potential genetic contribution of the IRAK-4 gene to AR demonstrated an allergen-dependant association pattern in Chinese population.
Saad, Mohamed N.; Mabrouk, Mai S.; Eldeib, Ayman M.; Shaker, Olfat G.
2015-01-01
Genetics of autoimmune diseases represent a growing domain with surpassing biomarker results with rapid progress. The exact cause of Rheumatoid Arthritis (RA) is unknown, but it is thought to have both a genetic and an environmental bases. Genetic biomarkers are capable of changing the supervision of RA by allowing not only the detection of susceptible individuals, but also early diagnosis, evaluation of disease severity, selection of therapy, and monitoring of response to therapy. This review is concerned with not only the genetic biomarkers of RA but also the methods of identifying them. Many of the identified genetic biomarkers of RA were identified in populations of European and Asian ancestries. The study of additional human populations may yield novel results. Most of the researchers in the field of identifying RA biomarkers use single nucleotide polymorphism (SNP) approaches to express the significance of their results. Although, haplotype block methods are expected to play a complementary role in the future of that field. PMID:26843965
Lado, Bettina; Matus, Ivan; Rodríguez, Alejandra; Inostroza, Luis; Poland, Jesse; Belzile, François; del Pozo, Alejandro; Quincke, Martín; Castro, Marina; von Zitzewitz, Jarislav
2013-12-09
In crop breeding, the interest of predicting the performance of candidate cultivars in the field has increased due to recent advances in molecular breeding technologies. However, the complexity of the wheat genome presents some challenges for applying new technologies in molecular marker identification with next-generation sequencing. We applied genotyping-by-sequencing, a recently developed method to identify single-nucleotide polymorphisms, in the genomes of 384 wheat (Triticum aestivum) genotypes that were field tested under three different water regimes in Mediterranean climatic conditions: rain-fed only, mild water stress, and fully irrigated. We identified 102,324 single-nucleotide polymorphisms in these genotypes, and the phenotypic data were used to train and test genomic selection models intended to predict yield, thousand-kernel weight, number of kernels per spike, and heading date. Phenotypic data showed marked spatial variation. Therefore, different models were tested to correct the trends observed in the field. A mixed-model using moving-means as a covariate was found to best fit the data. When we applied the genomic selection models, the accuracy of predicted traits increased with spatial adjustment. Multiple genomic selection models were tested, and a Gaussian kernel model was determined to give the highest accuracy. The best predictions between environments were obtained when data from different years were used to train the model. Our results confirm that genotyping-by-sequencing is an effective tool to obtain genome-wide information for crops with complex genomes, that these data are efficient for predicting traits, and that correction of spatial variation is a crucial ingredient to increase prediction accuracy in genomic selection models.
Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y F; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie; Martin, Darren Patrick
2014-02-01
Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here.
Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y. F.; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie
2014-01-01
Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here. PMID:24284329
Jafari, Naghmeh; Broer, Linda; Hoppenbrouwers, Ilse A; van Duijn, Cornelia M; Hintzen, Rogier Q
2010-11-01
Multiple sclerosis is a presumed autoimmune disease associated with genetic and environmental risk factors such as infectious mononucleosis. Recent research has shown infectious mononucleosis to be associated with a specific HLA class I polymorphism. Our aim was to test if the infectious mononucleosis-linked HLA class I single nucleotide polymorphism (rs6457110) is also associated with multiple sclerosis. Genotyping of the HLA-A single nucleotide polymorphism rs6457110 using TaqMan was performed in 591 multiple sclerosis cases and 600 controls. The association of multiple sclerosis with the HLA-A single nucleotide polymorphism was tested using logistic regression adjusted for age, sex and HLA-DRB1*1501. HLA-A minor allele (A) is associated with multiple sclerosis (OR = 0.68; p = 4.08 × 10( -5)). After stratification for HLA-DRB1*1501 risk allele (T) carrier we showed a significant OR of 0.70 (p = 0.003) for HLA-A. HLA class I single nucleotide polymorphism rs6457110 is associated with infectious mononucleosis and multiple sclerosis, independent of the major class II allele, supporting the hypothesis that shared genetics may contribute to the association between infectious mononucleosis and multiple sclerosis.
CLUSTAG: hierarchical clustering and graph methods for selecting tag SNPs.
Ao, S I; Yip, Kevin; Ng, Michael; Cheung, David; Fong, Pui-Yee; Melhado, Ian; Sham, Pak C
2005-04-15
Cluster and set-cover algorithms are developed to obtain a set of tag single nucleotide polymorphisms (SNPs) that can represent all the known SNPs in a chromosomal region, subject to the constraint that all SNPs must have a squared correlation R2>C with at least one tag SNP, where C is specified by the user. http://hkumath.hku.hk/web/link/CLUSTAG/CLUSTAG.html mng@maths.hku.hk.
USDA-ARS?s Scientific Manuscript database
Background: DNA methylation is influenced by diet and single nucleotide polymorphisms (SNPs), and methylation modulates gene expression. Objective: We aimed to explore whether the gene-by-diet interactions on blood lipids act through DNA methylation. Design: We selected 7 SNPs on the basis of predic...
Veale, Andrew J.
2017-01-01
Mechanisms underlying adaptive evolution can best be explored using paired populations displaying similar phenotypic divergence, illuminating the genomic changes associated with specific life history traits. Here, we used paired migratory [anadromous vs. resident (kokanee)] and reproductive [shore- vs. stream-spawning] ecotypes of sockeye salmon (Oncorhynchus nerka) sampled from seven lakes and two rivers spanning three catchments (Columbia, Fraser, and Skeena) in British Columbia, Canada to investigate the patterns and processes underlying their divergence. Restriction-site associated DNA sequencing was used to genotype this sampling at 7,347 single nucleotide polymorphisms, 334 of which were identified as outlier loci and candidates for divergent selection within at least one ecotype comparison. Sixty-eight of these outliers were present in two or more comparisons, with 33 detected across multiple catchments. Of particular note, one locus was detected as the most significant outlier between shore and stream-spawning ecotypes in multiple comparisons and across catchments (Columbia, Fraser, and Snake). We also detected several genomic islands of divergence, some shared among comparisons, potentially showing linked signals of differential selection. The single nucleotide polymorphisms and genomic regions identified in our study offer a range of mechanistic hypotheses associated with the genetic basis of O. nerka life history variation and provide novel tools for informing fisheries management. PMID:29045601
Investigation of Genetic Variants Associated with Alzheimer Disease in Parkinson Disease Cognition.
Barrett, Matthew J; Koeppel, Alexander F; Flanigan, Joseph L; Turner, Stephen D; Worrall, Bradford B
2016-01-01
Meta-analysis of genome-wide association studies have implicated multiple single nucleotide polymorphisms (SNPs) and associated genes with Alzheimer disease. The role of these SNPs in cognitive impairment in Parkinson disease (PD) remains incompletely evaluated. The objective of this study was to test alleles associated with risk of Alzheimer disease for association with cognitive impairment in Parkinson disease (PD). Two datasets with PD subjects accessed through the NIH database of Genotypes and Phenotypes contained both single nucleotide polymorphism (SNP) arrays and mini-mental state exam (MMSE) scores. Genetic data underwent rigorous quality control and we selected SNPs for genes associated with AD other than APOE. We constructed logistic regression and ordinal regression models, adjusted for sex, age at MMSE, and duration of PD, to assess the association between selected SNPs and MMSE score. In one dataset, PICALM rs3851179 was associated with cognitive impairment (MMSE < 24) in PD subjects > 70 years old (OR = 2.3; adjusted p-value = 0.017; n = 250) but not in PD subjects ≤ 70 years old. Our finding suggests that PICALM rs3851179 could contribute to cognitive impairment in older patients with PD. It is important that future studies consider the interaction of age and genetic risk factors in the development of cognitive impairment in PD.
Theodorou, Panagiotis; Radzevičiūtė, Rita; Kahnt, Belinda; Soro, Antonella; Grosse, Ivo; Paxton, Robert J
2018-04-25
Urbanization is considered a global threat to biodiversity; the growth of cities results in an increase in impervious surfaces, soil and air pollution, fragmentation of natural vegetation and invasion of non-native species, along with numerous environmental changes, including the heat island phenomenon. The combination of these effects constitutes a challenge for both the survival and persistence of many native species, while also imposing altered selective regimes. Here, using 110 314 single nucleotide polymorphisms generated by restriction-site-associated DNA sequencing, we investigated the genome-wide effects of urbanization on putative neutral and adaptive genomic diversity in a major insect pollinator, Bombus lapidarius , collected from nine German cities and nine paired rural sites. Overall, genetic differentiation among sites was low and there was no obvious genome-wide genetic structuring, suggesting the absence of strong effects of urbanization on gene flow. We nevertheless identified several loci under directional selection, a subset of which was associated with urban land use, including the percentage of impervious surface surrounding each sampling site. Overall, our results provide evidence of local adaptation to urbanization in the face of gene flow in a highly mobile insect pollinator. © 2018 The Author(s).
Wang, Junxiu; Xiong, Guoliang; Ma, Liang; Wang, Shihui; Zhou, Xu; Wang, Lei; Xiao, Lehui; Su, Xin; Yu, Changyuan
2017-08-15
Single-nucleotide mutation (SNM) has proven to be associated with a variety of human diseases. Development of reliable methods for the detection of SNM is crucial for molecular diagnosis and personalized medicine. The sandwich assays are widely used tools for detecting nucleic acid biomarkers due to their low cost and rapid signaling. However, the poor hybridization specificity of signal probe at room temperature hampers the discrimination of mutant and wild type. Here, we demonstrate a dynamic sandwich assay on magnetic beads for SNM detection based on the transient binding between signal probe and target. By taking the advantage of mismatch sensitive thermodynamics of transient DNA binding, the dynamic sandwich assay exhibits high discrimination factor for mutant with a broad range of salt concentration at room temperature. The beads used in this assay serve as a tool for separation, and might be helpful to enhance SNM selectivity. Flexible design of signal probe and facile magnetic separation allow multiple-mode downstream analysis including colorimetric detection and isothermal amplification. With this method, BRAF mutations in the genomic DNA extracted from cancer cell lines were tested, allowing sensitive detection of SNM at very low abundances (0.1-0.5% mutant/wild type). Copyright © 2017 Elsevier B.V. All rights reserved.
Chromosome-scale selective sweeps shape Caenorhabditis elegans genomic diversity
Andersen, Erik C.; Gerke, Justin P.; Shapiro, Joshua A.; Crissman, Jonathan R.; Ghosh, Rajarshi; Bloom, Joshua S.; Félix, Marie-Anne; Kruglyak, Leonid
2011-01-01
The nematode Caenorhabditis elegans is central to research in molecular, cell, and developmental biology, but nearly all of this research has been conducted on a single strain. Comparatively little is known about the population genomic and evolutionary history of this species. We characterized C. elegans genetic variation by high-throughput selective sequencing of a worldwide collection of 200 wild strains, identifying 41,188 single nucleotide polymorphisms. Unexpectedly, C. elegans genome variation is dominated by a set of commonly shared haplotypes on four of the six chromosomes, each spanning many megabases. Population-genetic modeling shows that this pattern was generated by chromosome-scale selective sweeps that have reduced variation worldwide; at least one of these sweeps likely occurred in the past few hundred years. These sweeps, which we hypothesize to be a result of human activity, have dramatically reshaped the global C. elegans population in the recent past. PMID:22286215
A Tradeoff Drives the Evolution of Reduced Metal Resistance in Natural Populations of Yeast
Chang, Shang-Lin; Leu, Jun-Yi
2011-01-01
Various types of genetic modification and selective forces have been implicated in the process of adaptation to novel or adverse environments. However, the underlying molecular mechanisms are not well understood in most natural populations. Here we report that a set of yeast strains collected from Evolution Canyon (EC), Israel, exhibit an extremely high tolerance to the heavy metal cadmium. We found that cadmium resistance is primarily caused by an enhanced function of a metal efflux pump, PCA1. Molecular analyses demonstrate that this enhancement can be largely attributed to mutations in the promoter sequence, while mutations in the coding region have a minor effect. Reconstruction experiments show that three single nucleotide substitutions in the PCA1 promoter quantitatively increase its activity and thus enhance the cells' cadmium resistance. Comparison among different yeast species shows that the critical nucleotides found in EC strains are conserved and functionally important for cadmium resistance in other species, suggesting that they represent an ancestral type. However, these nucleotides had diverged in most Saccharomyces cerevisiae populations, which gave cells growth advantages under conditions where cadmium is low or absent. Our results provide a rare example of a selective sweep in yeast populations driven by a tradeoff in metal resistance. PMID:21483812
NASA Astrophysics Data System (ADS)
Luo, Qingying; Liu, Lin; Yang, Cai; Yuan, Jing; Feng, Hongtao; Chen, Yan; Zhao, Peng; Yu, Zhiqiang; Jin, Zongwen
2018-03-01
MicroRNAs (miRNAs) are single stranded endogenous molecules composed of only 18-24 nucleotides which are critical for gene expression regulating the translation of messenger RNAs. Conventional methods based on enzyme-assisted nucleic acid amplification techniques have many problems, such as easy contamination, high cost, susceptibility to false amplification, and tendency to have sequence mismatches. Here we report a rapid, ratiometric, enzyme-free, sensitive, and highly selective single-step miRNA detection using three-way junction assembled (or self-assembled) FRET probes. The developed strategy can be operated within the linear range from subnanomolar to hundred nanomolar concentrations of miRNAs. In comparison with the traditional approaches, our method showed high sensitivity for the miRNA detection and extreme selectivity for the efficient discrimination of single-base mismatches. The results reveal that the strategy paved a new avenue for the design of novel highly specific probes applicable in diagnostics and potentially in microscopic imaging of miRNAs in real biological environments.
Gomez, M; Kioussis, D; Cantrell, D A
2001-11-01
The positive selection of CD4 or CD8 single-positive mature peripheral T lymphocytes and the deletion of self-reactive cells are crucial for central tolerance in the peripheral immune system. Previously, the guanine nucleotide binding protein Rac-1 has been shown to control pre-T cell development. The present report now describes the actions of Rac-1 in thymocyte selection. The study reveals that this molecule has the striking and unique ability to efficiently divert cells from positive selection into a pathway of negative selection and deletion. The ability of Rac-1 to switch thymocytes from a destiny of positive to negative selection identifies this molecule as a critical regulator of the developmental processes in T cells that are essential for immune homeostasis.
Electrical detection and quantification of single and mixed DNA nucleotides in suspension
NASA Astrophysics Data System (ADS)
Ahmad, Mahmoud Al; Panicker, Neena G.; Rizvi, Tahir A.; Mustafa, Farah
2016-09-01
High speed sequential identification of the building blocks of DNA, (deoxyribonucleotides or nucleotides for short) without labeling or processing in long reads of DNA is the need of the hour. This can be accomplished through exploiting their unique electrical properties. In this study, the four different types of nucleotides that constitute a DNA molecule were suspended in a buffer followed by performing several types of electrical measurements. These electrical parameters were then used to quantify the suspended DNA nucleotides. Thus, we present a purely electrical counting scheme based on the semiconductor theory that allows one to determine the number of nucleotides in a solution by measuring their capacitance-voltage dependency. The nucleotide count was observed to be similar to the multiplication of the corresponding dopant concentration and debye volume after de-embedding the buffer contribution. The presented approach allows for a fast and label-free quantification of single and mixed nucleotides in a solution.
Guirao-Rico, Sara; Aguadé, Montserrat
2013-01-01
In Drosophila, the insulin-signaling pathway controls some life history traits, such as fertility and lifespan, and it is considered to be the main metabolic pathway involved in establishing adult body size. Several observations concerning variation in body size in the Drosophila genus are suggestive of its adaptive character. Genes encoding proteins in this pathway are, therefore, good candidates to have experienced adaptive changes and to reveal the footprint of positive selection. The Drosophila insulin-like peptides (DILPs) are the ligands that trigger the insulin-signaling cascade. In Drosophila melanogaster, there are several peptides that are structurally similar to the single mammalian insulin peptide. The footprint of recent adaptive changes on nucleotide variation can be unveiled through the analysis of polymorphism and divergence. With this aim, we have surveyed nucleotide sequence variation at the dilp1-7 genes in a natural population of D. melanogaster. The comparison of polymorphism in D. melanogaster and divergence from D. simulans at different functional classes of the dilp genes provided no evidence of adaptive protein evolution after the split of the D. melanogaster and D. simulans lineages. However, our survey of polymorphism at the dilp gene regions of D. melanogaster has provided some evidence for the action of positive selection at or near these genes. The regions encompassing the dilp1-4 genes and the dilp6 gene stand out as likely affected by recent adaptive events. PMID:23308258
NASA Astrophysics Data System (ADS)
Liu, Siwei; Li, Qi; Yu, Hong; Kong, Lingfeng
2017-02-01
Glycogen is important not only for the energy supplementary of oysters, but also for human consumption. High glycogen content can improve the stress survival of oyster. A key enzyme in glycogenesis is glycogen synthase that is encoded by glycogen synthase gene GYS. In this study, the relationship between single nucleotide polymorphisms (SNPs) in coding regions of Crassostrea gigas GYS (Cg-GYS) and individual glycogen content was investigated with 321 individuals from five full-sib families. Single-strand conformation polymorphism (SSCP) procedure was combined with sequencing to confirm individual SNP genotypes of Cg-GYS. Least-square analysis of variance was performed to assess the relationship of variation in glycogen content of C. gigas with single SNP genotype and SNP haplotype. As a consequence, six SNPs were found in coding regions to be significantly associated with glycogen content ( P < 0.01), from which we constructed four main haplotypes due to linkage disequilibrium. Furthermore, the most effective haplotype H2 (GAGGAT) had extremely significant relationship with high glycogen content ( P < 0.0001). These findings revealed the potential influence of Cg-GYS polymorphism on the glycogen content and provided molecular biological information for the selective breeding of good quality traits of C. gigas.
An atypical CNG channel activated by a single cGMP molecule controls sperm chemotaxis.
Bönigk, Wolfgang; Loogen, Astrid; Seifert, Reinhard; Kashikar, Nachiket; Klemm, Clementine; Krause, Eberhard; Hagen, Volker; Kremmer, Elisabeth; Strünker, Timo; Kaupp, U Benjamin
2009-10-27
Sperm of the sea urchin Arbacia punctulata can respond to a single molecule of chemoattractant released by an egg. The mechanism underlying this extreme sensitivity is unknown. Crucial signaling events in the response of A. punctulata sperm to chemoattractant include the rapid synthesis of the intracellular messenger guanosine 3',5'-monophosphate (cGMP) and the ensuing membrane hyperpolarization that results from the opening of potassium-selective cyclic nucleotide-gated (CNGK) channels. Here, we use calibrated photolysis of caged cGMP to show that approximately 45 cGMP molecules are generated during the response to a single molecule of chemoattractant. The CNGK channel can respond to such small cGMP changes because it is exquisitely sensitive to cGMP and activated in a noncooperative fashion. Like voltage-activated Ca(v) and Na(v) channels, the CNGK polypeptide consists of four homologous repeat sequences. Disabling each of the four cyclic nucleotide-binding sites through mutagenesis revealed that binding of a single cGMP molecule to repeat 3 is necessary and sufficient to activate the CNGK channel. Thus, CNGK has developed a mechanism of activation that is different from the activation of other CNG channels, which requires the cooperative binding of several ligands and operates in the micromolar rather than the nanomolar range.
Beaulieu, Jean; Doerksen, Trevor; Boyle, Brian; Clément, Sébastien; Deslauriers, Marie; Beauseigle, Stéphanie; Blais, Sylvie; Poulin, Pier-Luc; Lenz, Patrick; Caron, Sébastien; Rigault, Philippe; Bicho, Paul; Bousquet, Jean; MacKay, John
2011-01-01
Marker-assisted selection holds promise for highly influencing tree breeding, especially for wood traits, by considerably reducing breeding cycles and increasing selection accuracy. In this study, we used a candidate gene approach to test for associations between 944 single-nucleotide polymorphism markers from 549 candidate genes and 25 wood quality traits in white spruce. A mixed-linear model approach, including a weak but nonsignificant population structure, was implemented for each marker–trait combination. Relatedness among individuals was controlled using a kinship matrix estimated either from the known half-sib structure or from the markers. Both additive and dominance effect models were tested. Between 8 and 21 single-nucleotide polymorphisms (SNPs) were found to be significantly associated (P ≤ 0.01) with each of earlywood, latewood, or total wood traits. After controlling for multiple testing (Q ≤ 0.10), 13 SNPs were still significant across as many genes belonging to different families, each accounting for between 3 and 5% of the phenotypic variance in 10 wood characters. Transcript accumulation was determined for genes containing SNPs associated with these traits. Significantly different transcript levels (P ≤ 0.05) were found among the SNP genotypes of a 1-aminocyclopropane-1-carboxylate oxidase, a β-tonoplast intrinsic protein, and a long-chain acyl-CoA synthetase 9. These results should contribute toward the development of efficient marker-assisted selection in an economically important tree species. PMID:21385726
Weng, Jianfeng; Li, Bo; Liu, Changlin; Yang, Xiaoyan; Wang, Hongwei; Hao, Zhuanfang; Li, Mingshun; Zhang, Degui; Ci, Xiaoke; Li, Xinhai; Zhang, Shihuang
2013-07-05
Kernel weight, controlled by quantitative trait loci (QTL), is an important component of grain yield in maize. Cytokinins (CKs) participate in determining grain morphology and final grain yield in crops. ZmIPT2, which is expressed mainly in the basal transfer cell layer, endosperm, and embryo during maize kernel development, encodes an isopentenyl transferase (IPT) that is involved in CK biosynthesis. The coding region of ZmIPT2 was sequenced across a panel of 175 maize inbred lines that are currently used in Chinese maize breeding programs. Only 16 single nucleotide polymorphisms (SNPs) and seven haplotypes were detected among these inbred lines. Nucleotide diversity (π) within the ZmIPT2 window and coding region were 0.347 and 0.0047, respectively, and they were significantly lower than the mean nucleotide diversity value of 0.372 for maize Chromosome 2 (P < 0.01). Association mapping revealed that a single nucleotide change from cytosine (C) to thymine (T) in the ZmIPT2 coding region, which converted a proline residue into a serine residue, was significantly associated with hundred kernel weight (HKW) in three environments (P <0.05), and explained 4.76% of the total phenotypic variation. In vitro characterization suggests that the dimethylallyl diphospate (DMAPP) IPT activity of ZmIPT2-T is higher than that of ZmIPT2-C, as the amounts of adenosine triphosphate (ATP), adenosine diphosphate (ADP), and adenosine monophosphate (AMP) consumed by ZmIPT2-T were 5.48-, 2.70-, and 1.87-fold, respectively, greater than those consumed by ZmIPT2-C. The effects of artificial selection on the ZmIPT2 coding region were evaluated using Tajima's D tests across six subgroups of Chinese maize germplasm, with the most frequent favorable allele identified in subgroup PB (Partner B). These results showed that ZmIPT2, which is associated with kernel weight, was subjected to artificial selection during the maize breeding process. ZmIPT2-T had higher IPT activity than ZmIPT2-C, and this favorable allele for kernel weight could be used in molecular marker-assisted selection for improvement of grain yield components in Chinese maize breeding programs.
Positive selection in the SLC11A1 gene in the family Equidae.
Bayerova, Zuzana; Janova, Eva; Matiasovic, Jan; Orlando, Ludovic; Horin, Petr
2016-05-01
Immunity-related genes are a suitable model for studying effects of selection at the genomic level. Some of them are highly conserved due to functional constraints and purifying selection, while others are variable and change quickly to cope with the variation of pathogens. The SLC11A1 gene encodes a transporter protein mediating antimicrobial activity of macrophages. Little is known about the patterns of selection shaping this gene during evolution. Although it is a typical evolutionarily conserved gene, functionally important polymorphisms associated with various diseases were identified in humans and other species. We analyzed the genomic organization, genetic variation, and evolution of the SLC11A1 gene in the family Equidae to identify patterns of selection within this important gene. Nucleotide SLC11A1 sequences were shown to be highly conserved in ten equid species, with more than 97 % sequence identity across the family. Single nucleotide polymorphisms (SNPs) were found in the coding and noncoding regions of the gene. Seven codon sites were identified to be under strong purifying selection. Codons located in three regions, including the glycosylated extracellular loop, were shown to be under diversifying selection. A 3-bp indel resulting in a deletion of the amino acid 321 in the predicted protein was observed in all horses, while it has been maintained in all other equid species. This codon comprised in an N-glycosylation site was found to be under positive selection. Interspecific variation in the presence of predicted N-glycosylation sites was observed.
Data on polymorphisms in CYP2A6 associated to risk and predispose to smoking related variables.
López-Flores, Luis A; Pérez-Rubio, Gloria; Ramírez-Venegas, Alejandra; Ambrocio-Ortiz, Enrique; Sansores, Raúl H; Falfán-Valencia, Ramcés
2017-12-01
This article contains data on the single nucleotide polymorphisms (SNPs) rs1137115, rs1801272 and rs28399433 rs4105144 in CYP2A6 associated to smoking related variables in Mexican Mestizo smokers (Pérez-Rubio et al., 2017) [1]. These SNPs were selected due to previous associations with other populations. Mexican Mestizo smokers were classified according their smoking pattern. A genetic association test was performed.
Shetova, I M; Timofeev, D Iu; Shamalov, N A; Bondarenko, E A; Slominskiĭ, P A; Limborskaia, S A; Skvortsova, V I
2012-01-01
The analysis of association between DNA markers and total stroke risk was performed in 950 Slavonic patients. Patients with cardioembolic stroke were selected for a genome-wide association study. The HUMANCYTOSNP12 v.2 microchip was used to analyze all DNA samples on a panel of 301 000 single nucleotide polymorphisms. SNP rs1842993 on chromosome 7 was found to be associated with cardioembolic stroke risk.
Yi, Ping; Chen, Zhuqin; Zhao, Yan; Guo, Jianxin; Fu, Huabin; Zhou, Yuanguo; Yu, Lili; Li, Li
2009-03-01
The discovery of fetal DNA in maternal plasma has opened up an approach for noninvasive diagnosis. We have now assessed the possibility of detecting single-nucleotide differences between fetal and maternal DNA in maternal plasma by polymerase chain reaction (PCR)/ligase detection reaction((LDR)/capillary electrophoresis. PCR/LDR/capillary electrophoresis was applied to detect the genotype of c.454-397T>gene (ESR1) from experimental DNA models of maternal plasma at different sensitivity levels and 13 maternal plasma samples.alphaC in estrogen receptor. (1) Our results demonstrated that the technique could discriminate low abundance single-nucleotide mutation with a mutant/normal allele ratio up to 1:10 000. (2) Examination of ESR1 c.454-397T>C genotypes by using the method of restriction fragment length analysis was performed in 25 pregnant women, of whom 13 pregnant women had homozygous genotypes. The c.454-397T>C genotypes of paternally inherited fetal DNA in maternal plasma of these 13 women were detected by PCR/LDR/capillary electrophoresis, which were accordant with the results of umbilical cord blood. PCR/LDR/capillary electrophoresis has very high sensitivity to distinguish low abundance single nucleotide differences and can discriminate point mutations and single-nucleotide polymorphisms(SNPs) of paternally inherited fetal DNA in maternal plasma.
Kim, Kwondo; Jung, Jaehoon; Caetano-Anollés, Kelsey; Sung, Samsun; Yoo, DongAhn; Choi, Bong-Hwan; Kim, Hyung-Chul; Jeong, Jin-Young; Cho, Yong-Min; Park, Eung-Woo; Choi, Tae-Jeong; Park, Byoungho; Lim, Dajeong
2018-01-01
Artificial selection has been demonstrated to have a rapid and significant effect on the phenotype and genome of an organism. However, most previous studies on artificial selection have focused solely on genomic sequences modified by artificial selection or genomic sequences associated with a specific trait. In this study, we generated whole genome sequencing data of 126 cattle under artificial selection, and 24,973,862 single nucleotide variants to investigate the relationship among artificial selection, genomic sequences and trait. Using runs of homozygosity detected by the variants, we showed increase of inbreeding for decades, and at the same time demonstrated a little influence of recent inbreeding on body weight. Also, we could identify ~0.2 Mb runs of homozygosity segment which may be created by recent artificial selection. This approach may aid in development of genetic markers directly influenced by artificial selection, and provide insight into the process of artificial selection. PMID:29561881
Kovaliov, Marina; Weitman, Michal; Major, Dan Thomas; Fischer, Bilha
2014-08-01
To expand the arsenal of fluorescent cytidine analogues for the detection of genetic material, we synthesized para-substituted phenyl-imidazolo-cytidine ((Ph)ImC) analogues 5a-g and established a relationship between their structure and fluorescence properties. These analogues were more emissive than cytidine (λem 398-420 nm, Φ 0.009-0.687), and excellent correlation was found between Φ of 5a-g and σp(-) of the substituent on the phenyl-imidazolo moiety (R(2) = 0.94). Calculations suggested that the dominant tautomer of (Ph)ImC in methanol solution is identical to that of cytidine. DFT calculations of the stable tautomer of selected (Ph)ImC analogues suggested a relationship between the HOMO-LUMO gap and Φ and explained the loss of fluorescence in the nitro analogue. Incorporation of the CF3-(Ph)ImdC analogue into a DNA probe resulted in 6-fold fluorescence quenching of the former. A 17-fold reduction of fluorescence was observed for the G-matched duplex vs ODN(CF3-(Ph)ImdC), while for A-mismatched duplex, only a 2-fold decrease was observed. Furthermore, since the quantum yield of ODN(CF3-(Ph)ImdC):ODN(G) was reduced 17-fold vs that of a single strand, whereas that of ODN(CF3-(Ph)ImdC):ORN(G) was reduced only 3.8-fold, ODN(CF3-(Ph)ImdC) appears to be a DNA-selective probe. We conclude that the ODN(CF3-(Ph)ImdC) probe, exhibiting emission sensitivity upon single nucleotide replacement, may be potentially useful for DNA single nucleotide polymorphism (SNP) typing.
van Binsbergen, R; Veerkamp, R F; Calus, M P L
2012-04-01
The correlated responses between traits may differ depending on the makeup of genetic covariances, and may differ from the predictions of polygenic covariances. Therefore, the objective of the present study was to investigate the makeup of the genetic covariances between the well-studied traits: milk yield, fat yield, protein yield, and their percentages in more detail. Phenotypic records of 1,737 heifers of research farms in 4 different countries were used after homogenizing and adjusting for management effects. All cows had a genotype for 37,590 single nucleotide polymorphisms (SNP). A bayesian stochastic search variable selection model was used to estimate the SNP effects for each trait. About 0.5 to 1.0% of the SNP had a significant effect on 1 or more traits; however, the SNP without a significant effect explained most of the genetic variances and covariances of the traits. Single nucleotide polymorphism correlations differed from the polygenic correlations, but only 10 regions were found with an effect on multiple traits; in 1 of these regions the DGAT1 gene was previously reported with an effect on multiple traits. This region explained up to 41% of the variances of 4 traits and explained a major part of the correlation between fat yield and fat percentage and contributes to asymmetry in correlated response between fat yield and fat percentage. Overall, for the traits in this study, the infinitesimal model is expected to be sufficient for the estimation of the variances and covariances. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Precise detection of de novo single nucleotide variants in human genomes.
Gómez-Romero, Laura; Palacios-Flores, Kim; Reyes, José; García, Delfino; Boege, Margareta; Dávila, Guillermo; Flores, Margarita; Schatz, Michael C; Palacios, Rafael
2018-05-22
The precise determination of de novo genetic variants has enormous implications across different fields of biology and medicine, particularly personalized medicine. Currently, de novo variations are identified by mapping sample reads from a parent-offspring trio to a reference genome, allowing for a certain degree of differences. While widely used, this approach often introduces false-positive (FP) results due to misaligned reads and mischaracterized sequencing errors. In a previous study, we developed an alternative approach to accurately identify single nucleotide variants (SNVs) using only perfect matches. However, this approach could be applied only to haploid regions of the genome and was computationally intensive. In this study, we present a unique approach, coverage-based single nucleotide variant identification (COBASI), which allows the exploration of the entire genome using second-generation short sequence reads without extensive computing requirements. COBASI identifies SNVs using changes in coverage of exactly matching unique substrings, and is particularly suited for pinpointing de novo SNVs. Unlike other approaches that require population frequencies across hundreds of samples to filter out any methodological biases, COBASI can be applied to detect de novo SNVs within isolated families. We demonstrate this capability through extensive simulation studies and by studying a parent-offspring trio we sequenced using short reads. Experimental validation of all 58 candidate de novo SNVs and a selection of non-de novo SNVs found in the trio confirmed zero FP calls. COBASI is available as open source at https://github.com/Laura-Gomez/COBASI for any researcher to use. Copyright © 2018 the Author(s). Published by PNAS.
Deng, Hong-Zhu; You, Cong; Xing, Yu; Chen, Kai-Yun; Zou, Xiao-Bing
2016-05-01
Autism spectrum disorder is a group of neurodevelopmental disorders with the higher prevalence in males. Our previous studies have indicated lower progesterone levels in the children with autism spectrum disorder, suggesting involvement of the cytochrome P-450scc gene (CYP11A1) and cytochrome P-45011beta gene (CYP11B1) as candidate genes in autism spectrum disorder. The aim of this study was to investigate the family-based genetic association between single-nucleotide polymorphisms, rs2279357 in the CYP11A1 gene and rs4534 and rs4541 in the CYP11B1 gene and autism spectrum disorder in Chinese children, which were selected according to the location in the coding region and 5' and 3' regions and minor allele frequencies of greater than 0.05 in the Chinese populations. The transmission disequilibrium test and case-control association analyses were performed in 100 Chinese Han autism spectrum disorder family trios. The genotype and allele frequency of the 3 single-nucleotide polymorphisms had no statistical difference between the children with autism spectrum disorder and their parents (P> .05). Transmission disequilibrium test analysis showed transmission disequilibrium of CYP11A1 gene rs2279357 single-nucleotide polymorphisms (χ(2)= 5.038,P< .001). Our findings provide further support for the hypothesis that a susceptibility gene for autism spectrum disorder exists within or near the CYP11A1 gene in the Han Chinese population. © The Author(s) 2015.
Precision-engineering the Pseudomonas aeruginosa genome with two-step allelic exchange
Hmelo, Laura R.; Borlee, Bradley R.; Almblad, Henrik; Love, Michelle E.; Randall, Trevor E.; Tseng, Boo Shan; Lin, Chuyang; Irie, Yasuhiko; Storek, Kelly M.; Yang, Jaeun Jane; Siehnel, Richard J.; Howell, P. Lynne; Singh, Pradeep K.; Tolker-Nielsen, Tim; Parsek, Matthew R.; Schweizer, Herbert P.; Harrison, Joe J.
2016-01-01
Allelic exchange is an efficient method of bacterial genome engineering. This protocol describes the use of this technique to make gene knockouts and knockins, as well as single nucleotide insertions, deletions and substitutions in Pseudomonas aeruginosa. Unlike other approaches to allelic exchange, this protocol does not require heterologous recombinases to insert or excise selective markers from the target chromosome. Rather, positive and negative selection are enabled solely by suicide vector-encoded functions and host cell proteins. Here, mutant alleles, which are flanked by regions of homology to the recipient chromosome, are synthesized in vitro and then cloned into allelic exchange vectors using standard procedures. These suicide vectors are then introduced into recipient cells by conjugation. Homologous recombination then results in antibiotic resistant single-crossover mutants in which the plasmid has integrated site-specifically into the chromosome. Subsequently, unmarked double-crossover mutants are isolated directly using sucrose-mediated counter-selection. This two-step process yields seamless mutations that are precise to a single base pair of DNA. The entire procedure requires ~2 weeks. PMID:26492139
Aspergillus and Penicillium identification using DNA sequences: Barcode or MLST?
USDA-ARS?s Scientific Manuscript database
Current methods in DNA technology can detect single nucleotide polymorphisms with measurable accuracy using several different approaches appropriate for different uses. If there are even single nucleotide differences that are invariant markers of the species, we can accomplish identification through...
A Single IGF1 Allele Is a Major Determinant of Small Size in Dogs
Sutter, Nathan B.; Bustamante, Carlos D.; Chase, Kevin; Gray, Melissa M.; Zhao, Keyan; Zhu, Lan; Padhukasahasram, Badri; Karlins, Eric; Davis, Sean; Jones, Paul G.; Quignon, Pascale; Johnson, Gary S.; Parker, Heidi G.; Fretwell, Neale; Mosher, Dana S.; Lawler, Dennis F.; Satyaraj, Ebenezer; Nordborg, Magnus; Lark, K. Gordon; Wayne, Robert K.; Ostrander, Elaine A.
2009-01-01
The domestic dog exhibits greater diversity in body size than any other terrestrial vertebrate. We used a strategy that exploits the breed structure of dogs to investigate the genetic basis of size. First, through a genome-wide scan, we identified a major quantitative trait locus (QTL) on chromosome 15 influencing size variation within a single breed. Second, we examined genetic variation in the 15-megabase interval surrounding the QTL in small and giant breeds and found marked evidence for a selective sweep spanning a single gene (IGF1), encoding insulin-like growth factor 1. A single IGF1 single-nucleotide polymorphism haplotype is common to all small breeds and nearly absent from giant breeds, suggesting that the same causal sequence variant is a major contributor to body size in all small dogs. PMID:17412960
A single IGF1 allele is a major determinant of small size in dogs.
Sutter, Nathan B; Bustamante, Carlos D; Chase, Kevin; Gray, Melissa M; Zhao, Keyan; Zhu, Lan; Padhukasahasram, Badri; Karlins, Eric; Davis, Sean; Jones, Paul G; Quignon, Pascale; Johnson, Gary S; Parker, Heidi G; Fretwell, Neale; Mosher, Dana S; Lawler, Dennis F; Satyaraj, Ebenezer; Nordborg, Magnus; Lark, K Gordon; Wayne, Robert K; Ostrander, Elaine A
2007-04-06
The domestic dog exhibits greater diversity in body size than any other terrestrial vertebrate. We used a strategy that exploits the breed structure of dogs to investigate the genetic basis of size. First, through a genome-wide scan, we identified a major quantitative trait locus (QTL) on chromosome 15 influencing size variation within a single breed. Second, we examined genetic variation in the 15-megabase interval surrounding the QTL in small and giant breeds and found marked evidence for a selective sweep spanning a single gene (IGF1), encoding insulin-like growth factor 1. A single IGF1 single-nucleotide polymorphism haplotype is common to all small breeds and nearly absent from giant breeds, suggesting that the same causal sequence variant is a major contributor to body size in all small dogs.
2013-01-01
Demand for nonnutritive sweeteners continues to increase due to their ability to provide desirable sweetness with minimal calories. Acesulfame potassium and saccharin are well-studied nonnutritive sweeteners commonly found in food products. Some individuals report aversive sensations from these sweeteners, such as bitter and metallic side tastes. Recent advances in molecular genetics have provided insight into the cause of perceptual differences across people. For example, common alleles for the genes TAS2R9 and TAS2R38 explain variable response to the bitter drugs ofloxacin in vitro and propylthiouracil in vivo. Here, we wanted to determine whether differences in the bitterness of acesulfame potassium could be predicted by common polymorphisms (genetic variants) in bitter taste receptor genes (TAS2Rs). We genotyped participants (n = 108) for putatively functional single nucleotide polymorphisms in 5 TAS2Rs and asked them to rate the bitterness of 25 mM acesulfame potassium on a general labeled magnitude scale. Consistent with prior reports, we found 2 single nucleotide polymorphisms in TAS2R31 were associated with acesulfame potassium bitterness. However, TAS2R9 alleles also predicted additional variation in acesulfame potassium bitterness. Conversely, single nucleotide polymorphisms in TAS2R4, TAS2R38, and near TAS2R16 were not significant predictors. Using 1 single nucleotide polymorphism each from TAS2R9 and TAS2R31, we modeled the simultaneous influence of these single nucleotide polymorphisms on acesulfame potassium bitterness; together, these 2 single nucleotide polymorphisms explained 13.4% of the variance in perceived bitterness. These data suggest multiple polymorphisms within TAS2Rs contribute to the ability to perceive the bitterness from acesulfame potassium. PMID:23599216
Nucleotide cleaving agents and method
Que, Jr., Lawrence; Hanson, Richard S.; Schnaith, Leah M. T.
2000-01-01
The present invention provides a unique series of nucleotide cleaving agents and a method for cleaving a nucleotide sequence, whether single-stranded or double-stranded DNA or RNA, using and a cationic metal complex having at least one polydentate ligand to cleave the nucleotide sequence phosphate backbone to yield a hydroxyl end and a phosphate end.
Conformational transitions in DNA polymerase I revealed by single-molecule FRET
Santoso, Yusdi; Joyce, Catherine M.; Potapova, Olga; Le Reste, Ludovic; Hohlbein, Johannes; Torella, Joseph P.; Grindley, Nigel D. F.; Kapanidis, Achillefs N.
2010-01-01
The remarkable fidelity of most DNA polymerases depends on a series of early steps in the reaction pathway which allow the selection of the correct nucleotide substrate, while excluding all incorrect ones, before the enzyme is committed to the chemical step of nucleotide incorporation. The conformational transitions that are involved in these early steps are detectable with a variety of fluorescence assays and include the fingers-closing transition that has been characterized in structural studies. Using DNA polymerase I (Klenow fragment) labeled with both donor and acceptor fluorophores, we have employed single-molecule fluorescence resonance energy transfer to study the polymerase conformational transitions that precede nucleotide addition. Our experiments clearly distinguish the open and closed conformations that predominate in Pol-DNA and Pol-DNA-dNTP complexes, respectively. By contrast, the unliganded polymerase shows a broad distribution of FRET values, indicating a high degree of conformational flexibility in the protein in the absence of its substrates; such flexibility was not anticipated on the basis of the available crystallographic structures. Real-time observation of conformational dynamics showed that most of the unliganded polymerase molecules sample the open and closed conformations in the millisecond timescale. Ternary complexes formed in the presence of mismatched dNTPs or complementary ribonucleotides show unique FRET species, which we suggest are relevant to kinetic checkpoints that discriminate against these incorrect substrates. PMID:20080740
2014-01-01
The Bactrian camel (Camelus bactrianus) and the dromedary (Camelus dromedarius) are among the last species that have been domesticated around 3000–6000 years ago. During domestication, strong artificial (anthropogenic) selection has shaped the livestock, creating a huge amount of phenotypes and breeds. Hence, domestic animals represent a unique resource to understand the genetic basis of phenotypic variation and adaptation. Similar to its late domestication history, the Bactrian camel is also among the last livestock animals to have its genome sequenced and deciphered. As no genomic data have been available until recently, we generated a de novo assembly by shotgun sequencing of a single male Bactrian camel. We obtained 1.6 Gb genomic sequences, which correspond to more than half of the Bactrian camel’s genome. The aim of this study was to identify heterozygous single-nucleotide polymorphisms (SNPs) and to estimate population parameters and nucleotide diversity based on an individual camel. With an average 6.6-fold coverage, we detected over 116 000 heterozygous SNPs and recorded a genome-wide nucleotide diversity similar to that of other domesticated ungulates. More than 20 000 (85%) dromedary expressed sequence tags successfully aligned to our genomic draft. Our results provide a template for future association studies targeting economically relevant traits and to identify changes underlying the process of camel domestication and environmental adaptation. PMID:23454912
Van, K; Onoda, S; Kim, M Y; Kim, K D; Lee, S-H
2008-03-01
The Waxy (Wx) gene product controls the formation of a straight chain polymer of amylose in the starch pathway. Dominance/recessiveness of the Wx allele is associated with amylose content, leading to non-waxy/waxy phenotypes. For a total of 113 foxtail millet accessions, agronomic traits and the molecular differences of the Wx gene were surveyed to evaluate genetic diversities. Molecular types were associated with phenotypes determined by four specific primer sets (non-waxy, Type I; low amylose, Type VI; waxy, Type IV or V). Additionally, the insertion of transposable element in waxy was confirmed by ex1/TSI2R, TSI2F/ex2, ex2int2/TSI7R and TSI7F/ex4r. Seventeen single nucleotide polymorphims (SNPs) were observed from non-coding regions, while three SNPs from coding regions were non-synonymous. Interestingly, the phenotype of No. 88 was still non-waxy, although seven nucleotides (AATTGGT) insertion at 2,993 bp led to 78 amino acids shorter. The rapid decline of r (2) in the sequenced region (exon 1-intron 1-exon 2) suggested a low level of linkage disequilibrium and limited haplotype structure. K (s) values and estimation of evolutionary events indicate early divergence of S. italica among cereal crops. This study suggested the Wx gene was one of the targets in the selection process during domestication.
Lado, Bettina; Matus, Ivan; Rodríguez, Alejandra; Inostroza, Luis; Poland, Jesse; Belzile, François; del Pozo, Alejandro; Quincke, Martín; Castro, Marina; von Zitzewitz, Jarislav
2013-01-01
In crop breeding, the interest of predicting the performance of candidate cultivars in the field has increased due to recent advances in molecular breeding technologies. However, the complexity of the wheat genome presents some challenges for applying new technologies in molecular marker identification with next-generation sequencing. We applied genotyping-by-sequencing, a recently developed method to identify single-nucleotide polymorphisms, in the genomes of 384 wheat (Triticum aestivum) genotypes that were field tested under three different water regimes in Mediterranean climatic conditions: rain-fed only, mild water stress, and fully irrigated. We identified 102,324 single-nucleotide polymorphisms in these genotypes, and the phenotypic data were used to train and test genomic selection models intended to predict yield, thousand-kernel weight, number of kernels per spike, and heading date. Phenotypic data showed marked spatial variation. Therefore, different models were tested to correct the trends observed in the field. A mixed-model using moving-means as a covariate was found to best fit the data. When we applied the genomic selection models, the accuracy of predicted traits increased with spatial adjustment. Multiple genomic selection models were tested, and a Gaussian kernel model was determined to give the highest accuracy. The best predictions between environments were obtained when data from different years were used to train the model. Our results confirm that genotyping-by-sequencing is an effective tool to obtain genome-wide information for crops with complex genomes, that these data are efficient for predicting traits, and that correction of spatial variation is a crucial ingredient to increase prediction accuracy in genomic selection models. PMID:24082033
Imincan, Gülnur; Pei, Fen; Yu, Lijia; Jin, Hongwei; Zhang, Liangren; Yang, Xiaoda; Zhang, Lihe; Tang, XinJing
2016-04-19
2'-O-(1-Pyrenylmethyl)uridine modified oligoribonucleotides provide highly sensitive pyrene fluorescent probes for detecting specific nucleotide mutation of RNA targets. To develop more stable and cost-effective oligonucleotide probes, we investigated the local microenvironmental effects of nearby nucleobases on pyrene fluorescence in duplexes of RNAs and 2'-O-(1-pyrenylmethyl)uridine modified oligonucleotides. By incorporation of deoxyribonucleotides, ribonucleotides, 2'-MeO-nucleotides and 2'-F-nucleotides at both sides of 2'-O-(1-pyrenylmethyl)uridine (U(p)) in oligodeoxynucleotide probes, we synthesized a series of pyrene modified oligonucleotide probes. Their pyrene fluorescence emission spectra indicated that only two proximal nucleotides have a substantial effect on the pyrene fluorescence properties of these oligonucleotide probes hybridized with target RNA with an order of fluorescence sensitivity of 2'-F-nucleotides > 2'-MeO-nucleotides > ribonucleotides ≫ deoxyribonucleotides. While based on circular dichroism spectra, overall helix conformations (either A- or B-form) of the duplexes have marginal effects on the sensitivity of the probes. Instead, the local substitution reflected the propensity of the nucleotide sugar ring to adopt North type conformation and, accordingly, shifted their helix geometry toward a more A-type like conformation in local microenvironments. Thus, higher enhancement of pyrene fluorescence emission favored local A-type helix structures and more polar and hydrophobic environments (F > MeO > OH at 2' substitution) of duplex minor grooves of probes with the target RNA. Further dynamic simulation revealed that local microenvironmental effect of 2'-F-nucleotides or ribonucleotides was enough for pyrene moiety to move out of nucleobases to the minor groove of duplexes; in addition, 2'-F-nucleotide had less effect on π-stack of pyrene-modified uridine with upstream and downstream nucleobases. The present oligonucleotide probes successfully distinguished target RNA from single-mutated RNA analyte during an in vitro assay of RNA synthesis.
Mustafa, Saima; Fatima, Hira; Fatima, Sadia; Khosa, Tafheem; Akbar, Atif; Shaikh, Rehan Sadiq; Iqbal, Furhan
2018-01-01
To find out a correlation between the single nucleotide polymorphisms in cluster of differentiation 28 and cluster of differentiation 40 genes with Graves' disease, if any. This case-control study was conducted at the Multan Institute of Nuclear Medicine and Radiotherapy, Multan, Pakistan, and comprised blood samples of Graves' disease patients and controls. Various risk factors were also correlated either with the genotype at each single-nucleotide polymorphism or with various combinations of genotypes studied during present investigation. Of the 160 samples, there were 80(50%) each from patients and controls. Risk factor analysis revealed that gender (p=0.008), marital status (p<0.001), education (p<0.001), smoking (p<0.001), tri-iodothyronine (P <0.001), thyroxin (p<0.001) and thyroid-stimulating hormone (p<0.000) levels in blood were associated with Graves' disease. Both single-nucleotide polymorphisms in both genes were not associated with Graves' disease, either individually or in any combined form.
Hwang, Hanshin; Taylor, John-Stephen
2005-03-29
We have recently reported that pyrene nucleotide is preferentially inserted opposite an abasic site, the 3'-T of a thymine dimer, and most undamaged bases by yeast DNA polymerase eta (pol eta). Because pyrene is a nonpolar molecule with no H-bonding ability, the unusually high efficiencies of dPMP insertion are ascribed to its superior base stacking ability, and underscore the importance of base stacking in the selection of nucleotides by pol eta. To investigate the role of H-bonding and base pair geometry in the selection of nucleotides by pol eta, we determined the insertion efficiencies of the base-modified nucleotides 2,6-diaminopurine, 2-aminopurine, 6-chloropurine, and inosine which would make a different number of H-bonds with the template base depending on base pair geometry. Watson-Crick base pairing appears to play an important role in the selection of nucleotide analogues for insertion opposite C and T as evidenced by the decrease in the relative insertion efficiencies with a decrease in the number of Watson-Crick H-bonds and an increase in the number of donor-donor and acceptor-acceptor interactions. The selectivity of nucleotide insertion is greater opposite the 5'-T than the 3'-T of the thymine dimer, in accord with previous work suggesting that the 5'-T is held more rigidly than the 3'-T. Furthermore, insertion of A opposite both Ts of the dimer appears to be mediated by Watson-Crick base pairing and not by Hoogsteen base pairing based on the almost identical insertion efficiencies of A and 7-deaza-A, the latter of which lacks H-bonding capability at N7. The relative efficiencies for insertion of nucleotides that can form Watson-Crick base pairs parallel those for the Klenow fragment, whereas the Klenow fragment more strongly discriminates against mismatches, in accord with its greater shape selectivity. These results underscore the importance of H-bonding and Watson-Crick base pair geometry in the selection of nucleotides by both pol eta and the Klenow fragment, and the lesser role of shape selection in insertion by pol eta due to its more open and less constrained active site.
Stockley, Jacqueline; Nisar, Shaista P; Leo, Vincenzo C; Sabi, Essa; Cunningham, Margaret R; Eikenboom, Jeroen C; Lethagen, Stefan; Schneppenheim, Reinhard; Goodeve, Anne C; Watson, Steve P; Mundell, Stuart J; Daly, Martina E
2015-01-01
The clinical expression of type 1 von Willebrand disease may be modified by co-inheritance of other mild bleeding diatheses. We previously showed that mutations in the platelet P2Y12 ADP receptor gene (P2RY12) could contribute to the bleeding phenotype in patients with type 1 von Willebrand disease. Here we investigated whether variations in platelet G protein-coupled receptor genes other than P2RY12 also contributed to the bleeding phenotype. Platelet G protein-coupled receptor genes P2RY1, F2R, F2RL3, TBXA2R and PTGIR were sequenced in 146 index cases with type 1 von Willebrand disease and the potential effects of identified single nucleotide variations were assessed using in silico methods and heterologous expression analysis. Seven heterozygous single nucleotide variations were identified in 8 index cases. Two single nucleotide variations were detected in F2R; a novel c.-67G>C transversion which reduced F2R transcriptional activity and a rare c.1063C>T transition predicting a p.L355F substitution which did not interfere with PAR1 expression or signalling. Two synonymous single nucleotide variations were identified in F2RL3 (c.402C>G, p.A134 =; c.1029 G>C p.V343 =), both of which introduced less commonly used codons and were predicted to be deleterious, though neither of them affected PAR4 receptor expression. A third single nucleotide variation in F2RL3 (c.65 C>A; p.T22N) was co-inherited with a synonymous single nucleotide variation in TBXA2R (c.6680 C>T, p.S218 =). Expression and signalling of the p.T22N PAR4 variant was similar to wild-type, while the TBXA2R variation introduced a cryptic splice site that was predicted to cause premature termination of protein translation. The enrichment of single nucleotide variations in G protein-coupled receptor genes among type 1 von Willebrand disease patients supports the view of type 1 von Willebrand disease as a polygenic disorder.
Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue
2016-01-01
DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962
NASA Astrophysics Data System (ADS)
Liu, Chengzhang; Wang, Xia; Xiang, Jianhai; Li, Fuhua
2012-09-01
Pacific white shrimp has become a major aquaculture and fishery species worldwide. Although a large scale EST resource has been publicly available since 2008, the data have not yet been widely used for SNP discovery or transcriptome-wide assessment of selective pressure. In this study, a set of 155 411 expressed sequence tags (ESTs) from the NCBI database were computationally analyzed and 17 225 single nucleotide polymorphisms (SNPs) were predicted, including 9 546 transitions, 5 124 transversions and 2 481 indels. Among the 7 298 SNP substitutions located in functionally annotated contigs, 58.4% (4 262) are non-synonymous SNPs capable of introducing amino acid mutations. Two hundred and fifty nonsynonymous SNPs in genes associated with economic traits have been identified as candidates for markers in selective breeding. Diversity estimates among the synonymous nucleotides were on average 3.49 times greater than those in non-synonymous, suggesting negative selection. Distribution of non-synonymous to synonymous substitutions (Ka/Ks) ratio ranges from 0 to 4.01, (average 0.42, median 0.26), suggesting that the majority of the affected genes are under purifying selection. Enrichment analysis identified multiple gene ontology categories under positive or negative selection. Categories involved in innate immune response and male gamete generation are rich in positively selected genes, which is similar to reports in Drosophila and primates. This work is the first transcriptome-wide assessment of selective pressure in a Penaeid shrimp species. The functionally annotated SNPs provide a valuable resource of potential molecular markers for selective breeding.
Detecting Single-Nucleotides by Tunneling Current Measurements at Sub-MHz Temporal Resolution.
Morikawa, Takanori; Yokota, Kazumichi; Tanimoto, Sachie; Tsutsui, Makusu; Taniguchi, Masateru
2017-04-18
Label-free detection of single-nucleotides was performed by fast tunneling current measurements in a polar solvent at 1 MHz sampling rate using SiO₂-protected Au nanoprobes. Short current spikes were observed, suggestive of trapping/detrapping of individual nucleotides between the nanoelectrodes. The fall and rise features of the electrical signatures indicated signal retardation by capacitance effects with a time constant of about 10 microseconds. The high temporal resolution revealed current fluctuations, reflecting the molecular conformation degrees of freedom in the electrode gap. The method presented in this work may enable direct characterizations of dynamic changes in single-molecule conformations in an electrode gap in liquid.
Li, Su-Xia
2004-12-01
Single nucleotide polymorphism (SNP) is the third genetic marker after restriction fragment length polymorphism (RFLP) and short tandem repeat. It represents the most density genetic variability in the human genome and has been widely used in gene location, cloning, and research of heredity variation, as well as parenthood identification in forensic medicine. As steady heredity polymorphism, single nucleotide polymorphism is becoming the focus of attention in monitoring chimerism and minimal residual disease in the patients after allogeneic hematopoietic stem cell transplantation. The article reviews SNP heredity characterization, analysis techniques and its applications in allogeneic stem cell transplantation and other fields.
Jiang, Rui ; Yang, Hua ; Zhou, Linqi ; Kuo, C.-C. Jay ; Sun, Fengzhu ; Chen, Ting
2007-01-01
The increasing demand for the identification of genetic variation responsible for common diseases has translated into a need for sophisticated methods for effectively prioritizing mutations occurring in disease-associated genetic regions. In this article, we prioritize candidate nonsynonymous single-nucleotide polymorphisms (nsSNPs) through a bioinformatics approach that takes advantages of a set of improved numeric features derived from protein-sequence information and a new statistical learning model called “multiple selection rule voting” (MSRV). The sequence-based features can maximize the scope of applications of our approach, and the MSRV model can capture subtle characteristics of individual mutations. Systematic validation of the approach demonstrates that this approach is capable of prioritizing causal mutations for both simple monogenic diseases and complex polygenic diseases. Further studies of familial Alzheimer diseases and diabetes show that the approach can enrich mutations underlying these polygenic diseases among the top of candidate mutations. Application of this approach to unclassified mutations suggests that there are 10 suspicious mutations likely to cause diseases, and there is strong support for this in the literature. PMID:17668383
Zhu, Xiao; Kong, Qingming; Xie, Liwei; Chen, Zhihong; Li, Hongmei; Zhu, Zhu; Huang, Yongmei; Lan, Feifei; Luo, Haiqing; Zhan, Jingting; Ding, Hongrong; Lei, Jinli; Xiao, Qin; Fu, Weiming; Fan, Wenguo; Zhang, Jinfang; Luo, Hui
2018-01-01
Previous studies showed that the low expressions of chromodomain-helicase-DNA-binding protein 5 (CHD5) were intensively associated with deteriorative biologic and clinical characteristics as well as outcomes in many tumors. The aim of this study is to determine whether CHD5 single nucleotide polymorphisms (SNPs) contribute to the prognosis of hepatocellular carcima (HCC). The SNPs were selected according to their linkage disequilibrium (LD) in the targeted next-generation sequencing (NGS) and then genotyped with TaqMan probers. We revealed a rare haplotype AG in CHD5 (SNPs: rs12564469-rs9434711) was markedly associated with HCC prognosis. The univariate and multivariate regression analyses revealed the patients with worse overall survival time were those with tumor metastasis and haplotype AG, as well as cirrhosis, poor differentiation and IV-TNM stage. Based on the available public databases, we discovered the significant association between haplotype AG and CHD5 mRNA expressions only existed in Chinese. These data proposed that the potentially genetic haplotype might functionally contribute to HCC prognosis and CHD5 mRNA expressions. PMID:29568352
Han, Lin; Xin, Ruosai; Sun, Jian; Hou, Feng; Li, Changgui; Hu, Xinlin; Liu, Zhen; Wang, Yao; Li, Xinde; Ren, Wei; Wang, Xuefeng; Jia, Zhaotong
2015-10-01
OBJECTIVE To assess the association of single nucleotide polymorphisms (SNPs) of susceptibility genes of type 2 diabetes mellitus (T2DM) with liability to gout among ethnic Han Chinese males from coastal region of Shandong province. METHODS Seven SNPs within the susceptibility genes of T2DM, including rs10773971(G/C) and rs4766398(G/C) of WNT5B gene, rs10225163(G/C) of JAZF1 gene, rs2069590(T/A) of BDKRB2 gene, rs5745709(G/A) of HGF gene, rs1991914(C/A) of OTOP1 gene and rs2236479(G/A) of COL18A1 gene, were typed with a custom-made Illumina GoldenGate Genotyping assay in 480 male patients with gout and 480 male controls. Potential association was assessed with the chi-square test. RESULTS No significant difference was detected for the 7 selected SNPs in terms of genotypic and allelic frequencies (P > 0.05). When age and body mass index (BMI) were adjusted, the 7 genetic variants still showed no significant association with gout. CONCLUSION The genotypes of the 7 selected SNPs are not associated with gout in ethnic Han Chinese male patients from the coastal region of Shandong province. However, the results need to be replicated in larger sets of patients collected from other regions and populations.
Bhattarai, Dinesh; Chen, Xing; Ur Rehman, Zia; Hao, Xingjie; Ullah, Farman; Dad, Rahim; Talpur, Hira Sajjad; Kadariya, Ishwari; Cui, Lu; Fan, Mingxia; Zhang, Shujun
2017-02-01
The objective of the studies presented in this Research Communication was to investigate the association of single nucleotide polymorphisms present in the MAP4K4 gene with different milk traits in dairy cows. Based on previous QTL fine mapping results on bovine chromosome 11, the MAP4K4 gene was selected as a candidate gene to evaluate its effect on somatic cell count and milk traits in ChineseHolstein cows. Milk production traits including milk yield, fat percentage, and protein percentage of each cow were collected using 305 d lactation records. Association between MAP4K4 genotype and different traits and Somatic Cell Score (SCS) was performed using General Linear Regression Model of R. Two SNPs at exon 18 (c.2061T > G and c.2196T > C) with genotype TT in both SNPs were found significantly higher for somatic SCS. We found the significant effect of exon 18 (c.2061T > G) on protein percentage, milk yield and SCS. We identified SNPs at different location of MAP4K4 gene of the cattle and several of them were significantly associated with the somatic cell score and other different milk traits. Thus, MAP4K4 gene could be a useful candidate gene for selection of dairy cattle against mastitis and the identified polymorphisms might potentially be strong genetic markers.
Veale, Andrew J; Russello, Michael A
2017-10-01
Mechanisms underlying adaptive evolution can best be explored using paired populations displaying similar phenotypic divergence, illuminating the genomic changes associated with specific life history traits. Here, we used paired migratory [anadromous vs. resident (kokanee)] and reproductive [shore- vs. stream-spawning] ecotypes of sockeye salmon (Oncorhynchus nerka) sampled from seven lakes and two rivers spanning three catchments (Columbia, Fraser, and Skeena) in British Columbia, Canada to investigate the patterns and processes underlying their divergence. Restriction-site associated DNA sequencing was used to genotype this sampling at 7,347 single nucleotide polymorphisms, 334 of which were identified as outlier loci and candidates for divergent selection within at least one ecotype comparison. Sixty-eight of these outliers were present in two or more comparisons, with 33 detected across multiple catchments. Of particular note, one locus was detected as the most significant outlier between shore and stream-spawning ecotypes in multiple comparisons and across catchments (Columbia, Fraser, and Snake). We also detected several genomic islands of divergence, some shared among comparisons, potentially showing linked signals of differential selection. The single nucleotide polymorphisms and genomic regions identified in our study offer a range of mechanistic hypotheses associated with the genetic basis of O. nerka life history variation and provide novel tools for informing fisheries management. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Matsumoto, Toshimi; Okumura, Naohiko; Uenishi, Hirohide; Hayashi, Takeshi; Hamasima, Noriyuki; Awata, Takashi
2012-01-01
We have collected more than 190000 porcine expressed sequence tags (ESTs) from full-length complementary DNA (cDNA) libraries and identified more than 2800 single nucleotide polymorphisms (SNPs). In this study, we tentatively chose 222 SNPs observed in assembled ESTs to study pigs of different breeds; 104 were selected by comparing the cDNA sequences of a Meishan pig and samples of three-way cross pigs (Landrace, Large White, and Duroc: LWD), and 118 were selected from LWD samples. To evaluate the genetic variation between the chosen SNPs from pig breeds, we determined the genotypes for 192 pig samples (11 pig groups) from our DNA reference panel with matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Of the 222 reference SNPs, 186 were successfully genotyped. A neighbor-joining tree showed that the pig groups were classified into two large clusters, namely, Euro-American and East Asian pig populations. F-statistics and the analysis of molecular variance of Euro-American pig groups revealed that approximately 25% of the genetic variations occurred because of intergroup differences. As the F(IS) values were less than the F(ST) values(,) the clustering, based on the Bayesian inference, implied that there was strong genetic differentiation among pig groups and less divergence within the groups in our samples. © 2011 The Authors. Animal Science Journal © 2011 Japanese Society of Animal Science.
Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay
2013-09-23
Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.
Yáñez, J M; Naswa, S; López, M E; Bassini, L; Correa, K; Gilbey, J; Bernatchez, L; Norris, A; Neira, R; Lhorente, J P; Schnable, P S; Newman, S; Mileham, A; Deeb, N; Di Genova, A; Maass, A
2016-07-01
A considerable number of single nucleotide polymorphisms (SNPs) are required to elucidate genotype-phenotype associations and determine the molecular basis of important traits. In this work, we carried out de novo SNP discovery accounting for both genome duplication and genetic variation from American and European salmon populations. A total of 9 736 473 nonredundant SNPs were identified across a set of 20 fish by whole-genome sequencing. After applying six bioinformatic filtering steps, 200 K SNPs were selected to develop an Affymetrix Axiom(®) myDesign Custom Array. This array was used to genotype 480 fish representing wild and farmed salmon from Europe, North America and Chile. A total of 159 099 (79.6%) SNPs were validated as high quality based on clustering properties. A total of 151 509 validated SNPs showed a unique position in the genome. When comparing these SNPs against 238 572 markers currently available in two other Atlantic salmon arrays, only 4.6% of the SNP overlapped with the panel developed in this study. This novel high-density SNP panel will be very useful for the dissection of economically and ecologically relevant traits, enhancing breeding programmes through genomic selection as well as supporting genetic studies in both wild and farmed populations of Atlantic salmon using high-resolution genomewide information. © 2016 John Wiley & Sons Ltd.
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...
USDA-ARS?s Scientific Manuscript database
High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...
Shirakawa, I; Chaen, S; Bagshaw, C R; Sugi, H
2000-01-01
The kinetics of displacement of a fluorescent nucleotide, 2'(3')-O-[N[2-[[Cy3]amido]ethyl]carbamoyl]-adenosine 5'-triphosphate (Cy3-EDA-ATP), bound to rabbit soleus muscle myofibrils were studied using flash photolysis of caged ATP. Use of myofibrils from this slow twitch muscle allowed better resolution of the kinetics of nucleotide exchange than previous studies with psoas muscle myofibrils (, Biophys. J. 73:2033-2042). Soleus myofibrils in the presence of Cy3-EDA-nucleotides (Cy3-EDA-ATP or Cy3-EDA-ADP) showed selective fluorescence staining of the A-band. The K(m) for Cy3-EDA-ATP and the K(d) for Cy3-EDA-ADP binding to the myofibril A-band were 1.9 microM and 3.8 microM, respectively, indicating stronger binding of nucleotide to soleus cross-bridges compared to psoas cross-bridges (2.6 microM and 50 microM, respectively). After flash photolysis of caged ATP, the A-band fluorescence of the myofibril in the Cy3-EDA-ATP solution under isometric conditions decayed exponentially with a rate constant of 0.045 +/- 0.007 s(-1) (n = 32) at 10 degrees C, which was about seven times slower than that for psoas myofibrils. When a myofibril was allowed to shorten with a constant velocity, the nucleotide displacement rate constant increased from 0.066 s(-1) (isometric) to 0.14 s(-1) at 20 degrees C with increasing shortening velocity up to 0.1 myofibril length/s (V(max), the shortening velocity under no load was approximately 0. 2 myofibril lengths/s). The rate constant was not significantly affected by an isovelocity stretch of up to 0.1 myofibril lengths/s. These results suggest that the cross-bridge kinetics are not significantly affected at higher strain during lengthening but depend on the lower strain during shortening. These data also indicate that the interaction distance between a cross-bridge and the actin filament is at least 16 nm for a single cycle of the ATPase. PMID:10653804
Genome-wide scans for loci under selection in humans
2005-01-01
Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection. PMID:16004726
Structural insights into translational recoding by frameshift suppressor tRNASufJ
Fagan, Crystal E.; Maehigashi, Tatsuya; Dunkle, Jack A.; Miles, Stacey J.
2014-01-01
The three-nucleotide mRNA reading frame is tightly regulated during translation to ensure accurate protein expression. Translation errors that lead to aberrant protein production can result from the uncoupled movement of the tRNA in either the 5′ or 3′ direction on mRNA. Here, we report the biochemical and structural characterization of +1 frameshift suppressor tRNASufJ, a tRNA known to decode four, instead of three, nucleotides. Frameshift suppressor tRNASufJ contains an insertion 5′ to its anticodon, expanding the anticodon loop from seven to eight nucleotides. Our results indicate that the expansion of the anticodon loop of either ASLSufJ or tRNASufJ does not affect its affinity for the A site of the ribosome. Structural analyses of both ASLSufJ and ASLThr bound to the Thermus thermophilus 70S ribosome demonstrate both ASLs decode in the zero frame. Although the anticodon loop residues 34–37 are superimposable with canonical seven-nucleotide ASLs, the single C31.5 insertion between nucleotides 31 and 32 in ASLSufJ imposes a conformational change of the anticodon stem, that repositions and tilts the ASL toward the back of the A site. Further modeling analyses reveal that this tilting would cause a distortion in full-length A-site tRNASufJ during tRNA selection and possibly impede gripping of the anticodon stem by 16S rRNA nucleotides in the P site. Together, these data implicate tRNA distortion as a major driver of noncanonical translation events such as frameshifting. PMID:25352689
Taira, Chiaki; Matsuda, Kazuyuki; Yamaguchi, Akemi; Sueki, Akane; Koeda, Hiroshi; Takagi, Fumio; Kobayashi, Yukihiro; Sugano, Mitsutoshi; Honda, Takayuki
2013-09-23
Single nucleotide alterations such as single nucleotide polymorphisms (SNP) and single nucleotide mutations are associated with responses to drugs and predisposition to several diseases, and they contribute to the pathogenesis of malignancies. We developed a rapid genotyping assay based on the allele-specific polymerase chain reaction (AS-PCR) with our droplet-PCR machine (droplet-AS-PCR). Using 8 SNP loci, we evaluated the specificity and sensitivity of droplet-AS-PCR. Buccal cells were pretreated with proteinase K and subjected directly to the droplet-AS-PCR without DNA extraction. The genotypes determined using the droplet-AS-PCR were then compared with those obtained by direct sequencing. Specific PCR amplifications for the 8 SNP loci were detected, and the detection limit of the droplet-AS-PCR was found to be 0.1-5.0% by dilution experiments. Droplet-AS-PCR provided specific amplification when using buccal cells, and all the genotypes determined within 9 min were consistent with those obtained by direct sequencing. Our novel droplet-AS-PCR assay enabled high-speed amplification retaining specificity and sensitivity and provided ultra-rapid genotyping. Crude samples such as buccal cells were available for the droplet-AS-PCR assay, resulting in the reduction of the total analysis time. Droplet-AS-PCR may therefore be useful for genotyping or the detection of single nucleotide alterations. Copyright © 2013 Elsevier B.V. All rights reserved.
Kerner, Berit; North, Kari E; Fallin, M Daniele
2010-01-01
Participants analyzed actual and simulated longitudinal data from the Framingham Heart Study for various metabolic and cardiovascular traits. The genetic information incorporated into these investigations ranged from selected single-nucleotide polymorphisms to genome-wide association arrays. Genotypes were incorporated using a broad range of methodological approaches including conditional logistic regression, linear mixed models, generalized estimating equations, linear growth curve estimation, growth modeling, growth mixture modeling, population attributable risk fraction based on survival functions under the proportional hazards models, and multivariate adaptive splines for the analysis of longitudinal data. The specific scientific questions addressed by these different approaches also varied, ranging from a more precise definition of the phenotype, bias reduction in control selection, estimation of effect sizes and genotype associated risk, to direct incorporation of genetic data into longitudinal modeling approaches and the exploration of population heterogeneity with regard to longitudinal trajectories. The group reached several overall conclusions: 1) The additional information provided by longitudinal data may be useful in genetic analyses. 2) The precision of the phenotype definition as well as control selection in nested designs may be improved, especially if traits demonstrate a trend over time or have strong age-of-onset effects. 3) Analyzing genetic data stratified for high-risk subgroups defined by a unique development over time could be useful for the detection of rare mutations in common multi-factorial diseases. 4) Estimation of the population impact of genomic risk variants could be more precise. The challenges and computational complexity demanded by genome-wide single-nucleotide polymorphism data were also discussed. PMID:19924713
Kawakami, Takeshi; Backström, Niclas; Burri, Reto; Husby, Arild; Olason, Pall; Rice, Amber M; Ålund, Murielle; Qvarnström, Anna; Ellegren, Hans
2014-01-01
With the access to draft genome sequence assemblies and whole-genome resequencing data from population samples, molecular ecology studies will be able to take truly genome-wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single-nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10-fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later-generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system. PMID:24784959
Structural Basis of Cyclic Nucleotide Selectivity in cGMP-dependent Protein Kinase II
Campbell, James C.; Kim, Jeong Joo; Li, Kevin Y.; ...
2016-01-14
Membrane-bound cGMP-dependent protein kinase (PKG) II is an important regulator of bone growth, renin secretion, and memory formation. Despite its crucial physiological roles, little is known about its cyclic nucleotide selectivity mechanism due to a lack of structural information. Here, we find that the C-terminal cyclic nucleotide binding (CNB-B) domain of PKGII binds cGMP with higher affinity and selectivity when compared with its N-terminal CNB (CNB-A) domain. To understand the structural basis of cGMP selectivity, we solved co-crystal structures of the CNB domains with cyclic nucleotides. Our structures combined with mutagenesis demonstrate that the guanine-specific contacts at Asp-412 and Arg-415more » of the αC-helix of CNB-B are crucial for cGMP selectivity and activation of PKG II. Structural comparison with the cGMP selective CNB domains of human PKG I and Plasmodium falciparum PKG (PfPKG) shows different contacts with the guanine moiety, revealing a unique cGMP selectivity mechanism for PKG II.« less
Brown, Jessica A.; Pack, Lindsey R.; Sherrer, Shanen M.; Kshetry, Ajay K.; Newmister, Sean A.; Fowler, Jason D.; Taylor, John-Stephen; Suo, Zucai
2010-01-01
DNA polymerase λ (Pol λ) is a novel X-family DNA polymerase that shares 34% sequence identity with DNA polymerase β (Pol β). Pre-steady state kinetic studies have shown that the Pol λ•DNA complex binds both correct and incorrect nucleotides 130-fold tighter on average than the Pol β•DNA complex, although, the base substitution fidelity of both polymerases is 10−4 to 10−5. To better understand Pol λ’s tight nucleotide binding affinity, we created single- and double-substitution mutants of Pol λ to disrupt interactions between active site residues and an incoming nucleotide or a template base. Single-turnover kinetic assays showed that Pol λ binds to an incoming nucleotide via cooperative interactions with active site residues (R386, R420, K422, Y505, F506, A510, and R514). Disrupting protein interactions with an incoming correct or incorrect nucleotide impacted binding with each of the common structural moieties in the following order: triphosphate ≫ base > ribose. In addition, the loss of Watson-Crick hydrogen bonding between the nucleotide and template base led to a moderate increase in the Kd. The fidelity of Pol λ was maintained predominantly by a single residue, R517, which has minor groove interactions with the DNA template. PMID:20851705
Vasudevan, Kumar; Vera Cruz, Casiana M.; Gruissem, Wilhelm; Bhullar, Navreet K.
2016-01-01
Rice blast is caused by Magnaporthe oryzae, which is the most destructive fungal pathogen affecting rice growing regions worldwide. The rice blast resistance gene Pib confers broad-spectrum resistance against Southeast Asian M. oryzae races. We investigated the allelic diversity of Pib in rice germplasm originating from 12 major rice growing countries. Twenty-five new Pib alleles were identified that have unique single nucleotide polymorphisms (SNPs), insertions and/or deletions, in addition to the polymorphic nucleotides that are shared between the different alleles. These partially or completely shared polymorphic nucleotides indicate frequent sequence exchange events between the Pib alleles. In some of the new Pib alleles, nucleotide diversity is high in the LRR domain, whereas, in others it is distributed among the NB-ARC and LRR domains. Most of the polymorphic amino acids in LRR and NB-ARC2 domains are predicted as solvent-exposed. Several of the alleles and the unique SNPs are country specific, suggesting a diversifying selection of alleles in various geographical locations in response to the locally prevalent M. oryzae population. Together, the new Pib alleles are an important genetic resource for rice blast resistance breeding programs and provide new information on rice-M. oryzae interactions at the molecular level. PMID:27446145
Genetic characterization of strains of Saccharomyces uvarum from New Zealand wineries.
Zhang, Hanyao; Richards, Keith D; Wilson, Sandra; Lee, Soon A; Sheehan, Hester; Roncoroni, Miguel; Gardner, Richard C
2015-04-01
We present a genetic characterization of 65 isolates of Saccharomyces uvarum isolated from wineries in New Zealand, along with the complete nucleotide sequence of a single sulfite-tolerant isolate. The genome of the New Zealand isolate averaged 99.85% nucleotide identity to CBS7001, the previously sequenced strain of S. uvarum. However, three genomic segments (37-87 kb) showed 10% nucleotide divergence from CBS7001 but 99% identity to Saccharomyces eubayanus. We conclude that these three segments appear to have been introgressed from that species. The nucleotide sequence of the internal transcribed spacer (ITS) region from other New Zealand isolates were also very similar to that of CBS7001, and hybrids showed complete genetic compatibility for some strains, with tetrads giving four viable progeny that showed 2:2 segregations of marker genes. Some strains showed high tolerance to sulfite, with genetic analysis indicating linkage of this trait to the transcription factor FZF1, but not to SSU1, the sulfite efflux pump that it regulates in order to confer sulfite tolerance in Saccharomyces cerevisiae. The fermentation characteristics of selected strains of S. uvarum showed exceptionally good cold fermentation characteristics, superior to the best commercially available strains of S. cerevisiae. Copyright © 2014 Elsevier Ltd. All rights reserved.
Solution to a gene divergence problem under arbitrary stable nucleotide transition probabilities
NASA Technical Reports Server (NTRS)
Holmquist, R.
1976-01-01
A nucleic acid chain, L nucleotides in length, with the specific base sequence B(1)B(2) ... B(L) is defined by the L-dimensional vector B = (B(1), B(2), ..., B(L)). For twelve given constant non-negative transition probabilities that, in a specified position, the base B is replaced by the base B' in a single step, an exact analytical expression is derived for the probability that the position goes from base B to B' in X steps. Assuming that each base mutates independently of the others, an exact expression is derived for the probability that the initial gene sequence B goes to a sequence B' = (B'(1), B'(2), ..., B'(L)) after X = (X(1), X(2), ..., X(L)) base replacements. The resulting equations allow a more precise accounting for the effects of Darwinian natural selection in molecular evolution than does the idealized (biologically less accurate) assumption that each of the four nucleotides is equally likely to mutate to and be fixed as one of the other three. Illustrative applications of the theory to some problems of biological evolution are given.
Discovery, Validation and Characterization of 1039 Cattle Single Nucleotide Polymorphisms
USDA-ARS?s Scientific Manuscript database
We identified approximately 13000 putative single nucleotide polymorphisms (SNPs) by comparison of repeat-masked BAC-end sequences from the cattle RPCI-42 BAC library with whole-genome shotgun contigs of cattle genome assembly Btau 1.0. Genotyping of a subset of these SNPs was performed on a panel ...
USDA-ARS?s Scientific Manuscript database
Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
USDA-ARS?s Scientific Manuscript database
Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) were genotyped using a high-density array and DNAs from individual plants from important onion populations from major production regions world-wide and the likely progenitor of onion, Allium vavilovii. Genotypes at 1226 SNPs were used to estimate genetic relati...
USDA-ARS?s Scientific Manuscript database
Genome scans in the pig have identified a region on chromosome 2 (SSC2) associated with tenderness. Calpastatin is a likely positional candidate gene in this region because of its inhibitory role in the calpain system that is involved in postmortem tenderization. Novel single nucleotide polymorphism...
Lineage and genogroup-defining single nucleotide polymorphisms of Escherichia coli 0157:H7
USDA-ARS?s Scientific Manuscript database
Escherichia coli O157:H7 is a zoonotic human pathogen for which cattle are an important reservoir host. Using both previously published and new sequencing data, a 48-locus single nucleotide polymorphism (SNP) based typing panel was developed that redundantly identified eleven genogroups that span ...
Getting it Right: How DNA Polymerases Select the Right Nucleotide.
Ludmann, Samra; Marx, Andreas
2016-01-01
All living organisms are defined by their genetic code encrypted in their DNA. DNA polymerases are the enzymes that are responsible for all DNA syntheses occurring in nature. For DNA replication, repair and recombination these enzymes have to read the parental DNA and recognize the complementary nucleotide out of a pool of four structurally similar deoxynucleotide triphosphates (dNTPs) for a given template. The selection of the nucleotide is in accordance with the Watson-Crick rule. In this process the accuracy of DNA synthesis is crucial for the maintenance of the genome stability. However, to spur evolution a certain degree of freedom must be allowed. This brief review highlights the mechanistic basis for selecting the right nucleotide by DNA polymerases.
The waiting time problem in a model hominin population.
Sanford, John; Brewer, Wesley; Smith, Franzine; Baumgardner, John
2015-09-17
Functional information is normally communicated using specific, context-dependent strings of symbolic characters. This is true within the human realm (texts and computer programs), and also within the biological realm (nucleic acids and proteins). In biology, strings of nucleotides encode much of the information within living cells. How do such information-bearing nucleotide strings arise and become established? This paper uses comprehensive numerical simulation to understand what types of nucleotide strings can realistically be established via the mutation/selection process, given a reasonable timeframe. The program Mendel's Accountant realistically simulates the mutation/selection process, and was modified so that a starting string of nucleotides could be specified, and a corresponding target string of nucleotides could be specified. We simulated a classic pre-human hominin population of at least 10,000 individuals, with a generation time of 20 years, and with very strong selection (50% selective elimination). Random point mutations were generated within the starting string. Whenever an instance of the target string arose, all individuals carrying the target string were assigned a specified reproductive advantage. When natural selection had successfully amplified an instance of the target string to the point of fixation, the experiment was halted, and the waiting time statistics were tabulated. Using this methodology we tested the effect of mutation rate, string length, fitness benefit, and population size on waiting time to fixation. Biologically realistic numerical simulations revealed that a population of this type required inordinately long waiting times to establish even the shortest nucleotide strings. To establish a string of two nucleotides required on average 84 million years. To establish a string of five nucleotides required on average 2 billion years. We found that waiting times were reduced by higher mutation rates, stronger fitness benefits, and larger population sizes. However, even using the most generous feasible parameters settings, the waiting time required to establish any specific nucleotide string within this type of population was consistently prohibitive. We show that the waiting time problem is a significant constraint on the macroevolution of the classic hominin population. Routine establishment of specific beneficial strings of two or more nucleotides becomes very problematic.
Silla, Toomas; Kepp, Katrin; Tai, E Shyong; Goh, Liang; Davila, Sonia; Catela Ivkovic, Tina; Calin, George A; Voorhoeve, P Mathijs
2014-01-01
Ultra-conserved genes or elements (UCGs/UCEs) in the human genome are extreme examples of conservation. We characterized natural variations in 2884 UCEs and UCGs in two distinct populations; Singaporean Chinese (n = 280) and Italian (n = 501) by using a pooled sample, targeted capture, sequencing approach. We identify, with high confidence, in these regions the abundance of rare SNVs (MAF<0.5%) of which 75% is not present in dbSNP137. UCEs association studies for complex human traits can use this information to model expected background variation and thus necessary power for association studies. By combining our data with 1000 Genome Project data, we show in three independent datasets that prevalent UCE variants (MAF>5%) are more often found in relatively less-conserved nucleotides within UCEs, compared to rare variants. Moreover, prevalent variants are less likely to overlap transcription factor binding site. Using SNPfold we found no significant influence of RNA secondary structure on UCE conservation. All together, these results suggest UCEs are not under selective pressure as a stretch of DNA but are under differential evolutionary pressure on the single nucleotide level.
2012-01-01
Background Carcass fatness is an important trait in most pig breeding programs. Following market requests, breeding plans for fresh pork consumption are usually designed to reduce carcass fat content and increase lean meat deposition. However, the Italian pig industry is mainly devoted to the production of Protected Designation of Origin dry cured hams: pigs are slaughtered at around 160 kg of live weight and the breeding goal aims at maintaining fat coverage, measured as backfat thickness to avoid excessive desiccation of the hams. This objective has shaped the genetic pool of Italian heavy pig breeds for a few decades. In this study we applied a selective genotyping approach within a population of ~ 12,000 performance tested Italian Large White pigs. Within this population, we selectively genotyped 304 pigs with extreme and divergent backfat thickness estimated breeding value by the Illumina PorcineSNP60 BeadChip and performed a genome wide association study to identify loci associated to this trait. Results We identified 4 single nucleotide polymorphisms with P≤5.0E-07 and additional 119 ones with 5.0E-07
Technologies in the Whole-Genome Age: MALDI-TOF-Based Genotyping.
Vogel, Nicolas; Schiebel, Katrin; Humeny, Andreas
2009-01-01
With the decipherment of the human genome, new questions have moved into the focus of today's research. One key aspect represents the discovery of DNA variations capable to influence gene transcription, RNA splicing, or regulating processes, and their link to pathology. Matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF-MS) is a powerful tool for the qualitative investigation and relative quantification of variations like single nucleotide polymorphisms, DNA methylation, microsatellite instability, or loss of heterozygosity. After its introduction into proteomics, efforts were made to adopt this technique to DNA analysis. Initially intended for peptide/protein analysis, it held several difficulties for application to nucleic acids. Today, MALDI-TOF-MS has reached worldwide acceptance and application in nucleic acid research, with a wide spectrum of methods being available. One of the most versatile approaches relies on primer extension to genotype single alleles, microsatellite repeat lengths or the methylation status of a given cytosine. Optimized methods comprising intelligent primer design and proper nucleotide selection for primer extension enabled multiplexing of reactions, rendering the analysis more economic due to parallel genotyping of several alleles in a single experiment. Laboratories equipped with MALDI-TOF-MS possess a universal technical platform for the analysis of a large variety of different molecules.
A novel MALDI–TOF based methodology for genotyping single nucleotide polymorphisms
Blondal, Thorarinn; Waage, Benedikt G.; Smarason, Sigurdur V.; Jonsson, Frosti; Fjalldal, Sigridur B.; Stefansson, Kari; Gulcher, Jeffery; Smith, Albert V.
2003-01-01
A new MALDI–TOF based detection assay was developed for analysis of single nucleotide polymorphisms (SNPs). It is a significant modification on the classic three-step minisequencing method, which includes a polymerase chain reaction (PCR), removal of excess nucleotides and primers, followed by primer extension in the presence of dideoxynucleotides using modified thermostable DNA polymerase. The key feature of this novel assay is reliance upon deoxynucleotide mixes, lacking one of the nucleotides at the polymorphic position. During primer extension in the presence of depleted nucleotide mixes, standard thermostable DNA polymerases dissociate from the template at positions requiring a depleted nucleotide; this principal was harnessed to create a genotyping assay. The assay design requires a primer- extension primer having its 3′-end one nucleotide upstream from the interrogated site. The assay further utilizes the same DNA polymerase in both PCR and the primer extension step. This not only simplifies the assay but also greatly reduces the cost per genotype compared to minisequencing methodology. We demonstrate accurate genotyping using this methodology for two SNPs run in both singleplex and duplex reactions. We term this assay nucleotide depletion genotyping (NUDGE). Nucleotide depletion genotyping could be extended to other genotyping assays based on primer extension such as detection by gel or capillary electrophoresis. PMID:14654708
Biological nanopore MspA for DNA sequencing
NASA Astrophysics Data System (ADS)
Manrao, Elizabeth A.
Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Statistical analysis of nucleotide sequences of the hemagglutinin gene of human influenza A viruses.
Ina, Y; Gojobori, T
1994-01-01
To examine whether positive selection operates on the hemagglutinin 1 (HA1) gene of human influenza A viruses (H1 subtype), 21 nucleotide sequences of the HA1 gene were statistically analyzed. The nucleotide sequences were divided into antigenic and nonantigenic sites. The nucleotide diversities for antigenic and nonantigenic sites of the HA1 gene were computed at synonymous and nonsynonymous sites separately. For nonantigenic sites, the nucleotide diversities were larger at synonymous sites than at nonsynonymous sites. This is consistent with the neutral theory of molecular evolution. For antigenic sites, however, the nucleotide diversities at nonsynonymous sites were larger than those at synonymous sites. These results suggest that positive selection operates on antigenic sites of the HA1 gene of human influenza A viruses (H1 subtype). PMID:8078892
Integrated Cox's model for predicting survival time of glioblastoma multiforme.
Ai, Zhibing; Li, Longti; Fu, Rui; Lu, Jing-Min; He, Jing-Dong; Li, Sen
2017-04-01
Glioblastoma multiforme is the most common primary brain tumor and is highly lethal. This study aims to figure out signatures for predicting the survival time of patients with glioblastoma multiforme. Clinical information, messenger RNA expression, microRNA expression, and single-nucleotide polymorphism array data of patients with glioblastoma multiforme were retrieved from The Cancer Genome Atlas. Patients were separated into two groups by using 1 year as a cutoff, and a logistic regression model was used to figure out any variables that can predict whether the patient was able to live longer than 1 year. Furthermore, Cox's model was used to find out features that were correlated with the survival time. Finally, a Cox model integrated the significant clinical variables, messenger RNA expression, microRNA expression, and single-nucleotide polymorphism was built. Although the classification method failed, signatures of clinical features, messenger RNA expression levels, and microRNA expression levels were figured out by using Cox's model. However, no single-nucleotide polymorphisms related to prognosis were found. The selected clinical features were age at initial diagnosis, Karnofsky score, and race, all of which had been suggested to correlate with survival time. Both of the two significant microRNAs, microRNA-221 and microRNA-222, were targeted to p27 Kip1 protein, which implied the important role of p27 Kip1 on the prognosis of glioblastoma multiforme patients. Our results suggested that survival modeling was more suitable than classification to figure out prognostic biomarkers for patients with glioblastoma multiforme. An integrated model containing clinical features, messenger RNA levels, and microRNA expression levels was built, which has the potential to be used in clinics and thus to improve the survival status of glioblastoma multiforme patients.
Gerreth, Karolina; Zaorska, Katarzyna; Zabel, Maciej; Borysewicz-Lewicka, Maria; Nowicki, Michał
2017-09-01
It is increasingly emphasized that the influence of a host's factors in the etiology of dental caries are of most interest, particularly those concerned with genetic aspect. The aim of the study was to analyze the genotype and allele frequencies of single nucleotide polymorphisms (SNPs) in AMELX, AMBN, TUFT1, TFIP11, MMP20 and KLK4 genes and to prove their association with dental caries occurrence in a population of Polish children. The study was performed in 96 children (48 individuals with caries - "cases" and 48 free of this disease - "controls"), aged 20-42 months, chosen out of 262 individuals who had dental examination performed and attended 4 day nurseries located in Poznań (Poland). From both groups oral swab was collected for molecular evaluation. Eleven selected SNPs markers were genotyped by Sanger sequencing. Genotype and allele frequencies were calculated and a standard χ2 analysis was used to test for deviation from Hardy-Weinberg equilibrium. The association of genetic variations with caries susceptibility or resistance was assessed by the Fisher's exact test and p ≤ 0.05 was considered statistically significant. Five markers were significantly associated with caries incidence in children in the study: rs17878486 in AMELX (p < 0.0001), rs34538475 in AMBN (p < 0.0001), rs2337360 in TUFT1 (p < 0.0001), and rs2235091 (p = 0.0085) and rs198969 (p = 0.0069) in KLK4. Genotype and allele frequencies indicated both risk and protective variants for these markers. Single nucleotide polymorphisms in AMELX, AMBN, TUFT1, KLK4 genes may be considered as a risk factor for dental caries occurrence in Polish children.
Palmer, RHC; Brick, L; Nugent, NR; Bidwell, LC; McGeary, JE; Knopik, VS; Keller, MC
2014-01-01
Background and Aims Twin and family studies suggest that genetic influences are shared across substances of abuse. However, despite evidence of heritability, genome-wide association and candidate gene studies have indicated numerous markers of limited effects, suggesting that much of the heritability remains missing. We estimated (1) the aggregate effect of common single nucleotide polymorphisms (SNPs) on multiple indicators of comorbid drug problems that are typically employed across community and population-based samples, and (2) the genetic covariance across these measures. Participants 2596 unrelated subjects from the “Study of Addiction: Genetics and Environment” provided information on alcohol, tobacco, cocaine, cannabis, and other illicit substance dependence. Phenotypic measures included: (1) a factor score based on DSM-IV drug dependence diagnoses (DD), (2) a factor score based on problem use (PU; i.e., 1+ DSM-IV symptoms), and (3) dependence vulnerability (DV; a ratio of DSM-IV symptoms to the number of substances used). Findings Univariate and bivariate Genome-wide complex trait analyses of this selected sample indicated that common SNPs explained 25-36% of the variance across measures, with DD and DV having the largest effects [h2SNP (CI)=0.36 (0.11-0.62) and 0.33(0.07-0.58), respectively; PU = 0.25 (-0.01-0.51)]. Genetic effects were shared across the three phenotypic measures of comorbid drug problems (rSNP; rDD-PU = 0.92 (0.76-1.00), rDD-DV = 0.97 (0.87-1.00), and rPU-DV = 0.96 (0.82-1.00)). Conclusion At least 20% of the variance in the generalized vulnerability to substance dependence is attributable to common single nucleotide polymorphisms. The additive effect of common single nucleotide polymorphisms is shared across important indicators of comorbid drug problems. PMID:25424661
Association between polymorphisms in prostanoid receptor genes and aspirin-intolerant asthma.
Kim, Sang-Heon; Kim, Yoon-Keun; Park, Heung-Woo; Jee, Young-Koo; Kim, Sang-Hoon; Bahn, Joon-Woo; Chang, Yoon-Seok; Kim, Seung-Hyun; Ye, Young-Min; Shin, Eun-Soon; Lee, Jong-Eun; Park, Hae-Sim; Min, Kyung-Up
2007-04-01
Genetic predisposition is linked to the pathogenesis of aspirin-intolerant asthma. Most candidate gene approaches have focused on leukotriene-related pathways, whereas there have been relatively few studies evaluating the effects of polymorphisms in prostanoid receptor genes on the development of aspirin-intolerant asthma. Therefore, we investigated the potential association between prostanoid receptor gene polymorphisms and the aspirin-intolerant asthma phenotype. We screened for genetic variations in the prostanoid receptor genes PTGER1, PTGER2, PTGER3, PTGER4, PTGDR, PTGIR, PTGFR, and TBXA2R using direct sequencing, and selected 32 tagging single nucleotide polymorphisms among the 77 polymorphisms with frequencies >0.02 based on linkage disequilibrium for genotyping. We compared the genotype distributions and allele frequencies of three participant groups (108 patients with aspirin-intolerant asthma, 93 patients with aspirin-tolerant asthma, and 140 normal controls). Through association analyses studies of the 32 single nucleotide polymorphisms, the following single nucleotide polymorphisms were found to have significant associations with the aspirin-intolerant asthma phenotype: -616C>G (P=0.038) and -166G>A (P=0.023) in PTGER2; -1709T>A (P=0.043) in PTGER3; -1254A>G (P=0.018) in PTGER4; 1915T>C (P=0.015) in PTGIR; and -4684C>T (P=0.027), and 795T>C (P=0.032) in TBXA2R. In the haplotype analysis of each gene, the frequency of PTGIR ht3[G-G-C-C], which includes 1915T>C, differed significantly between the aspirin-intolerant asthma patients and aspirin-tolerant asthma patients (P=0.015). These findings suggest that genetic polymorphisms in PTGER2, PTGER3, PTGER4, PTGIR, and TBXA2R play important roles in the pathogenesis of aspirin-intolerant asthma.
Schmidt, Börge; Dragano, Nico; Scherag, André; Pechlivanis, Sonali; Hoffmann, Per; Nöthen, Markus M; Erbel, Raimund; Jöckel, Karl-Heinz; Moebus, Susanne
2014-06-16
The relevance of disease-related genetic variants for the explanation of social inequalities in complex diseases is unclear and empirical analyses are largely missing. The aim of our study was to examine whether genetic variants predisposing to diabetes mellitus are associated with socioeconomic status in a population-based cohort. We genotyped 11 selected diabetes-related single nucleotide polymorphisms in 4655 participants (age 45-75 years) of the Heinz Nixdorf Recall study. Diabetes status was self-reported or defined by blood glucose levels. Education, income and paternal occupation were assessed as indicators of socioeconomic status. Multiple regression analyses were used to examine the association of socioeconomic status and diabetes by estimating sex-specific and age-adjusted prevalence ratios and their corresponding 95%-confidence intervals. To explore the relationship between individual single nucleotide polymorphisms and socioeconomic status sex- and age-adjusted odds ratios were computed. We adjusted the alpha-level for multiple testing of 11 single nucleotide polymorphisms using Bonferroni's method (α(BF) ~ 0.005). In addition, we explored the association of a genetic risk score with socioeconomic status. Social inequalities in diabetes were observed for all indicators of socioeconomic status. However, there were no significant associations between individual diabetes-related risk alleles and socioeconomic status with odds ratios ranging from 0.87 to 1.23. Similarly, the genetic risk score analysis revealed no evidence for an association. Our data provide no evidence for an association between 11 diabetes-related risk alleles and different indicators of socioeconomic status in a population-based cohort, suggesting that the explored genetic variants do not contribute to health inequalities in diabetes.
Kavakiotis, Ioannis; Samaras, Patroklos; Triantafyllidis, Alexandros; Vlahavas, Ioannis
2017-11-01
Single Nucleotide Polymorphism (SNPs) are, nowadays, becoming the marker of choice for biological analyses involving a wide range of applications with great medical, biological, economic and environmental interest. Classification tasks i.e. the assignment of individuals to groups of origin based on their (multi-locus) genotypes, are performed in many fields such as forensic investigations, discrimination between wild and/or farmed populations and others. Τhese tasks, should be performed with a small number of loci, for computational as well as biological reasons. Thus, feature selection should precede classification tasks, especially for Single Nucleotide Polymorphism (SNP) datasets, where the number of features can amount to hundreds of thousands or millions. In this paper, we present a novel data mining approach, called FIFS - Frequent Item Feature Selection, based on the use of frequent items for selection of the most informative markers from population genomic data. It is a modular method, consisting of two main components. The first one identifies the most frequent and unique genotypes for each sampled population. The second one selects the most appropriate among them, in order to create the informative SNP subsets to be returned. The proposed method (FIFS) was tested on a real dataset, which comprised of a comprehensive coverage of pig breed types present in Britain. This dataset consisted of 446 individuals divided in 14 sub-populations, genotyped at 59,436 SNPs. Our method outperforms the state-of-the-art and baseline methods in every case. More specifically, our method surpassed the assignment accuracy threshold of 95% needing only half the number of SNPs selected by other methods (FIFS: 28 SNPs, Delta: 70 SNPs Pairwise FST: 70 SNPs, In: 100 SNPs.) CONCLUSION: Our approach successfully deals with the problem of informative marker selection in high dimensional genomic datasets. It offers better results compared to existing approaches and can aid biologists in selecting the most informative markers with maximum discrimination power for optimization of cost-effective panels with applications related to e.g. species identification, wildlife management, and forensics. Copyright © 2017 Elsevier Ltd. All rights reserved.
Togashi, K; Hagiya, K; Osawa, T; Nakanishi, T; Yamazaki, T; Nagamine, Y; Lin, C Y; Matsumoto, S; Aihara, M; Hayasaka, K
2012-08-01
We first sought to clarify the effects of discounted rate, survival rate, and lactation persistency as a component trait of the selection index on net merit, defined as the first five lactation milks and herd life (HL) weighted by 1 and 0.389 (currently used in Japan), respectively, in units of genetic standard deviation. Survival rate increased the relative economic importance of later lactation traits and the first five lactation milk yields during the first 120 months from the start of the breeding scheme. In contrast, reliabilities of the estimated breeding value (EBV) in later lactation traits are lower than those of earlier lactation traits. We then sought to clarify the effects of applying single nucleotide polymorphism (SNP) on net merit to improve the reliability of EBV of later lactation traits to maximize their increased economic importance due to increase in survival rate. Net merit, selection accuracy, and HL increased by adding lactation persistency to the selection index whose component traits were only milk yields. Lactation persistency of the second and (especially) third parities contributed to increasing HL while maintaining the first five lactation milk yields compared with the selection index whose only component traits were milk yields. A selection index comprising the first three lactation milk yields and persistency accounted for 99.4% of net merit derived from a selection index whose components were identical to those for net merit. We consider that the selection index comprising the first three lactation milk yields and persistency is a practical method for increasing lifetime milk yield in the absence of data regarding HL. Applying SNP to the second- and third-lactation traits and HL increased net merit and HL by maximizing the increased economic importance of later lactation traits, reducing the effect of first-lactation milk yield on HL (genetic correlation (rG) = -0.006), and by augmenting the effects of the second- and third-lactation milk yields on HL (rG = 0.118 and 0.257, respectively).
2013-07-01
as a statistical graphic, and Pearson product moment correlation coefficients as measures of the strength of linear association; 4) performing SNP ...determine if there are differences in single nucleotide polymorphisms ( SNPs ) in selected candidate genes implicated in metabolic syndrome, obesity, chronic...samples for the serum and SNP analyses. We have reached a target of 500 patients at the end of year 2; however, some of the patients turned out to be
USDA-ARS?s Scientific Manuscript database
The objective of this study is to investigate single nucleotide polymorphism (SNP) genotypes imputation of Hereford cattle. Purebred Herefords were from two sources, Line 1 Hereford (N=240) and representatives of Industry Herefords (N=311). Using different reference panels of 62 and 494 males with 1...
USDA-ARS?s Scientific Manuscript database
Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of an evolutionarily recent genome duplication event. This situation complicates single nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and ...
USDA-ARS?s Scientific Manuscript database
Fertilization and development of the preimplantation embryo is under genetic control. The goal of the current study was to test 434 single nucleotide polymorphisms (SNPs) for association with genetic variation in fertilization and early embryonic development. The approach was to produce embryos from...
Prospects for inferring pairwise relationships with single nucleotide polymorphisms
Jeffery C. Glaubitz; O. Eugene, Jr. Rhodes; J. Andrew DeWoody
2003-01-01
An extraordinarily large number of single nucleotide polymorphisms (SNPs) are now available in humans as well as in other model organisms. Technological advancements may soon make it feasible to assay hundreds of SNPs in virtually any organism of interest. One potential application of SNPs is the determination of pairwise genetic relationships in populations without...
USDA-ARS?s Scientific Manuscript database
Call rate has been used as a measure of quality on both a single nucleotide polymorphism (SNP) and animal basis since SNP genotypes were first used in genomic evaluation of dairy cattle. The genotyping laboratories perform initial quality control screening and genotypes that fail are usually exclude...
Yu, Hong; Liu, Jun; Yang, Aiping; Yang, Guohui; Yang, Wenjun; Lei, Heyue; Quan, Jianjun; Zhang, Zengyu
2016-04-01
Genetic factors play an important role in childhood autism. This study is to determine the association of single-nucleotide polymorphisms in dopa decarboxylase (DDC) and dopamine receptor-1 (DRD1) genes with childhood autism, in a Chinese Han population. A total of 211 autistic children and 250 age- and gender-matched healthy controls were recruited. The severity of disease was determined by Children Autism Rating Scale scores. TaqMan Probe by real-time polymerase chain reaction was used to determine genotypes and allele frequencies of single-nucleotide polymorphism rs6592961 in DDC and rs251937 in DRD1. Case-control and case-only studies were respectively performed, to determine the contribution of both single-nucleotide polymorphisms to the predisposition of disease and its severity. Our results showed that there was no significant association of the genotypes and allele frequencies of both single-nucleotide polymorphisms concerning childhood autism and its severity. More studies with larger samples are needed to corroborate their predicting roles. © The Author(s) 2015.
Single-molecule comparison of DNA Pol I activity with native and analog nucleotides
NASA Astrophysics Data System (ADS)
Gul, Osman; Olsen, Tivoli; Choi, Yongki; Corso, Brad; Weiss, Gregory; Collins, Philip
2014-03-01
DNA polymerases are critical enzymes for DNA replication, and because of their complex catalytic cycle they are excellent targets for investigation by single-molecule experimental techniques. Recently, we studied the Klenow fragment (KF) of DNA polymerase I using a label-free, electronic technique involving single KF molecules attached to carbon nanotube transistors. The electronic technique allowed long-duration monitoring of a single KF molecule while processing thousands of template strands. Processivity of up to 42 nucleotide bases was directly observed, and statistical analysis of the recordings determined key kinetic parameters for the enzyme's open and closed conformations. Subsequently, we have used the same technique to compare the incorporation of canonical nucleotides like dATP to analogs like 1-thio-2'-dATP. The analog had almost no affect on duration of the closed conformation, during which the nucleotide is incorporated. On the other hand, the analog increased the rate-limiting duration of the open conformation by almost 40%. We propose that the thiolated analog interferes with KF's recognition and binding, two key steps that determine its ensemble turnover rate.
Xu, Zhi; Reynolds, Gavin P; Yuan, Yonggui; Shi, Yanyan; Pu, Mengjia; Zhang, Zhijun
2016-11-01
Variation in genes implicated in monoamine neurotransmission may interact with environmental factors to influence antidepressant response. We aimed to determine how a range of single nucleotide polymorphisms in monoaminergic genes influence this response to treatment and how they interact with childhood trauma and recent life stress in a Chinese sample. An initial study of monoaminergic coding region single nucleotide polymorphisms identified significant associations of TPH2 and HTR1B single nucleotide polymorphisms with treatment response that showed interactions with childhood and recent life stress, respectively (Xu et al., 2012). A total of 47 further single nucleotide polymorphisms in 17 candidate monoaminergic genes were genotyped in 281 Chinese Han patients with major depressive disorder. Response to 6 weeks' antidepressant treatment was determined by change in the 17-item Hamilton Depression Rating Scale score, and previous stressful events were evaluated by the Life Events Scale and Childhood Trauma Questionnaire-Short Form. Three TPH2 single nucleotide polymorphisms (rs11178998, rs7963717, and rs2171363) were significantly associated with antidepressant response in this Chinese sample, as was a haplotype in TPH2 (rs2171363 and rs1487278). One of these, rs2171363, showed a significant interaction with childhood adversity in its association with antidepressant response. These findings provide further evidence that variation in TPH2 is associated with antidepressant response and may also interact with childhood trauma to influence outcome of antidepressant treatment. © The Author 2016. Published by Oxford University Press on behalf of CINP.
Reynolds, Gavin P.; Yuan, Yonggui; Shi, Yanyan; Pu, Mengjia; Zhang, Zhijun
2016-01-01
Background: Variation in genes implicated in monoamine neurotransmission may interact with environmental factors to influence antidepressant response. We aimed to determine how a range of single nucleotide polymorphisms in monoaminergic genes influence this response to treatment and how they interact with childhood trauma and recent life stress in a Chinese sample. An initial study of monoaminergic coding region single nucleotide polymorphisms identified significant associations of TPH2 and HTR1B single nucleotide polymorphisms with treatment response that showed interactions with childhood and recent life stress, respectively (Xu et al., 2012). Methods: A total of 47 further single nucleotide polymorphisms in 17 candidate monoaminergic genes were genotyped in 281 Chinese Han patients with major depressive disorder. Response to 6 weeks’ antidepressant treatment was determined by change in the 17-item Hamilton Depression Rating Scale score, and previous stressful events were evaluated by the Life Events Scale and Childhood Trauma Questionnaire-Short Form. Results: Three TPH2 single nucleotide polymorphisms (rs11178998, rs7963717, and rs2171363) were significantly associated with antidepressant response in this Chinese sample, as was a haplotype in TPH2 (rs2171363 and rs1487278). One of these, rs2171363, showed a significant interaction with childhood adversity in its association with antidepressant response. Conclusions: These findings provide further evidence that variation in TPH2 is associated with antidepressant response and may also interact with childhood trauma to influence outcome of antidepressant treatment. PMID:27521242
Empirical Performance of Cross-Validation With Oracle Methods in a Genomics Context.
Martinez, Josue G; Carroll, Raymond J; Müller, Samuel; Sampson, Joshua N; Chatterjee, Nilanjan
2011-11-01
When employing model selection methods with oracle properties such as the smoothly clipped absolute deviation (SCAD) and the Adaptive Lasso, it is typical to estimate the smoothing parameter by m-fold cross-validation, for example, m = 10. In problems where the true regression function is sparse and the signals large, such cross-validation typically works well. However, in regression modeling of genomic studies involving Single Nucleotide Polymorphisms (SNP), the true regression functions, while thought to be sparse, do not have large signals. We demonstrate empirically that in such problems, the number of selected variables using SCAD and the Adaptive Lasso, with 10-fold cross-validation, is a random variable that has considerable and surprising variation. Similar remarks apply to non-oracle methods such as the Lasso. Our study strongly questions the suitability of performing only a single run of m-fold cross-validation with any oracle method, and not just the SCAD and Adaptive Lasso.
A large-scale study of the random variability of a coding sequence: a study on the CFTR gene.
Modiano, Guido; Bombieri, Cristina; Ciminelli, Bianca Maria; Belpinati, Francesca; Giorgi, Silvia; Georges, Marie des; Scotet, Virginie; Pompei, Fiorenza; Ciccacci, Cinzia; Guittard, Caroline; Audrézet, Marie Pierre; Begnini, Angela; Toepfer, Michael; Macek, Milan; Ferec, Claude; Claustres, Mireille; Pignatti, Pier Franco
2005-02-01
Coding single nucleotide substitutions (cSNSs) have been studied on hundreds of genes using small samples (n(g) approximately 100-150 genes). In the present investigation, a large random European population sample (average n(g) approximately 1500) was studied for a single gene, the CFTR (Cystic Fibrosis Transmembrane conductance Regulator). The nonsynonymous (NS) substitutions exhibited, in accordance with previous reports, a mean probability of being polymorphic (q > 0.005), much lower than that of the synonymous (S) substitutions, but they showed a similar rate of subpolymorphic (q < 0.005) variability. This indicates that, in autosomal genes that may have harmful recessive alleles (nonduplicated genes with important functions), genetic drift overwhelms selection in the subpolymorphic range of variability, making disadvantageous alleles behave as neutral. These results imply that the majority of the subpolymorphic nonsynonymous alleles of these genes are selectively negative or even pathogenic.
Jo, Joon-Jung; Kim, Min-Ji; Son, Jung-Tae; Kim, Jandi; Shin, Jong-Shik
2009-07-17
Nucleic acid hybridization is one of the essential biological processes involved in storage and transmission of genetic information. Here we quantitatively determined the effect of secondary structure on the hybridization activation energy using structurally defined oligonucleotides. It turned out that activation energy is linearly proportional to the length of a single-stranded region flanking a nucleation site, generating a 0.18 kcal/mol energy barrier per nucleotide. Based on this result, we propose that the presence of single-stranded segments available for non-productive base pairing with a nucleation counterpart extends the searching process for nucleation sites to find a perfect match. This result may provide insights into rational selection of a target mRNA site for siRNA and antisense gene silencing.
Structural insights into translational recoding by frameshift suppressor tRNA SufJ
Fagan, Crystal E.; Maehigashi, Tatsuya; Dunkle, Jack A.; ...
2014-10-28
The three-nucleotide mRNA reading frame is tightly regulated during translation to ensure accurate protein expression. Translation errors that lead to aberrant protein production can result from the uncoupled movement of the tRNA in either the 5' or 3' direction on mRNA. Here, we report the biochemical and structural characterization of +1 frameshift suppressor tRNA SufJ, a tRNA known to decode four, instead of three, nucleotides. Frameshift suppressor tRNA SufJ contains an insertion 5' to its anticodon, expanding the anticodon loop from seven to eight nucleotides. Our results indicate that the expansion of the anticodon loop of either ASL SufJ ormore » tRNA SufJ does not affect its affinity for the A site of the ribosome. Structural analyses of both ASL SufJ and ASL Thr bound to the Thermus thermophilus 70S ribosome demonstrate both ASLs decode in the zero frame. Although the anticodon loop residues 34–37 are superimposable with canonical seven-nucleotide ASLs, the single C31.5 insertion between nucleotides 31 and 32 in ASL SufJ imposes a conformational change of the anticodon stem, that repositions and tilts the ASL toward the back of the A site. Further modeling analyses reveal that this tilting would cause a distortion in full-length A-site tRNA SufJ during tRNA selection and possibly impede gripping of the anticodon stem by 16S rRNA nucleotides in the P site. Together, these data implicate tRNA distortion as a major driver of noncanonical translation events such as frameshifting.« less
NASA Astrophysics Data System (ADS)
An, Le; Adeli, Ehsan; Liu, Mingxia; Zhang, Jun; Lee, Seong-Whan; Shen, Dinggang
2017-03-01
Classification is one of the most important tasks in machine learning. Due to feature redundancy or outliers in samples, using all available data for training a classifier may be suboptimal. For example, the Alzheimer’s disease (AD) is correlated with certain brain regions or single nucleotide polymorphisms (SNPs), and identification of relevant features is critical for computer-aided diagnosis. Many existing methods first select features from structural magnetic resonance imaging (MRI) or SNPs and then use those features to build the classifier. However, with the presence of many redundant features, the most discriminative features are difficult to be identified in a single step. Thus, we formulate a hierarchical feature and sample selection framework to gradually select informative features and discard ambiguous samples in multiple steps for improved classifier learning. To positively guide the data manifold preservation process, we utilize both labeled and unlabeled data during training, making our method semi-supervised. For validation, we conduct experiments on AD diagnosis by selecting mutually informative features from both MRI and SNP, and using the most discriminative samples for training. The superior classification results demonstrate the effectiveness of our approach, as compared with the rivals.
Single Nucleotide Polymorphisms Predict Symptom Severity of Autism Spectrum Disorder
ERIC Educational Resources Information Center
Jiao, Yun; Chen, Rong; Ke, Xiaoyan; Cheng, Lu; Chu, Kangkang; Lu, Zuhong; Herskovits, Edward H.
2012-01-01
Autism is widely believed to be a heterogeneous disorder; diagnosis is currently based solely on clinical criteria, although genetic, as well as environmental, influences are thought to be prominent factors in the etiology of most forms of autism. Our goal is to determine whether a predictive model based on single-nucleotide polymorphisms (SNPs)…
USDA-ARS?s Scientific Manuscript database
Background/Objectives: The misincorporation of uracil into DNA leads to genomic instability. In a previous study, some of us identified four common single nucleotide polymorphisms (SNPs) in uracil-processing genes (rs2029166 and rs7296239 in SMUG1, rs34259 in UNG and rs4775748 in DUT) that were asso...
USDA-ARS?s Scientific Manuscript database
Single-nucleotide Polymorphism (SNP) markers are by far the most common form of DNA polymorphism in a genome. The objectives of this study were to discover SNPs in common bean comparing sequences from coding and non-coding regions obtained from Genbank and genomic DNA and to compare sequencing resu...
USDA-ARS?s Scientific Manuscript database
Previously, a candidate gene approach identified 51 single nucleotide polymorphisms (SNP) associated with genetic merit for reproductive traits and 26 associated with genetic merit for production in dairy bulls. We evaluated association of the 77 SNPs with days open (DO) for first lactation in a pop...
USDA-ARS?s Scientific Manuscript database
Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...
Demirci, Berna; Lee, Yoosook; Lanzaro, Gregory C; Alten, Bulent
2012-05-01
Culex theileri Theobald (Diptera: Culicidae) is one of the most common mosquito species in northeastern Turkey and serves as a vector for various zoonotic diseases including West Nile virus. Although there have been some studies on the ecology of Cx. theileri, very little genetic data has been made available. We successfully sequenced 11 gene fragments from Cx. theileri specimens collected from the northeastern part of Turkey. On average, we found a Single nucleotide polymorphism every 45 bp. Transitions outnumbered transversions, at a ratio of 2:1. This is the first report of genetic polymorphisms in Cx. theileri and Single nucleotide polymorphism discovered from this study can be used to investigate population structure and gene-environmental interactions.
Dutoit, Ludovic; Burri, Reto; Nater, Alexander; Mugal, Carina F; Ellegren, Hans
2017-07-01
Properly estimating genetic diversity in populations of nonmodel species requires a basic understanding of how diversity is distributed across the genome and among individuals. To this end, we analysed whole-genome resequencing data from 20 collared flycatchers (genome size ≈1.1 Gb; 10.13 million single nucleotide polymorphisms detected). Genomewide nucleotide diversity was almost identical among individuals (mean = 0.00394, range = 0.00384-0.00401), but diversity levels varied extensively across the genome (95% confidence interval for 200-kb windows = 0.0013-0.0053). Diversity was related to selective constraint such that in comparison with intergenic DNA, diversity at fourfold degenerate sites was reduced to 85%, 3' UTRs to 82%, 5' UTRs to 70% and nondegenerate sites to 12%. There was a strong positive correlation between diversity and chromosome size, probably driven by a higher density of targets for selection on smaller chromosomes increasing the diversity-reducing effect of linked selection. Simulations exploring the ability of sequence data from a small number of genetic markers to capture the observed diversity clearly demonstrated that diversity estimation from finite sampling of such data is bound to be associated with large confidence intervals. Nevertheless, we show that precision in diversity estimation in large outbred population benefits from increasing the number of loci rather than the number of individuals. Simulations mimicking RAD sequencing showed that this approach gives accurate estimates of genomewide diversity. Based on the patterns of observed diversity and the performed simulations, we provide broad recommendations for how genetic diversity should be estimated in natural populations. © 2016 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
Okamura, Kohji; Sakaguchi, Hironari; Sakamoto-Abutani, Rie; Nakanishi, Mahito; Nishimura, Ken; Yamazaki-Inoue, Mayu; Ohtaka, Manami; Periasamy, Vaiyapuri Subbarayan; Alshatwi, Ali Abdullah; Higuchi, Akon; Hanaoka, Kazunori; Nakabayashi, Kazuhiko; Takada, Shuji; Hata, Kenichiro; Toyoda, Masashi; Umezawa, Akihiro
2016-01-01
Disease-specific induced pluripotent stem cells (iPSCs) have been used as a model to analyze pathogenesis of disease. In this study, we generated iPSCs derived from a fibroblastic cell line of xeroderma pigmentosum (XP) group A (XPA-iPSCs), a rare autosomal recessive hereditary disease in which patients develop skin cancer in the areas of skin exposed to sunlight. XPA-iPSCs exhibited hypersensitivity to ultraviolet exposure and accumulation of single-nucleotide substitutions when compared with ataxia telangiectasia-derived iPSCs that were established in a previous study. However, XPA-iPSCs did not show any chromosomal instability in vitro, i.e. intact chromosomes were maintained. The results were mutually compensating for examining two major sources of mutations, nucleotide excision repair deficiency and double-strand break repair deficiency. Like XP patients, XPA-iPSCs accumulated single-nucleotide substitutions that are associated with malignant melanoma, a manifestation of XP. These results indicate that XPA-iPSCs may serve a monitoring tool (analogous to the Ames test but using mammalian cells) to measure single-nucleotide alterations, and may be a good model to clarify pathogenesis of XP. In addition, XPA-iPSCs may allow us to facilitate development of drugs that delay genetic alteration and decrease hypersensitivity to ultraviolet for therapeutic applications. PMID:27197874
2017-01-01
ABSTRACT RNA viruses are one of the fastest-evolving biological entities. Within their hosts, they exist as genetically diverse populations (i.e., viral mutant swarms), which are sculpted by different evolutionary mechanisms, such as mutation, natural selection, and genetic drift, and also the interactions between genetic variants within the mutant swarms. To elucidate the mechanisms that modulate the population diversity of an important plant-pathogenic virus, we performed evolution experiments with Potato virus Y (PVY) in potato genotypes that differ in their defense response against the virus. Using deep sequencing of small RNAs, we followed the temporal dynamics of standing and newly generated variations in the evolving viral lineages. A time-sampled approach allowed us to (i) reconstruct theoretical haplotypes in the starting population by using clustering of single nucleotide polymorphisms' trajectories and (ii) use quantitative population genetics approaches to estimate the contribution of selection and genetic drift, and their interplay, to the evolution of the virus. We detected imprints of strong selective sweeps and narrow genetic bottlenecks, followed by the shift in frequency of selected haplotypes. Comparison of patterns of viral evolution in differently susceptible host genotypes indicated possible diversifying evolution of PVY in the less-susceptible host (efficient in the accumulation of salicylic acid). IMPORTANCE High diversity of within-host populations of RNA viruses is an important aspect of their biology, since they represent a reservoir of genetic variants, which can enable quick adaptation of viruses to a changing environment. This study focuses on an important plant virus, Potato virus Y, and describes, at high resolution, temporal changes in the structure of viral populations within different potato genotypes. A novel and easy-to-implement computational approach was established to cluster single nucleotide polymorphisms into viral haplotypes from very short sequencing reads. During the experiment, a shift in the frequency of selected viral haplotypes was observed after a narrow genetic bottleneck, indicating an important role of the genetic drift in the evolution of the virus. On the other hand, a possible case of diversifying selection of the virus was observed in less susceptible host genotypes. PMID:28592544
High-throughput discovery of rare human nucleotide polymorphisms by Ecotilling
Till, Bradley J.; Zerr, Troy; Bowers, Elisabeth; Greene, Elizabeth A.; Comai, Luca; Henikoff, Steven
2006-01-01
Human individuals differ from one another at only ∼0.1% of nucleotide positions, but these single nucleotide differences account for most heritable phenotypic variation. Large-scale efforts to discover and genotype human variation have been limited to common polymorphisms. However, these efforts overlook rare nucleotide changes that may contribute to phenotypic diversity and genetic disorders, including cancer. Thus, there is an increasing need for high-throughput methods to robustly detect rare nucleotide differences. Toward this end, we have adapted the mismatch discovery method known as Ecotilling for the discovery of human single nucleotide polymorphisms. To increase throughput and reduce costs, we developed a universal primer strategy and implemented algorithms for automated band detection. Ecotilling was validated by screening 90 human DNA samples for nucleotide changes in 5 gene targets and by comparing results to public resequencing data. To increase throughput for discovery of rare alleles, we pooled samples 8-fold and found Ecotilling to be efficient relative to resequencing, with a false negative rate of 5% and a false discovery rate of 4%. We identified 28 new rare alleles, including some that are predicted to damage protein function. The detection of rare damaging mutations has implications for models of human disease. PMID:16893952
Robinson, James; Guethlein, Lisbeth A; Cereb, Nezih; Yang, Soo Young; Norman, Paul J; Marsh, Steven G E; Parham, Peter
2017-06-01
HLA class I glycoproteins contain the functional sites that bind peptide antigens and engage lymphocyte receptors. Recently, clinical application of sequence-based HLA typing has uncovered an unprecedented number of novel HLA class I alleles. Here we define the nature and extent of the variation in 3,489 HLA-A, 4,356 HLA-B and 3,111 HLA-C alleles. This analysis required development of suites of methods, having general applicability, for comparing and analyzing large numbers of homologous sequences. At least three amino-acid substitutions are present at every position in the polymorphic α1 and α2 domains of HLA-A, -B and -C. A minority of positions have an incidence >1% for the 'second' most frequent nucleotide, comprising 70 positions in HLA-A, 85 in HLA-B and 54 in HLA-C. The majority of these positions have three or four alternative nucleotides. These positions were subject to positive selection and correspond to binding sites for peptides and receptors. Most alleles of HLA class I (>80%) are very rare, often identified in one person or family, and they differ by point mutation from older, more common alleles. These alleles with single nucleotide polymorphisms reflect the germ-line mutation rate. Their frequency predicts the human population harbors 8-9 million HLA class I variants. The common alleles of human populations comprise 42 core alleles, which represent all selected polymorphism, and recombinants that have assorted this polymorphism.
Cereb, Nezih; Yang, Soo Young; Marsh, Steven G. E.; Parham, Peter
2017-01-01
HLA class I glycoproteins contain the functional sites that bind peptide antigens and engage lymphocyte receptors. Recently, clinical application of sequence-based HLA typing has uncovered an unprecedented number of novel HLA class I alleles. Here we define the nature and extent of the variation in 3,489 HLA-A, 4,356 HLA-B and 3,111 HLA-C alleles. This analysis required development of suites of methods, having general applicability, for comparing and analyzing large numbers of homologous sequences. At least three amino-acid substitutions are present at every position in the polymorphic α1 and α2 domains of HLA-A, -B and -C. A minority of positions have an incidence >1% for the ‘second’ most frequent nucleotide, comprising 70 positions in HLA-A, 85 in HLA-B and 54 in HLA-C. The majority of these positions have three or four alternative nucleotides. These positions were subject to positive selection and correspond to binding sites for peptides and receptors. Most alleles of HLA class I (>80%) are very rare, often identified in one person or family, and they differ by point mutation from older, more common alleles. These alleles with single nucleotide polymorphisms reflect the germ-line mutation rate. Their frequency predicts the human population harbors 8–9 million HLA class I variants. The common alleles of human populations comprise 42 core alleles, which represent all selected polymorphism, and recombinants that have assorted this polymorphism. PMID:28650991
Volkán-Kacsó, Sándor; Marcus, Rudolph A
2016-10-25
A recently proposed chemomechanical group transfer theory of rotary biomolecular motors is applied to treat single-molecule controlled rotation experiments. In these experiments, single-molecule fluorescence is used to measure the binding and release rate constants of nucleotides by monitoring the occupancy of binding sites. It is shown how missed events of nucleotide binding and release in these experiments can be corrected using theory, with F 1 -ATP synthase as an example. The missed events are significant when the reverse rate is very fast. Using the theory the actual rate constants in the controlled rotation experiments and the corrections are predicted from independent data, including other single-molecule rotation and ensemble biochemical experiments. The effective torsional elastic constant is found to depend on the binding/releasing nucleotide, and it is smaller for ADP than for ATP. There is a good agreement, with no adjustable parameters, between the theoretical and experimental results of controlled rotation experiments and stalling experiments, for the range of angles where the data overlap. This agreement is perhaps all the more surprising because it occurs even though the binding and release of fluorescent nucleotides is monitored at single-site occupancy concentrations, whereas the stalling and free rotation experiments have multiple-site occupancy.
Yang, Cheng-Hong; Wu, Kuo-Chuan; Chuang, Li-Yeh; Chang, Hsueh-Wei
2018-01-01
DNA barcode sequences are accumulating in large data sets. A barcode is generally a sequence larger than 1000 base pairs and generates a computational burden. Although the DNA barcode was originally envisioned as straightforward species tags, the identification usage of barcode sequences is rarely emphasized currently. Single-nucleotide polymorphism (SNP) association studies provide us an idea that the SNPs may be the ideal target of feature selection to discriminate between different species. We hypothesize that SNP-based barcodes may be more effective than the full length of DNA barcode sequences for species discrimination. To address this issue, we tested a r ibulose diphosphate carboxylase ( rbcL ) S NP b arcoding (RSB) strategy using a decision tree algorithm. After alignment and trimming, 31 SNPs were discovered in the rbcL sequences from 38 Brassicaceae plant species. In the decision tree construction, these SNPs were computed to set up the decision rule to assign the sequences into 2 groups level by level. After algorithm processing, 37 nodes and 31 loci were required for discriminating 38 species. Finally, the sequence tags consisting of 31 rbcL SNP barcodes were identified for discriminating 38 Brassicaceae species based on the decision tree-selected SNP pattern using RSB method. Taken together, this study provides the rational that the SNP aspect of DNA barcode for rbcL gene is a useful and effective sequence for tagging 38 Brassicaceae species.
Payen, Thibaut; Murat, Claude; Gigant, Anaïs; Morin, Emmanuelle; De Mita, Stéphane; Martin, Francis
2015-09-01
The Périgord black truffle (Tuber melanosporum Vittad.), considered a gastronomic delicacy worldwide, is an ectomycorrhizal filamentous fungus that is ecologically important in Mediterranean French, Italian and Spanish woodlands. In this study, we developed a novel resource of single nucleotide polymorphisms (SNPs) for T. melanosporum using Illumina high-throughput resequencing. The genome from six T. melanosporum geographical accessions was sequenced to a depth of approximately 20×. These geographical accessions were selected from different populations within the northern and southern regions of the geographical species distribution. Approximately 80% of the reads for each of the six resequenced geographical accessions mapped against the reference T. melanosporum genome assembly, estimating the core genome size of this organism to be approximately 110 Mbp. A total of 442 326 SNPs corresponding to 3540 SNPs/Mbps were identified as being included in all seven genomes. The SNPs occurred more frequently in repeated sequences (85%), although 4501 SNPs were also identified in the coding regions of 2587 genes. Using the ratio of nonsynonymous mutations per nonsynonymous site (pN) to synonymous mutations per synonymous site (pS) and Tajima's D index scanning the whole genome, we were able to identify genomic regions and genes potentially subjected to positive or purifying selection. The SNPs identified represent a valuable resource for future population genetics and genomics studies. © 2015 John Wiley & Sons Ltd.
Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.
Baniecki, Mary Lynn; Faust, Aubrey L; Schaffner, Stephen F; Park, Daniel J; Galinsky, Kevin; Daniels, Rachel F; Hamilton, Elizabeth; Ferreira, Marcelo U; Karunaweera, Nadira D; Serre, David; Zimmerman, Peter A; Sá, Juliana M; Wellems, Thomas E; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E; Volkman, Sarah K; Wirth, Dyann F; Sabeti, Pardis C
2015-03-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.
Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections
Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.
2015-01-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890
Zhang, Yiwen; Cao, Man; Wang, Mengting; Ding, Xianping; Jing, Yaling; Chen, Zuyi; Ma, Tengjiao; Chen, Honghan
2016-07-01
Human papillomavirus (HPV) is the major causative agent of cervical cancer, which accounts for the second highest cancer burden in women worldwide. HPV-52, the prevalent subtype in Asia, especially in southwest China, was analyzed in this study. To analyze polymorphisms, intratypic variants, and genetic variability in the E6-E7 (n=26) and L1 (n=53) genes of HPV-52, these genes were sequenced and the sequences were submitted to GenBank. Phylogenetic trees were constructed using the neighbor-joining and Kimura 2-parameters methods, followed by analysis of the diversity of secondary structure. Finally, we estimated the selection pressures acting on the E6-E7 and L1 genes. Fifty-one novel variants of HPV-52 L1, and two novel variants of HPV-52 E6-E7 were identified in this study. Thirty single nucleotide changes were observed in HPV-52 E6-E7 sequences with 19/30 non-synonymous mutations and 11/30 synonymous mutations (five in the alpha helix and five in the beta sheet). Fifty-five single nucleotide changes were observed in HPV-52 L1 sequences with 17/55 non-synonymous mutations (seven in the alpha helix and fourteen in the beta sheet) and 38/55 synonymous mutations. Selective pressure analysis predicted that most of these mutations reflect positive selection. Identifying new variants in HPV-52 may inform the rational design of new vaccines specifically for women in southwest China. Knowledge of genetic variation in HPV may be useful as an epidemiologic correlate of cervical cancer risk, or may even provide critical information for developing diagnostic probes. Copyright © 2016 Elsevier B.V. All rights reserved.
Kozlov, Konstantin N.; Kulakovskiy, Ivan V.; Zubair, Asif; Marjoram, Paul; Lawrie, David S.; Nuzhdin, Sergey V.; Samsonova, Maria G.
2017-01-01
Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects. PMID:28898266
He, Daniel; Lorenz, Robin; Kim, Choel; Herberg, Friedrich W; Lim, Chinten James
2017-12-15
The cyclic adenosine monophosphate (cAMP)- and cyclic guanosine monophosphate (cGMP)-dependent protein kinases (PKA and PKG) are key effectors of cyclic nucleotide signaling. Both share structural features that include tandem cyclic nucleotide-binding (CNB) domains, CNB-A and CNB-B, yet their functions are separated through preferential activation by either cAMP or cGMP. Based on structural studies and modeling, key CNB contact residues have been identified for both kinases. In this study, we explored the requirements for conversion of PKA activation from cAMP-dependent to cGMP-dependent. The consequences of the residue substitutions T192R/A212T within CNB-A or G316R/A336T within CNB-B of PKA-RIα on cyclic nucleotide binding and holoenzyme activation were assessed in vitro using purified recombinant proteins, and ex vivo using RIα-deficient mouse embryonic fibroblasts genetically reconstituted with wild-type or mutant PKA-RIα. In vitro, a loss of binding and activation selectivity was observed when residues in either one of the CNB domains were mutated, while mutations in both CNB domains resulted in a complete switch of selectivity from cAMP to cGMP. The switch in selectivity was also recapitulated ex vivo, confirming their functional roles in cells. Our results highlight the importance of key cyclic nucleotide contacts within each CNB domain and suggest that these domains may have evolved from an ancestral gene product to yield two distinct cyclic nucleotide-dependent protein kinases.
GAPIT: genome association and prediction integrated tool.
Lipka, Alexander E; Tian, Feng; Wang, Qishan; Peiffer, Jason; Li, Meng; Bradbury, Peter J; Gore, Michael A; Buckler, Edward S; Zhang, Zhiwu
2012-09-15
Software programs that conduct genome-wide association studies and genomic prediction and selection need to use methodologies that maximize statistical power, provide high prediction accuracy and run in a computationally efficient manner. We developed an R package called Genome Association and Prediction Integrated Tool (GAPIT) that implements advanced statistical methods including the compressed mixed linear model (CMLM) and CMLM-based genomic prediction and selection. The GAPIT package can handle large datasets in excess of 10 000 individuals and 1 million single-nucleotide polymorphisms with minimal computational time, while providing user-friendly access and concise tables and graphs to interpret results. http://www.maizegenetics.net/GAPIT. zhiwu.zhang@cornell.edu Supplementary data are available at Bioinformatics online.
Demographic history, selection and functional diversity of the canine genome.
Ostrander, Elaine A; Wayne, Robert K; Freedman, Adam H; Davis, Brian W
2017-12-01
The domestic dog represents one of the most dramatic long-term evolutionary experiments undertaken by humans. From a large wolf-like progenitor, unparalleled diversity in phenotype and behaviour has developed in dogs, providing a model for understanding the developmental and genomic mechanisms of diversification. We discuss pattern and process in domestication, beginning with general findings about early domestication and problems in documenting selection at the genomic level. Furthermore, we summarize genotype-phenotype studies based first on single nucleotide polymorphism (SNP) genotyping and then with whole-genome data and show how an understanding of evolution informs topics as different as human history, adaptive and deleterious variation, morphological development, ageing, cancer and behaviour.
Detecting signatures of selection within the Tibetan sheep mitochondrial genome.
Niu, Lili; Chen, Xiaoyong; Xiao, Ping; Zhao, Qianjun; Zhou, Jingxuan; Hu, Jiangtao; Sun, Hongxin; Guo, Jiazhong; Li, Li; Wang, Linjie; Zhang, Hongping; Zhong, Tao
2017-11-01
Tibetan sheep, a Chinese indigenous breed, are mainly distributed in plateau and mountain-valley areas at a terrestrial elevation between 2260 and 4100 m. The herd is genetically distinct from the other domestic sheep and undergoes acclimatization to adapt to the hypoxic environment. To date, whether the mitochondrial DNA modification of Tibetan sheep shares the same feature as the other domestic breed remains unknown. In this study, we compared the whole mitogenome sequences from 32 Tibetan sheep, 22 domestic sheep and 24 commercial sheep to identify the selection signatures of hypoxic-tolerant in Tibetan sheep. Nucleotide diversity analysis using the sliding window method showed that the highest level of nucleotide diversity was observed in the control region with a peak value of π = 0.05215, while the lowest π value was detected in the tRNAs region. qPCR results showed that the relative mtDNA copy number in Tibetan sheep was significantly lower than that in Suffolk sheep. None of the mutations in 12S rRNA were fixed in Tibetan sheep, which indicated that there has been less artificial selection in this herd than the other domestic and commercial breeds. Although one site (1277G) might undergo the purifying selection, it was not identified as the breed-specific allele in Tibetan sheep. We proposed that nature selection was the main drive during the domestication of Tibetan sheep and single mutation (or locus) could not reveal the signature of selection as for the high diversity in the mitogenome of Tibetan sheep.
regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution.
Zhang, Xinjun; Li, Meng; Lin, Hai; Rao, Xi; Feng, Weixing; Yang, Yuedong; Mort, Matthew; Cooper, David N; Wang, Yue; Wang, Yadong; Wells, Clark; Zhou, Yaoqi; Liu, Yunlong
2017-09-01
While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization.
Shi, Ainong; Qin, Jun; Mou, Beiquan; Correll, James; Weng, Yuejin; Brenner, David; Feng, Chunda; Motes, Dennis; Yang, Wei; Dong, Lingdi; Bhattarai, Gehendra; Ravelombola, Waltram
2017-01-01
Spinach (Spinacia oleracea L., 2n = 2x = 12) is an economically important vegetable crop worldwide and one of the healthiest vegetables due to its high concentrations of nutrients and minerals. The objective of this research was to conduct genetic diversity and population structure analysis of a collection of world-wide spinach genotypes using single nucleotide polymorphisms (SNPs) markers. Genotyping by sequencing (GBS) was used to discover SNPs in spinach genotypes. Three sets of spinach genotypes were used: 1) 268 USDA GRIN spinach germplasm accessions originally collected from 30 countries; 2) 45 commercial spinach F1 hybrids from three countries; and 3) 30 US Arkansas spinach cultivars/breeding lines. The results from this study indicated that there was genetic diversity among the 343 spinach genotypes tested. Furthermore, the genetic background in improved commercial F1 hybrids and in Arkansas cultivars/lines had a different structured populations from the USDA germplasm. In addition, the genetic diversity and population structures were associated with geographic origin and germplasm from the US Arkansas breeding program had a unique genetic background. These data could provide genetic diversity information and the molecular markers for selecting parents in spinach breeding programs.
Qin, Jun; Mou, Beiquan; Correll, James; Weng, Yuejin; Brenner, David; Feng, Chunda; Motes, Dennis; Yang, Wei; Dong, Lingdi; Bhattarai, Gehendra; Ravelombola, Waltram
2017-01-01
Spinach (Spinacia oleracea L., 2n = 2x = 12) is an economically important vegetable crop worldwide and one of the healthiest vegetables due to its high concentrations of nutrients and minerals. The objective of this research was to conduct genetic diversity and population structure analysis of a collection of world-wide spinach genotypes using single nucleotide polymorphisms (SNPs) markers. Genotyping by sequencing (GBS) was used to discover SNPs in spinach genotypes. Three sets of spinach genotypes were used: 1) 268 USDA GRIN spinach germplasm accessions originally collected from 30 countries; 2) 45 commercial spinach F1 hybrids from three countries; and 3) 30 US Arkansas spinach cultivars/breeding lines. The results from this study indicated that there was genetic diversity among the 343 spinach genotypes tested. Furthermore, the genetic background in improved commercial F1 hybrids and in Arkansas cultivars/lines had a different structured populations from the USDA germplasm. In addition, the genetic diversity and population structures were associated with geographic origin and germplasm from the US Arkansas breeding program had a unique genetic background. These data could provide genetic diversity information and the molecular markers for selecting parents in spinach breeding programs. PMID:29190770
The Single Nucleotide Polymorphism Consortium
NASA Technical Reports Server (NTRS)
Morgan, Michael
2003-01-01
I want to discuss both the Single Nucleotide Polymorphism (SNP) Consortium and the Human Genome Project. I am afraid most of my presentation will be thin on law and possibly too high on rhetoric. Having been engaged in a personal and direct way with these issues as a trained scientist, I find it quite difficult to be always as objective as I ought to be.
Xiao, Zhuo; Lie, Puchang; Fang, Zhiyuan; Yu, Luxin; Chen, Junhua; Liu, Jie; Ge, Chenchen; Zhou, Xuemeng; Zeng, Lingwen
2012-09-04
A lateral flow biosensor for detection of single nucleotide polymorphism based on circular strand displacement reaction (CSDPR) has been developed. Taking advantage of high fidelity of T4 DNA ligase, signal amplification by CSDPR, and the optical properties of gold nanoparticles, this assay has reached a detection limit of 0.01 fM.
A Laboratory Exercise for Genotyping Two Human Single Nucleotide Polymorphisms
ERIC Educational Resources Information Center
Fernando, James; Carlson, Bradley; LeBard, Timothy; McCarthy, Michael; Umali, Finianne; Ashton, Bryce; Rose, Ferrill F., Jr.
2016-01-01
The dramatic decrease in the cost of sequencing a human genome is leading to an era in which a wide range of students will benefit from having an understanding of human genetic variation. Since over 90% of sequence variation between humans is in the form of single nucleotide polymorphisms (SNPs), a laboratory exercise has been devised in order to…
USDA-ARS?s Scientific Manuscript database
The association of single nucleotide polymorphisms (SNPs) of calpastatin (CAST) gene with shear force of 2.54 cm steaks from M. longissimus dorsi from Gannan yaks (Bos grunniens, n=181) was studied. Yaks were harvested at 2, 3, and 4 yr of age (n=51, 59, and 71, respectively), and samples of each ya...
Winterhagen, Patrick; Wünsche, Jens-Norbert
2016-05-01
Within a polyembryonic mango seedling tree population, the genetic background of individuals should be identical because vigorous plants for cultivation are expected to develop from nucellar embryos representing maternal clones. Due to the fact that the mango cultivar 'Hôi' is assigned to the polyembryonic ecotype, an intra-cultivar variability of ethylene receptor genes was unexpected. Ethylene receptors in plants are conserved, but the number of receptors or receptor isoforms is variable regarding different plant species. However, it is shown here that the ethylene receptor MiETR1 is present in various isoforms within the mango cultivar 'Hôi'. The investigation of single nucleotide polymorphisms revealed that different MiETR1 isoforms can not be discriminated simply by individual single nucleotide exchanges but by the specific arrangement of single nucleotide polymorphisms at certain positions in the exons of MiETR1. Furthermore, an MiETR1 isoform devoid of introns in the genomic sequence was identified. The investigation demonstrates some limitations of high resolution melting and ScreenClust analysis and points out the necessity of sequencing to identify individual isoforms and to determine the variability within the tree population.
Method: a single nucleotide polymorphism genotyping method for Wheat streak mosaic virus.
Rogers, Stephanie M; Payton, Mark; Allen, Robert W; Melcher, Ulrich; Carver, Jesse; Fletcher, Jacqueline
2012-05-17
The September 11, 2001 attacks on the World Trade Center and the Pentagon increased the concern about the potential for terrorist attacks on many vulnerable sectors of the US, including agriculture. The concentrated nature of crops, easily obtainable biological agents, and highly detrimental impacts make agroterrorism a potential threat. Although procedures for an effective criminal investigation and attribution following such an attack are available, important enhancements are still needed, one of which is the capability for fine discrimination among pathogen strains. The purpose of this study was to develop a molecular typing assay for use in a forensic investigation, using Wheat streak mosaic virus (WSMV) as a model plant virus. This genotyping technique utilizes single base primer extension to generate a genetic fingerprint. Fifteen single nucleotide polymorphisms (SNPs) within the coat protein and helper component-protease genes were selected as the genetic markers for this assay. Assay optimization and sensitivity testing was conducted using synthetic targets. WSMV strains and field isolates were collected from regions around the world and used to evaluate the assay for discrimination. The assay specificity was tested against a panel of near-neighbors consisting of genetic and environmental near-neighbors. Each WSMV strain or field isolate tested produced a unique SNP fingerprint, with the exception of three isolates collected within the same geographic location that produced indistinguishable fingerprints. The results were consistent among replicates, demonstrating the reproducibility of the assay. No SNP fingerprints were generated from organisms included in the near-neighbor panel, suggesting the assay is specific for WSMV. Using synthetic targets, a complete profile could be generated from as low as 7.15 fmoles of cDNA. The molecular typing method presented is one tool that could be incorporated into the forensic science tool box after a thorough validation study. This method incorporates molecular biology techniques that are already well established in research and diagnostic laboratories, allowing for an easy introduction of this method into existing laboratories. single nucleotide polymorphisms, genotyping, plant pathology, viruses, microbial forensics, Single base primer extension, SNaPshot Multiplex Kit.
Method: a single nucleotide polymorphism genotyping method for Wheat streak mosaic virus
2012-01-01
Background The September 11, 2001 attacks on the World Trade Center and the Pentagon increased the concern about the potential for terrorist attacks on many vulnerable sectors of the US, including agriculture. The concentrated nature of crops, easily obtainable biological agents, and highly detrimental impacts make agroterrorism a potential threat. Although procedures for an effective criminal investigation and attribution following such an attack are available, important enhancements are still needed, one of which is the capability for fine discrimination among pathogen strains. The purpose of this study was to develop a molecular typing assay for use in a forensic investigation, using Wheat streak mosaic virus (WSMV) as a model plant virus. Method This genotyping technique utilizes single base primer extension to generate a genetic fingerprint. Fifteen single nucleotide polymorphisms (SNPs) within the coat protein and helper component-protease genes were selected as the genetic markers for this assay. Assay optimization and sensitivity testing was conducted using synthetic targets. WSMV strains and field isolates were collected from regions around the world and used to evaluate the assay for discrimination. The assay specificity was tested against a panel of near-neighbors consisting of genetic and environmental near-neighbors. Result Each WSMV strain or field isolate tested produced a unique SNP fingerprint, with the exception of three isolates collected within the same geographic location that produced indistinguishable fingerprints. The results were consistent among replicates, demonstrating the reproducibility of the assay. No SNP fingerprints were generated from organisms included in the near-neighbor panel, suggesting the assay is specific for WSMV. Using synthetic targets, a complete profile could be generated from as low as 7.15 fmoles of cDNA. Conclusion The molecular typing method presented is one tool that could be incorporated into the forensic science tool box after a thorough validation study. This method incorporates molecular biology techniques that are already well established in research and diagnostic laboratories, allowing for an easy introduction of this method into existing laboratories. Keywords: single nucleotide polymorphisms, genotyping, plant pathology, viruses, microbial forensics, Single base primer extension, SNaPshot Multiplex Kit PMID:22594601
Dimeric PROP1 binding to diverse palindromic TAAT sequences promotes its transcriptional activity.
Nakayama, Michie; Kato, Takako; Susa, Takao; Sano, Akiko; Kitahara, Kousuke; Kato, Yukio
2009-08-13
Mutations in the Prop1 gene are responsible for murine Ames dwarfism and human combined pituitary hormone deficiency with hypogonadism. Recently, we reported that PROP1 is a possible transcription factor for gonadotropin subunit genes through plural cis-acting sites composed of AT-rich sequences containing a TAAT motif which differs from its consensus binding sequence known as PRDQ9 (TAATTGAATTA). This study aimed to verify the binding specificity and sequence of PROP1 by applying the method of SELEX (Systematic Evolution of Ligands by EXponential enrichment), EMSA (electrophoretic mobility shift assay) and transient transfection assay. SELEX, after 5, 7 and 9 generations of selection using a random sequence library, showed that nucleotides containing one or two TAAT motifs were accumulated and accounted for 98.5% at the 9th generation. Aligned sequences and EMSA demonstrated that PROP1 binds preferentially to 11 nucleotides composed of an inverted TAAT motif separated by 3 nucleotides with variation in the half site of palindromic TAAT motifs and with preferential requirement of T at the nucleotide number 5 immediately 3' to a TAAT motif. Transient transfection assay demonstrated first that dimeric binding of PROP1 to an inverted TAAT motif and its cognates resulted in transcriptional activation, whereas monomeric binding of PROP1 to a single TAAT motif and an inverted ATTA motif did not mediate activation. Thus, this study demonstrated that dimeric binding of PROP1 is able to recognize diverse palindromic TAAT sequences separated by 3 nucleotides and to exhibit its transcriptional activity.
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.
Podder, Mohua; Welch, William J; Zamar, Ruben H; Tebbutt, Scott J
2006-11-30
Single nucleotide polymorphisms (SNPs) are DNA sequence variations, occurring when a single nucleotide--adenine (A), thymine (T), cytosine (C) or guanine (G)--is altered. Arguably, SNPs account for more than 90% of human genetic variation. Our laboratory has developed a highly redundant SNP genotyping assay consisting of multiple probes with signals from multiple channels for a single SNP, based on arrayed primer extension (APEX). This mini-sequencing method is a powerful combination of a highly parallel microarray with distinctive Sanger-based dideoxy terminator sequencing chemistry. Using this microarray platform, our current genotype calling system (known as SNP Chart) is capable of calling single SNP genotypes by manual inspection of the APEX data, which is time-consuming and exposed to user subjectivity bias. Using a set of 32 Coriell DNA samples plus three negative PCR controls as a training data set, we have developed a fully-automated genotyping algorithm based on simple linear discriminant analysis (LDA) using dynamic variable selection. The algorithm combines separate analyses based on the multiple probe sets to give a final posterior probability for each candidate genotype. We have tested our algorithm on a completely independent data set of 270 DNA samples, with validated genotypes, from patients admitted to the intensive care unit (ICU) of St. Paul's Hospital (plus one negative PCR control sample). Our method achieves a concordance rate of 98.9% with a 99.6% call rate for a set of 96 SNPs. By adjusting the threshold value for the final posterior probability of the called genotype, the call rate reduces to 94.9% with a higher concordance rate of 99.6%. We also reversed the two independent data sets in their training and testing roles, achieving a concordance rate up to 99.8%. The strength of this APEX chemistry-based platform is its unique redundancy having multiple probes for a single SNP. Our model-based genotype calling algorithm captures the redundancy in the system considering all the underlying probe features of a particular SNP, automatically down-weighting any 'bad data' corresponding to image artifacts on the microarray slide or failure of a specific chemistry. In this regard, our method is able to automatically select the probes which work well and reduce the effect of other so-called bad performing probes in a sample-specific manner, for any number of SNPs.
Zimmermann, Aleksandra; Greco, Roberto; Walker, Isabel; Horak, Jeannie; Cavazzini, Alberto; Lämmerhofer, Michael
2014-08-08
Synthetic oligonucleotides gain increasing importance in new therapeutic concepts and as probes in biological sciences. If pharmaceutical-grade purities are required, chromatographic purification using ion-pair reversed-phase chromatography is commonly carried out. However, separation selectivity for structurally closely related impurities is often insufficient, especially at high sample loads. In this study, a "mixed-mode" reversed-phase/weak anion exchanger stationary phase has been investigated as an alternative tool for chromatographic separation of synthetic oligonucleotides with minor sequence variations. The employed mixed-mode phase shows great flexibility in method development. It has been run in various gradient elution modes, viz. one, two or three parameter (mixed) gradients (altering buffer pH, buffer concentration, and organic modifier) to find optimal elution conditions and gain further insight into retention mechanisms. Compared to ion-pair reversed-phase and mere anion-exchange separation, enhanced selectivities were observed with the mixed-mode phase for 20-23 nucleotide (nt) long oligonucleotides with similar sequences. Oligonucleotides differing by 1, 2 or 3 nucleotides in length could be readily resolved and separation factors for single nucleotide replacements declined in the order Cytosine (C)/Guanine (G)>Adenine (A)/Guanine∼Guanine/Thymine (T)>Adenine/Cytosine∼Cytosine/Thymine>Adenine/Thymine. Selectivities were larger when the modification was at the 3' terminal-end, declined when it was in the middle of the sequence and was smallest when it was located at the 5' terminus. Due to the lower surface area of the 200Å pore size mixed-mode stationary phase compared to the corresponding 100Å material, lower retention times with equal selectivities under milder elution conditions were achievable. Considering high sample loading capacities of the mixed-mode anion-exchanger phase, it should have great potential for chromatographic oligonucleotide separation and purification. Copyright © 2014 Elsevier B.V. All rights reserved.
The GS (genetic selection) Principle.
Abel, David L
2009-01-01
The GS (Genetic Selection) Principle states that biological selection must occur at the nucleotide-sequencing molecular-genetic level of 3'5' phosphodiester bond formation. After-the-fact differential survival and reproduction of already-living phenotypic organisms (ordinary natural selection) does not explain polynucleotide prescription and coding. All life depends upon literal genetic algorithms. Even epigenetic and "genomic" factors such as regulation by DNA methylation, histone proteins and microRNAs are ultimately instructed by prior linear digital programming. Biological control requires selection of particular configurable switch-settings to achieve potential function. This occurs largely at the level of nucleotide selection, prior to the realization of any integrated biofunction. Each selection of a nucleotide corresponds to the setting of two formal binary logic gates. The setting of these switches only later determines folding and binding function through minimum-free-energy sinks. These sinks are determined by the primary structure of both the protein itself and the independently prescribed sequencing of chaperones. The GS Principle distinguishes selection of existing function (natural selection) from selection for potential function (formal selection at decision nodes, logic gates and configurable switch-settings).
Khodakov, Dmitriy A; Khodakova, Anastasia S; Huang, David M; Linacre, Adrian; Ellis, Amanda V
2015-03-04
Single nucleotide polymorphisms (SNPs) are a prime source of genetic diversity. Discriminating between different SNPs provides an enormous leap towards the better understanding of the uniqueness of biological systems. Here we report on a new approach for SNP discrimination using toehold-mediated DNA strand displacement. The distinctiveness of the approach is based on the combination of both 3- and 4-way branch migration mechanisms, which allows for reliable discrimination of SNPs within double-stranded DNA generated from real-life human mitochondrial DNA samples. Aside from the potential diagnostic value, the current study represents an additional way to control the strand displacement reaction rate without altering other reaction parameters and provides new insights into the influence of single nucleotide substitutions on 3- and 4-way branch migration efficiency and kinetics.
Single nucleotide polymorphism analysis using different colored dye dimer probes
NASA Astrophysics Data System (ADS)
Marmé, Nicole; Friedrich, Achim; Denapaite, Dalia; Hakenbeck, Regine; Knemeyer, Jens-Peter
2006-09-01
Fluorescence quenching by dye dimer formation has been utilized to develop hairpin-structured DNA probes for the detection of a single nucleotide polymorphism (SNP) in the penicillin target gene pbp2x, which is implicated in the penicillin resistance of Streptococcus pneumoniae. We designed two specific DNA probes for the identification of the pbp2x genes from a penicillin susceptible strain R6 and a resistant strain Streptococcus mitis 661 using green-fluorescent tetramethylrhodamine (TMR) and red-fluorescent DY-636, respectively. Hybridization of each of the probes to its respective target DNA sequence opened the DNA hairpin probes, consequently breaking the nonfluorescent dye dimers into fluorescent species. This hybridization of the target with the hairpin probe achieved single nucleotide specific detection at nanomolar concentrations via increased fluorescence.
Detection of nucleotide-specific CRISPR/Cas9 modified alleles using multiplex ligation detection
KC, R.; Srivastava, A.; Wilkowski, J. M.; Richter, C. E.; Shavit, J. A.; Burke, D. T.; Bielas, S. L.
2016-01-01
CRISPR/Cas9 genome-editing has emerged as a powerful tool to create mutant alleles in model organisms. However, the precision with which these mutations are created has introduced a new set of complications for genotyping and colony management. Traditional gene-targeting approaches in many experimental organisms incorporated exogenous DNA and/or allele specific sequence that allow for genotyping strategies based on binary readout of PCR product amplification and size selection. In contrast, alleles created by non-homologous end-joining (NHEJ) repair of double-stranded DNA breaks generated by Cas9 are much less amenable to such strategies. Here we describe a novel genotyping strategy that is cost effective, sequence specific and allows for accurate and efficient multiplexing of small insertion-deletions and single-nucleotide variants characteristic of CRISPR/Cas9 edited alleles. We show that ligation detection reaction (LDR) can be used to generate products that are sequence specific and uniquely detected by product size and/or fluorescent tags. The method works independently of the model organism and will be useful for colony management as mutant alleles differing by a few nucleotides become more prevalent in experimental animal colonies. PMID:27557703
Kochanowski, N; Blanchard, F; Cacan, R; Chirat, F; Guedon, E; Marc, A; Goergen, J-L
2006-01-15
Analysis of intracellular nucleotide and nucleotide sugar contents is essential in studying protein glycosylation of mammalian cells. Nucleotides and nucleotide sugars are the donor substrates of glycosyltransferases, and nucleotides are involved in cellular energy metabolism and its regulation. A sensitive and reproducible ion-pair reverse-phase high-performance liquid chromatography (RP-HPLC) method has been developed, allowing the direct and simultaneous detection and quantification of some essential nucleotides and nucleotide sugars. After a perchloric acid extraction, 13 molecules (8 nucleotides and 5 nucleotide sugars) were separated, including activated sugars such as UDP-glucose, UDP-galactose, GDP-mannose, UDP-N-acetylglucosamine, and UDP-N-acetylgalactosamine. To validate the analytical parameters, the reproducibility, linearity of calibration curves, detection limits, and recovery were evaluated for standard mixtures and cell extracts. The developed method is capable of resolving picomolar quantities of nucleotides and nucleotide sugars in a single chromatographic run. The HPLC method was then applied to quantify intracellular levels of nucleotides and nucleotide sugars of Chinese hamster ovary (CHO) cells cultivated in a bioreactor batch process. Evolutions of the titers of nucleotides and nucleotide sugars during the batch process are discussed.
Peptide biomarkers used for the selective breeding of a complex polygenic trait in honey bees.
Guarna, M Marta; Hoover, Shelley E; Huxter, Elizabeth; Higo, Heather; Moon, Kyung-Mee; Domanski, Dominik; Bixby, Miriam E F; Melathopoulos, Andony P; Ibrahim, Abdullah; Peirson, Michael; Desai, Suresh; Micholson, Derek; White, Rick; Borchers, Christoph H; Currie, Robert W; Pernal, Stephen F; Foster, Leonard J
2017-08-21
We present a novel way to select for highly polygenic traits. For millennia, humans have used observable phenotypes to selectively breed stronger or more productive livestock and crops. Selection on genotype, using single-nucleotide polymorphisms (SNPs) and genome profiling, is also now applied broadly in livestock breeding programs; however, selection on protein/peptide or mRNA expression markers has not yet been proven useful. Here we demonstrate the utility of protein markers to select for disease-resistant hygienic behavior in the European honey bee (Apis mellifera L.). Robust, mechanistically-linked protein expression markers, by integrating cis- and trans- effects from many genomic loci, may overcome limitations of genomic markers to allow for selection. After three generations of selection, the resulting marker-selected stock outperformed an unselected benchmark stock in terms of hygienic behavior, and had improved survival when challenged with a bacterial disease or a parasitic mite, similar to bees selected using a phenotype-based assessment for this trait. This is the first demonstration of the efficacy of protein markers for industrial selective breeding in any agricultural species, plant or animal.
USDA-ARS?s Scientific Manuscript database
Using linear regression models, we studied the main and two-way interaction effects of the predictor variables gender, age, BMI, and 64 folate/vitamin B-12/homocysteine/lipid/cholesterol-related single nucleotide polymorphisms (SNP) on log-transformed plasma homocysteine normalized by red blood cell...
ERIC Educational Resources Information Center
Gadow, Kenneth D.; Roohi, Jasmin; DeVincent, Carla J.; Kirsch, Sarah; Hatchwell, Eli
2010-01-01
Investigated association of single nucleotide polymorphism (SNP) rs301430 in glutamate transporter gene ("SLC1A1") with severity of repetitive behaviors (obsessive-compulsive behaviors, tics) and anxiety in children with autism spectrum disorder (ASD). Mothers and/or teachers completed a validated DSM-IV-referenced rating scale for 67 children…
USDA-ARS?s Scientific Manuscript database
The periodic need to restock reagent pools for genotyping chips provides an opportunity to increase the number of single-nucleotide polymorphisms (SNP) on a chip at no increase in cost. A high-density chip with >140,000 SNP has been developed by GeneSeek Inc. (Lincoln, NE) to increase accuracy of ge...
Keith R. Merrill; Craig E. Coleman; Susan E. Meyer; Elizabeth A. Leger; Katherine A. Collins
2016-01-01
Premise of the study: Bromus tectorum (Poaceae) is an annual grass species that is invasive in many areas of the world but most especially in the U.S. Intermountain West. Single-nucleotide polymorphism (SNP) markers were developed for use in investigating the geospatial and ecological diversity of B. tectorum in the Intermountain West to better understand the...
ERIC Educational Resources Information Center
Zhang, Xu; Shao, Meng; Gao, Lu; Zhao, Yuanyuan; Sun, Zixuan; Zhou, Liping; Yan, Yongmin; Shao, Qixiang; Xu, Wenrong; Qian, Hui
2017-01-01
Laboratory exercise is helpful for medical students to understand the basic principles of molecular biology and to learn about the practical applications of molecular biology. We have designed a lab course on molecular biology about the determination of single nucleotide polymorphism (SNP) in human REV3 gene, the product of which is a subunit of…
Brimacombe, M.; Hazbon, M.; Motiwala, A. S.; Alland, D.
2007-01-01
A single-nucleotide polymorphism-based cluster grouping (SCG) classification system for Mycobacterium tuberculosis was used to examine antibiotic resistance type and resistance mutations in relationship to specific evolutionary lineages. Drug resistance and resistance mutations were seen across all SCGs. SCG-2 had higher proportions of katG codon 315 mutations and resistance to four drugs. PMID:17846140
Archaic Adaptive Introgression in TBX15/WARS2
Gokhman, David; Fumagalli, Matteo; Ko, Amy; Hansen, Torben; Moltke, Ida; Albrechtsen, Anders; Carmel, Liran; Huerta-Sánchez, Emilia
2017-01-01
A recent study conducted the first genome-wide scan for selection in Inuit from Greenland using single nucleotide polymorphism chip data. Here, we report that selection in the region with the second most extreme signal of positive selection in Greenlandic Inuit favored a deeply divergent haplotype that is closely related to the sequence in the Denisovan genome, and was likely introgressed from an archaic population. The region contains two genes, WARS2 and TBX15, and has previously been associated with adipose tissue differentiation and body-fat distribution in humans. We show that the adaptively introgressed allele has been under selection in a much larger geographic region than just Greenland. Furthermore, it is associated with changes in expression of WARS2 and TBX15 in multiple tissues including the adrenal gland and subcutaneous adipose tissue, and with regional DNA methylation changes in TBX15. PMID:28007980
Stranges, P. Benjamin; Palla, Mirkó; Kalachikov, Sergey; Nivala, Jeff; Dorwart, Michael; Trans, Andrew; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Tao, Chuanjuan; Morozova, Irina; Li, Zengmin; Shi, Shundi; Aberra, Aman; Arnold, Cleoma; Yang, Alexander; Aguirre, Anne; Harada, Eric T.; Korenblum, Daniel; Pollard, James; Bhat, Ashwini; Gremyachinskiy, Dmitriy; Bibillo, Arek; Chen, Roger; Davis, Randy; Russo, James J.; Fuller, Carl W.; Roever, Stefan; Ju, Jingyue; Church, George M.
2016-01-01
Scalable, high-throughput DNA sequencing is a prerequisite for precision medicine and biomedical research. Recently, we presented a nanopore-based sequencing-by-synthesis (Nanopore-SBS) approach, which used a set of nucleotides with polymer tags that allow discrimination of the nucleotides in a biological nanopore. Here, we designed and covalently coupled a DNA polymerase to an α-hemolysin (αHL) heptamer using the SpyCatcher/SpyTag conjugation approach. These porin–polymerase conjugates were inserted into lipid bilayers on a complementary metal oxide semiconductor (CMOS)-based electrode array for high-throughput electrical recording of DNA synthesis. The designed nanopore construct successfully detected the capture of tagged nucleotides complementary to a DNA base on a provided template. We measured over 200 tagged-nucleotide signals for each of the four bases and developed a classification method to uniquely distinguish them from each other and background signals. The probability of falsely identifying a background event as a true capture event was less than 1.2%. In the presence of all four tagged nucleotides, we observed sequential additions in real time during polymerase-catalyzed DNA synthesis. Single-polymerase coupling to a nanopore, in combination with the Nanopore-SBS approach, can provide the foundation for a low-cost, single-molecule, electronic DNA-sequencing platform. PMID:27729524
Heated oligonucleotide ligation assay (HOLA): an affordable single nucleotide polymorphism assay.
Black, W C; Gorrochotegui-Escalante, N; Duteau, N M
2006-03-01
Most single nucleotide polymorphism (SNP) detection requires expensive equipment and reagents. The oligonucleotide ligation assay (OLA) is an inexpensive SNP assay that detects ligation between a biotinylated "allele-specific detector" and a 3' fluorescein-labeled "reporter" oligonucleotide. No ligation occurs unless the 3' detector nucleotide is complementary to the SNP nucleotide. The original OLA used chemical denaturation and neutralization. Heated OLA (HOLA) instead uses a thermal stable ligase and cycles of denaturing and hybridization for ligation and SNP detection. The cost per genotype is approximately US$1.25 with two-allele SNPs or approximately US$1.75 with three-allele SNPs. We illustrate the development of HOLA for SNP detection in the Early Trypsin and Abundant Trypsin loci in the mosquito Aedes aegypti (L.) and at the a-glycerophosphate dehydrogenase locus in the mosquito Anopheles gambiae s.s.
NASA Technical Reports Server (NTRS)
Vercoutere, W.; Solbrig, A.; DeGuzman, V.; Deamer, D.; Akeson, M.
2003-01-01
We use a biological nano-scale pore to distinguish among individual DNA hairpins that differ by a single site of oxidation or a nick in the sugar-phosphate backbone. In earlier work we showed that the protein ion channel alpha-hemolysin can be used as a detector to distinguish single-stranded from double-stranded DNA, single base pair and single nucleotide differences. This resolution is in part a result of sensitivity to structural changes that influence the molecular dynamics of nucleotides within DNA. The strand cleavage products we examined here included a 5-base-pair (5-bp) hairpin with a 5-prime five-nucleotide overhang, and a complementary five-nucleotide oligomer. These produced predictable shoulder-spike and rapid near-full blockade signatures, respectively. When combined, strand annealing was monitored in real time. The residual current level dropped to a lower discrete level in the shoulder-spike blockade signatures, and the duration lengthened. However, these blockade signatures had a shorter duration than the unmodified l0bp hairpin. To test the pore sensitivity to nucleotide oxidation, we examined a 9-bp hairpin with a terminal 8-oxo-deoxyguanosine (8-oxo-dG), or a penultimate 8-oxo-dG. Each produced blockade signatures that differed from the otherwise identical control 9bp hairpins. This study showed that DNA structure is modified sufficiently by strand cleavage or oxidation damage at a single site to alter in a predictable manner the ionic current blockade signatures produced. This technique improves the ability to assess damage to DNA, and can provide a simple means to help characterize the risks of radiation exposure. It may also provide a method to test radiation protection.
Yamada, Yoshiji; Sakuma, Jun; Takeuchi, Ichiro; Yasukochi, Yoshiki; Kato, Kimihiko; Oguri, Mitsutoshi; Fujimaki, Tetsuo; Horibe, Hideki; Muramatsu, Masaaki; Sawabe, Motoji; Fujiwara, Yoshinori; Taniguchi, Yu; Obuchi, Shuichi; Kawai, Hisashi; Shinkai, Shoji; Mori, Seijiro; Arai, Tomio; Tanaka, Masashi
2017-06-13
We have performed exome-wide association studies to identify genetic variants that influence body mass index or confer susceptibility to obesity or metabolic syndrome in Japanese. The exome-wide association study for body mass index included 12,890 subjects, and those for obesity and metabolic syndrome included 12,968 subjects (3954 individuals with obesity, 9014 controls) and 6817 subjects (3998 individuals with MetS, 2819 controls), respectively. Exome-wide association studies were performed with Illumina HumanExome-12 DNA Analysis BeadChip or Infinium Exome-24 BeadChip arrays. The relation of genotypes of single nucleotide polymorphisms to body mass index was examined by linear regression analysis, and that of allele frequencies of single nucleotide polymorphisms to obesity or metabolic syndrome was evaluated with Fisher's exact test. The exome-wide association studies identified six, 11, and 40 single nucleotide polymorphisms as being significantly associated with body mass index, obesity (P <1.21 × 10-6), or metabolic syndrome (P <1.20 × 10-6), respectively. Subsequent multivariable logistic regression analysis with adjustment for age and sex revealed that three and five single nucleotide polymorphisms were related (P < 0.05) to obesity or metabolic syndrome, respectively, with one of these latter polymorphisms-rs7350481 (C/T) at chromosome 11q23.3-also being significantly (P < 3.13 × 10-4) associated with metabolic syndrome. The polymorphism rs7350481 may thus be a novel susceptibility locus for metabolic syndrome in Japanese. In addition, single nucleotide polymorphisms in three genes (CROT, TSC1, RIN3) and at four loci (ANKK1, ZNF804B, CSRNP3, 17p11.2) were implicated as candidate determinants of obesity and metabolic syndrome, respectively.
Dai, Weiran; Ye, Ziliang; Lu, Haili; Su, Qiang; Li, Hui; Li, Lang
2018-02-23
The results showed that there was a certain correlation between the single nucleotide polymorphism of IL-10-1082G/A and rheumatic heart disease, but there was no systematic study to verify this conclusion. Systematic review of the association between single nucleotide polymorphism of IL-10-1082G/A locus and rheumatic heart disease. Computer retrieval PubMed, EMbase, Cochrane Library, CBM, CNKI, VIP and Data WanFang, the retrieval time limit from inception to June 2017. A case control study of single nucleotide polymorphisms and rheumatic heart disease in patients with rheumatic heart disease in the IL-10-1082G/A was collected. Two researchers independently screened the literature, extracted data and evaluated the risk of bias in the study, and using RevMan5.3 software for data analysis. A total of 3 case control studies were included, including 318 patients with rheumatic heart disease and 502 controls. Meta-analysis showed that there was no correlation between IL-10-1082G/A gene polymorphism and rheumatic heart disease [AA+AG VS GG: OR = 0.62, 95% CI (0.28, 1.39), P = 0.25; AA VS AG+GG: OR = 0.73, 95% CI (0.54, 1.00), P = 0.05; AA VS GG: OR = 0.70, 95% CI(0.47, 1.05), P = 0.08; AG VS GG: OR = 0.65, 95% CI (0.22, 1.92), P = 0.43; A VS G: OR = 0.87, 95% CI (0.71, 1.06), P = 0.17]. When AA is a recessive gene, the single nucleotide polymorphism of IL-10-1082G/A is associated with the presence of rheumatic heart disease. Due to the limitations of the quantity and quality of the included literatures, the further research results were still needed.
NASA Astrophysics Data System (ADS)
Schechinger, Linda Sue
I. To investigate the delivery of nucleotide-based drugs, we are studying molecular recognition of nucleotide derivatives in environments that are similar to cell membranes. The Nowick group previously discovered that membrane-like surfactant micelles tetradecyltrimethylammonium bromide (TTAB) micelle facilitate molecular of adenosine monophosphate (AMP) recognition. The micelles bind nucleotides by means of electrostatic interactions and hydrogen bonding. We observed binding by following 1H NMR chemical shift changes of unique hexylthymine protons upon addition of AMP. Cationic micelles are required for binding. In surfactant-free or sodium dodecylsulfate solutions, no hydrogen bonding is observed. These observations suggest that the cationic surfactant headgroups bind the nucleotide phosphate group, while the intramicellar base binds the nucleotide base. The micellar system was optimized to enhance binding and selectivity for adenosine nucleotides. The selectivity for adenosine and the number of phosphate groups attached to the adenosine were both investigated. Addition of cytidine, guanidine, or uridine monophosphates, results in no significant downfield shifting of the NH resonance. Selectivity for the phosphate is limited, since adenosine mono-, di-, and triphosphates all have similar binding constants. We successfully achieved molecular recognition of adenosine nucleotides in micellar environments. There is significant difference in the binding interactions between the adenosine nucleotides and three other natural nucleotides. II. The UCI Chemistry Outreach Program (UCICOP) addresses the declining interest of the nations youth for science. UCICOP brings fun and exciting chemistry experiments to local high schools, to remind students that science is fun and has many practical uses. Volunteer students and alumni of UCI perform the demonstrations using scripts and material provided by UCICOP. The preparation of scripts and materials is done by two coordinators. These coordinators organize the program and provide continuity to the program. The success of UCICOP can be measured by the high praise and gratitude expressed by the teachers, students and volunteers.
Selective fluorescence quenching of 2,3-diazabicyclo[2.2.2]oct-2-ene by nucleotides.
Marquez, Cesar; Pischel, Uwe; Nau, Werner M
2003-10-16
[reaction: see text] The fluorescence quenching of 2,3-diazabicyclo[2.2.2]oct-2-ene (DBO) by nucleotides has been studied. The quenching mechanism was analyzed on the basis of deuterium isotope effects, tendencies for exciplex formation, and the quenching efficiency in the presence of a molecular container (cucurbit[7]uril). Exciplex-induced quenching appears to prevail for adenosine, cytidine, and uridine, while hydrogen abstraction becomes competitive for thymidine and guanosine. Compared to other fluorescent probes, DBO responds very selectively to the type of nucleotide.
OmpF, a nucleotide-sensing nanoprobe, computational evaluation of single channel activities
NASA Astrophysics Data System (ADS)
Abdolvahab, R. H.; Mobasheri, H.; Nikouee, A.; Ejtehadi, M. R.
2016-09-01
The results of highthroughput practical single channel experiments should be formulated and validated by signal analysis approaches to increase the recognition precision of translocating molecules. For this purpose, the activities of the single nano-pore forming protein, OmpF, in the presence of nucleotides were recorded in real time by the voltage clamp technique and used as a means for nucleotide recognition. The results were analyzed based on the permutation entropy of current Time Series (TS), fractality, autocorrelation, structure function, spectral density, and peak fraction to recognize each nucleotide, based on its signature effect on the conductance, gating frequency and voltage sensitivity of channel at different concentrations and membrane potentials. The amplitude and frequency of ion current fluctuation increased in the presence of Adenine more than Cytosine and Thymine in milli-molar (0.5 mM) concentrations. The variance of the current TS at various applied voltages showed a non-monotonic trend whose initial increasing slope in the presence of Thymine changed to a decreasing one in the second phase and was different from that of Adenine and Cytosine; e.g., by increasing the voltage from 40 to 140 mV in the 0.5 mM concentration of Adenine or Cytosine, the variance decreased by one third while for the case of Thymine it was doubled. Moreover, according to the structure function of TS, the fractality of current TS differed as a function of varying membrane potentials (pd) and nucleotide concentrations. Accordingly, the calculated permutation entropy of the TS, validated the biophysical approach defined for the recognition of different nucleotides at various concentrations, pd's and polarities. Thus, the promising outcomes of the combined experimental and theoretical methodologies presented here can be implemented as a complementary means in pore-based nucleotide recognition approaches.
Huebner, Claudia; Ferguson, Lynnette R; Han, Dug Yeo; Philpott, Martin; Barclay, Murray L; Gearry, Richard B; McCulloch, Alan; Demmers, Pieter S; Browning, Brian L
2009-01-01
Background The nucleotide-binding oligomerization domain containing 1 (NOD1) gene encodes a pattern recognition receptor that senses pathogens, leading to downstream responses characteristic of innate immunity. We investigated the role of NOD1 single nucleotide polymorphisms (SNPs) on IBD risk in a New Zealand Caucasian population, and studied Nod1 expression in response to bacterial invasion in the Caco2 cell line. Findings DNA samples from 388 Crohn's disease (CD), 405 ulcerative colitis (UC), 27 indeterminate colitis patients and 201 randomly selected controls, from Canterbury, New Zealand were screened for 3 common SNPs in NOD1, using the MassARRAY® iPLEX Gold assay. Transcriptional activation of the protein produced by NOD1 (Nod1) was studied after infection of Caco2 cells with Escherichia coli LF82. Carrying the rs2075818 G allele decreased the risk of CD (OR = 0.66, 95% CI = 0.50–0.88, p < 0.002) but not UC. There was an increased frequency of the three SNP (rs2075818, rs2075822, rs2907748) haplotype, CTG (p = 0.004) and a decreased frequency of the GTG haplotype (p = 0.02).in CD. The rs2075822 CT or TT genotypes were at an increased frequency (genotype p value = 0.02), while the rs2907748 AA or AG genotypes showed decreased frequencies in UC (p = 0.04), but not in CD. Functional assays showed that Nod1 is produced 6 hours after bacterial invasion of the Caco2 cell line. Conclusion The NOD1 gene is important in signalling invasion of colonic cells by pathogenic bacteria, indicative of its' key role in innate immunity. Carrying specific SNPs in this gene significantly modifies the risk of CD and/or UC in a New Zealand Caucasian population. PMID:19327158
2014-01-01
Background During the domestication of crops, individual plants with traits desirable for human needs have been selected from their wild progenitors. Consequently, genetic and nucleotide diversity of genes associated with these selected traits in crop plants are expected to be lower than their wild progenitors. In the present study, we surveyed the pattern of nucleotide diversity of two selected trait specific genes, Wx and OsC1, which regulate amylose content and apiculus coloration respectively in cultivated rice varieties. The analyzed samples were collected from a wide geographic area in Northeast (NE) India, and included contrasting phenotypes considered to be associated with selected genes, namely glutinous and nonglutinous grains and colored and colorless apiculus. Results No statistically significant selection signatures were detected in both Wx and OsC1gene sequences. However, low level of selection that varied across the length of each gene was evident. The glutinous type varieties showed higher levels of nucleotide diversity at the Wx locus (πtot = 0.0053) than nonglutinous type varieties (πtot = 0.0043). The OsC1 gene revealed low levels of selection among the colorless apiculus varieties with lower nucleotide diversity (πtot = 0.0010) than in the colored apiculus varieties (πtot = 0.0023). Conclusions The results revealed that functional mutations at Wx and OsC1genes considered to be associated with specific phenotypes do not necessarily correspond to the phenotypes in indigenous rice varieties in NE India. This suggests that other than previously reported genomic regions may also be involved in determination of these phenotypes. PMID:24935343
Global diversity, population stratification, and selection of human copy number variation
Sudmant, Peter H.; Mallick, Swapan; Nelson, Bradley J.; Hormozdiari, Fereydoun; Krumm, Niklas; Huddleston, John; Coe, Bradley P.; Baker, Carl; Nordenfelt, Susanne; Bamshad, Michael; Jorde, Lynn B.; Posukh, Olga L.; Sahakyan, Hovhannes; Watkins, W. Scott; Yepiskoposyan, Levon; Abdullah, M. Syafiq; Bravi, Claudio M.; Capelli, Cristian; Hervig, Tor; Wee, Joseph T. S.; Tyler-Smith, Chris; van Driem, George; Romero, Irene Gallego; Jha, Aashish R.; Karachanak-Yankova, Sena; Toncheva, Draga; Comas, David; Henn, Brenna; Kivisild, Toomas; Ruiz-Linares, Andres; Sajantila, Antti; Metspalu, Ene; Parik, Jüri; Villems, Richard; Starikovskaya, Elena B.; Ayodo, George; Beall, Cynthia M.; Di Rienzo, Anna; Hammer, Michael; Khusainova, Rita; Khusnutdinova, Elza; Klitz, William; Winkler, Cheryl; Labuda, Damian; Metspalu, Mait; Tishkoff, Sarah A.; Dryomov, Stanislav; Sukernik, Rem; Patterson, Nick; Reich, David; Eichler, Evan E.
2015-01-01
In order to explore the diversity and selective signatures of duplication and deletion human copy number variants (CNVs), we sequenced 236 individuals from 125 distinct human populations. We observed that duplications exhibit fundamentally different population genetic and selective signatures than deletions and are more likely to be stratified between human populations. Through reconstruction of the ancestral human genome, we identify megabases of DNA lost in different human lineages and pinpoint large duplications that introgressed from the extinct Denisova lineage now found at high frequency exclusively in Oceanic populations. We find that the proportion of CNV base pairs to single nucleotide variant base pairs is greater among non-Africans than it is among African populations, but we conclude that this difference is likely due to unique aspects of non-African population history as opposed to differences in CNV load. PMID:26249230
Methods and kits for nucleic acid analysis using fluorescence resonance energy transfer
Kwok, Pui-Yan; Chen, Xiangning
1999-01-01
A method for detecting the presence of a target nucleotide or sequence of nucleotides in a nucleic acid is disclosed. The method is comprised of forming an oligonucleotide labeled with two fluorophores on the nucleic acid target site. The doubly labeled oligonucleotide is formed by addition of a singly labeled dideoxynucleoside triphosphate to a singly labeled polynucleotide or by ligation of two singly labeled polynucleotides. Detection of fluorescence resonance energy transfer upon denaturation indicates the presence of the target. Kits are also provided. The method is particularly applicable to genotyping.
USDA-ARS?s Scientific Manuscript database
In a marker-trait association study we estimated the statistical significance of 65 single nucleotide polymorphisms (SNP) in 23 candidate genes on HDL levels of two independent Caucasian populations. Each population consisted of men and women and their HDL levels were adjusted for gender and body we...
Eliakim, Alon; Ben Zaken, Sigal; Meckel, Yoav; Yamin, Chen; Dror, Nitzan; Nemet, Dan
2015-12-01
We present an adolescent elite water polo player who despite a genetic predisposition to develop exercise-induced severe muscle damage due to carrying the IL-6 174C allele single-nucleotide polymorphism, developed acute rhabdomyolysis only after a vigorous out-of-water training, suggesting that water polo training may be more suitable for genetically predisposed athletes.
Olsen, Randall J.; Sitkiewicz, Izabela; Ayeras, Ara A.; Gonulal, Vedia E.; Cantu, Concepcion; Beres, Stephen B.; Green, Nicole M.; Lei, Benfang; Humbird, Tammy; Greaver, Jamieson; Chang, Ellen; Ragasa, Willie P.; Montgomery, Charles A.; Cartwright, Joiner; McGeer, Allison; Low, Donald E.; Whitney, Adeline R.; Cagle, Philip T.; Blasdel, Terry L.; DeLeo, Frank R.; Musser, James M.
2010-01-01
Single-nucleotide changes are the most common cause of natural genetic variation among members of the same species, but there is remarkably little information bearing on how they alter bacterial virulence. We recently discovered a single-nucleotide mutation in the group A Streptococcus genome that is epidemiologically associated with decreased human necrotizing fasciitis (“flesh-eating disease”). Working from this clinical observation, we find that wild-type mtsR function is required for group A Streptococcus to cause necrotizing fasciitis in mice and nonhuman primates. Expression microarray analysis revealed that mtsR inactivation results in overexpression of PrsA, a chaperonin involved in posttranslational maturation of SpeB, an extracellular cysteine protease. Isogenic mutant strains that overexpress prsA or lack speB had decreased secreted protease activity in vivo and recapitulated the necrotizing fasciitis-negative phenotype of the ΔmtsR mutant strain in mice and monkeys. mtsR inactivation results in increased PrsA expression, which in turn causes decreased SpeB secreted protease activity and reduced necrotizing fasciitis capacity. Thus, a naturally occurring single-nucleotide mutation dramatically alters virulence by dysregulating a multiple gene virulence axis. Our discovery has broad implications for the confluence of population genomics and molecular pathogenesis research. PMID:20080771
Olsen, Randall J; Sitkiewicz, Izabela; Ayeras, Ara A; Gonulal, Vedia E; Cantu, Concepcion; Beres, Stephen B; Green, Nicole M; Lei, Benfang; Humbird, Tammy; Greaver, Jamieson; Chang, Ellen; Ragasa, Willie P; Montgomery, Charles A; Cartwright, Joiner; McGeer, Allison; Low, Donald E; Whitney, Adeline R; Cagle, Philip T; Blasdel, Terry L; DeLeo, Frank R; Musser, James M
2010-01-12
Single-nucleotide changes are the most common cause of natural genetic variation among members of the same species, but there is remarkably little information bearing on how they alter bacterial virulence. We recently discovered a single-nucleotide mutation in the group A Streptococcus genome that is epidemiologically associated with decreased human necrotizing fasciitis ("flesh-eating disease"). Working from this clinical observation, we find that wild-type mtsR function is required for group A Streptococcus to cause necrotizing fasciitis in mice and nonhuman primates. Expression microarray analysis revealed that mtsR inactivation results in overexpression of PrsA, a chaperonin involved in posttranslational maturation of SpeB, an extracellular cysteine protease. Isogenic mutant strains that overexpress prsA or lack speB had decreased secreted protease activity in vivo and recapitulated the necrotizing fasciitis-negative phenotype of the DeltamtsR mutant strain in mice and monkeys. mtsR inactivation results in increased PrsA expression, which in turn causes decreased SpeB secreted protease activity and reduced necrotizing fasciitis capacity. Thus, a naturally occurring single-nucleotide mutation dramatically alters virulence by dysregulating a multiple gene virulence axis. Our discovery has broad implications for the confluence of population genomics and molecular pathogenesis research.
Wang, Xiaohua; Chen, Yanling; Thomas, Catherine L; Ding, Guangda; Xu, Ping; Shi, Dexu; Grandke, Fabian; Jin, Kemo; Cai, Hongmei; Xu, Fangsen; Yi, Bin; Broadley, Martin R; Shi, Lei
2017-08-01
Breeding crops with ideal root system architecture for efficient absorption of phosphorus is an important strategy to reduce the use of phosphate fertilizers. To investigate genetic variants leading to changes in root system architecture, 405 oilseed rape cultivars were genotyped with a 60K Brassica Infinium SNP array in low and high P environments. A total of 285 single-nucleotide polymorphisms were associated with root system architecture traits at varying phosphorus levels. Nine single-nucleotide polymorphisms corroborate a previous linkage analysis of root system architecture quantitative trait loci in the BnaTNDH population. One peak single-nucleotide polymorphism region on A3 was associated with all root system architecture traits and co-localized with a quantitative trait locus for primary root length at low phosphorus. Two more single-nucleotide polymorphism peaks on A5 for root dry weight at low phosphorus were detected in both growth systems and co-localized with a quantitative trait locus for the same trait. The candidate genes identified on A3 form a haplotype 'BnA3Hap', that will be important for understanding the phosphorus/root system interaction and for the incorporation into Brassica napus breeding programs. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Sperm Bindin Divergence under Sexual Selection and Concerted Evolution in Sea Stars.
Patiño, Susana; Keever, Carson C; Sunday, Jennifer M; Popovic, Iva; Byrne, Maria; Hart, Michael W
2016-08-01
Selection associated with competition among males or sexual conflict between mates can create positive selection for high rates of molecular evolution of gamete recognition genes and lead to reproductive isolation between species. We analyzed coding sequence and repetitive domain variation in the gene encoding the sperm acrosomal protein bindin in 13 diverse sea star species. We found that bindin has a conserved coding sequence domain structure in all 13 species, with several repeated motifs in a large central region that is similar among all sea stars in organization but highly divergent among genera in nucleotide and predicted amino acid sequence. More bindin codons and lineages showed positive selection for high relative rates of amino acid substitution in genera with gonochoric outcrossing adults (and greater expected strength of sexual selection) than in selfing hermaphrodites. That difference is consistent with the expectation that selfing (a highly derived mating system) may moderate the strength of sexual selection and limit the accumulation of bindin amino acid differences. The results implicate both positive selection on single codons and concerted evolution within the repetitive region in bindin divergence, and suggest that both single amino acid differences and repeat differences may affect sperm-egg binding and reproductive compatibility. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Replacement of RNA hairpins by in vitro selected tetranucleotides.
Dichtl, B; Pan, T; DiRenzo, A B; Uhlenbeck, O C
1993-01-01
An in vitro selection method based on the autolytic cleavage of yeast tRNA(Phe) by Pb2+ was applied to obtain tRNA derivatives with the anticodon hairpin replaced by four single-stranded nucleotides. Based on the rates of the site-specific cleavage by Pb2+ and the presence of a specific UV-induced crosslink, certain tetranucleotide sequences allow proper folding of the rest of the tRNA molecule, whereas others do not. One such successful tetramer sequence was also used to replace the acceptor stem of yeast tRNA(Phe) and the anticodon hairpin of E.coli tRNA(Phe) without disrupting folding. These experiments suggest that certain tetramers may be able to replace structurally nonessential hairpins in any RNA. Images PMID:7680121
Fluorescence In situ Hybridization: Cell-Based Genetic Diagnostic and Research Applications.
Cui, Chenghua; Shu, Wei; Li, Peining
2016-01-01
Fluorescence in situ hybridization (FISH) is a macromolecule recognition technology based on the complementary nature of DNA or DNA/RNA double strands. Selected DNA strands incorporated with fluorophore-coupled nucleotides can be used as probes to hybridize onto the complementary sequences in tested cells and tissues and then visualized through a fluorescence microscope or an imaging system. This technology was initially developed as a physical mapping tool to delineate genes within chromosomes. Its high analytical resolution to a single gene level and high sensitivity and specificity enabled an immediate application for genetic diagnosis of constitutional common aneuploidies, microdeletion/microduplication syndromes, and subtelomeric rearrangements. FISH tests using panels of gene-specific probes for somatic recurrent losses, gains, and translocations have been routinely applied for hematologic and solid tumors and are one of the fastest-growing areas in cancer diagnosis. FISH has also been used to detect infectious microbias and parasites like malaria in human blood cells. Recent advances in FISH technology involve various methods for improving probe labeling efficiency and the use of super resolution imaging systems for direct visualization of intra-nuclear chromosomal organization and profiling of RNA transcription in single cells. Cas9-mediated FISH (CASFISH) allowed in situ labeling of repetitive sequences and single-copy sequences without the disruption of nuclear genomic organization in fixed or living cells. Using oligopaint-FISH and super-resolution imaging enabled in situ visualization of chromosome haplotypes from differentially specified single-nucleotide polymorphism loci. Single molecule RNA FISH (smRNA-FISH) using combinatorial labeling or sequential barcoding by multiple round of hybridization were applied to measure mRNA expression of multiple genes within single cells. Research applications of these single molecule single cells DNA and RNA FISH techniques have visualized intra-nuclear genomic structure and sub-cellular transcriptional dynamics of many genes and revealed their functions in various biological processes.
Empirical Performance of Cross-Validation With Oracle Methods in a Genomics Context
Martinez, Josue G.; Carroll, Raymond J.; Müller, Samuel; Sampson, Joshua N.; Chatterjee, Nilanjan
2012-01-01
When employing model selection methods with oracle properties such as the smoothly clipped absolute deviation (SCAD) and the Adaptive Lasso, it is typical to estimate the smoothing parameter by m-fold cross-validation, for example, m = 10. In problems where the true regression function is sparse and the signals large, such cross-validation typically works well. However, in regression modeling of genomic studies involving Single Nucleotide Polymorphisms (SNP), the true regression functions, while thought to be sparse, do not have large signals. We demonstrate empirically that in such problems, the number of selected variables using SCAD and the Adaptive Lasso, with 10-fold cross-validation, is a random variable that has considerable and surprising variation. Similar remarks apply to non-oracle methods such as the Lasso. Our study strongly questions the suitability of performing only a single run of m-fold cross-validation with any oracle method, and not just the SCAD and Adaptive Lasso. PMID:22347720
Lehman, Donna M; Fu, Dong-Jing; Freeman, Angela B; Hunt, Kelly J; Leach, Robin J; Johnson-Pais, Teresa; Hamlington, Jeanette; Dyer, Thomas D; Arya, Rector; Abboud, Hanna; Göring, Harald H H; Duggirala, Ravindranath; Blangero, John; Konrad, Robert J; Stern, Michael P
2005-04-01
Excess O-glycosylation of proteins by O-linked beta-N-acetylglucosamine (O-GlcNAc) may be involved in the pathogenesis of type 2 diabetes. The enzyme O-GlcNAc-selective N-acetyl-beta-d glucosaminidase (O-GlcNAcase) encoded by MGEA5 on 10q24.1-q24.3 reverses this modification by catalyzing the removal of O-GlcNAc. We have previously reported the linkage of type 2 diabetes and age at diabetes onset to an overlapping region on chromosome 10q in the San Antonio Family Diabetes Study (SAFADS). In this study, we investigated menangioma-expressed antigen-5 (MGEA5) as a positional candidate gene. Twenty-four single nucleotide polymorphisms (SNPs), identified by sequencing 44 SAFADS subjects, were genotyped in 436 individuals from 27 families whose data were used in the original linkage report. Association tests indicated significant association of a novel SNP with the traits diabetes (P = 0.0128, relative risk = 2.77) and age at diabetes onset (P = 0.0017). The associated SNP is located in intron 10, which contains an alternate stop codon and may lead to decreased expression of the 130-kDa isoform, the isoform predicted to contain the O-GlcNAcase activity. We investigated whether this variant was responsible for the original linkage signal. The variance attributed to this SNP accounted for approximately 25% of the logarithm of odds. These results suggest that this variant within the MGEA5 gene may increase diabetes risk in Mexican Americans.
Sookoian, Silvia; Gianotti, Tomas Fernandez; Gemma, Carolina; Burgueño, Adriana L; Pirola, Carlos J
2010-06-01
To perform a two-stage study to explore the role of gene variants in the risk of insulin resistance and arterial hypertension. The selection of variants was performed by a first stage of in-silico analysis of the original genome-wide association data sets on genes involved in metabolic syndrome components, granted by the Diabetes Genetics Initiative and the Wellcome Trust Case-Control Consortium. We started by identifying single-nucleotide polymorphisms with a cutoff for association (P < 0.05) in both data sets after the application of a computational algorithm of gene prioritization. Among the more promising variants, six single-nucleotide polymorphisms in IGF1R (rs11247362, rs10902606, rs1317459, rs11854132, rs2684761, and rs2715416) were selected for further evaluation in our population. Altogether, 1094 men, aged 34.4 +/- 8.6 years, were included in a population-based study. Genotypes of rs2684761 showed significant association with insulin resistance (as a discrete trait, odds ratio per G allele 1.27, 95% confidence interval 1.03-1.56, P = 0.026; and homeostasis model assessment-insulin resistance as a continuous trait, P = 0.01). A significant association of rs2684761 with arterial hypertension was also observed (odds ratio per G allele 1.29, 95% confidence interval 1.02-1.64, P = 0.037) after adjusting for age and homeostasis model assessment-insulin resistance. Our study suggests for the first time a putative role of IGF1R variants in individual susceptibility to metabolic syndrome-related phenotypes, in particular on the risk of having insulin resistance and arterial hypertension.
Statistical method to compare massive parallel sequencing pipelines.
Elsensohn, M H; Leblay, N; Dimassi, S; Campan-Fournier, A; Labalme, A; Roucher-Boulez, F; Sanlaville, D; Lesca, G; Bardel, C; Roy, P
2017-03-01
Today, sequencing is frequently carried out by Massive Parallel Sequencing (MPS) that cuts drastically sequencing time and expenses. Nevertheless, Sanger sequencing remains the main validation method to confirm the presence of variants. The analysis of MPS data involves the development of several bioinformatic tools, academic or commercial. We present here a statistical method to compare MPS pipelines and test it in a comparison between an academic (BWA-GATK) and a commercial pipeline (TMAP-NextGENe®), with and without reference to a gold standard (here, Sanger sequencing), on a panel of 41 genes in 43 epileptic patients. This method used the number of variants to fit log-linear models for pairwise agreements between pipelines. To assess the heterogeneity of the margins and the odds ratios of agreement, four log-linear models were used: a full model, a homogeneous-margin model, a model with single odds ratio for all patients, and a model with single intercept. Then a log-linear mixed model was fitted considering the biological variability as a random effect. Among the 390,339 base-pairs sequenced, TMAP-NextGENe® and BWA-GATK found, on average, 2253.49 and 1857.14 variants (single nucleotide variants and indels), respectively. Against the gold standard, the pipelines had similar sensitivities (63.47% vs. 63.42%) and close but significantly different specificities (99.57% vs. 99.65%; p < 0.001). Same-trend results were obtained when only single nucleotide variants were considered (99.98% specificity and 76.81% sensitivity for both pipelines). The method allows thus pipeline comparison and selection. It is generalizable to all types of MPS data and all pipelines.
Renal epithelial cells can release ATP by vesicular fusion
Bjaelde, Randi G.; Arnadottir, Sigrid S.; Overgaard, Morten T.; Leipziger, Jens; Praetorius, Helle A.
2013-01-01
Renal epithelial cells have the ability to release nucleotides as paracrine factors. In the intercalated cells of the collecting duct, ATP is released by connexin30 (cx30), which is selectively expressed in this cell type. However, ATP is released by virtually all renal epithelia and the aim of the present study was to identify possible alternative nucleotide release pathways in a renal epithelial cell model. We used MDCK (type1) cells to screen for various potential ATP release pathways. In these cells, inhibition of the vesicular H+-ATPases (bafilomycin) reduced both the spontaneous and hypotonically (80%)-induced nucleotide release. Interference with vesicular fusion using N-ethylamide markedly reduced the spontaneous nucleotide release, as did interference with trafficking from the endoplasmic reticulum to the Golgi apparatus (brefeldin A1) and vesicular transport (nocodazole). These findings were substantiated using a siRNA directed against SNAP-23, which significantly reduced spontaneous ATP release. Inhibition of pannexin and connexins did not affect the spontaneous ATP release in this cell type, which consists of ~90% principal cells. TIRF-microscopy of either fluorescently-labeled ATP (MANT-ATP) or quinacrine-loaded vesicles, revealed that spontaneous release of single vesicles could be promoted by either hypoosmolality (50%) or ionomycin. This vesicular release decreased the overall cellular fluorescence by 5.8 and 7.6% respectively. In summary, this study supports the notion that spontaneous and induced ATP release can occur via exocytosis in renal epithelial cells. PMID:24065923
Beddows, Amanda; Patel, Nikesh; Finger, L David; Atack, John M; Williams, David M; Grasby, Jane A
2012-09-14
Flap endonucleases (FENs) are proposed to select their target phosphate diester by unpairing the two terminal nucleotides of duplex. Interstrand disulfide crosslinks, introduced by oxidation of thiouracil and thioguanine bases, abolished the specificity of human FEN1 for hydrolysis one nucleotide into the 5'-duplex.
Non-additive Effects in Genomic Selection
Varona, Luis; Legarra, Andres; Toro, Miguel A.; Vitezica, Zulma G.
2018-01-01
In the last decade, genomic selection has become a standard in the genetic evaluation of livestock populations. However, most procedures for the implementation of genomic selection only consider the additive effects associated with SNP (Single Nucleotide Polymorphism) markers used to calculate the prediction of the breeding values of candidates for selection. Nevertheless, the availability of estimates of non-additive effects is of interest because: (i) they contribute to an increase in the accuracy of the prediction of breeding values and the genetic response; (ii) they allow the definition of mate allocation procedures between candidates for selection; and (iii) they can be used to enhance non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes. This study presents a review of methods for the incorporation of non-additive genetic effects into genomic selection procedures and their potential applications in the prediction of future performance, mate allocation, crossbreeding, and purebred selection. The work concludes with a brief outline of some ideas for future lines of that may help the standard inclusion of non-additive effects in genomic selection. PMID:29559995
Non-additive Effects in Genomic Selection.
Varona, Luis; Legarra, Andres; Toro, Miguel A; Vitezica, Zulma G
2018-01-01
In the last decade, genomic selection has become a standard in the genetic evaluation of livestock populations. However, most procedures for the implementation of genomic selection only consider the additive effects associated with SNP (Single Nucleotide Polymorphism) markers used to calculate the prediction of the breeding values of candidates for selection. Nevertheless, the availability of estimates of non-additive effects is of interest because: (i) they contribute to an increase in the accuracy of the prediction of breeding values and the genetic response; (ii) they allow the definition of mate allocation procedures between candidates for selection; and (iii) they can be used to enhance non-additive genetic variation through the definition of appropriate crossbreeding or purebred breeding schemes. This study presents a review of methods for the incorporation of non-additive genetic effects into genomic selection procedures and their potential applications in the prediction of future performance, mate allocation, crossbreeding, and purebred selection. The work concludes with a brief outline of some ideas for future lines of that may help the standard inclusion of non-additive effects in genomic selection.
Guo, Juan; Wang, Yunsheng; Song, Chi; Zhou, Jianfeng; Qiu, Lijuan; Huang, Hongwen; Wang, Ying
2010-01-01
Background and Aims It is essential to illuminate the evolutionary history of crop domestication in order to understand further the origin and development of modern cultivation and agronomy; however, despite being one of the most important crops, the domestication origin and bottleneck of soybean (Glycine max) are poorly understood. In the present study, microsatellites and nucleotide sequences were employed to elucidate the domestication genetics of soybean. Methods The genomes of 79 landrace soybeans (endemic cultivated soybeans) and 231 wild soybeans (G. soja) that represented the species-wide distribution of wild soybean in East Asia were scanned with 56 microsatellites to identify the genetic structure and domestication origin of soybean. To understand better the domestication bottleneck, four nucleotide sequences were selected to simulate the domestication bottleneck. Key Results Model-based analysis revealed that most of the landrace genotypes were assigned to the inferred wild soybean cluster of south China, South Korea and Japan. Phylogeny for wild and landrace soybeans showed that all landrace soybeans formed a single cluster supporting a monophyletic origin of all the cultivars. The populations of the nearest branches which were basal to the cultivar lineage were wild soybeans from south China. The coalescent simulation detected a bottleneck severity of K′ = 2 during soybean domestication, which could be explained by a foundation population of 6000 individuals if domestication duration lasted 3000 years. Conclusions As a result of integrating geographic distribution with microsatellite genotype assignment and phylogeny between landrace and wild soybeans, a single origin of soybean in south China is proposed. The coalescent simulation revealed a moderate genetic bottleneck with an effective wild soybean population used for domestication estimated to be ≈2 % of the total number of ancestral wild soybeans. Wild soybeans in Asia, especially in south China contain tremendous genetic resources for cultivar improvement. PMID:20566681
Rodríguez, Alejandra; Gonzalez, Luis; Ko, Arthur; Alvarez, Marcus; Miao, Zong; Bhagat, Yash; Nikkola, Elina; Cruz-Bautista, Ivette; Arellano-Campos, Olimpia; Muñoz-Hernández, Linda L; Ordóñez-Sánchez, Maria-Luisa; Rodriguez-Guillen, Rosario; Mohlke, Karen L; Laakso, Markku; Tusie-Luna, Teresa; Aguilar-Salinas, Carlos A; Pajukanta, Päivi
2016-07-01
We recently identified a locus on chromosome 18q11.2 for high serum triglycerides in Mexicans. We hypothesize that the lead genome-wide association study single-nucleotide polymorphism rs9949617, or its linkage disequilibrium proxies, regulates 1 of the 5 genes in the triglyceride-associated region. We performed a linkage disequilibrium analysis and found 9 additional variants in linkage disequilibrium (r(2)>0.7) with the lead single-nucleotide polymorphism. To select the variants for functional analyses, we annotated the 10 variants using DNase I hypersensitive sites, transcription factor and chromatin states and identified rs17259126 as the lead candidate variant for functional in vitro validation. Using luciferase transcriptional reporter assay in liver HepG2 cells, we found that the G allele exhibits a significantly lower effect on transcription (P<0.05). The electrophoretic mobility shift and ChIPqPCR (chromatin immunoprecipitation coupled with quantitative polymerase chain reaction) assays confirmed that the minor G allele of rs17259126 disrupts an hepatocyte nuclear factor 4 α-binding site. To find the regional candidate gene, we performed a local expression quantitative trait locus analysis and found that rs17259126 and its linkage disequilibrium proxies alter expression of the regional transmembrane protein 241 (TMEM241) gene in 795 adipose RNAs from the Metabolic Syndrome In Men (METSIM) cohort (P=6.11×10(-07)-5.80×10(-04)). These results were replicated in expression profiles of TMEM241 from the Multiple Tissue Human Expression Resource (MuTHER; n=856). The Mexican genome-wide association study signal for high serum triglycerides on chromosome 18q11.2 harbors a regulatory single-nucleotide polymorphism, rs17259126, which disrupts normal hepatocyte nuclear factor 4 α binding and decreases the expression of the regional TMEM241 gene. Our data suggest that decreased transcript levels of TMEM241 contribute to increased triglyceride levels in Mexicans. © 2016 American Heart Association, Inc.
Zhang, Zhen; Shang, Haihong; Shi, Yuzhen; Huang, Long; Li, Junwen; Ge, Qun; Gong, Juwu; Liu, Aiying; Chen, Tingting; Wang, Dan; Wang, Yanling; Palanga, Koffi Kibalou; Muhammad, Jamshed; Li, Weijie; Lu, Quanwei; Deng, Xiaoying; Tan, Yunna; Song, Weiwu; Cai, Juan; Li, Pengtao; Rashid, Harun or; Gong, Wankui; Yuan, Youlu
2016-04-11
Upland Cotton (Gossypium hirsutum) is one of the most important worldwide crops it provides natural high-quality fiber for the industrial production and everyday use. Next-generation sequencing is a powerful method to identify single nucleotide polymorphism markers on a large scale for the construction of a high-density genetic map for quantitative trait loci mapping. In this research, a recombinant inbred lines population developed from two upland cotton cultivars 0-153 and sGK9708 was used to construct a high-density genetic map through the specific locus amplified fragment sequencing method. The high-density genetic map harbored 5521 single nucleotide polymorphism markers which covered a total distance of 3259.37 cM with an average marker interval of 0.78 cM without gaps larger than 10 cM. In total 18 quantitative trait loci of boll weight were identified as stable quantitative trait loci and were detected in at least three out of 11 environments and explained 4.15-16.70 % of the observed phenotypic variation. In total, 344 candidate genes were identified within the confidence intervals of these stable quantitative trait loci based on the cotton genome sequence. These genes were categorized based on their function through gene ontology analysis, Kyoto Encyclopedia of Genes and Genomes analysis and eukaryotic orthologous groups analysis. This research reported the first high-density genetic map for Upland Cotton (Gossypium hirsutum) with a recombinant inbred line population using single nucleotide polymorphism markers developed by specific locus amplified fragment sequencing. We also identified quantitative trait loci of boll weight across 11 environments and identified candidate genes within the quantitative trait loci confidence intervals. The results of this research would provide useful information for the next-step work including fine mapping, gene functional analysis, pyramiding breeding of functional genes as well as marker-assisted selection.
Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo)
2012-01-01
Background The turkey (Meleagris gallopavo) is an important agricultural species and the second largest contributor to the world’s poultry meat production. Genetic improvement is attributed largely to selective breeding programs that rely on highly heritable phenotypic traits, such as body size and breast muscle development. Commercial breeding with small effective population sizes and epistasis can result in loss of genetic diversity, which in turn can lead to reduced individual fitness and reduced response to selection. The presence of genomic diversity in domestic livestock species therefore, is of great importance and a prerequisite for rapid and accurate genetic improvement of selected breeds in various environments, as well as to facilitate rapid adaptation to potential changes in breeding goals. Genomic selection requires a large number of genetic markers such as e.g. single nucleotide polymorphisms (SNPs) the most abundant source of genetic variation within the genome. Results Alignment of next generation sequencing data of 32 individual turkeys from different populations was used for the discovery of 5.49 million SNPs, which subsequently were used for the analysis of genetic diversity among the different populations. All of the commercial lines branched from a single node relative to the heritage varieties and the South Mexican turkey population. Heterozygosity of all individuals from the different turkey populations ranged from 0.17-2.73 SNPs/Kb, while heterozygosity of populations ranged from 0.73-1.64 SNPs/Kb. The average frequency of heterozygous SNPs in individual turkeys was 1.07 SNPs/Kb. Five genomic regions with very low nucleotide variation were identified in domestic turkeys that showed state of fixation towards alleles different than wild alleles. Conclusion The turkey genome is much less diverse with a relatively low frequency of heterozygous SNPs as compared to other livestock species like chicken and pig. The whole genome SNP discovery study in turkey resulted in the detection of 5.49 million putative SNPs compared to the reference genome. All commercial lines appear to share a common origin. Presence of different alleles/haplotypes in the SM population highlights that specific haplotypes have been selected in the modern domesticated turkey. PMID:22891612
2013-10-01
identify common genetic variations (i.e., single nucleotide polymorphisms [ SNPs ] and haplotypes) in cytokine genes, as well demographic, clinical, and...Center. The purpose of the proposed project is to identify common genetic variations (i.e., single nucleotide polymorphisms [ SNPs ] and haplotypes) in...research team continues to meet monthly to discuss progress with regards to recruitment, enrollment, and data collection. Training in Genetics In year
Single-cell analysis of intercellular heteroplasmy of mtDNA in Leber hereditary optic neuropathy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kobayashi, Y.; Sharpe, H.; Brown, N.
1994-07-01
The authors have investigated the distribution of mutant mtDNA molecules in single cells from a patient with Leber hereditary optic neuropathy (LHON). LHON is a maternally inherited disease that is characterized by a sudden-onset bilateral loss of central vision, which typically occurs in early adulthood. More than 50% of all LHON patients carry an mtDNA mutation at nucleotide position 11778. This nucleotide change converts a highly conserved arginine residue to histidine at codon 340 in the NADH-ubiquinone oxidoreductase subunit 4 (ND4) gene of mtDNA. In the present study, the authors used PCR amplification of mtDNA from lymphocytes to investigate mtDNAmore » heteroplasmy at the single-cell level in a LHON patient. They found that most cells were either homoplasmic normal or homoplasmic mutant at nucleotide position 11778. Some (16%) cells contained both mutant and normal mtDNA.« less
Tian, Kai; Chen, Xiaowei; Luan, Binquan; Singh, Prashant; Yang, Zhiyu; Gates, Kent S; Lin, Mengshi; Mustapha, Azlin; Gu, Li-Qun
2018-05-22
Accurate and rapid detection of single-nucleotide polymorphism (SNP) in pathogenic mutants is crucial for many fields such as food safety regulation and disease diagnostics. Current detection methods involve laborious sample preparations and expensive characterizations. Here, we investigated a single locked nucleic acid (LNA) approach, facilitated by a nanopore single-molecule sensor, to accurately determine SNPs for detection of Shiga toxin producing Escherichia coli (STEC) serotype O157:H7, and cancer-derived EGFR L858R and KRAS G12D driver mutations. Current LNA applications that require incorporation and optimization of multiple LNA nucleotides. But we found that in the nanopore system, a single LNA introduced in the probe is sufficient to enhance the SNP discrimination capability by over 10-fold, allowing accurate detection of the pathogenic mutant DNA mixed in a large amount of the wild-type DNA. Importantly, the molecular mechanistic study suggests that such a significant improvement is due to the effect of the single-LNA that both stabilizes the fully matched base-pair and destabilizes the mismatched base-pair. This sensitive method, with a simplified, low cost, easy-to-operate LNA design, could be generalized for various applications that need rapid and accurate identification of single-nucleotide variations.
Substitution rate and natural selection in parvovirus B19
Stamenković, Gorana G.; Ćirković, Valentina S.; Šiljić, Marina M.; Blagojević, Jelena V.; Knežević, Aleksandra M.; Joksić, Ivana D.; Stanojević, Maja P.
2016-01-01
The aim of this study was to estimate substitution rate and imprints of natural selection on parvovirus B19 genotype 1. Studied datasets included 137 near complete coding B19 genomes (positions 665 to 4851) for phylogenetic and substitution rate analysis and 146 and 214 partial genomes for selection analyses in open reading frames ORF1 and ORF2, respectively, collected 1973–2012 and including 9 newly sequenced isolates from Serbia. Phylogenetic clustering assigned majority of studied isolates to G1A. Nucleotide substitution rate for total coding DNA was 1.03 (0.6–1.27) x 10−4 substitutions/site/year, with higher values for analyzed genome partitions. In spite of the highest evolutionary rate, VP2 codons were found to be under purifying selection with rare episodic positive selection, whereas codons under diversifying selection were found in the unique part of VP1, known to contain B19 immune epitopes important in persistent infection. Analyses of overlapping gene regions identified nucleotide positions under opposite selective pressure in different ORFs, suggesting complex evolutionary mechanisms of nucleotide changes in B19 viral genomes. PMID:27775080
Gao, Yan; Ni, Xiaohui; Guo, Hua; Su, Zhe; Ba, Yi; Tong, Zhongsheng; Guo, Zhi; Yao, Xin; Chen, Xixi; Yin, Jian; Yan, Zhao; Guo, Lin; Liu, Ying; Bai, Fan; Xie, X Sunney; Zhang, Ning
2017-08-01
Copy number alteration (CNA) is a major contributor to genome instability, a hallmark of cancer. Here, we studied genomic alterations in single primary tumor cells and circulating tumor cells (CTCs) from the same patient. Single-nucleotide variants (SNVs) in single cells from both samples occurred sporadically, whereas CNAs among primary tumor cells emerged accumulatively rather than abruptly, converging toward the CNA in CTCs. Focal CNAs affecting the MYC gene and the PTEN gene were observed only in a minor portion of primary tumor cells but were present in all CTCs, suggesting a strong selection toward metastasis. Single-cell structural variant (SV) analyses revealed a two-step mechanism, a complex rearrangement followed by gene amplification, for the simultaneous formation of anomalous CNAs in multiple chromosome regions. Integrative CNA analyses of 97 CTCs from 23 patients confirmed the convergence of CNAs and revealed single, concurrent, and mutually exclusive CNAs that could be the driving events in cancer metastasis. © 2017 Gao et al.; Published by Cold Spring Harbor Laboratory Press.
Vendrami, David L J; Shah, Abhijeet; Telesca, Luca; Hoffman, Joseph I
2016-06-01
Transcriptional profiling not only provides insights into patterns of gene expression, but also generates sequences that can be mined for molecular markers, which in turn can be used for population genetic studies. As part of a large-scale effort to better understand how commercially important European shellfish species may respond to ocean acidification, we therefore mined the transcriptomes of four species (the Pacific oyster Crassostrea gigas, the blue mussel Mytilus edulis, the great scallop Pecten maximus and the blunt gaper Mya truncata) for single nucleotide polymorphisms (SNPs). Illumina data for C. gigas, M. edulis and P. maximus and 454 data for M. truncata were interrogated using GATK and SWAP454 respectively to identify between 8267 and 47,159 high quality SNPs per species (total=121,053 SNPs residing within 34,716 different contigs). We then annotated the transcripts containing SNPs to reveal homology to diverse genes. Finally, as oceanic pH affects the ability of organisms to incorporate calcium carbonate, we honed in on genes implicated in the biomineralization process to identify a total of 1899 SNPs in 157 genes. These provide good candidates for biomarkers with which to study patterns of selection in natural or experimental populations. Copyright © 2016 Elsevier B.V. All rights reserved.
Freedman, Jennifer A; Wang, Yanru; Li, Xuechan; Liu, Hongliang; Moorman, Patricia G; George, Daniel J; Lee, Norman H; Hyslop, Terry; Wei, Qingyi; Patierno, Steven R
2018-05-03
Prostate cancer is a clinically and molecularly heterogeneous disease, with variation in outcomes only partially predicted by grade and stage. Additional tools to distinguish indolent from aggressive disease are needed. Phenotypic characteristics of stemness correlate with poor cancer prognosis. Given this correlation, we identified single nucleotide polymorphisms (SNPs) of stemness-related genes and examined their associations with prostate cancer survival. SNPs within stemness-related genes were analyzed for association with overall survival of prostate cancer in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial. Significant SNPs predicted to be functional were selected for linkage disequilibrium analysis and combined and stratified analyses. Identified SNPs were evaluated for association with gene expression. SNPs of CD44 (rs9666607), ABCC1 (rs35605 and rs212091) and GDF15 (rs1058587) were associated with prostate cancer survival and predicted to be functional. A role for rs9666607 of CD44 and rs35605 of ABCC1 in RNA splicing regulation, rs212091 of ABCC1 in miRNA binding site activity and rs1058587 of GDF15 in causing an amino acid change was predicted. These SNPs represent potential novel prognostic markers for overall survival of prostate cancer and support a contribution of the stemness pathway to prostate cancer patient outcome.
Xu, L; Shi, Y; Gu, J; Wang, Y; Wang, L; You, L; Qi, X; Ye, Y; Chen, Z
2014-03-01
To investigate the association between 2 single nucleotide polymorphisms (SNP501A/C and 604 G/A) in the promoter of the ghrelin gene and the hormonal and metabolic phenotypes of polycystic ovary syndrome (PCOS) in a Chinese population. 285 patients with PCOS and 260 healthy controls were selected for a prospective, case-control study at Shandong Provincial Hospital, Jinan, China. All subjects underwent genotype analysis of the 2 single nucleotide polymorphisms of the ghrelin gene. Measurements were also taken of blood lipids, glucose, and hormone levels, and calculations of body mass index (BMI) and waist-to-hip ratio (WHR) were performed to detect hormonal and metabolic phenotypes. No significant diff erences in polymorphism genotypes were found between PCOS patients and healthy controls. However, the frequency of the -501 A/C A allele was significantly higher in the PCOS group than in the control group. PCOS -501 A/C A carriers had significantly higher BMI and WHR than PCOS women with the CC genotype. -604 G/A polymorphisms were not associated with clinical or biochemical characteristics of PCOS. The -501 A/C polymorphism of the ghrelin gene is associated with metabolic features of PCOS in a Chinese population. © J. A. Barth Verlag in Georg Thieme Verlag KG Stuttgart · New York.
NASA Astrophysics Data System (ADS)
Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli
2013-09-01
Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.
Association between RTEL1 gene polymorphisms and COPD susceptibility in a Chinese Han population.
Ding, Yipeng; Xu, Heping; Yao, Jinjian; Xu, Dongchuan; He, Ping; Yi, Shengyang; Li, Quanni; Liu, Yuanshui; Wu, Cibing; Tian, Zhongjie
2017-01-01
We investigated the association between single-nucleotide polymorphisms in regulation of telomere elongation helicase 1 ( RTEL1 ), which has been associated with telomere length in several brain cancers and age-related diseases, and the risk of chronic obstructive pulmonary disease (COPD) in a Chinese Han population. In a case-control study that included 279 COPD cases and 290 healthy controls, five single-nucleotide polymorphisms in RTEL1 were selected and genotyped using the Sequenom MassARRAY platform. Odds ratios (ORs) and 95% confidence intervals (CIs) were calculated using unconditional logistic regression after adjusting for age and gender. In the genotype model analysis, we determined that rs4809324 polymorphism had a decreased effect on the risk of COPD (CC versus TT: OR =0.28; 95% CI =0.10-0.82; P =0.02). In the genetic model analysis, we found that the "C/C" genotype of rs4809324 was associated with a decreased risk of COPD based on the codominant model (OR =0.33; 95% CI =0.13-0.86; P =0.022) and recessive model (OR =0.32; 95% CI =0.12-0.80; P =0.009). Our data shed new light on the association between genetic polymorphisms of RTEL1 and COPD susceptibility in the Chinese Han population.
Gaunt, Tom R; Rodriguez, Santiago; Zapata, Carlos; Day, Ian NM
2006-01-01
Background Various software tools are available for the display of pairwise linkage disequilibrium across multiple single nucleotide polymorphisms. The HapMap project also presents these graphics within their website. However, these approaches are limited in their use of data from multiallelic markers and provide limited information in a graphical form. Results We have developed a software package (MIDAS – Multiallelic Interallelic Disequilibrium Analysis Software) for the estimation and graphical display of interallelic linkage disequilibrium. Linkage disequilibrium is analysed for each allelic combination (of one allele from each of two loci), between all pairwise combinations of any type of multiallelic loci in a contig (or any set) of many loci (including single nucleotide polymorphisms, microsatellites, minisatellites and haplotypes). Data are presented graphically in a novel and informative way, and can also be exported in tabular form for other analyses. This approach facilitates visualisation of patterns of linkage disequilibrium across genomic regions, analysis of the relationships between different alleles of multiallelic markers and inferences about patterns of evolution and selection. Conclusion MIDAS is a linkage disequilibrium analysis program with a comprehensive graphical user interface providing novel views of patterns of linkage disequilibrium between all types of multiallelic and biallelic markers. Availability Available from and PMID:16643648
Range-wide parallel climate-associated genomic clines in Atlantic salmon
Stanley, Ryan R. E.; Wringe, Brendan F.; Guijarro-Sabaniel, Javier; Bourret, Vincent; Bernatchez, Louis; Bentzen, Paul; Beiko, Robert G.; Gilbey, John; Clément, Marie; Bradbury, Ian R.
2017-01-01
Clinal variation across replicated environmental gradients can reveal evidence of local adaptation, providing insight into the demographic and evolutionary processes that shape intraspecific diversity. Using 1773 genome-wide single nucleotide polymorphisms we evaluated latitudinal variation in allele frequency for 134 populations of North American and European Atlantic salmon (Salmo salar). We detected 84 (4.74%) and 195 (11%) loci showing clinal patterns in North America and Europe, respectively, with 12 clinal loci in common between continents. Clinal single nucleotide polymorphisms were evenly distributed across the salmon genome and logistic regression revealed significant associations with latitude and seasonal temperatures, particularly average spring temperature in both continents. Loci displaying parallel clines were associated with several metabolic and immune functions, suggesting a potential basis for climate-associated adaptive differentiation. These climate-based clines collectively suggest evidence of large-scale environmental associated differences on either side of the North Atlantic. Our results support patterns of parallel evolution on both sides of the North Atlantic, with evidence of both similar and divergent underlying genetic architecture. The identification of climate-associated genomic clines illuminates the role of selection and demographic processes on intraspecific diversity in this species and provides a context in which to evaluate the impacts of climate change. PMID:29291123
Electron attachment to DNA single strands: gas phase and aqueous solution.
Gu, Jiande; Xie, Yaoming; Schaefer, Henry F
2007-01-01
The 2'-deoxyguanosine-3',5'-diphosphate, 2'-deoxyadenosine-3',5'-diphosphate, 2'-deoxycytidine-3',5'-diphosphate and 2'-deoxythymidine-3',5'-diphosphate systems are the smallest units of a DNA single strand. Exploring these comprehensive subunits with reliable density functional methods enables one to approach reasonable predictions of the properties of DNA single strands. With these models, DNA single strands are found to have a strong tendency to capture low-energy electrons. The vertical attachment energies (VEAs) predicted for 3',5'-dTDP (0.17 eV) and 3',5'-dGDP (0.14 eV) indicate that both the thymine-rich and the guanine-rich DNA single strands have the ability to capture electrons. The adiabatic electron affinities (AEAs) of the nucleotides considered here range from 0.22 to 0.52 eV and follow the order 3',5'-dTDP > 3',5'-dCDP > 3',5'-dGDP > 3',5'-dADP. A substantial increase in the AEA is observed compared to that of the corresponding nucleic acid bases and the corresponding nucleosides. Furthermore, aqueous solution simulations dramatically increase the electron attracting properties of the DNA single strands. The present investigation illustrates that in the gas phase, the excess electron is situated both on the nucleobase and on the phosphate moiety for DNA single strands. However, the distribution of the extra negative charge is uneven. The attached electron favors the base moiety for the pyrimidine, while it prefers the 3'-phosphate subunit for the purine DNA single strands. In contrast, the attached electron is tightly bound to the base fragment for the cytidine, thymidine and adenosine nucleotides, while it almost exclusively resides in the vicinity of the 3'-phosphate group for the guanosine nucleotides due to the solvent effects. The comparatively low vertical detachment energies (VDEs) predicted for 3',5'-dADP(-) (0.26 eV) and 3',5'-dGDP(-) (0.32 eV) indicate that electron detachment might compete with reactions having high activation barriers such as glycosidic bond breakage. However, the radical anions of the pyrimidine nucleotides with high VDE are expected to be electronically stable. Thus the base-centered radical anions of the pyrimidine nucleotides might be the possible intermediates for DNA single-strand breakage.
DNA sequence alignment by microhomology sampling during homologous recombination
Qi, Zhi; Redding, Sy; Lee, Ja Yil; Gibb, Bryan; Kwon, YoungHo; Niu, Hengyao; Gaines, William A.; Sung, Patrick
2015-01-01
Summary Homologous recombination (HR) mediates the exchange of genetic information between sister or homologous chromatids. During HR, members of the RecA/Rad51 family of recombinases must somehow search through vast quantities of DNA sequence to align and pair ssDNA with a homologous dsDNA template. Here we use single-molecule imaging to visualize Rad51 as it aligns and pairs homologous DNA sequences in real-time. We show that Rad51 uses a length-based recognition mechanism while interrogating dsDNA, enabling robust kinetic selection of 8-nucleotide (nt) tracts of microhomology, which kinetically confines the search to sites with a high probability of being a homologous target. Successful pairing with a 9th nucleotide coincides with an additional reduction in binding free energy and subsequent strand exchange occurs in precise 3-nt steps, reflecting the base triplet organization of the presynaptic complex. These findings provide crucial new insights into the physical and evolutionary underpinnings of DNA recombination. PMID:25684365
Kinetic gating mechanism of DNA damage recognition by Rad4/XPC
NASA Astrophysics Data System (ADS)
Chen, Xuejing; Velmurugu, Yogambigai; Zheng, Guanqun; Park, Beomseok; Shim, Yoonjung; Kim, Youngchang; Liu, Lili; van Houten, Bennett; He, Chuan; Ansari, Anjum; Min, Jung-Hyun
2015-01-01
The xeroderma pigmentosum C (XPC) complex initiates nucleotide excision repair by recognizing DNA lesions before recruiting downstream factors. How XPC detects structurally diverse lesions embedded within normal DNA is unknown. Here we present a crystal structure that captures the yeast XPC orthologue (Rad4) on a single register of undamaged DNA. The structure shows that a disulphide-tethered Rad4 flips out normal nucleotides and adopts a conformation similar to that seen with damaged DNA. Contrary to many DNA repair enzymes that can directly reject non-target sites as structural misfits, our results suggest that Rad4/XPC uses a kinetic gating mechanism whereby lesion selectivity arises from the kinetic competition between DNA opening and the residence time of Rad4/XPC per site. This mechanism is further supported by measurements of Rad4-induced lesion-opening times using temperature-jump perturbation spectroscopy. Kinetic gating may be a general mechanism used by site-specific DNA-binding proteins to minimize time-consuming interrogations of non-target sites.
Gao, Zhong Feng; Ling, Yu; Lu, Lu; Chen, Ning Yu; Luo, Hong Qun; Li, Nian Bing
2014-03-04
Although various strategies have been reported for single-nucleotide polymorphisms (SNPs) detection, development of a time-saving, specific, and regenerated electrochemical sensing platform still remains a realistic goal. In this study, an ON-OFF switching of a regenerated biosensor based on a locked nucleic acid (LNA)-integrated and toehold-mediated strand displacement reaction technique is constructed for detection of SNPs. The LNA-integrated and methylene blue-labeled capture probe with an external toehold is designed to switch on the sensing system. The mutant-type DNA probe completes complementary with the capture probe to trigger the strand displacement reaction, which switches off the sensing system. However, when the single-base mismatched wild-type DNA probe is presented, the strand displacement reaction cannot be achieved; therefore, the sensing system still keeps the ON state. This DNA sensor is stable over five reuses. We further testify that the LNA-integrated sequence has better recognition ability for SNPs detection compared to the DNA-integrated sequence. Moreover, this DNA senor exhibits a remarkable discrimination capability of SNPs among abundant wild-type targets and 6000-fold (m/m) excess of genomic DNA. In addition, it is selective enough in complex and contaminant-ridden samples, such as human urine, soil, saliva, and beer. Overall, these results demonstrate that this reliable DNA sensor is easy to be fabricated, simple to operate, and stable enough to be readily regenerated.
Schermerhorn, Kelly M.; Gardner, Andrew F.
2015-01-01
Family D DNA polymerases (polDs) have been implicated as the major replicative polymerase in archaea, excluding the Crenarchaeota branch, and bear little sequence homology to other DNA polymerase families. Here we report a detailed kinetic analysis of nucleotide incorporation and exonuclease activity for a Family D DNA polymerase from Thermococcus sp. 9°N. Pre-steady-state single-turnover nucleotide incorporation assays were performed to obtain the kinetic parameters, kpol and Kd, for correct nucleotide incorporation, incorrect nucleotide incorporation, and ribonucleotide incorporation by exonuclease-deficient polD. Correct nucleotide incorporation kinetics revealed a relatively slow maximal rate of polymerization (kpol ∼2.5 s−1) and especially tight nucleotide binding (Kd(dNTP) ∼1.7 μm), compared with DNA polymerases from Families A, B, C, X, and Y. Furthermore, pre-steady-state nucleotide incorporation assays revealed that polD prevents the incorporation of incorrect nucleotides and ribonucleotides primarily through reduced nucleotide binding affinity. Pre-steady-state single-turnover assays on wild-type 9°N polD were used to examine 3′-5′ exonuclease hydrolysis activity in the presence of Mg2+ and Mn2+. Interestingly, substituting Mn2+ for Mg2+ accelerated hydrolysis rates >40-fold (kexo ≥110 s−1 versus ≥2.5 s−1). Preference for Mn2+ over Mg2+ in exonuclease hydrolysis activity is a property unique to the polD family. The kinetic assays performed in this work provide critical insight into the mechanisms that polD employs to accurately and efficiently replicate the archaeal genome. Furthermore, despite the unique properties of polD, this work suggests that a conserved polymerase kinetic pathway is present in all known DNA polymerase families. PMID:26160179
Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids
Bartkiewicz, Annette M.; Chilla, Friederike; Terefe-Ayana, Diro; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Linde, Marcus; Debener, Thomas
2018-01-01
Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP) array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL) for four phenotypic traits. PMID:29868076
Genomic Signatures Reveal New Evidences for Selection of Important Traits in Domestic Cattle
Xu, Lingyang; Bickhart, Derek M.; Cole, John B.; Schroeder, Steven G.; Song, Jiuzhou; Tassell, Curtis P. Van; Sonstegard, Tad S.; Liu, George E.
2015-01-01
We investigated diverse genomic selections using high-density single nucleotide polymorphism data of five distinct cattle breeds. Based on allele frequency differences, we detected hundreds of candidate regions under positive selection across Holstein, Angus, Charolais, Brahman, and N'Dama. In addition to well-known genes such as KIT, MC1R, ASIP, GHR, LCORL, NCAPG, WIF1, and ABCA12, we found evidence for a variety of novel and less-known genes under selection in cattle, such as LAP3, SAR1B, LRIG3, FGF5, and NUDCD3. Selective sweeps near LAP3 were then validated by next-generation sequencing. Genome-wide association analysis involving 26,362 Holsteins confirmed that LAP3 and SAR1B were related to milk production traits, suggesting that our candidate regions were likely functional. In addition, haplotype network analyses further revealed distinct selective pressures and evolution patterns across these five cattle breeds. Our results provided a glimpse into diverse genomic selection during cattle domestication, breed formation, and recent genetic improvement. These findings will facilitate genome-assisted breeding to improve animal production and health. PMID:25431480
König, S; Swalve, H H
2009-10-01
The availability of genomic estimated breeding values (GEBV) allows for possible modifications to existing dairy cattle breeding programs. Selection index calculations including genomic and phenotypic observations as index sources were used to determine the optimal number of offspring per genotyped sire with a focus on functional traits and the design of cooperator herds, and to evaluate the importance of a central station test for genotyped bull dams. Evaluation criteria to compare different breeding strategies were correlations between index and aggregate genotype (r(TI)), and the relative selection response percentage (RSR) of an index without single nucleotide polymorphism information in relation to a single nucleotide polymorphism-based index. The number of required daughter records per sire to achieve a predefined r(TI) strongly depends on the accuracy of GEBV (r(mg)) and the heritability of the trait. For a desired r(TI) of 0.8, h(2) = 0.10, and r(mg) = 0.5, at least 57 additional daughters have to be included in the genetic evaluation. Daughter records of genotyped sires are not necessary for optimal scenarios where r(mg) is greater than or equal to r(TI). There still is a substantial need for phenotypic daughter records, especially for low-heritability functional traits and r(mg) < 0.7. Phenotypic records from genotyped potential bull dams have no relevance for increasing r(TI), even with a low value for r(mg) of 0.5. Hence, genomic breeding programs should focus on recording functional traits within progeny groups, preferably in cooperator herds. For low-heritability traits and with r(mg) > 0.7, the RSR of conventional breeding programs was only 10% of RSR from genomic breeding strategies. As shown in scenarios including 2 traits in the index as well as in the aggregate genotype, the availability of highly accurate GEBV for production traits and low-accuracy GEBV for functional traits increased the risk of widening the gap between selection responses in production and functionality. Counteractions are possible, such as via higher economic weights for low-heritability functional traits. Finally, an alternative selection strategy considering only 2 pathways of selection for genotyped male calves and for cow dams was evaluated. This strategy is competitive with a 4-pathway genomic breeding program if the fraction of selected male calves for the artificial insemination program is below 1% and if selection is focused on functionality, thus pointing to substantial insufficiencies caused by low reliabilities of breeding values for cows for such traits in conventional bull dam selection schemes.
Mallik, Saurav; Kundu, Sudip
2017-04-01
Understanding the molecular evolution of macromolecular complexes in the light of their structure, assembly, and stability is of central importance. Here, we address how the modular organization of native molecular contacts shapes the selection pressure on individual residue sites of ribosomal complexes. The bacterial ribosomal complex is represented as a residue contact network where nodes represent amino acid/nucleotide residues and edges represent their van der Waals interactions. We find statistically overrepresented native amino acid-nucleotide contacts (OaantC, one amino acid contacts one or multiple nucleotides, internucleotide contacts are disregarded). Contact number is defined as the number of nucleotides contacted. Involvement of individual amino acids in OaantCs with smaller contact numbers is more random, whereas only a few amino acids significantly contribute to OaantCs with higher contact numbers. An investigation of structure, stability, and assembly of bacterial ribosome depicts the involvement of these OaantCs in diverse biophysical interactions stabilizing the complex, including high-affinity protein-RNA contacts, interprotein cooperativity, intersubunit bridge, packing of multiple ribosomal RNA domains, etc. Amino acid-nucleotide constituents of OaantCs with higher contact numbers are generally associated with significantly slower substitution rates compared with that of OaantCs with smaller contact numbers. This evolutionary rate heterogeneity emerges from the strong purifying selection pressure that conserves the respective amino acid physicochemical properties relevant to the stabilizing interaction with OaantC nucleotides. An analysis of relative molecular orientations of OaantC residues and their interaction energetics provides the biophysical ground of purifying selection conserving OaantC amino acid physicochemical properties. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Structure and Temporal Dynamics of Populations within Wheat Streak Mosaic Virus Isolates
Hall, Jeffrey S.; French, Roy; Morris, T. Jack; Stenger, Drake C.
2001-01-01
Variation within the Type and Sidney 81 strains of wheat streak mosaic virus was assessed by single-strand conformation polymorphism (SSCP) analysis and confirmed by nucleotide sequencing. Limiting-dilution subisolates (LDSIs) of each strain were evaluated for polymorphism in the P1, P3, NIa, and CP cistrons. Different SSCP patterns among LDSIs of a strain were associated with single-nucleotide substitutions. Sidney 81 LDSI-S10 was used as founding inoculum to establish three lineages each in wheat, corn, and barley. The P1, HC-Pro, P3, CI, NIa, NIb, and CP cistrons of LDSI-S10 and each lineage at passages 1, 3, 6, and 9 were evaluated for polymorphism. By passage 9, each lineage differed in consensus sequence from LDSI-S10. The majority of substitutions occurred within NIa and CP, although at least one change occurred in each cistron except HC-Pro and P3. Most consensus sequence changes among lineages were independent, with substitutions accumulating over time. However, LDSI-S10 bore a variant nucleotide (G6016) in NIa that was restored to A6016 in eight of nine lineages by passage 6. This near-global reversion is most easily explained by selection. Examination of nonconsensus variation revealed a pool of unique substitutions (singletons) that remained constant in frequency during passage, regardless of the host species examined. These results suggest that mutations arising by viral polymerase error are generated at a constant rate but that most newly generated mutants are sequestered in virions and do not serve as replication templates. Thus, a substantial fraction of variation generated is static and has yet to be tested for relative fitness. In contrast, nonsingleton variation increased upon passage, suggesting that some mutants do serve as replication templates and may become established in a population. Replicated mutants may or may not rise to prominence to become the consensus sequence in a lineage, with the fate of any particular mutant subject to selection and stochastic processes such as genetic drift and population growth factors. PMID:11581391
Crystal structures of the methyltransferase and helicase from the ZIKA 1947 MR766 Uganda strain
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bukrejewska, Malgorzata; Derewenda, Urszula; Radwanska, Malwina
2017-08-15
Two nonstructural proteins encoded byZika virusstrain MR766 RNA, a methyltransferase and a helicase, were crystallized and their structures were solved and refined at 2.10 and 2.01 Å resolution, respectively. The NS5 methyltransferase contains a boundS-adenosyl-L-methionine (SAM) co-substrate. The NS3 helicase is in the apo form. Comparison with published crystal structures of the helicase in the apo, nucleotide-bound and single-stranded RNA (ssRNA)-bound states suggests that binding of ssRNA to the helicase may occur through conformational selection rather than induced fit.
Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision.
Bao, Zehua; HamediRad, Mohammad; Xue, Pu; Xiao, Han; Tasan, Ipek; Chao, Ran; Liang, Jing; Zhao, Huimin
2018-07-01
We developed a CRISPR-Cas9- and homology-directed-repair-assisted genome-scale engineering method named CHAnGE that can rapidly output tens of thousands of specific genetic variants in yeast. More than 98% of target sequences were efficiently edited with an average frequency of 82%. We validate the single-nucleotide resolution genome-editing capability of this technology by creating a genome-wide gene disruption collection and apply our method to improve tolerance to growth inhibitors.
Genomic Selection in Dairy Cattle: The USDA Experience.
Wiggans, George R; Cole, John B; Hubbard, Suzanne M; Sonstegard, Tad S
2017-02-08
Genomic selection has revolutionized dairy cattle breeding. Since 2000, assays have been developed to genotype large numbers of single-nucleotide polymorphisms (SNPs) at relatively low cost. The first commercial SNP genotyping chip was released with a set of 54,001 SNPs in December 2007. Over 15,000 genotypes were used to determine which SNPs should be used in genomic evaluation of US dairy cattle. Official USDA genomic evaluations were first released in January 2009 for Holsteins and Jerseys, in August 2009 for Brown Swiss, in April 2013 for Ayrshires, and in April 2016 for Guernseys. Producers have accepted genomic evaluations as accurate indications of a bull's eventual daughter-based evaluation. The integration of DNA marker technology and genomics into the traditional evaluation system has doubled the rate of genetic progress for traits of economic importance, decreased generation interval, increased selection accuracy, reduced previous costs of progeny testing, and allowed identification of recessive lethals.
Medeiros, J D; Leite, L R; Pylro, V S; Oliveira, F S; Almeida, V M; Fernandes, G R; Salim, A C M; Araújo, F M G; Volpini, A C; Oliveira, G; Cuadros-Orellana, S
2017-10-01
Acid mine drainage (AMD) is characterized by an acid and metal-rich run-off that originates from mining systems. Despite having been studied for many decades, much remains unknown about the microbial community dynamics in AMD sites, especially during their early development, when the acidity is moderate. Here, we describe draft genome assemblies from single cells retrieved from an early-stage AMD sample. These cells belong to the genus Hydrotalea and are closely related to Hydrotalea flava. The phylogeny and average nucleotide identity analysis suggest that all single amplified genomes (SAGs) form two clades that may represent different strains. These cells have the genomic potential for denitrification, copper and other metal resistance. Two coexisting CRISPR-Cas loci were recovered across SAGs, and we observed heterogeneity in the population with regard to the spacer sequences, together with the loss of trailer-end spacers. Our results suggest that the genomes of Hydrotalea sp. strains studied here are adjusting to a quickly changing selective pressure at the microhabitat scale, and an important form of this selective pressure is infection by foreign DNA. © 2017 John Wiley & Sons Ltd.
Genome amplification of single sperm using multiple displacement amplification.
Jiang, Zhengwen; Zhang, Xingqi; Deka, Ranjan; Jin, Li
2005-06-07
Sperm typing is an effective way to study recombination rate on a fine scale in regions of interest. There are two strategies for the amplification of single meiotic recombinants: repulsion-phase allele-specific PCR and whole genome amplification (WGA). The former can selectively amplify single recombinant molecules from a batch of sperm but is not scalable for high-throughput operation. Currently, primer extension pre-amplification is the only method used in WGA of single sperm, whereas it has limited capacity to produce high-coverage products enough for the analysis of local recombination rate in multiple large regions. Here, we applied for the first time a recently developed WGA method, multiple displacement amplification (MDA), to amplify single sperm DNA, and demonstrated its great potential for producing high-yield and high-coverage products. In a 50 mul reaction, 76 or 93% of loci can be amplified at least 2500- or 250-fold, respectively, from single sperm DNA, and second-round MDA can further offer >200-fold amplification. The MDA products are usable for a variety of genetic applications, including sequencing and microsatellite marker and single nucleotide polymorphism (SNP) analysis. The use of MDA in single sperm amplification may open a new era for studies on local recombination rates.
Zhao, Xiaoyang; Liu, Bo; Yan, Jing; Yuan, Ying; An, Liwen; Guan, Yifu
2014-10-01
Thrombin binding aptamer (TBA), a 15-mer oligonucleotide of d(GGTTGGTGTGGTTGG) sequence, folds into a chair-type antiparallel G-quadruplex in the K(+) environment, and each of two G-tetrads is characterized by a syn-anti-syn-anti glycosidic conformation arrangement. To explore its folding topology and structural stability, 2'-O-methyl nucleotide (OMe) with the C3'-endo sugar pucker conformation and anti glycosidic angle was used to selectively substitute for the guanine residues of G-tetrads of TBA, and these substituted TBAs were characterized using a circular dichroism spectrum, thermally differential spectrum, ultraviolet stability analysis, electrophoresis mobility shift assay, and thermodynamic analysis in K(+) and Ca(2+) environments. Results showed that single substitutions for syn-dG residues destabilized the G-quadruplex structure, while single substitutions for anti-dG residues could preserve the G-quadruplex in the K(+) environment. When one or two G-tetrads were modified with OMe, TBA became unstructured. In contrast, in Ca(2+) environment, the native TBA appeared to be unstructured. When two G-tetrads were substituted with OMe, TBA seemed to become a more stable parallel G-4 structure. Further thermodynamic data suggested that OMe-substitutions were an enthalpy-driven event. The results in this study enrich our understanding about the effects of nucleotide derivatives on the G-quadruplex structure stability in different ionic environments, which will help to design G-quadruplex for biological and medical applications. © The Author 2014. Published by ABBS Editorial Office in association with Oxford University Press on behalf of the Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences.
Hepatitis C: progress and problems.
Cuthbert, J A
1994-01-01
The hepatitis C virus (HCV), a single-stranded RNA virus, is the major cause of posttransfusion hepatitis. HCV isolates differ in nucleotide and amino acid sequences. Nucleotide changes are concentrated in hypervariable regions and may be related to immune selection. In most immunocompetent persons, HCV infection is diagnosed serologically, using antigens from conserved regions. Amplification of RNA may be necessary to detect infection in immunosuppressed patients. Transmission by known parenteral routes is frequent; other means of spread are less common and may represent inapparent, percutaneous dissemination. Infection can lead to classical acute hepatitis, but most infected persons have no history of acute disease. Once infected, most individuals apparently remain carriers of the virus, with varying degrees of hepatocyte damage and fibrosis ensuing. Chronic hepatitis may lead to cirrhosis and hepatocellular carcinoma. However, disease progression varies widely, from less than 2 years to cirrhosis in some patients to more than 30 years with only chronic hepatitis in others. Determinants important in deciding outcome are unknown. Alpha interferon, which results in sustained remission in selected patients, is the only available therapy. Long-term benefits from such therapy have not been demonstrated. Prevention of HCV infection by vaccination is likely to be challenging if ongoing viral mutation results in escape from neutralization and clearance. PMID:7834603
Yang, Zujun; Zhang, Tao; Li, Guangrong; Nevo, Eviatar
2011-12-01
Dehydrins are one of the major stress-induced gene families, and the expression of dehydrin 6 (Dhn6) is strictly related to drought in barley. In order to investigate how the evolution of the Dhn6 gene is associated with adaptation to environmental changes, we examined 48 genotypes of wild barley, Hordeum spontaneum, from "Evolution Canyon" at Mount Carmel, Israel. The Dhn6 sequences of the 48 genotypes were identified, and a recent insertion of 342 bp at 5'UTR was found in the sequences of 11 genotypes. Both nucleotide and haplotype diversity of single nucleotide polymorphism in Dhn6 coding regions were higher on the AS ("African" slope or dry slope) than on the ES ("European" slope or humid slope), and the applied Tajima D and Fu-Li test rejected neutrality of SNP diversity. Expression analysis indicated that the 342 bp insertion at 5'UTR was associated with the earlier up-regulation of Dhn6 after dehydration. The genetic divergence of amino acids sequences indicated significant positive selection of Dhn6 among the wild barley populations. The diversity of Dhn6 in microclimatic divergence slopes suggested that Dhn6 has been subjected to natural selection and adaptively associated with drought resistance of wild barley at "Evolution Canyon".
Kanony, Claire; Fabiano-Tixier, Anne-Sylvie; Ravanat, Jean-Luc; Vicendo, Patricia; Paillous, Nicole
2003-06-01
Pyropheophorbides are red-absorbing porphyrin-like photosensitizers that may interact with DNA either by intercalation or by external binding with self-stacking according to the value of the nucleotide to chromophore molar ratio (N/C). This article reports on the nature and sequence selectivity of the DNA damage photoinduced by a water-soluble chlorhydrate of aminopyropheophorbide. First, this pyropheophorbide is shown to induce on irradiation the cleavage of phiX174 DNA by both Type-I and -II mechanisms, suggested by scavengers and D2O effects. These conclusions are then improved by sequencing experiments performed on a 20-mer oligodeoxynucleotide (ODN) irradiated at wavelengths >345 nm in the presence of the dye, N/C varying from 2.5 to 0.5. Oxidation of all guanine residues to the same extent is observed after piperidine treatment on both single- and double-stranded ODN. Moreover, unexpectedly, a remarkable sequence-selective cleavage occurring at a 5'-CG-3' site is detected before alkali treatment. This frank break is clearly predominant for a low nucleotide to chromophore molar ratio, corresponding to a self-stacking of the dye along the DNA helix. The electrophoretic properties of the band suggest that this lesion results from a sugar oxidation, which leads via a base release to a ribonolactone residue. The proposal is supported by high-performance liquid chromatography-matrix-assisted laser desorption-ionization mass spectrometry experiments that also reveal other sequence-selective frank scissions of lower intensity at 5'-GC-3' or other 5'-CG-3' sites. This sequence selectivity is discussed with regard to the binding selectivity of cationic porphyrins.
Schlötterer, C; Kofler, R; Versace, E; Tobler, R; Franssen, S U
2015-05-01
Evolve and resequence (E&R) is a new approach to investigate the genomic responses to selection during experimental evolution. By using whole genome sequencing of pools of individuals (Pool-Seq), this method can identify selected variants in controlled and replicable experimental settings. Reviewing the current state of the field, we show that E&R can be powerful enough to identify causative genes and possibly even single-nucleotide polymorphisms. We also discuss how the experimental design and the complexity of the trait could result in a large number of false positive candidates. We suggest experimental and analytical strategies to maximize the power of E&R to uncover the genotype-phenotype link and serve as an important research tool for a broad range of evolutionary questions.
Ryu, J; Lee, C
2016-04-01
Selection signals of Korean cattle might be attributed largely to artificial selection for meat quality. Rapidly increased intragenic markers of newly annotated genes in the bovine genome would help overcome limited findings of genetic markers associated with meat quality at the selection signals in a previous study. The present study examined genetic associations of marbling score (MS) with intragenic nucleotide variants at selection signals of Korean cattle. A total of 39 092 nucleotide variants of 407 Korean cattle were utilized in the association analysis. A total of 129 variants were selected within newly annotated genes in the bovine genome. Their genetic associations were analyzed using the mixed model with random polygenic effects based on identical-by-state genetic relationships among animals in order to control for spurious associations produced by population structure. Genetic associations of MS were found (P<3.88×10-4) with six intragenic nucleotide variants on bovine autosomes 3 (cache domain containing 1, CACHD1), 5 (like-glycosyltransferase, LARGE), 16 (cell division cycle 42 binding protein kinase alpha, CDC42BPA) and 21 (snurportin 1, SNUPN; protein tyrosine phosphatase, non-receptor type 9, PTPN9; chondroitin sulfate proteoglycan 4, CSPG4). In particular, the genetic associations with CDC42BPA and LARGE were confirmed using an independent data set of Korean cattle. The results implied that allele frequencies of functional variants and their proximity variants have been augmented by directional selection for greater MS and remain selection signals in the bovine genome. Further studies of fine mapping would be useful to incorporate favorable alleles in marker-assisted selection for MS of Korean cattle.
Lühr, B; Scheller, J; Meyer, P; Kramer, W
1998-02-01
We have analysed the correction of defined mismatches in wild-type and msh2, msh3, msh6 and msh3 msh6 mutants of Saccharomyces cerevisiae in two different yeast strain backgrounds by transformation with plasmid heteroduplex DNA constructs. Ten different base/base mismatches, two single-nucleotide loops and a 38-nucleotide loop were tested. Repair of all types of mismatches was severely impaired in msh2 and msh3 msh6 mutants. In msh6 mutants, repair efficiency of most base/base mismatches was reduced to a similar extent as in msh3 msh6 double mutants. G/T and A/C mismatches, however, displayed residual repair in msh6 mutants in one strain background, implying a role for Msh3p in recognition of base/base mismatches. Furthermore, the efficiency of repair of base/base mismatches was considerably reduced in msh3 mutants in one strain background, indicating a requirement for MSH3 for fully efficient mismatch correction. Also the efficiency of repair of the 38-nucleotide loop was reduced in msh3 mutants, and to a lesser extent in msh6 mutants. The single-nucleotide loop with an unpaired A was less efficiently repaired in msh3 mutants and that with an unpaired T was less efficiently corrected in msh6 mutants, indicating non-redundant functions for the two proteins in the recognition of single-nucleotide loops.
Richardson, Kris; Schnitzler, Gavin R; Lai, Chao-Qiang; Ordovas, Jose M
2015-12-01
Cardiovascular disease and type 2 diabetes mellitus represent overlapping diseases where a large portion of the variation attributable to genetics remains unexplained. An important player in their pathogenesis is peroxisome proliferator-activated receptor γ (PPARγ) that is involved in lipid and glucose metabolism and maintenance of metabolic homeostasis. We used a functional genomics methodology to interrogate human chromatin immunoprecipitation-sequencing, genome-wide association studies, and expression quantitative trait locus data to inform selection of candidate functional single nucleotide polymorphisms (SNPs) falling in PPARγ motifs. We derived 27 328 chromatin immunoprecipitation-sequencing peaks for PPARγ in human adipocytes through meta-analysis of 3 data sets. The PPARγ consensus motif showed greatest enrichment and mapped to 8637 peaks. We identified 146 SNPs in these motifs. This number was significantly less than would be expected by chance, and Inference of Natural Selection from Interspersed Genomically coHerent elemenTs analysis indicated that these motifs are under weak negative selection. A screen of these SNPs against genome-wide association studies for cardiometabolic traits revealed significant enrichment with 16 SNPs. A screen against the MuTHER expression quantitative trait locus data revealed 8 of these were significantly associated with altered gene expression in human adipose, more than would be expected by chance. Several SNPs fall close, or are linked by expression quantitative trait locus to lipid-metabolism loci including CYP26A1. We demonstrated the use of functional genomics to identify SNPs of potential function. Specifically, that SNPs within PPARγ motifs that bind PPARγ in adipocytes are significantly associated with cardiometabolic disease and with the regulation of transcription in adipose. This method may be used to uncover functional SNPs that do not reach significance thresholds in the agnostic approach of genome-wide association studies. © 2015 American Heart Association, Inc.
Jing, Yaling; Wang, Tao; Chen, Zuyi; Ding, Xianping; Xu, Jianju; Mu, Xuemei; Cao, Man; Chen, Honghan
2018-01-01
Globally, human papillomavirus (HPV)-56 accounts for a small proportion of all high-risk HPV types; however, HPV-56 is detected at a higher rate in Asia, particularly in southwest China. The present study analyzed polymorphisms, intratypic variants, and genetic variability in the long control regions (LCR), E6, E7, and L1 of HPV-56 (n=75). The LCRs, E6, E7 and L1 were sequenced using a polymerase chain reaction and the sequences were submitted to GenBank. Maximum-likelihood trees were constructed using Kimura's two-parameter model, followed by secondary structure analysis and protein damaging prediction. Additionally, in order to assess the effect of variations in the LCR on putative binding sites for cellular proteins, MATCH server was used. Finally, the selection pressures of the E6-E7 and L1 genes were estimated. A total of 18 point substitutions, a 42-bp deletion and a 19-bp deletion of LCR were identified. Some of those mutations are embedded in the putative binding sites for transcription factors. 18 single nucleotide changes occurred in the E6-E7 sequence, 11/18 were non-synonymous substitutions and 7/18 were synonymous mutations. A total 24 single nucleotide changes were identified in the L1 sequence, 6/24 being non-synonymous mutations and 18/24 synonymous mutations. Selective pressure analysis predicted that the majority of mutations of HPV-56 E6, E7 and L1 were of positive selection. The phylogenetic tree demonstrated that the isolates distributed in two lineages. Data on the prevalence and genetic variation of HPV-56 types in southwest China may aid future studies on viral molecular mechanisms and contribute to future investigations of diagnostic probes and therapeutic vaccines. PMID:29568922
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tucker, Susan L., E-mail: sltucker@mdanderson.org; Li Minghuan; Xu Ting
2013-01-01
Purpose: To determine whether single-nucleotide polymorphisms (SNPs) in genes associated with DNA repair, cell cycle, transforming growth factor-{beta}, tumor necrosis factor and receptor, folic acid metabolism, and angiogenesis can significantly improve the fit of the Lyman-Kutcher-Burman (LKB) normal-tissue complication probability (NTCP) model of radiation pneumonitis (RP) risk among patients with non-small cell lung cancer (NSCLC). Methods and Materials: Sixteen SNPs from 10 different genes (XRCC1, XRCC3, APEX1, MDM2, TGF{beta}, TNF{alpha}, TNFR, MTHFR, MTRR, and VEGF) were genotyped in 141 NSCLC patients treated with definitive radiation therapy, with or without chemotherapy. The LKB model was used to estimate the risk ofmore » severe (grade {>=}3) RP as a function of mean lung dose (MLD), with SNPs and patient smoking status incorporated into the model as dose-modifying factors. Multivariate analyses were performed by adding significant factors to the MLD model in a forward stepwise procedure, with significance assessed using the likelihood-ratio test. Bootstrap analyses were used to assess the reproducibility of results under variations in the data. Results: Five SNPs were selected for inclusion in the multivariate NTCP model based on MLD alone. SNPs associated with an increased risk of severe RP were in genes for TGF{beta}, VEGF, TNF{alpha}, XRCC1 and APEX1. With smoking status included in the multivariate model, the SNPs significantly associated with increased risk of RP were in genes for TGF{beta}, VEGF, and XRCC3. Bootstrap analyses selected a median of 4 SNPs per model fit, with the 6 genes listed above selected most often. Conclusions: This study provides evidence that SNPs can significantly improve the predictive ability of the Lyman MLD model. With a small number of SNPs, it was possible to distinguish cohorts with >50% risk vs <10% risk of RP when they were exposed to high MLDs.« less
Posada, David
2006-01-01
ModelTest server is a web-based application for the selection of models of nucleotide substitution using the program ModelTest. The server takes as input a text file with likelihood scores for the set of candidate models. Models can be selected with hierarchical likelihood ratio tests, or with the Akaike or Bayesian information criteria. The output includes several statistics for the assessment of model selection uncertainty, for model averaging or to estimate the relative importance of model parameters. The server can be accessed at . PMID:16845102
Demonstration of Protein-Based Human Identification Using the Hair Shaft Proteome
Leppert, Tami; Anex, Deon S.; Hilmer, Jonathan K.; Matsunami, Nori; Baird, Lisa; Stevens, Jeffery; Parsawar, Krishna; Durbin-Johnson, Blythe P.; Rocke, David M.; Nelson, Chad; Fairbanks, Daniel J.; Wilson, Andrew S.; Rice, Robert H.; Woodward, Scott R.; Bothner, Brian; Hart, Bradley R.; Leppert, Mark
2016-01-01
Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 single nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). This study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts. PMID:27603779
Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K
2017-04-01
There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Mohammed, Manal A F; Galbraith, Sareen E; Radford, Alan D; Dove, Winifred; Takasaki, Tomohiko; Kurane, Ichiro; Solomon, Tom
2011-07-01
Japanese encephalitis virus (JEV) is the most important cause of epidemic encephalitis worldwide but its origin is unknown. Epidemics of encephalitis suggestive of Japanese encephalitis (JE) were described in Japan from the 1870s onwards. Four genotypes of JEV have been characterised and representatives of each genotype have been fully sequenced. Based on limited information, a single isolate from Malaysia is thought to represent a putative fifth genotype. We have determined the complete nucleotide and amino acid sequence of Muar strain and compared it with other fully sequenced JEV genomes. Muar was the least similar, with nucleotide divergence ranging from 20.2 to 21.2% and amino acid divergence ranging from 8.5 to 9.9%. Phylogenetic analysis of Muar strain revealed that it does represent a distinct fifth genotype of JEV. We elucidated Muar signature amino acids in the envelope (E) protein, including E327 Glu on the exposed lateral surface of the putative receptor binding domain which distinguishes Muar strain from the other four genotypes. Evolutionary analysis of full-length JEV genomes revealed that the mean evolutionary rate is 4.35 × 10(-4) (3.4906 × 10(-4) to 5.303 × 10(-4)) nucleotides substitutions per site per year and suggests JEV originated from its ancestral virus in the mid 1500s in the Indonesia-Malaysia region and evolved there into different genotypes, which then spread across Asia. No strong evidence for positive selection was found between JEV strains of the five genotypes and the E gene has generally been subjected to strong purifying selection. Copyright © 2011 Elsevier B.V. All rights reserved.
Zimmerman, Matthew D.; Proudfoot, Michael; Yakunin, Alexander; Minor, Wladek
2008-01-01
Summary HD-domain phosphohydrolases have nucleotidase and phosphodiesterase activities and play important roles in the metabolism of nucleotides and in signaling. We present three 2.1 Å resolution crystal structures (one in the free state and two complexed with natural substrates) of a HD-domain phosphohydrolase, the E. coli 5′-nucleotidase YfbR. The free-state structure of YfbR contains a large cavity accommodating the metal-coordinating HD motif (H33, H68, D69, and D137) and other conserved residues (R18, E72, and D77). Alanine scanning mutagenesis confirms that these residues are important for activity. Two structures of the catalytically inactive mutant E72A complexed with Co2+ and either TMP or dAMP disclose the novel binding mode of deoxyribonucleotides in the active site. Residue R18 stabilizes the phosphate on the Co2+, and residue D77 forms a strong hydrogen bond critical for binding the ribose. The indole side chain of W19 is located close to the 2′-carbon atom of the deoxyribose moiety and is proposed to act as the selectivity switch for deoxyribonucleotide, which is supported by comparison to YfdR, another 5′-nucleotidase in E. coli. The nucleotide bases of both dAMP and TMP make no specific hydrogen bonds with the protein, explaining the lack of nucleotide base selectivity. The YfbR E72A substrate complex structures also suggest a plausible single-step nucleophilic substitution mechanism. This is the first proposed molecular mechanism for a HD-domain phosphohydrolase based directly on substrate-bound crystal structures. PMID:18353368
Orozco-terWengel, Pablo; Kapun, Martin; Nolte, Viola; Kofler, Robert; Flatt, Thomas; Schlötterer, Christian
2012-10-01
The genomic basis of adaptation to novel environments is a fundamental problem in evolutionary biology that has gained additional importance in the light of the recent global change discussion. Here, we combined laboratory natural selection (experimental evolution) in Drosophila melanogaster with genome-wide next generation sequencing of DNA pools (Pool-Seq) to identify alleles that are favourable in a novel laboratory environment and traced their trajectories during the adaptive process. Already after 15 generations, we identified a pronounced genomic response to selection, with almost 5000 single nucleotide polymorphisms (SNP; genome-wide false discovery rates < 0.005%) deviating from neutral expectation. Importantly, the evolutionary trajectories of the selected alleles were heterogeneous, with the alleles falling into two distinct classes: (i) alleles that continuously rise in frequency; and (ii) alleles that at first increase rapidly but whose frequencies then reach a plateau. Our data thus suggest that the genomic response to selection can involve a large number of selected SNPs that show unexpectedly complex evolutionary trajectories, possibly due to nonadditive effects. © 2012 Blackwell Publishing Ltd.
Zeron-Medina, Jorge; Wang, Xuting; Repapi, Emmanouela; Campbell, Michelle R.; Su, Dan; Castro-Giner, Francesc; Davies, Benjamin; Peterse, Elisabeth F.P.; Sacilotto, Natalia; Walker, Graeme J.; Terzian, Tamara; Tomlinson, Ian P.; Box, Neil F.; Meinshausen, Nicolai; De Val, Sarah; Bell, Douglas A.; Bond, Gareth L.
2014-01-01
SUMMARY The ability of p53 to regulate transcription is crucial for tumor suppression and implies that inherited polymorphisms in functional p53-binding sites could influence cancer. Here, we identify a polymorphic p53 responsive element and demonstrate its influence on cancer risk using genome-wide data sets of cancer susceptibility loci, genetic variation, p53 occupancy, and p53-binding sites. We uncover a single-nucleotide polymorphism (SNP) in a functional p53-binding site and establish its influence on the ability of p53 to bind to and regulate transcription of the KITLG gene. The SNP resides in KITLG and associates with one of the largest risks identified among cancer genome-wide association studies. We establish that the SNP has undergone positive selection throughout evolution, signifying a selective benefit, but go on to show that similar SNPs are rare in the genome due to negative selection, indicating that polymorphisms in p53-binding sites are primarily detrimental to humans. PMID:24120139
Signatures of negative selection in the genetic architecture of human complex traits.
Zeng, Jian; de Vlaming, Ronald; Wu, Yang; Robinson, Matthew R; Lloyd-Jones, Luke R; Yengo, Loic; Yap, Chloe X; Xue, Angli; Sidorenko, Julia; McRae, Allan F; Powell, Joseph E; Montgomery, Grant W; Metspalu, Andres; Esko, Tonu; Gibson, Greg; Wray, Naomi R; Visscher, Peter M; Yang, Jian
2018-05-01
We develop a Bayesian mixed linear model that simultaneously estimates single-nucleotide polymorphism (SNP)-based heritability, polygenicity (proportion of SNPs with nonzero effects), and the relationship between SNP effect size and minor allele frequency for complex traits in conventionally unrelated individuals using genome-wide SNP data. We apply the method to 28 complex traits in the UK Biobank data (N = 126,752) and show that on average, 6% of SNPs have nonzero effects, which in total explain 22% of phenotypic variance. We detect significant (P < 0.05/28) signatures of natural selection in the genetic architecture of 23 traits, including reproductive, cardiovascular, and anthropometric traits, as well as educational attainment. The significant estimates of the relationship between effect size and minor allele frequency in complex traits are consistent with a model of negative (or purifying) selection, as confirmed by forward simulation. We conclude that negative selection acts pervasively on the genetic variants associated with human complex traits.
Murillo, Gabriel H; You, Na; Su, Xiaoquan; Cui, Wei; Reilly, Muredach P; Li, Mingyao; Ning, Kang; Cui, Xinping
2016-05-15
Single nucleotide variant (SNV) detection procedures are being utilized as never before to analyze the recent abundance of high-throughput DNA sequencing data, both on single and multiple sample datasets. Building on previously published work with the single sample SNV caller genotype model selection (GeMS), a multiple sample version of GeMS (MultiGeMS) is introduced. Unlike other popular multiple sample SNV callers, the MultiGeMS statistical model accounts for enzymatic substitution sequencing errors. It also addresses the multiple testing problem endemic to multiple sample SNV calling and utilizes high performance computing (HPC) techniques. A simulation study demonstrates that MultiGeMS ranks highest in precision among a selection of popular multiple sample SNV callers, while showing exceptional recall in calling common SNVs. Further, both simulation studies and real data analyses indicate that MultiGeMS is robust to low-quality data. We also demonstrate that accounting for enzymatic substitution sequencing errors not only improves SNV call precision at low mapping quality regions, but also improves recall at reference allele-dominated sites with high mapping quality. The MultiGeMS package can be downloaded from https://github.com/cui-lab/multigems xinping.cui@ucr.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner
2015-07-01
Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.
Simultaneous determination of nucleotide sugars with ion-pair reversed-phase HPLC.
Nakajima, Kazuki; Kitazume, Shinobu; Angata, Takashi; Fujinawa, Reiko; Ohtsubo, Kazuaki; Miyoshi, Eiji; Taniguchi, Naoyuki
2010-07-01
Nucleotide sugars are important in determining cell surface glycoprotein glycosylation, which can modulate cellular properties such as growth and arrest. We have developed a conventional HPLC method for simultaneous determination of nucleotide sugars. A mixture of nucleotide sugars (CMP-NeuAc, UDP-Gal, UDP-Glc, UDP-GalNAc, UDP-GlcNAc, GDP-Man, GDP-Fuc and UDP-GlcUA) and relevant nucleotides were perfectly separated in an optimized ion-pair reversed-phase mode using Inertsil ODS-4 and ODS-3 columns. The newly developed method enabled us to determine the nucleotide sugars in cellular extracts from 1 x 10(6) cells in a single run. We applied this method to characterize nucleotide sugar levels in breast and pancreatic cancer cell lines and revealed that the abundance of UDP-GlcNAc, UDP-GalNAc, UDP-GlcUA and GDP-Fuc were a cell-type-specific feature. To determine the physiological significance of changes in nucleotide sugar levels, we analyzed their changes by glucose deprivation and found that the determination of nucleotide sugar levels provided us with valuable information with respect to studying the overview of cellular glycosylation status.
Concerted evolution of life stage performances signals recent selection on yeast nitrogen use.
Ibstedt, Sebastian; Stenberg, Simon; Bagés, Sara; Gjuvsland, Arne B; Salinas, Francisco; Kourtchenko, Olga; Samy, Jeevan K A; Blomberg, Anders; Omholt, Stig W; Liti, Gianni; Beltran, Gemma; Warringer, Jonas
2015-01-01
Exposing natural selection driving phenotypic and genotypic adaptive differentiation is an extraordinary challenge. Given that an organism's life stages are exposed to the same environmental variations, we reasoned that fitness components, such as the lag, rate, and efficiency of growth, directly reflecting performance in these life stages, should often be selected in concert. We therefore conjectured that correlations between fitness components over natural isolates, in a particular environmental context, would constitute a robust signal of recent selection. Critically, this test for selection requires fitness components to be determined by different genetic loci. To explore our conjecture, we exhaustively evaluated the lag, rate, and efficiency of asexual population growth of natural isolates of the model yeast Saccharomyces cerevisiae in a large variety of nitrogen-limited environments. Overall, fitness components were well correlated under nitrogen restriction. Yeast isolates were further crossed in all pairwise combinations and coinheritance of each fitness component and genetic markers were traced. Trait variations tended to map to quantitative trait loci (QTL) that were private to a single fitness component. We further traced QTLs down to single-nucleotide resolution and uncovered loss-of-function mutations in RIM15, PUT4, DAL1, and DAL4 as the genetic basis for nitrogen source use variations. Effects of SNPs were unique for a single fitness component, strongly arguing against pleiotropy between lag, rate, and efficiency of reproduction under nitrogen restriction. The strong correlations between life stage performances that cannot be explained by pleiotropy compellingly support adaptive differentiation of yeast nitrogen source use and suggest a generic approach for detecting selection. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Population Structure Shapes Copy Number Variation in Malaria Parasites.
Cheeseman, Ian H; Miller, Becky; Tan, John C; Tan, Asako; Nair, Shalini; Nkhoma, Standwell C; De Donato, Marcos; Rodulfo, Hectorina; Dondorp, Arjen; Branch, Oralee H; Mesia, Lastenia Ruiz; Newton, Paul; Mayxay, Mayfong; Amambua-Ngwa, Alfred; Conway, David J; Nosten, François; Ferdig, Michael T; Anderson, Tim J C
2016-03-01
If copy number variants (CNVs) are predominantly deleterious, we would expect them to be more efficiently purged from populations with a large effective population size (Ne) than from populations with a small Ne. Malaria parasites (Plasmodium falciparum) provide an excellent organism to examine this prediction, because this protozoan shows a broad spectrum of population structures within a single species, with large, stable, outbred populations in Africa, small unstable inbred populations in South America and with intermediate population characteristics in South East Asia. We characterized 122 single-clone parasites, without prior laboratory culture, from malaria-infected patients in seven countries in Africa, South East Asia and South America using a high-density single-nucleotide polymorphism/CNV microarray. We scored 134 high-confidence CNVs across the parasite exome, including 33 deletions and 102 amplifications, which ranged in size from <500 bp to 59 kb, as well as 10,107 flanking, biallelic single-nucleotide polymorphisms. Overall, CNVs were rare, small, and skewed toward low frequency variants, consistent with the deleterious model. Relative to African and South East Asian populations, CNVs were significantly more common in South America, showed significantly less skew in allele frequencies, and were significantly larger. On this background of low frequency CNV, we also identified several high-frequency CNVs under putative positive selection using an FST outlier analysis. These included known adaptive CNVs containing rh2b and pfmdr1, and several other CNVs (e.g., DNA helicase and three conserved proteins) that require further investigation. Our data are consistent with a significant impact of genetic structure on CNV burden in an important human pathogen. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Structural and Biochemical Determinants of Ligand Binding by the c-di-GMP Riboswitch
DOE Office of Scientific and Technical Information (OSTI.GOV)
Smith, K.; Lipchock, S; Livingston,
2010-01-01
The bacterial second messenger c-di-GMP is used in many species to control essential processes that allow the organism to adapt to its environment. The c-di-GMP riboswitch (GEMM) is an important downstream target in this signaling pathway and alters gene expression in response to changing concentrations of c-di-GMP. The riboswitch selectively recognizes its second messenger ligand primarily through contacts with two critical nucleotides. However, these two nucleotides are not the most highly conserved residues within the riboswitch sequence. Instead, nucleotides that stack with c-di-GMP and that form tertiary RNA contacts are the most invariant. Biochemical and structural evidence reveals that themore » most common natural variants are able to make alternative pairing interactions with both guanine bases of the ligand. Additionally, a high-resolution (2.3 {angstrom}) crystal structure of the native complex reveals that a single metal coordinates the c-di-GMP backbone. Evidence is also provided that after transcription of the first nucleotide on the 3{prime}-side of the P1 helix, which is predicted to be the molecular switch, the aptamer is functional for ligand binding. Although large energetic effects occur when several residues in the RNA are altered, mutations at the most conserved positions, rather than at positions that base pair with c-di-GMP, have the most detrimental effects on binding. Many mutants retain sufficient c-di-GMP affinity for the RNA to remain biologically relevant, which suggests that this motif is quite resilient to mutation.« less
Zhang, Wenting; Mirlohi, Shirin; Li, Xiaorong; He, Yuke
2018-06-01
Leaf traits affect plant agronomic performance; for example, leaf hair number provides a morphological indicator of drought and insect resistance. Brassica rapa crops have diverse phenotypes, and many B. rapa single-nucleotide polymorphisms (SNPs) have been identified and used as molecular markers for plant breeding. However, which SNPs are functional for leaf hair traits and, therefore, effective for breeding purposes remains unknown. Here, we identify a set of SNPs in the B. rapa ssp. pekinenesis candidate gene BrpHAIRY LEAVES1 ( BrpHL1 ) and a number of SNPs of BrpHL1 in a natural population of 210 B. rapa accessions that have hairy, margin-only hairy, and hairless leaves. BrpHL1 genes and their orthologs and paralogs have many SNPs. By intensive mutagenesis and genetic transformation, we selected the functional SNPs for leaf hairs by the exclusion of nonfunctional SNPs and the orthologous and paralogous genes. The residue tryptophan-92 of BrpHL1a was essential for direct interaction with GLABROUS3 and, thus, necessary for the formation of leaf hairs. The accessions with the functional SNP leading to substitution of the tryptophan-92 residue had hairless leaves. The orthologous BrcHL1b from B. rapa ssp. chinensis regulates hair formation on leaf margins rather than leaf surfaces. The selected SNP for the hairy phenotype could be adopted as a molecular marker for insect resistance in Brassica spp. crops. Moreover, the procedures optimized here can be used to explain the molecular mechanisms of natural variation and to facilitate the molecular breeding of many crops. © 2018 American Society of Plant Biologists. All rights reserved.
Heritability, linkage, and genetic associations of exercise treadmill test responses.
Ingelsson, Erik; Larson, Martin G; Vasan, Ramachandran S; O'Donnell, Christopher J; Yin, Xiaoyan; Hirschhorn, Joel N; Newton-Cheh, Christopher; Drake, Jared A; Musone, Stacey L; Heard-Costa, Nancy L; Benjamin, Emelia J; Levy, Daniel; Atwood, Larry D; Wang, Thomas J; Kathiresan, Sekar
2007-06-12
The blood pressure (BP) and heart rate responses to exercise treadmill testing predict incidence of cardiovascular disease, but the genetic determinants of hemodynamic and chronotropic responses to exercise are largely unknown. We assessed systolic BP, diastolic BP, and heart rate during the second stage of the Bruce protocol and at the third minute of recovery in 2982 Framingham Offspring participants (mean age 43 years; 53% women). With use of residuals from multivariable models adjusted for clinical correlates of exercise treadmill testing responses, we estimated the heritability (variance-components methods), genetic linkage (multipoint quantitative trait analyses), and association with 235 single-nucleotide polymorphisms in 14 candidate genes selected a priori from neurohormonal pathways for their potential role in exercise treadmill testing responses. Heritability estimates for heart rate during exercise and during recovery were 0.32 and 0.34, respectively. Heritability estimates for BP variables during exercise were 0.25 and 0.26 (systolic and diastolic BP) and during recovery, 0.16 and 0.13 (systolic and diastolic BP), respectively. Suggestive linkage was found for systolic BP during recovery from exercise (locus 1q43-44, log-of-the-odds score 2.59) and diastolic BP during recovery from exercise (locus 4p15.3, log-of-the-odds score 2.37). Among 235 single-nucleotide polymorphisms tested for association with exercise treadmill testing responses, the minimum nominal probability value was 0.003, which was nonsignificant after adjustment for multiple testing. Hemodynamic and chronotropic responses to exercise are heritable and demonstrate suggestive linkage to select loci. Genetic mapping with newer approaches such as genome-wide association may yield novel insights into the physiological responses to exercise.
Namroud, Marie-Claire; Beaulieu, Jean; Juge, Nicolas; Laroche, Jérôme; Bousquet, Jean
2008-01-01
Conifers are characterized by a large genome size and a rapid decay of linkage disequilibrium, most often within gene limits. Genome scans based on noncoding markers are less likely to detect molecular adaptation linked to genes in these species. In this study, we assessed the effectiveness of a genome-wide single nucleotide polymorphism (SNP) scan focused on expressed genes in detecting local adaptation in a conifer species. Samples were collected from six natural populations of white spruce (Picea glauca) moderately differentiated for several quantitative characters. A total of 534 SNPs representing 345 expressed genes were analysed. Genes potentially under natural selection were identified by estimating the differentiation in SNP frequencies among populations (FST) and identifying outliers, and by estimating local differentiation using a Bayesian approach. Both average expected heterozygosity and population differentiation estimates (HE = 0.270 and FST = 0.006) were comparable to those obtained with other genetic markers. Of all genes, 5.5% were identified as outliers with FST at the 95% confidence level, while 14% were identified as candidates for local adaptation with the Bayesian method. There was some overlap between the two gene sets. More than half of the candidate genes for local adaptation were specific to the warmest population, about 20% to the most arid population, and 15% to the coldest and most humid higher altitude population. These adaptive trends were consistent with the genes’ putative functions and the divergence in quantitative traits noted among the populations. The results suggest that an approach separating the locus and population effects is useful to identify genes potentially under selection. These candidates are worth exploring in more details at the physiological and ecological levels. PMID:18662225
Xiao, Shijun; Wang, Panpan; Dong, Linsong; Zhang, Yaguang; Han, Zhaofang; Wang, Qiurong
2016-01-01
Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms. PMID:28028455
Piao, Wei; Wang, Li; Zhang, Ting; Wang, Zhen; Shangguan, Shaofang; Sun, Jing; Huo, Junsheng
2017-01-01
Associations between genetic variants in the hepcidin regulation pathway and iron status have been reported in previous studies. Most of these studies were conducted in populations of European descent and relatively few studies have been conducted in Chinese populations. In this study, we evaluated associations between single-nucleotide polymorphisms (SNPs) in the hepcidin regulation pathway, serum ferritin (SF) and soluble transferrin receptor (sTfR) in Chinese adolescents. In total, 692 students from rural boarding schools were selected from six cities in China. The participants were divided into case and control groups according to criteria for SF and sTfR. Furthermore, 33 SNPs in TMPRSS6, TF, TFR2, BMP2, BMP4, HJV, CYBRD1, HFE, IL6, PCSK7, HAMP, KIAA1468, and SRPRB were selected. Associations between the genetic variants and SF or sTfR were detected. For SF, rs4820268 in TMPRSS6 was associated with an SF <25 ng/mL status. Carriers of the G/G genotype of rs4820268 exhibited significantly lower SF levels than A allele carriers did (p=0.047). For sTfR, rs1880669 in TF, rs4901474 in BMP4, and rs7536827 in HJV were significantly associated with an sTfR >=4.4 mg/L status. However, in general linear model analysis, after adjustment for age, sex, and location, only rs1880669 exhibited a stable association with higher sTfR levels (p=0.032). We found rs4820268, in TMPRSS6 that was associated with a low SF level, as previously reported, and a new association between 1880669 in TF and sTfR.
Masiran, Ruziana; Sidi, Hatta; Mohamed, Zahurin; Mohd Nazree, Nur Elia; Nik Jaafar, Nik Ruzyanei; Midin, Marhani; Das, Srijit; Mohamed Saini, Suriati
2014-04-01
Selective serotonin reuptake inhibitors (SSRIs) are known for their sexual side effects. Different SSRIs may affect different areas of sexual function at different rates. The study aimed to determine the prevalence of female sexual dysfunction (FSD), its clinical correlates, and association with 5HT2A (rs6311) single nucleotide polymorphisms (SNPs) in patients with major depressive disorder (MDD) who were on SSRI therapy. This was a cross-sectional study on 95 female outpatients with MDD treated with SSRI. The patients were in remission as determined by Montgomery-Asberg Depression Rating Scale. Genomic DNA was isolated from buccal swabs and samples were processed using a real time polymerase chain reaction. The presence or absence of FSD as measured by the Malay Version of Female Sexual Function Index and 5HT2A-1438 G/A (rs6311) SNP. The overall prevalence of FSD was 32.6%. After controlling for age, number of children, education level, total monthly income, SSRI types, and SSRI dosing, being employed significantly enhanced FSD by 4.5 times (odds ratio [OR] = 4.51; 95% confidence interval [CI] 1.00, 20.30; P = 0.05). Those having marital problems were 6.7 times more likely to have FSD (OR = 6.67; 95% CI 1.57, 28.34). 5HT2A-1438 G/A (rs6311) SNP was not significantly associated with FSD. There was no significant association between FSD and the 5HT2A (rs6311) SNP in patients with MDD on SSRI therapy. Employment status and marital state were significantly associated with FSD among these patients. © 2014 International Society for Sexual Medicine.
Li, M-H; Tiirikka, T; Kantanen, J
2014-01-01
In sheep, coat colour (and pattern) is one of the important traits of great biological, economic and social importance. However, the genetics of sheep coat colour has not yet been fully clarified. We conducted a genome-wide association study of sheep coat colours by genotyping 47 303 single-nucleotide polymorphisms (SNPs) in the Finnsheep population in Finland. We identified 35 SNPs associated with all the coat colours studied, which cover genomic regions encompassing three known pigmentation genes (TYRP1, ASIP and MITF) in sheep. Eighteen of these associations were confirmed in further tests between white versus non-white individuals, but none of the 35 associations were significant in the analysis of only non-white colours. Across the tests, the s66432.1 in ASIP showed significant association (P=4.2 × 10−11 for all the colours; P=2.3 × 10−11 for white versus non-white colours) with the variation in coat colours and strong linkage disequilibrium with other significant variants surrounding the ASIP gene. The signals detected around the ASIP gene were explained by differences in white versus non-white alleles. Further, a genome scan for selection for white coat pigmentation identified a strong and striking selection signal spanning ASIP. Our study identified the main candidate gene for the coat colour variation between white and non-white as ASIP, an autosomal gene that has been directly implicated in the pathway regulating melanogenesis. Together with ASIP, the two other newly identified genes (TYRP1 and MITF) in the Finnsheep, bordering associated SNPs, represent a new resource for enriching sheep coat-colour genetics and breeding. PMID:24022497
2013-01-01
Background Obesity, excess fat tissue in the body, can underlie a variety of medical complaints including heart disease, stroke and cancer. The pig is an excellent model organism for the study of various human disorders, including obesity, as well as being the foremost agricultural species. In order to identify genetic variants associated with fatness, we used a selective genomic approach sampling DNA from animals at the extreme ends of the fat and lean spectrum using estimated breeding values derived from a total population size of over 70,000 animals. DNA from 3 breeds (Sire Line Large White, Duroc and a white Pietrain composite line (Titan)) was used to interrogate the Illumina Porcine SNP60 Genotyping Beadchip in order to identify significant associations in terms of single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Results By sampling animals at each end of the fat/lean EBV (estimate breeding value) spectrum the whole population could be assessed using less than 300 animals, without losing statistical power. Indeed, several significant SNPs (at the 5% genome wide significance level) were discovered, 4 of these linked to genes with ontologies that had previously been correlated with fatness (NTS, FABP6, SST and NR3C2). Quantitative analysis of the data identified putative CNV regions containing genes whose ontology suggested fatness related functions (MCHR1, PPARα, SLC5A1 and SLC5A4). Conclusions Selective genotyping of EBVs at either end of the phenotypic spectrum proved to be a cost effective means of identifying SNPs and CNVs associated with fatness and with estimated major effects in a large population of animals. PMID:24225222
A single splice site mutation in human-specific ARHGAP11B causes basal progenitor amplification
Florio, Marta; Namba, Takashi; Pääbo, Svante; Hiller, Michael; Huttner, Wieland B.
2016-01-01
The gene ARHGAP11B promotes basal progenitor amplification and is implicated in neocortex expansion. It arose on the human evolutionary lineage by partial duplication of ARHGAP11A, which encodes a Rho guanosine triphosphatase–activating protein (RhoGAP). However, a lack of 55 nucleotides in ARHGAP11B mRNA leads to loss of RhoGAP activity by GAP domain truncation and addition of a human-specific carboxy-terminal amino acid sequence. We show that these 55 nucleotides are deleted by mRNA splicing due to a single C→G substitution that creates a novel splice donor site. We reconstructed an ancestral ARHGAP11B complementary DNA without this substitution. Ancestral ARHGAP11B exhibits RhoGAP activity but has no ability to increase basal progenitors during neocortex development. Hence, a single nucleotide substitution underlies the specific properties of ARHGAP11B that likely contributed to the evolutionary expansion of the human neocortex. PMID:27957544
Li, Ming; Ohi, Kazutaka; Chen, Chunhui; He, Qinghua; Liu, Jie-Wei; Chen, Chuansheng; Luo, Xiong-Jian; Dong, Qi; Hashimoto, Ryota; Su, Bing
2014-12-01
Hippocampal volume is a key brain structure for learning ability and memory process, and hippocampal atrophy is a recognized biological marker of Alzheimer's disease. However, the genetic bases of hippocampal volume are still unclear although it is a heritable trait. Genome-wide association studies (GWASs) on hippocampal volume have implicated several significantly associated genetic variants in Europeans. Here, to test the contributions of these GWASs identified genetic variants to hippocampal volume in different ethnic populations, we screened the GWAS-identified candidate single-nucleotide polymorphisms in 3 independent healthy Asian brain imaging samples (a total of 990 subjects). The results showed that none of these single-nucleotide polymorphisms were associated with hippocampal volume in either individual or combined Asian samples. The replication results suggested a complexity of genetic architecture for hippocampal volume and potential genetic heterogeneity between different ethnic populations. Copyright © 2014 Elsevier Inc. All rights reserved.
Detecting Single-Nucleotide Substitutions Induced by Genome Editing.
Miyaoka, Yuichiro; Chan, Amanda H; Conklin, Bruce R
2016-08-01
The detection of genome editing is critical in evaluating genome-editing tools or conditions, but it is not an easy task to detect genome-editing events-especially single-nucleotide substitutions-without a surrogate marker. Here we introduce a procedure that significantly contributes to the advancement of genome-editing technologies. It uses droplet digital polymerase chain reaction (ddPCR) and allele-specific hydrolysis probes to detect single-nucleotide substitutions generated by genome editing (via homology-directed repair, or HDR). HDR events that introduce substitutions using donor DNA are generally infrequent, even with genome-editing tools, and the outcome is only one base pair difference in 3 billion base pairs of the human genome. This task is particularly difficult in induced pluripotent stem (iPS) cells, in which editing events can be very rare. Therefore, the technological advances described here have implications for therapeutic genome editing and experimental approaches to disease modeling with iPS cells. © 2016 Cold Spring Harbor Laboratory Press.
Sypabekova, Marzhan; Bekmurzayeva, Aliya; Wang, Ronghui; Li, Yanbin; Nogues, Claude; Kanayeva, Damira
2017-05-01
Rapid detection of Mycobacterium tuberculosis (Mtb), an etiological agent of tuberculosis (TB), is important for global control of this disease. Aptamers have emerged as a potential rival for antibodies in therapeutics, diagnostics and biosensing due to their inherent characteristics. The aim of the current study was to select and characterize single-stranded DNA aptamers against MPT64 protein, one of the predominant secreted proteins of Mtb pathogen. Aptamers specific to MPT64 protein were selected in vitro using systematic evolution of ligands through exponential enrichment (SELEX) method. The selection was started with a pool of ssDNA library with randomized 40-nucleotide region. A total of 10 cycles were performed and seventeen aptamers with unique sequences were identified by sequencing. Dot Blot analysis was performed to monitor the SELEX process and to conduct the preliminary tests on the affinity and specificity of aptamers. Enzyme linked oligonucleotide assay (ELONA) showed that most of the aptamers were specific to the MPT64 protein with a linear correlation of R 2 = 0.94 for the most selective. Using Surface Plasmon Resonance (SPR), dissociation equilibrium constant K D of 8.92 nM was obtained. Bioinformatics analysis of the most specific aptamers revealed the existence of a conserved as well as distinct sequences and possible binding site on MPT64. The specificity was determined by testing non-target ESAT-6 and CFP-10. Negligible cross-reactivity confirmed the high specificity of the selected aptamer. The selected aptamer was further tested on clinical sputum samples using ELONA and had sensitivity and specificity of 91.3% and 90%, respectively. Microscopy, culture positivity and nucleotide amplification methods were used as reference standards. The aptamers studied could be further used for the development of medical diagnostic tools and detection assays for Mtb. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Gomez-Uchida, Daniel; Seeb, James E; Smith, Matt J; Habicht, Christopher; Quinn, Thomas P; Seeb, Lisa W
2011-02-18
Disentangling the roles of geography and ecology driving population divergence and distinguishing adaptive from neutral evolution at the molecular level have been common goals among evolutionary and conservation biologists. Using single nucleotide polymorphism (SNP) multilocus genotypes for 31 sockeye salmon (Oncorhynchus nerka) populations from the Kvichak River, Alaska, we assessed the relative roles of geography (discrete boundaries or continuous distance) and ecology (spawning habitat and timing) driving genetic divergence in this species at varying spatial scales within the drainage. We also evaluated two outlier detection methods to characterize candidate SNPs responding to environmental selection, emphasizing which mechanism(s) may maintain the genetic variation of outlier loci. For the entire drainage, Mantel tests suggested a greater role of geographic distance on population divergence than differences in spawn timing when each variable was correlated with pairwise genetic distances. Clustering and hierarchical analyses of molecular variance indicated that the largest genetic differentiation occurred between populations from distinct lakes or subdrainages. Within one population-rich lake, however, Mantel tests suggested a greater role of spawn timing than geographic distance on population divergence when each variable was correlated with pairwise genetic distances. Variable spawn timing among populations was linked to specific spawning habitats as revealed by principal coordinate analyses. We additionally identified two outlier SNPs located in the major histocompatibility complex (MHC) class II that appeared robust to violations of demographic assumptions from an initial pool of eight candidates for selection. First, our results suggest that geography and ecology have influenced genetic divergence between Alaskan sockeye salmon populations in a hierarchical manner depending on the spatial scale. Second, we found consistent evidence for diversifying selection in two loci located in the MHC class II by means of outlier detection methods; yet, alternative scenarios for the evolution of these loci were also evaluated. Both conclusions argue that historical contingency and contemporary adaptation have likely driven differentiation between Kvichak River sockeye salmon populations, as revealed by a suite of SNPs. Our findings highlight the need for conservation of complex population structure, because it provides resilience in the face of environmental change, both natural and anthropogenic.
2011-01-01
Background Disentangling the roles of geography and ecology driving population divergence and distinguishing adaptive from neutral evolution at the molecular level have been common goals among evolutionary and conservation biologists. Using single nucleotide polymorphism (SNP) multilocus genotypes for 31 sockeye salmon (Oncorhynchus nerka) populations from the Kvichak River, Alaska, we assessed the relative roles of geography (discrete boundaries or continuous distance) and ecology (spawning habitat and timing) driving genetic divergence in this species at varying spatial scales within the drainage. We also evaluated two outlier detection methods to characterize candidate SNPs responding to environmental selection, emphasizing which mechanism(s) may maintain the genetic variation of outlier loci. Results For the entire drainage, Mantel tests suggested a greater role of geographic distance on population divergence than differences in spawn timing when each variable was correlated with pairwise genetic distances. Clustering and hierarchical analyses of molecular variance indicated that the largest genetic differentiation occurred between populations from distinct lakes or subdrainages. Within one population-rich lake, however, Mantel tests suggested a greater role of spawn timing than geographic distance on population divergence when each variable was correlated with pairwise genetic distances. Variable spawn timing among populations was linked to specific spawning habitats as revealed by principal coordinate analyses. We additionally identified two outlier SNPs located in the major histocompatibility complex (MHC) class II that appeared robust to violations of demographic assumptions from an initial pool of eight candidates for selection. Conclusions First, our results suggest that geography and ecology have influenced genetic divergence between Alaskan sockeye salmon populations in a hierarchical manner depending on the spatial scale. Second, we found consistent evidence for diversifying selection in two loci located in the MHC class II by means of outlier detection methods; yet, alternative scenarios for the evolution of these loci were also evaluated. Both conclusions argue that historical contingency and contemporary adaptation have likely driven differentiation between Kvichak River sockeye salmon populations, as revealed by a suite of SNPs. Our findings highlight the need for conservation of complex population structure, because it provides resilience in the face of environmental change, both natural and anthropogenic. PMID:21332997
Seal, B S; Neill, J D; Ridpath, J F
1994-07-01
Caliciviruses are nonenveloped with a polyadenylated genome of approximately 7.6 kb and a single capsid protein. The "RNA Fold" computer program was used to analyze 3'-terminal noncoding sequences of five feline calicivirus (FCV), rabbit hemorrhagic disease virus (RHDV), and two San Miguel sea lion virus (SMSV) isolates. The FCV 3'-terminal sequences are 40-46 nucleotides in length and 72-91% similar. The FCV sequences were predicted to contain two possible duplex structures and one stem-loop structure with free energies of -2.1 to -18.2 kcal/mole. The RHDV genomic 3'-terminal RNA sequences are 54 nucleotides in length and share 49% sequence similarity to homologous regions of the FCV genome. The RHDV sequence was predicted to form two duplex structures in the 3'-terminal noncoding region with a single stem-loop structure, resembling that of FCV. In contrast, the SMSV 1 and 4 genomic 3'-terminal noncoding sequences were 185 and 182 nucleotides in length, respectively. Ten possible duplex structures were predicted with an average structural free energy of -35 kcal/mole. Sequence similarity between the two SMSV isolates was 75%. Furthermore, extensive cloverleaflike structures are predicted in the 3' noncoding region of the SMSV genome, in contrast to the predicted single stem-loop structures of FCV or RHDV.
Base Preferences in Non-Templated Nucleotide Incorporation by MMLV-Derived Reverse Transcriptases
Zajac, Pawel; Islam, Saiful; Hochgerner, Hannah; Lönnerberg, Peter; Linnarsson, Sten
2013-01-01
Reverse transcriptases derived from Moloney Murine Leukemia Virus (MMLV) have an intrinsic terminal transferase activity, which causes the addition of a few non-templated nucleotides at the 3´ end of cDNA, with a preference for cytosine. This mechanism can be exploited to make the reverse transcriptase switch template from the RNA molecule to a secondary oligonucleotide during first-strand cDNA synthesis, and thereby to introduce arbitrary barcode or adaptor sequences in the cDNA. Because the mechanism is relatively efficient and occurs in a single reaction, it has recently found use in several protocols for single-cell RNA sequencing. However, the base preference of the terminal transferase activity is not known in detail, which may lead to inefficiencies in template switching when starting from tiny amounts of mRNA. Here, we used fully degenerate oligos to determine the exact base preference at the template switching site up to a distance of ten nucleotides. We found a strong preference for guanosine at the first non-templated nucleotide, with a greatly reduced bias at progressively more distant positions. Based on this result, and a number of careful optimizations, we report conditions for efficient template switching for cDNA amplification from single cells. PMID:24392002
Urschitz, Johann; Sultan, Omar; Ward, Kenneth
2011-01-01
Objective Various Asian and Pacifific Islander groups have higher prevalence rates of type 2 diabetes and gestational diabetes. This increased incidence is likely to include genetic factors. Single nucleotide polymorphisms in the retinol binding protein 4 gene have been linked to the occurrence of type 2 diabetes. Hypothesizing a link between retinol binding protein 4 and gestational diabetes, we performed a candidate gene study to look for an association between an important retinol binding protein gene polymorphism (rs3758539) and gestational diabetes. Study Design Blood was collected from Caucasian, Asian, and Pacific Islander women diagnosed with gestational diabetes and from ethnically matched non-diabetic controls. DNA was extracted and real time PCR technology (TaqMan, Applied Biosystems) used to screen for the rs3758539 single nucleotide polymorphism located 5′ of exon 1 of the retinol binding protein 4 gene. Results Genotype and allele frequencies in the controls and gestational diabetes cases were tested using chi-square contingency tests. Genotype frequencies were in Hardy-Weinberg equilibrium. There was no association between the rs3758539 retinol binding protein 4 single nucleotide polymorphism and gestational diabetes in the Caucasian, Filipino, or Pacific Islander groups. Conclusion Interestingly, the rs3758539 retinol binding protein 4 single nucleotide polymorphism was not found to be associated with gestational diabetes. The absence of association suggests that gestational and type 2 diabetes may have more divergent molecular pathophysiology than previously suspected. PMID:21886308
Pooled genome wide association detects association upstream of FCRL3 with Graves' disease.
Khong, Jwu Jin; Burdon, Kathryn P; Lu, Yi; Laurie, Kate; Leonardos, Lefta; Baird, Paul N; Sahebjada, Srujana; Walsh, John P; Gajdatsy, Adam; Ebeling, Peter R; Hamblin, Peter Shane; Wong, Rosemary; Forehan, Simon P; Fourlanos, Spiros; Roberts, Anthony P; Doogue, Matthew; Selva, Dinesh; Montgomery, Grant W; Macgregor, Stuart; Craig, Jamie E
2016-11-18
Graves' disease is an autoimmune thyroid disease of complex inheritance. Multiple genetic susceptibility loci are thought to be involved in Graves' disease and it is therefore likely that these can be identified by genome wide association studies. This study aimed to determine if a genome wide association study, using a pooling methodology, could detect genomic loci associated with Graves' disease. Nineteen of the top ranking single nucleotide polymorphisms including HLA-DQA1 and C6orf10, were clustered within the Major Histo-compatibility Complex region on chromosome 6p21, with rs1613056 reaching genome wide significance (p = 5 × 10 -8 ). Technical validation of top ranking non-Major Histo-compatablity complex single nucleotide polymorphisms with individual genotyping in the discovery cohort revealed four single nucleotide polymorphisms with p ≤ 10 -4 . Rs17676303 on chromosome 1q23.1, located upstream of FCRL3, showed evidence of association with Graves' disease across the discovery, replication and combined cohorts. A second single nucleotide polymorphism rs9644119 downstream of DPYSL2 showed some evidence of association supported by finding in the replication cohort that warrants further study. Pooled genome wide association study identified a genetic variant upstream of FCRL3 as a susceptibility locus for Graves' disease in addition to those identified in the Major Histo-compatibility Complex. A second locus downstream of DPYSL2 is potentially a novel genetic variant in Graves' disease that requires further confirmation.
Gu, Hong; Sun, Erdan; Cui, Lei; Yang, Xiufen; Lim, Apiradee; Xu, Jun; Snellingen, Torkel; Liu, Xipu; Wang, Ningli; Liu, Ningpu
2012-10-01
To investigate the association between single-nucleotide polymorphisms in the pi isoform of glutathione S-transferase (GSTP1) gene and the risk of exudative age-related macular degeneration (AMD) in a Chinese case-control cohort. A total of 131 Chinese patients with exudative AMD and 138 control individuals were recruited. Genomic DNA was extracted from venous blood leukocytes. Two common nonsynonymous single-nucleotide polymorphisms in GSTP1 (rs1695 and rs1138272) were genotyped by polymerase chain reaction followed by allele-specific restriction enzyme digestion and direct sequencing. Significant association with exudative AMD was detected for single-nucleotide polymorphism, rs1695 (P = 0.019). The risk G allele frequencies were 21.8% in AMD patients and 12.7% in control subjects (P = 0.007). Compared with the wild-type AA genotype, odds ratio for the risk of AMD was 1.91 (95% confidence interval, 1.09-3.35) for the heterozygous AG genotype and 2.52 (95% confidence interval, 0.6-10.61) for the homozygous GG genotype. In contrast, rs1138272 was not associated with exudative AMD (P = 1.00). The risk G allele frequencies of rs1138272 were 0.4% in AMD patients and 0.4% in control subjects (P = 1.00). Our data suggest that the GSTP1 variant rs1695 moderately increases the risk of exudative AMD. The variant rs1138272 was rare and was not associated with exudative AMD in this Chinese cohort.
Electron attachment to DNA single strands: gas phase and aqueous solution
Gu, Jiande; Xie, Yaoming; Schaefer, Henry F.
2007-01-01
The 2′-deoxyguanosine-3′,5′-diphosphate, 2′-deoxyadenosine-3′,5′-diphosphate, 2′-deoxycytidine-3′,5′-diphosphate and 2′-deoxythymidine-3′,5′-diphosphate systems are the smallest units of a DNA single strand. Exploring these comprehensive subunits with reliable density functional methods enables one to approach reasonable predictions of the properties of DNA single strands. With these models, DNA single strands are found to have a strong tendency to capture low-energy electrons. The vertical attachment energies (VEAs) predicted for 3′,5′-dTDP (0.17 eV) and 3′,5′-dGDP (0.14 eV) indicate that both the thymine-rich and the guanine-rich DNA single strands have the ability to capture electrons. The adiabatic electron affinities (AEAs) of the nucleotides considered here range from 0.22 to 0.52 eV and follow the order 3′,5′-dTDP > 3′,5′-dCDP > 3′,5′-dGDP > 3′,5′-dADP. A substantial increase in the AEA is observed compared to that of the corresponding nucleic acid bases and the corresponding nucleosides. Furthermore, aqueous solution simulations dramatically increase the electron attracting properties of the DNA single strands. The present investigation illustrates that in the gas phase, the excess electron is situated both on the nucleobase and on the phosphate moiety for DNA single strands. However, the distribution of the extra negative charge is uneven. The attached electron favors the base moiety for the pyrimidine, while it prefers the 3′-phosphate subunit for the purine DNA single strands. In contrast, the attached electron is tightly bound to the base fragment for the cytidine, thymidine and adenosine nucleotides, while it almost exclusively resides in the vicinity of the 3′-phosphate group for the guanosine nucleotides due to the solvent effects. The comparatively low vertical detachment energies (VDEs) predicted for 3′,5′-dADP− (0.26 eV) and 3′,5′-dGDP− (0.32 eV) indicate that electron detachment might compete with reactions having high activation barriers such as glycosidic bond breakage. However, the radical anions of the pyrimidine nucleotides with high VDE are expected to be electronically stable. Thus the base-centered radical anions of the pyrimidine nucleotides might be the possible intermediates for DNA single-strand breakage. PMID:17660189
Agonists and antagonists for P2 receptors
Jacobson, Kenneth A.; Costanzi, Stefano; Joshi, Bhalchandra V.; Besada, Pedro; Shin, Dae Hong; Ko, Hyojin; Ivanov, Andrei A.; Mamedova, Liaman
2015-01-01
Recent work has identified nucleotide agonists selective for P2Y1, P2Y2 and P2Y6 receptors and nucleotide antagonists selective for P2Y1, P2Y12 and P2X1 receptors. Selective non-nucleotide antagonists have been reported for P2Y1, P2Y2, P2Y6, P2Y12, P2Y13, P2X2/3/P2X3 and P2X7 receptors. For example, the dinucleotide INS 37217 (Up4dC) potently activates the P2Y2 receptor, and the non-nucleotide antagonist A-317491 is selective for P2X2/3/P2X3 receptors. Nucleotide analogues in which the ribose moiety is substituted by a variety of novel ring systems, including conformation-ally locked moieties, have been synthesized as ligands for P2Y receptors. The focus on conformational factors of the ribose-like moiety allows the inclusion of general modifications that lead to enhanced potency and selectivity. At P2Y1,2,4,11 receptors, there is a preference for the North conformation as indicated with (N)-methanocarba analogues. The P2Y1 antagonist MRS2500 inhibited ADP-induced human platelet aggregation with an IC50 of 0.95 nM. MRS2365, an (N)-methanocarba analogue of 2-MeSADP, displayed potency (EC50) of 0.4 nM at the P2Y1 receptor, with >10 000-fold selectivity in comparison to P2Y12 and P2Y13 receptors. At P2Y6 receptors there is a dramatic preference for the South conformation. Three-dimensional structures of P2Y receptors have been deduced from structure activity relationships (SAR), mutagenesis and modelling studies. Detailed three-dimensional structures of P2X receptors have not yet been proposed. PMID:16805423
Molecular recognition at adenine nucleotide (P2) receptors in platelets.
Jacobson, Kenneth A; Mamedova, Liaman; Joshi, Bhalchandra V; Besada, Pedro; Costanzi, Stefano
2005-04-01
Transmembrane signaling through P2Y receptors for extracellular nucleotides controls a diverse array of cellular processes, including thrombosis. Selective agonists and antagonists of the two P2Y receptors present on the platelet surface-the G (q)-coupled P2Y (1) subtype and the G (i)-coupled P2Y (12) subtype-are now known. High-affinity antagonists of each have been developed from nucleotide structures. The (N)-methanocarba bisphosphate derivatives MRS2279 and MRS2500 are potent and selective P2Y (1) receptor antagonists. The carbocyclic nucleoside AZD6140 is an uncharged, orally active P2Y (12) receptor antagonist of nM affinity. Another nucleotide receptor on the platelet surface, the P2X (1) receptor, the activation of which may also be proaggregatory, especially under conditions of high shear stress, has high-affinity ligands, although high selectivity has not yet been achieved. Although alpha,beta-methylene-adenosine triphosphate (ATP) is the classic agonist for the P2X (1) receptor, where it causes rapid desensitization, the agonist BzATP is among the most potent in activating this subtype. The aromatic sulfonates NF279 and NF449 are potent antagonists of the P2X (1) receptor. The structures of the two platelet P2Y receptors have been modeled, based on a rhodopsin template, to explain the basis for nucleotide recognition within the putative transmembrane binding sites. The P2Y (1) receptor model, especially, has been exploited in the design and optimization of antagonists targeted to interact selectively with that subtype.
USDA-ARS?s Scientific Manuscript database
Background: Folate is an essential nutrient which supports nucleotide synthesis and biological methylation reactions. Diminished folate status results in chromosome breakage and is associated with several diseases including colorectal cancer. Folate status is also inversely related to plasma homocys...
Zhao, Y; Mette, M F; Gowda, M; Longin, C F H; Reif, J C
2014-06-01
Based on data from field trials with a large collection of 135 elite winter wheat inbred lines and 1604 F1 hybrids derived from them, we compared the accuracy of prediction of marker-assisted selection and current genomic selection approaches for the model traits heading time and plant height in a cross-validation approach. For heading time, the high accuracy seen with marker-assisted selection severely dropped with genomic selection approaches RR-BLUP (ridge regression best linear unbiased prediction) and BayesCπ, whereas for plant height, accuracy was low with marker-assisted selection as well as RR-BLUP and BayesCπ. Differences in the linkage disequilibrium structure of the functional and single-nucleotide polymorphism markers relevant for the two traits were identified in a simulation study as a likely explanation for the different trends in accuracies of prediction. A new genomic selection approach, weighted best linear unbiased prediction (W-BLUP), designed to treat the effects of known functional markers more appropriately, proved to increase the accuracy of prediction for both traits and thus closes the gap between marker-assisted and genomic selection.
Zhao, Y; Mette, M F; Gowda, M; Longin, C F H; Reif, J C
2014-01-01
Based on data from field trials with a large collection of 135 elite winter wheat inbred lines and 1604 F1 hybrids derived from them, we compared the accuracy of prediction of marker-assisted selection and current genomic selection approaches for the model traits heading time and plant height in a cross-validation approach. For heading time, the high accuracy seen with marker-assisted selection severely dropped with genomic selection approaches RR-BLUP (ridge regression best linear unbiased prediction) and BayesCπ, whereas for plant height, accuracy was low with marker-assisted selection as well as RR-BLUP and BayesCπ. Differences in the linkage disequilibrium structure of the functional and single-nucleotide polymorphism markers relevant for the two traits were identified in a simulation study as a likely explanation for the different trends in accuracies of prediction. A new genomic selection approach, weighted best linear unbiased prediction (W-BLUP), designed to treat the effects of known functional markers more appropriately, proved to increase the accuracy of prediction for both traits and thus closes the gap between marker-assisted and genomic selection. PMID:24518889
Single Endemic Genotype of Measles Virus Continuously Circulating in China for at Least 16 Years
Wang, Huiling; Zhu, Zhen; Ji, Yixin; Liu, Chunyu; Zhang, Xiaojie; Sun, Liwei; Zhou, Jianhui; Lu, Peishan; Hu, Ying; Feng, Daxing; Zhang, Zhenying; Wang, Changyin; Fang, Xueqiang; Zheng, Huanying; Liu, Leng; Sun, Xiaodong; Tang, Wei; Wang, Yan; Liu, Yan; Gao, Hui; Tian, Hong; Ma, Jiangtao; Gu, Suyi; Wang, Shuang; Feng, Yan; Bo, Fang; Liu, Jianfeng; Si, Yuan; Zhou, Shujie; Ma, Yuyan; Wu, Shengwei; Zhou, Shunde; Li, Fangcai; Ding, Zhengrong; Yang, Zhaohui; Rota, Paul A.; Featherstone, David; Jee, Youngmee; Bellini, William J.; Xu, Wenbo
2012-01-01
The incidence of measles in China from 1991 to 2008 was reviewed, and the nucleotide sequences from 1507 measles viruses (MeV) isolated during 1993 to 2008 were phylogenetically analyzed. The results showed that measles epidemics peaked approximately every 3 to 5 years with the range of measles cases detected between 56,850 and 140,048 per year. The Chinese MeV strains represented three genotypes; 1501 H1, 1 H2 and 5 A. Genotype H1 was the predominant genotype throughout China continuously circulating for at least 16 years. Genotype H1 sequences could be divided into two distinct clusters, H1a and H1b. A 4.2% average nucleotide divergence was found between the H1a and H1b clusters, and the nucleotide sequence and predicted amino acid homologies of H1a viruses were 92.3%–100% and 84.7%–100%, H1b were 97.1%–100% and 95.3%–100%, respectively. Viruses from both clusters were distributed throughout China with no apparent geographic restriction and multiple co-circulating lineages were present in many provinces. Cluster H1a and H1b viruses were co-circulating during 1993 to 2005, while no H1b viruses were detected after 2005 and the transmission of that cluster has presumably been interrupted. Analysis of the nucleotide and predicted amino acid changes in the N proteins of H1a and H1b viruses showed no evidence of selective pressure. This study investigated the genotype and cluster distribution of MeV in China over a 16-year period to establish a genetic baseline before MeV elimination in Western Pacific Region (WPR). Continuous and extensive MeV surveillance and the ability to quickly identify imported cases of measles will become more critical as measles elimination goals are achieved in China in the near future. This is the first report that a single endemic genotype of measles virus has been found to be continuously circulating in one country for at least 16 years. PMID:22532829
Single endemic genotype of measles virus continuously circulating in China for at least 16 years.
Zhang, Yan; Xu, Songtao; Wang, Huiling; Zhu, Zhen; Ji, Yixin; Liu, Chunyu; Zhang, Xiaojie; Sun, Liwei; Zhou, Jianhui; Lu, Peishan; Hu, Ying; Feng, Daxing; Zhang, Zhenying; Wang, Changyin; Fang, Xueqiang; Zheng, Huanying; Liu, Leng; Sun, Xiaodong; Tang, Wei; Wang, Yan; Liu, Yan; Gao, Hui; Tian, Hong; Ma, Jiangtao; Gu, Suyi; Wang, Shuang; Feng, Yan; Bo, Fang; Liu, Jianfeng; Si, Yuan; Zhou, Shujie; Ma, Yuyan; Wu, Shengwei; Zhou, Shunde; Li, Fangcai; Ding, Zhengrong; Yang, Zhaohui; Rota, Paul A; Featherstone, David; Jee, Youngmee; Bellini, William J; Xu, Wenbo
2012-01-01
The incidence of measles in China from 1991 to 2008 was reviewed, and the nucleotide sequences from 1507 measles viruses (MeV) isolated during 1993 to 2008 were phylogenetically analyzed. The results showed that measles epidemics peaked approximately every 3 to 5 years with the range of measles cases detected between 56,850 and 140,048 per year. The Chinese MeV strains represented three genotypes; 1501 H1, 1 H2 and 5 A. Genotype H1 was the predominant genotype throughout China continuously circulating for at least 16 years. Genotype H1 sequences could be divided into two distinct clusters, H1a and H1b. A 4.2% average nucleotide divergence was found between the H1a and H1b clusters, and the nucleotide sequence and predicted amino acid homologies of H1a viruses were 92.3%-100% and 84.7%-100%, H1b were 97.1%-100% and 95.3%-100%, respectively. Viruses from both clusters were distributed throughout China with no apparent geographic restriction and multiple co-circulating lineages were present in many provinces. Cluster H1a and H1b viruses were co-circulating during 1993 to 2005, while no H1b viruses were detected after 2005 and the transmission of that cluster has presumably been interrupted. Analysis of the nucleotide and predicted amino acid changes in the N proteins of H1a and H1b viruses showed no evidence of selective pressure. This study investigated the genotype and cluster distribution of MeV in China over a 16-year period to establish a genetic baseline before MeV elimination in Western Pacific Region (WPR). Continuous and extensive MeV surveillance and the ability to quickly identify imported cases of measles will become more critical as measles elimination goals are achieved in China in the near future. This is the first report that a single endemic genotype of measles virus has been found to be continuously circulating in one country for at least 16 years.
Sorimachi, Kenji; Okayasu, Teiji
2015-01-01
The complete vertebrate mitochondrial genome consists of 13 coding genes. We used this genome to investigate the existence of natural selection in vertebrate evolution. From the complete mitochondrial genomes, we predicted nucleotide contents and then separated these values into coding and non-coding regions. When nucleotide contents of a coding or non-coding region were plotted against the nucleotide content of the complete mitochondrial genomes, we obtained linear regression lines only between homonucleotides and their analogs. On every plot using G or A content purine, G content in aquatic vertebrates was higher than that in terrestrial vertebrates, while A content in aquatic vertebrates was lower than that in terrestrial vertebrates. Based on these relationships, vertebrates were separated into two groups, terrestrial and aquatic. However, using C or T content pyrimidine, clear separation between these two groups was not obtained. The hagfish (Eptatretus burgeri) was further separated from both terrestrial and aquatic vertebrates. Based on these results, nucleotide content relationships predicted from the complete vertebrate mitochondrial genomes reveal the existence of natural selection based on evolutionary separation between terrestrial and aquatic vertebrate groups. In addition, we propose that separation of the two groups might be linked to ammonia detoxification based on high G and low A contents, which encode Glu rich and Lys poor proteins.
Fraley, Stephanie I; Hardick, Justin; Masek, Billie J; Jo Masek, Billie; Athamanolap, Pornpat; Rothman, Richard E; Gaydos, Charlotte A; Carroll, Karen C; Wakefield, Teresa; Wang, Tza-Huei; Yang, Samuel
2013-10-01
Comprehensive profiling of nucleic acids in genetically heterogeneous samples is important for clinical and basic research applications. Universal digital high-resolution melt (U-dHRM) is a new approach to broad-based PCR diagnostics and profiling technologies that can overcome issues of poor sensitivity due to contaminating nucleic acids and poor specificity due to primer or probe hybridization inaccuracies for single nucleotide variations. The U-dHRM approach uses broad-based primers or ligated adapter sequences to universally amplify all nucleic acid molecules in a heterogeneous sample, which have been partitioned, as in digital PCR. Extensive assay optimization enables direct sequence identification by algorithm-based matching of melt curve shape and Tm to a database of known sequence-specific melt curves. We show that single-molecule detection and single nucleotide sensitivity is possible. The feasibility and utility of U-dHRM is demonstrated through detection of bacteria associated with polymicrobial blood infection and microRNAs (miRNAs) associated with host response to infection. U-dHRM using broad-based 16S rRNA gene primers demonstrates universal single cell detection of bacterial pathogens, even in the presence of larger amounts of contaminating bacteria; U-dHRM using universally adapted Lethal-7 miRNAs in a heterogeneous mixture showcases the single copy sensitivity and single nucleotide specificity of this approach.
Efficiency and Fidelity of Human DNA Polymerases λ and β during Gap-Filling DNA Synthesis
Brown, Jessica A.; Pack, Lindsey R.; Sanman, Laura E.; Suo, Zucai
2010-01-01
The base excision repair (BER) pathway coordinates the replacement of 1 to 10 nucleotides at sites of single-base lesions. This process generates DNA substrates with various gap sizes which can alter the catalytic efficiency and fidelity of a DNA polymerase during gap-filling DNA synthesis. Here, we quantitatively determined the substrate specificity and base substitution fidelity of human DNA polymerase λ (Pol λ), an enzyme proposed to support the known BER DNA polymerase β (Pol β), as it filled 1- to 10-nucleotide gaps at 1-nucleotide intervals. Pol λ incorporated a correct nucleotide with relatively high efficiency until the gap size exceeded 9 nucleotides. Unlike Pol λ, Pol β did not have an absolute threshold on gap size as the catalytic efficiency for a correct dNTP gradually decreased as the gap size increased from 2 to 10 nucleotides and then recovered for non-gapped DNA. Surprisingly, an increase in gap size resulted in lower polymerase fidelity for Pol λ, and this downregulation of fidelity was controlled by its non-enzymatic N-terminal domains. Overall, Pol λ was up to 160-fold more error-prone than Pol β, thereby suggesting Pol λ would be more mutagenic during long gap-filling DNA synthesis. In addition, dCTP was the preferred misincorporation for Pol λ and its N-terminal domain truncation mutants. This nucleotide preference was shown to be dependent upon the identity of the adjacent 5′-template base. Our results suggested that both Pol λ and Pol β would catalyze nucleotide incorporation with the highest combination of efficiency and accuracy when the DNA substrate contains a single-nucleotide gap. Thus, Pol λ, like Pol β, is better suited to catalyze gap-filling DNA synthesis during short-patch BER in vivo, although, Pol λ may play a role in long-patch BER. PMID:20961817
McAllister, Christine A; Miller, Allison J
2016-07-01
Autopolyploidy, genome duplication within a single lineage, can result in multiple cytotypes within a species. Geographic distributions of cytotypes may reflect the evolutionary history of autopolyploid formation and subsequent population dynamics including stochastic (drift) and deterministic (differential selection among cytotypes) processes. Here, we used a population genomic approach to investigate whether autopolyploidy occurred once or multiple times in Andropogon gerardii, a widespread, North American grass with two predominant cytotypes. Genotyping by sequencing was used to identify single nucleotide polymorphisms (SNPs) in individuals collected from across the geographic range of A. gerardii. Two independent approaches to SNP calling were used: the reference-free UNEAK pipeline and a reference-guided approach based on the sequenced Sorghum bicolor genome. SNPs generated using these pipelines were analyzed independently with genetic distance and clustering. Analyses of the two SNP data sets showed very similar patterns of population-level clustering of A. gerardii individuals: a cluster of A. gerardii individuals from the southern Plains, a northern Plains cluster, and a western cluster. Groupings of individuals corresponded to geographic localities regardless of cytotype: 6x and 9x individuals from the same geographic area clustered together. SNPs generated using reference-guided and reference-free pipelines in A. gerardii yielded unique subsets of genomic data. Both data sets suggest that the 9x cytotype in A. gerardii likely evolved multiple times from 6x progenitors across the range of the species. Genomic approaches like GBS and diverse bioinformatics pipelines used here facilitate evolutionary analyses of complex systems with multiple ploidy levels. © 2016 Botanical Society of America.
Association between RTEL1 gene polymorphisms and COPD susceptibility in a Chinese Han population
Ding, Yipeng; Xu, Heping; Yao, Jinjian; Xu, Dongchuan; He, Ping; Yi, Shengyang; Li, Quanni; Liu, Yuanshui; Wu, Cibing; Tian, Zhongjie
2017-01-01
Objective We investigated the association between single-nucleotide polymorphisms in regulation of telomere elongation helicase 1 (RTEL1), which has been associated with telomere length in several brain cancers and age-related diseases, and the risk of chronic obstructive pulmonary disease (COPD) in a Chinese Han population. Methods In a case–control study that included 279 COPD cases and 290 healthy controls, five single-nucleotide polymorphisms in RTEL1 were selected and genotyped using the Sequenom MassARRAY platform. Odds ratios (ORs) and 95% confidence intervals (CIs) were calculated using unconditional logistic regression after adjusting for age and gender. Results In the genotype model analysis, we determined that rs4809324 polymorphism had a decreased effect on the risk of COPD (CC versus TT: OR =0.28; 95% CI =0.10–0.82; P=0.02). In the genetic model analysis, we found that the “C/C” genotype of rs4809324 was associated with a decreased risk of COPD based on the codominant model (OR =0.33; 95% CI =0.13–0.86; P=0.022) and recessive model (OR =0.32; 95% CI =0.12–0.80; P=0.009). Conclusion Our data shed new light on the association between genetic polymorphisms of RTEL1 and COPD susceptibility in the Chinese Han population. PMID:28360516
Functional Reconstitution of a Fungal Natural Product Gene Cluster by Advanced Genome Editing.
Weber, Jakob; Valiante, Vito; Nødvig, Christina S; Mattern, Derek J; Slotkowski, Rebecca A; Mortensen, Uffe H; Brakhage, Axel A
2017-01-20
Filamentous fungi produce varieties of natural products even in a strain dependent manner. However, the genetic basis of chemical speciation between strains is still widely unknown. One example is trypacidin, a natural product of the opportunistic human pathogen Aspergillus fumigatus, which is not produced among different isolates. Combining computational analysis with targeted gene editing, we could link a single nucleotide insertion in the polyketide synthase of the trypacidin biosynthetic pathway and reconstitute its production in a nonproducing strain. Thus, we present a CRISPR/Cas9-based tool for advanced molecular genetic studies in filamentous fungi, exploiting selectable markers separated from the edited locus.
Hayatsu, H; Yamashita, Y; Yui, S; Yamagata, Y; Tomita, K; Negishi, K
1982-10-25
When guanine-, adenine- and cytosine-nucleosides and nucleotides were treated with formaldehyde and then with bisulfite, stable N-sulfomethyl compounds were formed. N2-Sulfomethylguanine, N6-sulfomethyladenine, N4-sulfomthylcytosine and N6-sulfomethyl-9-beta-D-arabinofuranosyladenine were isolated as crystals and characterized. A guanine-specific sulfomethylation was brought about by treatment and denatured single-stranded DNA with formaldehyde and then with bisulfite at pH 7 and 4 degrees C. Since native double-stranded DNA was not modified by this treatment, this new method of modification is expected to be useful as a conformational probe for polynucleotides.
Hayatsu, H; Yamashita, Y; Yui, S; Yamagata, Y; Tomita, K; Negishi, K
1982-01-01
When guanine-, adenine- and cytosine-nucleosides and nucleotides were treated with formaldehyde and then with bisulfite, stable N-sulfomethyl compounds were formed. N2-Sulfomethylguanine, N6-sulfomethyladenine, N4-sulfomthylcytosine and N6-sulfomethyl-9-beta-D-arabinofuranosyladenine were isolated as crystals and characterized. A guanine-specific sulfomethylation was brought about by treatment and denatured single-stranded DNA with formaldehyde and then with bisulfite at pH 7 and 4 degrees C. Since native double-stranded DNA was not modified by this treatment, this new method of modification is expected to be useful as a conformational probe for polynucleotides. PMID:7177848
Hu, Bo; Guo, Jing; Xu, Ying; Wei, Hua; Zhao, Guojie; Guan, Yifu
2017-08-01
Rapid and accurate detection of microRNAs in biological systems is of great importance. Here, we report the development of a visual colorimetric assay which possesses the high amplification capabilities and high selectivity of the rolling circle amplification (RCA) method and the simplicity and convenience of gold nanoparticles used as a signal indicator. The designed padlock probe recognizes the target miRNA and is circularized, and then acts as the template to extend the target miRNA into a long single-stranded nucleotide chain of many tandem repeats of nucleotide sequences. Next, the RCA product is hybridized with oligonucleotides tagged onto gold nanoparticles. This interaction leads to the aggregation of gold nanoparticles, and the color of the system changes from wine red to dark blue according to the abundance of miRNA. A linear correlation between fluorescence and target oligonucleotide content was obtained in the range 0.3-300 pM, along with a detection limit of 0.13 pM (n = 7) and a RSD of 3.9% (30 pM, n = 9). The present approach provides a simple, rapid, and accurate visual colorimetric assay that allows sensitive biodetection and bioanalysis of DNA and RNA nucleotides of interest in biologically important samples. Graphical abstract The colorimetric assay system for analyzing target oligonucleotides.
Kumar, Bharath; Abdel-Ghani, Adel H; Pace, Jordon; Reyes-Matamoros, Jenaro; Hochholdinger, Frank; Lübberstedt, Thomas
2014-07-01
Several genes involved in maize root development have been isolated. Identification of SNPs associated with root traits would enable the selection of maize lines with better root architecture that might help to improve N uptake, and consequently plant growth particularly under N deficient conditions. In the present study, an association study (AS) panel consisting of 74 maize inbred lines was screened for seedling root traits in 6, 10, and 14-day-old seedlings. Allele re-sequencing of candidate root genes Rtcl, Rth3, Rum1, and Rul1 was also carried out in the same AS panel lines. All four candidate genes displayed different levels of nucleotide diversity, haplotype diversity and linkage disequilibrium. Gene based association analyses were carried out between individual polymorphisms in candidate genes, and root traits measured in 6, 10, and 14-day-old maize seedlings. Association analyses revealed several polymorphisms within the Rtcl, Rth3, Rum1, and Rul1 genes associated with seedling root traits. Several nucleotide polymorphisms in Rtcl, Rth3, Rum1, and Rul1 were significantly (P<0.05) associated with seedling root traits in maize suggesting that all four tested genes are involved in the maize root development. Thus considerable allelic variation present in these root genes can be exploited for improving maize root characteristics. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc'h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus's but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies.
Guinoiseau, Thibault; Moreau, Alain; Hohnadel, Guillaume; Ngo-Giang-Huong, Nicole; Brulard, Celine; Vourc’h, Patrick; Goudeau, Alain; Gaudy-Graffin, Catherine
2017-01-01
Hepatitis C virus (HCV) evolves rapidly in a single host and circulates as a quasispecies wich is a complex mixture of genetically distinct virus’s but closely related namely variants. To identify intra-individual diversity and investigate their functional properties in vitro, it is necessary to define their quasispecies composition and isolate the HCV variants. This is possible using single genome amplification (SGA). This technique, based on serially diluted cDNA to amplify a single cDNA molecule (clonal amplicon), has already been used to determine individual HCV diversity. In these studies, positive PCR reactions from SGA were directly sequenced using Sanger technology. The detection of non-clonal amplicons is necessary for excluding them to facilitate further functional analysis. Here, we compared Next Generation Sequencing (NGS) with De Novo assembly and Sanger sequencing for their ability to distinguish clonal and non-clonal amplicons after SGA on one plasma specimen. All amplicons (n = 42) classified as clonal by NGS were also classified as clonal by Sanger sequencing. No double peaks were seen on electropherograms for non-clonal amplicons with position-specific nucleotide variation below 15% by NGS. Altogether, NGS circumvented many of the difficulties encountered when using Sanger sequencing after SGA and is an appropriate tool to reliability select clonal amplicons for further functional studies. PMID:28362878
Rubinstein, M; Japón, M A; Low, M J
1993-06-11
The introduction of small mutations instead of null alleles into the mouse genome has broad applications to the study of protein structure-function relationships and the creation of animal models of human genetic diseases. To test a simple mutational strategy we designed a targeting vector for the mouse proopiomelanocortin (POMC) gene containing a single nucleotide insertion that converts the initial tyrosine codon of beta-endorphin 1-31 to a premature translational termination codon and introduces a unique Hpal endonuclease restriction site. The targeting vector also contains a neo cassette immediately 3' to the last POMC exon and a herpes simplex virus thymidine kinase cassette to allow positive and negative selection. Homologous recombination occurred at a frequency of 1/30 clones of electroporated embryonic stem cells selected in G418 and gancyclovir. 10/11 clones identified initially by a polymerase chain reaction (PCR) strategy had the predicted structure without evidence of concatemer formation by Southern blot analysis. We used a combination of Hpa I digestion of PCR amplified fragments and direct nucleotide sequencing to further confirm that the point mutation was retained in 9/10 clones. The POMC gene was transcriptionally silent in embryonic stem cells and the targeted allele was not activated by the downstream phosphoglycerate kinase-1 promoter that transcribed the neo gene. Under the electroporation conditions used, we have demonstrated that a point mutation can be introduced with high efficiency and precision into the POMC gene using a replacement type vector containing a retained selectable marker without affecting expression of the allele in the embryonic stem cells. A similar strategy may be useful for a wide range of genes.
Rubinstein, M; Japón, M A; Low, M J
1993-01-01
The introduction of small mutations instead of null alleles into the mouse genome has broad applications to the study of protein structure-function relationships and the creation of animal models of human genetic diseases. To test a simple mutational strategy we designed a targeting vector for the mouse proopiomelanocortin (POMC) gene containing a single nucleotide insertion that converts the initial tyrosine codon of beta-endorphin 1-31 to a premature translational termination codon and introduces a unique Hpal endonuclease restriction site. The targeting vector also contains a neo cassette immediately 3' to the last POMC exon and a herpes simplex virus thymidine kinase cassette to allow positive and negative selection. Homologous recombination occurred at a frequency of 1/30 clones of electroporated embryonic stem cells selected in G418 and gancyclovir. 10/11 clones identified initially by a polymerase chain reaction (PCR) strategy had the predicted structure without evidence of concatemer formation by Southern blot analysis. We used a combination of Hpa I digestion of PCR amplified fragments and direct nucleotide sequencing to further confirm that the point mutation was retained in 9/10 clones. The POMC gene was transcriptionally silent in embryonic stem cells and the targeted allele was not activated by the downstream phosphoglycerate kinase-1 promoter that transcribed the neo gene. Under the electroporation conditions used, we have demonstrated that a point mutation can be introduced with high efficiency and precision into the POMC gene using a replacement type vector containing a retained selectable marker without affecting expression of the allele in the embryonic stem cells. A similar strategy may be useful for a wide range of genes. Images PMID:8392702
Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang
2015-08-26
The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Joint Identification of Genetic Variants for Physical Activity in Korean Population
Kim, Jayoun; Kim, Jaehee; Min, Haesook; Oh, Sohee; Kim, Yeonjung; Lee, Andy H.; Park, Taesung
2014-01-01
There has been limited research on genome-wide association with physical activity (PA). This study ascertained genetic associations between PA and 344,893 single nucleotide polymorphism (SNP) markers in 8842 Korean samples. PA data were obtained from a validated questionnaire that included information on PA intensity and duration. Metabolic equivalent of tasks were calculated to estimate the total daily PA level for each individual. In addition to single- and multiple-SNP association tests, a pathway enrichment analysis was performed to identify the biological significance of SNP markers. Although no significant SNP was found at genome-wide significance level via single-SNP association tests, 59 genetic variants mapped to 76 genes were identified via a multiple SNP approach using a bootstrap selection stability measure. Pathway analysis for these 59 variants showed that maturity onset diabetes of the young (MODY) was enriched. Joint identification of SNPs could enable the identification of multiple SNPs with good predictive power for PA and a pathway enriched for PA. PMID:25026172
Lam, Angela M.; Espiritu, Christine; Bansal, Shalini; Micolochick Steuer, Holly M.; Zennou, Veronique; Otto, Michael J.; Furman, Phillip A.
2011-01-01
PSI-352938, a cyclic phosphate nucleotide, and PSI-353661, a phosphoramidate nucleotide, are prodrugs of β-d-2′-deoxy-2′-α-fluoro-2′-β-C-methylguanosine-5′-monophosphate. Both compounds are metabolized to the same active 5′-triphosphate, PSI-352666, which serves as an alternative substrate inhibitor of the NS5B RNA-dependent RNA polymerase during HCV replication. PSI-352938 and PSI-353661 retained full activity against replicons containing the S282T substitution, which confers resistance to certain 2′-substituted nucleoside/nucleotide analogs. PSI-352666 was also similarly active against both wild-type and S282T NS5B polymerases. In order to identify mutations that confer resistance to these compounds, in vitro selection studies were performed using HCV replicon cells. While no resistant genotype 1a or 1b replicons could be selected, cells containing genotype 2a JFH-1 replicons cultured in the presence of PSI-352938 or PSI-353661 developed resistance to both compounds. Sequencing of the NS5B region identified a number of amino acid changes, including S15G, R222Q, C223Y/H, L320I, and V321I. Phenotypic evaluation of these mutations indicated that single amino acid changes were not sufficient to significantly reduce the activity of PSI-352938 and PSI-353661. Instead, a combination of three amino acid changes, S15G/C223H/V321I, was required to confer a high level of resistance. No cross-resistance exists between the 2′-F-2′-C-methylguanosine prodrugs and other classes of HCV inhibitors, including 2′-modified nucleoside/-tide analogs such as PSI-6130, PSI-7977, INX-08189, and IDX-184. Finally, we determined that in genotype 1b replicons, the C223Y/H mutation failed to support replication, and although the A15G/C223H/V321I triple mutation did confer resistance to PSI-352938 and PSI-353661, this mutant replicated at only about 10% efficiency compared to the wild type. PMID:21957306
Detecting and Analyzing Genetic Recombination Using RDP4.
Martin, Darren P; Murrell, Ben; Khoosal, Arjun; Muhire, Brejnev
2017-01-01
Recombination between nucleotide sequences is a major process influencing the evolution of most species on Earth. The evolutionary value of recombination has been widely debated and so too has its influence on evolutionary analysis methods that assume nucleotide sequences replicate without recombining. When nucleic acids recombine, the evolution of the daughter or recombinant molecule cannot be accurately described by a single phylogeny. This simple fact can seriously undermine the accuracy of any phylogenetics-based analytical approach which assumes that the evolutionary history of a set of recombining sequences can be adequately described by a single phylogenetic tree. There are presently a large number of available methods and associated computer programs for analyzing and characterizing recombination in various classes of nucleotide sequence datasets. Here we examine the use of some of these methods to derive and test recombination hypotheses using multiple sequence alignments.
Extension of the COG and arCOG databases by amino acid and nucleotide sequences
Meereis, Florian; Kaufmann, Michael
2008-01-01
Background The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries. Results Using sequence information obtained from GenBank flat files covering the completely sequenced genomes of the COG and arCOG databases, we constructed NUCOCOG (nucleotide sequences containing COG databases) as an extended version including all nucleotide sequences and in addition the amino acid sequences originally utilized to construct the current COG and arCOG databases. We make available three comprehensive single XML files containing the complete databases including all sequence information. In addition, we provide a web interface as a utility suitable to browse the NUCOCOG database for sequence retrieval. The database is accessible at . Conclusion NUCOCOG offers the possibility to analyze any sequence related property in the context of the COG and arCOG framework simply by using script languages such as PERL applied to a large but single XML document. PMID:19014535
Location of a major antigenic site involved in Ross River virus neutralization.
Vrati, S; Fernon, C A; Dalgarno, L; Weir, R C
1988-02-01
The location of a major antigenic domain involved in the neutralization of an alphavirus, Ross River virus, has been defined in terms of its position in the amino acid sequence of the E2 glycoprotein. The domain encompasses three topographically close epitopes which were identified using three E2-specific neutralizing monoclonal antibodies in competitive binding assays. Nucleotide sequencing of the structural protein genes of monoclonal antibody-selected antigenic variants showed that for each variant there was a single nucleotide change in the E2 gene leading to a nonconservative amino acid substitution in E2. Changes were at positions 216, 234, and 246-251 in the amino acid sequence. The epitopes are in a region of E2 which, though not strongly conserved as to sequence among Ross River virus, Semliki Forest virus, and Sindbis virus, is conserved in its hydropathy profile among the three alphaviruses. The epitopes lie between two asparagine-linked glycosylation sites (residues 200 and 262) in E2. They are conserved as to position between the mouse virulent T48 strain and the mouse avirulent NB5092 strain.
Rate of de novo mutations and the importance of father's age to disease risk.
Kong, Augustine; Frigge, Michael L; Masson, Gisli; Besenbacher, Soren; Sulem, Patrick; Magnusson, Gisli; Gudjonsson, Sigurjon A; Sigurdsson, Asgeir; Jonasdottir, Aslaug; Jonasdottir, Adalbjorg; Wong, Wendy S W; Sigurdsson, Gunnar; Walters, G Bragi; Steinberg, Stacy; Helgason, Hannes; Thorleifsson, Gudmar; Gudbjartsson, Daniel F; Helgason, Agnar; Magnusson, Olafur Th; Thorsteinsdottir, Unnur; Stefansson, Kari
2012-08-23
Mutations generate sequence diversity and provide a substrate for selection. The rate of de novo mutations is therefore of major importance to evolution. Here we conduct a study of genome-wide mutation rates by sequencing the entire genomes of 78 Icelandic parent-offspring trios at high coverage. We show that in our samples, with an average father's age of 29.7, the average de novo mutation rate is 1.20 × 10(-8) per nucleotide per generation. Most notably, the diversity in mutation rate of single nucleotide polymorphisms is dominated by the age of the father at conception of the child. The effect is an increase of about two mutations per year. An exponential model estimates paternal mutations doubling every 16.5 years. After accounting for random Poisson variation, father's age is estimated to explain nearly all of the remaining variation in the de novo mutation counts. These observations shed light on the importance of the father's age on the risk of diseases such as schizophrenia and autism.
Vargas-Rodríguez, Rosa del Carmen Miluska; da Silva Bastos, Melissa; Menezes, Maria José; Orjuela-Sánchez, Pamela; Ferreira, Marcelo U.
2012-01-01
Emerging resistance to chloroquine (CQ) poses a major challenge for Plasmodium vivax malaria control, and nucleotide substitutions and copy number variation in the P. vivax multidrug resistance 1 (pvmdr-1) locus, which encodes a digestive vacuole membrane transporter, may modulate this phenotype. We describe patterns of genetic variation in pvmdr-1 alleles from Acre and Amazonas in northwestern Brazil, and compare then with those reported in other malaria-endemic regions. The pvmdr-1 mutation Y976F, which is associated with CQ resistance in Southeast Asia and Oceania, remains rare in northwestern Brazil (1.8%) and its prevalence mirrors that of CQ resistance worldwide. Gene amplification of pvmdr-1, which is associated with mefloquine resistance but increased susceptibility to CQ, remains relatively rare in northwestern Brazil (0.9%) and globally (< 4%), but became common (> 10%) in Tak Province, Thailand, possibly because of drug-mediated selection. The global database we have assembled provides a baseline for further studies of genetic variation in pvmdr-1 and drug resistance in P. vivax malaria. PMID:22949516
Vargas-Rodríguez, Rosa del Carmen Miluska; da Silva Bastos, Melissa; Menezes, Maria José; Orjuela-Sánchez, Pamela; Ferreira, Marcelo U
2012-11-01
Emerging resistance to chloroquine (CQ) poses a major challenge for Plasmodium vivax malaria control, and nucleotide substitutions and copy number variation in the P. vivax multidrug resistance 1 (pvmdr-1) locus, which encodes a digestive vacuole membrane transporter, may modulate this phenotype. We describe patterns of genetic variation in pvmdr-1 alleles from Acre and Amazonas in northwestern Brazil, and compare then with those reported in other malaria-endemic regions. The pvmdr-1 mutation Y976F, which is associated with CQ resistance in Southeast Asia and Oceania, remains rare in northwestern Brazil (1.8%) and its prevalence mirrors that of CQ resistance worldwide. Gene amplification of pvmdr-1, which is associated with mefloquine resistance but increased susceptibility to CQ, remains relatively rare in northwestern Brazil (0.9%) and globally (< 4%), but became common (> 10%) in Tak Province, Thailand, possibly because of drug-mediated selection. The global database we have assembled provides a baseline for further studies of genetic variation in pvmdr-1 and drug resistance in P. vivax malaria.
Detection limit of intragenic deletions with targeted array comparative genomic hybridization
2013-01-01
Background Pathogenic mutations range from single nucleotide changes to deletions or duplications that encompass a single exon to several genes. The use of gene-centric high-density array comparative genomic hybridization (aCGH) has revolutionized the detection of intragenic copy number variations. We implemented an exon-centric design of high-resolution aCGH to detect single- and multi-exon deletions and duplications in a large set of genes using the OGT 60 K and 180 K arrays. Here we describe the molecular characterization and breakpoint mapping of deletions at the smaller end of the detectable range in several genes using aCGH. Results The method initially implemented to detect single to multiple exon deletions, was able to detect deletions much smaller than anticipated. The selected deletions we describe vary in size, ranging from over 2 kb to as small as 12 base pairs. The smallest of these deletions are only detectable after careful manual review during data analysis. Suspected deletions smaller than the detection size for which the method was optimized, were rigorously followed up and confirmed with PCR-based investigations to uncover the true detection size limit of intragenic deletions with this technology. False-positive deletion calls often demonstrated single nucleotide changes or an insertion causing lower hybridization of probes demonstrating the sensitivity of aCGH. Conclusions With optimizing aCGH design and careful review process, aCGH can uncover intragenic deletions as small as dozen bases. These data provide insight that will help optimize probe coverage in array design and illustrate the true assay sensitivity. Mapping of the breakpoints confirms smaller deletions and contributes to the understanding of the mechanism behind these events. Our knowledge of the mutation spectra of several genes can be expected to change as previously unrecognized intragenic deletions are uncovered. PMID:24304607
Wofford, Austin M.; Finch, Kristen; Bigott, Adam; Willyard, Ann
2014-01-01
• Premise of the study: Recently released Pinus plastome sequences support characterization of 15 plastid simple sequence repeat (cpSSR) loci originally published for P. contorta and P. thunbergii. This allows selection of loci for single-tube PCR multiplexed genotyping in any subsection of the genus. • Methods: Unique placement of primers and primer conservation across the genus were investigated, and a set of six loci were selected for single-tube multiplexing. We compared interspecific variation between cpSSRs and nucleotide sequences of ycf1 and tested intraspecific variation for cpSSRs using 911 samples in the P. ponderosa species complex. • Results: The cpSSR loci contain mononucleotide and complex repeats with additional length variation in flanking regions. They are not located in hypervariable regions, and most primers are conserved across the genus. A single PCR per sample multiplexed for six loci yielded 45 alleles in 911 samples. • Discussion: The protocol allows efficient genotyping of many samples. The cpSSR loci are too variable for Pinus phylogenies but are useful for the study of genetic structure within and among populations. The multiplex method could easily be extended to other plant groups by choosing primers for cpSSR loci in a plastome alignment for the target group. PMID:25202625
Moore, Jason H; Gilbert, Joshua C; Tsai, Chia-Ti; Chiang, Fu-Tien; Holden, Todd; Barney, Nate; White, Bill C
2006-07-21
Detecting, characterizing, and interpreting gene-gene interactions or epistasis in studies of human disease susceptibility is both a mathematical and a computational challenge. To address this problem, we have previously developed a multifactor dimensionality reduction (MDR) method for collapsing high-dimensional genetic data into a single dimension (i.e. constructive induction) thus permitting interactions to be detected in relatively small sample sizes. In this paper, we describe a comprehensive and flexible framework for detecting and interpreting gene-gene interactions that utilizes advances in information theory for selecting interesting single-nucleotide polymorphisms (SNPs), MDR for constructive induction, machine learning methods for classification, and finally graphical models for interpretation. We illustrate the usefulness of this strategy using artificial datasets simulated from several different two-locus and three-locus epistasis models. We show that the accuracy, sensitivity, specificity, and precision of a naïve Bayes classifier are significantly improved when SNPs are selected based on their information gain (i.e. class entropy removed) and reduced to a single attribute using MDR. We then apply this strategy to detecting, characterizing, and interpreting epistatic models in a genetic study (n = 500) of atrial fibrillation and show that both classification and model interpretation are significantly improved.
Obesity-Related Genomic Loci Are Associated with Type 2 Diabetes in a Han Chinese Population
Zhao, Qi; He, Jiang; Chen, Li; Zhao, Zhigang; Li, Qiang; Ge, Jiapu; Chen, Gang; Guo, Xiaohui; Lu, Juming; Weng, Jianping; Jia, Weiping; Ji, Linong; Xiao, Jianzhong; Shan, Zhongyan; Liu, Jie; Tian, Haoming; Ji, Qiuhe; Zhu, Dalong; Zhou, Zhiguang; Shan, Guangliang; Yang, Wenying
2014-01-01
Background and Aims Obesity is a well-known risk factor for type 2 diabetes. Genome-wide association studies have identified a number of genetic loci associated with obesity. The aim of this study is to examine the contribution of obesity-related genomic loci to type 2 diabetes in a Chinese population. Methods We successfully genotyped 18 obesity-related single nucleotide polymorphisms among 5338 type 2 diabetic patients and 4663 controls. Both individual and joint effects of these single nucleotide polymorphisms on type 2 diabetes and quantitative glycemic traits (assessing β-cell function and insulin resistance) were analyzed using logistic and linear regression models, respectively. Results Two single nucleotide polymorphisms near MC4R and GNPDA2 genes were significantly associated with type 2 diabetes before adjusting for body mass index and waist circumference (OR (95% CI) = 1.14 (1.06, 1.22) for the A allele of rs12970134, P = 4.75×10−4; OR (95% CI) = 1.10 (1.03, 1.17) for the G allele of rs10938397, P = 4.54×10−3). When body mass index and waist circumference were further adjusted, the association of MC4R with type 2 diabetes remained significant (P = 1.81×10−2) and that of GNPDA2 was attenuated (P = 1.26×10−1), suggesting the effect of the locus including GNPDA2 on type 2 diabetes may be mediated through obesity. Single nucleotide polymorphism rs2260000 within BAT2 was significantly associated with type 2 diabetes after adjusting for body mass index and waist circumference (P = 1.04×10−2). In addition, four single nucleotide polymorphisms (near or within SEC16B, BDNF, MAF and PRL genes) showed significant associations with quantitative glycemic traits in controls even after adjusting for body mass index and waist circumference (all P values<0.05). Conclusions This study indicates that obesity-related genomic loci were associated with type 2 diabetes and glycemic traits in the Han Chinese population. PMID:25093408
Gu, Xin; Na, Rong; Huang, Tao; Wang, Li; Tao, Sha; Tian, Lu; Chen, Zhuo; Jiao, Yang; Kang, Jian; Zheng, Siqun; Xu, Jianfeng; Sun, Jielin; Qi, Jun
2013-08-01
Common treatments for benign prostatic hyperplasia include 5α-reductase inhibitors and α-adrenergic receptor antagonists. However, these treatments can only partially decrease the risk of benign prostatic hyperplasia progression. SRD5A1 and SRD5A2 are 5α-reductase inhibitor targets. We investigated the association between drug efficacy and single nucleotide polymorphisms in the SRD5A1 and SRD5A2 genes in a Chinese population. We genotyped 11 tagging single nucleotide polymorphisms in the SRD5A1 and SRD5A2 genes in a total of 426 benign prostatic hyperplasia cases and 1,008 controls from Xinhua Hospital, Shanghai, People's Republic of China. Cases were treated with type II 5α-reductase inhibitors and α-adrenergic receptor antagonists. We tested the association of tagging single nucleotide polymorphisms with benign prostatic hyperplasia risk/progression, clinical characteristics at baseline, including the I-PSS (International Prostate Symptom Score) and total prostate volume, and changes in clinical characteristics after treatment. The 11 tagging single nucleotide polymorphisms were not significantly associated with benign prostatic hyperplasia risk or progression (each p >0.05). In the SRD5A1 gene rs6884552 and rs3797177 were significantly associated with baseline I-PSS (p = 0.04 and 0.003, respectively). In the SRD5A2 gene rs523349 (V89L) and rs9332975 were significantly associated with baseline total prostate volume (p = 0.01 and 0.001, respectively). In SRD5A1 rs166050 was significantly associated with the posttreatment change in total prostate volume (p = 0.04). In SRD5A2 rs523349 and rs612224 were significantly associated with the posttreatment I-PSS change (p = 0.03 and 0.009, respectively). SRD5A1 and SRD5A2 single nucleotide polymorphisms are significantly associated with the clinical characteristics of benign prostatic hyperplasia and the efficacy of benign prostatic hyperplasia treatment. Copyright © 2013 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Incorporation of causative quantitative trait nucleotides in single-step GBLUP.
Fragomeni, Breno O; Lourenco, Daniela A L; Masuda, Yutaka; Legarra, Andres; Misztal, Ignacy
2017-07-26
Much effort is put into identifying causative quantitative trait nucleotides (QTN) in animal breeding, empowered by the availability of dense single nucleotide polymorphism (SNP) information. Genomic selection using traditional SNP information is easily implemented for any number of genotyped individuals using single-step genomic best linear unbiased predictor (ssGBLUP) with the algorithm for proven and young (APY). Our aim was to investigate whether ssGBLUP is useful for genomic prediction when some or all QTN are known. Simulations included 180,000 animals across 11 generations. Phenotypes were available for all animals in generations 6 to 10. Genotypes for 60,000 SNPs across 10 chromosomes were available for 29,000 individuals. The genetic variance was fully accounted for by 100 or 1000 biallelic QTN. Raw genomic relationship matrices (GRM) were computed from (a) unweighted SNPs, (b) unweighted SNPs and causative QTN, (c) SNPs and causative QTN weighted with results obtained with genome-wide association studies, (d) unweighted SNPs and causative QTN with simulated weights, (e) only unweighted causative QTN, (f-h) as in (b-d) but using only the top 10% causative QTN, and (i) using only causative QTN with simulated weight. Predictions were computed by pedigree-based BLUP (PBLUP) and ssGBLUP. Raw GRM were blended with 1 or 5% of the numerator relationship matrix, or 1% of the identity matrix. Inverses of GRM were obtained directly or with APY. Accuracy of breeding values for 5000 genotyped animals in the last generation with PBLUP was 0.32, and for ssGBLUP it increased to 0.49 with an unweighted GRM, 0.53 after adding unweighted QTN, 0.63 when QTN weights were estimated, and 0.89 when QTN weights were based on true effects known from the simulation. When the GRM was constructed from causative QTN only, accuracy was 0.95 and 0.99 with blending at 5 and 1%, respectively. Accuracies simulating 1000 QTN were generally lower, with a similar trend. Accuracies using the APY inverse were equal or higher than those with a regular inverse. Single-step GBLUP can account for causative QTN via a weighted GRM. Accuracy gains are maximum when variances of causative QTN are known and blending is at 1%.
Holmes, Michael V; Exeter, Holly J; Folkersen, Lasse; Nelson, Christopher P; Guardiola, Montse; Cooper, Jackie A; Sofat, Reecha; Boekholdt, S Matthijs; Khaw, Kay-Tee; Li, Ka-Wah; Smith, Andrew J P; Van't Hooft, Ferdinand; Eriksson, Per; Franco-Cereceda, Anders; Asselbergs, Folkert W; Boer, Jolanda M A; Onland-Moret, N Charlotte; Hofker, Marten; Erdmann, Jeanette; Kivimaki, Mika; Kumari, Meena; Reiner, Alex P; Keating, Brendan J; Humphries, Steve E; Hingorani, Aroon D; Mallat, Ziad; Samani, Nilesh J; Talmud, Philippa J
2014-04-01
Secretory phospholipase A2 (sPLA2) enzymes are considered to play a role in atherosclerosis. sPLA2 activity encompasses several sPLA2 isoenzymes, including sPLA2-V. Although observational studies show a strong association between elevated sPLA2 activity and CHD, no assay to measure sPLA2-V levels exists, and the only evidence linking the sPLA2-V isoform to atherosclerosis progression comes from animal studies. In the absence of an assay that directly quantifies sPLA2-V levels, we used PLA2G5 mRNA levels in a novel, modified Mendelian randomization approach to investigate the hypothesized causal role of sPLA2-V in coronary heart disease (CHD) pathogenesis. Using data from the Advanced Study of Aortic Pathology, we identified the single-nucleotide polymorphism in PLA2G5 showing the strongest association with PLA2G5 mRNA expression levels as a proxy for sPLA2-V levels. We tested the association of this SNP with sPLA2 activity and CHD events in 4 prospective and 14 case-control studies with 27 230 events and 70 500 controls. rs525380C>A showed the strongest association with PLA2G5 mRNA expression (P=5.1×10(-6)). There was no association of rs525380C>A with plasma sPLA2 activity (difference in geometric mean of sPLA2 activity per rs525380 A-allele 0.4% (95% confidence intervals [-0.9%, 1.6%]; P=0.56). In meta-analyses, the odds ratio for CHD per A-allele was 1.02 (95% confidence intervals [0.99, 1.04]; P=0.20). This novel approach for single-nucleotide polymorphism selection for this modified Mendelian randomization analysis showed no association between rs525380 (the lead single-nucleotide polymorphism for PLA2G5 expression, a surrogate for sPLA2-V levels) and CHD events. The evidence does not support a causal role for sPLA2-V in CHD.
Ponte, Paulo Roberto Lins; de Medeiros, Pedro Henrique Quintela Soares; Havt, Alexandre; Caetano, Joselany Afio; Cid, David A C; Prata, Mara de Moura Gondim; Soares, Alberto Melo; Guerrant, Richard L; Mychaleckyj, Josyf; Lima, Aldo Ângelo Moreira
2016-02-01
This work aimed to evaluate and correlate symptoms, biochemical blood test results and single nucleotide polymorphisms for lactose intolerance diagnosis. A cross-sectional study was conducted in Fortaleza, Ceará, Brazil, with a total of 119 patients, 54 of whom were lactose intolerant. Clinical evaluation and biochemical blood tests were conducted after lactose ingestion and blood samples were collected for genotyping evaluation. In particular, the single nucleotide polymorphisms C>T-13910 and G>A-22018 were analyzed by restriction fragment length polymorphism/polymerase chain reaction and validated by DNA sequencing. Lactose-intolerant patients presented with more symptoms of flatulence (81.4%), bloating (68.5%), borborygmus (59.3%) and diarrhea (46.3%) compared with non-lactose-intolerant patients (p<0.05). We observed a significant association between the presence of the alleles T-13910 and A-22018 and the lactose-tolerant phenotype (p<0.05). After evaluation of the biochemical blood test results for lactose, we found that the most effective cutoff for glucose levels obtained for lactose malabsorbers was <15 mg/dL, presenting an area under the receiver operating characteristic curve greater than 80.3%, with satisfactory values for sensitivity and specificity. These data corroborate the association of these single nucleotide polymorphisms (C>T-13910 and G>A-22018) with lactose tolerance in this population and suggest clinical management for patients with lactose intolerance that considers single nucleotide polymorphism detection and a change in the biochemical blood test cutoff from <25 mg/dL to <15 mg/dL.
Adib-Samii, Poneh; Rost, Natalia; Traylor, Matthew; Devan, William; Biffi, Alessandro; Lanfranconi, Silvia; Fitzpatrick, Kaitlin; Bevan, Steve; Kanakis, Allison; Valant, Valerie; Gschwendtner, Andreas; Malik, Rainer; Richie, Alexa; Gamble, Dale; Segal, Helen; Parati, Eugenio A.; Ciusani, Emilio; Holliday, Elizabeth G.; Maguire, Jane; Wardlaw, Joanna; Worrall, Bradford; Bis, Joshua; Wiggins, Kerri L.; Longstreth, Will; Kittner, Steve J.; Cheng, Yu-Ching; Mosley, Thomas; Falcone, Guido J.; Furie, Karen L.; Leiva-Salinas, Carlos; Lau, Benison C.; Khan, Muhammed Saleem; Sharma, Pankaj; Fornage, Myriam; Mitchell, Braxton D.; Psaty, Bruce M.; Sudlow, Cathie; Levi, Christopher; Boncoraglio, Giorgio B.; Rothwell, Peter M.; Meschia, James; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.
2013-01-01
Background and Purpose Recently, a novel locus at 17q25 was associated with white matter hyperintensities (WMH) on MRI in stroke-free individuals. We aimed to replicate the association with WMH volume (WMHV) in patients with ischemic stroke. If the association acts by promoting a small vessel arteriopathy, it might be expected to also associate with lacunar stroke. Methods We quantified WMH on MRI in the stroke-free hemisphere of 2588 ischemic stroke cases. Association between WMHV and 6 single-nucleotide polymorphisms at chromosome 17q25 was assessed by linear regression. These single-nucleotide polymorphisms were also investigated for association with lacunar stroke in 1854 cases and 51 939 stroke-free controls from METASTROKE. Meta-analyses with previous reports and a genetic risk score approach were applied to identify other novel WMHV risk variants and uncover shared genetic contributions to WMHV in community participants without stroke and ischemic stroke. Results Single-nucleotide polymorphisms at 17q25 were associated with WMHV in ischemic stroke, the most significant being rs9894383 (P=0.0006). In contrast, there was no association between any single-nucleotide polymorphism and lacunar stroke. A genetic risk score analysis revealed further genetic components to WMHV shared between community participants without stroke and ischemic stroke. Conclusions This study provides support for an association between the 17q25 locus and WMH. In contrast, it is not associated with lacunar stroke, suggesting that the association does not act by promoting small-vessel arteriopathy or the same arteriopathy responsible for lacunar infarction. PMID:23674528
Leonardo, Daniela P.; Albuquerque, Dulcinéia M.; Lanaro, Carolina; Baptista, Letícia C.; Cecatti, José G.; Surita, Fernanda G.; Parpinelli, Mary A.; Costa, Fernando F.; Franco-Penteado, Carla F.; Fertrin, Kleber Y.; Costa, Maria Laura
2015-01-01
Background Preeclampsia is one of the leading causes of maternal and neonatal morbidity and mortality in the world, but its appearance is still unpredictable and its pathophysiology has not been entirely elucidated. Genetic studies have associated single nucleotide polymorphisms in genes encoding nitric oxide synthase and matrix metalloproteases with preeclampsia, but the results are largely inconclusive across different populations. Objectives To investigate the association of single nucleotide polymorphisms (SNPs) in NOS3 (G894T, T-786C, and a variable number of tandem repetitions VNTR in intron 4), MMP2 (C-1306T), and MMP9 (C-1562T) genes with preeclampsia in patients from Southeastern Brazil. Methods This prospective case-control study enrolled 77 women with preeclampsia and 266 control pregnant women. Clinical data were collected to assess risk factors and the presence of severe complications, such as eclampsia and HELLP (hemolysis, elevated liver enzymes, and low platelets) syndrome. Results We found a significant association between the single nucleotide polymorphism NOS3 T-786C and preeclampsia, independently from age, height, weight, or the other SNPs studied, and no association was found with the other polymorphisms. Age and history of preeclampsia were also identified as risk factors. The presence of at least one polymorphic allele for NOS3 T-786C was also associated with the occurrence of eclampsia or HELLP syndrome among preeclamptic women. Conclusions Our data support that the NOS3 T-786C SNP is associated with preeclampsia and the severity of its complications. PMID:26317342
Ponte, Paulo Roberto Lins; de Medeiros, Pedro Henrique Quintela Soares; Havt, Alexandre; Caetano, Joselany Afio; Cid, David A C; de Moura Gondim Prata, Mara; Soares, Alberto Melo; Guerrant, Richard L; Mychaleckyj, Josyf; Lima, Aldo Ângelo Moreira
2016-01-01
OBJECTIVE: This work aimed to evaluate and correlate symptoms, biochemical blood test results and single nucleotide polymorphisms for lactose intolerance diagnosis. METHOD: A cross-sectional study was conducted in Fortaleza, Ceará, Brazil, with a total of 119 patients, 54 of whom were lactose intolerant. Clinical evaluation and biochemical blood tests were conducted after lactose ingestion and blood samples were collected for genotyping evaluation. In particular, the single nucleotide polymorphisms C>T-13910 and G>A-22018 were analyzed by restriction fragment length polymorphism/polymerase chain reaction and validated by DNA sequencing. RESULTS: Lactose-intolerant patients presented with more symptoms of flatulence (81.4%), bloating (68.5%), borborygmus (59.3%) and diarrhea (46.3%) compared with non-lactose-intolerant patients (p<0.05). We observed a significant association between the presence of the alleles T-13910 and A-22018 and the lactose-tolerant phenotype (p<0.05). After evaluation of the biochemical blood test results for lactose, we found that the most effective cutoff for glucose levels obtained for lactose malabsorbers was <15 mg/dL, presenting an area under the receiver operating characteristic curve greater than 80.3%, with satisfactory values for sensitivity and specificity. CONCLUSIONS: These data corroborate the association of these single nucleotide polymorphisms (C>T-13910 and G>A-22018) with lactose tolerance in this population and suggest clinical management for patients with lactose intolerance that considers single nucleotide polymorphism detection and a change in the biochemical blood test cutoff from <25 mg/dL to <15 mg/dL. PMID:26934237
Marq, Jean-Baptiste; Hausmann, Stéphane; Veillard, Nicolas; Kolakofsky, Daniel; Garcin, Dominique
2011-02-25
Arenavirus RNA genomes are initiated by a "prime and realign" mechanism, such that the initiating GTP is found as a single unpaired (overhanging) nucleotide when the complementary genome ends anneal to form double-stranded (ds) RNA panhandle structures. dsRNAs modeled on these structures do not induce interferon (IFN), as opposed to blunt-ended (5' ppp)dsRNA. This study examines whether these viral structures can also act as decoys, by trapping RIG-I in inactive dsRNA complexes. We examined the ability of various dsRNAs to activate the RIG-I ATPase (presumably a measure of helicase translocation on dsRNA) relative to their ability to induce IFN. We found that there is no simple relationship between these two properties, as if RIG-I can translocate on short dsRNAs without inducing IFN. Moreover, we found that (5' ppp)dsRNAs with a single unpaired 5' ppp-nucleotide can in fact competitively inhibit the ability of blunt-ended (5' ppp)dsRNAs to induce IFN when co-transfected into cells and that this inhibition is strongly dependent on the presence of the 5' ppp. In contrast, (5' ppp)dsRNAs with a single unpaired 5' ppp-nucleotide does not inhibit poly(I-C)-induced IFN activation, which is independent of the presence of a 5' ppp group.
Four Linked Genes Participate in Controlling Sporulation Efficiency in Budding Yeast
Ben-Ari, Giora; Zenvirth, Drora; Sherman, Amir; David, Lior; Klutstein, Michael; Lavi, Uri; Hillel, Jossi; Simchen, Giora
2006-01-01
Quantitative traits are conditioned by several genetic determinants. Since such genes influence many important complex traits in various organisms, the identification of quantitative trait loci (QTLs) is of major interest, but still encounters serious difficulties. We detected four linked genes within one QTL, which participate in controlling sporulation efficiency in Saccharomyces cerevisiae. Following the identification of single nucleotide polymorphisms by comparing the sequences of 145 genes between the parental strains SK1 and S288c, we analyzed the segregating progeny of the cross between them. Through reciprocal hemizygosity analysis, four genes, RAS2, PMS1, SWS2, and FKH2, located in a region of 60 kilobases on Chromosome 14, were found to be associated with sporulation efficiency. Three of the four “high” sporulation alleles are derived from the “low” sporulating strain. Two of these sporulation-related genes were verified through allele replacements. For RAS2, the causative variation was suggested to be a single nucleotide difference in the upstream region of the gene. This quantitative trait nucleotide accounts for sporulation variability among a set of ten closely related winery yeast strains. Our results provide a detailed view of genetic complexity in one “QTL region” that controls a quantitative trait and reports a single nucleotide polymorphism-trait association in wild strains. Moreover, these findings have implications on QTL identification in higher eukaryotes. PMID:17112318
Liu, J; Turnbough, C L
1994-01-01
In Escherichia coli, expression of the pyrC gene is regulated primarily by a translational control mechanism based on nucleotide-sensitive selection of transcriptional start sites at the pyrC promoter. When intracellular levels of CTP are high, pyrC transcripts are initiated predominantly with CTP at a site 7 bases downstream of the Pribnow box. These transcripts form a stable hairpin at their 5' ends that blocks ribosome binding. When the CTP level is low and the GTP level is high, conditions found in pyrimidine-limited cells, transcripts are initiated primarily with GTP at a site 9 bases downstream of the Pribnow box. These shorter transcripts are unable to form a hairpin at their 5' ends and are readily translated. In this study, we examined the effects of nucleotide sequence and position on the selection of transcriptional start sites at the pyrC promoter. We characterized promoter mutations that systematically alter the sequence at position 7 or 9 downstream of the Pribnow box or vary the spacing between the Pribnow box and wild-type transcriptional initiation region. The results reveal preferences for particular initiating nucleotides (ATP > or = GTP > UTP >> CTP) and for starting positions downstream of the Pribnow box (7 >> 6 and 8 > 9 > 10). The results indicate that optimal nucleotide-sensitive start site switching at the wild-type pyrC promoter is the result of competition between the preferred start site (position 7) that uses the poorest initiating nucleotide (CTP) and a weak start site (position 9) that uses a good initiating nucleotide (GTP). The sequence of the pyrC promoter also minimizes the synthesis of untranslatable transcripts and provides for maximum stability of the regulatory transcript hairpin. In addition, the results show that the effects of the mutations on pyrC expression and regulation are consistent with the current model for translational control. Possible effects of preferences for initiating nucleotides and start sites on the expression and regulation of other genes are discussed. Images PMID:7910603
New evidence for positive selection helps explain the paternal age effect observed in achondroplasia
Shinde, Deepali N.; Elmer, Dominik P.; Calabrese, Peter; Boulanger, Jérôme; Arnheim, Norman; Tiemann-Boege, Irene
2013-01-01
There are certain de novo germline mutations associated with genetic disorders whose mutation rates per generation are orders of magnitude higher than the genome average. Moreover, these mutations occur exclusively in the male germ line and older men have a higher probability of having an affected child than younger ones, known as the paternal age effect (PAE). The classic example of a genetic disorder exhibiting a PAE is achondroplasia, caused predominantly by a single-nucleotide substitution (c.1138G>A) in FGFR3. To elucidate what mechanisms might be driving the high frequency of this mutation in the male germline, we examined the spatial distribution of the c.1138G>A substitution in a testis from an 80-year-old unaffected man. Using a technology based on bead-emulsion amplification, we were able to measure mutation frequencies in 192 individual pieces of the dissected testis with a false-positive rate lower than 2.7 × 10−6. We observed that most mutations are clustered in a few pieces with 95% of all mutations occurring in 27% of the total testis. Using computational simulations, we rejected the model proposing an elevated mutation rate per cell division at this nucleotide site. Instead, we determined that the observed mutation distribution fits a germline selection model, where mutant spermatogonial stem cells have a proliferative advantage over unmutated cells. Combined with data on several other PAE mutations, our results support the idea that the PAE, associated with a number of Mendelian disorders, may be explained primarily by a selective mechanism. PMID:23740942
Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian
2016-01-01
Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus. PMID:27336696
Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian
2016-01-01
Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus.
Optimal selection of markers for validation or replication from genome-wide association studies.
Greenwood, Celia M T; Rangrej, Jagadish; Sun, Lei
2007-07-01
With reductions in genotyping costs and the fast pace of improvements in genotyping technology, it is not uncommon for the individuals in a single study to undergo genotyping using several different platforms, where each platform may contain different numbers of markers selected via different criteria. For example, a set of cases and controls may be genotyped at markers in a small set of carefully selected candidate genes, and shortly thereafter, the same cases and controls may be used for a genome-wide single nucleotide polymorphism (SNP) association study. After such initial investigations, often, a subset of "interesting" markers is selected for validation or replication. Specifically, by validation, we refer to the investigation of associations between the selected subset of markers and the disease in independent data. However, it is not obvious how to choose the best set of markers for this validation. There may be a prior expectation that some sets of genotyping data are more likely to contain real associations. For example, it may be more likely for markers in plausible candidate genes to show disease associations than markers in a genome-wide scan. Hence, it would be desirable to select proportionally more markers from the candidate gene set. When a fixed number of markers are selected for validation, we propose an approach for identifying an optimal marker-selection configuration by basing the approach on minimizing the stratified false discovery rate. We illustrate this approach using a case-control study of colorectal cancer from Ontario, Canada, and we show that this approach leads to substantial reductions in the estimated false discovery rates in the Ontario dataset for the selected markers, as well as reductions in the expected false discovery rates for the proposed validation dataset. Copyright 2007 Wiley-Liss, Inc.
Genetics of Oxidative Stress in Obesity
Rupérez, Azahara I.; Gil, Angel; Aguilera, Concepción M.
2014-01-01
Obesity is a multifactorial disease characterized by the excessive accumulation of fat in adipose tissue and peripheral organs. Its derived metabolic complications are mediated by the associated oxidative stress, inflammation and hypoxia. Oxidative stress is due to the excessive production of reactive oxygen species or diminished antioxidant defenses. Genetic variants, such as single nucleotide polymorphisms in antioxidant defense system genes, could alter the efficacy of these enzymes and, ultimately, the risk of obesity; thus, studies investigating the role of genetic variations in genes related to oxidative stress could be useful for better understanding the etiology of obesity and its metabolic complications. The lack of existing literature reviews in this field encouraged us to gather the findings from studies focusing on the impact of single nucleotide polymorphisms in antioxidant enzymes, oxidative stress-producing systems and transcription factor genes concerning their association with obesity risk and its phenotypes. In the future, the characterization of these single nucleotide polymorphisms (SNPs) in obese patients could contribute to the development of controlled antioxidant therapies potentially beneficial for the treatment of obesity-derived metabolic complications. PMID:24562334
Genetics of oxidative stress in obesity.
Rupérez, Azahara I; Gil, Angel; Aguilera, Concepción M
2014-02-20
Obesity is a multifactorial disease characterized by the excessive accumulation of fat in adipose tissue and peripheral organs. Its derived metabolic complications are mediated by the associated oxidative stress, inflammation and hypoxia. Oxidative stress is due to the excessive production of reactive oxygen species or diminished antioxidant defenses. Genetic variants, such as single nucleotide polymorphisms in antioxidant defense system genes, could alter the efficacy of these enzymes and, ultimately, the risk of obesity; thus, studies investigating the role of genetic variations in genes related to oxidative stress could be useful for better understanding the etiology of obesity and its metabolic complications. The lack of existing literature reviews in this field encouraged us to gather the findings from studies focusing on the impact of single nucleotide polymorphisms in antioxidant enzymes, oxidative stress-producing systems and transcription factor genes concerning their association with obesity risk and its phenotypes. In the future, the characterization of these single nucleotide polymorphisms (SNPs) in obese patients could contribute to the development of controlled antioxidant therapies potentially beneficial for the treatment of obesity-derived metabolic complications.
A graphene-based platform for single nucleotide polymorphism (SNP) genotyping.
Liu, Meng; Zhao, Huimin; Chen, Shuo; Yu, Hongtao; Zhang, Yaobin; Quan, Xie
2011-06-15
A facile, rapid, stable and sensitive approach for fluorescent detection of single nucleotide polymorphism (SNP) is designed based on DNA ligase reaction and π-stacking between the graphene and the nucleotide bases. In the presence of perfectly matched DNA, DNA ligase can catalyze the linkage of fluorescein amidite-labeled single-stranded DNA (ssDNA) and a phosphorylated ssDNA, and thus the formation of a stable duplex in high yield. However, the catalytic reaction cannot effectively carry out with one-base mismatched DNA target. In this case, we add graphene to the system in order to produce different quenching signals due to its different adsorption affinity for ssDNA and double-stranded DNA. Taking advantage of the unique surface property of graphene and the high discriminability of DNA ligase, the proposed protocol exhibits good performance in SNP genotyping. The results indicate that it is possible to accurately determine SNP with frequency as low as 2.6% within 40 min. Furthermore, the presented flexible strategy facilitates the development of other biosensing applications in the future. Copyright © 2011 Elsevier B.V. All rights reserved.
WEB-server for search of a periodicity in amino acid and nucleotide sequences
NASA Astrophysics Data System (ADS)
E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.
2017-12-01
A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.
Selection of a novel CD19 aptamer for targeted delivery of doxorubicin to lymphoma cells.
Hu, Yan; Li, Xiaoou; An, Yacong; Duan, Jinhong; Yang, Xian-Da
2018-06-01
CD19 is overexpressed in most human B cell malignancies and considered an important tumor marker for diagnosis and treatment. Aptamers are oligonucleotides that may potentially serve as tumor-homing ligand for targeted cancer therapy with excellent affinity and specificity. In this study, we selected a novel CD19 aptamer (LC1) that was a 59-nucleotide single strand DNA. The aptamer could bind to recombinant CD19 protein with a K d of 85.4 nM, and had minimal cross reactivity to bovine serum albumin (BSA) or ovalbumin (OVA). Moreover, the aptamer was found capable of binding with the CD19-positive lymphoma cells (Ramos and Raji), but not the CD19-negative cell lines (Jurkat and NB4). An aptamer-doxorubicin complex (Apt-Dox) was also formulated, and selectively delivered doxorubicin to CD19-positive lymphoma cells in vitro . The results indicate that aptamer LC1 can recognize CD19-positive tumor cells and may potentially function as a CD19-targeting ligand.
Portnoy, D S; Puritz, J B; Hollenbeck, C M; Gelsleichter, J; Chapman, D; Gold, J R
2015-12-01
Sex-biased dispersal is expected to homogenize nuclear genetic variation relative to variation in genetic material inherited through the philopatric sex. When site fidelity occurs across a heterogeneous environment, local selective regimes may alter this pattern. We assessed spatial patterns of variation in nuclear-encoded, single nucleotide polymorphisms (SNPs) and sequences of the mitochondrial control region in bonnethead sharks (Sphyrna tiburo), a species thought to exhibit female philopatry, collected from summer habitats used for gestation. Geographic patterns of mtDNA haplotypes and putatively neutral SNPs confirmed female philopatry and male-mediated gene flow along the northeastern coast of the Gulf of Mexico. A total of 30 outlier SNP loci were identified; alleles at over half of these loci exhibited signatures of latitude-associated selection. Our results indicate that in species with sex-biased dispersal, philopatry can facilitate sorting of locally adaptive variation, with the dispersing sex facilitating movement of potentially adaptive variation among locations and environments. © 2015 John Wiley & Sons Ltd.
Cassava genome from a wild ancestor to cultivated varieties
Wang, Wenquan; Feng, Binxiao; Xiao, Jingfa; Xia, Zhiqiang; Zhou, Xincheng; Li, Pinghua; Zhang, Weixiong; Wang, Ying; Møller, Birger Lindberg; Zhang, Peng; Luo, Ming-Cheng; Xiao, Gong; Liu, Jingxing; Yang, Jun; Chen, Songbi; Rabinowicz, Pablo D.; Chen, Xin; Zhang, Hong-Bin; Ceballos, Henan; Lou, Qunfeng; Zou, Meiling; Carvalho, Luiz J.C.B.; Zeng, Changying; Xia, Jing; Sun, Shixiang; Fu, Yuhua; Wang, Haiyan; Lu, Cheng; Ruan, Mengbin; Zhou, Shuigeng; Wu, Zhicheng; Liu, Hui; Kannangara, Rubini Maya; Jørgensen, Kirsten; Neale, Rebecca Louise; Bonde, Maya; Heinz, Nanna; Zhu, Wenli; Wang, Shujuan; Zhang, Yang; Pan, Kun; Wen, Mingfu; Ma, Ping-An; Li, Zhengxu; Hu, Meizhen; Liao, Wenbin; Hu, Wenbin; Zhang, Shengkui; Pei, Jinli; Guo, Anping; Guo, Jianchun; Zhang, Jiaming; Zhang, Zhengwen; Ye, Jianqiu; Ou, Wenjun; Ma, Yaqin; Liu, Xinyue; Tallon, Luke J.; Galens, Kevin; Ott, Sandra; Huang, Jie; Xue, Jingjing; An, Feifei; Yao, Qingqun; Lu, Xiaojing; Fregene, Martin; López-Lavalle, L. Augusto Becerra; Wu, Jiajie; You, Frank M.; Chen, Meili; Hu, Songnian; Wu, Guojiang; Zhong, Silin; Ling, Peng; Chen, Yeyuan; Wang, Qinghuang; Liu, Guodao; Liu, Bin; Li, Kaimian; Peng, Ming
2014-01-01
Cassava is a major tropical food crop in the Euphorbiaceae family that has high carbohydrate production potential and adaptability to diverse environments. Here we present the draft genome sequences of a wild ancestor and a domesticated variety of cassava and comparative analyses with a partial inbred line. We identify 1,584 and 1,678 gene models specific to the wild and domesticated varieties, respectively, and discover high heterozygosity and millions of single-nucleotide variations. Our analyses reveal that genes involved in photosynthesis, starch accumulation and abiotic stresses have been positively selected, whereas those involved in cell wall biosynthesis and secondary metabolism, including cyanogenic glucoside formation, have been negatively selected in the cultivated varieties, reflecting the result of natural selection and domestication. Differences in microRNA genes and retrotransposon regulation could partly explain an increased carbon flux towards starch accumulation and reduced cyanogenic glucoside accumulation in domesticated cassava. These results may contribute to genetic improvement of cassava through better understanding of its biology. PMID:25300236
Atanur, Santosh S; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R; Kaisaki, Pamela J; Otto, Georg W; Ma, Man Chun John; Keane, Thomas M; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J
2013-08-01
Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. Copyright © 2013 The Authors. Published by Elsevier Inc. All rights reserved.
Ma, Xin; Kelley, Joanna L.; Eilertson, Kirsten; Musharoff, Shaila; Degenhardt, Jeremiah D.; Martins, André L.; Vinar, Tomas; Kosiol, Carolin; Siepel, Adam; Gutenkunst, Ryan N.; Bustamante, Carlos D.
2013-01-01
To gain insights into evolutionary forces that have shaped the history of Bornean and Sumatran populations of orang-utans, we compare patterns of variation across more than 11 million single nucleotide polymorphisms found by previous mitochondrial and autosomal genome sequencing of 10 wild-caught orang-utans. Our analysis of the mitochondrial data yields a far more ancient split time between the two populations (∼3.4 million years ago) than estimates based on autosomal data (0.4 million years ago), suggesting a complex speciation process with moderate levels of primarily male migration. We find that the distribution of selection coefficients consistent with the observed frequency spectrum of autosomal non-synonymous polymorphisms in orang-utans is similar to the distribution in humans. Our analysis indicates that 35% of genes have evolved under detectable negative selection. Overall, our findings suggest that purifying natural selection, genetic drift, and a complex demographic history are the dominant drivers of genome evolution for the two orang-utan populations. PMID:24194868
Ma, Xin; Kelley, Joanna L; Eilertson, Kirsten; Musharoff, Shaila; Degenhardt, Jeremiah D; Martins, André L; Vinar, Tomas; Kosiol, Carolin; Siepel, Adam; Gutenkunst, Ryan N; Bustamante, Carlos D
2013-01-01
To gain insights into evolutionary forces that have shaped the history of Bornean and Sumatran populations of orang-utans, we compare patterns of variation across more than 11 million single nucleotide polymorphisms found by previous mitochondrial and autosomal genome sequencing of 10 wild-caught orang-utans. Our analysis of the mitochondrial data yields a far more ancient split time between the two populations (~3.4 million years ago) than estimates based on autosomal data (0.4 million years ago), suggesting a complex speciation process with moderate levels of primarily male migration. We find that the distribution of selection coefficients consistent with the observed frequency spectrum of autosomal non-synonymous polymorphisms in orang-utans is similar to the distribution in humans. Our analysis indicates that 35% of genes have evolved under detectable negative selection. Overall, our findings suggest that purifying natural selection, genetic drift, and a complex demographic history are the dominant drivers of genome evolution for the two orang-utan populations.
2013-01-01
Background High resolution melting analysis (HRM) is a rapid and cost-effective technique for the characterisation of PCR amplicons. Because the reverse genetics of segmented influenza A viruses allows the generation of numerous influenza A virus reassortants within a short time, methods for the rapid selection of the correct recombinants are very useful. Methods PCR primer pairs covering the single nucleotide polymorphism (SNP) positions of two different influenza A H5N1 strains were designed. Reassortants of the two different H5N1 isolates were used as a model to prove the suitability of HRM for the selection of the correct recombinants. Furthermore, two different cycler instruments were compared. Results Both cycler instruments generated comparable average melting peaks, which allowed the easy identification and selection of the correct cloned segments or reassorted viruses. Conclusions HRM is a highly suitable method for the rapid and precise characterisation of cloned influenza A genomes. PMID:24028349
Atanur, Santosh S.; Diaz, Ana Garcia; Maratou, Klio; Sarkis, Allison; Rotival, Maxime; Game, Laurence; Tschannen, Michael R.; Kaisaki, Pamela J.; Otto, Georg W.; Ma, Man Chun John; Keane, Thomas M.; Hummel, Oliver; Saar, Kathrin; Chen, Wei; Guryev, Victor; Gopalakrishnan, Kathirvel; Garrett, Michael R.; Joe, Bina; Citterio, Lorena; Bianchi, Giuseppe; McBride, Martin; Dominiczak, Anna; Adams, David J.; Serikawa, Tadao; Flicek, Paul; Cuppen, Edwin; Hubner, Norbert; Petretto, Enrico; Gauguier, Dominique; Kwitek, Anne; Jacob, Howard; Aitman, Timothy J.
2013-01-01
Summary Large numbers of inbred laboratory rat strains have been developed for a range of complex disease phenotypes. To gain insights into the evolutionary pressures underlying selection for these phenotypes, we sequenced the genomes of 27 rat strains, including 11 models of hypertension, diabetes, and insulin resistance, along with their respective control strains. Altogether, we identified more than 13 million single-nucleotide variants, indels, and structural variants across these rat strains. Analysis of strain-specific selective sweeps and gene clusters implicated genes and pathways involved in cation transport, angiotensin production, and regulators of oxidative stress in the development of cardiovascular disease phenotypes in rats. Many of the rat loci that we identified overlap with previously mapped loci for related traits in humans, indicating the presence of shared pathways underlying these phenotypes in rats and humans. These data represent a step change in resources available for evolutionary analysis of complex traits in disease models. PaperClip PMID:23890820
Quantum-Sequencing: Fast electronic single DNA molecule sequencing
NASA Astrophysics Data System (ADS)
Casamada Ribot, Josep; Chatterjee, Anushree; Nagpal, Prashant
2014-03-01
A major goal of third-generation sequencing technologies is to develop a fast, reliable, enzyme-free, high-throughput and cost-effective, single-molecule sequencing method. Here, we present the first demonstration of unique ``electronic fingerprint'' of all nucleotides (A, G, T, C), with single-molecule DNA sequencing, using Quantum-tunneling Sequencing (Q-Seq) at room temperature. We show that the electronic state of the nucleobases shift depending on the pH, with most distinct states identified at acidic pH. We also demonstrate identification of single nucleotide modifications (methylation here). Using these unique electronic fingerprints (or tunneling data), we report a partial sequence of beta lactamase (bla) gene, which encodes resistance to beta-lactam antibiotics, with over 95% success rate. These results highlight the potential of Q-Seq as a robust technique for next-generation sequencing.
Korshoj, Lee E; Afsari, Sepideh; Chatterjee, Anushree; Nagpal, Prashant
2017-11-01
Electronic conduction or charge transport through single molecules depends primarily on molecular structure and anchoring groups and forms the basis for a wide range of studies from molecular electronics to DNA sequencing. Several high-throughput nanoelectronic methods such as mechanical break junctions, nanopores, conductive atomic force microscopy, scanning tunneling break junctions, and static nanoscale electrodes are often used for measuring single-molecule conductance. In these measurements, "smearing" due to conformational changes and other entropic factors leads to large variances in the observed molecular conductance, especially in individual measurements. Here, we show a method for characterizing smear in single-molecule conductance measurements and demonstrate how binning measurements according to smear can significantly enhance the use of individual conductance measurements for molecular recognition. Using quantum point contact measurements on single nucleotides within DNA macromolecules, we demonstrate that the distance over which molecular junctions are maintained is a measure of smear, and the resulting variance in unbiased single measurements depends on this smear parameter. Our ability to identify individual DNA nucleotides at 20× coverage increases from 81.3% accuracy without smear analysis to 93.9% with smear characterization and binning (SCRIB). Furthermore, merely 7 conductance measurements (7× coverage) are needed to achieve 97.8% accuracy for DNA nucleotide recognition when only low molecular smear measurements are used, which represents a significant improvement over contemporary sequencing methods. These results have important implications in a broad range of molecular electronics applications from designing robust molecular switches to nanoelectronic DNA sequencing.
2012-01-01
Background Efficient, robust, and accurate genotype imputation algorithms make large-scale application of genomic selection cost effective. An algorithm that imputes alleles or allele probabilities for all animals in the pedigree and for all genotyped single nucleotide polymorphisms (SNP) provides a framework to combine all pedigree, genomic, and phenotypic information into a single-stage genomic evaluation. Methods An algorithm was developed for imputation of genotypes in pedigreed populations that allows imputation for completely ungenotyped animals and for low-density genotyped animals, accommodates a wide variety of pedigree structures for genotyped animals, imputes unmapped SNP, and works for large datasets. The method involves simple phasing rules, long-range phasing and haplotype library imputation and segregation analysis. Results Imputation accuracy was high and computational cost was feasible for datasets with pedigrees of up to 25 000 animals. The resulting single-stage genomic evaluation increased the accuracy of estimated genomic breeding values compared to a scenario in which phenotypes on relatives that were not genotyped were ignored. Conclusions The developed imputation algorithm and software and the resulting single-stage genomic evaluation method provide powerful new ways to exploit imputation and to obtain more accurate genetic evaluations. PMID:22462519
Bavykin, Sergei G.; Mirzabekova, legal representative, Natalia V.; Mirzabekov, deceased, Andrei D.
2007-12-04
The present invention relates to methods and compositions for using nucleotide sequence variations of 16S and 23S rRNA within the B. cereus group to discriminate a highly infectious bacterium B. anthracis from closely related microorganisms. Sequence variations in the 16S and 23S rRNA of the B. cereus subgroup including B. anthracis are utilized to construct an array that can detect these sequence variations through selective hybridizations and discriminate B. cereus group that includes B. anthracis. Discrimination of single base differences in rRNA was achieved with a microchip during analysis of B. cereus group isolates from both single and in mixed samples, as well as identification of polymorphic sites. Successful use of a microchip to determine the appropriate subgroup classification using eight reference microorganisms from the B. cereus group as a study set, was demonstrated.
Label-free detection of DNA hybridization using carbon nanotube network field-effect transistors
NASA Astrophysics Data System (ADS)
Star, Alexander; Tu, Eugene; Niemann, Joseph; Gabriel, Jean-Christophe P.; Joiner, C. Steve; Valcke, Christian
2006-01-01
We report carbon nanotube network field-effect transistors (NTNFETs) that function as selective detectors of DNA immobilization and hybridization. NTNFETs with immobilized synthetic oligonucleotides have been shown to specifically recognize target DNA sequences, including H63D single-nucleotide polymorphism (SNP) discrimination in the HFE gene, responsible for hereditary hemochromatosis. The electronic responses of NTNFETs upon single-stranded DNA immobilization and subsequent DNA hybridization events were confirmed by using fluorescence-labeled oligonucleotides and then were further explored for label-free DNA detection at picomolar to micromolar concentrations. We have also observed a strong effect of DNA counterions on the electronic response, thus suggesting a charge-based mechanism of DNA detection using NTNFET devices. Implementation of label-free electronic detection assays using NTNFETs constitutes an important step toward low-cost, low-complexity, highly sensitive and accurate molecular diagnostics. hemochromatosis | SNP | biosensor
Analysis of Genome-Wide Association Studies with Multiple Outcomes Using Penalization
Liu, Jin; Huang, Jian; Ma, Shuangge
2012-01-01
Genome-wide association studies have been extensively conducted, searching for markers for biologically meaningful outcomes and phenotypes. Penalization methods have been adopted in the analysis of the joint effects of a large number of SNPs (single nucleotide polymorphisms) and marker identification. This study is partly motivated by the analysis of heterogeneous stock mice dataset, in which multiple correlated phenotypes and a large number of SNPs are available. Existing penalization methods designed to analyze a single response variable cannot accommodate the correlation among multiple response variables. With multiple response variables sharing the same set of markers, joint modeling is first employed to accommodate the correlation. The group Lasso approach is adopted to select markers associated with all the outcome variables. An efficient computational algorithm is developed. Simulation study and analysis of the heterogeneous stock mice dataset show that the proposed method can outperform existing penalization methods. PMID:23272092
Shi, Chao; Ge, Yujie; Gu, Hongxi; Ma, Cuiping
2011-08-15
Single nucleotide polymorphism (SNP) genotyping is attracting extensive attentions owing to its direct connections with human diseases including cancers. Here, we have developed a highly sensitive chemiluminescence biosensor based on circular strand-displacement amplification and the separation by magnetic beads reducing the background signal for point mutation detection at room temperature. This method took advantage of both the T4 DNA ligase recognizing single-base mismatch with high selectivity and the strand-displacement reaction of polymerase to perform signal amplification. The detection limit of this method was 1.3 × 10(-16)M, which showed better sensitivity than that of most of those reported detection methods of SNP. Additionally, the magnetic beads as carrier of immobility was not only to reduce the background signal, but also may have potential apply in high through-put screening of SNP detection in human genome. Copyright © 2011 Elsevier B.V. All rights reserved.
Genotoxin induced mutagenesis in the model plant Physcomitrella patens.
Holá, Marcela; Kozák, Jaroslav; Vágnerová, Radka; Angelis, Karel J
2013-01-01
The moss Physcomitrella patens is unique for the high frequency of homologous recombination, haploid state, and filamentous growth during early stages of the vegetative growth, which makes it an excellent model plant to study DNA damage responses. We used single cell gel electrophoresis (comet) assay to determine kinetics of response to Bleomycin induced DNA oxidative damage and single and double strand breaks in wild type and mutant lig4 Physcomitrella lines. Moreover, APT gene when inactivated by induced mutations was used as selectable marker to ascertain mutational background at nucleotide level by sequencing of the APT locus. We show that extensive repair of DSBs occurs also in the absence of the functional LIG4, whereas repair of SSBs is seriously compromised. From analysis of induced mutations we conclude that their accumulation rather than remaining lesions in DNA and blocking progression through cell cycle is incompatible with normal plant growth and development and leads to sensitive phenotype.
Genotoxin Induced Mutagenesis in the Model Plant Physcomitrella patens
Holá, Marcela; Kozák, Jaroslav; Vágnerová, Radka; Angelis, Karel J.
2013-01-01
The moss Physcomitrella patens is unique for the high frequency of homologous recombination, haploid state, and filamentous growth during early stages of the vegetative growth, which makes it an excellent model plant to study DNA damage responses. We used single cell gel electrophoresis (comet) assay to determine kinetics of response to Bleomycin induced DNA oxidative damage and single and double strand breaks in wild type and mutant lig4 Physcomitrella lines. Moreover, APT gene when inactivated by induced mutations was used as selectable marker to ascertain mutational background at nucleotide level by sequencing of the APT locus. We show that extensive repair of DSBs occurs also in the absence of the functional LIG4, whereas repair of SSBs is seriously compromised. From analysis of induced mutations we conclude that their accumulation rather than remaining lesions in DNA and blocking progression through cell cycle is incompatible with normal plant growth and development and leads to sensitive phenotype. PMID:24383055
Controlled Microwave Heating Accelerates Rolling Circle Amplification.
Yoshimura, Takeo; Suzuki, Takamasa; Mineki, Shigeru; Ohuchi, Shokichi
2015-01-01
Rolling circle amplification (RCA) generates single-stranded DNAs or RNA, and the diverse applications of this isothermal technique range from the sensitive detection of nucleic acids to analysis of single nucleotide polymorphisms. Microwave chemistry is widely applied to increase reaction rate as well as product yield and purity. The objectives of the present research were to apply microwave heating to RCA and indicate factors that contribute to the microwave selective heating effect. The microwave reaction temperature was strictly controlled using a microwave applicator optimized for enzymatic-scale reactions. Here, we showed that microwave-assisted RCA reactions catalyzed by either of the four thermostable DNA polymerases were accelerated over 4-folds compared with conventional RCA. Furthermore, the temperatures of the individual buffer components were specifically influenced by microwave heating. We concluded that microwave heating accelerated isothermal RCA of DNA because of the differential heating mechanisms of microwaves on the temperatures of reaction components, although the overall reaction temperatures were the same.
Yeast ribonuclease III uses a network of multiple hydrogen bonds for RNA binding and cleavage.
Lavoie, Mathieu; Abou Elela, Sherif
2008-08-19
Members of the bacterial RNase III family recognize a variety of short structured RNAs with few common features. It is not clear how this group of enzymes supports high cleavage fidelity while maintaining a broad base of substrates. Here we show that the yeast orthologue of RNase III (Rnt1p) uses a network of 2'-OH-dependent interactions to recognize substrates with different structures. We designed a series of bipartite substrates permitting the distinction between binding and cleavage defects. Each substrate was engineered to carry a single or multiple 2'- O-methyl or 2'-fluoro ribonucleotide substitutions to prevent the formation of hydrogen bonds with a specific nucleotide or group of nucleotides. Interestingly, introduction of 2'- O-methyl ribonucleotides near the cleavage site increased the rate of catalysis, indicating that 2'-OH are not required for cleavage. Substitution of nucleotides in known Rnt1p binding site with 2'- O-methyl ribonucleotides inhibited cleavage while single 2'-fluoro ribonucleotide substitutions did not. This indicates that while no single 2'-OH is essential for Rnt1p cleavage, small changes in the substrate structure are not tolerated. Strikingly, several nucleotide substitutions greatly increased the substrate dissociation constant with little or no effect on the Michaelis-Menten constant or rate of catalysis. Together, the results indicate that Rnt1p uses a network of nucleotide interactions to identify its substrate and support two distinct modes of binding. One mode is primarily mediated by the dsRNA binding domain and leads to the formation of stable RNA/protein complex, while the other requires the presence of the nuclease and N-terminal domains and leads to RNA cleavage.
Control of apical membrane chloride permeability in the renal A6 cell line by nucleotides
Banderali, U; Brochiero, E; Lindenthal, S; Raschi, C; Bogliolo, S; Ehrenfeld, J
1999-01-01
The effect of extracellular nucleotides applied on the apical side of polarised A6 cells grown on permeant filters was investigated by measuring the changes in (i) the 36Cl efflux through the apical membranes, (ii) the intracellular chloride concentrations (aCli, measured with N-(6-methoxyquinolyl) acetoethyl ester, MQAE), (iii) ICl, the short-circuit current in the absence of Na+ transport and (iv) the characteristics of the apical chloride channels using a patch-clamp approach. ATP or UTP (0.1-500 μm) transiently stimulated ICl. The sequence of purinergic agonist potencies was UTP = ATP > ADP >> the P2X-selective agonist β,γ-methylene ATP = the P2Y-selective agonist 2-methylthioATP. Suramin (100 μm) as the P2Y antagonist Reactive Blue 2 (10 μm) had no effect on the UTP (or ATP)-stimulated current. These findings are consistent with the presence of P2Y2-like receptors located on the apical membranes of A6 cells. Apical application of adenosine also transiently increased ICl. This effect was blocked by theophylline while the UTP-stimulated ICl was not. The existence of a second receptor, of the P1 type is proposed. ATP (or UTP)-stimulated ICl was blocked by apical application of 200 μmN-phenylanthranilic acid (DPC) or 100 μm niflumic acid while 100 μm glibenclamide was ineffective. Ionomycin and thapsigargin both transiently stimulated ICl; the nucleotide stimulation of ICl was not suppressed by pre-treatment with these agents. Chlorpromazin (50 μm), a Ca2+-calmodulin inhibitor strongly inhibited the stimulation of ICl induced either by apical UTP or by ionomycin application. BAPTA-AM pre-treatment of A6 cells blocked the UTP-stimulated ICl. Niflumic acid also blocked the ionomycin stimulated ICl. A fourfold increase in 36Cl effluxes through the apical membranes was observed after ATP or UTP application. These increases of the apical chloride permeability could also be observed when following aCli changes. Apical application of DPC (1 mm) or 5-nitro-2(3-phenylpropylamino)benzoic acid (NPPB; 500 μm) produced an incomplete inhibition of 36Cl effluxes through the apical membranes in ATP-stimulated and in untreated monolayers. In single channel patch-clamp experiments, an apical chloride channel with a unitary single channel conductance of 7.3 ± 0.6 pS (n = 12) was usually observed. ATP application induced the activation of one or more of these channels within a few minutes. These results indicate that multiple purinergic receptor subtypes are present in the apical membranes of A6 cells and that nucleotides can act as modulators of Cl− secretion in renal cells. PMID:10457087
USDA-ARS?s Scientific Manuscript database
One focus of the Sorghum Translational Genomics Lab (part of sorghum CRIS, PSGD, CSRL, USDA-ARS, Lubbock TX) is to utilize nucleotide variation between sorghum germplasm such as those derived from RNA seq for translation and validation of Single Nucleotide Polymorphism (SNP) into easy access DNA m...
Angart, Phillip A.; Carlson, Rebecca J.; Adu-Berchie, Kwasi
2016-01-01
Efficient short interfering RNA (siRNA)-mediated gene silencing requires selection of a sequence that is complementary to the intended target and possesses sequence and structural features that encourage favorable functional interactions with the RNA interference (RNAi) pathway proteins. In this study, we investigated how terminal sequence and structural characteristics of siRNAs contribute to siRNA strand loading and silencing activity and how these characteristics ultimately result in a functionally asymmetric duplex in cultured HeLa cells. Our results reiterate that the most important characteristic in determining siRNA activity is the 5′ terminal nucleotide identity. Our findings further suggest that siRNA loading is controlled principally by the hybridization stability of the 5′ terminus (Nucleotides: 1–2) of each siRNA strand, independent of the opposing terminus. Postloading, RNA-induced silencing complex (RISC)–specific activity was found to be improved by lower hybridization stability in the 5′ terminus (Nucleotides: 3–4) of the loaded siRNA strand and greater hybridization stability toward the 3′ terminus (Nucleotides: 17–18). Concomitantly, specific recognition of the 5′ terminal nucleotide sequence by human Argonaute 2 (Ago2) improves RISC half-life. These findings indicate that careful selection of siRNA sequences can maximize both the loading and the specific activity of the intended guide strand. PMID:27399870
A Lateral Flow Biosensor for the Detection of Single Nucleotide Polymorphisms.
Zeng, Lingwen; Xiao, Zhuo
2017-01-01
A lateral flow biosensor (LFB) is introduced for the detection of single nucleotide polymorphisms (SNPs). The assay is composed of two steps: circular strand displacement reaction and lateral flow biosensor detection. In step 1, the nucleotide at SNP site is recognized by T4 DNA ligase and the signal is amplified by strand displacement DNA polymerase, which can be accomplished at a constant temperature. In step 2, the reaction product of step 1 is detected by a lateral flow biosensor, which is a rapid and cost effective tool for nuclei acid detection. Comparing with conventional methods, it requires no complicated machines. It is suitable for the use of point of care diagnostics. Therefore, this simple, cost effective, robust, and promising LFB detection method of SNP has great potential for the detection of genetic diseases, personalized medicine, cancer related mutations, and drug-resistant mutations of infectious agents.
Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
Eriksson, Anders; Manica, Andrea
2011-01-01
Although ascertainment bias in single nucleotide polymorphisms is a well-known problem, it is generally accepted that microsatellites have mutation rates too high for bias to be a concern. Here, we analyze in detail the large set of microsatellites typed for the Human Genetic Diversity Panel (HGDP)-CEPH panel. We develop a novel framework based on rarefaction to compare heterozygosity across markers with different mutation rates. We find that, whereas di- and tri-nucleotides show similar patterns of within- and between-population heterozygosity, tetra-nucleotides are inconsistent with the other two motifs. In addition, di- and tri-nucleotides are consistent with 16 unbiased tetra-nucleotide markers, whereas the HPGP-CEPH tetra-nucleotides are significantly different. This discrepancy is due to the HGDP-CEPH tetra-nucleotides being too homogeneous across Eurasia, even after their slower mutation rate is taken into account by rarefying the other markers. The most likely explanation for this pattern is ascertainment bias. We strongly advocate the exclusion of tetra-nucleotides from future population genetics analysis of this dataset, and we argue that other microsatellite datasets should be investigated for the presence of bias using the approach outlined in this article. PMID:22384358
NASA Astrophysics Data System (ADS)
Tsyganov, M. M.; Ibragimova, M. K.; Karabut, I. V.; Freydin, M. B.; Choinzonov, E. L.; Litvyakov, N. V.
2015-11-01
Our previous research establishes that changes of expression of the ATP-binding cassette genes family is connected with the neoadjuvant chemotherapy effect. However, the mechanism of regulation of resistance gene expression remains unclear. As many researchers believe, single nucleotide polymorphisms can be involved in this process. Thereupon, microarray analysis is used to study polymorphisms in ATP-binding cassette genes. It is thus found that MDR gene expression is connected with 5 polymorphisms, i.e. rs241432, rs241429, rs241430, rs3784867, rs59409230, which participate in the regulation of expression of own genes.
Paulish-Miller, Teresa E.; Augostini, Peter; Schuyler, Jessica A.; Smith, William L.; Mordechai, Eli; Adelson, Martin E.; Gygax, Scott E.; Secor, William E.
2014-01-01
Metronidazole resistance in the sexually transmitted parasite Trichomonas vaginalis is a problematic public health issue. We have identified single nucleotide polymorphisms (SNPs) in two nitroreductase genes (ntr4Tv and ntr6Tv) associated with resistance. These SNPs were associated with one of two distinct T. vaginalis populations identified by multilocus sequence typing, yet one SNP (ntr6Tv A238T), which results in a premature stop codon, was associated with resistance independent of population structure and may be of diagnostic value. PMID:24550324
Kondo, Jiro; Westhof, Eric
2011-10-01
Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide-protein complexes are analyzed manually and pseudo pairs together with single and bifurcated hydrogen bonds observed between bases and amino acids are classified and annotated. Only 5 of the 20 usual amino acid residues, Asn, Gln, Asp, Glu and Arg, are able to orient in a coplanar fashion in order to form pseudo pairs with nucleotide bases through two hydrogen bonds. The peptide backbone can also form pseudo pairs with nucleotide bases and presents a strong bias for binding to the adenine base. The Watson-Crick side of the nucleotide bases is the major interaction edge participating in such pseudo pairs. Pseudo pairs between the Watson-Crick edge of guanine and Asp are frequently observed. The Hoogsteen edge of the purine bases is a good discriminatory element in recognition of nucleotide bases by protein side chains through the pseudo pairing: the Hoogsteen edge of adenine is recognized by various amino acids while the Hoogsteen edge of guanine is only recognized by Arg. The sugar edge is rarely recognized by either the side-chain or peptide backbone of amino acid residues.
CNG and HCN channels: two peas, one pod.
Craven, Kimberley B; Zagotta, William N
2006-01-01
Cyclic nucleotide-activated ion channels play a fundamental role in a variety of physiological processes. By opening in response to intracellular cyclic nucleotides, they translate changes in concentrations of signaling molecules to changes in membrane potential. These channels belong to two families: the cyclic nucleotide-gated (CNG) channels and the hyperpolarization-activated cyclic nucleotide-modulated (HCN) channels. The two families exhibit high sequence similarity and belong to the superfamily of voltage-gated potassium channels. Whereas HCN channels are activated by voltage and CNG channels are virtually voltage independent, both channels are activated by cyclic nucleotide binding. Furthermore, the channels are thought to have similar channel structures, leading to similar mechanisms of activation by cyclic nucleotides. However, although these channels are structurally and behaviorally similar, they have evolved to perform distinct physiological functions. This review describes the physiological roles and biophysical behavior of CNG and HCN channels. We focus on how similarities in structure and activation mechanisms result in common biophysical models, allowing CNG and HCN channels to be viewed as a single genre.
Eastwood, Heather; Xia, Fang; Lo, Mei-Chu; Zhou, Jing; Jordan, John B; McCarter, John; Barnhart, Wesley W; Gahm, Kyung-Hyun
2015-11-10
Analysis of nucleotide sugars, nucleoside di- and triphosphates and sugar-phosphates is an essential step in the process of understanding enzymatic pathways. A facile and rapid separation method was developed to analyze these compounds present in an enzymatic reaction mixture utilized to produce nucleotide sugars. The Primesep SB column explored in this study utilizes hydrophobic interactions as well as electrostatic interactions with the phosphoric portion of the nucleotide sugars. Ammonium formate buffer was selected due to its compatibility with mass spectrometry. Negative ion mode mass spectrometry was adopted for detection of the sugar phosphate (fucose-1-phophate), as the compound is not amenable to UV detection. Various mobile phase conditions such as pH, buffer concentration and organic modifier were explored. The semi-preparative separation method was developed to prepare 30mg of the nucleotide sugar. (19)F NMR was utilized to determine purity of the purified fluorinated nucleotide sugar. The collected nucleotide sugar was found to be 99% pure. Published by Elsevier B.V.
MacManes, Matthew D; Eisen, Michael B
2014-01-01
As a direct result of intense heat and aridity, deserts are thought to be among the most harsh of environments, particularly for their mammalian inhabitants. Given that osmoregulation can be challenging for these animals, with failure resulting in death, strong selection should be observed on genes related to the maintenance of water and solute balance. One such animal, Peromyscus eremicus, is native to the desert regions of the southwest United States and may live its entire life without oral fluid intake. As a first step toward understanding the genetics that underlie this phenotype, we present a characterization of the P. eremicus transcriptome. We assay four tissues (kidney, liver, brain, testes) from a single individual and supplement this with population level renal transcriptome sequencing from 15 additional animals. We identified a set of transcripts undergoing both purifying and balancing selection based on estimates of Tajima's D. In addition, we used the branch-site test to identify a transcript-Slc2a9, likely related to desert osmoregulation-undergoing enhanced selection in P. eremicus relative to a set of related non-desert rodents.
Genomic Signatures of Speciation in Sympatric and Allopatric Hawaiian Picture-Winged Drosophila
Kang, Lin; Settlage, Robert; McMahon, Wyatt; Michalak, Katarzyna; Tae, Hongseok; Garner, Harold R.; Stacy, Elizabeth A.; Price, Donald K.; Michalak, Pawel
2016-01-01
The Hawaiian archipelago provides a natural arena for understanding adaptive radiation and speciation. The Hawaiian Drosophila are one of the most diverse endemic groups in Hawaiì with up to 1,000 species. We sequenced and analyzed entire genomes of recently diverged species of Hawaiian picture-winged Drosophila, Drosophila silvestris and Drosophila heteroneura from Hawaiì Island, in comparison with Drosophila planitibia, their sister species from Maui, a neighboring island where a common ancestor of all three had likely occurred. Genome-wide single nucleotide polymorphism patterns suggest the more recent origin of D. silvestris and D. heteroneura, as well as a pervasive influence of positive selection on divergence of the three species, with the signatures of positive selection more prominent in sympatry than allopatry. Positively selected genes were significantly enriched for functional terms related to sensory detection and mating, suggesting that sexual selection played an important role in speciation of these species. In particular, sequence variation in Olfactory receptor and Gustatory receptor genes seems to play a major role in adaptive radiation in Hawaiian pictured-winged Drosophila. PMID:27189993
Bioinformatic analyses to select phenotype affecting polymorphisms in HTR2C gene.
Piva, Francesco; Giulietti, Matteo; Baldelli, Luisa; Nardi, Bernardo; Bellantuono, Cesario; Armeni, Tatiana; Saccucci, Franca; Principato, Giovanni
2011-08-01
Single nucleotide polymorphisms (SNPs) in serotonin related genes influence mental disorders, responses to pharmacological and psychotherapeutic treatments. In planning association studies, researchers that want to investigate new SNPs have to select some among a large number of candidates. Our aim is to guide researchers in the selection of the most likely phenotype affecting polymorphisms. Here, we studied serotonin receptor 2C (HTR2C) SNPs because, till now, only relatively few of about 2000 are investigated. We used the most updated and assessed bioinformatic tools to predict which variations can give rise to biological effects among 2450 HTR2C SNPs. We suggest 48 SNPs that are worth considering in future association studies in the field of psychiatry, psychology and pharmacogenomics. Moreover, our analyses point out the biological level probably affected, such as transcription, splicing, miRNA regulation and protein structure, thus allowing to suggest future molecular investigations. Although few association studies are available in literature, their results are in agreement with our predictions, showing that our selection methods can help to guide future association studies. Copyright © 2011 John Wiley & Sons, Ltd.
Aquaporin-4 polymorphisms and brain/body weight ratio in sudden infant death syndrome (SIDS).
Studer, Jacqueline; Bartsch, Christine; Haas, Cordula
2014-07-01
Failure in the regulation of homeostatic water balance in the brain is associated with severe cerebral edema and increased brain weights and may also play an important role in the pathogenesis of sudden infant death syndrome (SIDS). We genotyped three single-nucleotide polymorphisms in the aquaporin-4 water channel-encoding gene (AQP4), which were previously shown to be associated with (i) SIDS in Norwegian infants (rs2075575), (ii) severe brain edema (rs9951307), and (iii) increased brain water permeability (rs3906956). We also determined whether the brain/body weight ratio is increased in SIDS infants compared with sex- and age-matched controls. Genotyping of the three AQP4 single-nucleotide polymorphisms was performed in 160 Caucasian SIDS infants and 181 healthy Swiss adults using a single-base extension method. Brain and body weights were measured during autopsy in 157 SIDS and 59 non-SIDS infants. No differences were detected in the allelic frequencies of the three AQP4 single-nucleotide polymorphisms between SIDS and adult controls. The brain/body weight ratio was similarly distributed in SIDS and non-SIDS infants. Variations in the AQP4 gene seem of limited significance as predisposing factors in Caucasian SIDS infants. Increased brain weights may only become evident in conjunction with environmental or other genetic risk factors.
Single nucleotide editing without DNA cleavage using CRISPR/Cas9-deaminase in the sea urchin embryo.
Shevidi, Saba; Uchida, Alicia; Schudrowitz, Natalie; Wessel, Gary M; Yajima, Mamiko
2017-12-01
A single base pair mutation in the genome can result in many congenital disorders in humans. The recent gene editing approach using CRISPR/Cas9 has rapidly become a powerful tool to replicate or repair such mutations in the genome. These approaches rely on cleaving DNA, while presenting unexpected risks. In this study, we demonstrate a modified CRISPR/Cas9 system fused to cytosine deaminase (Cas9-DA), which induces a single nucleotide conversion in the genome. Cas9-DA was introduced into sea urchin eggs with sgRNAs targeted for SpAlx1, SpDsh, or SpPks, each of which is critical for skeletogenesis, embryonic axis formation, or pigment formation, respectively. We found that both Cas9 and Cas9-DA edit the genome, and cause predicted phenotypic changes at a similar efficiency. Cas9, however, resulted in significant deletions in the genome centered on the gRNA target sequence, whereas Cas9-DA resulted in single or double nucleotide editing of C to T conversions within the gRNA target sequence. These results suggest that the Cas9-DA approach may be useful for manipulating gene activity with decreased risks of genomic aberrations. Developmental Dynamics 246:1036-1046, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A
2012-05-01
The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Does the choice of nucleotide substitution models matter topologically?
Hoff, Michael; Orf, Stefan; Riehm, Benedikt; Darriba, Diego; Stamatakis, Alexandros
2016-03-24
In the context of a master level programming practical at the computer science department of the Karlsruhe Institute of Technology, we developed and make available an open-source code for testing all 203 possible nucleotide substitution models in the Maximum Likelihood (ML) setting under the common Akaike, corrected Akaike, and Bayesian information criteria. We address the question if model selection matters topologically, that is, if conducting ML inferences under the optimal, instead of a standard General Time Reversible model, yields different tree topologies. We also assess, to which degree models selected and trees inferred under the three standard criteria (AIC, AICc, BIC) differ. Finally, we assess if the definition of the sample size (#sites versus #sites × #taxa) yields different models and, as a consequence, different tree topologies. We find that, all three factors (by order of impact: nucleotide model selection, information criterion used, sample size definition) can yield topologically substantially different final tree topologies (topological difference exceeding 10 %) for approximately 5 % of the tree inferences conducted on the 39 empirical datasets used in our study. We find that, using the best-fit nucleotide substitution model may change the final ML tree topology compared to an inference under a default GTR model. The effect is less pronounced when comparing distinct information criteria. Nonetheless, in some cases we did obtain substantial topological differences.
Silva-Junior, Orzenil B; Grattapaglia, Dario
2015-11-01
We used high-density single nucleotide polymorphism (SNP) data and whole-genome pooled resequencing to examine the landscape of population recombination (ρ) and nucleotide diversity (ϴw ), assess the extent of linkage disequilibrium (r(2) ) and build the highest density linkage maps for Eucalyptus. At the genome-wide level, linkage disequilibrium (LD) decayed within c. 4-6 kb, slower than previously reported from candidate gene studies, but showing considerable variation from absence to complete LD up to 50 kb. A sharp decrease in the estimate of ρ was seen when going from short to genome-wide inter-SNP distances, highlighting the dependence of this parameter on the scale of observation adopted. Recombination was correlated with nucleotide diversity, gene density and distance from the centromere, with hotspots of recombination enriched for genes involved in chemical reactions and pathways of the normal metabolic processes. The high nucleotide diversity (ϴw = 0.022) of E. grandis revealed that mutation is more important than recombination in shaping its genomic diversity (ρ/ϴw = 0.645). Chromosome-wide ancestral recombination graphs allowed us to date the split of E. grandis (1.7-4.8 million yr ago) and identify a scenario for the recent demographic history of the species. Our results have considerable practical importance to Genome Wide Association Studies (GWAS), while indicating bright prospects for genomic prediction of complex phenotypes in eucalypt breeding. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Genetic stability of Ross River virus during epidemic spread in nonimmune humans.
Burness, A T; Pardoe, I; Faragher, S G; Vrati, S; Dalgarno, L
1988-12-01
We have examined the rate of evolution of Ross River virus, a mosquito-borne RNA virus, during epidemic spread through tens of thousands of nonimmune humans over a period of 10 months. Two regions of the Ross River virus genome were sequenced: the E2 gene (1.2 kb in length), which encodes the major neutralization determinant of the virus, and 0.4 kb of the 3'-untranslated region. In the E2 gene, a single nucleotide change was selected which led to a predicted amino acid change at residue 219. No changes were selected in the 3'-untranslated region. By comparison with rates of evolution reported for non-arthropod-borne RNA viruses, the rate for Ross River virus is surprisingly low. We identify three features of the Ross River virus replication and transmission cycle which may limit the rate of evolution of arthropod-borne viruses in the field.
Tag SNP selection via a genetic algorithm.
Mahdevar, Ghasem; Zahiri, Javad; Sadeghi, Mehdi; Nowzari-Dalini, Abbas; Ahrabian, Hayedeh
2010-10-01
Single Nucleotide Polymorphisms (SNPs) provide valuable information on human evolutionary history and may lead us to identify genetic variants responsible for human complex diseases. Unfortunately, molecular haplotyping methods are costly, laborious, and time consuming; therefore, algorithms for constructing full haplotype patterns from small available data through computational methods, Tag SNP selection problem, are convenient and attractive. This problem is proved to be an NP-hard problem, so heuristic methods may be useful. In this paper we present a heuristic method based on genetic algorithm to find reasonable solution within acceptable time. The algorithm was tested on a variety of simulated and experimental data. In comparison with the exact algorithm, based on brute force approach, results show that our method can obtain optimal solutions in almost all cases and runs much faster than exact algorithm when the number of SNP sites is large. Our software is available upon request to the corresponding author.
Genomics DNA Profiling in Elite Professional Soccer Players: A Pilot Study
Kambouris, M; Del Buono, A; Maffulli, N
2014-01-01
Functional variants in exonic regions have been associated with development of cardiovascular disease, diabetes and cancer. Athletic performance can be considered a multi-factorial complex phenotype. Genomic DNA was extracted from buccal swabs of seven soccer players from the Fulham football team. Single nucleotide polymorphism (SNPs) genotyping was undertaken. To achieve optimal athletic performance, predictive genomics DNA profiling for sports performance can be used to aid in sport selection and elaboration of personalized training and nutrition programs. Predictive DNA profiling may be able to detect athletes with potential or frank injuries, or screening and selection of future athletes, and can help them to maximize utilization of their potential and improve performance in sports. The aim of this study is to provide a wide scenario of specific genomic variants that an athlete carries, to implement which measures should be taken to maximize the athlete’s potential. PMID:24809029
Modular probes for enriching and detecting complex nucleic acid sequences
NASA Astrophysics Data System (ADS)
Wang, Juexiao Sherry; Yan, Yan Helen; Zhang, David Yu
2017-12-01
Complex DNA sequences are difficult to detect and profile, but are important contributors to human health and disease. Existing hybridization probes lack the capability to selectively bind and enrich hypervariable, long or repetitive sequences. Here, we present a generalized strategy for constructing modular hybridization probes (M-Probes) that overcomes these challenges. We demonstrate that M-Probes can tolerate sequence variations of up to 7 nt at prescribed positions while maintaining single nucleotide sensitivity at other positions. M-Probes are also shown to be capable of sequence-selectively binding a continuous DNA sequence of more than 500 nt. Furthermore, we show that M-Probes can detect genes with triplet repeats exceeding a programmed threshold. As a demonstration of this technology, we have developed a hybrid capture method to determine the exact triplet repeat expansion number in the Huntington's gene of genomic DNA using quantitative PCR.
The nucleotide sequence and genome organization of Plasmopara halstedii virus.
Heller-Dohmen, Marion; Göpfert, Jens C; Pfannstiel, Jens; Spring, Otmar
2011-03-17
Only very few viruses of Oomycetes have been studied in detail. Isometric virions were found in different isolates of the oomycete Plasmopara halstedii, the downy mildew pathogen of sunflower. However, complete nucleotide sequences and data on the genome organization were lacking. Viral RNA of different P. halstedii isolates was subjected to nucleotide sequencing and analysis of the viral genome. The N-terminal sequence of the viral coat protein was determined using Top-Down MALDI-TOF analysis. The complete nucleotide sequences of both single-stranded RNA segments (RNA1 and RNA2) were established. RNA1 consisted of 2793 nucleotides (nt) exclusive its 3' poly(A) tract and a single open-reading frame (ORF1) of 2745 nt. ORF1 was framed by a 5' untranslated region (5' UTR) of 18 nt and a 3' untranslated region (3' UTR) of 30 nt. ORF1 contained motifs of RNA-dependent RNA polymerases (RdRp) and showed similarities to RdRp of Scleropthora macrospora virus A (SmV A) and viruses within the Nodaviridae family. RNA2 consisted of 1526 nt exclusive its 3' poly(A) tract and a second ORF (ORF2) of 1128 nt. ORF2 coded for the single viral coat protein (CP) and was framed by a 5' UTR of 164 nt and a 3' UTR of 234 nt. The deduced amino acid sequence of ORF2 was verified by nano-LC-ESI-MS/MS experiments. Top-Down MALDI-TOF analysis revealed the N-terminal sequence of the CP. The N-terminal sequence represented a region within ORF2 suggesting a proteolytic processing of the CP in vivo. The CP showed similarities to CP of SmV A and viruses within the Tombusviridae family. Fragments of RNA1 (ca. 1.9 kb) and RNA2 (ca. 1.4 kb) were used to analyze the nucleotide sequence variation of virions in different P. halstedii isolates. Viral sequence variation was 0.3% or less regardless of their host's pathotypes, the geographical origin and the sensitivity towards the fungicide metalaxyl. The results showed the presence of a single and new virus type in different P. halstedii isolates. Insignificant viral sequence variation indicated that the virus did not account for differences in pathogenicity of the oomycete P. halstedii.
Mendes-Junior, C T; Castelli, E C; Meyer, D; Simões, A L; Donadi, E A
2013-12-01
HLA-G has an important role in the modulation of the maternal immune system during pregnancy, and evidence that balancing selection acts in the promoter and 3'UTR regions has been previously reported. To determine whether selection acts on the HLA-G coding region in the Amazon Rainforest, exons 2, 3 and 4 were analyzed in a sample of 142 Amerindians from nine villages of five isolated tribes that inhabit the Central Amazon. Six previously described single-nucleotide polymorphisms (SNPs) were identified and the Expectation-Maximization (EM) and PHASE algorithms were used to computationally reconstruct SNP haplotypes (HLA-G alleles). A new HLA-G allele, which originated in Amerindian populations by a crossing-over event between two widespread HLA-G alleles, was identified in 18 individuals. Neutrality tests evidenced that natural selection has a complex part in the HLA-G coding region. Although balancing selection is the type of selection that shapes variability at a local level (Native American populations), we have also shown that purifying selection may occur on a worldwide scale. Moreover, the balancing selection does not seem to act on the coding region as strongly as it acts on the flanking regulatory regions, and such coding signature may actually reflect a hitchhiking effect.
Uncovering drug-responsive regulatory elements
Luizon, Marcelo R; Ahituv, Nadav
2015-01-01
Nucleotide changes in gene regulatory elements can have a major effect on interindividual differences in drug response. For example, by reviewing all published pharmacogenomic genome-wide association studies, we show here that 96.4% of the associated single nucleotide polymorphisms reside in noncoding regions. We discuss how sequencing technologies are improving our ability to identify drug response-associated regulatory elements genome-wide and to annotate nucleotide variants within them. We highlight specific examples of how nucleotide changes in these elements can affect drug response and illustrate the techniques used to find them and functionally characterize them. Finally, we also discuss challenges in the field of drug-responsive regulatory elements that need to be considered in order to translate these findings into the clinic. PMID:26555224
Ye, Jun-jie; Ma, Li; Yang, Li-juan; Wang, Jin-huan; Wang, Yue-li; Guo, Hai; Gong, Ning; Nie, Wen-hui; Zhao, Shu-hua
2013-09-01
There are many reports on associations between spermatogenesis and partial azoospermia factor c (AZFc) deletions as well as duplications; however, results are conflicting, possibly due to differences in methodology and ethnic background. The purpose of this study is to investigate the association of AZFc polymorphisms and male infertility in the Yi ethnic population, residents within Yunnan Province, China. A total of 224 infertile patients and 153 fertile subjects were selected in the Yi ethnic population. The study was performed by sequence-tagged site plus/minus (STS+/-) analysis followed by gene dosage and gene copy definition analysis. Y haplotypes of 215 cases and 115 controls were defined by 12 binary markers using single nucleotide polymorphism on Y chromosome (Y-SNP) multiplex assays based on single base primer extension technology. The distribution of Y haplotypes was not significantly different between the case and control groups. The frequencies of both gr/gr (7.6% vs. 8.5%) and b2/b3 (6.3% vs. 8.5%) deletions do not show significant differences. Similarly, single nucleotide variant (SNV) analysis shows no significant difference of gene copy definition between the cases and controls. However, the frequency of partial duplications in the infertile group (4.0%) is significantly higher than that in the control group (0.7%). Further, we found a case with sY1206 deletion which had two CDY1 copies but removed half of DAZ genes. Our results show that male infertility is associated with partial AZFc duplications, but neither gr/gr nor b2/b3 deletions, suggesting that partial AZFc duplications rather than deletions are risk factors for male infertility in Chinese-Yi population.
Parker, Glendon J.; Leppert, Tami; Anex, Deon S.; ...
2016-09-07
Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
Association of α-, β-, and γ-Synuclein With Diffuse Lewy Body Disease
Nishioka, Kenya; Wider, Christian; Vilariño-Güell, Carles; Soto-Ortolaza, Alexandra I.; Lincoln, Sarah J.; Kachergus, Jennifer M.; Jasinska-Myga, Barbara; Ross, Owen A.; Rajput, Alex; Robinson, Christopher A.; Ferman, Tanis J.; Wszolek, Zbigniew K.; Dickson, Dennis W.; Farrer, Matthew J.
2016-01-01
Objective To determine the association of the genes that encode α-, β-, and γ-synuclein (SNCA, SNCB, and SNCG, respectively) with diffuse Lewy body disease (DLBD). Design Case-control study. Subjects A total of 172 patients with DLBD consistent with a clinical diagnosis of Parkinson disease dementia/dementia with Lewy bodies and 350 clinically and 97 pathologically normal controls. Interventions Sequencing of SNCA, SNCB, and SNCG and genotyping of single-nucleotide polymorphisms performed on an Applied Biosystems capillary sequencer and a Sequenom MassArray pLEX platform, respectively. Associations were determined using χ2 or Fisher exact tests. Results Initial sequencing studies of the coding regions of each gene in 89 patients with DLBD did not detect any pathogenic substitutions. Nevertheless, genotyping of known polymorphic variability in sequence-conserved regions detected several single-nucleotide polymorphisms in the SNCA and SNCG genes that were significantly associated with disease (P=.05 to <.001). Significant association was also observed for 3 single-nucleotide polymorphisms located in SNCB when comparing DLBD cases and pathologically confirmed normal controls (P=.03-.01); however, this association was not significant for the clinical controls alone or the combined clinical and pathological controls (P>.05). After correction for multiple testing, only 1 single-nucleotide polymorphism in SNCG (rs3750823) remained significant in all of the analyses (P=.05-.009). Conclusion These findings suggest that variants in all 3 members of the synuclein gene family, particularly SNCA and SNCG, affect the risk of developing DLBD and warrant further investigation in larger, pathologically defined data sets as well as clinically diagnosed Parkinson disease/dementia with Lewy bodies case-control series. PMID:20697047
Refactoring the Genetic Code for Increased Evolvability
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pines, Gur; Winkler, James D.; Pines, Assaf
ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parker, Glendon J.; Leppert, Tami; Anex, Deon S.
Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 singlemore » nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects’ DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European–American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). Furthermore, this study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.« less
Refactoring the Genetic Code for Increased Evolvability
Pines, Gur; Winkler, James D.; Pines, Assaf; ...
2017-11-14
ABSTRACT The standard genetic code is robust to mutations during transcription and translation. Point mutations are likely to be synonymous or to preserve the chemical properties of the original amino acid. Saturation mutagenesis experiments suggest that in some cases the best-performing mutant requires replacement of more than a single nucleotide within a codon. These replacements are essentially inaccessible to common error-based laboratory engineering techniques that alter a single nucleotide per mutation event, due to the extreme rarity of adjacent mutations. In this theoretical study, we suggest a radical reordering of the genetic code that maximizes the mutagenic potential of singlemore » nucleotide replacements. We explore several possible genetic codes that allow a greater degree of accessibility to the mutational landscape and may result in a hyperevolvable organism that could serve as an ideal platform for directed evolution experiments. We then conclude by evaluating the challenges of constructing such recoded organisms and their potential applications within the field of synthetic biology. IMPORTANCE The conservative nature of the genetic code prevents bioengineers from efficiently accessing the full mutational landscape of a gene via common error-prone methods. Here, we present two computational approaches to generate alternative genetic codes with increased accessibility. These new codes allow mutational transitions to a larger pool of amino acids and with a greater extent of chemical differences, based on a single nucleotide replacement within the codon, thus increasing evolvability both at the single-gene and at the genome levels. Given the widespread use of these techniques for strain and protein improvement, along with more fundamental evolutionary biology questions, the use of recoded organisms that maximize evolvability should significantly improve the efficiency of directed evolution, library generation, and fitness maximization.« less
Nitiyon, Sukanya; Khunnamwong, Pannida; Lertwattanasakul, Noppon; Limtong, Savitree
2018-05-24
Three strains (DMKU-XE11 T , DMKU-XE15 and DMKU-XE20) representing a single novel anamorphic and d-xylose-fermenting yeast species were obtained from three peat samples collected from Khan Thulee peat swamp forest in Surat Thani province, Thailand. The strains differed from each other by one to two nucleotide substitutions in the sequences of the D1/D2 region of the large subunit (LSU) rRNA gene and zero to one nucleotide substitution in the internal transcribed spacer (ITS) region. Phylogenetic analysis based on the combined sequences of the ITS and the D1/D2 regions showed that the three strains represented a single Candida species that was distinct from the other related species in the Lodderomyces/Candida albicans clade. The three strains form a subclade with the other Candida species including Candida sanyaensis, Candida tropicalis and Candida sojae. C. sanyaensis was the most closely related species, with 2.1-2.4 % nucleotide substitutions in the D1/D2 region of the LSU rRNA gene, and 3.8-4.0 % nucleotide substitutions in the ITS region. The three strains (DMKU-XE11 T , DMKU-XE15 and DMKU-XE20) were assigned as a single novel species, which was named Candida kantuleensis sp. nov. The type strain is DMKU-XE11 T (=CBS 15219 T =TBRC 7764 T ). The MycoBank number for C. kantuleensis sp. nov. is MB 824179.
IL-TIF/IL-22: genomic organization and mapping of the human and mouse genes.
Dumoutier, L; Van Roost, E; Ameye, G; Michaux, L; Renauld, J C
2000-12-01
IL-TIF is a new cytokine originally identified as a gene induced by IL-9 in murine T lymphocytes, and showing 22% amino acid identity with IL-10. Here, we report the sequence and organization of the mouse and human IL-TIF genes, which both consist of 6 exons spreading over approximately 6 Kb. The IL-TIF gene is a single copy gene in humans, and is located on chromosome 12q15, at 90 Kb from the IFN gamma gene, and at 27 Kb from the AK155 gene, which codes for another IL-10-related cytokine. In the mouse, the IL-TIF gene is located on chromosome 10, also in the same region as the IFN gamma gene. Although it is a single copy gene in BALB/c and DBA/2 mice, the IL-TIF gene is duplicated in other strains such as C57Bl/6, FVB and 129. The two copies, which show 98% nucleotide identity in the coding region, were named IL-TIF alpha and IL-TIF beta. Beside single nucleotide variations, they differ by a 658 nucleotide deletion in IL-TIF beta, including the first non-coding exon and 603 nucleotides from the promoter. A DNA fragment corresponding to this deletion was sufficient to confer IL-9-regulated expression of a luciferase reporter plasmid, suggesting that the IL-TIF beta gene is either differentially regulated, or not expressed at all.
Yan, Liying; Huang, Lei; Xu, Liya; Huang, Jin; Ma, Fei; Zhu, Xiaohui; Tang, Yaqiong; Liu, Mingshan; Lian, Ying; Liu, Ping; Li, Rong; Lu, Sijia; Tang, Fuchou; Qiao, Jie; Xie, X Sunney
2015-12-29
In vitro fertilization (IVF), preimplantation genetic diagnosis (PGD), and preimplantation genetic screening (PGS) help patients to select embryos free of monogenic diseases and aneuploidy (chromosome abnormality). Next-generation sequencing (NGS) methods, while experiencing a rapid cost reduction, have improved the precision of PGD/PGS. However, the precision of PGD has been limited by the false-positive and false-negative single-nucleotide variations (SNVs), which are not acceptable in IVF and can be circumvented by linkage analyses, such as short tandem repeats or karyomapping. It is noteworthy that existing methods of detecting SNV/copy number variation (CNV) and linkage analysis often require separate procedures for the same embryo. Here we report an NGS-based PGD/PGS procedure that can simultaneously detect a single-gene disorder and aneuploidy and is capable of linkage analysis in a cost-effective way. This method, called "mutated allele revealed by sequencing with aneuploidy and linkage analyses" (MARSALA), involves multiple annealing and looping-based amplification cycles (MALBAC) for single-cell whole-genome amplification. Aneuploidy is determined by CNVs, whereas SNVs associated with the monogenic diseases are detected by PCR amplification of the MALBAC product. The false-positive and -negative SNVs are avoided by an NGS-based linkage analysis. Two healthy babies, free of the monogenic diseases of their parents, were born after such embryo selection. The monogenic diseases originated from a single base mutation on the autosome and the X-chromosome of the disease-carrying father and mother, respectively.
Wang, Longxin; Wang, Bowen; Du, Qingzhang; Chen, Jinhui; Tian, Jiaxing; Yang, Xiaohui; Zhang, Deqiang
2017-02-01
Photosynthesis is one of the most important reactions on earth. PsbW, a nuclear-encoded subunit of photosystem II (PSII), stabilizes PSII structure and plays an important role in photosynthesis. Here, we used candidate gene-based linkage disequilibrium (LD) mapping to detect significant associations between allelic variations of PtoPsbW and traits related to photosynthesis, growth, and wood properties in Populus tomentosa. PtoPsbW showed the highest expression in leaves and it increased during the development of these leaves, suggesting that PtoPsbW may play an important role in plant growth and development. Analysis of nucleotide diversity and LD revealed that PtoPsbW has low single-nucleotide polymorphism (SNP) diversity (π tot = 0.0048 and θ w = 0.0050) and relatively low average value of LD (0.1500), indicating that PtoPsbW is conserved due to its indispensable function. Using single-SNP associations in an association population of 435 individuals, we identified five significant associations at the threshold of P ≤ 0.05, explaining 3.28-15.98 % of the phenotypic variation. Haplotype-based association analyses indicated that 13 haplotypes (P ≤ 0.05) from six blocks were associated with photosynthesis, growth, and wood properties. Our work shows that identifying allelic variation and LD can help to decipher the genetic basis of photosynthesis and could potentially be applied for molecular marker-assisted selection in Populus.
Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design
Goonetilleke, Shashi N.; March, Timothy J.; Wirthensohn, Michelle G.; Arús, Pere; Walker, Amanda R.; Mather, Diane E.
2017-01-01
In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond (Prunus dulcis Mill. D. A. Webb), application of a double pseudotestcross mapping approach to the F1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars “Nonpareil” and “Lauranne.” Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond. PMID:29141988
Development of solution-gated graphene transistor model for biosensors
NASA Astrophysics Data System (ADS)
Karimi, Hediyeh; Yusof, Rubiyah; Rahmani, Rasoul; Hosseinpour, Hoda; Ahmadi, Mohammad T.
2014-02-01
The distinctive properties of graphene, characterized by its high carrier mobility and biocompatibility, have stimulated extreme scientific interest as a promising nanomaterial for future nanoelectronic applications. In particular, graphene-based transistors have been developed rapidly and are considered as an option for DNA sensing applications. Recent findings in the field of DNA biosensors have led to a renewed interest in the identification of genetic risk factors associated with complex human diseases for diagnosis of cancers or hereditary diseases. In this paper, an analytical model of graphene-based solution gated field effect transistors (SGFET) is proposed to constitute an important step towards development of DNA biosensors with high sensitivity and selectivity. Inspired by this fact, a novel strategy for a DNA sensor model with capability of single-nucleotide polymorphism detection is proposed and extensively explained. First of all, graphene-based DNA sensor model is optimized using particle swarm optimization algorithm. Based on the sensing mechanism of DNA sensors, detective parameters ( I ds and V gmin) are suggested to facilitate the decision making process. Finally, the behaviour of graphene-based SGFET is predicted in the presence of single-nucleotide polymorphism with an accuracy of more than 98% which guarantees the reliability of the optimized model for any application of the graphene-based DNA sensor. It is expected to achieve the rapid, quick and economical detection of DNA hybridization which could speed up the realization of the next generation of the homecare sensor system.
Development of solution-gated graphene transistor model for biosensors
2014-01-01
The distinctive properties of graphene, characterized by its high carrier mobility and biocompatibility, have stimulated extreme scientific interest as a promising nanomaterial for future nanoelectronic applications. In particular, graphene-based transistors have been developed rapidly and are considered as an option for DNA sensing applications. Recent findings in the field of DNA biosensors have led to a renewed interest in the identification of genetic risk factors associated with complex human diseases for diagnosis of cancers or hereditary diseases. In this paper, an analytical model of graphene-based solution gated field effect transistors (SGFET) is proposed to constitute an important step towards development of DNA biosensors with high sensitivity and selectivity. Inspired by this fact, a novel strategy for a DNA sensor model with capability of single-nucleotide polymorphism detection is proposed and extensively explained. First of all, graphene-based DNA sensor model is optimized using particle swarm optimization algorithm. Based on the sensing mechanism of DNA sensors, detective parameters (Ids and Vgmin) are suggested to facilitate the decision making process. Finally, the behaviour of graphene-based SGFET is predicted in the presence of single-nucleotide polymorphism with an accuracy of more than 98% which guarantees the reliability of the optimized model for any application of the graphene-based DNA sensor. It is expected to achieve the rapid, quick and economical detection of DNA hybridization which could speed up the realization of the next generation of the homecare sensor system. PMID:24517158
Petit, Sarah J; Wise, Emma L; Chambers, John C; Sehmi, Jobanpreet; Chayen, Naomi E; Kooner, Jaspal S; Pease, James E
2011-04-01
The chemokine CXCL16 serves as a scavenger receptor for oxidized low-density lipoprotein and as an adhesion molecule and chemoattractant for cells expressing the receptor CXCR6. A commonly occurring CXCL16 allele has been described containing 2 nonsynonymous single-nucleotide polymorphisms in complete linkage disequilibrium, although the effects on CXCL16 function are unknown. Here, we examined the effect of the single-nucleotide polymorphisms on CXCL16 function and assessed the association of the mutant allele with coronary heart disease (CHD). Both wild-type and mutant T123V181-CXCL16 were readily expressed in vitro and were similarly functional in assays of oxidized low-density lipoprotein scavenging and chemotaxis. However, unlike wild-type CXCL16, T123V181-CXCL16 was unable to promote adhesion of CXCR6(+) cells. Findings were confirmed ex vivo, with monocytes from donors homozygous for the T123V181 allele unable to facilitate adhesion of CXCR6 transfectants. In the London Life Sciences Prospective Population cohort (n = 2797), we found that the T123V181 allele was not associated with protection or susceptibility to CHD (adjusted odds ratio, 1.01; 95% CI, 0.95 to 1.10; P = 0.74). CXCL16-mediated cell adhesion plays at best a modest role in CHD, and the scavenging and chemotactic properties of the chemokine are more likely to be more important in disease pathogenesis.
Bandarian, Fatemeh; Daneshpour, Maryam Sadat; Hedayati, Mehdi; Naseri, Mohsen; Azizi, Fereidoun
2016-01-01
Apolipoprotein A2 (APOA2) is the second major apolipoprotein of the high-density lipoprotein cholesterol (HDL-C). The study aim was to identify APOA2 gene variation in individuals within two extreme tails of HDL-C levels and its relationship with HDL-C level. This cross-sectional survey was conducted on participants from Tehran Glucose and Lipid Study (TLGS) at Research Institute for Endocrine Sciences, Tehran, Iran from April 2012 to February 2013. In total, 79 individuals with extreme low HDL-C levels (≤5th percentile for age and gender) and 63 individuals with extreme high HDL-C levels (≥95th percentile for age and gender) were selected. Variants were identified using DNA amplification and direct sequencing. Screen of all exons and the core promoter region of APOA2 gene identified nine single nucleotide substitutions and one microsatellite; five of which were known and four were new variants. Of these nine variants, two were common tag single nucleotide polymorphisms (SNPs) and seven were rare SNPs. Both exonic substitutions were missense mutations and caused an amino acid change. There was a significant association between the new missense mutation (variant Chr.1:16119226, Ala98Pro) and HDL-C level. None of two common tag SNPs of rs6413453 and rs5082 contributes to the HDL-C trait in Iranian population, but a new missense mutation in APOA2 in our population has a significant association with HDL-C.
Schmidt, Liesbeth M; Mouton, Laurence; Nong, Guang; Ebert, Dieter; Preston, James F
2008-01-01
Pasteuria penetrans, an obligate endospore-forming parasite of Meloidogyne spp. (root knot nematodes), has been identified as a promising agent for biocontrol of these destructive agricultural crop pests. Pasteuria ramosa, an obligate parasite of water fleas (Daphnia spp.), has been shown to modulate cladoceran populations in natural ecosystems. Selected sporulation genes and an epitope associated with the spore envelope of these related species were compared. The sigE and spoIIAA/spoIIAB genes differentiate the two species to a greater extent than 16S rRNA and may serve as probes to differentiate the species. Single-nucleotide variations were observed in several conserved genes of five distinct populations of P. ramosa, and while most of these variations are silent single-nucleotide polymorphisms, a few result in conservative amino acid substitutions. A monoclonal antibody directed against an adhesin epitope present on P. penetrans P20 endospores, previously determined to be specific for Pasteuria spp. associated with several phytopathogenic nematodes, also detects an epitope associated with P. ramosa endospores. Immunoblotting provided patterns that differentiate P. ramosa from other Pasteuria spp. This monoclonal antibody thus provides a probe with which to detect and discriminate endospores of different Pasteuria spp. The presence of a shared adhesin epitope in two species with such ecologically distant hosts suggests that there is an ancient and ecologically significant recognition process in these endospore-forming bacilli that contributes to the virulence of both species in their respective hosts.
Schmidt, Liesbeth M.; Mouton, Laurence; Nong, Guang; Ebert, Dieter; Preston, James F.
2008-01-01
Pasteuria penetrans, an obligate endospore-forming parasite of Meloidogyne spp. (root knot nematodes), has been identified as a promising agent for biocontrol of these destructive agricultural crop pests. Pasteuria ramosa, an obligate parasite of water fleas (Daphnia spp.), has been shown to modulate cladoceran populations in natural ecosystems. Selected sporulation genes and an epitope associated with the spore envelope of these related species were compared. The sigE and spoIIAA/spoIIAB genes differentiate the two species to a greater extent than 16S rRNA and may serve as probes to differentiate the species. Single-nucleotide variations were observed in several conserved genes of five distinct populations of P. ramosa, and while most of these variations are silent single-nucleotide polymorphisms, a few result in conservative amino acid substitutions. A monoclonal antibody directed against an adhesin epitope present on P. penetrans P20 endospores, previously determined to be specific for Pasteuria spp. associated with several phytopathogenic nematodes, also detects an epitope associated with P. ramosa endospores. Immunoblotting provided patterns that differentiate P. ramosa from other Pasteuria spp. This monoclonal antibody thus provides a probe with which to detect and discriminate endospores of different Pasteuria spp. The presence of a shared adhesin epitope in two species with such ecologically distant hosts suggests that there is an ancient and ecologically significant recognition process in these endospore-forming bacilli that contributes to the virulence of both species in their respective hosts. PMID:17933927
Yuhui, Zhang; Ping, Huang; Jing, Lin; Jin, Zhao
2017-10-01
This study aims to investigate the association between interleukin (IL)-10-597 (C/A) single-nucleotide polymorphisms and chronic periodontitis of Moyu Uygur population in Xinjiang Uygur Autonomous Region of China. In accordance with the inclusion and exclusion criteria, the buccal swabs of 300 subjects were randomly selected from the epidemiological investigation of Uygur adults in Moyu county on April and May 2013. The study was conducted on a healthy control group, a mild chronic periodontitis group, and a moderate-to-severe chronic periodontitis group, with each comprising 100 samples. The IL-10-597(C/A) site in the promoter sequences was analyzed using the tetra-primer amplification refractory mutation system-polymerase chain reaction method to test the genotype and allele distributions. Statistical analysis was performed using Chi-squared test and ordinal classification Logistic regression analysis. The genotype and allele frequencies of the IL-10-597(C/A) site in the healthy control group, mild chronic periodontitis group, and moderate-to-severe chronic periodontitis group exhibited no significant difference (P>0.05). The age of all the samples was associated with chronic periodontitis. The risk of chronic periodontitis in the people of 55-65 years old was 25 times in the people under the age of 35 (OR=25.56, P<0.001). The IL-10-597 (C/A) single-nucleotide polymorphisms in the gene promoter are not associated with chronic periodontitis in Uygur adult population.
Single nucleotide-level mapping of DNA double-strand breaks in human HEK293T cells.
Pope, Bernard J; Mahmood, Khalid; Jung, Chol-Hee; Georgeson, Peter; Park, Daniel J
2017-03-01
Constitutional biological processes involve the generation of DNA double-strand breaks (DSBs). The production of such breaks and their subsequent resolution are also highly relevant to neurodegenerative diseases and cancer, in which extensive DNA fragmentation has been described Stephens et al. (2011), Blondet et al. (2001). Tchurikov et al. Tchurikov et al. (2011, 2013) have reported previously that frequent sites of DSBs occur in chromosomal domains involved in the co-ordinated expression of genes. This group report that hot spots of DSBs in human HEK293T cells often coincide with H3K4me3 marks, associated with active transcription Kravatsky et al. (2015) and that frequent sites of DNA double-strand breakage are likely to be relevant to cancer genomics Tchurikov et al. (2013, 2016) . Recently, they applied a RAFT (rapid amplification of forum termini) protocol that selects for blunt-ended DSB sites and mapped these to the human genome within defined co-ordinate 'windows'. In this paper, we re-analyse public RAFT data to derive sites of DSBs at the single-nucleotide level across the built genome for human HEK293T cells (https://figshare.com/s/35220b2b79eaaaf64ed8). This refined mapping, combined with accessory ENCODE data tracks and ribosomal DNA-related sequence annotations, will likely be of value for the design of clinically relevant targeted assays such as those for cancer susceptibility, diagnosis, treatment-matching and prognostication.
Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A
2017-04-01
Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Evaluating the performance of selection scans to detect selective sweeps in domestic dogs
Schlamp, Florencia; van der Made, Julian; Stambler, Rebecca; Chesebrough, Lewis; Boyko, Adam R.; Messer, Philipp W.
2015-01-01
Selective breeding of dogs has resulted in repeated artificial selection on breed-specific morphological phenotypes. A number of quantitative trait loci associated with these phenotypes have been identified in genetic mapping studies. We analyzed the population genomic signatures observed around the causal mutations for 12 of these loci in 25 dog breeds, for which we genotyped 25 individuals in each breed. By measuring the population frequencies of the causal mutations in each breed, we identified those breeds in which specific mutations most likely experienced positive selection. These instances were then used as positive controls for assessing the performance of popular statistics to detect selection from population genomic data. We found that artificial selection during dog domestication has left characteristic signatures in the haplotype and nucleotide polymorphism patterns around selected loci that can be detected in the genotype data from a single population sample. However, the sensitivity and accuracy at which such signatures were detected varied widely between loci, the particular statistic used, and the choice of analysis parameters. We observed examples of both hard and soft selective sweeps and detected strong selective events that removed genetic diversity almost entirely over regions >10 Mbp. Our study demonstrates the power and limitations of selection scans in populations with high levels of linkage disequilibrium due to severe founder effects and recent population bottlenecks. PMID:26589239
Evaluating the performance of selection scans to detect selective sweeps in domestic dogs.
Schlamp, Florencia; van der Made, Julian; Stambler, Rebecca; Chesebrough, Lewis; Boyko, Adam R; Messer, Philipp W
2016-01-01
Selective breeding of dogs has resulted in repeated artificial selection on breed-specific morphological phenotypes. A number of quantitative trait loci associated with these phenotypes have been identified in genetic mapping studies. We analysed the population genomic signatures observed around the causal mutations for 12 of these loci in 25 dog breeds, for which we genotyped 25 individuals in each breed. By measuring the population frequencies of the causal mutations in each breed, we identified those breeds in which specific mutations most likely experienced positive selection. These instances were then used as positive controls for assessing the performance of popular statistics to detect selection from population genomic data. We found that artificial selection during dog domestication has left characteristic signatures in the haplotype and nucleotide polymorphism patterns around selected loci that can be detected in the genotype data from a single population sample. However, the sensitivity and accuracy at which such signatures were detected varied widely between loci, the particular statistic used and the choice of analysis parameters. We observed examples of both hard and soft selective sweeps and detected strong selective events that removed genetic diversity almost entirely over regions >10 Mbp. Our study demonstrates the power and limitations of selection scans in populations with high levels of linkage disequilibrium due to severe founder effects and recent population bottlenecks. © 2015 John Wiley & Sons Ltd.
Culture adaptation of malaria parasites selects for convergent loss-of-function mutants.
Claessens, Antoine; Affara, Muna; Assefa, Samuel A; Kwiatkowski, Dominic P; Conway, David J
2017-01-24
Cultured human pathogens may differ significantly from source populations. To investigate the genetic basis of laboratory adaptation in malaria parasites, clinical Plasmodium falciparum isolates were sampled from patients and cultured in vitro for up to three months. Genome sequence analysis was performed on multiple culture time point samples from six monoclonal isolates, and single nucleotide polymorphism (SNP) variants emerging over time were detected. Out of a total of five positively selected SNPs, four represented nonsense mutations resulting in stop codons, three of these in a single ApiAP2 transcription factor gene, and one in SRPK1. To survey further for nonsense mutants associated with culture, genome sequences of eleven long-term laboratory-adapted parasite strains were examined, revealing four independently acquired nonsense mutations in two other ApiAP2 genes, and five in Epac. No mutants of these genes exist in a large database of parasite sequences from uncultured clinical samples. This implicates putative master regulator genes in which multiple independent stop codon mutations have convergently led to culture adaptation, affecting most laboratory lines of P. falciparum. Understanding the adaptive processes should guide development of experimental models, which could include targeted gene disruption to adapt fastidious malaria parasite species to culture.
Onuki, Ritsuko; Yamaguchi, Rui; Shibuya, Tetsuo; Kanehisa, Minoru; Goto, Susumu
2017-01-01
Genome-wide scans for positive selection have become important for genomic medicine, and many studies aim to find genomic regions affected by positive selection that are associated with risk allele variations among populations. Most such studies are designed to detect recent positive selection. However, we hypothesize that ancient positive selection is also important for adaptation to pathogens, and has affected current immune-mediated common diseases. Based on this hypothesis, we developed a novel linkage disequilibrium-based pipeline, which aims to detect regions associated with ancient positive selection across populations from single nucleotide polymorphism (SNP) data. By applying this pipeline to the genotypes in the International HapMap project database, we show that genes in the detected regions are enriched in pathways related to the immune system and infectious diseases. The detected regions also contain SNPs reported to be associated with cancers and metabolic diseases, obesity-related traits, type 2 diabetes, and allergic sensitization. These SNPs were further mapped to biological pathways to determine the associations between phenotypes and molecular functions. Assessments of candidate regions to identify functions associated with variations in incidence rates of these diseases are needed in the future. PMID:28445522
Lê Cao, Kim-Anh; Boitard, Simon; Besse, Philippe
2011-06-22
Variable selection on high throughput biological data, such as gene expression or single nucleotide polymorphisms (SNPs), becomes inevitable to select relevant information and, therefore, to better characterize diseases or assess genetic structure. There are different ways to perform variable selection in large data sets. Statistical tests are commonly used to identify differentially expressed features for explanatory purposes, whereas Machine Learning wrapper approaches can be used for predictive purposes. In the case of multiple highly correlated variables, another option is to use multivariate exploratory approaches to give more insight into cell biology, biological pathways or complex traits. A simple extension of a sparse PLS exploratory approach is proposed to perform variable selection in a multiclass classification framework. sPLS-DA has a classification performance similar to other wrapper or sparse discriminant analysis approaches on public microarray and SNP data sets. More importantly, sPLS-DA is clearly competitive in terms of computational efficiency and superior in terms of interpretability of the results via valuable graphical outputs. sPLS-DA is available in the R package mixOmics, which is dedicated to the analysis of large biological data sets.