Science.gov

Sample records for nucleotide sequence polymorphism

  1. Single Nucleotide Polymorphism Mapping Using Genome-Wide Unique Sequences

    PubMed Central

    Chen, Leslie Y.Y.; Lu, Szu-Hsien; Shih, Edward S.C.; Hwang, Ming-Jing

    2002-01-01

    As more and more genomic DNAs are sequenced to characterize human genetic variations, the demand for a very fast and accurate method to genomically position these DNA sequences is high. We have developed a new mapping method that does not require sequence alignment. In this method, we first identified DNA fragments of 15 bp in length that are unique in the human genome and then used them to position single nucleotide polymorphism (SNP) sequences. By use of four desktop personal computers with AMD K7 (1 GHz) processors, our new method mapped more than 1.6 million SNP sequences in 20 hr and achieved a very good agreement with mapping results from alignment-based methods. PMID:12097348

  2. Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library

    Technology Transfer Automated Retrieval System (TEKTRAN)

    BACKGROUND: To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (...

  3. Single nucleotide polymorphism mining and nucleotide sequence analysis of Mx1 gene in exonic regions of Japanese quail

    PubMed Central

    Niraj, Diwesh Kumar; Kumar, Pushpendra; Mishra, Chinmoy; Narayan, Raj; Bhattacharya, Tarun Kumar; Shrivastava, Kush; Bhushan, Bharat; Tiwari, Ashok Kumar; Saxena, Vishesh; Sahoo, Nihar Ranjan; Sharma, Deepak

    2015-01-01

    Aim: An attempt has been made to study the Myxovirus resistant (Mx1) gene polymorphism in Japanese quail. Materials and Methods: In the present, investigation four fragments viz. Fragment I of 185 bp (Exon 3 region), Fragment II of 148 bp (Exon 5 region), Fragment III of 161 bp (Exon 7 region), and Fragment IV of 176 bp (Exon 13 region) of Mx1 gene were amplified and screened for polymorphism by polymerase chain reaction-single-strand conformation polymorphism technique in 170 Japanese quail birds. Results: Out of the four fragments, one fragment (Fragment II) was found to be polymorphic. Remaining three fragments (Fragment I, III, and IV) were found to be monomorphic which was confirmed by custom sequencing. Overall nucleotide sequence analysis of Mx1 gene of Japanese quail showed 100% homology with common quail and more than 80% homology with reported sequence of chicken breeds. Conclusion: The Mx1 gene is mostly conserved in Japanese quail. There is an urgent need of comprehensive analysis of other regions of Mx1 gene along with its possible association with the traits of economic importance in Japanese quail. PMID:27047057

  4. Developing Single Nucleotide Polymorphism (SNP) markers from transcriptome sequences for the identification of longan (Dimocarpus longan) germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...

  5. Large-scale detection and application of expressed sequence tag single nucleotide polymorphisms in Nicotiana.

    PubMed

    Wang, Y; Zhou, D; Wang, S; Yang, L

    2015-01-01

    Single nucleotide polymorphisms (SNPs) are widespread in the Nicotiana genome. Using an alignment and variation detection method, we developed 20,607,973 SNPs, based on the expressed sequence tag sequences of 10 Nicotiana species. The replacement rate was much higher than the transversion rate in the SNPs, and SNPs widely exist in the Nicotiana. In vitro verification indicated that all of the SNPs were high quality and accurate. Evolutionary relationships between 15 varieties were investigated by polymerase chain reaction with a special primer; the specific 302 locus of these sequence results clearly indicated the origin of Zhongyan 100. A database of Nicotiana SNPs (NSNP) was developed to store and search for SNPs in Nicotiana. NSNP is a tool for researchers to develop SNP markers of sequence data. PMID:26214460

  6. Mining for single nucleotide polymorphisms and insertions / deletions in expressed sequence tag libraries of oil palm.

    PubMed

    Riju, Aykkal; Chandrasekar, Arumugam; Arunachalam, Vadivel

    2007-01-01

    The oil palm is a tropical oil bearing tree. Recently EST-derived SNPs and SSRs are a free by-product of the currently expanding EST (Expressed Sequence Tag) data bases. The development of high-throughput methods for the detection of SNPs (Single Nucleotide Polymorphism) and small indels (insertion / deletion) has led to a revolution in their use as molecular markers. Available (5452) Oil palm EST sequences were mined from dbEST of NCBI. CAP3 program was used to assemble EST sequences into contigs. Candidate SNPs and Indel polymorphisms were detected using the perl script auto_snip version 1.0 which has used 576 ESTs for detecting SNPs and Indel sites. We found 1180 SNP sites and 137 indel polymorphisms with frequency 1.36 SNPs / 100 bp. Among the six tissues from which the EST libraries had been generated, mesocarp had high frequency of 2.91 SNPs and indels per 100 bp whereas the zygotic embryos had lowest frequency of 0.15 per 100 bp. We also used the Shannon index to analyze the proportion of ten possible types of SNP/indels. ESTs from tissues of normal apex showed highest values of Shannon index (0.60) whereas abnormal apex had least value (0.02). The present report deals the use of Shannon index for comparing SNP/ indel frequencies mined from ESTlibraries and also confirm that the frequency of SNP occurrence in oil palm to use them as markers for genetic studies. PMID:21670789

  7. Species diagnostic single-nucleotide polymorphism and sequence-tagged site markers for the parasitic WASP Genus Nasonia (Hymenoptera: Ptermalidae)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We developed, identified and evaluated eight single nucleotide polymorphism (SNP) and three sequence-tagged site (STS) markers in nuclear gene sequences of the wasp genus Nasonia (Hymenoptera). We studied variation of these markers in natural populations of the closely related and regionally sympatr...

  8. Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

    DOEpatents

    McCutchen-Maloney, Sandra L.

    2002-01-01

    DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.

  9. Developing single nucleotide polymorphism (SNP) markers from transcriptome sequences for identification of longan (Dimocarpus longan) germplasm

    PubMed Central

    Wang, Boyi; Tan, Hua-Wei; Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Matsumoto, Tracie; Zhang, Dapeng

    2015-01-01

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in 50 longan germplasm accessions, including cultivated varieties and wild germplasm; and designated 25 SNP markers that unambiguously identified all tested longan varieties with high statistical rigor (P<0.0001). Multiple trees from the same clone were verified and off-type trees were identified. Diversity analysis revealed genetic relationships among analyzed accessions. Cultivated varieties differed significantly from wild populations (Fst=0.300; P<0.001), demonstrating untapped genetic diversity for germplasm conservation and utilization. Within cultivated varieties, apparent differences between varieties from China and those from Thailand and Hawaii indicated geographic patterns of genetic differentiation. These SNP markers provide a powerful tool to manage longan genetic resources and breeding, with accurate and efficient genotype identification. PMID:26504559

  10. Whole genome sequencing of a single Bos taurus animal for single nucleotide polymorphism discovery

    PubMed Central

    Eck, Sebastian H; Benet-Pagès, Anna; Flisikowski, Krzysztof; Meitinger, Thomas; Fries, Ruedi; Strom, Tim M

    2009-01-01

    Background The majority of the 2 million bovine single nucleotide polymorphisms (SNPs) currently available in dbSNP have been identified in a single breed, Hereford cattle, during the bovine genome project. In an attempt to evaluate the variance of a second breed, we have produced a whole genome sequence at low coverage of a single Fleckvieh bull. Results We generated 24 gigabases of sequence, mainly using 36-bp paired-end reads, resulting in an average 7.4-fold sequence depth. This coverage was sufficient to identify 2.44 million SNPs, 82% of which were previously unknown, and 115,000 small indels. A comparison with the genotypes of the same animal, generated on a 50 k oligonucleotide chip, revealed a detection rate of 74% and 30% for homozygous and heterozygous SNPs, respectively. The false positive rate, as determined by comparison with genotypes determined for 196 randomly selected SNPs, was approximately 1.1%. We further determined the allele frequencies of the 196 SNPs in 48 Fleckvieh and 48 Braunvieh bulls. 95% of the SNPs were polymorphic with an average minor allele frequency of 24.5% and with 83% of the SNPs having a minor allele frequency larger than 5%. Conclusions This work provides the first single cattle genome by next-generation sequencing. The chosen approach - low to medium coverage re-sequencing - added more than 2 million novel SNPs to the currently publicly available SNP resource, providing a valuable resource for the construction of high density oligonucleotide arrays in the context of genome-wide association studies. PMID:19660108

  11. Empirical Comparison of Simple Sequence Repeats and Single Nucleotide Polymorphisms in Assessment of Maize Diversity and Relatedness

    Technology Transfer Automated Retrieval System (TEKTRAN)

    While Simple Sequence Repeats (SSRs) are extremely useful genetic markers, recent advances in technology have produced a shift toward use of single nucleotide polymorphisms (SNPs). The different mutational properties of these two classes of markers result in differences in heterozygosities and allel...

  12. A high-density simple sequence repeat and single nucleotide polymorphism genetic map of the tetraploid cotton genome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Cotton genome complexity was investigated with a saturated molecular genetic map that combined several sets of microsatellites or simple sequence repeats (SSR) and the first major public set of single nucleotide polymorphism (SNP) markers in cotton genomes (Gossypium spp.), and that was constructed ...

  13. Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library

    PubMed Central

    2009-01-01

    Background To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population. Results The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme HaeIII; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the

  14. Gene-based single nucleotide polymorphism discovery in bovine muscle using next-generation transcriptomic sequencing

    PubMed Central

    2013-01-01

    Background Genetic information based on molecular markers has increasingly being used in cattle breeding improvement programmes, as a mean to improve conventionally phenotypic selection. Advances in molecular genetics have led to the identification of several genetic markers associated with genes affecting economic traits. Until recently, the identification of the causative genetic variants involved in the phenotypes of interest has remained a difficult task. The advent of novel sequencing technologies now offers a new opportunity for the identification of such variants. Despite sequencing costs plummeting, sequencing whole-genomes or large targeted regions is still too expensive for most laboratories. A transcriptomic-based sequencing approach offers a cheaper alternative to identify a large number of polymorphisms and possibly to discover causative variants. In the present study, we performed a gene-based single nucleotide polymorphism (SNP) discovery analysis in bovine Longissimus thoraci, using RNA-Seq. To our knowledge, this represents the first study done in bovine muscle. Results Messenger RNAs from Longissimus thoraci from three Limousin bull calves were subjected to high-throughput sequencing. Approximately 36–46 million paired-end reads were obtained per library. A total of 19,752 transcripts were identified and 34,376 different SNPs were detected. Fifty-five percent of the SNPs were found in coding regions and ~22% resulted in an amino acid change. Applying a very stringent SNP quality threshold, we detected 8,407 different high-confidence SNPs, 18% of which are non synonymous coding SNPs. To analyse the accuracy of RNA-Seq technology for SNP detection, 48 SNPs were selected for validation by genotyping. No discrepancies were observed when using the highest SNP probability threshold. To test the usefulness of the identified SNPs, the 48 selected SNPs were assessed by genotyping 93 bovine samples, representing mostly the nine major breeds used in France

  15. Development of Single Nucleotide Polymorphism Markers via Sequence-based Genotyping in Cotton (Gossypium spp)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    High-throughput single nucleotide polymorphism (SNP) genotyping has become the dominant approach to genomic analysis and genetic manipulation in many crop plants. In cotton (Gossypium spp), however, only a very limited number of loci and a dearth of information have been generated from SNP genotypi...

  16. A comparative genomics strategy for targeted discovery of single-nucleotide polymorphisms and conserved-noncoding sequences in orphan crops.

    PubMed

    Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H

    2006-04-01

    Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031

  17. SNUFER: A software for localization and presentation of single nucleotide polymorphisms using a Clustal multiple sequence alignment output file

    PubMed Central

    Mansur, Marco A B; Cardozo, Giovana P; Santos, Elaine V; Marins, Mozart

    2008-01-01

    SNUFER is a software for the automatic localization and generation of tables used for the presentation of single nucleotide polymorphisms (SNPs). After input of a fasta file containing the sequences to be analyzed, a multiple sequence alignment is generated using ClustalW ran inside SNUFER. The ClustalW output file is then used to generate a table which displays the SNPs detected in the aligned sequences and their degree of similarity. This table can be exported to Microsoft Word, Microsoft Excel or as a single text file, permitting further editing for publication. The software was written using Delphi 7 for programming and FireBird 2.0 for sequence database management. It is freely available for noncommercial use and can be downloaded from http://www.heranza.com.br/bioinformatica2.htm. PMID:19238196

  18. Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing

    PubMed Central

    Pearson, Talima; Busch, Joseph D.; Ravel, Jacques; Read, Timothy D.; Rhoton, Shane D.; U'Ren, Jana M.; Simonson, Tatum S.; Kachur, Sergey M.; Leadem, Rebecca R.; Cardon, Michelle L.; Van Ert, Matthew N.; Huynh, Lynn Y.; Fraser, Claire M.; Keim, Paul

    2004-01-01

    Phylogenetic reconstruction using molecular data is often subject to homoplasy, leading to inaccurate conclusions about phylogenetic relationships among operational taxonomic units. Compared with other molecular markers, single-nucleotide polymorphisms (SNPs) exhibit extremely low mutation rates, making them rare in recently emerged pathogens, but they are less prone to homoplasy and thus extremely valuable for phylogenetic analyses. Despite their phylogenetic potential, ascertainment bias occurs when SNP characters are discovered through biased taxonomic sampling; by using whole-genome comparisons of five diverse strains of Bacillus anthracis to facilitate SNP discovery, we show that only polymorphisms lying along the evolutionary pathway between reference strains will be observed. We illustrate this in theoretical and simulated data sets in which complex phylogenetic topologies are reduced to linear evolutionary models. Using a set of 990 SNP markers, we also show how divergent branches in our topologies collapse to single points but provide accurate information on internodal distances and points of origin for ancestral clades. These data allowed us to determine the ancestral root of B. anthracis, showing that it lies closer to a newly described “C” branch than to either of two previously described “A” or “B” branches. In addition, subclade rooting of the C branch revealed unequal evolutionary rates that seem to be correlated with ecological parameters and strain attributes. Our use of nonhomoplastic whole-genome SNP characters allows branch points and clade membership to be estimated with great precision, providing greater insight into epidemiological, ecological, and forensic questions. PMID:15347815

  19. Development and characterization of new single nucleotide polymorphism markers from expressed sequence tags in common carp (Cyprinus carpio).

    PubMed

    Zhu, Chuankun; Cheng, Lei; Tong, Jingou; Yu, Xiaomu

    2012-01-01

    The common carp (Cyprinus carpio) is an important aquaculture fish worldwide but only limited single nucleotide polymorphism (SNP) markers are characterized from expressed sequence tags (ESTs) in this species. In this study, 1487 putative SNPs were bioinformatically mined from 14,066 online ESTs mainly from the European common carp, with the occurrence rate of about one SNP every 173 bp. One hundred and twenty-one of these SNPs were selected for validation using PCR fragment sequencing, and 48 out of 81 primers could amplify the expected fragments in the Chinese common carp genome. Only 26 (21.5%) putative SNPs were validated, however, 508 new SNPs and 68 indels were identified. The ratios of transitions to transversions were 1.77 for exon SNPs and 1.05 for intron SNPs. All the 23 SNPs selected for population tests were polymorphic, with the observed heterozygosity (Ho) ranging from 0.053 to 0.526 (mean 0.262), polymorphism information content (PIC) from 0.095 to 0.357 (mean 0.246), and 21 SNPs were in Hardy-Weinberg equilibrium. These results suggest that different common carp populations with geographic isolation have significant genetic variation at the SNP level, and these new EST-SNP markers are readily available for genetics and breeding studies in common carp. PMID:22837697

  20. A Simple Sequence Repeat- and Single-Nucleotide Polymorphism-Based Genetic Linkage Map of the Brown Planthopper, Nilaparvata lugens

    PubMed Central

    Jairin, Jirapong; Kobayashi, Tetsuya; Yamagata, Yoshiyuki; Sanada-Morimura, Sachiyo; Mori, Kazuki; Tashiro, Kosuke; Kuhara, Satoru; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Yamamoto, Kimiko; Matsumura, Masaya; Yasui, Hideshi

    2013-01-01

    In this study, we developed the first genetic linkage map for the major rice insect pest, the brown planthopper (BPH, Nilaparvata lugens). The linkage map was constructed by integrating linkage data from two backcross populations derived from three inbred BPH strains. The consensus map consists of 474 simple sequence repeats, 43 single-nucleotide polymorphisms, and 1 sequence-tagged site, for a total of 518 markers at 472 unique positions in 17 linkage groups. The linkage groups cover 1093.9 cM, with an average distance of 2.3 cM between loci. The average number of marker loci per linkage group was 27.8. The sex-linkage group was identified by exploiting X-linked and Y-specific markers. Our linkage map and the newly developed markers used to create it constitute an essential resource and a useful framework for future genetic analyses in BPH. PMID:23204257

  1. Species-wide genome sequence and nucleotide polymorphisms from the model allopolyploid plant Brassica napus

    PubMed Central

    Schmutzer, Thomas; Samans, Birgit; Dyrszka, Emmanuelle; Ulpinnis, Chris; Weise, Stephan; Stengel, Doreen; Colmsee, Christian; Lespinasse, Denis; Micic, Zeljko; Abel, Stefan; Duchscherer, Peter; Breuer, Frank; Abbadi, Amine; Leckband, Gunhild; Snowdon, Rod; Scholz, Uwe

    2015-01-01

    Brassica napus (oilseed rape, canola) is one of the world’s most important sources of vegetable oil for human nutrition and biofuel, and also a model species for studies investigating the evolutionary consequences of polyploidisation. Strong bottlenecks during its recent origin from interspecific hybridisation, and subsequently through intensive artificial selection, have severely depleted the genetic diversity available for breeding. On the other hand, high-throughput genome profiling technologies today provide unprecedented scope to identify, characterise and utilise genetic diversity in primary and secondary crop gene pools. Such methods also enable implementation of genomic selection strategies to accelerate breeding progress. The key prerequisite is availability of high-quality sequence data and identification of high-quality, genome-wide sequence polymorphisms representing relevant gene pools. We present comprehensive genome resequencing data from a panel of 52 highly diverse natural and synthetic B. napus accessions, along with a stringently selected panel of 4.3 million high-confidence, genome-wide SNPs. The data is of great interest for genomics-assisted breeding and for evolutionary studies on the origins and consequences in allopolyploidisation in plants. PMID:26647166

  2. Species-wide genome sequence and nucleotide polymorphisms from the model allopolyploid plant Brassica napus.

    PubMed

    Schmutzer, Thomas; Samans, Birgit; Dyrszka, Emmanuelle; Ulpinnis, Chris; Weise, Stephan; Stengel, Doreen; Colmsee, Christian; Lespinasse, Denis; Micic, Zeljko; Abel, Stefan; Duchscherer, Peter; Breuer, Frank; Abbadi, Amine; Leckband, Gunhild; Snowdon, Rod; Scholz, Uwe

    2015-01-01

    Brassica napus (oilseed rape, canola) is one of the world's most important sources of vegetable oil for human nutrition and biofuel, and also a model species for studies investigating the evolutionary consequences of polyploidisation. Strong bottlenecks during its recent origin from interspecific hybridisation, and subsequently through intensive artificial selection, have severely depleted the genetic diversity available for breeding. On the other hand, high-throughput genome profiling technologies today provide unprecedented scope to identify, characterise and utilise genetic diversity in primary and secondary crop gene pools. Such methods also enable implementation of genomic selection strategies to accelerate breeding progress. The key prerequisite is availability of high-quality sequence data and identification of high-quality, genome-wide sequence polymorphisms representing relevant gene pools. We present comprehensive genome resequencing data from a panel of 52 highly diverse natural and synthetic B. napus accessions, along with a stringently selected panel of 4.3 million high-confidence, genome-wide SNPs. The data is of great interest for genomics-assisted breeding and for evolutionary studies on the origins and consequences in allopolyploidisation in plants. PMID:26647166

  3. A resource of single-nucleotide polymorphisms for rainbow trout generated by restriction-site associated DNA sequencing of doubled haploids

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Salmonid genomes are considered to be in a pseudo-tetraploid state as a result of an evolutionarily recent genome duplication event. This situation complicates single nucleotide polymorphism (SNP) discovery in rainbow trout as many putative SNPs are actually paralogous sequence variants (PSVs) and ...

  4. Development of Single Nucleotide Polymorphism markers in Theobroma cacao and comparison to Simple Sequence Repeat markers for genotyping of Cameroon clones.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single Nucleotide Polymorphism (SNP) markers are increasingly being used in crop breeding programs, slowly replacing Simple Sequence Repeats (SSR) and other markers. SNPs provide many benefits over SSRs, including ease of analysis and unambiguous results across various platforms. We have identifie...

  5. Empirical Comparison of Simple Sequence Repeats and Single Nucleotide Polymorphisms in Assessment of Maize Diversity and Relatedness

    PubMed Central

    Hamblin, Martha T.; Warburton, Marilyn L.; Buckler, Edward S.

    2007-01-01

    While Simple Sequence Repeats (SSRs) are extremely useful genetic markers, recent advances in technology have produced a shift toward use of single nucleotide polymorphisms (SNPs). The different mutational properties of these two classes of markers result in differences in heterozygosities and allele frequencies that may have implications for their use in assessing relatedness and evaluation of genetic diversity. We compared analyses based on 89 SSRs (primarily dinucleotide repeats) to analyses based on 847 SNPs in individuals from the same 259 inbred maize lines, which had been chosen to represent the diversity available among current and historic lines used in breeding. The SSRs performed better at clustering germplasm into populations than did a set of 847 SNPs or 554 SNP haplotypes, and SSRs provided more resolution in measuring genetic distance based on allele-sharing. Except for closely related pairs of individuals, measures of distance based on SSRs were only weakly correlated with measures of distance based on SNPs. Our results suggest that 1) large numbers of SNP loci will be required to replace highly polymorphic SSRs in studies of diversity and relatedness and 2) relatedness among highly-diverged maize lines is difficult to measure accurately regardless of the marker system. PMID:18159250

  6. Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms.

    PubMed

    Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources. PMID:26151450

  7. Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms

    PubMed Central

    Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources. PMID:26151450

  8. Finding the right coverage: the impact of coverage and sequence quality on single nucleotide polymorphism genotyping error rates.

    PubMed

    Fountain, Emily D; Pauli, Jonathan N; Reid, Brendan N; Palsbøll, Per J; Peery, M Zachariah

    2016-07-01

    Restriction-enzyme-based sequencing methods enable the genotyping of thousands of single nucleotide polymorphism (SNP) loci in nonmodel organisms. However, in contrast to traditional genetic markers, genotyping error rates in SNPs derived from restriction-enzyme-based methods remain largely unknown. Here, we estimated genotyping error rates in SNPs genotyped with double digest RAD sequencing from Mendelian incompatibilities in known mother-offspring dyads of Hoffman's two-toed sloth (Choloepus hoffmanni) across a range of coverage and sequence quality criteria, for both reference-aligned and de novo-assembled data sets. Genotyping error rates were more sensitive to coverage than sequence quality and low coverage yielded high error rates, particularly in de novo-assembled data sets. For example, coverage ≥5 yielded median genotyping error rates of ≥0.03 and ≥0.11 in reference-aligned and de novo-assembled data sets, respectively. Genotyping error rates declined to ≤0.01 in reference-aligned data sets with a coverage ≥30, but remained ≥0.04 in the de novo-assembled data sets. We observed approximately 10- and 13-fold declines in the number of loci sampled in the reference-aligned and de novo-assembled data sets when coverage was increased from ≥5 to ≥30 at quality score ≥30, respectively. Finally, we assessed the effects of genotyping coverage on a common population genetic application, parentage assignments, and showed that the proportion of incorrectly assigned maternities was relatively high at low coverage. Overall, our results suggest that the trade-off between sample size and genotyping error rates be considered prior to building sequencing libraries, reporting genotyping error rates become standard practice, and that effects of genotyping errors on inference be evaluated in restriction-enzyme-based SNP studies. PMID:26946083

  9. Comparison of single nucleotide polymorphisms and simple sequence repeats in genotype identification and diversity assessment of cacao germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Accurate identification of individual genotypes in an efficient manner is especially important for cacao (Theobroma cacao L.) germplasm conservation and breeding. The development of single nucleotide polymorphism (SNP) markers in cacao offers the opportunity to use a high throughput genotyping syste...

  10. Sequences, Annotation and Single Nucleotide Polymorphism of the Major Histocompatibility Complex in the Domestic Cat

    PubMed Central

    Yuhki, Naoya; Mullikin, James C.; Beck, Thomas; Stephens, Robert; O'Brien, Stephen J.

    2008-01-01

    Two sequences of major histocompatibility complex (MHC) regions in the domestic cat, 2.976 and 0.362 Mbps, which were separated by an ancient chromosome break (55–80 MYA) and followed by a chromosomal inversion were annotated in detail. Gene annotation of this MHC was completed and identified 183 possible coding regions, 147 human homologues, possible functional genes and 36 pseudo/unidentified genes) by GENSCAN and BLASTN, BLASTP RepeatMasker programs. The first region spans 2.976 Mbp sequence, which encodes six classical class II antigens (three DRA and three DRB antigens) lacking the functional DP, DQ regions, nine antigen processing molecules (DOA/DOB, DMA/DMB, TAPASIN, and LMP2/LMP7,TAP1/TAP2), 52 class III genes, nineteen class I genes/gene fragments (FLAI-A to FLAI-S). Three class I genes (FLAI-H, I-K, I-E) may encode functional classical class I antigens based on deduced amino acid sequence and promoter structure. The second region spans 0.362 Mbp sequence encoding no class I genes and 18 cross-species conserved genes, excluding class I, II and their functionally related/associated genes, namely framework genes, including three olfactory receptor genes. One previously identified feline endogenous retrovirus, a baboon retrovirus derived sequence (ECE1) and two new endogenous retrovirus sequences, similar to brown bat endogenous retrovirus (FERVmlu1, FERVmlu2) were found within a 140 Kbp interval in the middle of class I region. MHC SNPs were examined based on comparisons of this BAC sequence and MHC homozygous 1.9× WGS sequences and found that 11,654 SNPs in 2.84 Mbp (0.00411 SNP per bp), which is 2.4 times higher rate than average heterozygous region in the WGS (0.0017 SNP per bp genome), and slightly higher than the SNP rate observed in human MHC (0.00337 SNP per bp). PMID:18629345

  11. A survey of endogenous retrovirus (ERV) sequences in the vicinity of multiple sclerosis (MS)-associated single nucleotide polymorphisms (SNPs).

    PubMed

    Brütting, Christine; Emmer, Alexander; Kornhuber, Malte; Staege, Martin S

    2016-08-01

    Although multiple sclerosis (MS) is one of the most common central nervous system diseases in young adults, little is known about its etiology. Several human endogenous retroviruses (ERVs) are considered to play a role in MS. We are interested in which ERVs can be identified in the vicinity of MS associated genetic marker to find potential initiators of MS. We analysed the chromosomal regions surrounding 58 single nucleotide polymorphisms (SNPs) that are associated with MS identified in one of the last major genome wide association studies. We scanned these regions for putative endogenous retrovirus sequences with large open reading frames (ORFs). We observed that more retrovirus-related putative ORFs exist in the relatively close vicinity of SNP marker indices in multiple sclerosis compared to control SNPs. We found very high homologies to HERV-K, HCML-ARV, XMRV, Galidia ERV, HERV-H/env62 and XMRV-like mouse endogenous retrovirus mERV-XL. The associated genes (CYP27B1, CD6, CD58, MPV17L2, IL12RB1, CXCR5, PTGER4, TAGAP, TYK2, ICAM3, CD86, GALC, GPR65 as well as the HLA DRB1*1501) are mainly involved in the immune system, but also in vitamin D regulation. The most frequently detected ERV sequences are related to the multiple sclerosis-associated retrovirus, the human immunodeficiency virus 1, HERV-K, and the Simian foamy virus. Our data shows that there is a relation between MS associated SNPs and the number of retroviral elements compared to control. Our data identifies new ERV sequences that have not been associated with MS, so far. PMID:27169423

  12. BRDT gene sequence in human testicular pathologies and the implication of its single nucleotide polymorphism (rs3088232) on fertility.

    PubMed

    Barda, S; Yogev, L; Paz, G; Yavetz, H; Lehavi, O; Hauser, R; Doniger, T; Breitbart, H; Kleiman, S E

    2014-07-01

    Bromodomain testis-specific (BRDT) protein is essential for the normal process of spermatogenesis. Mutant mice that expressed truncated BRDT had impaired testicular histology with severely reduced sperm concentration and abnormal sperm morphology, while a model of knockout Brdt mice with no BRDT protein had complete meiotic arrest. A BRDT single nucleotide polymorphism (SNP) (rs3088232) was reported as being associated with infertility in men. We assessed testicular specimens of 276 azoospermic men who underwent testicular sperm extraction to search for specimens that showed spermatogenic impairments similar to those of mutant BRDT mice. Ten similar specimens were selected for BRDT gene sequencing and they revealed three NCBI-reported SNPs (rs10783071, rs3088232 and rs10747493) variously distributed among them. Bioinformatics analysis predicted that they would not affect protein activity. Further assessment of rs3088232 frequency in a large group of non-obstructive azoospermia men and fertile controls demonstrated no significant difference between them (27.2 and 21.7% respectively; p = 0.122, Fisher's exact test). We conclude that the testicular impairments observed in the 10 specimens were not a consequence of BRDT gene mutation. The association between BRDT rs3088232 and infertility that had been reported in other studies was not supported. PMID:24865796

  13. Mining and comparison of haplotype-based expressed sequence tag single nucleotide polymorphisms among citrus cultivars

    Technology Transfer Automated Retrieval System (TEKTRAN)

    In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially...

  14. Phylogenetic analysis of Rutaceous plants based on single nucleotide polymorphism in chloroplast and nuclear gene sequences

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The family Rutaceae encompasses several genera including the economically important genus Citrus. In this study, we selected 22 citrus relatives belonging to the various sub groups of Rutaceae and compared the sequences of three gene fragments. The accessions selected belong to the subfamily Rutoide...

  15. The Single Nucleotide Polymorphism Consortium

    NASA Technical Reports Server (NTRS)

    Morgan, Michael

    2003-01-01

    I want to discuss both the Single Nucleotide Polymorphism (SNP) Consortium and the Human Genome Project. I am afraid most of my presentation will be thin on law and possibly too high on rhetoric. Having been engaged in a personal and direct way with these issues as a trained scientist, I find it quite difficult to be always as objective as I ought to be.

  16. Genome-wide association study for endocrine fertility traits using single nucleotide polymorphism arrays and sequence variants in dairy cattle.

    PubMed

    Tenghe, A M M; Bouwman, A C; Berglund, B; Strandberg, E; de Koning, D J; Veerkamp, R F

    2016-07-01

    Endocrine fertility traits, which are defined from progesterone concentration levels in milk, are interesting indicators of dairy cow fertility because they more directly reflect the cows own reproductive physiology than classical fertility traits, which are more biased by farm management decisions. The aim of this study was to detect quantitative trait loci (QTL) for 7 endocrine fertility traits in dairy cows by performing a genome-wide association study with 85k single nucleotide polymorphisms (SNP), and then fine-map targeted QTL regions, using imputed sequence variants. Two classical fertility traits were also analyzed for QTL with 85k SNP. The association between a SNP and a phenotype was assessed by single-locus regression for each SNP, using a linear mixed model that included a random polygenic effect. A total of 2,447 Holstein Friesian cows with 5,339 lactations with both phenotypes and genotypes were used for association analysis. Heritability estimates ranged from 0.09 to 0.15 for endocrine fertility traits and 0.03 to 0.10 for classical fertility traits. The genome-wide association study identified 17 QTL regions for endocrine fertility traits on Bos taurus autosomes (BTA) 2, 3, 8, 12, 15, 17, 23, and 25. The highest number (5) of QTL regions from the genome-wide association study was identified for the endocrine trait "proportion of samples with luteal activity." Overlapping QTL regions were found between endocrine traits on BTA 2, 3, and 17. For the classical trait calving to first service, 3 QTL regions were identified on BTA 3, 15, and 23, and an overlapping region was identified on BTA 23 with endocrine traits. Fine-mapping target regions for the endocrine traits on BTA 2 and 3 using imputed sequence variants confirmed the QTL from the genome-wide association study, and identified several associated variants that can contribute to an index of markers for genetic improvement of fertility. Several potential candidate genes underlying endocrine

  17. Insertion Sequence Element Single Nucleotide Polymorphism Typing Provides Insights into the Population Structure and Evolution of Mycobacterium ulcerans across Africa

    PubMed Central

    Jordaens, Kurt; Bomans, Pieter; Leirs, Herwig; Durnez, Lies; Affolabi, Dissou; Sopoh, Ghislain; Aguiar, Julia; Phanzu, Delphin Mavinga; Kibadi, Kapay; Eyangoh, Sara; Manou, Louis Bayonne; Phillips, Richard Odame; Adjei, Ohene; Ablordey, Anthony; Rigouts, Leen; Portaels, Françoise; Eddyani, Miriam; de Jong, Bouke C.

    2014-01-01

    Buruli ulcer is an indolent, slowly progressing necrotizing disease of the skin caused by infection with Mycobacterium ulcerans. In the present study, we applied a redesigned technique to a vast panel of M. ulcerans disease isolates and clinical samples originating from multiple African disease foci in order to (i) gain fundamental insights into the population structure and evolutionary history of the pathogen and (ii) disentangle the phylogeographic relationships within the genetically conserved cluster of African M. ulcerans. Our analyses identified 23 different African insertion sequence element single nucleotide polymorphism (ISE-SNP) types that dominate in different areas where Buruli ulcer is endemic. These ISE-SNP types appear to be the initial stages of clonal diversification from a common, possibly ancestral ISE-SNP type. ISE-SNP types were found unevenly distributed over the greater West African hydrological drainage basins. Our findings suggest that geographical barriers bordering the basins to some extent prevented bacterial gene flow between basins and that this resulted in independent focal transmission clusters associated with the hydrological drainage areas. Different phylogenetic methods yielded two well-supported sister clades within the African ISE-SNP types. The ISE-SNP types from the “pan-African clade” were found to be widespread throughout Africa, while the ISE-SNP types of the “Gabonese/Cameroonian clade” were much rarer and found in a more restricted area, which suggested that the latter clade evolved more recently. Additionally, the Gabonese/Cameroonian clade was found to form a strongly supported monophyletic group with Papua New Guinean ISE-SNP type 8, which is unrelated to other Southeast Asian ISE-SNP types. PMID:24296504

  18. Single nucleotide polymorphism discovery in cutthroat trout subspecies using genome reduction, barcoding, and 454 pyro-sequencing

    PubMed Central

    2012-01-01

    Background Salmonids are popular sport fishes, and as such have been subjected to widespread stocking throughout western North America. Historically, stocking was done with little regard for genetic variation among populations and has resulted in genetic mixing among species and subspecies in many areas, thus putting the genetic integrity of native salmonid populations at risk and creating a need to assess the genetic constitution of native salmonid populations. Cutthroat trout is a salmonid species with pronounced geographic structure (there are 10 extant subspecies) and a recent history of hybridization with introduced rainbow trout in many populations. Genetic admixture has also occurred among cutthroat trout subspecies in areas where introductions have brought two or more subspecies into contact. Consequently, management agencies have increased their efforts to evaluate the genetic composition of cutthroat trout populations to identify populations that remain uncompromised and manage them accordingly, but additional genetic markers are needed to do so effectively. Here we used genome reduction, MID-barcoding, and 454-pyrosequencing to discover single nucleotide polymorphisms that differentiate cutthroat trout subspecies and can be used as a rapid, cost-effective method to characterize the genetic composition of cutthroat trout populations. Results Thirty cutthroat and six rainbow trout individuals were subjected to genome reduction and next-generation sequencing. A total of 1,499,670 reads averaging 379 base pairs in length were generated by 454-pyrosequencing, resulting in 569,060,077 total base pairs sequenced. A total of 43,558 putative SNPs were identified, and of those, 125 SNP primers were developed that successfully amplified 96 cutthroat trout and rainbow trout individuals. These SNP loci were able to differentiate most cutthroat trout subspecies using distance methods and Structure analyses. Conclusions Genomic and bioinformatic protocols were

  19. Identification and Evaluation of Single-Nucleotide Polymorphisms in Allotetraploid Peanut (Arachis hypogaea L.) Based on Amplicon Sequencing Combined with High Resolution Melting (HRM) Analysis.

    PubMed

    Hong, Yanbin; Pandey, Manish K; Liu, Ying; Chen, Xiaoping; Liu, Hong; Varshney, Rajeev K; Liang, Xuanqiang; Huang, Shangzhi

    2015-01-01

    The cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB) species derived from the A-genome (Arachis duranensis) and B-genome (Arachis ipaensis) progenitors. Presence of two versions of a DNA sequence based on the two progenitor genomes poses a serious technical and analytical problem during single nucleotide polymorphism (SNP) marker identification and analysis. In this context, we have analyzed 200 amplicons derived from expressed sequence tags (ESTs) and genome survey sequences (GSS) to identify SNPs in a panel of genotypes consisting of 12 cultivated peanut varieties and two diploid progenitors representing the ancestral genomes. A total of 18 EST-SNPs and 44 genomic-SNPs were identified in 12 peanut varieties by aligning the sequence of A. hypogaea with diploid progenitors. The average frequency of sequence polymorphism was higher for genomic-SNPs than the EST-SNPs with one genomic-SNP every 1011 bp as compared to one EST-SNP every 2557 bp. In order to estimate the potential and further applicability of these identified SNPs, 96 peanut varieties were genotyped using high resolution melting (HRM) method. Polymorphism information content (PIC) values for EST-SNPs ranged between 0.021 and 0.413 with a mean of 0.172 in the set of peanut varieties, while genomic-SNPs ranged between 0.080 and 0.478 with a mean of 0.249. Total 33 SNPs were used for polymorphism detection among the parents and 10 selected lines from mapping population Y13Zh (Zhenzhuhei × Yueyou13). Of the total 33 SNPs, nine SNPs showed polymorphism in the mapping population Y13Zh, and seven SNPs were successfully mapped into five linkage groups. Our results showed that SNPs can be identified in allotetraploid peanut with high accuracy through amplicon sequencing and HRM assay. The identified SNPs were very informative and can be used for different genetic and breeding applications in peanut. PMID:26697032

  20. Identification and Evaluation of Single-Nucleotide Polymorphisms in Allotetraploid Peanut (Arachis hypogaea L.) Based on Amplicon Sequencing Combined with High Resolution Melting (HRM) Analysis

    PubMed Central

    Hong, Yanbin; Pandey, Manish K.; Liu, Ying; Chen, Xiaoping; Liu, Hong; Varshney, Rajeev K.; Liang, Xuanqiang; Huang, Shangzhi

    2015-01-01

    The cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB) species derived from the A-genome (Arachis duranensis) and B-genome (Arachis ipaensis) progenitors. Presence of two versions of a DNA sequence based on the two progenitor genomes poses a serious technical and analytical problem during single nucleotide polymorphism (SNP) marker identification and analysis. In this context, we have analyzed 200 amplicons derived from expressed sequence tags (ESTs) and genome survey sequences (GSS) to identify SNPs in a panel of genotypes consisting of 12 cultivated peanut varieties and two diploid progenitors representing the ancestral genomes. A total of 18 EST-SNPs and 44 genomic-SNPs were identified in 12 peanut varieties by aligning the sequence of A. hypogaea with diploid progenitors. The average frequency of sequence polymorphism was higher for genomic-SNPs than the EST-SNPs with one genomic-SNP every 1011 bp as compared to one EST-SNP every 2557 bp. In order to estimate the potential and further applicability of these identified SNPs, 96 peanut varieties were genotyped using high resolution melting (HRM) method. Polymorphism information content (PIC) values for EST-SNPs ranged between 0.021 and 0.413 with a mean of 0.172 in the set of peanut varieties, while genomic-SNPs ranged between 0.080 and 0.478 with a mean of 0.249. Total 33 SNPs were used for polymorphism detection among the parents and 10 selected lines from mapping population Y13Zh (Zhenzhuhei × Yueyou13). Of the total 33 SNPs, nine SNPs showed polymorphism in the mapping population Y13Zh, and seven SNPs were successfully mapped into five linkage groups. Our results showed that SNPs can be identified in allotetraploid peanut with high accuracy through amplicon sequencing and HRM assay. The identified SNPs were very informative and can be used for different genetic and breeding applications in peanut. PMID:26697032

  1. Detection of a G>C single nucleotide polymorphism within a repetitive DNA sequence by high-resolution DNA melting.

    PubMed

    Schmidt, Ulrike; Hulkkonen, Johannes; Naue, Jana

    2016-09-01

    In standard forensic DNA analysis, single base mutations within short tandem repeats (STR) mostly escape detection. In this study, high-resolution DNA melting (HRM) is compared to minisequencing and Sanger sequencing as to determine the most suitable method for detection of a G to C mutation within a repetitive DNA sequence, the STR system DXS10161. It shows an ATG/ATC polymorphism surrounded by a variable number of (TATC) and (ATCT) motifs. Neutral base changes like G:C to C:G result in very low differences in the melting temperature (T m) of the PCR amplicons. By enhanced resolution of fluorescence vs. temperature in HRM, the technique showed to be suitable for detecting a G to C transversion in this repetitive DNA sequence context. Compared to minisequencing, HRM is more time- and cost-effective. Results were confirmed by Sanger sequencing. PMID:26972692

  2. Development and Validation of Single Nucleotide Polymorphism (SNP) Markers from an Expressed Sequence Tag (EST) Database in Olive Flounder (Paralichthys olivaceus)

    PubMed Central

    Kim, Jung Eun; Lee, Young Mee; Lee, Jeong-Ho; Noh, Jae Koo; Kim, Hyun Chul; Park, Choul-Ji; Park, Jong-Won; Kim, Kyung-Kil

    2014-01-01

    To successful molecular breeding, identification and functional characterization of breeding related genes and development of molecular breeding techniques using DNA markers are essential. Although the development of a useful marker is difficult in the aspect of time, cost and effort, many markers are being developed to be used in molecular breeding and developed markers have been used in many fields. Single nucleotide polymorphisms (SNPs) markers were widely used for genomic research and breeding, but has hardly been validated for screening functional genes in olive flounder. We identified single nucleotide polymorphisms (SNPs) from expressed sequence tag (EST) database in olive flounder; out of a total 4,327 ESTs, 693 contigs and 514 SNPs were detected in total EST, and these substitutions include 297 transitions and 217 transversions. As a result, 144 SNP markers were developed on the basis of 514 SNP to selection of useful gene region, and then applied to each of eight wild and culture olive flounder (total 16 samples). In our experimental result, only 32 markers had detected polymorphism in sample, also identified 21 transitions and 11 transversions, whereas indel was not detected in polymorphic SNPs. Heterozygosity of wild and cultured olive flounder using the 32 SNP markers is 0.34 and 0.29, respectively. In conclusion, we identified SNP and polymorphism in olive flounder using newly designed marker, it supports that developed markers are suitable for SNP detection and diversity analysis in olive flounder. The outcome of this study can be basic data for researches for immunity gene and characteristic with SNP. PMID:25949198

  3. Genome-Wide Single-Nucleotide Polymorphisms Discovery and High-Density Genetic Map Construction in Cauliflower Using Specific-Locus Amplified Fragment Sequencing

    PubMed Central

    Zhao, Zhenqing; Gu, Honghui; Sheng, Xiaoguang; Yu, Huifang; Wang, Jiansheng; Huang, Long; Wang, Dan

    2016-01-01

    Molecular markers and genetic maps play an important role in plant genomics and breeding studies. Cauliflower is an important and distinctive vegetable; however, very few molecular resources have been reported for this species. In this study, a novel, specific-locus amplified fragment (SLAF) sequencing strategy was employed for large-scale single nucleotide polymorphism (SNP) discovery and high-density genetic map construction in a double-haploid, segregating population of cauliflower. A total of 12.47 Gb raw data containing 77.92 M pair-end reads were obtained after processing and 6815 polymorphic SLAFs between the two parents were detected. The average sequencing depths reached 52.66-fold for the female parent and 49.35-fold for the male parent. Subsequently, these polymorphic SLAFs were used to genotype the population and further filtered based on several criteria to construct a genetic linkage map of cauliflower. Finally, 1776 high-quality SLAF markers, including 2741 SNPs, constituted the linkage map with average data integrity of 95.68%. The final map spanned a total genetic length of 890.01 cM with an average marker interval of 0.50 cM, and covered 364.9 Mb of the reference genome. The markers and genetic map developed in this study could provide an important foundation not only for comparative genomics studies within Brassica oleracea species but also for quantitative trait loci identification and molecular breeding of cauliflower. PMID:27047515

  4. Analysis of single nucleotide polymorphisms based on RNA sequencing data of diverse bio-geographical accessions in barley.

    PubMed

    Takahagi, Kotaro; Uehara-Yamaguchi, Yukiko; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo; Mochida, Keiichi; Saisho, Daisuke

    2016-01-01

    Barley is one of the founder crops of Old world agriculture and has become the fourth most important cereal worldwide. Information on genome-scale DNA polymorphisms allows elucidating the evolutionary history behind domestication, as well as discovering and isolating useful genes for molecular breeding. Deep transcriptome sequencing enables the exploration of sequence variations in transcribed sequences; such analysis is particularly useful for species with large and complex genomes, such as barley. In this study, we performed RNA sequencing of 20 barley accessions, comprising representatives of several biogeographic regions and a wild ancestor. We identified 38,729 to 79,949 SNPs in the 19 domesticated accessions and 55,403 SNPs in the wild barley and revealed their genome-wide distribution using a reference genome. Genome-scale comparisons among accessions showed a clear differentiation between oriental and occidental barley populations. The results based on population structure analyses provide genome-scale properties of sub-populations grouped to oriental, occidental and marginal groups in barley. Our findings suggest that the oriental population of domesticated barley has genomic variations distinct from those in occidental groups, which might have contributed to barley's domestication. PMID:27616653

  5. A Laboratory Exercise for Genotyping Two Human Single Nucleotide Polymorphisms

    ERIC Educational Resources Information Center

    Fernando, James; Carlson, Bradley; LeBard, Timothy; McCarthy, Michael; Umali, Finianne; Ashton, Bryce; Rose, Ferrill F., Jr.

    2016-01-01

    The dramatic decrease in the cost of sequencing a human genome is leading to an era in which a wide range of students will benefit from having an understanding of human genetic variation. Since over 90% of sequence variation between humans is in the form of single nucleotide polymorphisms (SNPs), a laboratory exercise has been devised in order to…

  6. InPhaDel: integrative shotgun and proximity-ligation sequencing to phase deletions with single nucleotide polymorphisms

    PubMed Central

    Patel, Anand; Edge, Peter; Selvaraj, Siddarth; Bansal, Vikas; Bafna, Vineet

    2016-01-01

    Phasing of single nucleotide (SNV), and structural variations into chromosome-wide haplotypes in humans has been challenging, and required either trio sequencing or restricting phasing to population-based haplotypes. Selvaraj et al. demonstrated single individual SNV phasing is possible with proximity ligated (HiC) sequencing. Here, we demonstrate HiC can phase structural variants into phased scaffolds of SNVs. Since HiC data is noisy, and SV calling is challenging, we applied a range of supervised classification techniques, including Support Vector Machines and Random Forest, to phase deletions. Our approach was demonstrated on deletion calls and phasings on the NA12878 human genome. We used three NA12878 chromosomes and simulated chromosomes to train model parameters. The remaining NA12878 chromosomes withheld from training were used to evaluate phasing accuracy. Random Forest had the highest accuracy and correctly phased 86% of the deletions with allele-specific read evidence. Allele-specific read evidence was found for 76% of the deletions. HiC provides significant read evidence for accurately phasing 33% of the deletions. Also, eight of eight top ranked deletions phased by only HiC were validated using long range polymerase chain reaction and Sanger. Thus, deletions from a single individual can be accurately phased using a combination of shotgun and proximity ligation sequencing. InPhaDel software is available at: http://l337x911.github.io/inphadel/. PMID:27105843

  7. InPhaDel: integrative shotgun and proximity-ligation sequencing to phase deletions with single nucleotide polymorphisms.

    PubMed

    Patel, Anand; Edge, Peter; Selvaraj, Siddarth; Bansal, Vikas; Bafna, Vineet

    2016-07-01

    Phasing of single nucleotide (SNV), and structural variations into chromosome-wide haplotypes in humans has been challenging, and required either trio sequencing or restricting phasing to population-based haplotypes. Selvaraj et al demonstrated single individual SNV phasing is possible with proximity ligated (HiC) sequencing. Here, we demonstrate HiC can phase structural variants into phased scaffolds of SNVs. Since HiC data is noisy, and SV calling is challenging, we applied a range of supervised classification techniques, including Support Vector Machines and Random Forest, to phase deletions. Our approach was demonstrated on deletion calls and phasings on the NA12878 human genome. We used three NA12878 chromosomes and simulated chromosomes to train model parameters. The remaining NA12878 chromosomes withheld from training were used to evaluate phasing accuracy. Random Forest had the highest accuracy and correctly phased 86% of the deletions with allele-specific read evidence. Allele-specific read evidence was found for 76% of the deletions. HiC provides significant read evidence for accurately phasing 33% of the deletions. Also, eight of eight top ranked deletions phased by only HiC were validated using long range polymerase chain reaction and Sanger. Thus, deletions from a single individual can be accurately phased using a combination of shotgun and proximity ligation sequencing. InPhaDel software is available at: http://l337x911.github.io/inphadel/. PMID:27105843

  8. Single Nucleotide Polymorphisms and Osteoarthritis

    PubMed Central

    Wang, Ting; Liang, Yuting; Li, Hong; Li, Haibo; He, Quanze; Xue, Ying; Shen, Cong; Zhang, Chunhua; Xiang, Jingjing; Ding, Jie; Qiao, Longwei; Zheng, Qiping

    2016-01-01

    Abstract Osteoarthritis (OA) is a complex disorder characterized by degenerative articular cartilage and is largely attributed to genetic risk factors. Single nucleotide polymorphisms (SNPs) are common DNA variants that have shown promising and efficiency, compared with positional cloning, to map candidate genes of complex diseases, including OA. In this study, we aim to provide an overview of multiple SNPs from a number of genes that have recently been linked to OA susceptibility. We also performed a comprehensive meta-analysis to evaluate the association of SNP rs7639618 of double von Willebrand factor A domains (DVWA) gene with OA susceptibility. A systematic search of studies on the association of SNPs with susceptibility to OA was conducted in PubMed and Google scholar. Studies subjected to meta-analysis include human and case-control studies that met the Hardy–Weinberg equilibrium model and provide sufficient data to calculate an odds ratio (OR). A total of 9500 OA cases and 9365 controls in 7 case-control studies relating to SNP rs7639618 were included in this study and the ORs with 95% confidence intervals (CIs) were calculated. Over 50 SNPs from different genes have been shown to be associated with either hip (23), or knee (20), or both (13) OA. The ORs of these SNPs for OA and the subtypes are not consistent. As to SNP rs7639618 of DVWA, increased knee OA risk was observed in all genetic models analyzed. Specifically, people from Asian with G-allele showed significantly increased risk of knee OA (A versus G: OR = 1.28, 95% CI 1.13–1.46; AA versus GG: OR = 1.60, 95% CI 1.25–2.05; GA versus GG: OR = 1.31, 95% CI 1.18–1.44; AA versus GA+GG: OR = 1.34, 95% CI 1.12–1.61; AA+GA versus GG: OR = 1.40, 95% CI 1.19–1.64), but not in Caucasians or with hip OA. Our results suggest that multiple SNPs play different roles in the pathogenesis of OA and its subtypes; SNP rs7639618 of DVWA gene is associated with a significantly increased

  9. Deep sequencing revealed genome-wide single-nucleotide polymorphism and plasmid content of Erwinia amylovora strains isolated in Middle Atlas, Morocco.

    PubMed

    Hannou, Najat; Mondy, Samuel; Planamente, Sara; Moumni, Mohieddine; Llop, Pablo; López, María; Manceau, Charles; Barny, Marie-Anne; Faure, Denis

    2013-10-01

    Erwinia amylovora causes economic losses that affect pear and apple production in Morocco. Here, we report comparative genomics of four Moroccan E. amylovora strains with the European strain CFBP1430 and North-American strain ATCC49946. Analysis of single nucleotide polymorphisms (SNPs) revealed genetic homogeneity of Moroccan's strains and their proximity to the European strain CFBP1430. Moreover, the collected sequences allowed the assembly of a 65 kpb plasmid, which is highly similar to the plasmid pEI70 harbored by several European E. amylovora isolates. This plasmid was found in 33% of the 40 E. amylovora strains collected from several host plants in 2009 and 2010 in Morocco. PMID:23770248

  10. The EMBL Nucleotide Sequence Database.

    PubMed

    Stoesser, G; Tuli, M A; Lopez, R; Sterk, P

    1999-01-01

    The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl.html) constitutes Europe's primary nucleotide sequence resource. Main sources for DNA and RNA sequences are direct submissions from individual researchers, genome sequencing projects and patent applications. While automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO), the preferred submission tool for individual submitters is Webin (WWW). Through all stages, dataflow is monitored by EBI biologists communicating with the sequencing groups. In collaboration with DDBJ and GenBank the database is produced, maintained and distributed at the European Bioinformatics Institute (EBI). Database releases are produced quarterly and are distributed on CD-ROM. Network services allow access to the most up-to-date data collection via Internet and World Wide Web interface. EBI's Sequence Retrieval System (SRS) is a Network Browser for Databanks in Molecular Biology, integrating and linking the main nucleotide and protein databases, plus many specialised databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, Blast etc) are available for external users to compare their own sequences against the most currently available data in the EMBL Nucleotide Sequence Database and SWISS-PROT. PMID:9847133

  11. MIG-seq: an effective PCR-based method for genome-wide single-nucleotide polymorphism genotyping using the next-generation sequencing platform

    PubMed Central

    Suyama, Yoshihisa; Matsuki, Yu

    2015-01-01

    Restriction-enzyme (RE)-based next-generation sequencing methods have revolutionized marker-assisted genetic studies; however, the use of REs has limited their widespread adoption, especially in field samples with low-quality DNA and/or small quantities of DNA. Here, we developed a PCR-based procedure to construct reduced representation libraries without RE digestion steps, representing de novo single-nucleotide polymorphism discovery, and its genotyping using next-generation sequencing. Using multiplexed inter-simple sequence repeat (ISSR) primers, thousands of genome-wide regions were amplified effectively from a wide variety of genomes, without prior genetic information. We demonstrated: 1) Mendelian gametic segregation of the discovered variants; 2) reproducibility of genotyping by checking its applicability for individual identification; and 3) applicability in a wide variety of species by checking standard population genetic analysis. This approach, called multiplexed ISSR genotyping by sequencing, should be applicable to many marker-assisted genetic studies with a wide range of DNA qualities and quantities. PMID:26593239

  12. Nucleotide polymorphism and copy number variant detection using exome capture and next-generation sequencing in the polyploid grass Panicum virgatum

    PubMed Central

    Evans, Joseph; Kim, Jeongwoon; Childs, Kevin L; Vaillancourt, Brieanne; Crisovan, Emily; Nandety, Aruna; Gerhardt, Daniel J; Richmond, Todd A; Jeddeloh, Jeffrey A; Kaeppler, Shawn M; Casler, Michael D; Buell, C Robin

    2014-01-01

    Switchgrass (Panicum virgatum) is a polyploid, outcrossing grass species native to North America and has recently been recognized as a potential biofuel feedstock crop. Significant phenotypic variation including ploidy is present across the two primary ecotypes of switchgrass, referred to as upland and lowland switchgrass. The tetraploid switchgrass genome is approximately 1400 Mbp, split between two subgenomes, with significant repetitive sequence content limiting the efficiency of re-sequencing approaches for determining genome diversity. To characterize genetic diversity in upland and lowland switchgrass as a first step in linking genotype to phenotype, we designed an exome capture probe set based on transcript assemblies that represent approximately 50 Mb of annotated switchgrass exome sequences. We then evaluated and optimized the probe set using solid phase comparative genome hybridization and liquid phase exome capture followed by next-generation sequencing. Using the optimized probe set, we assessed variation in the exomes of eight switchgrass genotypes representing tetraploid lowland and octoploid upland cultivars to benchmark our exome capture probe set design. We identified ample variation in the switchgrass genome including 1 395 501 single nucleotide polymorphisms (SNPs), 8173 putative copy number variants and 3336 presence/absence variants. While the majority of the SNPs (84%) detected was bi-allelic, a substantial number was tri-allelic with limited occurrence of tetra-allelic polymorphisms consistent with the heterozygous and polyploid nature of the switchgrass genome. Collectively, these data demonstrate the efficacy of exome capture for discovery of genome variation in a polyploid species with a large, repetitive and heterozygous genome. PMID:24947485

  13. Discovery, Validation and Characterization of 1039 Cattle Single Nucleotide Polymorphisms

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We identified approximately 13000 putative single nucleotide polymorphisms (SNPs) by comparison of repeat-masked BAC-end sequences from the cattle RPCI-42 BAC library with whole-genome shotgun contigs of cattle genome assembly Btau 1.0. Genotyping of a subset of these SNPs was performed on a panel ...

  14. Nucleotide sequences 1986/1987

    SciTech Connect

    Not Available

    1987-01-01

    These eight volumes are the third annual published compendium of nucleic acid sequences included in the European Molecular Biology Laboratory Nucleotide Sequence Data Library and the GenBank Genetic Sequences Data Bank. Each volume surveys one or more subdivisions of the database. The volume subtitles are: Primates; Rodents; Other Vertebrates and Invertebrates, Plants and Organelles, Bacteria and Bacteriophage, Viruses, Structural RNA, Synthetic and Unannotated Sequences, and Database Directory and Master Indices.

  15. Characterization of Foodborne Outbreaks of Salmonella enterica Serovar Enteritidis with Whole-Genome Sequencing Single Nucleotide Polymorphism-Based Analysis for Surveillance and Outbreak Detection.

    PubMed

    Taylor, Angela J; Lappi, Victoria; Wolfgang, William J; Lapierre, Pascal; Palumbo, Michael J; Medus, Carlota; Boxrud, David

    2015-10-01

    Salmonella enterica serovar Enteritidis is a significant cause of gastrointestinal illness in the United States; however, current molecular subtyping methods lack resolution for this highly clonal serovar. Advances in next-generation sequencing technologies have made it possible to examine whole-genome sequencing (WGS) as a potential molecular subtyping tool for outbreak detection and source trace back. Here, we conducted a retrospective analysis of S. Enteritidis isolates from seven epidemiologically confirmed foodborne outbreaks and sporadic isolates (not epidemiologically linked) to determine the utility of WGS to identify outbreaks. A collection of 55 epidemiologically characterized clinical and environmental S. Enteritidis isolates were sequenced. Single nucleotide polymorphism (SNP)-based cluster analysis of the S. Enteritidis genomes revealed well supported clades, with less than four-SNP pairwise diversity, that were concordant with epidemiologically defined outbreaks. Sporadic isolates were an average of 42.5 SNPs distant from the outbreak clusters. Isolates collected from the same patient over several weeks differed by only two SNPs. Our findings show that WGS provided greater resolution between outbreak, sporadic, and suspect isolates than the current gold standard subtyping method, pulsed-field gel electrophoresis (PFGE). Furthermore, results could be obtained in a time frame suitable for surveillance activities, supporting the use of WGS as an outbreak detection and characterization method for S. Enteritidis. PMID:26269623

  16. Characterization of Foodborne Outbreaks of Salmonella enterica Serovar Enteritidis with Whole-Genome Sequencing Single Nucleotide Polymorphism-Based Analysis for Surveillance and Outbreak Detection

    PubMed Central

    Lappi, Victoria; Wolfgang, William J.; Lapierre, Pascal; Palumbo, Michael J.; Medus, Carlota; Boxrud, David

    2015-01-01

    Salmonella enterica serovar Enteritidis is a significant cause of gastrointestinal illness in the United States; however, current molecular subtyping methods lack resolution for this highly clonal serovar. Advances in next-generation sequencing technologies have made it possible to examine whole-genome sequencing (WGS) as a potential molecular subtyping tool for outbreak detection and source trace back. Here, we conducted a retrospective analysis of S. Enteritidis isolates from seven epidemiologically confirmed foodborne outbreaks and sporadic isolates (not epidemiologically linked) to determine the utility of WGS to identify outbreaks. A collection of 55 epidemiologically characterized clinical and environmental S. Enteritidis isolates were sequenced. Single nucleotide polymorphism (SNP)-based cluster analysis of the S. Enteritidis genomes revealed well supported clades, with less than four-SNP pairwise diversity, that were concordant with epidemiologically defined outbreaks. Sporadic isolates were an average of 42.5 SNPs distant from the outbreak clusters. Isolates collected from the same patient over several weeks differed by only two SNPs. Our findings show that WGS provided greater resolution between outbreak, sporadic, and suspect isolates than the current gold standard subtyping method, pulsed-field gel electrophoresis (PFGE). Furthermore, results could be obtained in a time frame suitable for surveillance activities, supporting the use of WGS as an outbreak detection and characterization method for S. Enteritidis. PMID:26269623

  17. The genetic landscape of paediatric de novo acute myeloid leukaemia as defined by single nucleotide polymorphism array and exon sequencing of 100 candidate genes.

    PubMed

    Olsson, Linda; Zettermark, Sofia; Biloglav, Andrea; Castor, Anders; Behrendtz, Mikael; Forestier, Erik; Paulsson, Kajsa; Johansson, Bertil

    2016-07-01

    Cytogenetic analyses of a consecutive series of 67 paediatric (median age 8 years; range 0-17) de novo acute myeloid leukaemia (AML) patients revealed aberrations in 55 (82%) cases. The most common subgroups were KMT2A rearrangement (29%), normal karyotype (15%), RUNX1-RUNX1T1 (10%), deletions of 5q, 7q and/or 17p (9%), myeloid leukaemia associated with Down syndrome (7%), PML-RARA (7%) and CBFB-MYH11 (5%). Single nucleotide polymorphism array (SNP-A) analysis and exon sequencing of 100 genes, performed in 52 and 40 cases, respectively (39 overlapping), revealed ≥1 aberration in 89%; when adding cytogenetic data, this frequency increased to 98%. Uniparental isodisomies (UPIDs) were detected in 13% and copy number aberrations (CNAs) in 63% (median 2/case); three UPIDs and 22 CNAs were recurrent. Twenty-two genes were targeted by focal CNAs, including AEBP2 and PHF6 deletions and genes involved in AML-associated gene fusions. Deep sequencing identified mutations in 65% of cases (median 1/case). In total, 60 mutations were found in 30 genes, primarily those encoding signalling proteins (47%), transcription factors (25%), or epigenetic modifiers (13%). Twelve genes (BCOR, CEBPA, FLT3, GATA1, KIT, KRAS, NOTCH1, NPM1, NRAS, PTPN11, SMC3 and TP53) were recurrently mutated. We conclude that SNP-A and deep sequencing analyses complement the cytogenetic diagnosis of paediatric AML. PMID:27022003

  18. Characterization of pancreatic ductal adenocarcinoma using whole transcriptome sequencing and copy number analysis by single-nucleotide polymorphism array.

    PubMed

    Di Marco, Mariacristina; Astolfi, Annalisa; Grassi, Elisa; Vecchiarelli, Silvia; Macchini, Marina; Indio, Valentina; Casadei, Riccardo; Ricci, Claudio; D'Ambra, Marielda; Taffurelli, Giovanni; Serra, Carla; Ercolani, Giorgio; Santini, Donatella; D'Errico, Antonia; Pinna, Antonio Daniele; Minni, Francesco; Durante, Sandra; Martella, Laura Raffaella; Biasco, Guido

    2015-11-01

    The aim of the current study was to implement whole transcriptome massively parallel sequencing (RNASeq) and copy number analysis to investigate the molecular biology of pancreatic ductal adenocarcinoma (PDAC). Samples from 16 patients with PDAC were collected by ultrasound‑guided biopsy or from surgical specimens for DNA and RNA extraction. All samples were analyzed by RNASeq performed at 75x2 base pairs on a HiScanSQ Illumina platform. Single‑nucleotide variants (SNVs) were detected with SNVMix and filtered on dbSNP, 1000 Genomes and Cosmic. Non‑synonymous SNVs were analyzed with SNPs&GO and PROVEAN. A total of 13 samples were analyzed by high resolution copy number analysis on an Affymetrix SNP array 6.0. RNAseq resulted in an average of 264 coding non‑synonymous novel SNVs (ranging from 146‑374) and 16 novel insertions or deletions (In/Dels) (ranging from 6‑24) for each sample, of which a mean of 11.2% were disease‑associated and somatic events, while 34.7% were frameshift somatic In/Dels. From this analysis, alterations in the known oncogenes associated with PDAC were observed, including Kirsten rat sarcoma viral oncogene homolog (KRAS) mutations (93.7%) and inactivation of cyclin‑dependent kinase inhibitor 2A (CDKN2A) (50%), mothers against decapentaplegic homolog 4 (SMAD4) (50%), and tumor protein 53 (TP53) (56%). One case that was negative for KRAS exhibited a G13D neuroblastoma RAS viral oncogene homolog mutation. In addition, gene fusions were detected in 10 samples for a total of 23 different intra‑ or inter‑chromosomal rearrangements, however, a recurrent fusion transcript remains to be identified. SNP arrays identified macroscopic and cryptic cytogenetic alterations in 85% of patients. Gains were observed in the chromosome arms 6p, 12p, 18q and 19q which contain KRAS, GATA binding protein 6, protein kinase B and cyclin D3. Deletions were identified on chromosome arms 1p, 9p, 6p, 18q, 10q, 15q, 17p, 21q and 19q which involve TP53

  19. Detection, validation and application of genotyping-by-sequencing based single nucleotide polymorphisms in upland cotton (Gossypium hirsutum L.).

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The presence of two closely related sub-genomes in the allotetraploid Upland cotton (Gossypium hirsutum L.) combined with a narrow genetic base of the cultivated varieties has hindered the identification of polymorphic genetic markers and their utilization in improving this important crop. Genotypi...

  20. Phylogenetic diversity of bat trypanosomes of subgenus Schizotrypanum based on multilocus enzyme electrophoresis, random amplified polymorphic DNA, and cytochrome b nucleotide sequence analyses.

    PubMed

    Barnabe, C; Brisse, S; Tibayrenc, M

    2003-02-01

    Trypanosome stocks isolated from bats (Chiroptera) and belonging to the subgenus Schizotrypanum were analyzed by multilocus enzyme electrophoresis (MLEE) at 22 loci, random amplified polymorphic DNA (RAPD) with 14 primers and/or cytochrome b nucleotide sequence. Bat trypanosomes belonged to the species Trypanosoma cruzi marinkellei (10 stocks), Trypanosoma dionisii (four stocks) and Trypanosoma vespertilionis (three stocks). One T. rangeli stock and seven stocks of T. cruzi sensu stricto, the agent of Chagas disease, were included for comparison. The homology of several RAPD fragments shared by distinct species was verified by hybridization. The sequence of a 516-nucleotide portion of the maxicircle-encoded cytochrome b (CYb) coding region was determined in representative stocks of the species under study. Phylogenetic analysis of the data confirmed the previous taxonomic attribution of these bat trypanosomes based on biological, epidemiological and ecological features. However, a new finding was that within T. cruzi marinkellei two major subdivisions could be distinguished, T.c.m. I, found in the spear-nose bats Phyllostomus discolor and Phyllostomus hastatus, and T.c.m. II, from P. discolor. In addition, the T. c. marinkellei 'Z' stock from a short-tailed bat (Carollia perspicillata) was distantly related to these two subdivisions, and the monophyly of T. c. marinkellei is unclear based on the present data. Based on the present sample, the European species T. dionisii and T. vespertilionis appeared to be more homogeneous. RAPD and CYb data both suggested the monophyly of a group composed of T. cruzi and the two major subdivisions of T. cruzi marinkellei. This study shows that MLEE, RAPD and CYb can be used for taxonomic assignment and provide valuable phylogenetic information for strains and taxa within the subgenus Schizotrypanum. An evolutionary scenario in which the broad host-range parasite T. cruzi would be derived from a bat-restricted trypanosome ancestor

  1. Tracking a Tuberculosis Outbreak Over 21 Years: Strain-Specific Single-Nucleotide Polymorphism Typing Combined With Targeted Whole-Genome Sequencing

    PubMed Central

    Stucki, David; Ballif, Marie; Bodmer, Thomas; Coscolla, Mireia; Maurer, Anne-Marie; Droz, Sara; Butz, Christa; Borrell, Sonia; Längle, Christel; Feldmann, Julia; Furrer, Hansjakob; Mordasini, Carlo; Helbling, Peter; Rieder, Hans L.; Egger, Matthias; Gagneux, Sébastien; Fenner, Lukas

    2015-01-01

    Background. Whole-genome sequencing (WGS) is increasingly used in molecular-epidemiological investigations of bacterial pathogens, despite cost- and time-intensive analyses. We combined strain-specific single-nucleotide polymorphism (SNP) typing and targeted WGS to investigate a tuberculosis cluster spanning 21 years in Bern, Switzerland. Methods. On the basis of genome sequences of 3 historical outbreak Mycobacterium tuberculosis isolates, we developed a strain-specific SNP-typing assay to identify further cases. We screened 1642 patient isolates and performed WGS on all identified cluster isolates. We extracted SNPs to construct genomic networks. Clinical and social data were retrospectively collected. Results. We identified 68 patients associated with the outbreak strain. Most received a tuberculosis diagnosis in 1991–1995, but cases were observed until 2011. Two thirds were homeless and/or substance abusers. Targeted WGS revealed 133 variable SNP positions among outbreak isolates. Genomic network analyses suggested a single origin of the outbreak, with subsequent division into 3 subclusters. Isolates from patients with confirmed epidemiological links differed by 0–11 SNPs. Conclusions. Strain-specific SNP genotyping allowed rapid and inexpensive identification of M. tuberculosis outbreak isolates in a population-based strain collection. Subsequent targeted WGS provided detailed insights into transmission dynamics. This combined approach could be applied to track bacterial pathogens in real time and at high resolution. PMID:25362193

  2. Single nucleotide polymorphisms (SNPs) in a set of expressed-sequence tag (EST) and conserved ortholog set II (COSII) markers in cultivated tomato (Solanum lycopersicum L.)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single nucleotide polymorphisms (SNPs) are the fundamental unit of genetic variation and are applied as molecular tools for genetic mapping, breeding, germplasm characterization, taxonomy, and evaluation of distinctness, uniformity and stability (DUS). We report 29 novel SNPs in 10 EST and COSII ma...

  3. Single nucleotide polymorphisms generated by genotyping by sequencing to characterize genome-wide diversity, linkage disequilibrium, and selective sweeps in cultivated watermelon

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Large datasets containing single nucleotide polymorphisms (SNPs) are used to analyze genome-wide diversity in a robust collection of cultivars from representative accessions, across the world. The extent of linkage disequilibrium (LD) within a population determines the number of markers required fo...

  4. A high-throughput data mining of single nucleotide polymorphisms in Coffea species expressed sequence tags suggests differential homeologous gene expression in the allotetraploid Coffea arabica.

    PubMed

    Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

    2010-11-01

    Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545

  5. A High-Throughput Data Mining of Single Nucleotide Polymorphisms in Coffea Species Expressed Sequence Tags Suggests Differential Homeologous Gene Expression in the Allotetraploid Coffea arabica1[W

    PubMed Central

    Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães

    2010-01-01

    Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed. PMID:20864545

  6. Characterization of a mini core collection of Japanese wheat varieties using single-nucleotide polymorphisms generated by genotyping-by-sequencing

    PubMed Central

    Kobayashi, Fuminori; Tanaka, Tsuyoshi; Kanamori, Hiroyuki; Wu, Jianzhong; Katayose, Yuichi; Handa, Hirokazu

    2016-01-01

    A core collection of Japanese wheat varieties (JWC) consisting of 96 accessions was established based on their passport data and breeding pedigrees. To clarify the molecular basis of the JWC collection, genome-wide single-nucleotide polymorphism (SNP) genotyping was performed using the genotyping-by-sequencing (GBS) approach. Phylogenetic tree and population structure analyses using these SNP data revealed the genetic diversity and relationships among the JWC accessions, classifying them into four groups; “varieties in the Hokkaido area”, “modern varieties in the northeast part of Japan”, “modern varieties in the southwest part of Japan” and “classical varieties including landraces”. This clustering closely reflected the history of wheat breeding in Japan. Furthermore, to demonstrate the utility of the JWC collection, we performed a genome-wide association study (GWAS) for three traits, namely, “days to heading in autumn sowing”, “days to heading in spring sowing” and “culm length”. We found significantly associated SNP markers with each trait, and some of these were closely linked to known major genes for heading date or culm length on the genetic map. Our study indicates that this JWC collection is a useful set of germplasm for basic and applied research aimed at understanding and utilizing the genetic diversity among Japanese wheat varieties. PMID:27162493

  7. Automated Identification of Nucleotide Sequences

    NASA Technical Reports Server (NTRS)

    Osman, Shariff; Venkateswaran, Kasthuri; Fox, George; Zhu, Dian-Hui

    2007-01-01

    STITCH is a computer program that processes raw nucleotide-sequence data to automatically remove unwanted vector information, perform reverse-complement comparison, stitch shorter sequences together to make longer ones to which the shorter ones presumably belong, and search against the user s choice of private and Internet-accessible public 16S rRNA databases. ["16S rRNA" denotes a ribosomal ribonucleic acid (rRNA) sequence that is common to all organisms.] In STITCH, a template 16S rRNA sequence is used to position forward and reverse reads. STITCH then automatically searches known 16S rRNA sequences in the user s chosen database(s) to find the sequence most similar to (the sequence that lies at the smallest edit distance from) each spliced sequence. The result of processing by STITCH is the identification of the most similar well-described bacterium. Whereas previously commercially available software for analyzing genetic sequences operates on one sequence at a time, STITCH can manipulate multiple sequences simultaneously to perform the aforementioned operations. A typical analysis of several dozen sequences (length of the order of 103 base pairs) by use of STITCH is completed in a few minutes, whereas such an analysis performed by use of prior software takes hours or days.

  8. Mining an Ostrinia nubilalis Midgut Expressed Sequence Tag (EST) Library for Candidate Genes and Single Nucleotide Polymorphisms (SNPs)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    European corn borer, Ostrinia nubilalis, larvae feed upon many plant hosts and are a major target for genetically-engineered corn expressing Bacillus thuringiensis (Bt) toxins. DNA sequencing of a non-normalized O. nubilalis larval midgut cDNA library (ARS-CICGRU ONmgEST) identified 535 unique sequ...

  9. Restriction site-associated DNA sequencing generates high-quality single nucleotide polymorphisms for assessing hybridization between bighead and silver carp in the United States and China.

    PubMed

    Lamer, James T; Sass, Greg G; Boone, Jason Q; Arbieva, Zarema H; Green, Stefan J; Epifanio, John M

    2014-01-01

    Bighead carp (Hypophthalmichthys nobilis) and silver carp (H. molitrix) are invasive species and listed as US federally injurious species under the Lacy Act. They have established populations in much of the Mississippi River Basin (MRB; Mississippi, Illinois, and Missouri rivers) and are capable of producing fertile hybrids and complex introgression. Characterizing the composition of this admixture requires a large set of high-quality, evolutionarily conserved, diagnostic genetic markers to aid in the identification and management of these species in the midst of morphological ambiguity. Restriction site-associated DNA (RAD) sequencing of 45 barcoded bighead and silver carp from the United States and China produced reads that were aligned to the silver carp transcriptome yielded 261 candidate single nucleotide polymorphisms (SNPs) with fixed allelic differences between the two species. We selected the highest quality 112 SNP loci for validation using 194 putative pure-species and F1 hybrids from the MRB and putative bighead carp and silver carp pure species from China (Amur, Pearl and Yangtze rivers). Fifty SNPs were omitted due to design/amplification failure or lack of diagnostic utility. A total of 57 species-diagnostic SNPs conserved between carp species in US and Chinese rivers were identified; 32 were annotated to functional gene loci. Twenty-seven of the 181 (15%) putative pure species were identified as hybrid backcrosses after validation, including three backcrosses from the Amur River, where hybridization has not been documented previously. The 57 SNPs identified through RAD sequencing provide a diagnostic tool to detect population admixture and to identify hybrid and pure-species Asian carps in the United States and China. PMID:23957862

  10. Whole-Genome Sequencing of Erwinia amylovora Strains from Mexico Detects Single Nucleotide Polymorphisms in rpsL Conferring Streptomycin Resistance and in the avrRpt2 Effector Altering Host Interactions.

    PubMed

    Smits, Theo H M; Guerrero-Prieto, Víctor M; Hernández-Escarcega, Germán; Blom, Jochem; Goesmann, Alexander; Rezzonico, Fabio; Duffy, Brion; Stockwell, Virginia O

    2014-01-01

    We report draft genome sequences of three Mexican Erwinia amylovora strains. A novel plasmid, pEA78, was identified. Comparative genomics revealed an rpsL chromosomal mutation conferring high-level streptomycin resistance in two strains. In the effector gene avrRpt2, a single nucleotide polymorphism was detected that overcomes fire blight disease resistance in Malus × robusta 5. PMID:24459281

  11. DNA sequence polymorphisms within the bovine guanine nucleotide-binding protein Gs subunit alpha (Gsα)-encoding (GNAS) genomic imprinting domain are associated with performance traits

    PubMed Central

    2011-01-01

    Background Genes which are epigenetically regulated via genomic imprinting can be potential targets for artificial selection during animal breeding. Indeed, imprinted loci have been shown to underlie some important quantitative traits in domestic mammals, most notably muscle mass and fat deposition. In this candidate gene study, we have identified novel associations between six validated single nucleotide polymorphisms (SNPs) spanning a 97.6 kb region within the bovine guanine nucleotide-binding protein Gs subunit alpha gene (GNAS) domain on bovine chromosome 13 and genetic merit for a range of performance traits in 848 progeny-tested Holstein-Friesian sires. The mammalian GNAS domain consists of a number of reciprocally-imprinted, alternatively-spliced genes which can play a major role in growth, development and disease in mice and humans. Based on the current annotation of the bovine GNAS domain, four of the SNPs analysed (rs43101491, rs43101493, rs43101485 and rs43101486) were located upstream of the GNAS gene, while one SNP (rs41694646) was located in the second intron of the GNAS gene. The final SNP (rs41694656) was located in the first exon of transcripts encoding the putative bovine neuroendocrine-specific protein NESP55, resulting in an aspartic acid-to-asparagine amino acid substitution at amino acid position 192. Results SNP genotype-phenotype association analyses indicate that the single intronic GNAS SNP (rs41694646) is associated (P ≤ 0.05) with a range of performance traits including milk yield, milk protein yield, the content of fat and protein in milk, culled cow carcass weight and progeny carcass conformation, measures of animal body size, direct calving difficulty (i.e. difficulty in calving due to the size of the calf) and gestation length. Association (P ≤ 0.01) with direct calving difficulty (i.e. due to calf size) and maternal calving difficulty (i.e. due to the maternal pelvic width size) was also observed at the rs43101491 SNP. Following

  12. Population genetic structure in farm and feral American mink (Neovison vison) inferred from RAD sequencing-generated single nucleotide polymorphisms.

    PubMed

    Thirstrup, J P; Ruiz-Gonzalez, A; Pujolar, J M; Larsen, P F; Jensen, J; Randi, E; Zalewski, A; Pertoldi, C

    2015-08-01

    Feral American mink populations (), derived from mink farms, are widespread in Europe. In this study we investigated genetic diversity and genetic differentiation between feral and farm mink using a panel of genetic markers (194 SNP) generated from RAD sequencing data. Sampling included a total of 211 individuals from 14 populations, 4 feral and 10 from farms, the latter including a total of 7 color types (Brown, Black, Mahogany, Sapphire, White, Pearl, and Silver). Our study revealed similar low levels of genetic diversity in both farm and feral mink. Results are consistent with small effective population size as a consequence of line selection in the farms and founder effects of a few escapees from the farms in feral populations. Moderately high genetic differentiation was found between farm and feral animals, suggesting a scenario in which wild populations were founded from farm escapes a few decades ago. Currently, escapes and gene flow are probably limited. Genetic differentiation was higher among farm color types than among farms, consistent with line selection using few individuals to create the lines. Finally, no indications of inbreeding were found in either farm or feral samples, with significant negative values found in most farm samples, showing farms are successful in avoiding inbreeding. PMID:26440156

  13. Characterization of the transcriptome, nucleotide sequence polymorphism, and natural selection in the desert adapted mouse Peromyscus eremicus

    PubMed Central

    Eisen, Michael B.

    2014-01-01

    As a direct result of intense heat and aridity, deserts are thought to be among the most harsh of environments, particularly for their mammalian inhabitants. Given that osmoregulation can be challenging for these animals, with failure resulting in death, strong selection should be observed on genes related to the maintenance of water and solute balance. One such animal, Peromyscus eremicus, is native to the desert regions of the southwest United States and may live its entire life without oral fluid intake. As a first step toward understanding the genetics that underlie this phenotype, we present a characterization of the P. eremicus transcriptome. We assay four tissues (kidney, liver, brain, testes) from a single individual and supplement this with population level renal transcriptome sequencing from 15 additional animals. We identified a set of transcripts undergoing both purifying and balancing selection based on estimates of Tajima’s D. In addition, we used the branch-site test to identify a transcript—Slc2a9, likely related to desert osmoregulation—undergoing enhanced selection in P. eremicus relative to a set of related non-desert rodents. PMID:25374784

  14. Genome-Wide Patterns of Nucleotide Polymorphism in Domesticated Rice

    PubMed Central

    Hernandez, Ryan D; Boyko, Adam; Fledel-Alon, Adi; York, Thomas L; Polato, Nicholas R; Olsen, Kenneth M; Nielsen, Rasmus; McCouch, Susan R; Bustamante, Carlos D; Purugganan, Michael D

    2007-01-01

    Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i) selectively neutral population bottleneck model, (ii) bottleneck plus migration model, (iii) multiple selective sweeps model, and (iv) bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation. PMID:17907810

  15. Single nucleotide polymorphisms in caprine calpastatin gene.

    PubMed

    Sharma, R; Maitra, A; Pandey, A K; Singh, L V; Mishra, B P

    2013-04-01

    The calpains and calpastatin (CAST) make up a major cytosolic proteolytic system, the calpain-calpastatin system, found in mammalian tissues. The relative levels of the components of the calpain-calpastatin system determine the extent of meat tenderization during postmortem storage. Calpastatin (CAST) is a protein inhibitor of the ubiquitous calcium-dependent proteases-micro-calpain and m-calpain. Polymorphisms in the bovine, ovine and pig CAST gene have been associated with meat tenderness but little is known about how caprine CAST gene may affect goat meat quality traits. In this study we selected different parts of the CAST gene: 1) that have been previously reported to be polymorphic, intron 5 and 12 and 3'UTR; 2) first time explored (exon 3, 7 and 8 and part of intron 7 and 8) to investigate polymorphic status of caprine CAST gene. Using comparative sequencing ten novel SN Ps located in exon 3 and intron 5, 7 and 8 were identified. Previously reported SNPs in intron 5, 3'UTR and intron 12 were absent. Sequence analysis revealed a non synonymous amino acid variation in exon 3, which would result in Lys/Arg substitution in the corresponding protein sequence. Considerable variation was detected in intronic regions. Twenty-four InDel were also recognized in intronic regions (15) and 3'UTR (9). All the sequences shared high homology with published bovine and ovine sequences. Three PCR-RFLP loci have been established for further analyzing genetic polymorphism in indigenous goats. PMID:23866627

  16. Single nucleotide polymorphisms in common bean: their discovery and genotyping using a multiplex detection system

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single-nucleotide Polymorphism (SNP) markers are by far the most common form of DNA polymorphism in a genome. The objectives of this study were to discover SNPs in common bean comparing sequences from coding and non-coding regions obtained from Genbank and genomic DNA and to compare sequencing resu...

  17. Discovery and mapping of a new expressed sequence tag-single nucleotide polymorphism and simple sequence repeat panel for large-scale genetic studies and breeding of Theobroma cacao L.

    PubMed Central

    Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire

    2012-01-01

    Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604

  18. Nucleotide sequences encoding a thermostable alkaline protease

    DOEpatents

    Wilson, David B.; Lao, Guifang

    1998-01-01

    Nucleotide sequences, derived from a thermophilic actinomycete microorganism, which encode a thermostable alkaline protease are disclosed. Also disclosed are variants of the nucleotide sequences which encode a polypeptide having thermostable alkaline proteolytic activity. Recombinant thermostable alkaline protease or recombinant polypeptide may be obtained by culturing in a medium a host cell genetically engineered to contain and express a nucleotide sequence according to the present invention, and recovering the recombinant thermostable alkaline protease or recombinant polypeptide from the culture medium.

  19. Nucleotide sequences encoding a thermostable alkaline protease

    DOEpatents

    Wilson, D.B.; Lao, G.

    1998-01-06

    Nucleotide sequences, derived from a thermophilic actinomycete microorganism, which encode a thermostable alkaline protease are disclosed. Also disclosed are variants of the nucleotide sequences which encode a polypeptide having thermostable alkaline proteolytic activity. Recombinant thermostable alkaline protease or recombinant polypeptide may be obtained by culturing in a medium a host cell genetically engineered to contain and express a nucleotide sequence according to the present invention, and recovering the recombinant thermostable alkaline protease or recombinant polypeptide from the culture medium. 3 figs.

  20. Pyrosequencing: an accurate detection platform for single nucleotide polymorphisms.

    PubMed

    Fakhrai-Rad, Hossein; Pourmand, Nader; Ronaghi, Mostafa

    2002-05-01

    Pyrosequencing, a non-electrophoretic method for DNA sequencing, is emerging as a popular platform for analysis of single nucleotide polymorphisms (SNPs). This technology has the advantage of accuracy, ease-of-use, and high flexibility for different applications. Here, we review the methodology and the use of this technique for SNP genotyping, SNP discovery, haplotyping, and allelic frequency studies. In addition, we describe new schemes for template preparation and multiplexing as an effort for cost reduction in large-scale studies. PMID:11968080

  1. Characterization of 22 novel single nucleotide polymorphism markers in steelhead and rainbow trout

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Thirty-two individuals representing coastal and inland populations of steelhead and rainbow trout (Oncorhynchus mykiss) were sequenced at 15 ESTs and 9 microsatellite loci to identify single nucleotide polymorphisms (SNPs). Sixty-two polymorphisms were discovered during the screen and 13 were devel...

  2. From Single Nucleotide Polymorphism to Transcriptional Mechanism

    PubMed Central

    Martini, Sebastian; Nair, Viji; Patel, Sanjeevkumar R.; Eichinger, Felix; Nelson, Robert G.; Weil, E. Jennifer; Pezzolesi, Marcus G.; Krolewski, Andrzej S.; Randolph, Ann; Keller, Benjamin J.; Werner, Thomas; Kretzler, Matthias

    2013-01-01

    Genome-wide association studies have proven to be highly effective at defining relationships between single nucleotide polymorphisms (SNPs) and clinical phenotypes in complex diseases. Establishing a mechanistic link between a noncoding SNP and the clinical outcome is a significant hurdle in translating associations into biological insight. We demonstrate an approach to assess the functional context of a diabetic nephropathy (DN)-associated SNP located in the promoter region of the gene FRMD3. The approach integrates pathway analyses with transcriptional regulatory pattern-based promoter modeling and allows the identification of a transcriptional framework affected by the DN-associated SNP in the FRMD3 promoter. This framework provides a testable hypothesis for mechanisms of genomic variation and transcriptional regulation in the context of DN. Our model proposes a possible transcriptional link through which the polymorphism in the FRMD3 promoter could influence transcriptional regulation within the bone morphogenetic protein (BMP)-signaling pathway. These findings provide the rationale to interrogate the biological link between FRMD3 and the BMP pathway and serve as an example of functional genomics-based hypothesis generation. PMID:23434934

  3. Y-Single Nucleotide Polymorphisms Diversity in Chinese Indigenous Horse

    PubMed Central

    Han, Haoyuan; Zhang, Qin; Gao, Kexin; Yue, Xiangpeng; Zhang, Tao; Dang, Ruihua; Lan, Xianyong; Chen, Hong; Lei, Chuzhao

    2015-01-01

    In contrast to high genetic diversity of mitochondrial DNA (mtDNA), equine Y chromosome shows extremely low variability, implying limited patrilines in the domesticated horse. In this study, we applied direct sequencing and restriction fragment length polymorphism (RFLP) methods to investigate the polymorphisms of 33 Y chromosome specific loci in 304 Chinese indigenous horses from 13 breeds. Consequently, two Y-single nucleotide polymorphisms (SNPs) (Y-45701/997 and Y-50869) and one Y-indel (Y-45288) were identified. Of those, the Y-50869 (T>A) revealed the highest variation frequency (24.67%), whereas it was only 3.29% and 1.97% in Y-45288 (T/-) and Y-45701/997 (G>T) locus, respectively. These three mutations accounted for 27.96% of the total samples and identified five Y-SNP haplotypes, demonstrating genetic diversity of Y chromosome in Chinese horses. In addition, all the five Y-SNP haplotypes were shared by different breeds. Among 13 horse breeds analyzed, Balikun horse displayed the highest nucleotide diversity (π = 5.6×10−4) and haplotype diversity (h = 0.527), while Ningqiang horse showed the lowest nucleotide diversity (π = 0.00000) and haplotype diversity (h = 0.000). The results also revealed that Chinese horses had a different polymorphic pattern of Y chromosome from European and American horses. In conclusion, Chinese horses revealed genetic diversity of Y chromosome, however more efforts should be made to better understand the domestication and paternal origin of Chinese indigenous horses. PMID:26104513

  4. Long-range correlations in nucleotide sequences

    NASA Astrophysics Data System (ADS)

    Peng, C.-K.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Sciortino, F.; Simons, M.; Stanley, H. E.

    1992-03-01

    DNA SEQUENCES have been analysed using models, such as an it-step Markov chain, that incorporate the possibility of short-range nucleotide correlations1. We propose here a method for studying the stochastic properties of nucleotide sequences by constructing a 1:1 map of the nucleotide sequence onto a walk, which we term a 'DNA walk'. We then use the mapping to provide a quantitative measure of the correlation between nucleotides over long distances along the DNA chain. Thus we uncover in the nucleotide sequence a remarkably long-range power law correlation that implies a new scale-invariant property of DNA. We find such long-range correlations in intron-containing genes and in nontranscribed regulatory DNA sequences, but not in complementary DNA sequences or intron-less genes.

  5. Single nucleotide polymorphism analysis using different colored dye dimer probes

    NASA Astrophysics Data System (ADS)

    Marmé, Nicole; Friedrich, Achim; Denapaite, Dalia; Hakenbeck, Regine; Knemeyer, Jens-Peter

    2006-09-01

    Fluorescence quenching by dye dimer formation has been utilized to develop hairpin-structured DNA probes for the detection of a single nucleotide polymorphism (SNP) in the penicillin target gene pbp2x, which is implicated in the penicillin resistance of Streptococcus pneumoniae. We designed two specific DNA probes for the identification of the pbp2x genes from a penicillin susceptible strain R6 and a resistant strain Streptococcus mitis 661 using green-fluorescent tetramethylrhodamine (TMR) and red-fluorescent DY-636, respectively. Hybridization of each of the probes to its respective target DNA sequence opened the DNA hairpin probes, consequently breaking the nonfluorescent dye dimers into fluorescent species. This hybridization of the target with the hairpin probe achieved single nucleotide specific detection at nanomolar concentrations via increased fluorescence.

  6. Comparative Performance of Single Nucleotide Polymorphism (SNP) and Microsatellite Markers for the Detection of Population Differentiation in Diabrotica virgifera virgifera (Coleoptera: Chrysomelidae)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Putative single nucleotide polymorphisms (SNPs) were identified from contiguous sequences assembled from Diabrotica virgifera virgifera midgut expressed sequence tags (ESTs). Polymerase chain reaction restriction fragment length polymorphism (PCR-RFLP)-based assays confirmed variation at 20 biallel...

  7. Transcriptomic analysis of the interaction between Helianthus annuus and its obligate parasite Plasmopara halstedii shows single nucleotide polymorphisms in CRN sequences

    PubMed Central

    2011-01-01

    Background Downy mildew in sunflowers (Helianthus annuus L.) is caused by the oomycete Plasmopara halstedii (Farl.) Berlese et de Toni. Despite efforts by the international community to breed mildew-resistant varieties, downy mildew remains a major threat to the sunflower crop. Very few genomic, genetic and molecular resources are currently available to study this pathogen. Using a 454 sequencing method, expressed sequence tags (EST) during the interaction between H. annuus and P. halstedii have been generated and a search was performed for sites in putative effectors to show polymorphisms between the different races of P. halstedii. Results A 454 pyrosequencing run of two infected sunflower samples (inbred lines XRQ and PSC8 infected with race 710 of P. halstedii, which exhibit incompatible and compatible interactions, respectively) generated 113,720 and 172,107 useable reads. From these reads, 44,948 contigs and singletons have been produced. A bioinformatic portal, HP, was specifically created for in-depth analysis of these clusters. Using in silico filtering, 405 clusters were defined as being specific to oomycetes, and 172 were defined as non-specific oomycete clusters. A subset of these two categories was checked using PCR amplification, and 86% of the tested clusters were validated. Twenty putative RXLR and CRN effectors were detected using PSI-BLAST. Using corresponding sequences from four races (100, 304, 703 and 710), 22 SNPs were detected, providing new information on pathogen polymorphisms. Conclusions This study identified a large number of genes that are expressed during H. annuus/P. halstedii compatible or incompatible interactions. It also reveals, for the first time, that an infection mechanism exists in P. halstedii similar to that in other oomycetes associated with the presence of putative RXLR and CRN effectors. SNPs discovered in CRN effector sequences were used to determine the genetic distances between the four races of P. halstedii. This

  8. [Application of single nucleotide polymorphism in crop genetics and improvement].

    PubMed

    Du, Chun-Fang; Liu, Hui-Min; Li, Run-Zhi; Li, Peng-Bo; Ren, Zhi-Qiang

    2003-11-01

    Single nucleotide polymorphism(SNP) is the most common type of sequence difference between alleles, which can be used as a kind of high-throughput genetic marker. Several different routes have been developed to discover and identify SNP. These include the direct sequencing of PCR amplicons, electronic SNP(eSNP) and so on. SNP assays have been made in many crop species such as maize and soybean. The elite germplasm of some crops have been narrowed in genetic diversity, increasing the amount of linkage disequilibrium (LD) present and facilitating the association of SNP haplotypes at candidate gene loci with phenotypes. SNP analysis has been broadly used in the field of plant gene mapping, integration of genetic and physical maps, DNA marker-assisted breeding and functional genomics. PMID:15639972

  9. Syndrome-based discrimination of single nucleotide polymorphism.

    PubMed

    May, E E; Dolan, P; Crozier, P; Brozik, S

    2006-01-01

    The ability to discriminate nucleic acid sequences is necessary for a wide variety of applications: high throughput screening, distinguishing genetically modified organisms (GMOs), molecular computing, differentiating biological markers, fingerprinting a specific sensor response for complex systems, etc. Hybridization-based target recognition and discrimination is central to the operation of nucleic acid sensor systems. Therefore developing a quantitative correlation between mishybridization events and sensor out put is critical to the accurate interpretation of results. In this work, using experimental data produced by introducing single mutations (single nucleotide polymorphisms, SNPs) in the probe sequence of computational catalytic molecular beacons (deoxyribozyme gates) [1], we investigate coding theory algorithms for uniquely categorizing SNPs based on the calculation of syndromes. PMID:17947098

  10. Single Nucleotide Polymorphisms for Pig Identification and Parentage Exclusion

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single nucleotide polymorphisms have become an important type of marker for commercial diagnostic and parentage genotyping applications as automated genotyping systems have been developed that yield accurate genotypes. Unfortunately, a large number of highly informative public SNP markers tested in ...

  11. Complete nucleotide sequence of a maize chlorotic mottle virus isolate from Nebraska

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The complete genome of a maize chlorotic mottle virus isolate from Nebraska (MCMV-NE) was cloned and sequenced. The MCMV-NE genome consists of 4,436 nucleotides and shares 99.5% nucleotide sequence identity with an MCMV isolate from Kansas (MCMV-KS). Of 22 polymorphic sites, most resulted from t...

  12. Discovery of nucleotide polymorphisms in the Musa gene pool by Ecotilling.

    PubMed

    Till, Bradley J; Jankowicz-Cieslak, Joanna; Sági, László; Huynh, Owen A; Utsushi, Hiroe; Swennen, Rony; Terauchi, Ryohei; Mba, Chikelu

    2010-11-01

    Musa (banana and plantain) is an important genus for the global export market and in local markets where it provides staple food for approximately 400 million people. Hybridization and polyploidization of several (sub)species, combined with vegetative propagation and human selection have produced a complex genetic history. We describe the application of the Ecotilling method for the discovery and characterization of nucleotide polymorphisms in diploid and polyploid accessions of Musa. We discovered over 800 novel alleles in 80 accessions. Sequencing and band evaluation shows Ecotilling to be a robust and accurate platform for the discovery of polymorphisms in homologous and homeologous gene targets. In the process of validating the method, we identified two single nucleotide polymorphisms that may be deleterious for the function of a gene putatively important for phototropism. Evaluation of heterozygous polymorphism and haplotype blocks revealed a high level of nucleotide diversity in Musa accessions. We further applied a strategy for the simultaneous discovery of heterozygous and homozygous polymorphisms in diploid accessions to rapidly evaluate nucleotide diversity in accessions of the same genome type. This strategy can be used to develop hypotheses for inheritance patterns of nucleotide polymorphisms within and between genome types. We conclude that Ecotilling is suitable for diversity studies in Musa, that it can be considered for functional genomics studies and as tool in selecting germplasm for traditional and mutation breeding approaches. PMID:20589365

  13. Statistical analysis of nucleotide sequences.

    PubMed Central

    Stückle, E E; Emmrich, C; Grob, U; Nielsen, P J

    1990-01-01

    In order to scan nucleic acid databases for potentially relevant but as yet unknown signals, we have developed an improved statistical model for pattern analysis of nucleic acid sequences by modifying previous methods based on Markov chains. We demonstrate the importance of selecting the appropriate parameters in order for the method to function at all. The model allows the simultaneous analysis of several short sequences with unequal base frequencies and Markov order k not equal to 0 as is usually the case in databases. As a test of these modifications, we show that in E. coli sequences there is a bias against palindromic hexamers which correspond to known restriction enzyme recognition sites. PMID:2251125

  14. Lineage and genogroup-defining single nucleotide polymorphisms of Escherichia coli 0157:H7

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Escherichia coli O157:H7 is a zoonotic human pathogen for which cattle are an important reservoir host. Using both previously published and new sequencing data, a 48-locus single nucleotide polymorphism (SNP) based typing panel was developed that redundantly identified eleven genogroups that span ...

  15. Subtyping of Salmonella enterica subspecies I using single nucleotide polymorphisms in adenylate cyclase (cyaA)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Methods to rapidly identify serotypes of Salmonella enterica subspecies I are of vital importance for protecting the safety of food. To supplement the serotyping method dkgB-linked intergenic sequence ribotyping (ISR), single nucleotide polymorphisms (SNPs) were characterized within adenylate cyclas...

  16. Development of Single Nucleotide Polymorphism (SNP) Markers for Use in Commercial Maize (Zea Mays L.) Germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The development of single nucleotide polymorphism (SNP) markers in maize offer the opportunity to utilize DNA markers in many new areas of population genetics, gene discovery, plant breeding, and germplasm identification. However, the steps from sequencing and SNP discovery to SNP marker design and ...

  17. The International Nucleotide Sequence Database Collaboration.

    PubMed

    Cochrane, Guy; Karsch-Mizrachi, Ilene; Takagi, Toshihisa

    2016-01-01

    The International Nucleotide Sequence Database Collaboration (INSDC; http://www.insdc.org) comprises three global partners committed to capturing, preserving and providing comprehensive public-domain nucleotide sequence information. The INSDC establishes standards, formats and protocols for data and metadata to make it easier for individuals and organisations to submit their nucleotide data reliably to public archives. This work enables the continuous, global exchange of information about living things. Here we present an update of the INSDC in 2015, including data growth and diversification, new standards and requirements by publishers for authors to submit their data to the public archives. The INSDC serves as a model for data sharing in the life sciences. PMID:26657633

  18. Nucleotide sequence of bacteriophage fd DNA.

    PubMed Central

    Beck, E; Sommer, R; Auerswald, E A; Kurz, C; Zink, B; Osterburg, G; Schaller, H; Sugimoto, K; Sugisaki, H; Okamoto, T; Takanami, M

    1978-01-01

    The sequence of the 6,408 nucleotides of bacteriophage fd DNA has been determined. This allows to deduce the exact organisation of the filamentous phage genome and provides easy access to DNA segments of known structure and function. PMID:745987

  19. Single nucleotide polymorphism discovery in rainbow trout

    Technology Transfer Automated Retrieval System (TEKTRAN)

    To enhance capabilities for genetic analyses in rainbow trout, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be developed. However, the evolutionarily recent whole genome duplication event complicates the use of standard approaches in the discove...

  20. Single Nucleotide Polymorphisms Predict Symptom Severity of Autism Spectrum Disorder

    ERIC Educational Resources Information Center

    Jiao, Yun; Chen, Rong; Ke, Xiaoyan; Cheng, Lu; Chu, Kangkang; Lu, Zuhong; Herskovits, Edward H.

    2012-01-01

    Autism is widely believed to be a heterogeneous disorder; diagnosis is currently based solely on clinical criteria, although genetic, as well as environmental, influences are thought to be prominent factors in the etiology of most forms of autism. Our goal is to determine whether a predictive model based on single-nucleotide polymorphisms (SNPs)…

  1. Complete Nucleotide Sequence of Tn10

    PubMed Central

    Chalmers, Ronald; Sewitz, Sven; Lipkow, Karen; Crellin, Paul

    2000-01-01

    The complete nucleotide sequence of Tn10 has been determined. The dinucleotide signature and percent G+C of the sequence had no discontinuities, indicating that Tn10 constitutes a homogeneous unit. The new sequence contained three new open reading frames corresponding to a glutamate permease, repressors of heavy metal resistance operons, and a hypothetical protein in Bacillus subtilis. The glutamate permease was fully functional when expressed, but Tn10 did not protect Escherichia coli from the toxic effects of various metals. PMID:10781570

  2. The multiple codes of nucleotide sequences.

    PubMed

    Trifonov, E N

    1989-01-01

    Nucleotide sequences carry genetic information of many different kinds, not just instructions for protein synthesis (triplet code). Several codes of nucleotide sequences are discussed including: (1) the translation framing code, responsible for correct triplet counting by the ribosome during protein synthesis; (2) the chromatin code, which provides instructions on appropriate placement of nucleosomes along the DNA molecules and their spatial arrangement; (3) a putative loop code for single-stranded RNA-protein interactions. The codes are degenerate and corresponding messages are not only interspersed but actually overlap, so that some nucleotides belong to several messages simultaneously. Tandemly repeated sequences frequently considered as functionless "junk" are found to be grouped into certain classes of repeat unit lengths. This indicates some functional involvement of these sequences. A hypothesis is formulated according to which the tandem repeats are given the role of weak enhancer-silencers that modulate, in a copy number-dependent way, the expression of proximal genes. Fast amplification and elimination of the repeats provides an attractive mechanism of species adaptation to a rapidly changing environment. PMID:2673451

  3. Targeted Amplicon Sequencing for Single-Nucleotide-Polymorphism Genotyping of Attaching and Effacing Escherichia coli O26:H11 Cattle Strains via a High-Throughput Library Preparation Technique

    PubMed Central

    Delannoy, Sabine; Bugarel, Marie; Nagaraja, Tiruvoor G.; Renter, David G.; den Bakker, Henk C.; Nightingale, Kendra K.; Fach, Patrick; Loneragan, Guy H.

    2015-01-01

    Enterohemorrhagic Escherichia coli (EHEC) O26:H11, a serotype within Shiga toxin-producing E. coli (STEC) that causes severe human disease, has been considered to have evolved from attaching and effacing E. coli (AEEC) O26:H11 through the acquisition of a Shiga toxin-encoding gene. Targeted amplicon sequencing using next-generation sequencing technology of 48 phylogenetically informative single-nucleotide polymorphisms (SNPs) and three SNPs differentiating Shiga toxin-positive (stx-positive) strains from Shiga toxin-negative (stx-negative) strains were used to infer the phylogenetic relationships of 178 E. coli O26:H11 strains (6 stx-positive strains and 172 stx-negative AEEC strains) from cattle feces to 7 publically available genomes of human clinical strains. The AEEC cattle strains displayed synonymous SNP genotypes with stx2-positive sequence type 29 (ST29) human O26:H11 strains, while stx1 ST21 human and cattle strains clustered separately, demonstrating the close phylogenetic relatedness of these Shiga toxin-negative AEEC cattle strains and human clinical strains. With the exception of seven stx-negative strains, five of which contained espK, three stx-related SNPs differentiated the STEC strains from non-STEC strains, supporting the hypothesis that these AEEC cattle strains could serve as a potential reservoir for new or existing pathogenic human strains. Our results support the idea that targeted amplicon sequencing for SNP genotyping expedites strain identification and genetic characterization of E. coli O26:H11, which is important for food safety and public health. PMID:26567298

  4. Single nucleotide polymorphism markers for genetic mapping in Drosophila melanogaster

    SciTech Connect

    Hoskins, Roger A.; Phan, Alexander C.; Naeemuddin, Mohammed; Mapa, Felipa A.; Ruddy, David A.; Ryan, Jessica J.; Young, Lynn M.; Wells, Trent; Kopczynski, Casey; Ellis, Michael C.

    2001-04-16

    For nearly a century, genetic analysis in Drosophila melanogaster has been a powerful tool for analyzing gene function, yet Drosophila lacks the molecular genetic mapping tools that have recently revolutionized human, mouse and plant genetics. Here, we describe the systematic characterization of a dense set of molecular markers in Drosophila using an STS-based physical map of the genome. We identify 474 biallelic markers in standard laboratory strains of Drosophila that the genome. The majority of these markers are single nucleotide polymorphisms (SNPs) and sequences for these variants are provided in an accessible format. The average density of the new markers is 1 marker per 225 kb on the autosomes and 1 marker per 1 Mb on the X chromosome. We include in this survey a set of P-element strains that provide additional utility for high-resolution mapping. We demonstrate one application of the new markers in a simple set of crosses to map a mutation in the hedgehog gene to an interval of <1 Mb. This new map resource significantly increases the efficiency and resolution of recombination mapping and will be of immediate value to the Drosophila research community.

  5. Single nucleotide polymorphism genotyping using BeadChip microarrays.

    PubMed

    Lambert, Gilliam; Tsinajinnie, Darwin; Duggan, David

    2013-07-01

    The genotyping of single nucleotide polymorphisms (SNPs) has successfully contributed to the study of complex diseases more than any other technology to date. Genome-wide association studies (GWAS) using 10,000s to >1,000,000 SNPs have identified 1000s of statistically significant SNPs pertaining to 17 different human disease and trait categories. Post-GWAS fine-mapping studies using 10,000s to 100,000s SNPs on a microarray have narrowed the region of interest for many of these GWAS findings; in addition, independent signals within the original GWAS region have been identified. Focused content, SNP-based microarrays such as the human exome, for example, have too been used successfully to identify novel disease associations. Success has come to studies where 100s to 10,000s (mostly) to >100,000 samples were genotyped. For the time being, SNP-based microarrays remain cost-effective especially when studying large numbers of samples compared to other "genotyping" technologies including next generation sequencing. In this unit, protocols for manual (LIMS-free), semi-manual, and automated processing of BeadChip microarrays are presented. Lower throughput studies will find value in the manual and semi-manual protocols, while all types of studies--low-, medium-, and high-throughput--will find value in the semi-manual and automated protocols. PMID:23853082

  6. Single nucleotide polymorphisms in clinics: Fantasy or reality for cancer?

    PubMed

    Srinivasan, Srilakshmi; Clements, Judith A; Batra, Jyotsna

    2016-01-01

    Single nucleotide polymorphisms (SNPs) have been classically used for dissecting various human complex disorders using candidate gene studies. During the last decade, large scale SNP analysis, i.e. genome-wide association studies (GWAS) have provided an agnostic approach to identify possible genetic loci associated with heterogeneous disease such as cancer susceptibility, prognosis of survival or drug response. Further, the advent of new technologies, including microarray-based genotyping as well as high throughput next generation sequencing has opened new avenues for SNPs to be used in clinical practice. It is speculated that the utility of SNPs to understand the mechanisms, biology of variable drug response and ultimately treatment individualization based on the individual's genome composition will be indispensable in the near future. In the current review, we discuss the advantages and disadvantages of the clinical utility of genetic variants in disease risk-prediction, prognosis, clinical outcome and pharmacogenomics. The lessons and challenges for the utility of SNP-based biomarkers are also discussed, including the need for additional functional validation studies. PMID:26398894

  7. Single nucleotide polymorphisms of Kit gene in Chinese indigenous horses.

    PubMed

    Han, Haoyuan; Mao, Chunchun; Chen, Ningbo; Lan, Xianyong; Chen, Hong; Lei, Chuzhao; Dang, Ruihua

    2016-02-01

    Kit gene is a genetic determinant of horse white coat color which has been a highly valued trait in horses for at least 2,000 years. Single nucleotide polymorphisms (SNPs) in Kit are of importance due to their strong associations with melanoblast survival during embryonic development. In this study, a mutation analysis of all 21 Kit exons in 14 Chinese domestic horse breeds revealed six SNPs (g.91214T>G, g.143245T>G, g.164297C>T, g.170189C>T, g.171356C>G, and g.171471G>A), which located in 5'-UTR region, intron 6, exon 15, exon 20, intron 20, and exon 21 of the equine Kit gene, respectively. Subsequently, these six SNPs loci were genotyped in 632 Chinese horses by PCR-RFLP or direct sequencing. The six SNPs together defined 18 haplotypes, demonstrating abundant haplotype diversities in Chinese horses. All the mutant alleles and haplotypes were shared among different breeds. But fewer mutations were detected in horses from China than that from abroad, indicating that Chinese horses belong to a more ancient genetic pool. This study will provide fundamental genetic information for evaluating the genetic diversity of Kit gene in Chinese indigenous horse breeds. PMID:27348891

  8. Discovery of single nucleotide polymorphisms and mutations by pyrosequencing.

    PubMed

    Ronaghi, Mostafa; Elahi, Elahe

    2002-01-01

    Comparative genomics, analyzing variation among individual genomes, is an area of intense investigation. DNA sequencing is usually employed to look for polymorphisms and mutations. Pyrosequencing, a real-time DNA sequencing method, is emerging as a popular platform for comparative genomics. Here we review the use of this technology for mutation scanning, polymorphism discovery and chemical haplotyping. We describe the methodology and accuracy of this technique and discuss how to reduce the cost for large-scale analysis. PMID:18628881

  9. A Sequence-Ready Physical Map of Barley Anchored Genetically by Two Million Single-Nucleotide Polymorphisms1[W][OPEN

    PubMed Central

    Ariyadasa, Ruvini; Mascher, Martin; Nussbaumer, Thomas; Schulte, Daniela; Frenkel, Zeev; Poursarebani, Naser; Zhou, Ruonan; Steuernagel, Burkhard; Gundlach, Heidrun; Taudien, Stefan; Felder, Marius; Platzer, Matthias; Himmelbach, Axel; Schmutzer, Thomas; Hedley, Pete E.; Muehlbauer, Gary J.; Scholz, Uwe; Korol, Abraham; Mayer, Klaus F.X.; Waugh, Robbie; Langridge, Peter; Graner, Andreas; Stein, Nils

    2014-01-01

    Barley (Hordeum vulgare) is an important cereal crop and a model species for Triticeae genomics. To lay the foundation for hierarchical map-based sequencing, a genome-wide physical map of its large and complex 5.1 billion-bp genome was constructed by high-information content fingerprinting of almost 600,000 bacterial artificial chromosomes representing 14-fold haploid genome coverage. The resultant physical map comprises 9,265 contigs with a cumulative size of 4.9 Gb representing 96% of the physical length of the barley genome. The reliability of the map was verified through extensive genetic marker information and the analysis of topological networks of clone overlaps. A minimum tiling path of 66,772 minimally overlapping clones was defined that will serve as a template for hierarchical clone-by-clone map-based shotgun sequencing. We integrated whole-genome shotgun sequence data from the individuals of two mapping populations with published bacterial artificial chromosome survey sequence information to genetically anchor the physical map. This novel approach in combination with the comprehensive whole-genome shotgun sequence data sets allowed us to independently validate and improve a previously reported physical and genetic framework. The resources developed in this study will underpin fine-mapping and cloning of agronomically important genes and the assembly of a draft genome sequence. PMID:24243933

  10. Single nucleotide polymorphism in transcriptional regulatory regions and expression of environmentally responsive genes

    SciTech Connect

    Wang, Xuting; Tomso, Daniel J.; Liu Xuemei; Bell, Douglas A. . E-mail: BELL1@niehs.nih.gov

    2005-09-01

    Single nucleotide polymorphisms (SNPs) in the human genome are DNA sequence variations that can alter an individual's response to environmental exposure. SNPs in gene coding regions can lead to changes in the biological properties of the encoded protein. In contrast, SNPs in non-coding gene regulatory regions may affect gene expression levels in an allele-specific manner, and these functional polymorphisms represent an important but relatively unexplored class of genetic variation. The main challenge in analyzing these SNPs is a lack of robust computational and experimental methods. Here, we first outline mechanisms by which genetic variation can impact gene regulation, and review recent findings in this area; then, we describe a methodology for bioinformatic discovery and functional analysis of regulatory SNPs in cis-regulatory regions using the assembled human genome sequence and databases on sequence polymorphism and gene expression. Our method integrates SNP and gene databases and uses a set of computer programs that allow us to: (1) select SNPs, from among the >9 million human SNPs in the NCBI dbSNP database, that are similar to cis-regulatory element (RE) consensus sequences; (2) map the selected dbSNP entries to the human genome assembly in order to identify polymorphic REs near gene start sites; (3) prioritize the candidate polymorphic RE containing genes by searching the existing genotype and gene expression data sets. The applicability of this system has been demonstrated through studies on p53 responsive elements and is being extended to additional pathways and environmentally responsive genes.

  11. Multifactor dimensionality reduction analysis identifies specific nucleotide patterns promoting genetic polymorphisms

    PubMed Central

    Arehart, Eric; Gleim, Scott; White, Bill; Hwa, John; Moore, Jason H

    2009-01-01

    Background The fidelity of DNA replication serves as the nidus for both genetic evolution and genomic instability fostering disease. Single nucleotide polymorphisms (SNPs) constitute greater than 80% of the genetic variation between individuals. A new theory regarding DNA replication fidelity has emerged in which selectivity is governed by base-pair geometry through interactions between the selected nucleotide, the complementary strand, and the polymerase active site. We hypothesize that specific nucleotide combinations in the flanking regions of SNP fragments are associated with mutation. Results We modeled the relationship between DNA sequence and observed polymorphisms using the novel multifactor dimensionality reduction (MDR) approach. MDR was originally developed to detect synergistic interactions between multiple SNPs that are predictive of disease susceptibility. We initially assembled data from the Broad Institute as a pilot test for the hypothesis that flanking region patterns associate with mutagenesis (n = 2194). We then confirmed and expanded our inquiry with human SNPs within coding regions and their flanking sequences collected from the National Center for Biotechnology Information (NCBI) database (n = 29967) and a control set of sequences (coding region) not associated with SNP sites randomly selected from the NCBI database (n = 29967). We discovered seven flanking region pattern associations in the Broad dataset which reached a minimum significance level of p ≤ 0.05. Significant models (p << 0.001) were detected for each SNP type examined in the larger NCBI dataset. Importantly, the flanking region models were elongated or truncated depending on the nucleotide change. Additionally, nucleotide distributions differed significantly at motif sites relative to the type of variation observed. The MDR approach effectively discerned specific sites within the flanking regions of observed SNPs and their respective identities, supporting the collective

  12. Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...

  13. Genotyping single nucleotide polymorphisms in barley by tetra-primer ARMS-PCR.

    PubMed

    Chiapparino, E; Lee, D; Donini, P

    2004-04-01

    Single nucleotide polymorphisms (SNPs) are the most abundant form of DNA polymorphism. These polymorphisms can be used in plants as simple genetic markers for many breeding applications, for population studies, and for germplasm fingerprinting. The great increase in the available DNA sequences in the databases has made it possible to identify SNPs by "database mining", and the single most important factor preventing their widespread use appears to be the genotyping cost. Many genotyping platforms rely on the use of sophisticated, automated equipment coupled to costly chemistry and detection systems. A simple and economical method involving a single PCR is reported here for barley SNP genotyping. Using the tetra-primer ARMS-PCR procedure, we have been able to assay unambiguously five SNPs in a set of 132 varieties of cultivated barley. The results show the reliability of this technique and its potential for use in low- to moderate-throughput situations; the association of agronomically important traits is discussed. PMID:15060595

  14. Single Nucleotide Polymorphism Analysis of European Archaeological M. leprae DNA

    PubMed Central

    Watson, Claire L.; Lockwood, Diana N. J.

    2009-01-01

    Background Leprosy was common in Europe eight to twelve centuries ago but molecular confirmation of this has been lacking. We have extracted M. leprae ancient DNA (aDNA) from medieval bones and single nucleotide polymorphism (SNP) typed the DNA, this provides insight into the pattern of leprosy transmission in Europe and may assist in the understanding of M. leprae evolution. Methods and Findings Skeletons have been exhumed from 3 European countries (the United Kingdom, Denmark and Croatia) and are dated around the medieval period (476 to 1350 A.D.). we tested for the presence of 3 previously identified single nucleotide polymorphisms (SNPs) in 10 aDNA extractions. M. leprae aDNA was extracted from 6 of the 10 bone samples. SNP analysis of these 6 extractions were compared to previously analysed European SNP data using the same PCR assays and were found to be the same. Testing for the presence of SNPs in M. leprae DNA extracted from ancient bone samples is a novel approach to analysing European M. leprae DNA and the findings concur with the previously published data that European M. leprae strains fall in to one group (SNP group 3). Conclusions These findings support the suggestion that the M. leprae genome is extremely stable and show that archaeological M. leprae DNA can be analysed to gain detailed information about the genotypic make-up of European leprosy, which may assist in the understanding of leprosy transmission worldwide. PMID:19847306

  15. Evaluation of published single nucleotide polymorphisms associated with acute GVHD.

    PubMed

    Chien, Jason W; Zhang, Xinyi Cindy; Fan, Wenhong; Wang, Hongwei; Zhao, Lue Ping; Martin, Paul J; Storer, Barry E; Boeckh, Michael; Warren, Edus H; Hansen, John A

    2012-05-31

    Candidate genetic associations with acute GVHD (aGVHD) were evaluated with the use of genotyped and imputed single-nucleotide polymorphism data from genome-wide scans of 1298 allogeneic hematopoietic cell transplantation (HCT) donors and recipients. Of 40 previously reported candidate SNPs, 6 were successfully genotyped, and 10 were imputed and passed criteria for analysis. Patient and donor genotypes were assessed for association with grades IIb-IV and III-IV aGVHD, stratified by donor type, in univariate and multivariate allelic, recessive and dominant models. Use of imputed genotypes to replicate previous IL10 associations was validated. Similar to previous publications, the IL6 donor genotype for rs1800795 was associated with a 20%-50% increased risk for grade IIb-IV aGVHD after unrelated HCT in the allelic (adjusted P = .011) and recessive (adjusted P = .0013) models. The donor genotype was associated with a 60% increase in risk for grade III-IV aGVHD after related HCT (adjusted P = .028). Other associations were found for IL2, CTLA4, HPSE, and MTHFR but were inconsistent with original publications. These results illustrate the advantages of using imputed single-nucleotide polymorphism data in genetic analyses and demonstrate the importance of validation in genetic association studies. PMID:22282500

  16. Characterization of single-nucleotide-polymorphism markers for Plasmopara viticola, the causal agent of grapevine downy mildew.

    PubMed

    Delmotte, F; Machefer, V; Giresse, X; Richard-Cervera, S; Latorse, M P; Beffa, R

    2011-11-01

    We report 34 new nuclear single-nucleotide-polymorphism (SNP) markers that have been developed from an expressed sequence tag library of Plasmopara viticola, the causal agent of grapevine downy mildew. This newly developed battery of markers will provide useful additional genetic tools for population genetic studies of this important agronomic species. PMID:21926208

  17. Computational Analysis of Single Nucleotide Polymorphisms Associated with Altered Drug Responsiveness in Type 2 Diabetes

    PubMed Central

    Costa, Valerio; Federico, Antonio; Pollastro, Carla; Ziviello, Carmela; Cataldi, Simona; Formisano, Pietro; Ciccodicola, Alfredo

    2016-01-01

    Type 2 diabetes (T2D) is one of the most frequent mortality causes in western countries, with rapidly increasing prevalence. Anti-diabetic drugs are the first therapeutic approach, although many patients develop drug resistance. Most drug responsiveness variability can be explained by genetic causes. Inter-individual variability is principally due to single nucleotide polymorphisms, and differential drug responsiveness has been correlated to alteration in genes involved in drug metabolism (CYP2C9) or insulin signaling (IRS1, ABCC8, KCNJ11 and PPARG). However, most genome-wide association studies did not provide clues about the contribution of DNA variations to impaired drug responsiveness. Thus, characterizing T2D drug responsiveness variants is needed to guide clinicians toward tailored therapeutic approaches. Here, we extensively investigated polymorphisms associated with altered drug response in T2D, predicting their effects in silico. Combining different computational approaches, we focused on the expression pattern of genes correlated to drug resistance and inferred evolutionary conservation of polymorphic residues, computationally predicting the biochemical properties of polymorphic proteins. Using RNA-Sequencing followed by targeted validation, we identified and experimentally confirmed that two nucleotide variations in the CAPN10 gene—currently annotated as intronic—fall within two new transcripts in this locus. Additionally, we found that a Single Nucleotide Polymorphism (SNP), currently reported as intergenic, maps to the intron of a new transcript, harboring CAPN10 and GPR35 genes, which undergoes non-sense mediated decay. Finally, we analyzed variants that fall into non-coding regulatory regions of yet underestimated functional significance, predicting that some of them can potentially affect gene expression and/or post-transcriptional regulation of mRNAs affecting the splicing. PMID:27347941

  18. Single nucleotide polymorphism analysis reveals heterogeneity within a seedling tree population of a polyembryonic mango cultivar.

    PubMed

    Winterhagen, Patrick; Wünsche, Jens-Norbert

    2016-05-01

    Within a polyembryonic mango seedling tree population, the genetic background of individuals should be identical because vigorous plants for cultivation are expected to develop from nucellar embryos representing maternal clones. Due to the fact that the mango cultivar 'Hôi' is assigned to the polyembryonic ecotype, an intra-cultivar variability of ethylene receptor genes was unexpected. Ethylene receptors in plants are conserved, but the number of receptors or receptor isoforms is variable regarding different plant species. However, it is shown here that the ethylene receptor MiETR1 is present in various isoforms within the mango cultivar 'Hôi'. The investigation of single nucleotide polymorphisms revealed that different MiETR1 isoforms can not be discriminated simply by individual single nucleotide exchanges but by the specific arrangement of single nucleotide polymorphisms at certain positions in the exons of MiETR1. Furthermore, an MiETR1 isoform devoid of introns in the genomic sequence was identified. The investigation demonstrates some limitations of high resolution melting and ScreenClust analysis and points out the necessity of sequencing to identify individual isoforms and to determine the variability within the tree population. PMID:27093244

  19. Gallium plasmonic nanoparticles for label-free DNA and single nucleotide polymorphism sensing.

    PubMed

    Marín, Antonio García; García-Mendiola, Tania; Bernabeu, Cristina Navio; Hernández, María Jesús; Piqueras, Juan; Pau, Jose Luis; Pariente, Félix; Lorenzo, Encarnación

    2016-05-01

    A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R(2) = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells. PMID:27120517

  20. Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms

    PubMed Central

    Zhang, Wei; Qi, Weihong; Albert, Thomas J.; Motiwala, Alifiya S.; Alland, David; Hyytia-Trees, Eija K.; Ribot, Efrain M.; Fields, Patricia I.; Whittam, Thomas S.; Swaminathan, Bala

    2006-01-01

    Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7 × 10−9 per site per year), we estimate that the most recent common ancestor of the contemporary β-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens. PMID:16606700

  1. Current research status, databases and application of single nucleotide polymorphism.

    PubMed

    Javed, R; Mukesh

    2010-07-01

    Single Nucleotide Polymorphisms (SNPs) are the most frequent form of DNA variation in the genome. SNPs are genetic markers which are bi-allelic in nature and grow at a very fast rate. Current genomic databases contain information on several million SNPs. More than 6 million SNPs have been identified and the information is publicly available through the efforts of the SNP Consortium and others data bases. The NCBI plays a major role in facillating the identification and cataloging of SNPs through creation and maintenance of the public SNP database (dbSNP) by the biomedical community worldwide and stimulate many areas of biological research including the identification of the genetic components of disease. In this review article, we are compiling the existing SNP databases, research status and their application. PMID:21717869

  2. Development of 101 novel EST-derived single nucleotide polymorphism markers for Zhikong scallop ( Chlamys farreri)

    NASA Astrophysics Data System (ADS)

    Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli

    2013-09-01

    Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.

  3. A genetic variation map for chicken with 2.8 million single nucleotide polymorphisms

    SciTech Connect

    Wong, G K; Hillier, L; Brandstrom, M; Croojmans, R; Ovcharenko, I; Gordon, L; Stubbs, L; Lucas, S; Glavina, T; Kaiser, P; Gunnarsson, U; Webber, C; Overton, I

    2005-02-20

    We describe a genetic variation map for the chicken genome containing 2.8 million single nucleotide polymorphisms (SNPs), based on a comparison of the sequences of 3 domestic chickens (broiler, layer, Silkie) to their wild ancestor Red Jungle Fowl (RJF). Subsequent experiments indicate that at least 90% are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about 5 SNP/kb for almost every possible comparison between RJF and domestic lines, between two different domestic lines, and within domestic lines--contrary to the idea that domestic animals are highly inbred relative to their wild ancestors. In fact, most of the SNPs originated prior to domestication, and there is little to no evidence of selective sweeps for adaptive alleles on length scales of greater than 100 kb.

  4. Nucleotide sequence of 3' untranslated portion of human alpha globin mRNA.

    PubMed Central

    Wilson, J T; deRiel, J K; Forget, B G; Marotta, C A; Weissman, S M

    1977-01-01

    We have determined the nucleotide sequence of 75 nucleotides of the 3'-untranslated portion of normal human alpha globin mRNA which corresponds to the elongated amino acid sequence of the chain termination mutant Hb Constant Spring. This was accomplished by sequence analysis of cDNA fragments obtained by restriction endonuclease or T4 endonuclease IV cleavage of human globin cDNA synthesized from globin mRNA by use of viral reverse transcriptase. Analysis of cRNA synthesized from cDNA by use of RNA polymerase provided additional confirmatory sequence information. Possible polymorphism has been identified at one site of the sequence. Our sequence overlaps with, and extends the sequence of 43 nucleotides determined by Proudfood and coworkers for the very 3'-terminal portion of human alpha globin mRNA. The complete 3'-untranslated sequence of human alpha globin mRNA (112 nucleotides including termination codon) shows little homology to that of the human or rabbit beta globin mRNAs except for the presence of the hexanucleotide sequence AAUAAA which is found in most eukaryotic mRNAs near the 3'-terminal poly (A). Images PMID:909779

  5. Nucleotide Sequence Diversity and Linkage Disequilibrium of Four Nuclear Loci in Foxtail Millet (Setaria italica)

    PubMed Central

    He, Shui-lian; Yang, Yang; Morrell, Peter L.; Yi, Ting-shuang

    2015-01-01

    Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less. PMID:26325578

  6. Subtyping of Salmonella enterica Subspecies I Using Single-Nucleotide Polymorphisms in Adenylate Cyclase.

    PubMed

    Guard, Jean; Abdo, Zaid; Byers, Sara Overstreet; Kriebel, Patrick; Rothrock, Michael J

    2016-07-01

    Methods to rapidly identify serotypes of Salmonella enterica subspecies I are of vital importance for protecting the safety of food. To supplement the serotyping method dkgB-linked intergenic sequence ribotyping (ISR), single-nucleotide polymorphisms were characterized within adenylate cyclase (cyaA). The National Center for Biotechnology Information (NCBI) database had 378 cyaA sequences from S. enterica subspecies I, which included 42 unique DNA sequences and 19 different amino acid sequences. Five representative isolates, namely serotypes Typhimurium, Kentucky, Enteritidis phage type PT4, and two variants of Enteritidis phage type PT13a, were differentiated within a microsphere-based fluidics system in cyaA by allele-specific primer extension. Validation against 25 poultry-related environmental Salmonella isolates representing 11 serotypes yielded a ∼89% success rate at identifying the serotype of the isolate, and a different region could be targeted to achieve 100%. When coupled with ISR, all serotypes were differentiated. Phage lineages of serotype Enteritidis 13a and 4 were identified, and a biofilm-forming strain of PT13a was differentiated from a smooth phenotype within phage type. Comparative ranking of mutation indices to genes such as the tRNA transferases, the diguanylate cyclases, and genes used for multilocus sequence typing indicated that cyaA is an appropriate gene for assessing epidemiological trends of Salmonella because of its relative stability in nucleotide composition. PMID:27035032

  7. Subtyping of Salmonella enterica Subspecies I Using Single-Nucleotide Polymorphisms in Adenylate Cyclase

    PubMed Central

    Abdo, Zaid; Byers, Sara Overstreet; Kriebel, Patrick; Rothrock, Michael J.

    2016-01-01

    Abstract Methods to rapidly identify serotypes of Salmonella enterica subspecies I are of vital importance for protecting the safety of food. To supplement the serotyping method dkgB-linked intergenic sequence ribotyping (ISR), single-nucleotide polymorphisms were characterized within adenylate cyclase (cyaA). The National Center for Biotechnology Information (NCBI) database had 378 cyaA sequences from S. enterica subspecies I, which included 42 unique DNA sequences and 19 different amino acid sequences. Five representative isolates, namely serotypes Typhimurium, Kentucky, Enteritidis phage type PT4, and two variants of Enteritidis phage type PT13a, were differentiated within a microsphere-based fluidics system in cyaA by allele-specific primer extension. Validation against 25 poultry-related environmental Salmonella isolates representing 11 serotypes yielded a ∼89% success rate at identifying the serotype of the isolate, and a different region could be targeted to achieve 100%. When coupled with ISR, all serotypes were differentiated. Phage lineages of serotype Enteritidis 13a and 4 were identified, and a biofilm-forming strain of PT13a was differentiated from a smooth phenotype within phage type. Comparative ranking of mutation indices to genes such as the tRNA transferases, the diguanylate cyclases, and genes used for multilocus sequence typing indicated that cyaA is an appropriate gene for assessing epidemiological trends of Salmonella because of its relative stability in nucleotide composition. PMID:27035032

  8. Assessment of the Geographic Origins of Pinewood Nematode Isolates via Single Nucleotide Polymorphism in Effector Genes

    PubMed Central

    Figueiredo, Joana; Simões, Maria José; Gomes, Paula; Barroso, Cristina; Pinho, Diogo; Conceição, Luci; Fonseca, Luís; Abrantes, Isabel; Pinheiro, Miguel; Egas, Conceição

    2013-01-01

    The pinewood nematode, Bursaphelenchus xylophilus, is native to North America but it only causes damaging pine wilt disease in those regions of the world where it has been introduced. The accurate detection of the species and its dispersal routes are thus essential to define effective control measures. The main goals of this study were to analyse the genetic diversity among B. xylophilus isolates from different geographic locations and identify single nucleotide polymorphism (SNPs) markers for geographic origin, through a comparative transcriptomic approach. The transcriptomes of seven B. xylophilus isolates, from Continental Portugal (4), China (1), Japan (1) and USA (1), were sequenced in the next generation platform Roche 454. Analysis of effector gene transcripts revealed inter-isolate nucleotide diversity that was validated by Sanger sequencing in the genomic DNA of the seven isolates and eight additional isolates from different geographic locations: Madeira Island (2), China (1), USA (1), Japan (2) and South Korea (2). The analysis identified 136 polymorphic positions in 10 effector transcripts. Pairwise comparison of the 136 SNPs through Neighbor-Joining and the Maximum Likelihood methods and 5-mer frequency analysis with the alignment-independent bilinear multivariate modelling approach correlated the SNPs with the isolates geographic origin. Furthermore, the SNP analysis indicated a closer proximity of the Portuguese isolates to the Korean and Chinese isolates than to the Japanese or American isolates. Each geographic cluster carried exclusive alleles that can be used as SNP markers for B. xylophilus isolate identification. PMID:24391785

  9. Naked-eye fingerprinting of single nucleotide polymorphisms on psoriasis patients

    NASA Astrophysics Data System (ADS)

    Valentini, Paola; Marsella, Alessandra; Tarantino, Paolo; Mauro, Salvatore; Baglietto, Silvia; Congedo, Maurizio; Paolo Pompa, Pier

    2016-05-01

    We report a low-cost test, based on gold nanoparticles, for the colorimetric (naked-eye) fingerprinting of a panel of single nucleotide polymorphisms (SNPs), relevant for the personalized therapy of psoriasis. Such pharmacogenomic tests are not routinely performed on psoriasis patients, due to the high cost of standard technologies. We demonstrated high sensitivity and specificity of our colorimetric test by validating it on a cohort of 30 patients, through a double-blind comparison with two state-of-the-art instrumental techniques, namely reverse dot blotting and sequencing, finding 100% agreement. This test offers high parallelization capabilities and can be easily generalized to other SNPs of clinical relevance, finding broad utility in diagnostics and pharmacogenomics.We report a low-cost test, based on gold nanoparticles, for the colorimetric (naked-eye) fingerprinting of a panel of single nucleotide polymorphisms (SNPs), relevant for the personalized therapy of psoriasis. Such pharmacogenomic tests are not routinely performed on psoriasis patients, due to the high cost of standard technologies. We demonstrated high sensitivity and specificity of our colorimetric test by validating it on a cohort of 30 patients, through a double-blind comparison with two state-of-the-art instrumental techniques, namely reverse dot blotting and sequencing, finding 100% agreement. This test offers high parallelization capabilities and can be easily generalized to other SNPs of clinical relevance, finding broad utility in diagnostics and pharmacogenomics. Electronic supplementary information (ESI) available. See DOI: 10.1039/c6nr02200f

  10. Assessment of the geographic origins of pinewood nematode isolates via single nucleotide polymorphism in effector genes.

    PubMed

    Figueiredo, Joana; Simões, Maria José; Gomes, Paula; Barroso, Cristina; Pinho, Diogo; Conceição, Luci; Fonseca, Luís; Abrantes, Isabel; Pinheiro, Miguel; Egas, Conceição

    2013-01-01

    The pinewood nematode, Bursaphelenchus xylophilus, is native to North America but it only causes damaging pine wilt disease in those regions of the world where it has been introduced. The accurate detection of the species and its dispersal routes are thus essential to define effective control measures. The main goals of this study were to analyse the genetic diversity among B. xylophilus isolates from different geographic locations and identify single nucleotide polymorphism (SNPs) markers for geographic origin, through a comparative transcriptomic approach. The transcriptomes of seven B. xylophilus isolates, from Continental Portugal (4), China (1), Japan (1) and USA (1), were sequenced in the next generation platform Roche 454. Analysis of effector gene transcripts revealed inter-isolate nucleotide diversity that was validated by Sanger sequencing in the genomic DNA of the seven isolates and eight additional isolates from different geographic locations: Madeira Island (2), China (1), USA (1), Japan (2) and South Korea (2). The analysis identified 136 polymorphic positions in 10 effector transcripts. Pairwise comparison of the 136 SNPs through Neighbor-Joining and the Maximum Likelihood methods and 5-mer frequency analysis with the alignment-independent bilinear multivariate modelling approach correlated the SNPs with the isolates geographic origin. Furthermore, the SNP analysis indicated a closer proximity of the Portuguese isolates to the Korean and Chinese isolates than to the Japanese or American isolates. Each geographic cluster carried exclusive alleles that can be used as SNP markers for B. xylophilus isolate identification. PMID:24391785

  11. Polymorphisms of nucleotide excision repair genes predict melanoma survival.

    PubMed

    Li, Chunying; Yin, Ming; Wang, Li-E; Amos, Christopher I; Zhu, Dakai; Lee, Jeffrey E; Gershenwald, Jeffrey E; Grimm, Elizabeth A; Wei, Qingyi

    2013-07-01

    Melanoma is the most highly malignant skin cancer, and nucleotide excision repair (NER) is involved in melanoma susceptibility. In this analysis of 1,042 melanoma patients, we evaluated whether genetic variants of NER genes may predict survival outcome of melanoma patients. We used genotyping data of 74 tagging single-nucleotide polymorphisms (tagSNPs) in eight core NER genes from our genome-wide association study (including two in XPA, 14 in XPC, three in XPE, four in ERCC1, 10 in ERCC2, eight in ERCC3, 14 in ERCC4, and 19 in ERCC5) and evaluated their associations with prognosis of melanoma patients. Using the Cox proportional hazards model and Kaplan-Meier analysis, we found a predictive role of XPE rs28720291, ERCC5 rs4150314, XPC rs2470458, and ERCC2 rs50871 SNPs in the prognosis of melanoma patients (rs28720291: AG vs. GG, adjusted hazard ratio (adjHR)=11.2, 95% confidence interval (CI) 3.04-40.9, P=0.0003; rs4150314: AG vs. GG, adjHR=4.76, 95% CI 1.09-20.8, P=0.038; rs2470458: AA vs. AG/GG, adjHR=2.11, 95% CI 1.03-4.33, P=0.040; and rs50871: AA vs. AC/CC adjHR=2.27, 95% CI 1.18-4.35, P=0.015). Patients with an increasing number of unfavorable genotypes had markedly increased death risk. Genetic variants of NER genes, particularly XPE rs28720291, ERCC5 rs4150314, XPC rs2470458, and ERCC2 rs50871, may independently or jointly modulate survival outcome of melanoma patients. Because our results were based on a median follow-up of 3 years without multiple test corrections, additional large prospective studies are needed to confirm our findings. PMID:23407396

  12. Research on Single Nucleotide Polymorphisms Interaction Detection from Network Perspective

    PubMed Central

    Su, Lingtao; Liu, Guixia; Wang, Han; Tian, Yuan; Zhou, Zhihui; Han, Liang; Yan, Lun

    2015-01-01

    Single Nucleotide Polymorphisms (SNPs) found in Genome-Wide Association Study (GWAS) mainly influence the susceptibility of complex diseases, but they still could not comprehensively explain the relationships between mutations and diseases. Interactions between SNPs are considered so important for deeply understanding of those relationships that several strategies have been proposed to explore such interactions. However, part of those methods perform poorly when marginal effects of disease loci are weak or absent, others may lack of considering high-order SNPs interactions, few methods have achieved the requirements in both performance and accuracy. Considering the above reasons, not only low-order, but also high-order SNP interactions as well as main-effect SNPs, should be taken into account in detection methods under an acceptable computational complexity. In this paper, a new pairwise (or low-order) interaction detection method IG (Interaction Gain) is introduced, in which disease models are not required and parallel computing is utilized. Furthermore, high-order SNP interactions were proposed to be detected by finding closely connected function modules of the network constructed from IG detection results. Tested by a wide range of simulated datasets and four WTCCC real datasets, the proposed methods accurately detected both low-order and high-order SNP interactions as well as disease-associated main-effect SNPS and it surpasses all competitors in performances. The research will advance complex diseases research by providing more reliable SNP interactions. PMID:25763929

  13. Research on single nucleotide polymorphisms interaction detection from network perspective.

    PubMed

    Su, Lingtao; Liu, Guixia; Wang, Han; Tian, Yuan; Zhou, Zhihui; Han, Liang; Yan, Lun

    2015-01-01

    Single Nucleotide Polymorphisms (SNPs) found in Genome-Wide Association Study (GWAS) mainly influence the susceptibility of complex diseases, but they still could not comprehensively explain the relationships between mutations and diseases. Interactions between SNPs are considered so important for deeply understanding of those relationships that several strategies have been proposed to explore such interactions. However, part of those methods perform poorly when marginal effects of disease loci are weak or absent, others may lack of considering high-order SNPs interactions, few methods have achieved the requirements in both performance and accuracy. Considering the above reasons, not only low-order, but also high-order SNP interactions as well as main-effect SNPs, should be taken into account in detection methods under an acceptable computational complexity. In this paper, a new pairwise (or low-order) interaction detection method IG (Interaction Gain) is introduced, in which disease models are not required and parallel computing is utilized. Furthermore, high-order SNP interactions were proposed to be detected by finding closely connected function modules of the network constructed from IG detection results. Tested by a wide range of simulated datasets and four WTCCC real datasets, the proposed methods accurately detected both low-order and high-order SNP interactions as well as disease-associated main-effect SNPS and it surpasses all competitors in performances. The research will advance complex diseases research by providing more reliable SNP interactions. PMID:25763929

  14. Single nucleotide polymorphism-based dispersal estimates using noninvasive sampling.

    PubMed

    Norman, Anita J; Spong, Göran

    2015-08-01

    Quantifying dispersal within wild populations is an important but challenging task. Here we present a method to estimate contemporary, individual-based dispersal distance from noninvasively collected samples using a specialized panel of 96 SNPs (single nucleotide polymorphisms). One main issue in conducting dispersal studies is the requirement for a high sampling resolution at a geographic scale appropriate for capturing the majority of dispersal events. In this study, fecal samples of brown bear (Ursus arctos) were collected by volunteer citizens, resulting in a high sampling resolution spanning over 45,000 km(2) in Gävleborg and Dalarna counties in Sweden. SNP genotypes were obtained for unique individuals sampled (n = 433) and subsequently used to reconstruct pedigrees. A Mantel test for isolation by distance suggests that the sampling scale was appropriate for females but not for males, which are known to disperse long distances. Euclidean distance was estimated between mother and offspring pairs identified through the reconstructed pedigrees. The mean dispersal distance was 12.9 km (SE 3.2) and 33.8 km (SE 6.8) for females and males, respectively. These results were significantly different (Wilcoxon's rank-sum test: P-value = 0.02) and are in agreement with the previously identified pattern of male-biased dispersal. Our results illustrate the potential of using a combination of noninvasively collected samples at high resolution and specialized SNPs for pedigree-based dispersal models. PMID:26357536

  15. Single nucleotide polymorphisms predict symptom severity of autism spectrum disorder

    PubMed Central

    Jiao, Yun; Chen, Rong; Ke, Xiaoyan; Cheng, Lu; Chu, Kangkang; Lu, Zuhong; Herskovits, Edward H

    2011-01-01

    Autism is widely believed to be a heterogeneous disorder; diagnosis is currently based solely on clinical criteria, although genetic, as well as environmental, influences are thought to be prominent factors in the etiology of most forms of autism. Our goal is to determine whether a predictive model based on single-nucleotide polymorphisms (SNPs) can predict symptom severity of autism spectrum disorder (ASD). We divided 118 ASD children into a mild/moderate autism group (n = 65) and a severe autism group (n = 53), based on the Childhood Autism Rating Scale (CARS). For each child, we obtained 29 SNPs of 9 ASD-related genes. To generate predictive models, we employed three machine-learning techniques: decision stumps (DSs), alternating decision trees (ADTrees), and FlexTrees. DS and FlexTree generated modestly better classifiers, with accuracy = 67%, sensitivity = 0.88 and specificity = 0.42. The SNP rs878960 in GABRB3 was selected by all models, and was related associated with CARS assessment. Our results suggest that SNPs have the potential to offer accurate classification of ASD symptom severity. PMID:21786105

  16. Single nucleotide polymorphism-based dispersal estimates using noninvasive sampling

    PubMed Central

    Norman, Anita J; Spong, Göran

    2015-01-01

    Quantifying dispersal within wild populations is an important but challenging task. Here we present a method to estimate contemporary, individual-based dispersal distance from noninvasively collected samples using a specialized panel of 96 SNPs (single nucleotide polymorphisms). One main issue in conducting dispersal studies is the requirement for a high sampling resolution at a geographic scale appropriate for capturing the majority of dispersal events. In this study, fecal samples of brown bear (Ursus arctos) were collected by volunteer citizens, resulting in a high sampling resolution spanning over 45,000 km2 in Gävleborg and Dalarna counties in Sweden. SNP genotypes were obtained for unique individuals sampled (n = 433) and subsequently used to reconstruct pedigrees. A Mantel test for isolation by distance suggests that the sampling scale was appropriate for females but not for males, which are known to disperse long distances. Euclidean distance was estimated between mother and offspring pairs identified through the reconstructed pedigrees. The mean dispersal distance was 12.9 km (SE 3.2) and 33.8 km (SE 6.8) for females and males, respectively. These results were significantly different (Wilcoxon’s rank-sum test: P-value = 0.02) and are in agreement with the previously identified pattern of male-biased dispersal. Our results illustrate the potential of using a combination of noninvasively collected samples at high resolution and specialized SNPs for pedigree-based dispersal models. PMID:26357536

  17. A MEMS-Based Approach to Single Nucleotide Polymorphism Genotyping

    PubMed Central

    Zhu, Jing; Palla, Mirkó; Ronca, Stefano; Warpner, Ronald; Ju, Jingyue; Lin, Qiao

    2014-01-01

    Genotyping of single nucleotide polymorphisms (SNPs) allows diagnosis of human genetic disorders associated with single base mutations. Conventional SNP genotyping methods are capable of providing either accurate or high-throughput detection, but are still labor-, time-, and resource-intensive. Microfluidics has been applied to SNP detection to provide fast, low-cost, and automated alternatives, although these applications are still limited by either accuracy or throughput issues. To address this challenge, we present a MEMS-based SNP genotyping approach that uses solid-phase-based reactions in a single microchamber on a temperature control chip. Polymerase chain reaction (PCR), allele specific single base extension (SBE), and desalting on microbeads are performed in the microchamber, which is coupled with matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to analyze the SBE product. Experimental results from genotyping of the SNP on exon 1 of the HBB gene, which causes sickle cell anemia, demonstrate the potential of the device for rapid, accurate, multiplexed and high-throughput detection of SNPs. PMID:24729659

  18. A Microfluidic Device for Multiplex Single-Nucleotide Polymorphism Genotyping

    PubMed Central

    Zhu, Jing; Qiu, Chunmei; Palla, Mirkó; Nguyen, ThaiHuu; Russo, James J.; Ju, Jingyue; Lin, Qiao

    2015-01-01

    Single-nucleotide polymorphisms (SNPs) are the most abundant type of genetic variations; they provide the genetic fingerprint of individuals and are essential for genetic biomarker discoveries. Accurate detection of SNPs is of great significance for disease prevention, diagnosis and prognosis, and for prediction of drug response and clinical outcomes in patients. Nevertheless, conventional SNP genotyping methods are still limited by insufficient accuracy or labor-, time-, and resource-intensive procedures. Microfluidics has been increasingly utilized to improve efficiency; however, the currently available microfluidic genotyping systems still have shortcomings in accuracy, sensitivity, throughput and multiplexing capability. To address these challenges, we developed a multi-step SNP genotyping microfluidic device, which performs single-base extension of SNP specific primers and solid-phase purification of the extension products on a temperature-controlled chip. The products are ready for immediate detection by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS), providing identification of the alleles at the target loci. The integrated device enables efficient and automated operation, while maintaining the high accuracy and sensitivity provided by MS. The multiplex genotyping capability was validated by performing rapid, accurate and simultaneous detection of 4 loci on a synthetic template. The microfluidic device has the potential to perform automatic, accurate, quantitative and high-throughput assays covering a broad spectrum of applications in biological and clinical research, drug development and forensics. PMID:26594354

  19. Single Nucleotide Polymorphism in Patients with Moyamoya Disease

    PubMed Central

    2015-01-01

    Moyamoya disease (MMD) is a chronic, progressive, cerebrovascular occlusive disorder that displays various clinical features and results in cerebral infarct or hemorrhagic stroke. Specific genes associated with the disease have not yet been identified, making identification of at-risk patients difficult before clinical manifestation. Familial MMD is not uncommon, with as many as 15% of MMD patients having a family history of the disease, suggesting a genetic etiology. Studies of single nucleotide polymorphisms (SNPs) in MMD have mostly focused on mechanical stress on vessels, endothelium, and the relationship to atherosclerosis. In this review, we discuss SNPs studies targeting the genetic etiology of MMD. Genetic analyses in familial MMD and genome-wide association studies represent promising strategies for elucidating the pathophysiology of this condition. This review also discusses future research directions, not only to offer new insights into the origin of MMD, but also to enhance our understanding of the genetic aspects of MMD. There have been several SNP studies of MMD. Current SNP studies suggest a genetic contribution to MMD, but further reliable and replicable data are needed. A large cohort or family-based design would be important. Modern SNP studies of MMD depend on novel genetic, experimental, and database methods that will hopefully hasten the arrival of a consensus conclusion. PMID:26180609

  20. Single Nucleotide Polymorphism Clustering in Systemic Autoimmune Diseases

    PubMed Central

    Charlon, Thomas; Bossini-Castillo, Lara; Carmona, F. David; Di Cara, Alessandro; Wojcik, Jérôme; Voloshynovskiy, Sviatoslav

    2016-01-01

    Systemic Autoimmune Diseases, a group of chronic inflammatory conditions, have variable symptoms and difficult diagnosis. In order to reclassify them based on genetic markers rather than clinical criteria, we performed clustering of Single Nucleotide Polymorphisms. However naive approaches tend to group patients primarily by their geographic origin. To reduce this “ancestry signal”, we developed SNPClust, a method to select large sources of ancestry-independent genetic variations from all variations detected by Principal Component Analysis. Applied to a Systemic Lupus Erythematosus case control dataset, SNPClust successfully reduced the ancestry signal. Results were compared with association studies between the cases and controls without or with reference population stratification correction methods. SNPClust amplified the disease discriminating signal and the ratio of significant associations outside the HLA locus was greater compared to population stratification correction methods. SNPClust will enable the use of ancestry-independent genetic information in the reclassification of Systemic Autoimmune Diseases. SNPClust is available as an R package and demonstrated on the public Human Genome Diversity Project dataset at https://github.com/ThomasChln/snpclust. PMID:27490238

  1. Single Nucleotide Polymorphism Clustering in Systemic Autoimmune Diseases.

    PubMed

    Charlon, Thomas; Martínez-Bueno, Manuel; Bossini-Castillo, Lara; Carmona, F David; Di Cara, Alessandro; Wojcik, Jérôme; Voloshynovskiy, Sviatoslav; Martín, Javier; Alarcón-Riquelme, Marta E

    2016-01-01

    Systemic Autoimmune Diseases, a group of chronic inflammatory conditions, have variable symptoms and difficult diagnosis. In order to reclassify them based on genetic markers rather than clinical criteria, we performed clustering of Single Nucleotide Polymorphisms. However naive approaches tend to group patients primarily by their geographic origin. To reduce this "ancestry signal", we developed SNPClust, a method to select large sources of ancestry-independent genetic variations from all variations detected by Principal Component Analysis. Applied to a Systemic Lupus Erythematosus case control dataset, SNPClust successfully reduced the ancestry signal. Results were compared with association studies between the cases and controls without or with reference population stratification correction methods. SNPClust amplified the disease discriminating signal and the ratio of significant associations outside the HLA locus was greater compared to population stratification correction methods. SNPClust will enable the use of ancestry-independent genetic information in the reclassification of Systemic Autoimmune Diseases. SNPClust is available as an R package and demonstrated on the public Human Genome Diversity Project dataset at https://github.com/ThomasChln/snpclust. PMID:27490238

  2. Simplified computer programs for search of homology within nucleotide sequences.

    PubMed Central

    Kröger, M; Kröger-Block, A

    1984-01-01

    Four new computer programs for search of homology within nucleotide sequences are presented. The main scope of the program design is flexibility, independence of sequence length and the capability to be used by any molecular biologist without any prior computer experience. The programs offer a linear search, a search for maximal identity, an alignment along a given sequence and a search based on homology within the amino acid coding capacity of nucleotide sequences. The language is Fortran V. Copies are available on request. PMID:6546417

  3. Single nucleotide polymorphisms and haplotype diversity in rice sucrose synthase 3.

    PubMed

    Lestari, Puji; Lee, Gian; Ham, Tae-Ho; Reflinur; Woo, Mi-Ok; Piao, Rihua; Jiang, Wenzhu; Chu, Sang Ho; Lee, Joohyun; Koh, Hee-Jong

    2011-01-01

    Rice sucrose synthase 3 (RSUS3) is expressed predominantly in rice seed endosperm and is thought to play an important role in starch filling during the milky stage of rice seed ripening. Because the genetic diversity of this locus is not known yet, the full sequence of RSUS3 from 43 rice varieties was amplified to examine the distribution of DNA polymorphisms. A total of 254 sequence variants, including SNPs and insertion/deletions, were successfully identified in the 7733 bp sequence that comprises the promoter, exons and introns, and 3' downstream nontranscribed region (NTR). Eleven haplotypes were distinguished among the 43 rice varieties based on nucleotide variation in the 3 defined regions (5' NTR, transcript, and 3' NTR). The promoter region showed evidence of a base change on a cis-element that might influence the functional role of the motif in seed-specific expression. The genetic diversity of the RSUS3 gene sequences in the rice germplasm used in this study appears to be the result of nonrandom processes. Analysis of polymorphism sites indicated that at least 11 recombinations have occurred, primarily in the transcribed region. This finding provides insight into the development of a cladistic approach for establishing future genetic association studies of the RSUS3 locus. PMID:21914668

  4. Novel single nucleotide polymorphism of UGT1A9 gene in Japanese.

    PubMed

    Fujita, Ken-ichi; Ando, Yuichi; Nagashima, Fumio; Yamamoto, Wataru; Endo, Hisashi; Kodama, Keiji; Araki, Kazuhiro; Miya, Toshimichi; Narabayashi, Masaru; Sasaki, Yasutsuna

    2006-02-01

    We sequenced from 5'-franking region to intron 1 (to 337 bp downstream from exon 1) of the UDP-glucuronosyltransferase (UGT) 1A9 gene prepared from 55 Japanese cancer patients. Seven single nucleotide polymorphisms (SNPs) were found. Two of them were UGT1A9 -118(T)n (n=10) and UGT1A9*5, and four were reported SNPs in intron 1 of UGT1A9 gene (89540C>T, 89549G>A, 89616T>A and 89710A>C). A novel SNP (89587T>C) was found. The sequence is as follows: SNP, 050824FujitaK001; Gene Name, UGT1A9; Accession Number, AF297093; Length, 25 bases; 5'-CCTTCTTGAAGAT/CATGTATTTATAA-3'. Two patients were heterozygous for the mutant allele, resulting in the allele frequency of 1.82%. PMID:16547398

  5. Gallium plasmonic nanoparticles for label-free DNA and single nucleotide polymorphism sensing

    NASA Astrophysics Data System (ADS)

    Marín, Antonio García; García-Mendiola, Tania; Bernabeu, Cristina Navio; Hernández, María Jesús; Piqueras, Juan; Pau, Jose Luis; Pariente, Félix; Lorenzo, Encarnación

    2016-05-01

    A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori onto GaNP/Si substrates; complementary target sequences of Helicobacter pylori can be quantified over the range of 10 pM to 3.0 nM with a detection limit of 6.0 pM and a linear correlation coefficient of R2 = 0.990. The selectivity of the device allows the detection of a single nucleotide polymorphism (SNP) in a specific sequence of Helicobacter pylori, without the need for a hybridization suppressor in solution such as formamide. Furthermore, it also allows the detection of this sequence in the presence of other pathogens, such as Escherichia coli in the sample. The broad applicability of the system was demonstrated by the detection of a specific gene mutation directly associated with cystic fibrosis in large genomic DNA isolated from blood cells.A label-free DNA and single nucleotide polymorphism (SNP) sensing method is described. It is based on the use of the pseudodielectric function of gallium plasmonic nanoparticles (GaNPs) deposited on Si (100) substrates under reversal of the polarization handedness condition. Under this condition, the pseudodielectric function is extremely sensitive to changes in the surrounding medium of the nanoparticle surface providing an excellent sensing platform competitive to conventional surface plasmon resonance. DNA sensing has been carried out by immobilizing a thiolated capture probe sequence from Helicobacter pylori

  6. Nucleotide sequence of SHV-2 beta-lactamase gene

    SciTech Connect

    Garbarg-Chenon, A.; Godard, V.; Labia, R.; Nicolas, J.C. )

    1990-07-01

    The nucleotide sequence of plasmid-mediated beta-lactamase SHV-2 from Salmonella typhimurium (SHV-2pHT1) was determined. The gene was very similar to chromosomally encoded beta-lactamase LEN-1 of Klebsiella pneumoniae. Compared with the sequence of the Escherichia coli SHV-2 enzyme (SHV-2E.coli) obtained by protein sequencing, the deduced amino acid sequence of SHV-2pHT1 differed by three amino acid substitutions.

  7. Two bi-allelic single nucleotide polymorphisms within the promoter region of the horse tumour necrosis factor alpha gene.

    PubMed

    Matiasovic, J; Lukeszová, L; Horín, P

    2002-08-01

    Primers based on GenBank sequences within the 5' untranslated region (UTR) of the human and horse tumour necrosis factor alpha (TNF-alpha) genes were designed and used to amplify a 522-bp product. Sequencing of five clones derived from five independent PCRs obtained from three different animals of three different breeds (Old Kladruber, Akhal-Teke and Shetland Pony) revealed a high level of sequence identity to the TNF-alpha promoter regions of other species. The existing GenBank horse sequences were confirmed and extended upstream by 230 nucleotides. Based on the sequence obtained, a new horse-specific forward primer was designed to amplify a 213-bp PCR product, which was screened for polymorphism using single-strand conformation polymorphism (SSCP). Three allelic variants of the horse TNF-alpha gene were identified and sequenced (GenBank accession numbers ADF 349558-60). Two single nucleotide polymorphisms explained the existence of the three SSCP alleles detected: C/T and T/C single base pair substitutions at positions 137 and 147, respectively. Differences in allelic frequencies between Old Kladruber and Akhal-Teke breeds were observed. PMID:12121271

  8. Morpholino-functionalized nanochannel array for label-free single nucleotide polymorphisms detection.

    PubMed

    Gao, Hong-Li; Wang, Min; Wu, Zeng-Qiang; Wang, Chen; Wang, Kang; Xia, Xing-Hua

    2015-04-01

    The sensitive identification of single nucleotide polymorphisms becomes increasingly important for disease diagnosis, prevention, and practical applicability of pharmacogenomics. Herein, we propose a simple, highly selective, label-free single nucleotide polymorphisms (SNPs) sensing device by electrochemically monitoring the diffusion flux of ferricyanide probe across probe DNA/morpholino duplex functionalized nanochannels of porous anodic alumina. When perfectly matched or mismatched target DNA flows through the nanochannels modified with probe DNA/morpholino duplex, it competes for the probe DNA from morpholino, resulting in a change of the surface charges. Thus, the diffusion flux of negatively charged electroactive probe ferricyanide is modulated since it is sensitive to the surface charge due to the electrostatic interactions in electric double layer-merged nanochannels. Monitoring of the change in diffusion flux of probe enables us to detect not only a single base or two base mismatched sequence but also the specific location of the mismatched base. As is demonstrated, SNPs in the PML/RARα fusion gene, known as a biomarker of acute promyelocytic leukemia (APL), have been successfully detected. PMID:25734499

  9. Reading biological processes from nucleotide sequences

    NASA Astrophysics Data System (ADS)

    Murugan, Anand

    Cellular processes have traditionally been investigated by techniques of imaging and biochemical analysis of the molecules involved. The recent rapid progress in our ability to manipulate and read nucleic acid sequences gives us direct access to the genetic information that directs and constrains biological processes. While sequence data is being used widely to investigate genotype-phenotype relationships and population structure, here we use sequencing to understand biophysical mechanisms. We present work on two different systems. First, in chapter 2, we characterize the stochastic genetic editing mechanism that produces diverse T-cell receptors in the human immune system. We do this by inferring statistical distributions of the underlying biochemical events that generate T-cell receptor coding sequences from the statistics of the observed sequences. This inferred model quantitatively describes the potential repertoire of T-cell receptors that can be produced by an individual, providing insight into its potential diversity and the probability of generation of any specific T-cell receptor. Then in chapter 3, we present work on understanding the functioning of regulatory DNA sequences in both prokaryotes and eukaryotes. Here we use experiments that measure the transcriptional activity of large libraries of mutagenized promoters and enhancers and infer models of the sequence-function relationship from this data. For the bacterial promoter, we infer a physically motivated 'thermodynamic' model of the interaction of DNA-binding proteins and RNA polymerase determining the transcription rate of the downstream gene. For the eukaryotic enhancers, we infer heuristic models of the sequence-function relationship and use these models to find synthetic enhancer sequences that optimize inducibility of expression. Both projects demonstrate the utility of sequence information in conjunction with sophisticated statistical inference techniques for dissecting underlying biophysical

  10. Single nucleotide polymorphism discovery in rainbow trout using reduced representation libraries

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single Nucleotide Polymorphisms (SNPs) are highly abundant, widespread and evenly distributed markers, which can be easily genotyped using high-throughput assays. These characteristics explain their increasing popularity in genome analyses such as quantitative trait loci mapping, linkage disequilibr...

  11. A high-density single nucleotide polymorphism map for Neurospora crassa.

    PubMed

    Lambreghts, Randy; Shi, Mi; Belden, William J; Decaprio, David; Park, Danny; Henn, Matthew R; Galagan, James E; Bastürkmen, Meray; Birren, Bruce W; Sachs, Matthew S; Dunlap, Jay C; Loros, Jennifer J

    2009-02-01

    We report the discovery and validation of a set of single nucleotide polymorphisms (SNPs) between the reference Neurospora crassa strain Oak Ridge and the Mauriceville strain (FGSC 2555), of sufficient density to allow fine mapping of most loci. Sequencing of Mauriceville cDNAs and alignment to the completed genomic sequence of the Oak Ridge strain identified 19,087 putative SNPs. Of these, a subset was validated by cleaved amplified polymorphic sequence (CAPS), a simple and robust PCR-based assay that reliably distinguishes between SNP alleles. Experimental confirmation resulted in the development of 250 CAPS markers distributed evenly over the genome. To demonstrate the applicability of this map, we used bulked segregant analysis followed by interval mapping to locate the csp-1 mutation to a narrow region on LGI. Subsequently, we refined mapping resolution to 74 kbp by developing additional markers, resequenced the candidate gene, NCU02713.3, in the mutant background, and phenocopied the mutation by gene replacement in the WT strain. Together, these techniques demonstrate a generally applicable and straightforward approach for the isolation of novel genes from existing mutants. Data on both putative and validated SNPs are deposited in a customized public database at the Broad Institute, which encourages augmentation by community users. PMID:19015548

  12. Identification, validation and survey of a single nucleotide polymorphism (SNP) associated with pungency in Capsicum spp.

    PubMed

    Garcés-Claver, Ana; Fellman, Shanna Moore; Gil-Ortega, Ramiro; Jahn, Molly; Arnedo-Andrés, María S

    2007-11-01

    A single nucleotide polymorphism (SNP) associated with pungency was detected within an expressed sequence tag (EST) of 307 bp. This fragment was identified after expression analysis of the EST clone SB2-66 in placenta tissue of Capsicum fruits. Sequence alignments corresponding to this new fragment allowed us to identify an SNP between pungent and non-pungent accessions. Two methods were chosen for the development of the SNP marker linked to pungency: tetra-primer amplification refractory mutation system-PCR (tetra-primer ARMS-PCR) and cleaved amplified polymorphic sequence. Results showed that both methods were successful in distinguishing genotypes. Nevertheless, tetra-primer ARMS-PCR was chosen for SNP genotyping because it was more rapid, reliable and less cost-effective. The utility of this SNP marker for pungency was demonstrated by the ability to distinguish between 29 pungent and non-pungent cultivars of Capsicum annuum. In addition, the SNP was also associated with phenotypic pungent character in the tested genotypes of C. chinense, C. baccatum, C. frutescens, C. galapagoense, C. eximium, C. tovarii and C. cardenasi. This SNP marker is a faster, cheaper and more reproducible method for identifying pungent peppers than other techniques such as panel tasting, and allows rapid screening of the trait in early growth stages. PMID:17882396

  13. Are Immune Modulating Single Nucleotide Polymorphisms Associated with Necrotizing Enterocolitis?

    PubMed Central

    Franklin, Ashanti L.; Said, Mariam; Cappiello, Clint D.; Gordish-Dressman, Heather; Tatari-Calderone, Zohreh; Vukmanovic, Stanislav; Rais-Bahrami, Khodayar; Luban, Naomi L. C.; Devaney, Joseph M.; Sandler, Anthony D.

    2015-01-01

    Necrotizing enterocolitis (NEC) is a devastating gastrointestinal emergency. The purpose of this study is to determine if functional single nucleotide polymorphisms (SNPs) in immune-modulating genes pre-dispose infants to NEC. After Institutional Review Board approval and parental consent, buccal swabs were collected for DNA extraction. TaqMan allelic discrimination assays and BglII endonuclease digestion were used to genotype specific inflammatory cytokines and TRIM21. Statistical analysis was completed using logistic regression. 184 neonates were analyzed in the study. Caucasian neonates with IL-6 (rs1800795) were over 6 times more likely to have NEC (p = 0.013; OR = 6.61, 95% CI 1.48–29.39), and over 7 times more likely to have Stage III disease (p = 0.011; OR = 7.13, (95% CI 1.56–32.52). Neonates with TGFβ-1 (rs2241712) had a decreased incidence of NEC-related perforation (p = 0.044; OR = 0.28, 95% CI: 0.08–0.97) and an increased incidence of mortality (p = 0.049; OR = 2.99, 95% CI: 1.01 – 8.86). TRIM21 (rs660) was associated with NEC-related intestinal perforation (p = 0.038; OR = 4.65, 95% CI 1.09–19.78). In premature Caucasian neonates, the functional SNP IL-6 (rs1800795) is associated with both the development and increased severity of NEC. TRIM21 (rs660) and TGFβ-1 (rs2241712) were associated with NEC- related perforation in all neonates in the cohort. These findings suggest a possible genetic role in the development of NEC. PMID:26670709

  14. Discovery and validation of genic single nucleotide polymorphisms in the Pacific oyster Crassostrea gigas.

    PubMed

    Wang, Jiafeng; Qi, Haigang; Li, Li; Que, Huayong; Wang, Di; Zhang, Guofan

    2015-01-01

    The economic and ecological importance of the oyster necessitates further research on the molecular mechanisms, which both regulate the commercially important traits of the oyster and help it to survive in the variable marine environment. Single nucleotide polymorphisms (SNPs) have been widely used to assess genetic variation and identify genes underlying target traits. In addition, high-resolution melting (HRM) analysis is a potentially powerful method for validating candidate SNPs. In this study, we adopted a rapid and efficient pipeline for the screening and validation of SNPs in the genic region of Crassostrea gigas based on transcriptome sequencing and HRM analysis. Transcriptomes of three wild oyster populations were sequenced using Illumina sequencing technology. In total, 50-60 million short reads, corresponding to 4.5-5.4 Gbp, from each population were aligned to the oyster genome, and 5.8 × 10(5) SNPs were putatively identified, resulting in a predicted SNP every 47 nucleotides on average. The putative SNPs were unevenly distributed in the genome and high-density (≥2%), nonsynonymous coding SNPs were enriched in genes related to apoptosis and responses to biotic stimuli. Subsequently, 1,671 loci were detected by HRM analysis, accounting for 64.7% of the total selected candidate primers, and finally, 1,301 polymorphic SNP markers were developed based on HRM analysis. All of the validated SNPs were distributed into 897 genes and located in 672 scaffolds, and 275 of these genes were stress inducible under unfavourable salinity, temperature, and exposure to air and heavy metals. The validated SNPs in this study provide valuable molecular markers for genetic mapping and characterization of important traits in oysters. PMID:24823694

  15. Single nucleotide polymorphisms in DKK3 gene are associated with prostate cancer risk and progression

    PubMed Central

    Kim, Min Su; Lee, Ha Na; Kim, Hae Jong; Myung, Soon Chul

    2015-01-01

    ABSTRACT We had investigated whether sequence variants within DKK3 gene are associated with the development of prostate cancer in a Korean study cohort. We evaluated the association between 53 single nucleotide polymorphisms (SNPs) in the DKK3 gene and prostate cancer risk as well as clinical characteristics (PSA, clinical stage, pathological stage and Gleason score) in Korean men (272 prostate cancer subjects and 173 benign prostate hyperplasia subjects) using unconditional logistic regression analysis. Of the 53 SNPs and 25 common haplotypes, 5 SNPs and 4 haplotypes were associated with prostate cancer risk (P=0.02–0.04); 3 SNPs and 2 haplotypes were significantly associated with susceptibility to prostate cancer, however 2 SNPs and 2 haplotypes exhibited a significant protective effect on prostate cancer. Logistic analyses of the DKK3 gene polymorphisms with several prostate cancer related factors showed that several SNPs were significant; three SNPs and two haplotypes to PSA level, three SNPs and two haplotypes to clinical stage, nine SNPs and two haplotype to pathological stage, one SNP and one haplotypes to Gleason score. To the author's knowledge, this is the first report documenting that DKK3 polymorphisms are not only associated with prostate cancer but also related to prostate cancer-related factors. PMID:26689513

  16. The application and performance of single nucleotide polymorphism markers for population genetic analyses of Lepidoptera

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single nucleotide polymorphisms (SNPs) are nucleotide substitution mutations that tend to be at high densities within eukaryotic genomes. The development of assays that detect allelic variation at SNP loci is attractive for genome mapping, population genetics, and phylogeographic applications. A p...

  17. The nucleotide sequence of cowpea mosaic virus B RNA

    PubMed Central

    Lomonossoff, G.P.; Shanks, M.

    1983-01-01

    The complete sequence of the bottom component RNA (B RNA) of cowpea mosaic virus (CPMV) has been determined. Restriction enzyme fragments of double-stranded cDNA were cloned in M13 and the sequence of the inserts was determined by a combination of enzymatic and chemical sequencing techniques. Additional sequence information was obtained by primed synthesis on first strand cDNA. The complete sequence deduced is 5889 nucleotides long excluding the 3' poly(A), and contains an open reading frame sufficient to code for a polypeptide of mol. wt. 207 760. The coding region is flanked by a 5' leader sequence of 206 nucleotides and a 3' non-coding region of 82 residues which does not contain a polyadenylation signal. PMID:16453487

  18. Identification of a Novel Single Nucleotide Polymorphism in Porcine Beta-Defensin-1 Gene.

    PubMed

    Pruthviraj, D R; Usha, A P; Venkatachalapathy, R T

    2016-03-01

    Porcine beta-defensin-1 (PBD-1) gene plays an important role in the innate immunity of pigs. The peptide encoded by this gene is an antimicrobial peptide that has direct activity against a wide range of microbes. This peptide is involved in the co-creation of an antimicrobial barrier in the oral cavity of pigs. The objective of the present study was to detect polymorphisms, if any, in exon-1 and exon-2 regions of PBD-1 gene in Large White Yorkshire (LWY) and native Ankamali pigs of Kerala, India. Blood samples were collected from 100 pigs and genomic DNA was isolated using phenol chloroform method. The quantity of DNA was assessed in a spectrophotometer and quality by gel electrophoresis. Exon-1 and exon-2 regions of PBD-1 gene were amplified by polymerase chain reaction (PCR) and the products were subjected to single strand conformation polymorphism (SSCP) analysis. Subsequent silver staining of the polyacrylamide gels revealed three unique SSCP banding patterns in each of the two exons. The presence of single nucleotide polymorphisms (SNPs) was confirmed by nucleotide sequencing of the PCR products. A novel SNP was found in the 5'-UTR region of exon-1 and a SNP was detected in the mature peptide coding region of exon-2. In exon-1, the pooled population frequencies of GG, GT, and TT genotypes were 0.67, 0.30, and 0.03, respectively. GG genotype was predominant in both the breeds whereas TT genotype was not detected in LWY breed. Similarly, in exon-2, the pooled population frequencies of AA, AG, and GG genotypes were 0.50, 0.27, and 0.23, respectively. AA genotype was predominant in LWY pigs whereas GG genotype was predominant in native pigs. These results suggest that there exists a considerable genetic variation at PBD-1 locus and further association studies may help in development of a PCR based genotyping test to select pigs with better immunity. PMID:26950860

  19. Identification of a Novel Single Nucleotide Polymorphism in Porcine Beta-Defensin-1 Gene

    PubMed Central

    Pruthviraj, D. R.; Usha, A. P.; Venkatachalapathy, R. T.

    2016-01-01

    Porcine beta-defensin-1 (PBD-1) gene plays an important role in the innate immunity of pigs. The peptide encoded by this gene is an antimicrobial peptide that has direct activity against a wide range of microbes. This peptide is involved in the co-creation of an antimicrobial barrier in the oral cavity of pigs. The objective of the present study was to detect polymorphisms, if any, in exon-1 and exon-2 regions of PBD-1 gene in Large White Yorkshire (LWY) and native Ankamali pigs of Kerala, India. Blood samples were collected from 100 pigs and genomic DNA was isolated using phenol chloroform method. The quantity of DNA was assessed in a spectrophotometer and quality by gel electrophoresis. Exon-1 and exon-2 regions of PBD-1 gene were amplified by polymerase chain reaction (PCR) and the products were subjected to single strand conformation polymorphism (SSCP) analysis. Subsequent silver staining of the polyacrylamide gels revealed three unique SSCP banding patterns in each of the two exons. The presence of single nucleotide polymorphisms (SNPs) was confirmed by nucleotide sequencing of the PCR products. A novel SNP was found in the 5′-UTR region of exon-1 and a SNP was detected in the mature peptide coding region of exon-2. In exon-1, the pooled population frequencies of GG, GT, and TT genotypes were 0.67, 0.30, and 0.03, respectively. GG genotype was predominant in both the breeds whereas TT genotype was not detected in LWY breed. Similarly, in exon-2, the pooled population frequencies of AA, AG, and GG genotypes were 0.50, 0.27, and 0.23, respectively. AA genotype was predominant in LWY pigs whereas GG genotype was predominant in native pigs. These results suggest that there exists a considerable genetic variation at PBD-1 locus and further association studies may help in development of a PCR based genotyping test to select pigs with better immunity. PMID:26950860

  20. Single nucleotide polymorphism isolated from a novel EST dataset in garden asparagus (Asparagus officinalis L.).

    PubMed

    Mercati, Francesco; Riccardi, Paolo; Leebens-Mack, Jim; Abenavoli, Maria Rosa; Falavigna, Agostino; Sunseri, Francesco

    2013-04-01

    Single nucleotide polymorphisms (SNPs) and simple sequence repeats (SSR) are abundant and evenly distributed co-dominant molecular markers in plant genomes. SSRs are valuable for marker assisted breeding and positional cloning of genes associated traits of interest. Although several high throughput platforms have been developed to identify SNP and SSR markers for analysis of segregant plant populations, breeding in garden asparagus (Asparagus officinalis L.) has been limited by a low content of such markers. In this study massively parallel GS-FLX pyro-sequencing technology (454 Life Sciences) has been used to sequence and compare transcriptome from two genotypes: a rust tolerant male (1770) and a susceptible female (G190). A total of 122,963 and 99,368 sequence reads, with an average length of 245.7bp, have been recovered from accessions 1770 and 190 respectively. A computational pipeline has been used to predict and visually inspect putative SNPs and SSR sequences. Analysis of Gene Ontology (GO) slim annotation assignments for all assembled uniscripts indicated that the 24,403 assemblies represent genes from a broad array of functions. Further, over 1800 putative SNPs and 1000 SSRs were detected. One hundred forty-four SNPs together with 60 selected SSRs were validated and used to develop a preliminary genetic map by using a large BC(1) population, derived from 1770 and G190. The abundance of SNPs and SSRs provides a foundation for the development of saturated genetic maps and their utilization in assisted asparagus breeding programs. PMID:23415335

  1. A Locked Nucleic Acid Probe Based on Selective Salt-Induced Effect Detects Single Nucleotide Polymorphisms

    PubMed Central

    Zhang, Jing; Wu, Huizhe; Chen, Qiuchen; Zhao, Pengfei; Zhao, Haishan; Yao, Weifan; Wei, Minjie

    2015-01-01

    Detection of single based genetic mutation by using oligonucleotide probes is one of the common methods of detecting single nucleotide polymorphisms at known loci. In this paper, we demonstrated a hybridization system which included a buffer solution that produced selective salt-induced effect and a locked nucleic acid modified 12 nt oligonucleotide probe. The hybridization system is suitable for hybridization under room temperature. By using magnetic nanoparticles as carriers for PCR products, the SNPs (MDR1 C3435T/A) from 45 volunteers were analyzed, and the results were consistent with the results from pyrophosphoric acid sequencing. The method presented in this paper differs from the traditional method of using molecular beacons to detect SNPs in that it is suitable for research institutions lacking real-time quantitative PCR detecting systems, to detect PCR products at room temperature. PMID:26347880

  2. Epidemic population structure of extraintestinal pathogenic Escherichia coli determined by single nucleotide polymorphism pyrosequencing.

    PubMed

    Fernández-Romero, Natalia; Romero-Gómez, María Pilar; Gómez-Gil, María Rosa; Mingorance, Jesús

    2011-10-01

    We have developed an MLST-based scheme for typing Escherichia coli isolates using pyrosequencing of single nucleotide polymorphic positions (SNP). The SNP sequences are converted into allelic patterns and analyzed using the same approach used for MLST analyses. We have tested the method in two unselected collections of clinical isolates of E. coli obtained from blood and urine cultures. The two collections had a similar structure, 25% of the profiles (representing 68% of the isolates) were common to both, and 62% of the profiles (nearly 20% of the isolates) were unique. The four major profiles accounted for 44% of the isolates, and among these the most frequent one was related to the pandemic ST131 clone. The method is easy to implement and might be useful for typing large microbial collections. PMID:21723423

  3. Naked-eye fingerprinting of single nucleotide polymorphisms on psoriasis patients.

    PubMed

    Valentini, Paola; Marsella, Alessandra; Tarantino, Paolo; Mauro, Salvatore; Baglietto, Silvia; Congedo, Maurizio; Paolo Pompa, Pier

    2016-06-01

    We report a low-cost test, based on gold nanoparticles, for the colorimetric (naked-eye) fingerprinting of a panel of single nucleotide polymorphisms (SNPs), relevant for the personalized therapy of psoriasis. Such pharmacogenomic tests are not routinely performed on psoriasis patients, due to the high cost of standard technologies. We demonstrated high sensitivity and specificity of our colorimetric test by validating it on a cohort of 30 patients, through a double-blind comparison with two state-of-the-art instrumental techniques, namely reverse dot blotting and sequencing, finding 100% agreement. This test offers high parallelization capabilities and can be easily generalized to other SNPs of clinical relevance, finding broad utility in diagnostics and pharmacogenomics. PMID:27174795

  4. Pyrosequencing with di-base addition for single nucleotide polymorphism genotyping.

    PubMed

    Pu, Dan; Mao, Chengguang; Cui, Lunbiao; Shi, Zhiyang; Xiao, Pengfeng

    2016-05-01

    We develop color code-based pyrosequencing with di-base addition for analysis of single nucleotide polymorphisms (SNPs). When a di-base is added into the polymerization, one or several two-color code(s) containing the type and the number of incorporated nucleotides will be produced. The code information obtained in a single run is useful to genotype SNPs as each allelic variant will give a specific pattern compared to the two other variants. Special care has to be taken while designing the di-base dispensation order. Here, we present a detailed protocol for establishing sequence-specific di-base addition to avoid nonsynchronous extension at the SNP sites. By using this technology, as few as 50 copies of DNA templates were accurately sequenced. Higher signals were produced and thus a relatively lower sample amount was required. Furthermore, the read length of per flow was increased, making simultaneous identification of multiple SNPs in a single sequencing run possible. Validation of the method was performed by using templates with two SNPs covering 37 bp and with three SNPs covering 58 bp as well as 82 bp. These SNPs were successfully genotyped by using only a sequencing primer in a single PCR/sequencing run. Our results demonstrated that this technology could be potentially developed into a powerful methodology to accurately determine SNPs so as to diagnose clinical settings. Graphical Abstract Conventional pyrosequencing adds one base (A, G, C, or T) at a time to determine the SNP site (left). Pyrosequencing with di-base addition adds di-base AG, AC, AT, CT, GC or GT at a time to determine the SNP site (right). Higher signals at SNP site will be produced due to the addition of di-bases. PMID:26935928

  5. Single Nucleotide Polymorphisms in Pediatric Idiopathic Nephrotic Syndrome

    PubMed Central

    Suvanto, Maija; Jahnukainen, Timo; Kestilä, Marjo; Jalanko, Hannu

    2016-01-01

    Polymorphic variants in several molecules involved in the glomerular function and drug metabolism have been implicated in the pathophysiology of pediatric idiopathic nephrotic syndrome (INS), but the results remain inconsistent. We analyzed the association of eleven allelic variants in eight genes (angiopoietin-like 4 (ANGPTL4), glypican 5 (GPC5), interleukin-13 (IL-13), macrophage migration inhibitory factor (MIF), neural nitric oxide synthetase (nNOS), multidrug resistance-1 (MDR1), glucocorticoid-induced transcript-1 (GLCCI1), and nuclear receptor subfamily-3 (NR3C1)) in 100 INS patients followed up till adulthood. We genotyped variants using PCR and direct sequencing and evaluated estimated haplotypes of MDR1 variants. The analysis revealed few differences in SNP genotype frequencies between patients and controls, or in clinical parameters among the patients. Genotype distribution of MDR1 SNPs rs1236, rs2677, and rs3435 showed significant (p < 0.05) association with different medication regimes (glucocorticoids only versus glucocorticoids plus additional immunosuppressives). Some marginal association was detected between ANGPTL4, GPC5, GLCCI1, and NR3C1 variants and different medication regimes, number of relapses, and age of onset. Conclusion. While MDR1 variant genotype distribution associated with different medication regimes, the other analyzed gene variants showed only little or marginal clinical relevance in INS. PMID:27247801

  6. Developing single nucleotide polymorphism markers for the identification of pineapple (Ananas comosus) germplasm

    PubMed Central

    Zhou, Lin; Matsumoto, Tracie; Tan, Hua-Wei; Meinhardt, Lyndel W; Mischke, Sue; Wang, Boyi; Zhang, Dapeng

    2015-01-01

    Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using expressed sequence tag and nucleotide sequences from public databases, we developed 213 single nucleotide polymorphism (SNP) markers and validated 96 SNPs by genotyping the United States Department of Agriculture - Agricultural Research Service pineapple germplasm collection, maintained in Hilo, Hawaii. The validation resulted in designation of a set of 57 polymorphic SNP markers that revealed a high rate of duplicates in this pineapple collection. Twenty-four groups of duplicates were detected, encompassing 130 of the total 170 A cosmos accessions. The results show that somatic mutation has been the main source of intra-cultivar variations in pineapple. Multivariate clustering and a model-based population stratification suggest that the modern pineapple cultivars are comprised of progenies that are derived from different wild Ananas botanical varieties. Parentage analysis further revealed that both A. comosus var. bracteatus and A. comosus var. ananassoides are likely progenitors of pineapple cultivars. However, the traditional classification of cultivated pineapple into horticultural groups (e.g. ‘Cayenne’, ‘Spanish’, ‘Queen’) was not well supported by the present study. These SNP markers provide robust and universally comparable DNA fingerprints; thus, they can serve as an efficient genotyping tool to assist pineapple germplasm management, propagation of planting material, and pineapple cultivar protection. The high rate of genetic redundancy detected in this pineapple collection suggests the potential impact of applying this technology on other clonally propagated perennial crops. PMID:26640697

  7. Developing single nucleotide polymorphism markers for the identification of pineapple (Ananas comosus) germplasm.

    PubMed

    Zhou, Lin; Matsumoto, Tracie; Tan, Hua-Wei; Meinhardt, Lyndel W; Mischke, Sue; Wang, Boyi; Zhang, Dapeng

    2015-01-01

    Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using expressed sequence tag and nucleotide sequences from public databases, we developed 213 single nucleotide polymorphism (SNP) markers and validated 96 SNPs by genotyping the United States Department of Agriculture - Agricultural Research Service pineapple germplasm collection, maintained in Hilo, Hawaii. The validation resulted in designation of a set of 57 polymorphic SNP markers that revealed a high rate of duplicates in this pineapple collection. Twenty-four groups of duplicates were detected, encompassing 130 of the total 170 A cosmos accessions. The results show that somatic mutation has been the main source of intra-cultivar variations in pineapple. Multivariate clustering and a model-based population stratification suggest that the modern pineapple cultivars are comprised of progenies that are derived from different wild Ananas botanical varieties. Parentage analysis further revealed that both A. comosus var. bracteatus and A. comosus var. ananassoides are likely progenitors of pineapple cultivars. However, the traditional classification of cultivated pineapple into horticultural groups (e.g. 'Cayenne', 'Spanish', 'Queen') was not well supported by the present study. These SNP markers provide robust and universally comparable DNA fingerprints; thus, they can serve as an efficient genotyping tool to assist pineapple germplasm management, propagation of planting material, and pineapple cultivar protection. The high rate of genetic redundancy detected in this pineapple collection suggests the potential impact of applying this technology on other clonally propagated perennial crops. PMID:26640697

  8. Estimating a nucleotide substitution rate for maize from polymorphism at a major domestication locus.

    PubMed

    Clark, Richard M; Tavaré, Simon; Doebley, John

    2005-11-01

    To estimate a rate for single nucleotide substitutions for maize (Zea mays ssp. mays), we have taken advantage of data from genetic and archaeological studies of the domestication of maize from its wild ancestor, teosinte (Z. mays ssp. parviglumis). Genetic studies have shown that the teosinte branched1 (tb1) gene was a major target of human selection during maize domestication, and sequence diversity in the intergenic region 5' to the tb1-coding sequence is extraordinarily low. We show that polymorphism in this region is consistent with new mutation following fixation for a small number of tb1 haplotypes during domestication. Archeological studies suggest that maize was domesticated approximately 6,250-10,000 years ago and subsequently the size of the maize population is thought to have expanded rapidly. Using the observed number of mutations within the region of selection at tb1, the approximate age of maize domestication, and approximations for the maize genealogy, we have derived estimates for the nucleotide substitution rate for the tb1 intergenic region. Using two approaches, one of which is a coalescent approach, we obtain rate estimates of approximately 2.9 x 10(-8) and 3.3 x 10(-8) substitutions per site per year. We also show that the pattern of polymorphism in the tb1 intergenic region appears to have been strongly affected by the mutagenic effect of DNA methylation. Excluding target sites of symmetric DNA methylation (CG and CNG sites) from analysis, the mutation rate estimates are reduced by approximately 50%-60%, while the rates for CG and CNG sites are nearly an order of magnitude higher. We use rate estimates from the tb1 region to estimate the timing of expansion of transposable elements in the maize genome and suggest that this expansion occurred primarily within the last million years. PMID:16079248

  9. DNA sequence representation by trianders and determinative degree of nucleotides

    PubMed Central

    Duplij, Diana; Duplij, Steven

    2005-01-01

    A new version of DNA walks, where nucleotides are regarded unequal in their contribution to a walk is introduced, which allows us to study thoroughly the “fine structure” of nucleotide sequences. The approach is based on the assumption that nucleotides have an inner abstract characteristic, the determinative degree, which reflects genetic code phenomenological properties and is adjusted to nucleotides physical properties. We consider each codon position independently, which gives three separate walks characterized by different angles and lengths, and that such an object is called triander which reflects the “strength” of branch. A general method for identifying DNA sequence “by triander” which can be treated as a unique “genogram” (or “gene passport”) is proposed. The two- and three-dimensional trianders are considered. The difference of sequences fine structure in genes and the intergenic space is shown. A clear triplet signal in coding sequences was found which is absent in the intergenic space and is independent from the sequence length. This paper presents the topological classification of trianders which can allow us to provide a detailed working out signatures of functionally different genomic regions. PMID:16052707

  10. Moss Phylogeny Reconstruction Using Nucleotide Pangenome of Complete Mitogenome Sequences.

    PubMed

    Goryunov, D V; Nagaev, B E; Nikolaev, M Yu; Alexeevski, A V; Troitsky, A V

    2015-11-01

    Stability of composition and sequence of genes was shown earlier in 13 mitochondrial genomes of mosses (Rensing, S. A., et al. (2008) Science, 319, 64-69). It is of interest to study the evolution of mitochondrial genomes not only at the gene level, but also on the level of nucleotide sequences. To do this, we have constructed a "nucleotide pangenome" for mitochondrial genomes of 24 moss species. The nucleotide pangenome is a set of aligned nucleotide sequences of orthologous genome fragments covering the totality of all genomes. The nucleotide pangenome was constructed using specially developed new software, NPG-explorer (NPGe). The stable part of the mitochondrial genome (232 stable blocks) is shown to be, on average, 45% of its length. In the joint alignment of stable blocks, 82% of positions are conserved. The phylogenetic tree constructed with the NPGe program is in good correlation with other phylogenetic reconstructions. With the NPGe program, 30 blocks have been identified with repeats no shorter than 50 bp. The maximal length of a block with repeats is 140 bp. Duplications in the mitochondrial genomes of mosses are rare. On average, the genome contains about 500 bp in large duplications. The total length of insertions and deletions was determined in each genome. The losses and gains of DNA regions are rather active in mitochondrial genomes of mosses, and such rearrangements presumably can be used as additional markers in the reconstruction of phylogeny. PMID:26615445

  11. High volume molecular genetic identification of single nucleotide polymorphisms using Genetic Bit Analysis Application to human genetic diagnosis

    SciTech Connect

    Boyce-Jacino, M.T.; Reynolds, J.; Nikiforov, T.

    1994-09-01

    The most common type of genetic disease-associated mutation is the single nucleotide polymorphism (SNP). Because most genetic diseases can be caused by multiple SNPs in the same gene, effective routine diagnosis of complex genetic diseases is dependent on a simple and reliable method of interrogating SNP sites. Molecular Tool`s solid phase assay capable of direct genotyping (single base sequencing) of SNP sites, Genetic Bit Analysis (GBA), involves hybridization-capture of a single-stranded PCR product to a sequence-specific, microtiter plate-bound oligonucleotide primer. The captured PCR product then acts as template for single-base extension of the capture primer across the polymorphic site, enabling direct determination of the base composition of the polymorphism through a simple colormetric assay. Genotyping in a high volume, semi-automated, processing system with a current capacity of 100 SNP interrogations per technician per day enables the screening of candidate mutations rapidly and cost-effectively, critically important to comprehensive genetic diagnosis. Using this gel-free technology, we have developed prototype diagnostic tests for CFTR and ApoE polymorphisms which enable direct sequencing of the polymorphic base at each site of interest. Routine clinical diagnosis of genetically complex diseases such as cystic fibrosis is dependent on this combination of robust biochemistry and simple format. Additionally, the ability to transfer the format and biochemistry to any disease gene of interest enables the broad application of this technology to clinical diagnostics, especially for genetically complex diseases.

  12. The human XPG gene: gene architecture, alternative splicing and single nucleotide polymorphisms

    PubMed Central

    Emmert, Steffen; Schneider, Thomas D.; Khan, Sikandar G.; Kraemer, Kenneth H.

    2001-01-01

    Defects in the XPG DNA repair endonuclease gene can result in the cancer-prone disorders xeroderma pigmentosum (XP) or the XP–Cockayne syndrome complex. While the XPG cDNA sequence was known, determination of the genomic sequence was required to understand its different functions. In cells from normal donors, we found that the genomic sequence of the human XPG gene spans 30 kb, contains 15 exons that range from 61 to 1074 bp and 14 introns that range from 250 to 5763 bp. Analysis of the splice donor and acceptor sites using an information theory-based approach revealed three splice sites with low information content, which are components of the minor (U12) spliceosome. We identified six alternatively spliced XPG mRNA isoforms in cells from normal donors and from XPG patients: partial deletion of exon 8, partial retention of intron 8, two with alternative exons (in introns 1 and 6) and two that retained complete introns (introns 3 and 9). The amount of alternatively spliced XPG mRNA isoforms varied in different tissues. Most alternative splice donor and acceptor sites had a relatively high information content, but one has the U12 spliceosome sequence. A single nucleotide polymorphism has allele frequencies of 0.74 for 3507G and 0.26 for 3507C in 91 donors. The human XPG gene contains multiple splice sites with low information content in association with multiple alternatively spliced isoforms of XPG mRNA. PMID:11266544

  13. Identification of single nucleotide polymorphisms from the transcriptome of an organism with a whole genome duplication

    PubMed Central

    2013-01-01

    Background The common ancestor of salmonid fishes, including rainbow trout (Oncorhynchus mykiss), experienced a whole genome duplication between 20 and 100 million years ago, and many of the duplicated genes have been retained in the trout genome. This retention complicates efforts to detect allelic variation in salmonid fishes. Specifically, single nucleotide polymorphism (SNP) detection is problematic because nucleotide variation can be found between the duplicate copies (paralogs) of a gene as well as between alleles. Results We present a method of differentiating between allelic and paralogous (gene copy) sequence variants, allowing identification of SNPs in organisms with multiple copies of a gene or set of genes. The basic strategy is to: 1) identify windows of unique cDNA sequences with homology to each other, 2) compare these unique cDNAs if they are not shared between individuals (i.e. the cDNA is homozygous in one individual and homozygous for another cDNA in the other individual), and 3) give a “SNP score” value between zero and one to each candidate sequence variant based on six criteria. Using this strategy we were able to detect about seven thousand potential SNPs from the transcriptomes of several clonal lines of rainbow trout. When directly compared to a pre-validated set of SNPs in polyploid wheat, we were also able to estimate the false-positive rate of this strategy as 0 to 28% depending on parameters used. Conclusions This strategy has an advantage over traditional techniques of SNP identification because another dimension of sequencing information is utilized. This method is especially well suited for identifying SNPs in polyploids, both outbred and inbred, but would tend to be conservative for diploid organisms. PMID:24237905

  14. Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections

    PubMed Central

    Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.

    2015-01-01

    Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890

  15. Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.

    PubMed

    Baniecki, Mary Lynn; Faust, Aubrey L; Schaffner, Stephen F; Park, Daniel J; Galinsky, Kevin; Daniels, Rachel F; Hamilton, Elizabeth; Ferreira, Marcelo U; Karunaweera, Nadira D; Serre, David; Zimmerman, Peter A; Sá, Juliana M; Wellems, Thomas E; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E; Volkman, Sarah K; Wirth, Dyann F; Sabeti, Pardis C

    2015-03-01

    Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890

  16. Complete nucleotide sequence of Nootka lupine vein-clearing virus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The complete genome sequence of Nootka lupine vein-clearing virus (NLVCV) was determined to be 4,172 nucleotides in length containing four open reading frames ORFs with a similar genetic organization and conceptual translations of virus species in the genus Carmovirus, family Tombusviridae. The orde...

  17. Alu-associated enhancement of single nucleotide polymorphisms in the human genome.

    PubMed

    Ng, Siu-Kin; Xue, Hong

    2006-03-01

    Identifying features shaping the architecture of sequence variations is important for understanding genome evolution and mapping disease loci. In this study, high-resolution scanning of Alu-centered alignments of the human genome sequences has revealed a striking elevation of the frequency of single nucleotide polymorphisms (SNP) in the body and tail of Alu sequences compared to flanking regions. This enhancement in SNP density is evident for all twenty-four chromosomes, and in both the Alu-body and Alu-tail, which together may be referred to as the Alu-SNPs. Reduced levels of Alu-SNPs in the sex chromosomes, especially in the non-recombining NRY region of the Y chromosome, are consistent with recombination events playing an important role in the enhancement. The Alu elements are unstable recombination-mutation hotspots in the human genome, and it is suggested that the Alu-SNPs represent a key manifestation of this instability. Variations in Alu-SNPs among the HapMap populations of northern and western European ancestry (CEU), Han Chinese from Beijing (CHB), Japanese from Tokyo (JPT), and Yoruba from Ibadan, Nigeria (YRI) indicate that the Alu-SNPs provide useful sequence markers, in addition to the Alu-insertion polymorphisms themselves, for the delineation of human genome evolution. That Alu-SNP levels are highest in the youngest Alu-Y, intermediate in the Alu-S of intermediate age, and lowest in the oldest Alu-J is consistent with the occurrence of not only genetic drift but also natural selection on the Alu-SNPs. Such evolutionary selection in turn suggests that Alu-SNPs might include potential sites of disease association, and therefore deserve detailed investigation. PMID:16380220

  18. Identification of novel single nucleotide polymorphisms in the DGAT1 gene of buffaloes by PCR-SSCP

    PubMed Central

    Raut, Ashwin A.; Kumar, Anil; Kala, Sheo N.; Chhokar, Vinod; Rana, Neeraj; Beniwal, Vikas; Jaglan, Sundeep; Samuchiwal, Sachin K.; Singh, Jitender K.; Mishra, Anamika

    2012-01-01

    Diacylglycerol O-acyltransferase 1 (DGAT1) is a microsomal enzyme that catalyzes the final step of triglyceride synthesis. The DGAT1 gene is a strong functional candidate for determining milk fat content in cattle. In this work, we used PCR-SSCP (polymerase chain reaction-single-strand conformation polymorphism) and DNA sequencing to examine polymorphism in the region spanning exon 7 to exon 9 of the DGAT1 gene in Murrah and Pandharpuri buffaloes. Three alleles (A, B and C) and four novel single-nucleotide polymorphisms were identified in the buffalo DGAT1 gene. The frequencies of the alleles differed between the two buffalo breeds, with allele C being present in Murrah but not in Pandharpuri buffalo. The allele variation detected in this work may influence DGAT1 expression and function. The results described here could be useful in examining the association between the DGAT1 gene and milk traits in buffalo. PMID:23055800

  19. Association between Single Nucleotide Polymorphisms of the Major Histocompatibility Complex Class II Gene and Newcastle Disease Virus Titre and Body Weight in Leung Hang Khao Chickens

    PubMed Central

    Molee, A.; Kongroi, K.; Kuadsantia, P.; Poompramun, C.; Likitdecharote, B.

    2016-01-01

    The aim of the present study was to investigate the effect of single nucleotide polymorphisms in the major histocompatibility complex (MHC) class II gene on resistance to Newcastle disease virus and body weight of the Thai indigenous chicken, Leung Hang Khao (Gallus gallus domesticus). Blood samples were collected for single nucleotide polymorphism analysis from 485 chickens. Polymerase chain reaction sequencing was used to classify single nucleotide polymorphisms of class II MHC. Body weights were measured at the ages of 3, 4, 5, and 7 months. Titres of Newcastle disease virus at 2 weeks to 7 months were determined and the correlation between body weight and titre was analysed. The association between single nucleotide polymorphisms and body weight and titre were analysed by a generalized linear model. Seven single nucleotide polymorphisms were identified: C125T, A126T, C209G, C242T, A243T, C244T, and A254T. Significant correlations between log titre and body weight were found at 2 and 4 weeks. Associations between single nucleotide polymorphisms and titre were found for C209G and A254T, and between all single nucleotide polymorphisms (except A243T) and body weight. The results showed that class II MHC is associated with both titre of Newcastle disease virus and body weight in Leung Hang Khao chickens. This is of concern because improved growth traits are the main goal of breeding selection. Moreover, the results suggested that MHC has a pleiotropic effect on the titre and growth performance. This mechanism should be investigated in a future study. PMID:26732325

  20. Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean

    PubMed Central

    2012-01-01

    Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron

  1. Method for the detection of specific nucleic acid sequences by polymerase nucleotide incorporation

    DOEpatents

    Castro, Alonso

    2004-06-01

    A method for rapid and efficient detection of a target DNA or RNA sequence is provided. A primer having a 3'-hydroxyl group at one end and having a sequence of nucleotides sufficiently homologous with an identifying sequence of nucleotides in the target DNA is selected. The primer is hybridized to the identifying sequence of nucleotides on the DNA or RNA sequence and a reporter molecule is synthesized on the target sequence by progressively binding complementary nucleotides to the primer, where the complementary nucleotides include nucleotides labeled with a fluorophore. Fluorescence emitted by fluorophores on single reporter molecules is detected to identify the target DNA or RNA sequence.

  2. Single nucleotide polymorphism of FSHβ gene associated with reproductive traits in Japanese flounder ( Paralichthys olivaceus)

    NASA Astrophysics Data System (ADS)

    He, Feng; Wen, Haishen; Yu, Dahui; Li, Jifang; Shi, Bao; Chen, Caifang; Zhang, Jiaren; Jin, Guoxiong; Chen, Xiaoyan; Shi, Dan; Yang, Yanping

    2010-12-01

    Follicle stimulating hormone β (FSHβ) of Japanese flounder ( Paralichthys olivaceus) plays a key role in the regulation of gonadal development. This study aimed to investigate molecular genetic characteristics of the FSHβ gene and elucidate the effects of single nucleotide polymorphisms (SNPs) of FSHβ on reproductive traits in Japanese flounder. We used polymerase chain reaction single-strand conformation polymorphism (PCR-SSCP) and sequencing of the FSHβ gene in 60 individuals. We identified only an SNP (T/C) in the coding region of exon3 of FSHβ. The SNP (T/C) did not lead to amino acid changes at the position 340 bp of FSHβ gene. Statistical analysis showed that the SNP was significantly associated with testosterone (T) level and gonadosomatic index (GSI) ( P < 0.05). Individuals with genotype TC of the SNP had significantly higher serum T levels and GSI ( P < 0.05) than that of genotype CC. Therefore, FSHβ gene could be a useful molecular marker in selection for prominent reproductive trait in Japanese Flounder.

  3. Role of six single nucleotide polymorphisms, risk factors in coronary disease, in OLR1 alternative splicing.

    PubMed

    Tejedor, J Ramón; Tilgner, Hagen; Iannone, Camilla; Guigó, Roderic; Valcárcel, Juan

    2015-06-01

    The OLR1 gene encodes the oxidized low-density lipoprotein receptor (LOX-1), which is responsible for the cellular uptake of oxidized LDL (Ox-LDL), foam cell formation in atheroma plaques and atherosclerotic plaque rupture. Alternative splicing (AS) of OLR1 exon 5 generates two protein isoforms with antagonistic functions in Ox-LDL uptake. Previous work identified six single nucleotide polymorphisms (SNPs) in linkage disequilibrium that influence the inclusion levels of OLR1 exon 5 and correlate with the risk of cardiovascular disease. Here we use minigenes to recapitulate the effects of two allelic series (Low- and High-Risk) on OLR1 AS and identify one SNP in intron 4 (rs3736234) as the main contributor to the differences in exon 5 inclusion, while the other SNPs in the allelic series attenuate the drastic effects of this key SNP. Bioinformatic, proteomic, mutational and functional high-throughput analyses allowed us to define regulatory sequence motifs and identify SR protein family members (SRSF1, SRSF2) and HMGA1 as factors involved in the regulation of OLR1 AS. Our results suggest that antagonism between SRSF1 and SRSF2/HMGA1, and differential recognition of their regulatory motifs depending on the identity of the rs3736234 polymorphism, influence OLR1 exon 5 inclusion and the efficiency of Ox-LDL uptake, with potential implications for atherosclerosis and coronary disease. PMID:25904137

  4. [Correlation analysis between single nucleotide polymorphism of FGF5 gene and wool yield in rabbits].

    PubMed

    Li, Chun-Xiao; Jiang, Mei-Shan; Chen, Shi-Yi; Lai, Song-Jia

    2008-07-01

    Single nucleotide polymorphism (SNP) in exon 1 and 3 of fibroblast growth factor (FGF5) gene was studied by DNA sequencing in Yingjing angora rabbit, Tianfu black rabbit and California rabbit. A frameshift mutation (TCT insert) at base position 217 (site A) of exon 1 and a T/C missense mutation at base position 59 (site B) of exon 3 were found in Yingjing angora rabbit with a high frequency; a T/C same-sense mutation at base position 3 (site C) of exon 3 was found with similar frequency in three rabbit breeds. Least square analysis showed that different genotypes had no significant association with wool yield in site A, and had high significant association with wool yield in site B (P<0.01) and significant association with wool yield in site C (P<0.05). It was concluded from the results that FGF5 gene could be the potential major gene affecting wool yield or link with the major gene, and polymorphic loci B and C may be used as molecular markers for im-proving wool yield in angora rabbits. PMID:18779133

  5. Detection of mandarin in orange juice by single-nucleotide polymorphism qPCR assay.

    PubMed

    Aldeguer, Miriam; López-Andreo, María; Gabaldón, José A; Puyet, Antonio

    2014-02-15

    A dual-probe real time PCR (qPCR) DNA-based analysis was devised for the identification of mandarin in orange juice. A single nucleotide polymorphism at the trnL-trnF intergenic region of the chloroplast chromosome was confirmed in nine orange (Citrus sinensis) and thirteen commercial varieties of mandarin, including Citrus reticulata and Citrus unshiu species and a mandarin × tangelo hybrid. Two short minor-groove binding fluorescent probes targeting the polymorphic sequence were used in the dual-probe qPCR, which allowed the detection of both species in single-tube reactions. The similarity of PCR efficiencies allowed a simple estimation of the ratio mandarin/orange in the juice samples, which correlated to the measured difference of threshold cycle values for both probes. The limit of detection of the assay was 5% of mandarin in orange juice, both when the juice was freshly prepared (not from concentrate) or reconstituted from concentrate, which would allow the detection of fraudulently added mandarin juice. The possible use of the dual-probe system for quantitative measurements was also tested on fruit juice mixtures. qPCR data obtained from samples containing equal amounts of mandarin and orange juice revealed that the mandarin target copy number was approximately 2.6-fold higher than in orange juice. The use of a matrix-adapted control as calibrator to compensate the resulting C(T) bias allowed accurate quantitative measurements to be obtained. PMID:24128588

  6. Nucleotide Sequencing and Identification of Some Wild Mushrooms

    PubMed Central

    Das, Sudip Kumar; Mandal, Aninda; Datta, Animesh K.; Gupta, Sudha; Paul, Rita; Saha, Aditi; Sengupta, Sonali; Dubey, Priyanka Kumari

    2013-01-01

    The rDNA-ITS (Ribosomal DNA Internal Transcribed Spacers) fragment of the genomic DNA of 8 wild edible mushrooms (collected from Eastern Chota Nagpur Plateau of West Bengal, India) was amplified using ITS1 (Internal Transcribed Spacers 1) and ITS2 primers and subjected to nucleotide sequence determination for identification of mushrooms as mentioned. The sequences were aligned using ClustalW software program. The aligned sequences revealed identity (homology percentage from GenBank data base) of Amanita hemibapha [CN (Chota Nagpur) 1, % identity 99 (JX844716.1)], Amanita sp. [CN 2, % identity 98 (JX844763.1)], Astraeus hygrometricus [CN 3, % identity 87 (FJ536664.1)], Termitomyces sp. [CN 4, % identity 90 (JF746992.1)], Termitomyces sp. [CN 5, % identity 99 (GU001667.1)], T. microcarpus [CN 6, % identity 82 (EF421077.1)], Termitomyces sp. [CN 7, % identity 76 (JF746993.1)], and Volvariella volvacea [CN 8, % identity 100 (JN086680.1)]. Although out of 8 mushrooms 4 could be identified up to species level, the nucleotide sequences of the rest may be relevant to further characterization. A phylogenetic tree is constructed using Neighbor-Joining method showing interrelationship between/among the mushrooms. The determined nucleotide sequences of the mushrooms may provide additional information enriching GenBank database aiding to molecular taxonomy and facilitating its domestication and characterization for human benefits. PMID:24489501

  7. Nucleotide sequencing and identification of some wild mushrooms.

    PubMed

    Das, Sudip Kumar; Mandal, Aninda; Datta, Animesh K; Gupta, Sudha; Paul, Rita; Saha, Aditi; Sengupta, Sonali; Dubey, Priyanka Kumari

    2013-01-01

    The rDNA-ITS (Ribosomal DNA Internal Transcribed Spacers) fragment of the genomic DNA of 8 wild edible mushrooms (collected from Eastern Chota Nagpur Plateau of West Bengal, India) was amplified using ITS1 (Internal Transcribed Spacers 1) and ITS2 primers and subjected to nucleotide sequence determination for identification of mushrooms as mentioned. The sequences were aligned using ClustalW software program. The aligned sequences revealed identity (homology percentage from GenBank data base) of Amanita hemibapha [CN (Chota Nagpur) 1, % identity 99 (JX844716.1)], Amanita sp. [CN 2, % identity 98 (JX844763.1)], Astraeus hygrometricus [CN 3, % identity 87 (FJ536664.1)], Termitomyces sp. [CN 4, % identity 90 (JF746992.1)], Termitomyces sp. [CN 5, % identity 99 (GU001667.1)], T. microcarpus [CN 6, % identity 82 (EF421077.1)], Termitomyces sp. [CN 7, % identity 76 (JF746993.1)], and Volvariella volvacea [CN 8, % identity 100 (JN086680.1)]. Although out of 8 mushrooms 4 could be identified up to species level, the nucleotide sequences of the rest may be relevant to further characterization. A phylogenetic tree is constructed using Neighbor-Joining method showing interrelationship between/among the mushrooms. The determined nucleotide sequences of the mushrooms may provide additional information enriching GenBank database aiding to molecular taxonomy and facilitating its domestication and characterization for human benefits. PMID:24489501

  8. Nucleotide sequence and genome organization of tomato leaf curl geminivirus.

    PubMed

    Dry, I B; Rigden, J E; Krake, L R; Mullineaux, P M; Rezaian, M A

    1993-01-01

    The genome of tomato leaf curl virus (TLCV) from Australia was cloned and its complete nucleotide sequence determined. It is a single circular ssDNA of 2766 nucleotides containing the consensus nonanucleotide sequence present in all geminiviruses. It has six open reading frames with an organization resembling that of certain other dicotyledonous plant-infecting monopartite geminiviruses, i.e. tomato yellow leaf curl and beet curly top viruses. The regulatory sequences present indicate a bidirectional mode of transcription. A dimeric TLCV DNA clone was constructed in a binary vector and used to agroinoculate three different host species. Typical virus infections were produced, confirming that the single DNA component is sufficient for infectivity. PMID:8423446

  9. High-throughput polymorphism detection and genotyping in Brassica napus using next-generation RAD sequencing

    PubMed Central

    2012-01-01

    Background The complex genome of rapeseed (Brassica napus) is not well understood despite the economic importance of the species. Good knowledge of sequence variation is needed for genetics approaches and breeding purposes. We used a diversity set of B. napus representing eight different germplasm types to sequence genome-wide distributed restriction-site associated DNA (RAD) fragments for polymorphism detection and genotyping. Results More than 113,000 RAD clusters with more than 20,000 single nucleotide polymorphisms (SNPs) and 125 insertions/deletions were detected and characterized. About one third of the RAD clusters and polymorphisms mapped to the Brassica rapa reference sequence. An even distribution of RAD clusters and polymorphisms was observed across the B. rapa chromosomes, which suggests that there might be an equal distribution over the Brassica oleracea chromosomes, too. The representation of Gene Ontology (GO) terms for unigenes with RAD clusters and polymorphisms revealed no signature of selection with respect to the distribution of polymorphisms within genes belonging to a specific GO category. Conclusions Considering the decreasing costs for next-generation sequencing, the results of our study suggest that RAD sequencing is not only a simple and cost-effective method for high-density polymorphism detection but also an alternative to SNP genotyping from transcriptome sequencing or SNP arrays, even for species with complex genomes such as B. napus. PMID:22726880

  10. A novel MALDI-TOF based methodology for genotyping single nucleotide polymorphisms.

    PubMed

    Blondal, Thorarinn; Waage, Benedikt G; Smarason, Sigurdur V; Jonsson, Frosti; Fjalldal, Sigridur B; Stefansson, Kari; Gulcher, Jeffery; Smith, Albert V

    2003-12-15

    A new MALDI-TOF based detection assay was developed for analysis of single nucleotide polymorphisms (SNPs). It is a significant modification on the classic three-step minisequencing method, which includes a polymerase chain reaction (PCR), removal of excess nucleotides and primers, followed by primer extension in the presence of dideoxynucleotides using modified thermostable DNA polymerase. The key feature of this novel assay is reliance upon deoxynucleotide mixes, lacking one of the nucleotides at the polymorphic position. During primer extension in the presence of depleted nucleotide mixes, standard thermostable DNA polymerases dissociate from the template at positions requiring a depleted nucleotide; this principal was harnessed to create a genotyping assay. The assay design requires a primer- extension primer having its 3'-end one nucleotide upstream from the interrogated site. The assay further utilizes the same DNA polymerase in both PCR and the primer extension step. This not only simplifies the assay but also greatly reduces the cost per genotype compared to minisequencing methodology. We demonstrate accurate genotyping using this methodology for two SNPs run in both singleplex and duplex reactions. We term this assay nucleotide depletion genotyping (NUDGE). Nucleotide depletion genotyping could be extended to other genotyping assays based on primer extension such as detection by gel or capillary electrophoresis. PMID:14654708

  11. PERB11 (MIC): a polymorphic MHC gene is expressed in skin and single nucleotide polymorphisms are associated with psoriasis

    PubMed Central

    Tay, G K; Hui, J; Gaudieri, S; Schmitt-Egenolf, M; Martinez, O P; Leelayuwat, C; Williamson, J F; Eiermann, T H; Dawkins, R L

    2000-01-01

    The susceptibility genes for psoriasis remain to be identified. At least one of these must be in the major histocompatibility complex (MHC) to explain associations with alleles at human leucocyte antigen (HLA)-A, -B, -C, -DR, -DQ and C4. In fact, most of these alleles are components of just two ancestral haplotypes (AHs) designated 13.1 and 57.1. Although relevant MHC gene(s) could be within a region of at least 4 Mb, most studies have favoured the area near HLA-B and -C. This region contains a large number of non-HLA genes, many of which are duplicated and polymorphic. Members of one such gene family, PERB11.1 and PERB11.2, are expressed in the skin and are encoded in the region between tumour necrosis factor and HLA-B. To investigate the relationship of PERB11.1 alleles to psoriasis, sequence based typing was performed on 97 patients classified according to age of onset and family history. The frequency of the PERB11.1*06 allele is 44% in type I psoriasis but only 7% in controls (Pc = 0.003 by Fisher's exact test, two-tailed). The major determinant of this association is a single nucleotide polymorphism (SNP) within intron 4. In normal and affected skin, expression of PERB11 is mainly in the basal layer of the epidermis including ducts and follicles. PERB11 is also present in the upper keratin layers but there is relative deficiency in the intermediate layers. These findings suggest a possible role for PERB11 and other MHC genes in the pathogenesis of psoriasis. PMID:10691930

  12. The primary nucleotide sequence of U4 RNA.

    PubMed

    Reddy, R; Henning, D; Busch, H

    1981-04-10

    U4 RNA is one of the "capped" nuclear snRNAs recently found to be precipitable by anti-Sm antibodies as ribonucleoprotein particles. U4 RNA, along with other snRNAs, has been implicated in hnRNA processing, mRNA transport, or both (Lerner, M. R., Boyle, J., Mount, S., Wolin, S., and Steitz, J. A. (1980) Nature 283, 220-224). Since the proteins bound to different snRNAs appear to be the same, the functions of different snRNPs might be dependent on the RNA components. To help understand the function of U4 RNP, the nucleotide sequence of U4 RNA was determined. The sequence is (formula see text) In addition to the modified nucleotides in the "cap," U4 RNA contains Am at position 63 and m6A at position 98. It also exhibited A-C microheterogeneity at position 97. PMID:6162848

  13. Nucleotide-Specific Contrast for DNA Sequencing by Electron Spectroscopy

    PubMed Central

    Schmid, Andreas K.; Davis, Ronald W.

    2016-01-01

    DNA sequencing by imaging in an electron microscope is an approach that holds promise to deliver long reads with low error rates and without the need for amplification. Earlier work using transmission electron microscopes, which use high electron energies on the order of 100 keV, has shown that low contrast and radiation damage necessitates the use of heavy atom labeling of individual nucleotides, which increases the read error rates. Other prior work using scattering electrons with much lower energy has shown to suppress beam damage on DNA. Here we explore possibilities to increase contrast by employing two methods, X-ray photoelectron and Auger electron spectroscopy. Using bulk DNA samples with monomers of each base, both methods are shown to provide contrast mechanisms that can distinguish individual nucleotides without labels. Both spectroscopic techniques can be readily implemented in a low energy electron microscope, which may enable label-free DNA sequencing by direct imaging. PMID:27149617

  14. Bulk segregant analysis using single nucleotide polymorphism microarrays

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Bulk segregant analysis using microarrays, and extreme array mapping have recently been used to rapidly identify genomic regions associated with phenotypes in multiple species. These experiments, however require the identification of single feature polymorphisms between the cross parents for each ne...

  15. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request. SUMMARY: The United States....'' SUPPLEMENTARY INFORMATION: I. Abstract Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of the sequence listing in accordance with the requirements in 37 CFR...

  16. Discovery and verification of functional single nucleotide polymorphisms in regulatory genomic regions: Current and developing technologies

    PubMed Central

    Chorley, Brian N.; Wang, Xuting; Campbell, Michelle R.; Pittman, Gary S.; Noureddine, Maher A.; Bell, Douglas A.

    2008-01-01

    The most common form of genetic variation, single nucleotide polymorphisms or SNPs, can affect the way an individual responds to the environment and modify disease risk. Although most of the millions of SNPs have little or no effect on gene regulation and protein activity, there are many circumstances where base changes can have deleterious effects. Non-synonymous SNPs that result in amino acid changes in proteins have been studied because of their obvious impact on protein activity. It is well known that SNPs within regulatory regions of the genome can result in disregulation of gene transcription. However, the impact of SNPs located in putative regulatory regions, or rSNPs, is harder to predict for two primary reasons. First, the mechanistic roles of non-coding genomic sequence remain poorly defined. Second, experimental validation of the functional consequences of rSNPs is often slow and laborious. In this review, we summarize traditional and novel methodologies for candidate rSNPs selection, in particular in silico techniques that aid in candidate rSNP selection. Additionally we will discuss molecular biological techniques that assess the impact of rSNPs on binding of regulatory machinery, as well as functional consequences on transcription. Standard techniques such as EMSA and luciferase reporter constructs are still widely used to assess effects of rSNPs on binding and gene transcription; however, these protocols are often bottlenecks in the discovery process. Therefore, we highlight novel and developing high-throughput protocols that promise to aid in shortening the process of rSNP validation. Given the large amount of genomic information generated from a multitude of re-sequencing and genome-wide SNP array efforts, future focus should be to develop validation techniques that will allow greater understanding of the impact these polymorphisms have on human health and disease. PMID:18565787

  17. Complete nucleotide sequence of Saccharomyces cerevisiae chromosome X.

    PubMed Central

    Galibert, F; Alexandraki, D; Baur, A; Boles, E; Chalwatzis, N; Chuat, J C; Coster, F; Cziepluch, C; De Haan, M; Domdey, H; Durand, P; Entian, K D; Gatius, M; Goffeau, A; Grivell, L A; Hennemann, A; Herbert, C J; Heumann, K; Hilger, F; Hollenberg, C P; Huang, M E; Jacq, C; Jauniaux, J C; Katsoulou, C; Karpfinger-Hartl, L

    1996-01-01

    The complete nucleotide sequence of Saccharomyces cerevisiae chromosome X (745 442 bp) reveals a total of 379 open reading frames (ORFs), the coding region covering approximately 75% of the entire sequence. One hundred and eighteen ORFs (31%) correspond to genes previously identified in S. cerevisiae. All other ORFs represent novel putative yeast genes, whose function will have to be determined experimentally. However, 57 of the latter subset (another 15% of the total) encode proteins that show significant analogy to proteins of known function from yeast or other organisms. The remaining ORFs, exhibiting no significant similarity to any known sequence, amount to 54% of the total. General features of chromosome X are also reported, with emphasis on the nucleotide frequency distribution in the environment of the ATG and stop codons, the possible coding capacity of at least some of the small ORFs (<100 codons) and the significance of 46 non-canonical or unpaired nucleotides in the stems of some of the 24 tRNA genes recognized on this chromosome. Images PMID:8641269

  18. Haplotype of single nucleotide polymorphisms in exon 6 of the MZF-1 gene and Alzheimer's disease.

    PubMed

    Porcellini, Elisa; Carbone, Ilaria; Martelli, Pier Luigi; Ianni, Manuela; Casadio, Rita; Pession, Annalisa; Licastro, Federico

    2013-01-01

    Our previous works showed that single nucleotide polymorphisms (SNPs) in genes with regulatory function upon inflammatory response and cholesterol metabolism were associated with Alzheimer's disease (AD) risk. The list comprises SNPs located on the promoters of alpha 1 antichymotrypsin (rs1884082), hydroxy methyl glutaryl coenzime A reductase (rs376140), tumor necrosis factor alpha (rs1800629), and interleukin 10 (rs1800869). Here we investigated the effect of these SNPs on the binding for transcription factors. We computationally detected putative binding sites for transcription factors located in the SNP regions. To this aim, the TESS program for scanning the promoter sequences against the binding-site models available at TRANSFACT and JASPAR databases was adopted. All the analyzed SNPs appeared to affect the binding of myeloid zinc finger protein 1 (MZF-1) to the promoter sequence of the above reported genes. Therefore 16 SNPs in MZF-1 gene were tested in 120 AD cases and 88 controls to asses a possible association between MZF-1 and AD. 14 SNPs showed no variability in AD and control populations, while two SNPs rs4756 and rs2228162 showed the three genotypes. Genotype distributions and allele frequencies of these two SNPs were comparable between AD and controls. On the other hand, the haplotype distribution of rs4756 and rs2228162 was different between AD and controls; being the AG haplotype associated with a decreased AD risk. In conclusion, selected SNPs in MZF-1 gene exert a minor effect on AD risk. PMID:23241556

  19. Investigating single nucleotide polymorphism (SNP) density in the human genome and its implications for molecular evolution.

    PubMed

    Zhao, Zhongming; Fu, Yun-Xin; Hewett-Emmett, David; Boerwinkle, Eric

    2003-07-17

    We investigated the single nucleotide polymorphism (SNP) density across the human genome and in different genic categories using two SNP databases: Celera's CgsSNP, which includes SNPs identified by comparing genomic sequences, and Celera's RefSNP, which includes SNPs from a variety of sources and is biased toward disease-associated genes. Based on CgsSNP, the average numbers of SNPs per 10 kb was 8.33, 8.44, and 8.09 in the human genome, in intergenic regions, and in genic regions, respectively. In genic regions, the SNP density in intronic, exonic and adjoining untranslated regions was 8.21, 5.28, and 7.51 SNPs per 10 kb, respectively. The pattern of SNP density based on RefSNP was different from that based on CgsSNP, emphasizing its utility for genotype-phenotype association studies but not for most population genetic studies. The number of SNPs per chromosome was correlated with chromosome length, but the density of SNPs estimated by CgsSNP was not significantly correlated with the GC content of the chromosome. Based on CgsSNP, the ratio of nonsense to missense mutations (0.027), the ratio of missense to silent mutations (1.15), and the ratio of non-synonymous to synonymous mutations (1.18) was less than half of that expected in a human protein coding sequence under the neutral mutation theory, reflecting a role for natural selection, especially purifying selection. PMID:12909357

  20. Identification of new aquaporin genes and single nucleotide polymorphism in bread wheat.

    PubMed

    Pandey, B; Sharma, P; Pandey, D M; Sharma, I; Chatrath, R

    2013-01-01

    Major facilitators of water movement through plant cell membranes include aquaporin proteins. Wheat is among the largest and most important cereal crops worldwide; however, unlike other model plants such as rice, maize and Arabidopsis, little has been reported on wheat major intrinsic proteins (MIPs). This study presents a comprehensive computational identification of 349 new wheat expressed sequence tags (ESTs), encoding 13 wheat aquaporin genes. Identified aquaporins consist of 6 plasma membrane intrinsic proteins (PIP) and 1 TIP showing high sequence similarity with rice aquaporins. We also identified 4 NOD26-like intrinsic proteins (NIP) and 2 SIP members that showed more divergence. Further, expression analysis of the aquaporin genes using the available EST information in UniGene revealed their transcripts were differentially regulated in various stress- and tissue-specific libraries. Allele specific Polymerase chain reaction (PCR) primers based on single nucleotide polymorphism (SNP) were designed using PIP as the target gene and validated on a core set of Indian wheat genotypes. A 3D theoretical model of the wheat aquaporin protein was built by homology modeling and could prove to be useful in the further functional characterization of this protein. Collectively with expression and bioinformatics analysis, our results support the idea that the genes identified in this study signify an important genetic resource providing potential targets to modify the water use properties of wheat. PMID:24250219

  1. Single nucleotide polymorphisms in the bovine Histophilus somni genome; a comparison of new and old isolates.

    PubMed

    Madampage, Claudia Avis; Rawlyk, Neil; Crockford, Gordon; Van Donkersgoed, Joyce; Dorin, Craig; Potter, Andrew

    2015-07-01

    Histophilus somni, a causative agent of the bovine respiratory disease complex, can also cause a variety of systemic disorders, including bronchopneumonia, myocarditis, pericarditis, arthritis, pleuritis, and infectious thrombotic meningoencephalitis. The purpose of this study was to determine if currently circulating strains differ from those of the 1980s by identifying genomic changes. Single nucleotide polymorphisms (SNPs) and insertion and deletion (INDEL) sites were examined by whole-genome sequencing in 12 samples, 6 old and 6 new. The 31 028 SNP/INDELs recorded were compared against the reference genome sequence of the pathogenic H. somni strain 2336. The distribution of about 75% of these SNPs within a specified gene differed between old and new isolates and did not follow any particular pattern. The other 25% clustered into 2 groups containing the same SNPs in various genes: group I included 5 old isolates and 1 new isolate; group II included 5 new isolates and 1 old isolate. For putative virulence genes there were more SNPs in group I compared with strain 2336, itself an older isolate, than in group II. Although only 25% of all the SNPs formed 2 clusters, the results suggest some genetic difference in various genes between old and new strains. PMID:26130851

  2. Bioinformatics of varicella-zoster virus: Single nucleotide polymorphisms define clades and attenuated vaccine genotypes

    PubMed Central

    Chow, Vincent T.; Tipples, Graham A.; Grose, Charles

    2012-01-01

    Varicella zoster virus (VZV) is one of the human herpesviruses. To date, over 40 complete VZV genomes have been sequenced and analyzed. The VZV genome contains around 125,000 base pairs including 70 open reading frames (ORFs). Enumeration of single nucleotide polymorphisms (SNPs) has determined that the following ORFs are the most variable (in descending order): 62, 22, 29, 28, 37, 21, 54, 31, 1 and 55. ORF 62 is the major immediate early regulatory VZV gene. Further SNP analysis across the entire genome has led to the observation that VZV strains can be broadly grouped into clades within a phylogenetic tree. VZV strains collected in Singapore provided important sequence data for construction of the phylogenetic tree. Currently 5 VZV clades are recognized; they have been designated clades 1 through 5. Clades 1 and 3 include European/North American strains; clade 2 includes Asian strains, especially from Japan; and clade 5 includes strains from India. Clade 4 includes some strains from Europe, but its geographic origins need further documentation.. Within clade 1, five variant viruses have been isolated with a missense mutation in the gE (ORF 68) glycoprotein; these strains have an altered increased cell spread phenotype. Bioinformatics analyses of the attenuated vaccine strains have also been performed, with a subsequent discovery of a stop-codon SNP in ORFO as a likely attenuation determinant. Taken together, these VZV bioinformatics analyses have provided enormous insights into VZV phylogenetics as well as VZV SNPs associated with attenuation. PMID:23183312

  3. The Application and Performance of Single Nucleotide Polymorphism Markers for Population Genetic Analyses of Lepidoptera

    PubMed Central

    Coates, Brad Steven; Bayles, Darrell O.; Wanner, Kevin W.; Robertson, Hugh M.; Hellmich, Richard L.; Sappington, Thomas W.

    2011-01-01

    Microsatellite markers are difficult to apply within lepidopteran studies due to the lack of locus-specific PCR amplification and the high proportion of “null” alleles, such that erroneous estimations of population genetic parameters often result. Herein single nucleotide polymorphism (SNP) markers are developed from Ostrinia nubilalis (Lepidoptera: Crambidae) using next generation expressed sequence tag (EST) data. A total of 2742 SNPs were predicted within a reference assembly of 7414 EST contigs, and a subset of 763 were incorporated into 24 multiplex PCR reactions. To validate this pipeline, 5 European and North American sample sites were genotyped at 178 SNP loci, which indicated 84 (47.2%) were in Hardy–Weinberg equilibrium. Locus-by-locus FST, analysis of molecular variance, and STRUCTURE analyses indicate significant genetic differentiation may exist between European and North American O. nubilalis. The observed genetic diversity was significantly lower among European sites, which may result from genetic drift, natural selection, a genetic bottleneck, or ascertainment bias due to North American origin of EST sequence data. SNPs are an abundant source of mutation data for molecular genetic marker development in non-model species, with shared ancestral SNPs showing application within closely related species. These markers offer advantages over microsatellite markers for genetic and genomic analyses of Lepidoptera, but the source of mutation data may affect the estimation of population parameters and likely need to be considered in the interpretation of empirical data. PMID:22303334

  4. Nucleotide sequence of the hypervariable region of the human C2 gene

    SciTech Connect

    Zhu, Z.B.; Volanakis, J.V. )

    1991-03-15

    It has been previously suggested that the multiallelic Bam H1/Sst I RFLPs of the human C2 gene arose through deletion/insertion of a tandemly-repeated minisatellite region. In this study the authors subcloned and sequenced the Sst I polymorphic fragment of the b haplotype of the C2 gene. This restriction fragment is 2,450 bp long and maps 1,550 bp 3{prime} of exon 3. Its nucleotide sequence is characterized by the presence of at least 4 different repeated regions varying in size from 18 to 58 bp. One of these regions starting at position 1,413 is 48 bp long and is repeated five times. The first 3 repeats are in tandem and are separated by 72 bp from two additional tandem repeats. Sequence homology among the 5 repeats ranges between 93 and 98%. Eighty three percent of the nucleotides of the repeated-region are G or C. It seems likely that this nucleotide repeat resulted in the multiallelic RFLPs through a mechanism of unequal recombination or replication slippage.

  5. The complete nucleotide sequence of pelargonium leaf curl virus.

    PubMed

    McGavin, Wendy J; MacFarlane, Stuart A

    2016-05-01

    Investigation of a tombusvirus isolated from tulip plants in Scotland revealed that it was pelargonium leaf curl virus (PLCV) rather than the originally suggested tomato bushy stunt virus. The complete sequence of the PLCV genome was determined for the first time, revealing it to be 4789 nucleotides in size and to have an organization similar to that of the other, previously described tombusviruses. Primers derived from the sequence were used to construct a full-length infectious clone of PLCV that recapitulates the disease symptoms of leaf curling in systemically infected pelargonium plants. PMID:26906694

  6. Nucleotide sequences of five anti-lysozyme monoclonal antibodies.

    PubMed Central

    Darsley, M J; Rees, A R

    1985-01-01

    The nucleotide sequences of the heavy and light chain immunoglobulin mRNAs derived from five hybridomas (Gloop 1-5) secreting IgGs specific for the loop region of hen egg lysozyme were determined. These monoclonal antibodies recognise three distinct but overlapping epitopes within the loop region. The sequences of two pairs of antibodies with indistinguishable fine specificities were similar in both chains whereas the sequences of antibodies of non-identical specificities were very different. It is proposed that the D-segments expressed in two of the antibodies (Gloop3 and Gloop4) are the products of one, or perhaps two, previously unidentified germ line D-genes. Gloop1 and Gloop2 use a D-segment previously identified in antibodies specific for the hapten 2-phenyloxazolone; however it is recombined in a different reading frame in the anti-lysozyme antibodies, producing a different amino acid sequence. PMID:2410256

  7. Short communication: Relationship of call rate and accuracy of single nucleotide polymorphism genotypes in dairy cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Call rate has been used as a measure of quality on both a single nucleotide polymorphism (SNP) and animal basis since SNP genotypes were first used in genomic evaluation of dairy cattle. The genotyping laboratories perform initial quality control screening and genotypes that fail are usually exclude...

  8. Using 90,113 single nucleotide polymorphisms in genomic evaluation of dairy cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Accuracy of genomic evaluation is expected to increase when more markers are used because of better tracking of causative genetic variants. However, Illumina BovineHD genotypes based on 777,962 single nucleotide polymorphisms (SNP) have not been used for US genomic evaluation because the small relia...

  9. ASSOCIATION OF RESISTANCE TO AVIAN COCCIDIOSIS WITH SINGLE NUCLEOTIDE POLYMORPHISMS IN THE ZYXIN GENE

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Our previous genetic studies demonstrated that resistance to avian coccidiosis was linked with microsatellite markers LEI0071 and LEI0101 on chromosome 1. In this study, the associations between parameters of resistance to coccidiosis and single nucleotide polymorphisms (SNPs) in 3 candidate genes ...

  10. The effects of single nucleotide polymorphisms (SNPs) of calpastatin (CAST) gene on meat tenderness of yak.

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The association of single nucleotide polymorphisms (SNPs) of calpastatin (CAST) gene with shear force of 2.54 cm steaks from M. longissimus dorsi from Gannan yaks (Bos grunniens, n=181) was studied. Yaks were harvested at 2, 3, and 4 yr of age (n=51, 59, and 71, respectively), and samples of each ya...

  11. Development of a web services based system for dissemination of single nucleotide polymorphism data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Single nucleotide polymorphisms (SNPs) can be used to generate DNA-based fingerprints for individual identification. The efficiency of DNA fingerprinting is greatest when the frequency of both SNP alleles is near 0.50. A number of SNPs have been identified in cattle populations with minor allele f...

  12. Relationships among calpastatin single nucleotide polymorphisms, calpastatin expression and tenderness in pork longissimus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome scans in the pig have identified a region on chromosome 2 (SSC2) associated with tenderness. Calpastatin is a likely positional candidate gene in this region because of its inhibitory role in the calpain system that is involved in postmortem tenderization. Novel single nucleotide polymorphism...

  13. Performance of single nucleotide polymorphisms versus haplotypes for genome-wide association analysis in barley

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association studies (GWAS) may benefit from using haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on the genetic architecture of traits, patter...

  14. Increasing the number of single nucleotide polymorphisms used in genomic evaluation of dairy cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    GeneSeek designed a new version of the GeneSeek Genomic Profiler HD BeadChip for Dairy Cattle, which had >77,000 single nucleotide polymorphisms (SNPs). A set of >140,000 SNPs was selected that included all SNPs on the existing GeneSeek chip, all SNPs used in U.S. national genomic evaluations, SNPs ...

  15. Association of a single nucleotide polymorphism of calpain 1 gene with meat tenderness of the yak

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The association of a single nucleotide polymorphism (SNP) of calpain 1 (CAPN1) gene with shear force of 2.54 cm steaks from M. longissimus dorsi from Gannan yaks (Bos grunniens, n = 181) was studied. The experimental design was a repeated measures with the main unit in a completely randomized design...

  16. Association of Single Nucleotide Polymorphisms in the CAST Gene Associated with Longissimus Tenderness in Beef Cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective was to assess the association of single nucleotide polymorphisms (SNP) developed on the CAST gene, with longissimus tenderness. Forty one SNP were identified in the CAST gene and assays were developed. Markers were scattered throughout the gene. These markers, in conjunction with a com...

  17. Association of single nucleotide polymorphisms in candidate genes residing under quantitative trait loci in beef cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The objective was to assess the association of single nucleotide polymorphisms (SNP) developed on candidate genes residing under previously identified quantitative trait loci for marbling score and meat tenderness. Two hundred five SNP were identified on twenty candidate genes. Genes selected under ...

  18. Single nucleotide polymorphisms in sheep varying in tolerance to elevated dietary nitrate

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Discovery of single nucleotide polymorphisms (SNPs) may lead to development of marker panels predictive of tolerance to high dietary nitrate (NO3-). The aims of this research were to identify SNPs in Arginiosuccinate Lyase (ASL), determine the relationship of ASL SNP genotypes on NO3- tolerance, an...

  19. Single nucleotide polymorphism in wheat chromosome region harboring Fhb1 for Fusarium head blight resistance

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Fusarium head blight (FHB) is a destructive disease that reduces wheat grain yield and quality. To date, the quantitative trait locus on 3BS (Fhb1) from Sumai 3 has shown the largest effect on FHB resistance. Single nucleotide polymorphism (SNP) is the most common form of genetic variation and suita...

  20. Effect of inversion polymorphism on the neutral nucleotide variability of linked chromosomal regions in Drosophila.

    PubMed Central

    Navarro, A; Barbadilla, A; Ruiz, A

    2000-01-01

    Recombination is a main factor determining nucleotide variability in different regions of the genome. Chromosomal inversions, which are ubiquitous in the genus Drosophila, are known to reduce and redistribute recombination, and thus their specific effect on nucleotide variation may be of major importance as an explanatory factor for levels of DNA variation. Here, we use the coalescent approach to study this effect. First, we develop analytical expressions to predict nucleotide variability in old inversion polymorphisms that have reached mutation-drift-flux equilibrium. The effects on nucleotide variability of a new arrangement appearing in the population and reaching a stable polymorphism are then studied by computer simulation. We show that inversions modulate nucleotide variability in a complex way. The establishment of an inversion polymorphism involves a partial selective sweep that eliminates part of the variability in the population. This is followed by a slow convergence to the equilibrium values. During this convergence, regions close to the breakpoints exhibit much lower variability than central regions. However, at equilibrium, regions close to the breakpoints have higher levels of variability and differentiation between arrangements than regions in the middle of the inverted segment. The implications of these findings for overall variability levels during the evolution of Drosophila species are discussed. PMID:10835391

  1. Evidence for Balancing Selection from Nucleotide Sequence Analyses of Human G6PD

    PubMed Central

    Verrelli, Brian C.; McDonald, John H.; Argyropoulos, George; Destro-Bisol, Giovanni; Froment, Alain; Drousiotou, Anthi; Lefranc, Gerard; Helal, Ahmed N.; Loiselet, Jacques; Tishkoff, Sarah A.

    2002-01-01

    Glucose-6-phosphate dehydrogenase (G6PD) mutations that result in reduced enzyme activity have been implicated in malarial resistance and constitute one of the best examples of selection in the human genome. In the present study, we characterize the nucleotide diversity across a 5.2-kb region of G6PD in a sample of 160 Africans and 56 non-Africans, to determine how selection has shaped patterns of DNA variation at this gene. Our global sample of enzymatically normal B alleles and A, A−, and Med alleles with reduced enzyme activities reveals many previously uncharacterized silent-site polymorphisms. In comparison with the absence of amino acid divergence between human and chimpanzee G6PD sequences, we find that the number of G6PD amino acid polymorphisms in human populations is significantly high. Unlike many other G6PD-activity alleles with reduced activity, we find that the age of the A variant, which is common in Africa, may not be consistent with the recent emergence of severe malaria and therefore may have originally had a historically different adaptive function. Overall, our observations strongly support previous genotype-phenotype association studies that proposed that balancing selection maintains G6PD deficiencies within human populations. The present study demonstrates that nucleotide sequence analyses can reveal signatures of both historical and recent selection in the genome and may elucidate the impact that infectious disease has had during human evolution. PMID:12378426

  2. Association of Nitric Oxide Synthase and Matrix Metalloprotease Single Nucleotide Polymorphisms with Preeclampsia and Its Complications

    PubMed Central

    Leonardo, Daniela P.; Albuquerque, Dulcinéia M.; Lanaro, Carolina; Baptista, Letícia C.; Cecatti, José G.; Surita, Fernanda G.; Parpinelli, Mary A.; Costa, Fernando F.; Franco-Penteado, Carla F.; Fertrin, Kleber Y.; Costa, Maria Laura

    2015-01-01

    Background Preeclampsia is one of the leading causes of maternal and neonatal morbidity and mortality in the world, but its appearance is still unpredictable and its pathophysiology has not been entirely elucidated. Genetic studies have associated single nucleotide polymorphisms in genes encoding nitric oxide synthase and matrix metalloproteases with preeclampsia, but the results are largely inconclusive across different populations. Objectives To investigate the association of single nucleotide polymorphisms (SNPs) in NOS3 (G894T, T-786C, and a variable number of tandem repetitions VNTR in intron 4), MMP2 (C-1306T), and MMP9 (C-1562T) genes with preeclampsia in patients from Southeastern Brazil. Methods This prospective case-control study enrolled 77 women with preeclampsia and 266 control pregnant women. Clinical data were collected to assess risk factors and the presence of severe complications, such as eclampsia and HELLP (hemolysis, elevated liver enzymes, and low platelets) syndrome. Results We found a significant association between the single nucleotide polymorphism NOS3 T-786C and preeclampsia, independently from age, height, weight, or the other SNPs studied, and no association was found with the other polymorphisms. Age and history of preeclampsia were also identified as risk factors. The presence of at least one polymorphic allele for NOS3 T-786C was also associated with the occurrence of eclampsia or HELLP syndrome among preeclamptic women. Conclusions Our data support that the NOS3 T-786C SNP is associated with preeclampsia and the severity of its complications. PMID:26317342

  3. Comparing compressed sequences for faster nucleotide BLAST searches.

    PubMed

    Cameron, Michael; Williams, Hugh E

    2007-01-01

    Molecular biologists, geneticists, and other life scientists use the BLAST homology search package as their first step for discovery of information about unknown or poorly annotated genomic sequences. There are two main variants of BLAST: BLASTP for searching protein collections and BLASTN for nucleotide collections. Surprisingly, BLASTN has had very little attention; for example, the algorithms it uses do not follow those described in the 1997 BLAST paper and no exact description has been published. It is important that BLASTN is state-of-the-art: Nucleotide collections such as GenBank dwarf the protein collections in size, they double in size almost yearly, and they take many minutes to search on modern general purpose workstations. This paper proposes significant improvements to the BLASTN algorithms. Each of our schemes is based on compressed bytepacked formats that allow queries and collection sequences to be compared four bases at a time, permitting very fast query evaluation using lookup tables and numeric comparisons. Our most significant innovations are two new, fast gapped alignment schemes that allow accurate sequence alignment without decompression of the collection sequences. Overall, our innovations more than double the speed of BLASTN with no effect on accuracy and have been integrated into our new version of BLAST that is freely available for download from http://www.fsa-blast.org/. PMID:17666756

  4. Bioinformatics comparison of sulfate-reducing metabolism nucleotide sequences

    NASA Astrophysics Data System (ADS)

    Tremberger, G.; Dehipawala, Sunil; Nguyen, A.; Cheung, E.; Sullivan, R.; Holden, T.; Lieberman, D.; Cheung, T.

    2015-09-01

    The sulfate-reducing bacteria can be traced back to 3.5 billion years ago. The thermodynamics details of the sulfur cycle have been well documented. A recent sulfate-reducing bacteria report (Robator, Jungbluth, et al , 2015 Jan, Front. Microbiol) with Genbank nucleotide data has been analyzed in terms of the sulfite reductase (dsrAB) via fractal dimension and entropy values. Comparison to oil field sulfate-reducing sequences was included. The AUCG translational mass fractal dimension versus ATCG transcriptional mass fractal dimension for the low temperature dsrB and dsrA sequences reported in Reference Thirteen shows correlation R-sq ~ 0.79 , with a probably of about 3% in simulation. A recent report of using Cystathionine gamma-lyase sequence to produce CdS quantum dot in a biological method, where the sulfur is reduced just like in the H2S production process, was included for comparison. The AUCG mass fractal dimension versus ATCG mass fractal dimension for the Cystathionine gamma-lyase sequences was found to have R-sq of 0.72, similar to the low temperature dissimilatory sulfite reductase dsr group with 3% probability, in contrary to the oil field group having R-sq ~ 0.94, a high probable outcome in the simulation. The other two simulation histograms, namely, fractal dimension versus entropy R-sq outcome values, and di-nucleotide entropy versus mono-nucleotide entropy R-sq outcome values are also discussed in the data analysis focusing on low probability outcomes.

  5. Tetra Primer ARMS PCR Optimization to Detect Single Nucleotide Polymorphisms of the CYP2E1 Gene.

    PubMed

    Suhda, Saihas; Paramita, Dewi Kartikawati; Fachiroh, Jajah

    2016-01-01

    Single nucleotide polymorphism (SNP) detection has been used extensively for genetic association studies of diseases including cancer. For mass, yet accurate and more economic SNP detection we have optimized tetra primer amplification refractory mutation system polymerase chain reaction (ARMS PCR) to detect three SNPs in the cytochrome P450 2E1 (CYP2E1) gene locus; i.e. rs3813865, rs2070672 and rs3813867. The optimization system strategies used were (1) designing inner and outer primers; (2) determining of their optimum primer concentration ratios; and (3) determining of the optimum PCR annealing temperature. The tetra primer ARMS PCR result could be directly observed using agarose gel electrophoresis. The method succesfully determined three SNPs in CYP2E1 locus, the results being consistent with validation using DNA sequencing and restriction fragment length polymorphisms (RFLP). PMID:27509930

  6. Single nucleotide polymorphisms in genes associated with isoniazid resistance in Mycobacterium tuberculosis.

    PubMed

    Ramaswamy, Srinivas V; Reich, Robert; Dou, Shu-Jun; Jasperse, Linda; Pan, Xi; Wanger, Audrey; Quitugua, Teresa; Graviss, Edward A

    2003-04-01

    Isoniazid (INH) is a central component of drug regimens used worldwide to treat tuberculosis. Previous studies have identified resistance-associated mutations in katG, inhA, kasA, ndh, and the oxyR-ahpC intergenic region. DNA microarray-based experiments have shown that INH induces several genes in Mycobacterium tuberculosis that encode proteins physiologically relevant to the drug's mode of action. To gain further insight into the molecular genetic basis of INH resistance, 20 genes implicated in INH resistance were sequenced for INH resistance-associated mutations. Thirty-eight INH-monoresistant clinical isolates and 86 INH-susceptible isolates of M. tuberculosis were obtained from the Texas Department of Health and the Houston Tuberculosis Initiative. Epidemiologic independence was established for all isolates by IS6110 restriction fragment length polymorphism analysis. Susceptible isolates were matched with resistant isolates by molecular genetic group and IS6110 profiles. Spoligotyping was done with isolates with five or fewer IS6110 copies. A major genetic group was established on the basis of the polymorphisms in katG codon 463 and gyrA codon 95. MICs were determined by the E-test. Semiquantitative catalase assays were performed with isolates with mutations in the katG gene. When the 20 genes were sequenced, it was found that 17 (44.7%) INH-resistant isolates had a single-locus, resistance-associated mutation in the katG, mabA, or Rv1772 gene. Seventeen (44.7%) INH-resistant isolates had resistance-associated mutations in two or more genes, and 76% of all INH-resistant isolates had a mutation in the katG gene. Mutations were also identified in the fadE24, Rv1592c, Rv1772, Rv0340, and iniBAC genes, recently shown by DNA-based microarray experiments to be upregulated in response to INH. In general, the MICs were higher for isolates with mutations in katG and the isolates had reduced catalase activities. The results show that a variety of single nucleotide

  7. On-chip detection of a single nucleotide polymorphism without polymerase amplification

    PubMed Central

    Han, Jinhee; Tan, Matthew; Sudheendra, Lakshmana; Weiss, Robert H.; Kennedy, Ian M.

    2014-01-01

    A nanoparticle-assembled photonic crystal (PC) array was used to detect single nucleotide polymorphism (SNP). The assay platform with PC nanostructure enhanced the fluorescent signal from nanoparticle-hybridized DNA complexes due to phase matching of excitation and emission. Nanoparticles coupled with probe DNA were trapped into nanowells in an array by using an electrophoretic particle entrapment system. The PC/DNA assay platform was able to identify a 1 base pair (bp) difference in synthesized nucleotide sequences that mimicked the mutation seen in a feline model of human autosomal dominant polycystic kidney disease (PKD) with a sensitivity of 0.9 fg/mL (50 aM)-sensitivity, which corresponds to 30 oligos/array. The reliability of the PC/DNA assay platform to detect SNP in a real sample was demonstrated by using genomic DNA (gDNA) extracted from the urine and blood of two PKD− wild type and three PKD positive cats. The standard curves for PKD positive (PKD+) and negative (PKD−) DNA were created using two feline-urine samples. An additional three urine samples were analyzed in a similar fashion and showed satisfactory agreement with the standard curve, confirming the presence of the mutation in affected urine. The limit of detection (LOD) was 0.005 ng/mL which corresponds to 6 fg per array for gDNA in urine and blood. The PC system demonstrated the ability to detect a number of genome equivalents for the PKD SNP that was very similar to the results reported with real time polymerase chain reaction (PCR). The favorable comparison with quantitative PCR suggests that the PC technology may find application well beyond the detection of the PKD SNP, into areas where a simple, cheap and portable nucleic acid analysis is desirable. PMID:25580203

  8. Cytochrome b nucleotide sequence variation among the Atlantic Alcidae.

    PubMed

    Friesen, V L; Montevecchi, W A; Davidson, W S

    1993-01-01

    Analysis of cytochrome b nucleotide sequences of the six extant species of Atlantic alcids and a gull revealed an excess of adenines and cytosines and a deficit of guanines at silent sites on the coding strand. Phylogenetic analyses grouped the sequences of the common (Uria aalge) and Brünnich's (U. lomvia) guillemots, followed by the razorbill (Alca torda) and little auk (Alle alle). The black guillemot (Cepphus grylle) sequence formed a sister taxon, and the puffin (Fratercula arctica) fell outside the other alcids. Phylogenetic comparisons of substitutions indicated that mutabilities of bases did not differ, but that C was much more likely to be incorporated than was G. Imbalances in base composition appear to result from a strand bias in replication errors, which may result from selection on secondary RNA structure and/or the energetics of codon-anticodon interactions. PMID:7916741

  9. The nucleotide sequence of the bacteriophage T5 ltf gene.

    PubMed

    Kaliman, A V; Kulshin, V E; Shlyapnikov, M G; Ksenzenko, V N; Kryukov, V M

    1995-06-01

    The nucleotide sequence of the bacteriophage T5 Bg/II-BamHI fragment (4,835 bp in length) known to carry a gene encoding the LTF protein which forms the phage L-shaped tail fibers was determined. It was shown to contain an open reading frame for 1,396 amino acid residues that corresponds to a protein of 147.8 kDa. The coding region of ltf gene is preceded by a typical Shine-Dalgarno sequence. Downstream from the ltf gene there is a strong transcription terminator. Data bank analysis of the LTF protein sequence reveals 55.1% identity to the hypothetical protein ORF 401 of bacteriophage lambda in a segment of 118 amino acids overlap. PMID:7789514

  10. Single nucleotide polymorphisms concordant with the horned/polled trait in Holsteins

    PubMed Central

    Cargill, Edward J; Nissing, Nick J; Grosz, Michael D

    2008-01-01

    Background Cattle that naturally do not grow horns are referred to as polled, a trait inherited in a dominant Mendelian fashion. Previous studies have localized the polled mutation (which is unknown) to the proximal end of bovine chromosome 1 in a region approximately 3 Mb in size. While a polled genetic test, Tru-Polled™, is commercially available from MetaMorphix Inc., Holsteins are not a validated breed for this test. Findings Approximately 160 kb were sequenced within the known polled region from 12 polled and 12 horned Holsteins. Analysis of the polymorphisms identified 13 novel single nucleotide polymorphisms (SNPs) that are concordant with the horned/polled trait. Three of the 13 SNPs are located in gene coding or regulatory regions (e.g., the untranslated region, or UTR) where one is located in the 3'UTR of a gene and the other two are located in the 5'UTR and coding region (synonymous SNP) of another gene. The 3'UTR of genes have been shown to be targets of microRNAs regulating gene expression. In silico analysis indicates the 3'UTR SNP may disrupt a microRNA target site. Conclusion These 13 novel SNPs concordant with the horned/polled trait in Holsteins represent a test panel for the breed and this is the first report to the authors' knowledge of SNPs within gene coding or regulatory regions concordant with the horned/polled trait in cattle. These SNPs will require further testing for verification and further study to determine if the 3'UTR SNP may have a functional effect on the polled trait in Holsteins. PMID:19063733

  11. A single nucleotide polymorphism in NEUROD1 is associated with production traits in Nelore beef cattle.

    PubMed

    de Oliveira, P S N; Tizioto, P C; Malago, W; do Nascimento, M L; Cesar, A S M; Diniz, W J S; de Souza, M M; Lanna, D P D; Tullio, R R; Mourão, G B; de A Mudadu, M; Coutinho, L L; de A Regitano, L C

    2016-01-01

    Feed efficiency and carcass characteristics are late-measured traits. The detection of molecular markers associated with them can help breeding programs to select animals early in life, and to predict breeding values with high accuracy. The objective of this study was to identify polymorphisms in the functional and positional candidate gene NEUROD1 (neurogenic differentiation 1), and investigate their associations with production traits in reference families of Nelore cattle. A total of 585 steers were used, from 34 sires chosen to represent the variability of this breed. By sequencing 14 animals with extreme residual feed intake (RFI) values, seven single nucleotide polymorphisms (SNPs) in NEUROD1 were identified. The investigation of marker effects on the target traits RFI, backfat thickness (BFT), ribeye area (REA), average body weight (ABW), and metabolic body weight (MBW) was performed with a mixed model using the restricted maximum likelihood method. SNP1062, which changes cytosine for guanine, had no significant association with RFI or REA. However, we found an additive effect on ABW (P ≤ 0.05) and MBW (P ≤ 0.05), with an estimated allele substitution effect of -1.59 and -0.93 kg0.75, respectively. A dominant effect of this SNP for BFT was also found (P ≤ 0.010). Our results are the first that identify NEUROD1 as a candidate that affects BFT, ABW, and MBW. Once confirmed, the inclusion of this SNP in dense panels may improve the accuracy of genomic selection for these traits in Nelore beef cattle as this SNP is not currently represented on SNP chips. PMID:27420997

  12. Prospecting for pig single nucleotide polymorphisms in the human genome: have we struck gold?

    PubMed

    Grapes, L; Rudd, S; Fernando, R L; Megy, K; Rocha, D; Rothschild, M F

    2006-06-01

    Gene-to-gene variation in the frequency of single nucleotide polymorphisms (SNPs) has been observed in humans, mice, rats, primates and pigs, but a relationship across species in this variation has not been described. Here, the frequency of porcine coding SNPs (cSNPs) identified by in silico methods, and the frequency of murine cSNPs, were compared with the frequency of human cSNPs across homologous genes. From 150,000 porcine expressed sequence tag (EST) sequences, a total of 452 SNP-containing sequence clusters were found, totalling 1394 putative SNPs. All the clustered porcine EST annotations and SNP data have been made publicly available at http://sputnik.btk.fi/project?name=swine. Human and murine cSNPs were identified from dbSNP and were characterized as either validated or total number of cSNPs (validated plus non-validated) for comparison purposes. The correlation between in silico pig cSNP and validated human cSNP densities was found to be 0.77 (p < 0.00001) for a set of 25 homologous genes, while a correlation of 0.48 (p < 0.0005) was found for a primarily random sample of 50 homologous human and mouse genes. This is the first evidence of conserved gene-to-gene variability in cSNP frequency across species and indicates that site-directed screening of porcine genes that are homologous to cSNP-rich human genes may rapidly advance cSNP discovery in pigs. PMID:16706918

  13. Nucleotide sequence corresponding to five chemotaxis genes in Escherichia coli.

    PubMed Central

    Mutoh, N; Simon, M I

    1986-01-01

    The nucleotide sequence of DNA which contains five chemotaxis-related genes of Escherichia coli, cheW, cheR, cheB, cheY, and cheZ, and part of the cheA gene was determined. Molecular weights of the polypeptides encoded by these genes were calculated from translated amino acid sequences, and they were 18,100 for cheW, 32,700 for cheR, 37,500 for cheB, 14,100 for cheY, and 24,000 for cheZ. Nucleotide sequences which could act as ribosome-binding sites were found in the upstream region of each gene. After the termination codon of the cheW gene, a typical rho-independent transcription termination signal was observed. There are no other open reading frames long enough to encode polypeptides in this region except those which code for the two previously reported genes tar and tap. PMID:3510184

  14. Nucleotide sequence and expression of a Drosophila metallothionein.

    PubMed

    Lastowski-Perry, D; Otto, E; Maroni, G

    1985-02-10

    A Drosophila melanogaster cDNA clone was isolated based on its more intense hybridization to RNA sequences from copper-fed larvae than from control larval RNA. This clone showed strong hybridization to mouse metallothionein I cDNA at reduced stringency. Its nucleotide sequence includes an open reading segment which codes for a 40-amino acid protein; this protein is identified as metallothionein based on its similarity to the amino-terminal portion of mammalian and crab metalloproteins. The 10 cysteine residues present occur in five pairs of near vicinal cysteines (Cys-X-Cys). This cDNA sequence hybridized to a 400-nucleotide polyadenylated RNA whose presence in the cells of the alimentary canal of larvae was stimulated by ingestion of cadmium or copper; in other tissues this RNA was present at much lower levels. Mercury, silver, and zinc induced metallothionein to a lesser extent. The level of metallothionein RNA increased very soon after the initiation of metal treatment and reached a maximum after approximately 36 h. PMID:2578462

  15. Mouse SNP Miner: an annotated database of mouse functional single nucleotide polymorphisms

    PubMed Central

    Reuveni, Eli; Ramensky, Vasily E; Gross, Cornelius

    2007-01-01

    Background The mapping of quantitative trait loci in rat and mouse has been extremely successful in identifying chromosomal regions associated with human disease-related phenotypes. However, identifying the specific phenotype-causing DNA sequence variations within a quantitative trait locus has been much more difficult. The recent availability of genomic sequence from several mouse inbred strains (including C57BL/6J, 129X1/SvJ, 129S1/SvImJ, A/J, and DBA/2J) has made it possible to catalog DNA sequence differences within a quantitative trait locus derived from crosses between these strains. However, even for well-defined quantitative trait loci (<10 Mb) the identification of candidate functional DNA sequence changes remains challenging due to the high density of sequence variation between strains. Description To help identify functional DNA sequence variations within quantitative trait loci we have used the Ensembl annotated genome sequence to compile a database of mouse single nucleotide polymorphisms (SNPs) that are predicted to cause missense, nonsense, frameshift, or splice site mutations (available at ). For missense mutations we have used the PolyPhen and PANTHER algorithms to predict whether amino acid changes are likely to disrupt protein function. Conclusion We have developed a database of mouse SNPs predicted to cause missense, nonsense, frameshift, and splice-site mutations. Our analysis revealed that 20% and 14% of missense SNPs are likely to be deleterious according to PolyPhen and PANTHER, respectively, and 6% are considered deleterious by both algorithms. The database also provides gene expression and functional annotations from the Symatlas, Gene Ontology, and OMIM databases to further assess candidate phenotype-causing mutations. To demonstrate its utility, we show that Mouse SNP Miner successfully finds a previously identified candidate SNP in the taste receptor, Tas1r3, that underlies sucrose preference in the C57BL/6J strain. We also use Mouse

  16. Varietal identification of tea (Camellia sinensis) using nanofluidic array of single nucleotide polymorphism (SNP) markers

    PubMed Central

    Fang, Wan-Ping; Meinhardt, Lyndel W; Tan, Hua-Wei; Zhou, Lin; Mischke, Sue; Zhang, Dapeng

    2014-01-01

    Apart from water, tea is the world’s most widely consumed beverage. Tea is produced in more than 50 countries with an annual production of approximately 4.7 million tons. The market segment for specialty tea has been expanding rapidly owing to increased demand, resulting in higher revenues and profits for tea growers and the industry. Accurate varietal identification is critically important to ensure traceability and authentication of premium tea products, which in turn contribute to on-farm conservation of tea genetic diversity. Using a set of single nucleotide polymorphism (SNP) markers developed from the expressed sequence tag (EST) database of Camilla senensis, we genotyped deoxyribonucleic acid (DNA) samples extracted from a diverse group of tea varieties, including both fresh and processed commercial loose-leaf teas. The validation led to the designation of 60 SNPs that unambiguously identified all 40 tested tea varieties with high statistical rigor (p<0.0001). Varietal authenticity and genetic relationships among the analyzed cultivars were further characterized by ordination and Bayesian clustering analysis. These SNP markers, in combination with a high-throughput genotyping protocol, effectively established and verified specific DNA fingerprints for all tested tea varieties. This method provides a powerful tool for variety authentication and quality control for the tea industry. It is also highly useful for the management of tea genetic resources and breeding, where accurate and efficient genotype identification is essential. PMID:26504544

  17. A molecular beacon microarray based on a quantum dot label for detecting single nucleotide polymorphisms.

    PubMed

    Guo, Qingsheng; Bai, Zhixiong; Liu, Yuqian; Sun, Qingjiang

    2016-03-15

    In this work, we report the application of streptavidin-coated quantum dot (strAV-QD) in molecular beacon (MB) microarray assays by using the strAV-QD to label the immobilized MB, avoiding target labeling and meanwhile obviating the use of amplification. The MBs are stem-loop structured oligodeoxynucleotides, modified with a thiol and a biotin at two terminals of the stem. With the strAV-QD labeling an "opened" MB rather than a "closed" MB via streptavidin-biotin reaction, a sensitive and specific detection of label-free target DNA sequence is demonstrated by the MB microarray, with a signal-to-background ratio of 8. The immobilized MBs can be perfectly regenerated, allowing the reuse of the microarray. The MB microarray also is able to detect single nucleotide polymorphisms, exhibiting genotype-dependent fluorescence signals. It is demonstrated that the MB microarray can perform as a 4-to-2 encoder, compressing the genotype information into two outputs. PMID:26397421

  18. Quadruplex-single nucleotide polymorphisms (Quad-SNP) influence gene expression difference among individuals

    PubMed Central

    Baral, Aradhita; Kumar, Pankaj; Halder, Rashi; Mani, Prithvi; Yadav, Vinod Kumar; Singh, Ankita; Das, Swapan K.; Chowdhury, Shantanu

    2012-01-01

    Non-canonical guanine quadruplex structures are not only predominant but also conserved among bacterial and mammalian promoters. Moreover recent findings directly implicate quadruplex structures in transcription. These argue for an intrinsic role of the structural motif and thereby posit that single nucleotide polymorphisms (SNP) that compromise the quadruplex architecture could influence function. To test this, we analysed SNPs within quadruplex motifs (Quad-SNP) and gene expression in 270 individuals across four populations (HapMap) representing more than 14 500 genotypes. Findings reveal significant association between quadruplex-SNPs and expression of the corresponding gene in individuals (P < 0.0001). Furthermore, analysis of Quad-SNPs obtained from population-scale sequencing of 1000 human genomes showed relative selection bias against alteration of the structural motif. To directly test the quadruplex-SNP-transcription connection, we constructed a reporter system using the RPS3 promoter—remarkable difference in promoter activity in the ‘quadruplex-destabilized’ versus ‘quadruplex-intact’ promoter was noticed. As a further test, we incorporated a quadruplex motif or its disrupted counterpart within a synthetic promoter reporter construct. The quadruplex motif, and not the disrupted-motif, enhanced transcription in human cell lines of different origin. Together, these findings build direct support for quadruplex-mediated transcription and suggest quadruplex-SNPs may play significant role in mechanistically understanding variations in gene expression among individuals. PMID:22238381

  19. Melting analysis on microbeads in rapid temperature-gradient inside microchannels for single nucleotide polymorphisms detectiona)

    PubMed Central

    Li, Kan-Chien; Ding, Shih-Torng; Lin, En-Chung; Wang, Lon (Alex); Lu, Yen-Wen

    2014-01-01

    A continuous-flow microchip with a temperature gradient in microchannels was utilized to demonstrate spatial melting analysis on microbeads for clinical Single Nucleotide Polymorphisms (SNPs) genotyping on animal genomic DNA. The chip had embedded heaters and thermometers, which created a rapid and yet stable temperature gradient between 60 °C and 85 °C in a short distance as the detection region. The microbeads, which served as mobile supports carrying the target DNA and fluorescent dye, were transported across the temperature gradient. As the surrounding temperature increased, the fluorescence signals of the microbeads decayed with this relationship being acquired as the melting curve. Fast DNA denaturation, as a result of the improved heat transfer and thermal stability due to scaling, was also confirmed. Further, each individual microbead could potentially bear different sequences and pass through the detection region, one by one, for a series of melting analysis, with multiplex, high-throughput capability being possible. A prototype was tested with target DNA samples in different genotypes (i.e., wild and mutant types) with a SNP location from Landrace sows. The melting temperatures were obtained and compared to the ones using a traditional tube-based approach. The results showed similar levels of SNP discrimination, validating our proposed technique for scanning homozygotes and heterozygotes to distinguish single base changes for disease research, drug development, medical diagnostics, agriculture, and animal production. PMID:25553186

  20. Genetic Diversity of Eurycoma longifolia Inferred from Single Nucleotide Polymorphisms1[w

    PubMed Central

    Osman, Asiah; Jordan, Barbara; Lessard, Philip A.; Muhammad, Norwati; Haron, M. Rosli; Riffin, Norifiza Mat; Sinskey, Anthony J.; Rha, ChoKyun; Housman, David E.

    2003-01-01

    Eurycoma longifolia Jack. is a treelet that grows in the forests of Southeast Asia and is widely used throughout the region because of its reported medicinal properties. Widespread harvesting of wild-grown trees has led to rapid thinning of natural populations, causing a potential decrease in genetic diversity among E. longifolia. Suitable genetic markers would be very useful for propagation and breeding programs to support conservation of this species, although no such markers currently exist. To meet this need, we have applied a genome complexity reduction strategy to identify a series of single nucleotide polymorphisms (SNPs) within the genomes of several E. longifolia accessions. We have found that the occurrence of these SNPs reflects the geographic origins of individual plants and can distinguish different natural populations. This work demonstrates the rapid development of molecular genetic markers in species for which little or no genomic sequence information is available. The SNP markers that we have developed in this study will also be useful for identifying genetic fingerprints that correlate with other properties of E. longifolia, such as high regenerability or the appearance of bioactive metabolites. PMID:12644679

  1. Mining the transcriptomes of four commercially important shellfish species for single nucleotide polymorphisms within biomineralization genes.

    PubMed

    Vendrami, David L J; Shah, Abhijeet; Telesca, Luca; Hoffman, Joseph I

    2016-06-01

    Transcriptional profiling not only provides insights into patterns of gene expression, but also generates sequences that can be mined for molecular markers, which in turn can be used for population genetic studies. As part of a large-scale effort to better understand how commercially important European shellfish species may respond to ocean acidification, we therefore mined the transcriptomes of four species (the Pacific oyster Crassostrea gigas, the blue mussel Mytilus edulis, the great scallop Pecten maximus and the blunt gaper Mya truncata) for single nucleotide polymorphisms (SNPs). Illumina data for C. gigas, M. edulis and P. maximus and 454 data for M. truncata were interrogated using GATK and SWAP454 respectively to identify between 8267 and 47,159 high quality SNPs per species (total=121,053 SNPs residing within 34,716 different contigs). We then annotated the transcripts containing SNPs to reveal homology to diverse genes. Finally, as oceanic pH affects the ability of organisms to incorporate calcium carbonate, we honed in on genes implicated in the biomineralization process to identify a total of 1899 SNPs in 157 genes. These provide good candidates for biomarkers with which to study patterns of selection in natural or experimental populations. PMID:26806806

  2. SNPer: An R Library for Quantitative Variant Analysis on Single Nucleotide Polymorphisms among Influenza Virus Populations

    PubMed Central

    Sangket, Unitsa; Vijasika, Sukanya; Noh, Hasnee; Chantratita, Wasun; Klungthong, Chonticha; Yoon, In Kyu; Fernandez, Stefan; Rutvisuttinunt, Wiriya

    2015-01-01

    Influenza virus (IFV) can evolve rapidly leading to genetic drifts and shifts resulting in human and animal influenza epidemics and pandemics. The genetic shift that gave rise to the 2009 influenza A/H1N1 pandemic originated from a triple gene reassortment of avian, swine and human IFVs. More minor genetic alterations in genetic drift can lead to influenza drug resistance such as the H274Y mutation associated with oseltamivir resistance. Hence, a rapid tool to detect IFV mutations and the potential emergence of new virulent strains can better prepare us for seasonal influenza outbreaks as well as potential pandemics. Furthermore, identification of specific mutations by closely examining single nucleotide polymorphisms (SNPs) in IFV sequences is essential to classify potential genetic markers associated with potentially dangerous IFV phenotypes. In this study, we developed a novel R library called “SNPer” to analyze quantitative variants in SNPs among IFV subpopulations. The computational SNPer program was applied to three different subpopulations of published IFV genomic information. SNPer queried SNPs data and grouped the SNPs into (1) universal SNPs, (2) likely common SNPs, and (3) unique SNPs. SNPer outperformed manual visualization in terms of time and labor. SNPer took only three seconds with no errors in SNP comparison events compared with 40 hours with errors using manual visualization. The SNPer tool can accelerate the capacity to capture new and potentially dangerous IFV strains to mitigate future influenza outbreaks. PMID:25876137

  3. Single nucleotide polymorphism discovery in albacore and Atlantic bluefin tuna provides insights into worldwide population structure.

    PubMed

    Albaina, A; Iriondo, M; Velado, I; Laconcha, U; Zarraonaindia, I; Arrizabalaga, H; Pardo, M A; Lutcavage, M; Grant, W S; Estonba, A

    2013-12-01

    The optimal management of the commercially important, but mostly over-exploited, pelagic tunas, albacore (Thunnus alalunga Bonn., 1788) and Atlantic bluefin tuna (BFT; Thunnus thynnus L., 1758), requires a better understanding of population structure than has been provided by previous molecular methods. Despite numerous studies of both species, their population structures remain controversial. This study reports the development of single nucleotide polymorphisms (SNPs) in albacore and BFT and the application of these SNPs to survey genetic variability across the geographic ranges of these tunas. A total of 616 SNPs were discovered in 35 albacore tuna by comparing sequences of 54 nuclear DNA fragments. A panel of 53 SNPs yielded FST values ranging from 0.0 to 0.050 between samples after genotyping 460 albacore collected throughout the distribution of this species. No significant heterogeneity was detected within oceans, but between-ocean comparisons (Atlantic, Pacific and Indian oceans along with Mediterranean Sea) were significant. Additionally, a 17-SNP panel was developed in Atlantic BFT by cross-species amplification in 107 fish. This limited number of SNPs discriminated between samples from the two major spawning areas of Atlantic BFT (FST  = 0.116). The SNP markers developed in this study can be used to genotype large numbers of fish without the need for standardizing alleles among laboratories. PMID:23668670

  4. Discovering All Transcriptome Single-Nucleotide Polymorphisms and Scanning for Selection Signatures in Ducks (Anas platyrhynchos)

    PubMed Central

    Lin, Ruiyi; Du, Xiaoyong; Peng, Sixue; Yang, Liubin; Ma, Yunlong; Gong, Yanzhang; Li, Shijun

    2015-01-01

    The duck is one of the most economically important waterfowl as a source of meat, eggs, and feathers. Characterizing the genetic variation in duck species is an important step toward linking genes or genomic regions with phenotypes. Human-driven selection during duck domestication and subsequent breed formation has likely left detectable signatures in duck genome. In this study, we employed a panel of >1.4 million single-nucleotide polymorphisms (SNPs) identified from the RNA sequencing (RNA-seq) data of 15 duck individuals. The density of the resulting SNPs is significantly positively correlated with the density of genes across the duck genome, which demonstrates that the usage of the RNA-seq data allowed us to enrich variant functional categories, such as coding exons, untranslated regions (UTRs), introns, and downstream/upstream. We performed a complete scan of selection signatures in the ducks using the composite likelihood ratio (CLR) and found 76 candidate regions of selection, many of which harbor genes related to phenotypes relevant to the function of the digestive system and fat metabolism, including TCF7L2, EIF2AK3, ELOVL2, and fatty acid-binding protein family. This study illustrates the potential of population genetic approaches for identifying genomic regions affecting domestication-related phenotypes and further helps to increase the known genetic information about this economically important animal. PMID:26819540

  5. Reverse random amplified microsatellite polymorphism reveals enhanced polymorphisms in the 3' end of simple sequence repeats in the pepper genome.

    PubMed

    Min, Woong-Ki; Han, Jung-Heon; Kang, Won-Hee; Lee, Heung-Ryul; Kim, Byung-Dong

    2008-09-30

    Microsatellites or simple sequence repeats (SSR) are widely distributed in eukaryotic genomes and are informative genetic markers. Despite many advantages of SSR markers such as a high degree of allelic polymorphisms, co-dominant inheritance, multi-allelism, and genome-wide coverage in various plant species, they also have shortcomings such as low polymorphic rates between genetically close lines, especially in Capsicum annuum. We developed an alternative technique to SSR by normalizing and alternating anchored primers in random amplified microsatellite polymorphisms (RAMP). This technique, designated reverse random amplified microsatellite polymorphism (rRAMP), allows the detection of nucleotide variation in the 3' region flanking an SSR using normalized anchored and random primer combinations. The reproducibility and frequency of polymorphic loci in rRAMP was vigorously enhanced by translocation of the 5' anchor of repeat sequences to the 3' end position and selective use of moderate arbitrary primers. In our study, the PCR banding pattern of rRAMP was highly dependent on the frequency of repeat motifs and primer combinations with random primers. Linkage analysis showed that rRAMP markers were well scattered on an intra-specific pepper map. Based on these results, we suggest that this technique is useful for studying genetic diversity, molecular fingerprinting, and rapidly constructing molecular maps for diverse plant species. PMID:18483466

  6. Nucleotide polymorphisms and protein structure changes in the Fg16 gene of Fusarium graminearum sensu stricto

    PubMed Central

    Abedi-Tizaki, Mostafa; Zafari, Doustmorad

    2016-01-01

    Fusarium graminearum is one of the most important causes of wheat scab in different parts of the world. This fungus is able to produce widespread trichothecene mycotoxins such as nivalenol (NIV) and deoxynivalenol (DON) which are harmful for both human and animals. The Fg16 target is located in chromosome 1 of the F. graminearum genome coding for a hypothetical protein whose function is not yet known. The Fg16 gene is involved in lipid biosynthesis and leads to sexual development during colonization in wheat stalks. This gene is used to detect F. graminearum and determine the lineage of F. graminearum complex species. In the present study, polymerase chain reaction–single strand conformational polymorphism (PCR–SSCP) and DNA sequencing methods were employed in screening for genetic variation in 172 F.graminearum s.s. isolates. The PCR reaction forced the amplification of 410-bp fragments of Fg16. Two single nucleotide polymorphisms (T82C and A352T) and one amino acid exchange (C65S) with three patterns (TA/TA, CT/CT and TA/CT genotypes) were found in the Fg16 gene fragment. Two haplotypes, 1A and 1B, were identified within F. graminearum s.s. populations in northern and western regions of Iran. Two different secondary structures of protein were predicted for CT/CT and TA/CT genotypes of Fg16 gene. The average diversity levels detected were relatively high (He: 0.3238; Heu: 0.334; Ho: 0.2894; mean PIC: 0.514; mean Shannon's information index: 0.4132; mean number of alleles per locus: 1.473). On the basis of the obtained results, it was revealed that the Fg16 gene had a high degree of polymorphism that can be considered for future control programming strategies and thus the associations between the SSCP patterns with different traits of F. graminearum such as wheat colonization, perithecium formation on stalk tissues and lineage discrimination should be investigated. PMID:27222818

  7. Nucleotide polymorphisms and protein structure changes in the Fg16 gene of Fusarium graminearum sensu stricto.

    PubMed

    Abedi-Tizaki, Mostafa; Zafari, Doustmorad

    2016-09-01

    Fusarium graminearum is one of the most important causes of wheat scab in different parts of the world. This fungus is able to produce widespread trichothecene mycotoxins such as nivalenol (NIV) and deoxynivalenol (DON) which are harmful for both human and animals. The Fg16 target is located in chromosome 1 of the F. graminearum genome coding for a hypothetical protein whose function is not yet known. The Fg16 gene is involved in lipid biosynthesis and leads to sexual development during colonization in wheat stalks. This gene is used to detect F. graminearum and determine the lineage of F. graminearum complex species. In the present study, polymerase chain reaction-single strand conformational polymorphism (PCR-SSCP) and DNA sequencing methods were employed in screening for genetic variation in 172 F. graminearum s.s. isolates. The PCR reaction forced the amplification of 410-bp fragments of Fg16. Two single nucleotide polymorphisms (T82C and A352T) and one amino acid exchange (C65S) with three patterns (TA/TA, CT/CT and TA/CT genotypes) were found in the Fg16 gene fragment. Two haplotypes, 1A and 1B, were identified within F. graminearum s.s. populations in northern and western regions of Iran. Two different secondary structures of protein were predicted for CT/CT and TA/CT genotypes of Fg16 gene. The average diversity levels detected were relatively high (He: 0.3238; Heu: 0.334; Ho: 0.2894; mean PIC: 0.514; mean Shannon's information index: 0.4132; mean number of alleles per locus: 1.473). On the basis of the obtained results, it was revealed that the Fg16 gene had a high degree of polymorphism that can be considered for future control programming strategies and thus the associations between the SSCP patterns with different traits of F. graminearum such as wheat colonization, perithecium formation on stalk tissues and lineage discrimination should be investigated. PMID:27222818

  8. Nucleotide sequence of Bacillus phage Nf terminal protein gene.

    PubMed Central

    Leavitt, M C; Ito, J

    1987-01-01

    The nucleotide sequence of Bacillus phage Nf gene E has been determined. Gene E codes for phage terminal protein which is the primer necessary for the initiation of DNA replication. The deduced amino acid sequence of Nf terminal protein is approximately 66% homologous with the terminal proteins of Bacillus phages PZA and luminal diameter 29, and shows similar hydropathy and secondary structure predictions. A serine which has been identified as the residue which covalently links the protein to the 5' end of the genome in luminal diameter 29, is conserved in all three phages. The hydropathic and secondary structural environment of this serine is similar in these phage terminal proteins and also similar to the linking serine of adenovirus terminal protein. PMID:3601672

  9. Allele-specific polymerase chain reaction for the detection of Alzheimer’s disease-related single nucleotide polymorphisms

    PubMed Central

    2013-01-01

    Background The incidence of Alzheimer’s disease, particularly in developing countries, is expected to increase exponentially as the population ages. Continuing research in this area is essential in order to better understand this disease and develop strategies for treatment and prevention. Genome-wide association studies have identified several loci as genetic risk factors of AD aside from apolipoprotein E such as bridging integrator (BIN1), clusterin (CLU), ATP-binding cassette sub-family A member 7 (ABCA7), complement receptor 1 (CR1) and phosphatidylinositol binding clathrin assembly protein (PICALM). However genetic research in developing countries is often limited by lack of funding and expertise. This study therefore developed and validated a simple, cost effective polymerase chain reaction based technique to determine these single nucleotide polymorphisms. Methods An allele-specific PCR method was developed to detect single nucleotide polymorphisms of BIN1 rs744373, CLU rs11136000, ABCA7 rs3764650, CR1 rs3818361 and PICALM rs3851179 in human DNA samples. Allele-specific primers were designed by using appropriate software to permit the PCR amplification only if the nucleotide at the 3’-end of the primer complemented the base at the wild-type or variant-type DNA sample. The primers were then searched for uniqueness using the Basic Local Alignment Search Tool search engine. Results The assay was tested on a hundred samples and accurately detected the homozygous wild-type, homozygous variant-type and heterozygous of each SNP. Validation was by direct DNA sequencing. Conclusion This method will enable researchers to carry out genetic polymorphism studies for genetic risk factors associated with late-onset Alzheimer’s disease (BIN1, CLU, ABCA7, CR1 and PICALM) without the use of expensive instrumentation and reagents. PMID:23419238

  10. Nucleotide sequences specific to Brucella and methods for the detection of Brucella

    DOEpatents

    McCready, Paula M.; Radnedge, Lyndsay; Andersen, Gary L.; Ott, Linda L.; Slezak, Thomas R.; Kuczmarski, Thomas A.

    2009-02-24

    Nucleotide sequences specific to Brucella that serves as a marker or signature for identification of this bacterium were identified. In addition, forward and reverse primers and hybridization probes derived from these nucleotide sequences that are used in nucleotide detection methods to detect the presence of the bacterium are disclosed.