Science.gov

Sample records for developing snp markers

  1. SNP2CAPS: a SNP and INDEL analysis tool for CAPS marker development.

    PubMed

    Thiel, Thomas; Kota, Raja; Grosse, Ivo; Stein, Nils; Graner, Andreas

    2004-01-02

    With the influx of various SNP genotyping assays in recent years, there has been a need for an assay that is robust, yet cost effective, and could be performed using standard gel-based procedures. In this context, CAPS markers have been shown to meet these criteria. However, converting SNPs to CAPS markers can be a difficult process if done manually. In order to address this problem, we describe a computer program, SNP2CAPS, that facilitates the computational conversion of SNP markers into CAPS markers. 413 multiple aligned sequences derived from barley ESTs were analysed for the presence of polymorphisms in 235 distinct restriction sites. 282 (90%) of 314 alignments that contain sequence variation due to SNPs and InDels revealed at least one polymorphic restriction site. After reducing the number of restriction enzymes from 235 to 10, 31% of the polymorphic sites could still be detected. In order to demonstrate the usefulness of this tool for marker development, we experimentally validated some of the results predicted by SNP2CAPS.

  2. Development of Single Nucleotide Polymorphism (SNP) Markers for Use in Commercial Maize (Zea Mays L.) Germplasm

    USDA-ARS?s Scientific Manuscript database

    The development of single nucleotide polymorphism (SNP) markers in maize offer the opportunity to utilize DNA markers in many new areas of population genetics, gene discovery, plant breeding, and germplasm identification. However, the steps from sequencing and SNP discovery to SNP marker design and ...

  3. SKM-SNP: SNP markers detection method.

    PubMed

    Liu, Yang; Li, Mark; Cheung, Yiu M; Sham, Pak C; Ng, Michael K

    2010-04-01

    SKM-SNP, SNP markers detection program, is proposed to identify a set of relevant SNPs for the association between a disease and multiple marker genotypes. We employ a subspace categorical clustering algorithm to compute a weight for each SNP in the group of patient samples and the group of normal samples, and use the weights to identify the subsets of relevant SNPs that categorize these two groups. The experiments on both Schizophrenia and Parkinson Disease data sets containing genome-wide SNPs are reported to demonstrate the program. Results indicate that our method can find some relevant SNPs that categorize the disease samples. The online SKM-SNP program is available at http://www.math.hkbu.edu.hk/~mng/SKM-SNP/SKM-SNP.html.

  4. Development of discrimination SNP markers for Hanwoo (Korean native cattle).

    PubMed

    Cheong, H S; Kim, L H; Namgoong, S; Shin, H D

    2013-07-01

    In the Korean meat market, the native cattle, Hanwoo beef, are preferred over imported beef and domestic Holstein beef despite its relatively high price. In order to hold the beef industry accountable and support consumers' right to know, correct beef-origin labeling is required. For this purpose, we developed 90 single-nucleotide polymorphism markers to discriminate between Hanwoo and other breeds including Holstein using 1602 cattle DNAs. The probability of discrimination was found to be 100% in a subsequent validation set consisting of 632 DNAs. Our study suggests that improved beef-origin discrimination can be achieved by using a combined genetic model that takes into account small genetic differences among a large number of markers. These markers could be useful for discriminating between Hanwoo and imported breeds including domestic Holsteins, and would contribute to the prevention of falsified beef origin.

  5. Developing single nucleotide polymorphism (SNP) markers from transcriptome sequences for identification of longan (Dimocarpus longan) germplasm

    PubMed Central

    Wang, Boyi; Tan, Hua-Wei; Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Matsumoto, Tracie; Zhang, Dapeng

    2015-01-01

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in 50 longan germplasm accessions, including cultivated varieties and wild germplasm; and designated 25 SNP markers that unambiguously identified all tested longan varieties with high statistical rigor (P<0.0001). Multiple trees from the same clone were verified and off-type trees were identified. Diversity analysis revealed genetic relationships among analyzed accessions. Cultivated varieties differed significantly from wild populations (Fst=0.300; P<0.001), demonstrating untapped genetic diversity for germplasm conservation and utilization. Within cultivated varieties, apparent differences between varieties from China and those from Thailand and Hawaii indicated geographic patterns of genetic differentiation. These SNP markers provide a powerful tool to manage longan genetic resources and breeding, with accurate and efficient genotype identification. PMID:26504559

  6. Report on the development of putative functional SSR and SNP markers in passion fruits.

    PubMed

    da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro

    2017-09-06

    Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.

  7. Developing Single Nucleotide Polymorphism (SNP) markers from transcriptome sequences for the identification of longan (Dimocarpus longan) germplasm

    USDA-ARS?s Scientific Manuscript database

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...

  8. SNP identification and SNAP marker development for a GmNARK gene controlling supernodulation in soybean.

    PubMed

    Kim, M Y; Van, K; Lestari, P; Moon, J-K; Lee, S-H

    2005-04-01

    Supernodulation in soybean (Glycine max L. Merr.) is an important source of nitrogen supply to subterranean ecological systems. Single nucleotide-amplified polymorphism (SNAP) markers for supernodulation should allow rapid screening of the trait in early growth stages, without the need for inoculation and phenotyping. The gene GmNARK (Glycine max nodule autoregulation receptor kinase), controlling autoregulation of nodulation, was found to have a single nucleotide polymorphism (SNP) between the wild-type cultivar Sinpaldalkong 2 and its supernodulating mutant, SS2-2. Transversion of A to T at the 959-bp position of the GmNARK sequence results in a change of lysine (AAG) to a stop codon (TAG), thus terminating its translation in SS2-2. Based on the identified SNP in GmNARK, five primer pairs specific to each allele were designed using the WebSnaper program to develop a SNAP marker for supernodulation. One A-specific primer pair produced a band present in only Sinpaldalkong 2, while two T-specific pairs showed a band in only SS2-2. Both complementary PCRs, using each allele-specific primer pair were performed to genotype supernodulation against F2 progeny of Sinpaldalkong 2 x SS2-2. Among 28 individuals with the normal phenotype, eight individuals having only the A-allele-specific band were homozygous and normal, while 20 individuals were found to be heterozygous at the SNP having both A and T bands. Twelve supernodulating individuals showed only the band specific to the T allele. This SNAP marker for supernodulation could easily be analyzed through simple PCR and agarose gel electrophoresis. Therefore, use of this SNAP marker might be faster, cheaper, and more reproducible than using other genotyping methods, such as a cleaved amplified polymorphic sequence marker, which demand of restriction enzymes.

  9. Development of SNP markers identifying European wildcats, domestic cats, and their admixed progeny.

    PubMed

    Nussberger, B; Greminger, M P; Grossen, C; Keller, L F; Wandeler, P

    2013-05-01

    Introgression can be an important evolutionary force but it can also lead to species extinction and as such is a crucial issue for species conservation. However, introgression is difficult to detect, morphologically as well as genetically. Hybridization with domestic cats (Felis silvestris catus) is a major concern for the conservation of European wildcats (Felis s. silvestris). The available morphologic and genetic markers for the two Felis subspecies are not sufficient to reliably detect hybrids beyond first generation. Here we present a single nucleotide polymorphism (SNP) based approach that allows the identification of introgressed individuals. Using high-throughput sequencing of reduced representation libraries we developed a diagnostic marker set containing 48 SNPs (Fst > 0.8) which allows the identification of wildcats, domestic cats, their hybrids and backcrosses. This allows assessing introgression rate in natural wildcat populations and is key for a better understanding of hybridization processes.

  10. SNP marker development for linkage map construction, anchoring of the common bean whole genome sequence and genetic research

    USDA-ARS?s Scientific Manuscript database

    Our objectives were to identify SNP DNA markers based on a diverse set of common bean cultivars via next generation sequencing technologies; to develop Illumina Infinium BeadChip assays containing SNPs with high polymorphism within and between common bean market classes, to create high density genet...

  11. Development of EST-based SNP and InDel markers and their utilization in tetraploid cotton genetic mapping

    USDA-ARS?s Scientific Manuscript database

    Expressed sequence tags (ESTs) were analyzed in silico in order to identify single nucleotide polymorphisms (SNPs) and insertion-deletion polymorphisms (InDels) in cotton. A total of 1349 EST-based SNP and InDel markers were developed by comparing ESTs between Gossypium hirsutum and G. barbadense, m...

  12. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication

    PubMed Central

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-01-01

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615

  13. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

    PubMed

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-06-04

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.

  14. Development of SNP markers for genes of the phenylpropanoid pathway and their association to kernel and malting traits in barley

    PubMed Central

    2013-01-01

    Background Flavonoids are an important class of secondary compounds in angiosperms. Next to certain biological functions in plants, they play a role in the brewing process and have an effect on taste, color and aroma of beer. The aim of this study was to reveal the haplotype diversity of candidate genes involved in the phenylpropanoid biosynthesis pathway in cultivated barley varieties (Hordeum vulgare L.) and to determine associations to kernel and malting quality parameters. Results Five genes encoding phenylalanine ammonia-lyase (PAL), cinnamate 4-hydroxylase (C4H), chalcone synthase (CHS), flavanone 3-hydroxylase (F3H) and dihydroflavonol reductase (DFR) of the phenylpropanoid biosynthesis pathway were partially resequenced in 16 diverse barley reference genotypes. Their localization in the barley genome, their genetic structure, and their genetic variation e.g. single nucleotide polymorphism (SNP) and Insertion/Deletion (InDel) patterns were revealed. In total, 130 SNPs and seven InDels were detected. Of these, 21 polymorphisms were converted into high-throughput pyrosequencing markers. The resulting SNP and haplotype patterns were used to calculate associations with kernel and malting quality parameters. Conclusions SNP patterns were found to be highly variable for the investigated genes. The developed high-throughput markers are applicable for assessing the genetic variability and for the determination of haplotype patterns in a set of barley accessions. The candidate genes PAL, C4H and F3H were shown to be associated to several malting properties like glassiness (PAL), viscosity (C4H) or to final attenuation (F3H). PMID:24088365

  15. De novo assembly and transcriptome analysis of the rubber tree (Hevea brasiliensis) and SNP markers development for rubber biosynthesis pathways.

    PubMed

    Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira

    2014-01-01

    Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.

  16. De Novo Assembly and Transcriptome Analysis of the Rubber Tree (Hevea brasiliensis) and SNP Markers Development for Rubber Biosynthesis Pathways

    PubMed Central

    Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira

    2014-01-01

    Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection. PMID:25048025

  17. Development of new SNP derived cleaved amplified polymorphic sequence marker set and its successful utilization in the genetic analysis of seed color variation in barley.

    PubMed

    Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad

    2016-03-01

    The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide.

  18. Fine Mapping for Identification of Citrus Alternaria Brown Spot Candidate Resistance Genes and Development of New SNP Markers for Marker-Assisted Selection.

    PubMed

    Cuenca, Jose; Aleza, Pablo; Garcia-Lor, Andres; Ollitrault, Patrick; Navarro, Luis

    2016-01-01

    Alternaria brown spot (ABS) is a serious disease affecting susceptible citrus genotypes, which is a strong concern regarding citrus breeding programs. Resistance is conferred by a recessive locus (ABSr) previously located by our group within a 3.3 Mb genome region near the centromere in chromosome III. This work addresses fine-linkage mapping of this region for identifying candidate resistance genes and develops new molecular markers for ABS-resistance effective marker-assisted selection (MAS). Markers closely linked to ABSr locus were used for fine mapping using a 268-segregating diploid progeny derived from a heterozygous susceptible × resistant cross. Fine mapping limited the genomic region containing the ABSr resistance gene to 366 kb, flanked by markers at 0.4 and 0.7 cM. This region contains nine genes related to pathogen resistance. Among them, eight are resistance (R) gene homologs, with two of them harboring a serine/threonine protein kinase domain. These two genes along with a gene encoding a S-adenosyl-L-methionine-dependent-methyltransferase protein, should be considered as strong candidates for ABS-resistance. Moreover, the closest SNP was genotyped in 40 citrus varieties, revealing very high association with the resistant/susceptible phenotype. This new marker is currently used in our citrus breeding program for ABS-resistant parent and cultivar selection, at diploid, triploid and tetraploid level.

  19. Fine Mapping for Identification of Citrus Alternaria Brown Spot Candidate Resistance Genes and Development of New SNP Markers for Marker-Assisted Selection

    PubMed Central

    Cuenca, Jose; Aleza, Pablo; Garcia-Lor, Andres; Ollitrault, Patrick; Navarro, Luis

    2016-01-01

    Alternaria brown spot (ABS) is a serious disease affecting susceptible citrus genotypes, which is a strong concern regarding citrus breeding programs. Resistance is conferred by a recessive locus (ABSr) previously located by our group within a 3.3 Mb genome region near the centromere in chromosome III. This work addresses fine-linkage mapping of this region for identifying candidate resistance genes and develops new molecular markers for ABS-resistance effective marker-assisted selection (MAS). Markers closely linked to ABSr locus were used for fine mapping using a 268-segregating diploid progeny derived from a heterozygous susceptible × resistant cross. Fine mapping limited the genomic region containing the ABSr resistance gene to 366 kb, flanked by markers at 0.4 and 0.7 cM. This region contains nine genes related to pathogen resistance. Among them, eight are resistance (R) gene homologs, with two of them harboring a serine/threonine protein kinase domain. These two genes along with a gene encoding a S-adenosyl-L-methionine-dependent-methyltransferase protein, should be considered as strong candidates for ABS-resistance. Moreover, the closest SNP was genotyped in 40 citrus varieties, revealing very high association with the resistant/susceptible phenotype. This new marker is currently used in our citrus breeding program for ABS-resistant parent and cultivar selection, at diploid, triploid and tetraploid level. PMID:28066498

  20. SNP discovery and marker development for disease resistance candidate genes in common carp (Cyprinus carpio)

    USDA-ARS?s Scientific Manuscript database

    Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers of susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpes virus 3 (CyHV-3) is highly contagious and virulent in common carp. With the aim to investigate the gene...

  1. SNP Detection from De Novo Transcriptome Sequencing in the Bivalve Macoma balthica: Marker Development for Evolutionary Studies

    PubMed Central

    Becquet, Vanessa; Belkhir, Khalid; Bierne, Nicolas; Garcia, Pascale

    2012-01-01

    Hybrid zones are noteworthy systems for the study of environmental adaptation to fast-changing environments, as they constitute reservoirs of polymorphism and are key to the maintenance of biodiversity. They can move in relation to climate fluctuations, as temperature can affect both selection and migration, or remain trapped by environmental and physical barriers. There is therefore a very strong incentive to study the dynamics of hybrid zones subjected to climate variations. The infaunal bivalve Macoma balthica emerges as a noteworthy model species, as divergent lineages hybridize, and its native NE Atlantic range is currently contracting to the North. To investigate the dynamics and functioning of hybrid zones in M. balthica, we developed new molecular markers by sequencing the collective transcriptome of 30 individuals. Ten individuals were pooled for each of the three populations sampled at the margins of two hybrid zones. A single 454 run generated 277 Mb from which 17K SNPs were detected. SNP density averaged 1 polymorphic site every 14 to 19 bases, for mitochondrial and nuclear loci, respectively. An scan detected high genetic divergence among several hundred SNPs, some of them involved in energetic metabolism, cellular respiration and physiological stress. The high population differentiation, recorded for nuclear-encoded ATP synthase and NADH dehydrogenase as well as most mitochondrial loci, suggests cytonuclear genetic incompatibilities. Results from this study will help pave the way to a high-resolution study of hybrid zone dynamics in M. balthica, and the relative importance of endogenous and exogenous barriers to gene flow in this system. PMID:23300636

  2. SNP marker detection and genotyping in tilapia.

    PubMed

    Van Bers, N E M; Crooijmans, R P M A; Groenen, M A M; Dibbits, B W; Komen, J

    2012-09-01

    We have generated a unique resource consisting of nearly 175 000 short contig sequences and 3569 SNP markers from the widely cultured GIFT (Genetically Improved Farmed Tilapia) strain of Nile tilapia (Oreochromis niloticus). In total, 384 SNPs were selected to monitor the wider applicability of the SNPs by genotyping tilapia individuals from different strains and different geographical locations. In all strains and species tested (O. niloticus, O. aureus and O. mossambicus), the genotyping assay was working for a similar number of SNPs (288-305 SNPs). The actual number of polymorphic SNPs was, as expected, highest for individuals from the GIFT population (255 SNPs). In the individuals from an Egyptian strain and in individuals caught in the wild in the basin of the river Volta, 197 and 163 SNPs were polymorphic, respectively. A pairwise calculation of Nei's genetic distance allowed the discrimination of the individual strains and species based on the genotypes determined with the SNP set. We expect that this set will be widely applicable for use in tilapia aquaculture, e.g. for pedigree reconstruction. In addition, this set is currently used for assaying the genetic diversity of native Nile tilapia in areas where tilapia is, or will be, introduced in aquaculture projects. This allows the tracing of escapees from aquaculture and the monitoring of effects of introgression and hybridization.

  3. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    USDA-ARS?s Scientific Manuscript database

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  4. Development of SNP markers and their application for genetic diversity analysis in the oil palm (Elaeis guineensis).

    PubMed

    Ong, P W; Maizura, I; Abdullah, N A P; Rafii, M Y; Ooi, L C L; Low, E T L; Singh, R

    2015-10-09

    The genetic evaluation of oil palm germplasm collections is required for insight into the variability among populations. The information obtained is also useful for incorporating new genetic materials into current breeding programs. Single nucleotide polymorphisms (SNPs) have been widely used in many plant genetic studies due to the availability of large numbers of genomic sequences and expressed sequence tags. The present study examined 219 oil palms collected from two natural Angolan populations, a few hundred kilometers apart. A total of 62 SNPs were designed from oil palm genomic sequences and converted to cleaved amplified polymorphic sequence (CAPS). Of these, nine were found to be informative across the two populations. The nine informative SNPs revealed mean major allele frequency of 0.693. The average expected and observed heterozygosities were 0.398 and 0.400, respectively. The mean polymorphism information content was 0.315 (ranging between 0.223 and 0.375). None of the loci deviated from Hardy-Weinberg equilibrium and no rare alleles were detected. In cluster analysis using unweighted pair group method with arithmetic, the 219 oil palms fell into two clusters. This was further supported by the population structure analysis result (K = 2), suggesting that the samples were divided into two main genetic groups. However, the two groups did not coincide with the geographic populations. Analysis of molecular variance indicated that within-population variation contributed 93% of the total genetic variation. This study showed that SNP-based CAPS markers are useful for studying the genetic diversity of oil palm and have potential application for marker-trait association studies.

  5. Development of loop-mediated isothermal amplification (LAMP)-based SNP markers for shelf-life in melon (Cucumis melo L.).

    PubMed

    Fukuta, Shiro; Mizukami, Yuko; Ishida, Akira; Kanbe, Michio

    2006-01-01

    In this study, LAMP markers linked to shelf-life in melon (Cucumis melo L.) were developed by converting a cleaved amplified polymorphic sequences (CAPS) marker (C2). The CAPS-PCR fragments from the long-shelf-life melon (O-3) and short-shelf-life melon (Nat-2) were cloned and sequenced to construct LAMP primers. A single nucleotide polymorphism (SNP) was identified between O-3 and Nat-2. LAMP primers were designed to detect the SNP. In the LAMP reaction to detect long-shelf-life melon, the turbidity of the templates using O-3, F1, homozygous long-shelf-life F2 lines and heterozygous long-shelf-life F2 lines started to increase after 40 min. In contrast, the turbidity of Nat-2 and homozygous short-shelf-life F2 lines did not increase even after 90 min. In the LAMP reaction to detect short-shelf-life melon, the turbidity of the templates using Nat-2, F1, homozygous short-shelf-life F2 lines and heterozygous long-shelf-life F2 lines started to increase after 40 min. But the turbidity of O-3 and homozygous long-shelf-life F2 lines did not increase after 90 min. This attests to the high reliability and usefulness of LAMP for marker-assisted selection.

  6. Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers

    PubMed Central

    Atwood, Tressa S.; Currey, Mark C.; Shiver, Anthony L.; Lewis, Zachary A.; Selker, Eric U.; Cresko, William A.; Johnson, Eric A.

    2008-01-01

    Single nucleotide polymorphism (SNP) discovery and genotyping are essential to genetic mapping. There remains a need for a simple, inexpensive platform that allows high-density SNP discovery and genotyping in large populations. Here we describe the sequencing of restriction-site associated DNA (RAD) tags, which identified more than 13,000 SNPs, and mapped three traits in two model organisms, using less than half the capacity of one Illumina sequencing run. We demonstrated that different marker densities can be attained by choice of restriction enzyme. Furthermore, we developed a barcoding system for sample multiplexing and fine mapped the genetic basis of lateral plate armor loss in threespine stickleback by identifying recombinant breakpoints in F2 individuals. Barcoding also facilitated mapping of a second trait, a reduction of pelvic structure, by in silico re-sorting of individuals. To further demonstrate the ease of the RAD sequencing approach we identified polymorphic markers and mapped an induced mutation in Neurospora crassa. Sequencing of RAD markers is an integrated platform for SNP discovery and genotyping. This approach should be widely applicable to genetic mapping in a variety of organisms. PMID:18852878

  7. SNP identification and allelic-specific PCR markers development for TaGW2, a gene linked to wheat kernel weight.

    PubMed

    Yang, Zibo; Bai, Zhiyuan; Li, Xiaolin; Wang, Pei; Wu, Qingxia; Yang, Lin; Li, Liqun; Li, Xuejun

    2012-09-01

    TaGW2, an orthologous gene of rice OsGW2, has been associated with kernel width and weight of bread wheat (Triticum aestivum). Difference in TaGW2 coding sequence was not found among different wheat varieties in previous researches. In this study, we found eight exons and seven introns in TaGW2 with a full-length cDNA sequence of 1,275 bp, which contains a conserved function domain and seven splice sites that shared homology with rice OsGW2. A single T-base insertion in the eighth exon of TaGW2 on chromosome 6A was detected in a large-kernel wheat variety, Lankaodali. This insertion mutation reduces the coding protein sequence from normal 424 amino acids (~47.2 kDa) to 328 amino acids (~37.1 kDa) by truncating 96 amino acids. The result was validated by identifying histidine-tagged TaGW2 proteins encoded by both alleles of the mutant and the wild types in SDS-PAGE. Allele-specific PCR markers were developed based on the single nucleotide polymorphism (SNP) site. The SNP markers were genotyped for an F(2) segregation population from the cross of Lankaodali × Chinese Spring. Seed traits of F(2:3) families were evaluated in three different environments. The association analysis indicated that F(2:3) families with the mutated TaGW2 allele significantly increased kernel width (KW) and thousand-kernel weight (TKW), and slightly improved kernel length (KL). Using the SNP markers, another two varieties harbored the mutated TaGW2 allele were successfully identified from 22 additional wheat varieties, and they both have large KW and TKW. Cloning and sequencing of the gene further confirmed the functions of the mutated allele of TaGW2 in the two large kernel varieties. The results suggested that TaGW2 may negatively regulate kernel size variation, which shares the same function as OsGW2 in rice. The successful development of SNP markers provides a useful tool for improving kernel yield in wheat.

  8. Development of single nucleotide polymorphism (SNP) markers from the mango (Mangiferaindica) transcriptome for mapping and estimation of genetic diversity

    USDA-ARS?s Scientific Manuscript database

    The development of resources for genomic studies in Mangifera indica (mango) will allow marker-assisted selection and identification of genetically diverse germplasm, greatly aiding mango breeding programs. We report here a first step in developing such resources, our identification of thousands una...

  9. Development of COS-SNP and HRM markers for high-throughput and reliable haplotype-based detection of Lr14a in durum wheat (Triticum durum Desf.).

    PubMed

    Terracciano, Irma; Maccaferri, Marco; Bassi, Filippo; Mantovani, Paola; Sanguineti, Maria C; Salvi, Silvio; Simková, Hana; Doležel, Jaroslav; Massi, Andrea; Ammar, Karim; Kolmer, James; Tuberosa, Roberto

    2013-04-01

    Leaf rust (Puccinia triticina Eriks. & Henn.) is a major disease affecting durum wheat production. The Lr14a-resistant gene present in the durum wheat cv. Creso and its derivative cv. Colosseo is one of the best characterized leaf-rust resistance sources deployed in durum wheat breeding. Lr14a has been mapped close to the simple sequence repeat markers gwm146, gwm344 and wmc10 in the distal portion of the chromosome arm 7BL, a gene-dense region. The objectives of this study were: (1) to enrich the Lr14a region with single nucleotide polymorphisms (SNPs) and high-resolution melting (HRM)-based markers developed from conserved ortholog set (COS) genes and from sequenced Diversity Array Technology (DArT(®)) markers; (2) to further investigate the gene content and colinearity of this region with the Brachypodium and rice genomes. Ten new COS-SNP and five HRM markers were mapped within an 8.0 cM interval spanning Lr14a. Two HRM markers pinpointed the locus in an interval of <1.0 cM and eight COS-SNPs were mapped 2.1-4.1 cM distal to Lr14a. Each marker was tested for its capacity to predict the state of Lr14a alleles (in particular, Lr14-Creso associated to resistance) in a panel of durum wheat elite germplasm including 164 accessions. Two of the most informative markers were converted into KASPar(®) markers. Single assay markers ubw14 and wPt-4038-HRM designed for agarose gel electrophoresis/KASPar(®) assays and high-resolution melting analysis, respectively, as well as the double-marker combinations ubw14/ubw18, ubw14/ubw35 and wPt-4038-HRM-ubw35 will be useful for germplasm haplotyping and for molecular-assisted breeding.

  10. Development and dissection of diagnostic SNP markers for the downy mildew resistance genes Pl Arg and Pl 8 and maker-assisted gene pyramiding in sunflower (Helianthus annuus L.).

    PubMed

    Qi, L L; Talukder, Z I; Hulke, B S; Foley, M E

    2017-02-03

    Diagnostic DNA markers are an invaluable resource in breeding programs for successful introgression and pyramiding of disease resistance genes. Resistance to downy mildew (DM) disease in sunflower is mediated by Pl genes which are known to be effective against the causal fungus, Plasmopara halstedii. Two DM resistance genes, Pl Arg and Pl 8 , are highly effective against P. halstedii races in the USA, and have been previously mapped to the sunflower linkage groups (LGs) 1 and 13, respectively, using simple sequence repeat (SSR) markers. In this study, we developed high-density single nucleotide polymorphism (SNP) maps encompassing the Pl arg and Pl 8 genes and identified diagnostic SNP markers closely linked to these genes. The specificity of the diagnostic markers was validated in a highly diverse panel of 548 sunflower lines. Dissection of a large marker cluster co-segregated with Pl Arg revealed that the closest SNP markers NSA_007595 and NSA_001835 delimited Pl Arg to an interval of 2.83 Mb on the LG1 physical map. The SNP markers SFW01497 and SFW06597 delimited Pl 8 to an interval of 2.85 Mb on the LG13 physical map. We also developed sunflower lines with homozygous, three gene pyramids carrying Pl Arg , Pl 8 , and the sunflower rust resistance gene R 12 using the linked SNP markers from a segregating F2 population of RHA 340 (carrying Pl 8 )/RHA 464 (carrying Pl Arg and R 12 ). The high-throughput diagnostic SNP markers developed in this study will facilitate marker-assisted selection breeding, and the pyramided sunflower lines will provide durable resistance to downy mildew and rust diseases.

  11. SNP Markers and Their Impact on Plant Breeding

    PubMed Central

    Mammadov, Jafar; Aggarwal, Rajat; Buyyarapu, Ramesh; Kumpatla, Siva

    2012-01-01

    The use of molecular markers has revolutionized the pace and precision of plant genetic analysis which in turn facilitated the implementation of molecular breeding of crops. The last three decades have seen tremendous advances in the evolution of marker systems and the respective detection platforms. Markers based on single nucleotide polymorphisms (SNPs) have rapidly gained the center stage of molecular genetics during the recent years due to their abundance in the genomes and their amenability for high-throughput detection formats and platforms. Computational approaches dominate SNP discovery methods due to the ever-increasing sequence information in public databases; however, complex genomes pose special challenges in the identification of informative SNPs warranting alternative strategies in those crops. Many genotyping platforms and chemistries have become available making the use of SNPs even more attractive and efficient. This paper provides a review of historical and current efforts in the development, validation, and application of SNP markers in QTL/gene discovery and plant breeding by discussing key experimental strategies and cases exemplifying their impact. PMID:23316221

  12. Large-scale development of cost-effective SNP marker assays for diversity assessment and genetic mapping in chickpea and comparative mapping in legumes

    PubMed Central

    Hiremath, Pavana J; Kumar, Ashish; Penmetsa, Ramachandra Varma; Farmer, Andrew; Schlueter, Jessica A; Chamarthi, Siva K; Whaley, Adam M; Carrasquilla-Garcia, Noelia; Gaur, Pooran M; Upadhyaya, Hari D; Kavi Kishor, Polavarapu B; Shah, Trushar M; Cook, Douglas R; Varshney, Rajeev K

    2012-01-01

    A set of 2486 single nucleotide polymorphisms (SNPs) were compiled in chickpea using four approaches, namely (i) Solexa/Illumina sequencing (1409), (ii) amplicon sequencing of tentative orthologous genes (TOGs) (604), (iii) mining of expressed sequence tags (ESTs) (286) and (iv) sequencing of candidate genes (187). Conversion of these SNPs to the cost-effective and flexible throughput Competitive Allele Specific PCR (KASPar) assays generated successful assays for 2005 SNPs. These marker assays have been designated as Chickpea KASPar Assay Markers (CKAMs). Screening of 70 genotypes including 58 diverse chickpea accessions and 12 BC3F2 lines showed 1341 CKAMs as being polymorphic. Genetic analysis of these data clustered chickpea accessions based on geographical origin. Genotyping data generated for 671 CKAMs on the reference mapping population (Cicer arietinum ICC 4958 × Cicer reticulatum PI 489777) were compiled with 317 unpublished TOG-SNPs and 396 published markers for developing the genetic map. As a result, a second-generation genetic map comprising 1328 marker loci including novel 625 CKAMs, 314 TOG-SNPs and 389 published marker loci with an average inter-marker distance of 0.59 cM was constructed. Detailed analyses of 1064 mapped loci of this second-generation chickpea genetic map showed a higher degree of synteny with genome of Medicago truncatula, followed by Glycine max, Lotus japonicus and least with Vigna unguiculata. Development of these cost-effective CKAMs for SNP genotyping will be useful not only for genetics research and breeding applications in chickpea, but also for utilizing genome information from other sequenced or model legumes. PMID:22703242

  13. Development of genotyping by sequencing (GBS) and array derived SNP markers for stem rust resistance gene Sr42

    USDA-ARS?s Scientific Manuscript database

    The stem rust fungus, particularly race TTKSK (Ug99), poses a serious threat to world wheat production. Gene Sr42 or SrCad (which could be the same gene or an allele of Sr42) is effective against race TTKSK. However, known genetic markers for Sr42 are mostly SSR markers which are generally labor i...

  14. SNP marker discovery in koala TLR genes.

    PubMed

    Cui, Jian; Frankham, Greta J; Johnson, Rebecca N; Polkinghorne, Adam; Timms, Peter; O'Meally, Denis; Cheng, Yuanyuan; Belov, Katherine

    2015-01-01

    Toll-like receptors (TLRs) play a crucial role in the early defence against invading pathogens, yet our understanding of TLRs in marsupial immunity is limited. Here, we describe the characterisation of nine TLRs from a koala immune tissue transcriptome and one TLR from a draft sequence of the koala genome and the subsequent development of an assay to study genetic diversity in these genes. We surveyed genetic diversity in 20 koalas from New South Wales, Australia and showed that one gene, TLR10 is monomorphic, while the other nine TLR genes have between two and 12 alleles. 40 SNPs (16 non-synonymous) were identified across the ten TLR genes. These markers provide a springboard to future studies on innate immunity in the koala, a species under threat from two major infectious diseases.

  15. SNP Marker Discovery in Koala TLR Genes

    PubMed Central

    Cui, Jian; Frankham, Greta J.; Johnson, Rebecca N.; Polkinghorne, Adam; Timms, Peter; O’Meally, Denis; Cheng, Yuanyuan; Belov, Katherine

    2015-01-01

    Toll-like receptors (TLRs) play a crucial role in the early defence against invading pathogens, yet our understanding of TLRs in marsupial immunity is limited. Here, we describe the characterisation of nine TLRs from a koala immune tissue transcriptome and one TLR from a draft sequence of the koala genome and the subsequent development of an assay to study genetic diversity in these genes. We surveyed genetic diversity in 20 koalas from New South Wales, Australia and showed that one gene, TLR10 is monomorphic, while the other nine TLR genes have between two and 12 alleles. 40 SNPs (16 non-synonymous) were identified across the ten TLR genes. These markers provide a springboard to future studies on innate immunity in the koala, a species under threat from two major infectious diseases. PMID:25799012

  16. Genomic dissection of a 'Fuji' apple cultivar: re-sequencing, SNP marker development, definition of haplotypes, and QTL detection.

    PubMed

    Kunihisa, Miyuki; Moriya, Shigeki; Abe, Kazuyuki; Okada, Kazuma; Haji, Takashi; Hayashi, Takeshi; Kawahara, Yoshihiro; Itoh, Ryutaro; Itoh, Takeshi; Katayose, Yuichi; Kanamori, Hiroyuki; Matsumoto, Toshimi; Mori, Satomi; Sasaki, Harumi; Matsumoto, Takashi; Nishitani, Chikako; Terakami, Shingo; Yamamoto, Toshiya

    2016-09-01

    'Fuji' is one of the most popular and highly-produced apple cultivars worldwide, and has been frequently used in breeding programs. The development of genotypic markers for the preferable phenotypes of 'Fuji' is required. Here, we aimed to define the haplotypes of 'Fuji' and find associations between haplotypes and phenotypes of five traits (harvest day, fruit weight, acidity, degree of watercore, and flesh mealiness) by using 115 accessions related to 'Fuji'. Through the re-sequencing of 'Fuji' genome, total of 2,820,759 variants, including single nucleotide polymorphisms (SNPs) and insertions or deletions (indels) were detected between 'Fuji' and 'Golden Delicious' reference genome. We selected mapping-validated 1,014 SNPs, most of which were heterozygous in 'Fuji' and capable of distinguishing alleles inherited from the parents of 'Fuji' (i.e., 'Ralls Janet' and 'Delicious'). We used these SNPs to define the haplotypes of 'Fuji' and trace their inheritance in relatives, which were shown to have an average of 27% of 'Fuji' genome. Analysis of variance (ANOVA) based on 'Fuji' haplotypes identified one quantitative trait loci (QTL) each for harvest time, acidity, degree of watercore, and mealiness. A haplotype from 'Delicious' chr14 was considered to dominantly cause watercore, and one from 'Ralls Janet' chr1 was related to low-mealiness.

  17. Identification of Immune-Related Genes and Development of SSR/SNP Markers from the Spleen Transcriptome of Schizothorax prenanti

    PubMed Central

    Zhang, Zhengshi; Lv, Changhuan; Zheng, Shuming; Wang, Zhiyong; Wang, Xiaoqing

    2016-01-01

    Schizothorax prenanti (S. prenanti) is mainly distributed in the upstream regions of the Yangtze River and its tributaries in China. This species is indigenous and commercially important. However, in recent years, wild populations and aquacultures have faced the serious challenges of germplasm variation loss and an increased susceptibility to a range of pathogens. Currently, the genetics and immune mechanisms of S. prenanti are unknown, partly due to a lack of genome and transcriptome information. Here, we sought to identify genes related to immune functions and to identify molecular markers to study the function of these genes and for trait mapping. To this end, the transcriptome from spleen tissues of S. prenanti was analyzed and sequenced. Using paired-end reads from the Illumina Hiseq2500 platform, 48,517 transcripts were isolated from the spleen transcriptome. These transcripts could be clustered into 37,785 unigenes with an N50 length of 2,539 bp. The majority of the unigenes (35,653, 94.4%) were successfully annotated using non-redundant nucleotide sequence analysis (nt), and the non-redundant protein (nr), Swiss-Prot, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. KEGG pathway assignment identified more than 500 immune-related genes. Furthermore, 7,545 putative simple sequence repeats (SSRs), 857,535 single nucleotide polymorphisms (SNPs), and 53,481 insertion/deletion (InDels) were detected from the transcriptome. This is the first reported high-throughput transcriptome analysis of S. prenanti, and it provides valuable genetic resources for the investigation of immune mechanisms, conservation of germplasm, and molecular marker-assisted breeding of S. prenanti. PMID:27019203

  18. Identification of Immune-Related Genes and Development of SSR/SNP Markers from the Spleen Transcriptome of Schizothorax prenanti.

    PubMed

    Luo, Hui; Xiao, Shijun; Ye, Hua; Zhang, Zhengshi; Lv, Changhuan; Zheng, Shuming; Wang, Zhiyong; Wang, Xiaoqing

    2016-01-01

    Schizothorax prenanti (S. prenanti) is mainly distributed in the upstream regions of the Yangtze River and its tributaries in China. This species is indigenous and commercially important. However, in recent years, wild populations and aquacultures have faced the serious challenges of germplasm variation loss and an increased susceptibility to a range of pathogens. Currently, the genetics and immune mechanisms of S. prenanti are unknown, partly due to a lack of genome and transcriptome information. Here, we sought to identify genes related to immune functions and to identify molecular markers to study the function of these genes and for trait mapping. To this end, the transcriptome from spleen tissues of S. prenanti was analyzed and sequenced. Using paired-end reads from the Illumina Hiseq2500 platform, 48,517 transcripts were isolated from the spleen transcriptome. These transcripts could be clustered into 37,785 unigenes with an N50 length of 2,539 bp. The majority of the unigenes (35,653, 94.4%) were successfully annotated using non-redundant nucleotide sequence analysis (nt), and the non-redundant protein (nr), Swiss-Prot, Gene Ontology (GO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. KEGG pathway assignment identified more than 500 immune-related genes. Furthermore, 7,545 putative simple sequence repeats (SSRs), 857,535 single nucleotide polymorphisms (SNPs), and 53,481 insertion/deletion (InDels) were detected from the transcriptome. This is the first reported high-throughput transcriptome analysis of S. prenanti, and it provides valuable genetic resources for the investigation of immune mechanisms, conservation of germplasm, and molecular marker-assisted breeding of S. prenanti.

  19. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    PubMed Central

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  20. Marker development

    SciTech Connect

    Adams, M.R.

    1987-05-01

    This report is to discuss the marker development for radioactive waste disposal sites. The markers must be designed to last 10,000 years, and place no undue burdens on the future generations. Barriers cannot be constructed that preclude human intrusion. Design specifications for surface markers will be discussed, also marker pictograms will also be covered.

  1. Generation of SNP markers for short straw in oat (Avena sativa L.).

    PubMed

    Tanhuanpää, Pirjo; Kalendar, Ruslan; Laurila, Jaana; Schulman, Alan H; Manninen, Outi; Kiviharju, Elina

    2006-03-01

    Short straw is a desired trait in oat germplasm (Avena sativa L.). Marker-assisted selection, a key tool for achieving this objective, is limited by the presence and number of available markers. Here, we have attempted to develop markers sufficiently linked to a gene specifying short straw so that marker-assisted selection could be applied. Bulked-segregant analysis was used to identify anonymous PCR-based markers associated with the dwarfing gene Dw6 in an F2 population from the cross between A. sativa "Aslak" and A. sativa "Kontant". One random amplified polymorphic DNA (RAPD) and 1 retrotransposon-microsatellite amplified polymorphism (REMAP) marker were found to be associated with height. These were converted into codominant single-nucleotide polymorphism (SNP) markers. The SNP-REMAP and the SNP-RAPD markers were located 5.2 and 12.6 cM from Dw6, respectively. They can be used in future efforts both to enhance oat germplasm by application of molecular markers and to determine the nature of the gene through positional cloning.

  2. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing.

    PubMed

    Yu, ShiGang; Chu, WeiWei; Zhang, LiFan; Han, HouMing; Zhao, RongXue; Wu, Wei; Zhu, JiangNing; Dodson, Michael V; Wei, Wei; Liu, HongLin; Chen, Jie

    2015-01-01

    Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying

  3. Identification of Laying-Related SNP Markers in Geese Using RAD Sequencing

    PubMed Central

    Yu, ShiGang; Chu, WeiWei; Zhang, LiFan; Han, HouMing; Zhao, RongXue; Wu, Wei; Zhu, JiangNing; Dodson, Michael V.; Wei, Wei; Liu, HongLin; Chen, Jie

    2015-01-01

    Laying performance is an important economical trait of goose production. As laying performance is of low heritability, it is of significance to develop a marker-assisted selection (MAS) strategy for this trait. Definition of sequence variation related to the target trait is a prerequisite of quantitating MAS, but little is presently known about the goose genome, which greatly hinders the identification of genetic markers for the laying traits of geese. Recently developed restriction site-associated DNA (RAD) sequencing is a possible approach for discerning large-scale single nucleotide polymorphism (SNP) and reducing the complexity of a genome without having reference genomic information available. In the present study, we developed a pooled RAD sequencing strategy for detecting geese laying-related SNP. Two DNA pools were constructed, each consisting of equal amounts of genomic DNA from 10 individuals with either high estimated breeding value (HEBV) or low estimated breeding value (LEBV). A total of 139,013 SNP were obtained from 42,291,356 sequences, of which 18,771,943 were for LEBV and 23,519,413 were for HEBV cohorts. Fifty-five SNP which had different allelic frequencies in the two DNA pools were further validated by individual-based AS-PCR genotyping in the LEBV and HEBV cohorts. Ten out of 55 SNP exhibited distinct allele distributions in these two cohorts. These 10 SNP were further genotyped in a goose population of 492 geese to verify the association with egg numbers. The result showed that 8 of 10 SNP were associated with egg numbers. Additionally, liner regression analysis revealed that SNP Record-111407, 106975 and 112359 were involved in a multiplegene network affecting laying performance. We used IPCR to extend the unknown regions flanking the candidate RAD tags. The obtained sequences were subjected to BLAST to retrieve the orthologous genes in either ducks or chickens. Five novel genes were cloned for geese which harbored the candidate laying

  4. An improved consensus linkage map of barley based on flow-sorted chromosomes and SNP markers

    USDA-ARS?s Scientific Manuscript database

    Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a SNP-based genotyping platform was developed a...

  5. Use of microsatellite and SNP markers to characterize biotypes in Hessian fly

    USDA-ARS?s Scientific Manuscript database

    Exploration of the biotype structure of Hessian fly, Mayetiola destructor (Say), would improve our knowledge regarding variation in virulence phenotypes and difference in genetic background. The objective of this study was to develop and test a panel of 18 microsatellite and 22 SNP markers to reveal...

  6. Verification of genetic identity of introduced cacao germplasm in Ghana using single nucleotide polymorphism (SNP) markers

    USDA-ARS?s Scientific Manuscript database

    Accurate identification of individual genotypes is important for cacao (Theobroma cacao L.) breeding, germplasm conservation and seed propagation. The development of single nucleotide polymorphism (SNP) markers in cacao offers an effective way to use a high-throughput genotyping system for cacao gen...

  7. Identification of a SNP marker associated with WB242 nematode resistance in sugar beet

    USDA-ARS?s Scientific Manuscript database

    The beet-cyst nematode (Heterodera schachtii Schmidt) is one of the major diseases of sugar beet. The identification of molecular markers associated to the nematode resistance would be helpful for developing resistant varieties. The aim of this study was the identification of SNP (Single Nucleotide ...

  8. Citrus (Rutaceae) SNP markers based on Competitive Allele-Specific PCR; transferability across the Aurantioideae subfamily1

    PubMed Central

    Garcia-Lor, Andres; Ancillo, Gema; Navarro, Luis; Ollitrault, Patrick

    2013-01-01

    • Premise of the study: Single nucleotide polymorphism (SNP) markers based on Competitive Allele-Specific PCR (KASPar) were developed from sequences of three Citrus species. Their transferability was tested in 63 Citrus genotypes and 19 relative genera of the subfamily Aurantioideae to estimate the potential of SNP markers, selected from a limited intrageneric discovery panel, for ongoing broader diversity analysis at the intra- and intergeneric levels and systematic germplasm bank characterization. • Methods and Results: Forty-two SNP markers were developed using KASPar technology. Forty-one were successfully genotyped in all of the Citrus germplasm, where intra- and interspecific polymorphisms were observed. The transferability and diversity decreased with increasing taxonomic distance. • Conclusions: SNP markers based on the KASPar method developed from sequence data of a limited intrageneric discovery panel provide a valuable molecular resource for genetic diversity analysis of germplasm within a genus and should be useful for germplasm fingerprinting at a much broader diversity level. PMID:25202535

  9. DHOEM: a statistical simulation software for simulating new markers in real SNP marker data.

    PubMed

    Jacquin, Laval; Cao, Tuong-Vi; Grenier, Cécile; Ahmadi, Nourollah

    2015-12-03

    Numerous simulation tools based on specific assumptions have been proposed to simulate populations. Here we present a simulation tool named DHOEM (densification of haplotypes by loess regression and maximum likelihood) which is free from population assumptions and simulates new markers in real SNP marker data. The main objective of DHOEM is to generate a new population, which incorporates real and simulated SNP by statistical learning from an initial population, which match the realized features of the latter. To demonstrate DHOEM's abilities, we used a sample of 704 haplotypes for 12 chromosomes with 8336 SNP from a synthetic population, used for breeding upland rice in Latin America. The distributions of allele frequencies, pairwise SNP LD coefficients and data structures, before and after marker densification of the associated marker data set, were shown to be in relatively good agreement at moderate degrees of marker densification. DHOEM is a user-friendly tool that allows the user to specify the level of marker density desired, with a user defined minor allele frequency (MAF) limit, which is produced in a reasonable computation time. DHOEM is a user-friendly and useful tool for simulation and methodological studies in quantitative genetics and breeding.

  10. SNP-based markers for discriminating olive (Olea europaea L.) cultivars.

    PubMed

    Reale, S; Doveri, S; Díaz, A; Angiolillo, A; Lucentini, L; Pilla, F; Martín, A; Donini, P; Lee, D

    2006-09-01

    A set of 11 polymorphic markers (1 cleaved amplified polymorphic sequence (CAPS), 2 sequence-characterized amplified regions (SCARs), and 8 single-nucleotide polymorphism (SNP)-derived markers) was obtained for olive cultivar identification by comparing DNA sequences from different accessions. Marker development was more efficient, using sequences from the database rather than cloning arbitrary DNA fragments. Analyses of the sequences of 3 genes from 11 diverse cultivars revealed an SNP frequency of 1 per 190 base pairs in exons and 1 per 149 base pairs in introns. Most mutations were silent or had little perceptible effect on the polypeptide encoded. The higher incidence of transversions (55%) suggests that methylation is not the major driving force for DNA base changes. Evidence of linkage disequilibrium in 2 pairs of markers has been detected. The set of predominantly SNP-based markers was used to genotype 65 olive samples obtained from Europe and Australia, and was able clearly to discriminate 77% of the cultivars. Samples, putatively of the same cultivar but derived from different sources, were revealed as identical, demonstrating the utility of these markers as tools for resolving nomenclature issues. Genotyping data were used for constructing a dendrogram by UPGMA cluster analysis using the simple matching similarity coefficient. Relationships between cultivars are discussed in relation to the route of olive's spread.

  11. QTL Analysis Using SNP Markers Developed by Next-Generation Sequencing for Identification of Candidate Genes Controlling 4-Methylthio-3-Butenyl Glucosinolate Contents in Roots of Radish, Raphanus sativus L

    PubMed Central

    Zou, Zhongwei; Ishida, Masahiko; Li, Feng; Kakizaki, Tomohiro; Suzuki, Sho; Kitashiba, Hiroyasu; Nishio, Takeshi

    2013-01-01

    SNP markers for QTL analysis of 4-MTB-GSL contents in radish roots were developed by determining nucleotide sequences of bulked PCR products using a next-generation sequencer. DNA fragments were amplified from two radish lines by multiplex PCR with six primer pairs, and those amplified by 2,880 primer pairs were mixed and sequenced. By assembling sequence data, 1,953 SNPs in 750 DNA fragments, 437 of which have been previously mapped in a linkage map, were identified. A linkage map of nine linkage groups was constructed with 188 markers, and five QTLs were detected in two F2 populations, three of them accounting for more than 50% of the total phenotypic variance being repeatedly detected. In the identified QTL regions, nine SNP markers were newly produced. By synteny analysis of the QTLs regions with Arabidopsis thaliana and Brassica rapa genome sequences, three candidate genes were selected, i.e., RsMAM3 for production of aliphatic glucosinolates linked to GSL-QTL-4, RsIPMDH1 for leucine biosynthesis showing strong co-expression with glucosinolate biosynthesis genes linked to GSL-QTL-2, and RsBCAT4 for branched-chain amino acid aminotransferase linked to GSL-QTL-1. Nucleotide sequences and expression of these genes suggested their possible function in 4MTB-GSL biosynthesis in radish roots. PMID:23308250

  12. QTL analysis using SNP markers developed by next-generation sequencing for identification of candidate genes controlling 4-methylthio-3-butenyl glucosinolate contents in roots of radish, Raphanus sativus L.

    PubMed

    Zou, Zhongwei; Ishida, Masahiko; Li, Feng; Kakizaki, Tomohiro; Suzuki, Sho; Kitashiba, Hiroyasu; Nishio, Takeshi

    2013-01-01

    SNP markers for QTL analysis of 4-MTB-GSL contents in radish roots were developed by determining nucleotide sequences of bulked PCR products using a next-generation sequencer. DNA fragments were amplified from two radish lines by multiplex PCR with six primer pairs, and those amplified by 2,880 primer pairs were mixed and sequenced. By assembling sequence data, 1,953 SNPs in 750 DNA fragments, 437 of which have been previously mapped in a linkage map, were identified. A linkage map of nine linkage groups was constructed with 188 markers, and five QTLs were detected in two F(2) populations, three of them accounting for more than 50% of the total phenotypic variance being repeatedly detected. In the identified QTL regions, nine SNP markers were newly produced. By synteny analysis of the QTLs regions with Arabidopsis thaliana and Brassica rapa genome sequences, three candidate genes were selected, i.e., RsMAM3 for production of aliphatic glucosinolates linked to GSL-QTL-4, RsIPMDH1 for leucine biosynthesis showing strong co-expression with glucosinolate biosynthesis genes linked to GSL-QTL-2, and RsBCAT4 for branched-chain amino acid aminotransferase linked to GSL-QTL-1. Nucleotide sequences and expression of these genes suggested their possible function in 4MTB-GSL biosynthesis in radish roots.

  13. The use of SNP markers for linkage mapping in diploid and tetraploid peanuts.

    PubMed

    Bertioli, David J; Ozias-Akins, Peggy; Chu, Ye; Dantas, Karinne M; Santos, Silvio P; Gouvea, Ediene; Guimarães, Patricia M; Leal-Bertioli, Soraya C M; Knapp, Steven J; Moretzsohn, Marcio C

    2014-01-10

    Single nucleotide polymorphic markers (SNPs) are attractive for use in genetic mapping and marker-assisted breeding because they can be scored in parallel assays at favorable costs. However, scoring SNP markers in polyploid plants like the peanut is problematic because of interfering signal generated from the DNA bases that are homeologous to those being assayed. The present study used a previously constructed 1536 GoldenGate SNP assay developed using SNPs identified between two A. duranensis accessions. In this study, the performance of this assay was tested on two RIL mapping populations, one diploid (A. duranensis × A. stenosperma) and one tetraploid [A. hypogaea cv. Runner IAC 886 × synthetic tetraploid (A. ipaënsis × A. duranensis)(4×)]. The scoring was performed using the software GenomeStudio version 2011.1. For the diploid, polymorphic markers provided excellent genotyping scores with default software parameters. In the tetraploid, as expected, most of the polymorphic markers provided signal intensity plots that were distorted compared to diploid patterns and that were incorrectly scored using default parameters. However, these scorings were easily corrected using the GenomeStudio software. The degree of distortion was highly variable. Of the polymorphic markers, approximately 10% showed no distortion at all behaving as expected for single-dose markers, and another 30% showed low distortion and could be considered high-quality. The genotyped markers were incorporated into diploid and tetraploid genetic maps of Arachis and, in the latter case, were located almost entirely on A genome linkage groups.

  14. Evaluation of approaches for identifying population informative markers from high density SNP Chips

    PubMed Central

    2011-01-01

    Background Genetic markers can be used to identify and verify the origin of individuals. Motivation for the inference of ancestry ranges from conservation genetics to forensic analysis. High density assays featuring Single Nucleotide Polymorphism (SNP) markers can be exploited to create a reduced panel containing the most informative markers for these purposes. The objectives of this study were to evaluate methods of marker selection and determine the minimum number of markers from the BovineSNP50 BeadChip required to verify the origin of individuals in European cattle breeds. Delta, Wright's FST, Weir & Cockerham's FST and PCA methods for population differentiation were compared. The level of informativeness of each SNP was estimated from the breed specific allele frequencies. Individual assignment analysis was performed using the ranked informative markers. Stringency levels were applied by log-likelihood ratio to assess the confidence of the assignment test. Results A 95% assignment success rate for the 384 individually genotyped animals was achieved with < 80, < 100, < 140 and < 200 SNP markers (with increasing stringency threshold levels) across all the examined methods for marker selection. No further gain in power of assignment was achieved by sampling in excess of 200 SNP markers. The marker selection method that required the lowest number of SNP markers to verify the animal's breed origin was Wright's FST (60 to 140 SNPs depending on the chosen degree of confidence). Certain breeds required fewer markers (< 100) to achieve 100% assignment success. In contrast, closely related breeds require more markers (~200) to achieve > 95% assignment success. The power of assignment success, and therefore the number of SNP markers required, is dependent on the levels of genetic heterogeneity and pool of samples considered. Conclusions While all SNP selection methods produced marker panels capable of breed identification, the power of assignment varied markedly among

  15. Fine Mapping of a Clubroot Resistance Gene in Chinese Cabbage Using SNP Markers Identified from Bulked Segregant RNA Sequencing

    PubMed Central

    Huang, Zhen; Peng, Gary; Liu, Xunjia; Deora, Abhinandan; Falk, Kevin C.; Gossen, Bruce D.; McDonald, Mary R.; Yu, Fengqun

    2017-01-01

    Clubroot, caused by Plasmodiophora brassicae, is an important disease of canola (Brassica napus) in western Canada and worldwide. In this study, a clubroot resistance gene (Rcr2) was identified and fine mapped in Chinese cabbage cv. “Jazz” using single-nucleotide polymorphisms (SNP) markers identified from bulked segregant RNA sequencing (BSR-Seq) and molecular markers were developed for use in marker assisted selection. In total, 203.9 million raw reads were generated from one pooled resistant (R) and one pooled susceptible (S) sample, and >173,000 polymorphic SNP sites were identified between the R and S samples. One significant peak was observed between 22 and 26 Mb of chromosome A03, which had been predicted by BSR-Seq to contain the causal gene Rcr2. There were 490 polymorphic SNP sites identified in the region. A segregating population consisting of 675 plants was analyzed with 15 SNP sites in the region using the Kompetitive Allele Specific PCR method, and Rcr2 was fine mapped between two SNP markers, SNP_A03_32 and SNP_A03_67 with 0.1 and 0.3 cM from Rcr2, respectively. Five SNP markers co-segregated with Rcr2 in this region. Variants were identified in 14 of 36 genes annotated in the Rcr2 target region. The numbers of poly variants differed among the genes. Four genes encode TIR-NBS-LRR proteins and two of them Bra019410 and Bra019413, had high numbers of polymorphic variants and so are the most likely candidates of Rcr2. PMID:28894454

  16. Identification of SNP and SSR markers in eggplant using RAD tag sequencing

    PubMed Central

    2011-01-01

    Background The eggplant (Solanum melongena L.) genome is relatively unexplored, especially compared to those of the other major Solanaceae crops tomato and potato. In particular, no SNP markers are publicly available; on the other hand, over 1,000 SSR markers were developed and publicly available. We have combined the recently developed Restriction-site Associated DNA (RAD) approach with Illumina DNA sequencing for rapid and mass discovery of both SNP and SSR markers for eggplant. Results RAD tags were generated from the genomic DNA of a pair of eggplant mapping parents, and sequenced to produce ~17.5 Mb of sequences arrangeable into ~78,000 contigs. The resulting non-redundant genomic sequence dataset consisted of ~45,000 sequences, of which ~29% were putative coding sequences and ~70% were in common between the mapping parents. The shared sequences allowed the discovery of ~10,000 SNPs and nearly 1,000 indels, equivalent to a SNP frequency of 0.8 per Kb and an indel frequency of 0.07 per Kb. Over 2,000 of the SNPs are likely to be mappable via the Illumina GoldenGate assay. A subset of 384 SNPs was used to successfully fingerprint a panel of eggplant germplasm, producing a set of informative diversity data. The RAD sequences also included nearly 2,000 putative SSRs, and primer pairs were designed to amplify 1,155 loci. Conclusion The high throughput sequencing of the RAD tags allowed the discovery of a large number of DNA markers, which will prove useful for extending our current knowledge of the genome organization of eggplant, for assisting in marker-aided selection and for carrying out comparative genomic analyses within the Solanaceae family. PMID:21663628

  17. SNP-markers in Allium species to facilitate introgression breeding in onion.

    PubMed

    Scholten, Olga E; van Kaauwen, Martijn P W; Shahin, Arwa; Hendrickx, Patrick M; Keizer, L C Paul; Burger, Karin; van Heusden, Adriaan W; van der Linden, C Gerard; Vosman, Ben

    2016-08-31

    Within onion, Allium cepa L., the availability of disease resistance is limited. The identification of sources of resistance in related species, such as Allium roylei and Allium fistulosum, was a first step towards the improvement of onion cultivars by breeding. SNP markers linked to resistance and polymorphic between these related species and onion cultivars are a valuable tool to efficiently introgress disease resistance genes. In this paper we describe the identification and validation of SNP markers valuable for onion breeding. Transcriptome sequencing resulted in 192 million RNA seq reads from the interspecific F1 hybrid between A. roylei and A. fistulosum (RF) and nine onion cultivars. After assembly, reliable SNPs were discovered in about 36 % of the contigs. For genotyping of the interspecific three-way cross population, derived from a cross between an onion cultivar and the RF (CCxRF), 1100 SNPs that are polymorphic in RF and monomorphic in the onion cultivars (RF SNPs) were selected for the development of KASP assays. A molecular linkage map based on 667 RF-SNP markers was constructed for CCxRF. In addition, KASP assays were developed for 1600 onion-SNPs (SNPs polymorphic among onion cultivars). A second linkage map was constructed for an F2 of onion x A. roylei (F2(CxR)) that consisted of 182 onion-SNPs and 119 RF-SNPs, and 76 previously mapped markers. Markers co-segregating in both the F2(CxR) and the CCxRF population were used to assign the linkage groups of RF to onion chromosomes. To validate usefulness of these SNP markers, QTL mapping was applied in the CCxRF population that segregates for resistance to Botrytis squamosa and resulted in a QTL for resistance on chromosome 6 of A. roylei. Our research has more than doubled the publicly available marker sequences of expressed onion genes and two onion-related species. It resulted in a detailed genetic map for the interspecific CCxRF population. This is the first paper that reports the detection of

  18. Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies

    PubMed Central

    Gimode, Davis; Odeny, Damaris A.; de Villiers, Etienne P.; Wanyonyi, Solomon; Dida, Mathews M.; Mneney, Emmarold E.; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M.

    2016-01-01

    Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional

  19. Development of SNP-based dCAPS markers for identifying male sterile gene tms5 in two-line hybrid rice.

    PubMed

    Song, F S; Ni, J L; Qian, Y L; Li, L; Ni, D H; Yang, J B

    2016-08-29

    Molecular markers can increase both the efficiency and speed of breeding programs. Functional markers that detect the functional mutations causing phenotypic changes offer a precise method for genetic identification. In this study, we used newly derived cleaved amplified polymorphic sequence markers to detect the functional mutations of tms5, which is a male sterile gene that is widely used in rice production in China. In addition, restriction cutting sites were designed to specifically digest amplicons of tms5 but not wild type (TMS5), in order to avoid the risk of false positive results. By optimizing the condition of the polymerase chain reaction amplifications and restriction enzyme digestions, the newly designed markers could accurately distinguish between tms5 and TMS5. These markers can be applied in marker-assisted selection for breeding novel thermo-sensitive genic male sterile (TGMS) lines, as well as to rapidly identify the TGMS hybrid seed purity.

  20. Genomic dissection of a ‘Fuji’ apple cultivar: re-sequencing, SNP marker development, definition of haplotypes, and QTL detection

    PubMed Central

    Kunihisa, Miyuki; Moriya, Shigeki; Abe, Kazuyuki; Okada, Kazuma; Haji, Takashi; Hayashi, Takeshi; Kawahara, Yoshihiro; Itoh, Ryutaro; Itoh, Takeshi; Katayose, Yuichi; Kanamori, Hiroyuki; Matsumoto, Toshimi; Mori, Satomi; Sasaki, Harumi; Matsumoto, Takashi; Nishitani, Chikako; Terakami, Shingo; Yamamoto, Toshiya

    2016-01-01

    ‘Fuji’ is one of the most popular and highly-produced apple cultivars worldwide, and has been frequently used in breeding programs. The development of genotypic markers for the preferable phenotypes of ‘Fuji’ is required. Here, we aimed to define the haplotypes of ‘Fuji’ and find associations between haplotypes and phenotypes of five traits (harvest day, fruit weight, acidity, degree of watercore, and flesh mealiness) by using 115 accessions related to ‘Fuji’. Through the re-sequencing of ‘Fuji’ genome, total of 2,820,759 variants, including single nucleotide polymorphisms (SNPs) and insertions or deletions (indels) were detected between ‘Fuji’ and ‘Golden Delicious’ reference genome. We selected mapping-validated 1,014 SNPs, most of which were heterozygous in ‘Fuji’ and capable of distinguishing alleles inherited from the parents of ‘Fuji’ (i.e., ‘Ralls Janet’ and ‘Delicious’). We used these SNPs to define the haplotypes of ‘Fuji’ and trace their inheritance in relatives, which were shown to have an average of 27% of ‘Fuji’ genome. Analysis of variance (ANOVA) based on ‘Fuji’ haplotypes identified one quantitative trait loci (QTL) each for harvest time, acidity, degree of watercore, and mealiness. A haplotype from ‘Delicious’ chr14 was considered to dominantly cause watercore, and one from ‘Ralls Janet’ chr1 was related to low-mealiness. PMID:27795675

  1. SNP Discovery by Illumina-Based Transcriptome Sequencing of the Olive and the Genetic Characterization of Turkish Olive Genotypes Revealed by AFLP, SSR and SNP Markers

    PubMed Central

    Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin

    2013-01-01

    Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of

  2. Association of Agronomic Traits with SNP Markers in Durum Wheat (Triticum turgidum L. durum (Desf.)).

    PubMed

    Hu, Xin; Ren, Jing; Ren, Xifeng; Huang, Sisi; Sabiel, Salih A I; Luo, Mingcheng; Nevo, Eviatar; Fu, Chunjie; Peng, Junhua; Sun, Dongfa

    2015-01-01

    Association mapping is a powerful approach to detect associations between traits of interest and genetic markers based on linkage disequilibrium (LD) in molecular plant breeding. In this study, 150 accessions of worldwide originated durum wheat germplasm (Triticum turgidum spp. durum) were genotyped using 1,366 SNP markers. The extent of LD on each chromosome was evaluated. Association of single nucleotide polymorphisms (SNP) markers with ten agronomic traits measured in four consecutive years was analyzed under a mix linear model (MLM). Two hundred and one significant association pairs were detected in the four years. Several markers were associated with one trait, and also some markers were associated with multiple traits. Some of the associated markers were in agreement with previous quantitative trait loci (QTL) analyses. The function and homology analyses of the corresponding ESTs of some SNP markers could explain many of the associations for plant height, length of main spike, number of spikelets on main spike, grain number per plant, and 1000-grain weight, etc. The SNP associations for the observed traits are generally clustered in specific chromosome regions of the wheat genome, mainly in 2A, 5A, 6A, 7A, 1B, and 6B chromosomes. This study demonstrates that association mapping can complement and enhance previous QTL analyses and provide additional information for marker-assisted selection.

  3. Association of Agronomic Traits with SNP Markers in Durum Wheat (Triticum turgidum L. durum (Desf.))

    PubMed Central

    Hu, Xin; Ren, Jing; Ren, Xifeng; Huang, Sisi; Sabiel, Salih A. I.; Luo, Mingcheng; Nevo, Eviatar; Fu, Chunjie; Peng, Junhua; Sun, Dongfa

    2015-01-01

    Association mapping is a powerful approach to detect associations between traits of interest and genetic markers based on linkage disequilibrium (LD) in molecular plant breeding. In this study, 150 accessions of worldwide originated durum wheat germplasm (Triticum turgidum spp. durum) were genotyped using 1,366 SNP markers. The extent of LD on each chromosome was evaluated. Association of single nucleotide polymorphisms (SNP) markers with ten agronomic traits measured in four consecutive years was analyzed under a mix linear model (MLM). Two hundred and one significant association pairs were detected in the four years. Several markers were associated with one trait, and also some markers were associated with multiple traits. Some of the associated markers were in agreement with previous quantitative trait loci (QTL) analyses. The function and homology analyses of the corresponding ESTs of some SNP markers could explain many of the associations for plant height, length of main spike, number of spikelets on main spike, grain number per plant, and 1000-grain weight, etc. The SNP associations for the observed traits are generally clustered in specific chromosome regions of the wheat genome, mainly in 2A, 5A, 6A, 7A, 1B, and 6B chromosomes. This study demonstrates that association mapping can complement and enhance previous QTL analyses and provide additional information for marker-assisted selection. PMID:26110423

  4. Association mapping of maturity and plant height using SNP markers with the sorghum mini core collection.

    PubMed

    Upadhyaya, Hari D; Wang, Yi-Hong; Gowda, C L L; Sharma, Shivali

    2013-08-01

    Plant height and maturity are two critical traits in sorghum breeding. To develop molecular tools and to identify genes underlying the traits for molecular breeding, we developed 14,739 SNP markers used to genotype the complete sorghum [Sorghum bicolor (L.) Moench] mini core collection. The collection was evaluated in four rainy and three post-rainy season environments for plant height and maturity. Association analysis identified six marker loci linked to height and ten to maturity in at least two environments with at least two SNPs in each locus. Of these, 14 were in close proximity to previously mapped height/maturity QTL in sorghum. Candidate genes for maturity or plant height close to the marker loci include a sugar transporter (SbSUC9), an auxin response factor (SbARF3), an FLC and FT regulator (SbMED12), and a photoperiod response gene (SbPPR1) for maturity and peroxidase 53, and an auxin transporter (SbLAX4) for plant height. Linkage disequilibrium analysis showed that SbPPR1 and SbARF3 were in regions with reduced sequence variation among early-maturing accessions, suggestive of past purifying selection. We also found a linkage disequilibrium block that existed only among the accessions with short plant height in rainy season environments. The block contains a gene homologous to the Arabidopsis flowering time gene, LUMINIDEPENDENS (LD). Functional LD promotes early maturity while mutation delays maturity, affecting plant height. Previous studies also found reduced sequence variations within this gene. These newly-mapped SNP markers will facilitate further efforts to identify plant height or maturity genes in sorghum.

  5. Objective evaluation measures of genetic marker selection in large-scale SNP genotyping.

    PubMed

    Kaminuma, Eli; Masuya, Hiroshi; Miura, Ikuo; Motegi, Hiromi; Takahasi, Kenzi R; Nakazawa, Miki; Matsui, Minami; Gondo, Yoichi; Noda, Tetsuo; Shiroishi, Toshihiko; Wakana, Shigeharu; Toyoda, Tetsuro

    2008-10-01

    High-throughput single nucleotide polymorphism (SNP) genotyping systems provide two kinds of fluorescent signals detected from different alleles. In current technologies, the process of genotype discrimination requires subjective judgments by expert operators, even when using clustering algorithms. Here, we propose two evaluation measures to manage fluorescent scatter data with nonclear plot aggregation. The first is the marker ranking measure, which provides a ranking system for the SNP markers based on the distance between the scatter plot distribution and a user-defined ideal distribution. The second measure, called individual genotype membership, uses the membership probability of each genotype related to an individual plot in the scatter data. In verification experiments, the marker ranking measure determined the ranking of SNP markers correlated with the subjective order of SNP markers judged by an expert operator. The experiment using the individual genotype membership measure clarified that the total number of unclassified individuals was remarkably reduced compared to that of manually unclassified ones. These two evaluation measures were implemented as the GTAssist software. GTAssist provides objective standards and avoids subjective biases in SNP genotyping workflows.

  6. A Genome-Wide Association Study for Agronomic Traits in Soybean Using SNP Markers and SNP-Based Haplotype Analysis

    PubMed Central

    de Oliveira, Marco Antônio Rott; Higashi, Wilson; Scapim, Carlos Alberto; Schuster, Ivan

    2017-01-01

    Mapping quantitative trait loci through the use of linkage disequilibrium (LD) in populations of unrelated individuals provides a valuable approach for dissecting the genetic basis of complex traits in soybean (Glycine max). The haplotype-based genome-wide association study (GWAS) has now been proposed as a complementary approach to intensify benefits from LD, which enable to assess the genetic determinants of agronomic traits. In this study a GWAS was undertaken to identify genomic regions that control 100-seed weight (SW), plant height (PH) and seed yield (SY) in a soybean association mapping panel using single nucleotide polymorphism (SNP) markers and haplotype information. The soybean cultivars (N = 169) were field-evaluated across four locations of southern Brazil. The genome-wide haplotype association analysis (941 haplotypes) identified eleven, seventeen and fifty-nine SNP-based haplotypes significantly associated with SY, SW and PH, respectively. Although most marker-trait associations were environment and trait specific, stable haplotype associations were identified for SY and SW across environments (i.e., haplotypes Gm12_Hap12). The haplotype block 42 on Chr19 (Gm19_Hap42) was confirmed to be associated with PH in two environments. These findings enable us to refine the breeding strategy for tropical soybean, which confirm that haplotype-based GWAS can provide new insights on the genetic determinants that are not captured by the single-marker approach. PMID:28152092

  7. A Genome-Wide Association Study for Agronomic Traits in Soybean Using SNP Markers and SNP-Based Haplotype Analysis.

    PubMed

    Contreras-Soto, Rodrigo Iván; Mora, Freddy; de Oliveira, Marco Antônio Rott; Higashi, Wilson; Scapim, Carlos Alberto; Schuster, Ivan

    2017-01-01

    Mapping quantitative trait loci through the use of linkage disequilibrium (LD) in populations of unrelated individuals provides a valuable approach for dissecting the genetic basis of complex traits in soybean (Glycine max). The haplotype-based genome-wide association study (GWAS) has now been proposed as a complementary approach to intensify benefits from LD, which enable to assess the genetic determinants of agronomic traits. In this study a GWAS was undertaken to identify genomic regions that control 100-seed weight (SW), plant height (PH) and seed yield (SY) in a soybean association mapping panel using single nucleotide polymorphism (SNP) markers and haplotype information. The soybean cultivars (N = 169) were field-evaluated across four locations of southern Brazil. The genome-wide haplotype association analysis (941 haplotypes) identified eleven, seventeen and fifty-nine SNP-based haplotypes significantly associated with SY, SW and PH, respectively. Although most marker-trait associations were environment and trait specific, stable haplotype associations were identified for SY and SW across environments (i.e., haplotypes Gm12_Hap12). The haplotype block 42 on Chr19 (Gm19_Hap42) was confirmed to be associated with PH in two environments. These findings enable us to refine the breeding strategy for tropical soybean, which confirm that haplotype-based GWAS can provide new insights on the genetic determinants that are not captured by the single-marker approach.

  8. MDM2 SNP309 and SNP285 Act as Negative Prognostic Markers for Non-small Cell Lung Cancer Adenocarcinoma Patients

    PubMed Central

    Deben, Christophe; Op de Beeck, Ken; Van den Bossche, Jolien; Jacobs, Julie; Lardon, Filip; Wouters, An; Peeters, Marc; Van Camp, Guy; Rolfo, Christian; Deschoolmeester, Vanessa; Pauwels, Patrick

    2017-01-01

    Objectives: Two functional polymorphisms in the MDM2 promoter region, SNP309T>G and SNP285G>C, have been shown to impact MDM2 expression and cancer risk. Currently available data on the prognostic value of MDM2 SNP309 in non-small cell lung cancer (NSCLC) is contradictory and unavailable for SNP285. The goal of this study was to clarify the role of these MDM2 SNPs in the outcome of NSCLC patients. Materials and Methods: In this study we genotyped SNP309 and SNP285 in 98 NSCLC adenocarcinoma patients and determined MDM2 mRNA and protein levels. In addition, we assessed the prognostic value of these common SNPs on overall and progression free survival, taking into account the TP53 status of the tumor. Results and Conclusion: We found that the SNP285C allele, but not the SNP309G allele, was significantly associated with increased MDM2 mRNA expression levels (p = 0.025). However, we did not observe an association with MDM2 protein levels for SNP285. The SNP309G allele was significantly associated with the presence of wild type TP53 (p = 0.047) and showed a strong trend towards increased MDM2 protein levels (p = 0.068). In addition, patients harboring the SNP309G allele showed a worse overall survival, but only in the presence of wild type TP53. The SNP285C allele was significantly associated with an early age of diagnosis and metastasis. Additionally, the SNP285C allele acted as an independent predictor for worse progression free survival (HR = 3.97; 95% CI = 1.51 - 10.42; p = 0.005). Our data showed that both SNP309 (in the presence of wild type TP53) and SNP285 act as negative prognostic markers for NSCLC patients, implicating a prominent role for these variants in the outcome of these patients. PMID:28819417

  9. Whole-genome single-nucleotide polymorphism (SNP) marker discovery and association analysis with the eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content in Larimichthys crocea.

    PubMed

    Xiao, Shijun; Wang, Panpan; Dong, Linsong; Zhang, Yaguang; Han, Zhaofang; Wang, Qiurong; Wang, Zhiyong

    2016-01-01

    Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms.

  10. Whole-genome single-nucleotide polymorphism (SNP) marker discovery and association analysis with the eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content in Larimichthys crocea

    PubMed Central

    Xiao, Shijun; Wang, Panpan; Dong, Linsong; Zhang, Yaguang; Han, Zhaofang; Wang, Qiurong

    2016-01-01

    Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms. PMID:28028455

  11. Multi-marker-LD based genetic algorithm for tag SNP selection.

    PubMed

    Mouawad, Amer E; Mansour, Nashat

    2014-12-01

    Despite the advances in genotyping technologies which have led to large reduction in genotyping cost, the Tag SNP Selection problem remains an important problem for computational biologists and geneticists. Selecting the smallest subset of tag SNPs that can predict the other SNPs would considerably minimize the complexity of genome-wide or block-based SNP-disease association studies. These studies would lead to better diagnosis and treatment of diseases. In this work, we propose three variations of a genetic algorithm based on two-marker linkage disequilibrium, multi-marker linkage disequilibrium, and a third measure that we denote by prediction power. The performance of the three algorithms are compared with those of a recognized tag SNP selection algorithm using three different real data sets from the HapMap project. The results indicate that the multi-marker linkage disequilibrium based genetic algorithm yields better prediction accuracy.

  12. SNP markers-based map construction and genome-wide linkage analysis in Brassica napus.

    PubMed

    Raman, Harsh; Dalton-Morgan, Jessica; Diffey, Simon; Raman, Rosy; Alamery, Salman; Edwards, David; Batley, Jacqueline

    2014-09-01

    An Illumina Infinium array comprising 5306 single nucleotide polymorphism (SNP) markers was used to genotype 175 individuals of a doubled haploid population derived from a cross between Skipton and Ag-Spectrum, two Australian cultivars of rapeseed (Brassica napus L.). A genetic linkage map based on 613 SNP and 228 non-SNP (DArT, SSR, SRAP and candidate gene markers) covering 2514.8 cM was constructed and further utilized to identify loci associated with flowering time and resistance to blackleg, a disease caused by the fungus Leptosphaeria maculans. Comparison between genetic map positions of SNP markers and the sequenced Brassica rapa (A) and Brassica oleracea (C) genome scaffolds showed several genomic rearrangements in the B. napus genome. A major locus controlling resistance to L. maculans was identified at both seedling and adult plant stages on chromosome A07. QTL analyses revealed that up to 40.2% of genetic variation for flowering time was accounted for by loci having quantitative effects. Comparative mapping showed Arabidopsis and Brassica flowering genes such as Phytochrome A/D, Flowering Locus C and agamous-Like MADS box gene AGL1 map within marker intervals associated with flowering time in a DH population from Skipton/Ag-Spectrum. Genomic regions associated with flowering time and resistance to L. maculans had several SNP markers mapped within 10 cM. Our results suggest that SNP markers will be suitable for various applications such as trait introgression, comparative mapping and high-resolution mapping of loci in B. napus. © 2014 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.

  13. Identification of a sex-linked SNP marker in the salmon louse (Lepeophtheirus salmonis) using RAD sequencing.

    PubMed

    Carmichael, Stephen N; Bekaert, Michaël; Taggart, John B; Christie, Hayden R L; Bassett, David I; Bron, James E; Skuce, Philip J; Gharbi, Karim; Skern-Mauritzen, Rasmus; Sturm, Armin

    2013-01-01

    The salmon louse (Lepeophtheirus salmonis (Krøyer, 1837)) is a parasitic copepod that can, if untreated, cause considerable damage to Atlantic salmon (Salmo salar Linnaeus, 1758) and incurs significant costs to the Atlantic salmon mariculture industry. Salmon lice are gonochoristic and normally show sex ratios close to 1:1. While this observation suggests that sex determination in salmon lice is genetic, with only minor environmental influences, the mechanism of sex determination in the salmon louse is unknown. This paper describes the identification of a sex-linked Single Nucleotide Polymorphism (SNP) marker, providing the first evidence for a genetic mechanism of sex determination in the salmon louse. Restriction site-associated DNA sequencing (RAD-seq) was used to isolate SNP markers in a laboratory-maintained salmon louse strain. A total of 85 million raw Illumina 100 base paired-end reads produced 281,838 unique RAD-tags across 24 unrelated individuals. RAD marker Lsa101901 showed complete association with phenotypic sex for all individuals analysed, being heterozygous in females and homozygous in males. Using an allele-specific PCR assay for genotyping, this SNP association pattern was further confirmed for three unrelated salmon louse strains, displaying complete association with phenotypic sex in a total of 96 genotyped individuals. The marker Lsa101901 was located in the coding region of the prohibitin-2 gene, which showed a sex-dependent differential expression, with mRNA levels determined by RT-qPCR about 1.8-fold higher in adult female than adult male salmon lice. This study's observations of a novel sex-linked SNP marker are consistent with sex determination in the salmon louse being genetic and following a female heterozygous system. Marker Lsa101901 provides a tool to determine the genetic sex of salmon lice, and could be useful in the development of control strategies.

  14. Identification of a Sex-Linked SNP Marker in the Salmon Louse (Lepeophtheirus salmonis) Using RAD Sequencing

    PubMed Central

    Taggart, John B.; Christie, Hayden R. L.; Bassett, David I.; Bron, James E.; Skuce, Philip J.; Gharbi, Karim; Skern-Mauritzen, Rasmus; Sturm, Armin

    2013-01-01

    The salmon louse (Lepeophtheirus salmonis (Krøyer, 1837)) is a parasitic copepod that can, if untreated, cause considerable damage to Atlantic salmon (Salmo salar Linnaeus, 1758) and incurs significant costs to the Atlantic salmon mariculture industry. Salmon lice are gonochoristic and normally show sex ratios close to 1:1. While this observation suggests that sex determination in salmon lice is genetic, with only minor environmental influences, the mechanism of sex determination in the salmon louse is unknown. This paper describes the identification of a sex-linked Single Nucleotide Polymorphism (SNP) marker, providing the first evidence for a genetic mechanism of sex determination in the salmon louse. Restriction site-associated DNA sequencing (RAD-seq) was used to isolate SNP markers in a laboratory-maintained salmon louse strain. A total of 85 million raw Illumina 100 base paired-end reads produced 281,838 unique RAD-tags across 24 unrelated individuals. RAD marker Lsa101901 showed complete association with phenotypic sex for all individuals analysed, being heterozygous in females and homozygous in males. Using an allele-specific PCR assay for genotyping, this SNP association pattern was further confirmed for three unrelated salmon louse strains, displaying complete association with phenotypic sex in a total of 96 genotyped individuals. The marker Lsa101901 was located in the coding region of the prohibitin-2 gene, which showed a sex-dependent differential expression, with mRNA levels determined by RT-qPCR about 1.8-fold higher in adult female than adult male salmon lice. This study’s observations of a novel sex-linked SNP marker are consistent with sex determination in the salmon louse being genetic and following a female heterozygous system. Marker Lsa101901 provides a tool to determine the genetic sex of salmon lice, and could be useful in the development of control strategies. PMID:24147087

  15. Molecular authentication and quantitative analysis of Sarcandra glabra and adulterated chloranthus products using SNP markers.

    PubMed

    Wei, Yicong; Chen, Ying; Huang, Youkai; Liu, Jinping; Liang, Yichi

    2016-09-01

    Sarcandra glabra (Thunb.) Nakai is one of the most popular and valuable plant species in the oriental medicinal herb market. Chloranthus (Chloranthaceae) species are the most widely used adulterants, but they are known to have hepatotoxicity effects and different medicinal values. The aim of this study is to develop a robust and accurate DNA marker for the qualitative and quantitative analyses of their products. Four single nucleotide polymorphism (SNP) sites specific to Sarcandra glabra, Chloranthus spicatus, Chloranthus serratus and Chloranthus henryi were exploited from the trnL-F region in chloroplast DNA, which have a higher copy number in the products than the nuclear DNA. Based on the SNP sites, specific primers were designed to identify the products of Sarcandra glabra, Chloranthus spicatus, Chloranthus serratus and Chloranthus henryi in mixed solutions via multiplexed PCR. The primers were also used to quantitatively analyse the ratio of chloroplast DNA in the mixed products using real-time PCR. The established multiplexed-PCR and real-time PCR methods were determined to be effective for the authentication and relative quantitative assessments of the products of Sarcandra glabra, its adulterants, and their mixtures. We therefore present an effective method for monitoring the quality of these products.

  16. Development of maizeSNP3072, a high-throughput compatible SNP array, for DNA fingerprinting identification of Chinese maize varieties.

    PubMed

    Tian, Hong-Li; Wang, Feng-Ge; Zhao, Jiu-Ran; Yi, Hong-Mei; Wang, Lu; Wang, Rui; Yang, Yang; Song, Wei

    2015-01-01

    Single nucleotide polymorphisms (SNPs) are abundant and evenly distributed throughout the maize (Zea mays L.) genome. SNPs have several advantages over simple sequence repeats, such as ease of data comparison and integration, high-throughput processing of loci, and identification of associated phenotypes. SNPs are thus ideal for DNA fingerprinting, genetic diversity analysis, and marker-assisted breeding. Here, we developed a high-throughput and compatible SNP array, maizeSNP3072, containing 3072 SNPs developed from the maizeSNP50 array. To improve genotyping efficiency, a high-quality cluster file, maizeSNP3072_GT.egt, was constructed. All 3072 SNP loci were localized within different genes, where they were distributed in exons (43 %), promoters (21 %), 3' untranslated regions (UTRs; 22 %), 5' UTRs (9 %), and introns (5 %). The average genotyping failure rate using these SNPs was only 6 %, or 3 % using the cluster file to call genotypes. The genotype consistency of repeat sample analysis on Illumina GoldenGate versus Infinium platforms exceeded 96.4 %. The minor allele frequency (MAF) of the SNPs averaged 0.37 based on data from 309 inbred lines. The 3072 SNPs were highly effective for distinguishing among 276 examined hybrids. Comparative analysis using Chinese varieties revealed that the 3072SNP array showed a better marker success rate and higher average MAF values, evaluation scores, and variety-distinguishing efficiency than the maizeSNP50K array. The maizeSNP3072 array thus can be successfully used in DNA fingerprinting identification of Chinese maize varieties and shows potential as a useful tool for germplasm resource evaluation and molecular marker-assisted breeding.

  17. Applying SNP marker technology in the cacao breeding program at the Cocoa Research Institute of Ghana

    USDA-ARS?s Scientific Manuscript database

    In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...

  18. Association mapping of resistance to leaf rust in emmer wheat using high throughput SNP markers

    USDA-ARS?s Scientific Manuscript database

    Emmer wheat (Triticum turgidum L. subsp. dicoccum) is known to be a useful source of genes for many desirable characters for improvement of modern cultivated wheat. Recently, a panel of 181 emmer wheat accessions has been genotyped with wheat 9K SNP (single nucleotide polymorphism) markers and exte...

  19. Analysis of gene-derived SNP marker polymorphism in wheat (Triticum aestivum L.)

    USDA-ARS?s Scientific Manuscript database

    In this study, we analyzed 359 single nucleotide polymorphisms (SNPs) previously discovered in intron sequences of wheat genes to evaluate SNP marker polymorphism in common wheat (Triticum aestivum L.). These SNPs showed an average polymorphism information content (PIC) of 0.181 among 20 US wheat c...

  20. SNP markers identify widely distributed clonal lineages of Phytophthora colocasiae in Vietnam, Hawaii and Hainan Island, China.

    PubMed

    Shrestha, Sandesh; Hu, Jian; Fryxell, Rebecca Trout; Mudge, Joann; Lamour, Kurt

    2014-01-01

    Taro (Colocasia esculenta) is an important food crop, and taro leaf blight caused by Phytophthora colocasiae can significantly affect production. Our objectives were to develop single nucleotide polymorphism (SNP) markers for P. colocasiae and characterize populations in Hawaii (HI), Vietnam (VN) and Hainan Island, China (HIC). In total, 379 isolates were analyzed for mating type and multilocus SNP profiles including 214 from HI, 97 from VN and 68 from HIC. A total of 1152 single nucleotide variant (SNV) sites were identified via restriction site-associated DNA (RAD) sequencing of two field isolates. Genotyping with 27 SNPs revealed 41 multilocus SNP genotypes grouped into seven clonal lineages containing 2-232 members. Three clonal lineages were shared among countries. In addition, five SNP markers had a low incidence of loss of heterozygosity (LOH) during asexual laboratory growth. For HI and VN, >95% of isolates were the A2 mating type. On HIC, isolates within single clonal lineages had A1, A2 and A0 (neuter) isolates. The implications for the wide dispersal of clonal lineages are discussed.

  1. Forensic SNP genotyping with SNaPshot: Technical considerations for the development and optimization of multiplexed SNP assays.

    PubMed

    Fondevila, M; Børsting, C; Phillips, C; de la Puente, M; Consortium, Euroforen-NoE; Carracedo, A; Morling, N; Lareu, M V

    2017-01-01

    This review explores the key factors that influence the optimization, routine use, and profile interpretation of the SNaPshot single-base extension (SBE) system applied to forensic single-nucleotide polymorphism (SNP) genotyping. Despite being a mainly complimentary DNA genotyping technique to routine STR profiling, use of SNaPshot is an important part of the development of SNP sets for a wide range of forensic applications with these markers, from genotyping highly degraded DNA with very short amplicons to the introduction of SNPs to ascertain the ancestry and physical characteristics of an unidentified contact trace donor. However, this technology, as resourceful as it is, displays several features that depart from the usual STR genotyping far enough to demand a certain degree of expertise from the forensic analyst before tackling the complex casework on which SNaPshot application provides an advantage. In order to provide the basis for developing such expertise, we cover in this paper the most challenging aspects of the SNaPshot technology, focusing on the steps taken to design primer sets, optimize the PCR and single-base extension chemistries, and the important features of the peak patterns observed in typical forensic SNP profiles using SNaPshot. With that purpose in mind, we provide guidelines and troubleshooting for multiplex-SNaPshot-oriented primer design and the resulting capillary electrophoresis (CE) profile interpretation (covering the most commonly observed artifacts and expected departures from the ideal conditions).

  2. WIPP marker development

    SciTech Connect

    1994-04-01

    This article discusses the development of permanent, passive markers for the Waste Isolation Pilot Plant (WIPP) and presents some preliminary concepts in drawings and a table of components for the markers. The panel, convened by Sandia National Laboratories, was charged with developing design characteristics for permanent markers and judging the efficacy of markers in deterring inadvertent human intrusion. 6 figs., 2 tabs.

  3. Rice chromosome segment substitution line selection utilizing SNP markers

    USDA-ARS?s Scientific Manuscript database

    Chromosome segment substitution lines (CSSLs) are a powerful tool for identifying naturally occurring, favorable alleles in unadapted germplasm. Six CSSL libraries in rice (Oryza sativa) are being developed from crosses between three different accessions of the rice progenitor species, O. rufipogon...

  4. SNP marker discovery, linkage map construction and identification of QTLs for enhanced salinity tolerance in field pea (Pisum sativum L.)

    PubMed Central

    2013-01-01

    Background Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs. Results In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for

  5. High-density SNP-based genetic map development and linkage disequilibrium assessment in Brassica napus L

    PubMed Central

    2013-01-01

    Background High density genetic maps built with SNP markers that are polymorphic in various genetic backgrounds are very useful for studying the genetics of agronomical traits as well as genome organization and evolution. Simultaneous dense SNP genotyping of segregating populations and variety collections was applied to oilseed rape (Brassica napus L.) to obtain a high density genetic map for this species and to study the linkage disequilibrium pattern. Results We developed an integrated genetic map for oilseed rape by high throughput SNP genotyping of four segregating doubled haploid populations. A very high level of collinearity was observed between the four individual maps and a large number of markers (>59%) was common to more than two maps. The precise integrated map comprises 5764 SNP and 1603 PCR markers. With a total genetic length of 2250 cM, the integrated map contains a density of 3.27 markers (2.56 SNP) per cM. Genotyping of these mapped SNP markers in oilseed rape collections allowed polymorphism level and linkage disequilibrium (LD) to be studied across the different collections (winter vs spring, different seed quality types) and along the linkage groups. Overall, polymorphism level was higher and LD decayed faster in spring than in “00” winter oilseed rape types but this was shown to vary greatly along the linkage groups. Conclusions Our study provides a valuable resource for further genetic studies using linkage or association mapping, for marker assisted breeding and for Brassica napus sequence assembly and genome organization analyses. PMID:23432809

  6. Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm

    PubMed Central

    Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron

    2012-01-01

    Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421

  7. SNP Marker Discovery in Pima Cotton (Gossypium barbadense L.) Leaf Transcriptomes

    PubMed Central

    Kottapalli, Pratibha; Ulloa, Mauricio; Kottapalli, Kameswara Rao; Payton, Paxton; Burke, John

    2016-01-01

    The objective of this study was to explore the known narrow genetic diversity and discover single-nucleotide polymorphic (SNP) markers for marker-assisted breeding within Pima cotton (Gossypium barbadense L.) leaf transcriptomes. cDNA from 25-day plants of three diverse cotton genotypes [Pima S6 (PS6), Pima S7 (PS7), and Pima 3-79 (P3-79)] was sequenced on Illumina sequencing platform. A total of 28.9 million reads (average read length of 138 bp) were generated by sequencing cDNA libraries of these three genotypes. The de novo assembly of reads generated transcriptome sets of 26,369 contigs for PS6, 25,870 contigs for PS7, and 24,796 contigs for P3-79. A Pima leaf reference transcriptome was generated consisting of 42,695 contigs. More than 10,000 single-nucleotide polymorphisms (SNPs) were identified between the genotypes, with 100% SNP frequency and a minimum of eight sequencing reads. The most prevalent SNP substitutions were C—T and A—G in these cotton genotypes. The putative SNPs identified can be utilized for characterizing genetic diversity, genotyping, and eventually in Pima cotton breeding through marker-assisted selection. PMID:27721653

  8. [New SNP markers of the honeybee vitellogenin gene (Vg) used for identification of subspecies Apis mellifera mellifera L].

    PubMed

    Ilyasov, R A; Poskryakov, A V; Nikolenko, A G

    2015-02-01

    Preservation of the gene pool of honeybee subspecies Apis mellifera mellifera is of vital importance for successful beekeeping development in the northern regions of Eurasia. An effective method of genotyping honeybee colonies used in modern science is the mapping of sites of single nucleotide polymorphism (SNP). The honeybee vitellogenin gene (Vg) encodes a protein that affects reproductive function, behavior, immunity, longevity, and social organization in the honeybee Apis mellifera and is therefore a topical research subject. The results of comparative analysis of honeybee Vg sequences show that there are 26 SNP sites that differentiate M and C evolutionary branches and can be used as markers in selective breeding, DNA-barcoding, and the creation of genetic passports for A. m. mellifera colonies.

  9. Nuclear Species-Diagnostic SNP Markers Mined from 454 Amplicon Sequencing Reveal Admixture Genomic Structure of Modern Citrus Varieties

    PubMed Central

    Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick

    2015-01-01

    Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP

  10. Nuclear species-diagnostic SNP markers mined from 454 amplicon sequencing reveal admixture genomic structure of modern citrus varieties.

    PubMed

    Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick

    2015-01-01

    Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP

  11. FastTagger: an efficient algorithm for genome-wide tag SNP selection using multi-marker linkage disequilibrium

    PubMed Central

    2010-01-01

    Background Human genome contains millions of common single nucleotide polymorphisms (SNPs) and these SNPs play an important role in understanding the association between genetic variations and human diseases. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), thus it is not necessary to genotype all SNPs for association study. Many algorithms have been developed to find a small subset of SNPs called tag SNPs that are sufficient to infer all the other SNPs. Algorithms based on the r2 LD statistic have gained popularity because r2 is directly related to statistical power to detect disease associations. Most of existing r2 based algorithms use pairwise LD. Recent studies show that multi-marker LD can help further reduce the number of tag SNPs. However, existing tag SNP selection algorithms based on multi-marker LD are both time-consuming and memory-consuming. They cannot work on chromosomes containing more than 100 k SNPs using length-3 tagging rules. Results We propose an efficient algorithm called FastTagger to calculate multi-marker tagging rules and select tag SNPs based on multi-marker LD. FastTagger uses several techniques to reduce running time and memory consumption. Our experiment results show that FastTagger is several times faster than existing multi-marker based tag SNP selection algorithms, and it consumes much less memory at the same time. As a result, FastTagger can work on chromosomes containing more than 100 k SNPs using length-3 tagging rules. FastTagger also produces smaller sets of tag SNPs than existing multi-marker based algorithms, and the reduction ratio ranges from 3%-9% when length-3 tagging rules are used. The generated tagging rules can also be used for genotype imputation. We studied the prediction accuracy of individual rules, and the average accuracy is above 96% when r2 ≥ 0.9. Conclusions Generating multi-marker tagging rules is a computation intensive task, and it is the bottleneck of existing multi-marker based tag

  12. Minimal SNP overlap among multiple panels of ancestry informative markers argues for more international collaboration.

    PubMed

    Soundararajan, Usha; Yun, Libing; Shi, Meisen; Kidd, Kenneth K

    2016-07-01

    The century-old use of genetic markers to determine population relationships has morphed in modern forensics into use of markers to determine the ancestry of an individual from a DNA sample. Researchers have identified sets of SNPs that have frequency differences among populations and many sets of SNPs have been published for the purpose of inferring ancestry. Such inference also requires reference datasets for the particular set of SNPs selected. We have identified 21 largely independent published panels of ancestry informative SNPs (AISNPs) and examined their union of 1397 SNPs. No SNP occurs in more than 6 panels. The 1397 SNPs in 21 panels yield a largely empty matrix that is inhibiting progress on more refined ability to infer ancestry for a forensic sample. The most common set of reference populations is the HGDP set of 52 small population samples totaling a thousand individuals. Only 46 (3%) of the 1397 SNPs occur in three or more panels. We assembled a new dataset for 44 of those SNPs involving 4,559 individuals from 73 populations. Analyses of this dataset provided clear differentiation of only five biogeographic regions: sub-Saharan Africa, Europe and SW Asia, South Asia, East Asia, and the Americas. This is an inadequate level of biogeographic resolution already exceeded by other panels. We conclude that more such AISNP panels are not needed and that the forensic community must collaborate to develop a common set of highly differentiating AISNPs typed on a very large number of population samples. How that can be accomplished will be the subject of future discussion. Copyright © 2016 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  13. Identification and validation of a SNP marker linked to the gene HsBvm-1 for nematode resistance in sugar beet

    USDA-ARS?s Scientific Manuscript database

    The beet-cyst nematode (Heterodera schachtii Schmidt) is one of the major pests of sugar beet. The identification of molecular markers associated with nematode resistance would be helpful for developing resistant varieties. The aim of this study was the identification of SNP (Single Nucleotide Polym...

  14. Development of a forensic identity SNP panel for Indonesia.

    PubMed

    Augustinus, Daniel; Gahan, Michelle E; McNevin, Dennis

    2015-07-01

    Genetic markers included in forensic identity panels must exhibit Hardy-Weinberg and linkage equilibrium (HWE and LE). "Universal" panels designed for global use can fail these tests in regional jurisdictions exhibiting high levels of genetic differentiation such as the Indonesian archipelago. This is especially the case where a single DNA database is required for allele frequency estimates to calculate random match probabilities (RMPs) and associated likelihood ratios (LRs). A panel of 65 single nucleotide polymorphisms (SNPs) and a reduced set of 52 SNPs have been selected from 15 Indonesian subpopulations in the HUGO Pan Asian SNP database using a SNP selection strategy that could be applied to any panel of forensic identity markers. The strategy consists of four screening steps: (1) application of a G test for HWE; (2) ranking for high heterozygosity; (3) selection for LE; and (4) selection for low inbreeding depression. SNPs in our Indonesian panel perform well in comparison to some other universal SNP and short tandem repeat (STR) panels as measured by Fisher's exact test for HWE and LE and Wright's F statistics.

  15. Development of SNP-genotyping arrays in two shellfish species.

    PubMed

    Lapègue, S; Harrang, E; Heurtebise, S; Flahauw, E; Donnadieu, C; Gayral, P; Ballenghien, M; Genestout, L; Barbotte, L; Mahla, R; Haffray, P; Klopp, C

    2014-07-01

    Use of SNPs has been favoured due to their abundance in plant and animal genomes, accompanied by the falling cost and rising throughput capacity for detection and genotyping. Here, we present in vitro (obtained from targeted sequencing) and in silico discovery of SNPs, and the design of medium-throughput genotyping arrays for two oyster species, the Pacific oyster, Crassostrea gigas, and European flat oyster, Ostrea edulis. Two sets of 384 SNP markers were designed for two Illumina GoldenGate arrays and genotyped on more than 1000 samples for each species. In each case, oyster samples were obtained from wild and selected populations and from three-generation families segregating for traits of interest in aquaculture. The rate of successfully genotyped polymorphic SNPs was about 60% for each species. Effects of SNP origin and quality on genotyping success (Illumina functionality Score) were analysed and compared with other model and nonmodel species. Furthermore, a simulation was made based on a subset of the C. gigas SNP array with a minor allele frequency of 0.3 and typical crosses used in shellfish hatcheries. This simulation indicated that at least 150 markers were needed to perform an accurate parental assignment. Such panels might provide valuable tools to improve our understanding of the connectivity between wild (and selected) populations and could contribute to future selective breeding programmes.

  16. Model SNP development for complex genomes based on hexaploid oat using high-throughput 454 sequencing technology

    PubMed Central

    2011-01-01

    Background Genetic markers are pivotal to modern genomics research; however, discovery and genotyping of molecular markers in oat has been hindered by the size and complexity of the genome, and by a scarcity of sequence data. The purpose of this study was to generate oat expressed sequence tag (EST) information, develop a bioinformatics pipeline for SNP discovery, and establish a method for rapid, cost-effective, and straightforward genotyping of SNP markers in complex polyploid genomes such as oat. Results Based on cDNA libraries of four cultivated oat genotypes, approximately 127,000 contigs were assembled from approximately one million Roche 454 sequence reads. Contigs were filtered through a novel bioinformatics pipeline to eliminate ambiguous polymorphism caused by subgenome homology, and 96 in silico SNPs were selected from 9,448 candidate loci for validation using high-resolution melting (HRM) analysis. Of these, 52 (54%) were polymorphic between parents of the Ogle1040 × TAM O-301 (OT) mapping population, with 48 segregating as single Mendelian loci, and 44 being placed on the existing OT linkage map. Ogle and TAM amplicons from 12 primers were sequenced for SNP validation, revealing complex polymorphism in seven amplicons but general sequence conservation within SNP loci. Whole-amplicon interrogation with HRM revealed insertions, deletions, and heterozygotes in secondary oat germplasm pools, generating multiple alleles at some primer targets. To validate marker utility, 36 SNP assays were used to evaluate the genetic diversity of 34 diverse oat genotypes. Dendrogram clusters corresponded generally to known genome composition and genetic ancestry. Conclusions The high-throughput SNP discovery pipeline presented here is a rapid and effective method for identification of polymorphic SNP alleles in the oat genome. The current-generation HRM system is a simple and highly-informative platform for SNP genotyping. These techniques provide a model for SNP

  17. Developing a new nonbinary SNP fluorescent multiplex detection system for forensic application in China.

    PubMed

    Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila

    2017-02-06

    Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis.

  18. Diversity in 113 cowpea [Vigna unguiculata (L) Walp] accessions assessed with 458 SNP markers.

    PubMed

    Egbadzor, Kenneth F; Ofori, Kwadwo; Yeboah, Martin; Aboagye, Lawrence M; Opoku-Agyeman, Michael O; Danquah, Eric Y; Offei, Samuel K

    2014-01-01

    Single Nucleotide Polymorphism (SNP) markers were used in characterization of 113 cowpea accessions comprising of 108 from Ghana and 5 from abroad. Leaf tissues from plants cultivated at the University of Ghana were genotyped at KBioscience in the United Kingdom. Data was generated for 477 SNPs, out of which 458 revealed polymorphism. The results were used to analyze genetic dissimilarity among the accessions using Darwin 5 software. The markers discriminated among all of the cowpea accessions and the dissimilarity values which ranged from 0.006 to 0.63 were used for factorial plot. Unexpected high levels of heterozygosity were observed on some of the accessions. Accessions known to be closely related clustered together in a dendrogram drawn with WPGMA method. A maximum length sub-tree which comprised of 48 core accessions was constructed. The software package structure was used to separate accessions into three groups, and the programme correctly identified varieties that were known hybrids. The hybrids were those accessions with numerous heterozygous loci. The structure plot showed closely related accessions with similar genome patterns. The SNP markers were more efficient in discriminating among the cowpea germplasm than morphological, seed protein polymorphism and simple sequence repeat studies reported earlier on the same collection.

  19. Determination of cytoplasmic male sterile factors in onion plants (Allium cepa L.) using PCR-RFLP and SNP markers.

    PubMed

    Cho, Kwang-Soo; Yang, Tae-Jin; Hong, Su-Young; Kwon, Young-Seok; Woo, Jong-Gyu; Park, Hyo-Guen

    2006-06-30

    We have developed a polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) marker that can distinguish male-fertile (N) and male-sterile (S) cytoplasm in onions. The PCR-RFLP marker was located in a chloroplast psbA gene amplicon. Digesting the amplicons from different cytoplasm-containing varieties with the restriction enzyme MspI revealed that N-cytoplasm plants have a functional MspI site (CCGG), whereas the S-cytoplasm plants has a substitution in that site (CTGG), and thus no MspI target. The results obtained using this PCR-RFLP marker to distinguish between cytoplasmic male sterile factors in 35 onion varieties corresponded with those using a CMS-specific sequence-characterized amplified region (SCAR) marker. Moreover, the PCR-RFLP marker can identify N- ot S-cytoplasms in DNA sample mixtures in which they are in up to a 10-fold minority, indicating that use of the marker has high diagnostic precision. We also demonstrated the usefulness of the SNP detected in the psbA gene for high-throughput discrimination of CMS factors using Real-time PCR and a TaqMan probe assay.

  20. Outlier SNP markers reveal fine-scale genetic structuring across European hake populations (Merluccius merluccius).

    PubMed

    Milano, Ilaria; Babbucci, Massimiliano; Cariani, Alessia; Atanassova, Miroslava; Bekkevold, Dorte; Carvalho, Gary R; Espiñeira, Montserrat; Fiorentino, Fabio; Garofalo, Germana; Geffen, Audrey J; Hansen, Jakob H; Helyar, Sarah J; Nielsen, Einar E; Ogden, Rob; Patarnello, Tomaso; Stagioni, Marco; Tinti, Fausto; Bargelloni, Luca

    2014-01-01

    Shallow population structure is generally reported for most marine fish and explained as a consequence of high dispersal, connectivity and large population size. Targeted gene analyses and more recently genome-wide studies have challenged such view, suggesting that adaptive divergence might occur even when neutral markers provide genetic homogeneity across populations. Here, 381 SNPs located in transcribed regions were used to assess large- and fine-scale population structure in the European hake (Merluccius merluccius), a widely distributed demersal species of high priority for the European fishery. Analysis of 850 individuals from 19 locations across the entire distribution range showed evidence for several outlier loci, with significantly higher resolving power. While 299 putatively neutral SNPs confirmed the genetic break between basins (F(CT) = 0.016) and weak differentiation within basins, outlier loci revealed a dramatic divergence between Atlantic and Mediterranean populations (F(CT) range 0.275-0.705) and fine-scale significant population structure. Outlier loci separated North Sea and Northern Portugal populations from all other Atlantic samples and revealed a strong differentiation among Western, Central and Eastern Mediterranean geographical samples. Significant correlation of allele frequencies at outlier loci with seawater surface temperature and salinity supported the hypothesis that populations might be adapted to local conditions. Such evidence highlights the importance of integrating information from neutral and adaptive evolutionary patterns towards a better assessment of genetic diversity. Accordingly, the generated outlier SNP data could be used for tackling illegal practices in hake fishing and commercialization as well as to develop explicit spatial models for defining management units and stock boundaries. © 2013 John Wiley & Sons Ltd.

  1. Genetic diversity and structure in Bos taurus and Bos indicus populations analyzed by SNP markers.

    PubMed

    Lin, Bang Zhong; Sasazaki, Shinji; Mannen, Hideyuki

    2010-06-01

    The purpose of this study was to assess genetic diversity, phylogenetic relationship and population structure among nine Eurasian cattle populations using 58 single nucleotide polymorphism (SNP) markers. The calculated distribution of minor allele frequencies and heterozygosities suggested that the genetic diversity of Bos indicus populations was lower than that of Bos taurus populations. Phylogenetic analyses revealed the main divergence between the Bos taurus and Bos indicus populations, and subsequently between Asian and European populations. By principal components analysis, the Bos taurus and Bos indicus populations were clearly distinguished with PC1 (61.1%); however, six Bos taurus populations clustered loosely and the partial separation between European and Asian groups was observed by PC2 (12.5%). The structure analysis was performed using the STRUCTURE program. Distinct separation between Bos taurus and Bos indicus was shown at K = 2, and that between European and Asian populations at K = 3. At K = 4, 5 and 6, Mongolian population showed an admixture pattern with different ancestry of Asian and European cattle. At K = 7, all Bos taurus populations showed each cluster with little proportion of admixture. In conclusion, 58 SNP markers in this study could sufficiently estimate the genetic diversity, relationship and structure for nine Eurasian cattle populations, especially by analyses of principal components and STRUCTURE.

  2. Radial basis function regression methods for predicting quantitative traits using SNP markers.

    PubMed

    Long, Nanye; Gianola, Daniel; Rosa, Guilherme J M; Weigel, Kent A; Kranis, Andreas; González-Recio, Oscar

    2010-06-01

    A challenge when predicting total genetic values for complex quantitative traits is that an unknown number of quantitative trait loci may affect phenotypes via cryptic interactions. If markers are available, assuming that their effects on phenotypes are additive may lead to poor predictive ability. Non-parametric radial basis function (RBF) regression, which does not assume a particular form of the genotype-phenotype relationship, was investigated here by simulation and analysis of body weight and food conversion rate data in broilers. The simulation included a toy example in which an arbitrary non-linear genotype-phenotype relationship was assumed, and five different scenarios representing different broad sense heritability levels (0.1, 0.25, 0.5, 0.75 and 0.9) were created. In addition, a whole genome simulation was carried out, in which three different gene action modes (pure additive, additive+dominance and pure epistasis) were considered. In all analyses, a training set was used to fit the model and a testing set was used to evaluate predictive performance. The latter was measured by correlation and predictive mean-squared error (PMSE) on the testing data. For comparison, a linear additive model known as Bayes A was used as benchmark. Two RBF models with single nucleotide polymorphism (SNP)-specific (RBF I) and common (RBF II) weights were examined. Results indicated that, in the presence of complex genotype-phenotype relationships (i.e. non-linearity and non-additivity), RBF outperformed Bayes A in predicting total genetic values using SNP markers. Extension of Bayes A to include all additive, dominance and epistatic effects could improve its prediction accuracy. RBF I was generally better than RBF II, and was able to identify relevant SNPs in the toy example.

  3. SNP Discovery and Development of a High-Density Genotyping Array for Sunflower

    PubMed Central

    Bachlava, Eleni; Taylor, Christopher A.; Tang, Shunxue; Bowers, John E.; Mandel, Jennifer R.; Burke, John M.; Knapp, Steven J.

    2012-01-01

    Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible. PMID:22238659

  4. SNP discovery and development of a high-density genotyping array for sunflower.

    PubMed

    Bachlava, Eleni; Taylor, Christopher A; Tang, Shunxue; Bowers, John E; Mandel, Jennifer R; Burke, John M; Knapp, Steven J

    2012-01-01

    Recent advances in next-generation DNA sequencing technologies have made possible the development of high-throughput SNP genotyping platforms that allow for the simultaneous interrogation of thousands of single-nucleotide polymorphisms (SNPs). Such resources have the potential to facilitate the rapid development of high-density genetic maps, and to enable genome-wide association studies as well as molecular breeding approaches in a variety of taxa. Herein, we describe the development of a SNP genotyping resource for use in sunflower (Helianthus annuus L.). This work involved the development of a reference transcriptome assembly for sunflower, the discovery of thousands of high quality SNPs based on the generation and analysis of ca. 6 Gb of transcriptome re-sequencing data derived from multiple genotypes, the selection of 10,640 SNPs for inclusion in the genotyping array, and the use of the resulting array to screen a diverse panel of sunflower accessions as well as related wild species. The results of this work revealed a high frequency of polymorphic SNPs and relatively high level of cross-species transferability. Indeed, greater than 95% of successful SNP assays revealed polymorphism, and more than 90% of these assays could be successfully transferred to related wild species. Analysis of the polymorphism data revealed patterns of genetic differentiation that were largely congruent with the evolutionary history of sunflower, though the large number of markers allowed for finer resolution than has previously been possible.

  5. High throughput SNP discovery and validation in the pig: towards the development of a high density swine SNP chip

    USDA-ARS?s Scientific Manuscript database

    Recent developments in sequencing technology have allowed the generation of millions of short read sequences in a fast and inexpensive way. This enables the cost effective large scale identification of hundreds of thousands of SNPs needed for the development of high density SNP arrays. Currently, a ...

  6. CAGI4 Crohn's exome challenge: Marker SNP versus exome variant models for assigning risk of Crohn disease.

    PubMed

    Pal, Lipika R; Kundu, Kunal; Yin, Yizhou; Moult, John

    2017-09-01

    Understanding the basis of complex trait disease is a fundamental problem in human genetics. The CAGI Crohn's Exome challenges are providing insight into the adequacy of current disease models by requiring participants to identify which of a set of individuals has been diagnosed with the disease, given exome data. For the CAGI4 round, we developed a method that used the genotypes from exome sequencing data only to impute the status of genome wide association studies marker SNPs. We then used the imputed genotypes as input to several machine learning methods that had been trained to predict disease status from marker SNP information. We achieved the best performance using Naïve Bayes and with a consensus machine learning method, obtaining an area under the curve of 0.72, larger than other methods used in CAGI4. We also developed a model that incorporated the contribution from rare missense variants in the exome data, but this performed less well. Future progress is expected to come from the use of whole genome data rather than exomes. © 2017 Wiley Periodicals, Inc.

  7. Development of a SNP set for human identification: A set with high powers of discrimination which yields high genetic information from naturally degraded DNA samples in the Thai population.

    PubMed

    Boonyarit, Hathaichanoke; Mahasirimongkol, Surakameth; Chavalvechakul, Nuttama; Aoki, Masayuki; Amitani, Hanae; Hosono, Naoya; Kamatani, Naoyuki; Kubo, Michiaki; Lertrit, Patcharee

    2014-07-01

    This study describes the development of a SNP typing system for human identification in the Thai population, in particular for extremely degraded DNA samples. A highly informative SNP marker set for forensic identification was identified, and a multiplex PCR-based Invader assay was developed. Fifty-one highly informative autosomal SNP markers and three sex determination SNP markers were amplified in two multiplex PCR reactions and then detected using Invader assay reactions. The average PCR product size was 71 base pairs. The match probability of the 54-SNP marker set in 124 Thai individuals was 1.48×10(-21), higher than that of STR typing, suggesting that this 54-SNP marker set is beneficial for forensic identification in the Thai population. The selected SNP marker set was also evaluated in 90 artificially degraded samples, and in 128 naturally degraded DNA samples from real forensic casework which had shown no profiles or incomplete profiles when examined using a commercial STR typing system. A total of 56 degraded samples (44%) achieved the matching probability (PM) equivalent to STR gold standard analysis (successful genotyping of 44 SNP markers) for human identification. These data indicated that our novel 54-SNP marker set provides a very useful and valuable approach for forensic identification in the Thai population, especially in the case of highly to extremely degraded DNA. In summary, we have developed a set of 54 Thai-specific SNPs for human identification which have higher discrimination power than STR genotyping. The PCRs for these 54 SNP markers were successfully combined into two multiplex reactions and detected with an Invader assay. This novel SNP genotyping system also yields high levels of genetic information from naturally degraded samples, even though there are much more difficult to recover than artificially degraded samples.

  8. Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers

    PubMed Central

    2010-01-01

    Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls

  9. Ancestry informative marker panels for African Americans based on subsets of commercially available SNP arrays.

    PubMed

    Tandon, Arti; Patterson, Nick; Reich, David

    2011-01-01

    Admixture mapping is a widely used method for localizing disease genes in African Americans. Most current methods for inferring ancestry at each locus in the genome use a few thousand single nucleotide polymorphisms (SNPs) that are very different in frequency between West Africans and European Americans, and that are required to not be in linkage disequilibrium in the ancestral populations. Modern SNP arrays provide data on hundreds of thousands of SNPs per sample, and to use these to infer ancestry, using many of the standard methods, it is necessary to choose subsets of the SNPs for analysis. Here we present panels of about 4,300 ancestry informative markers (AIMs) that are subsets respectively of SNPs on the Illumina 1 M, Illumina 650, Illumina 610, Affymetrix 6.0 and Affymetrix 5.0 arrays. To validate the usefulness of these panels, we applied them to samples that are different from the ones used to select the SNPs. The panels provide about 80% of the maximum information about African or European ancestry, even with up to 10% missing data. © 2010 Wiley-Liss, Inc.

  10. Integration of novel SSR and gene-based SNP marker loci in the chickpea genetic map and establishment of new anchor points with Medicago truncatula genome

    PubMed Central

    Nayak, Spurthi N.; Zhu, Hongyan; Varghese, Nicy; Datta, Subhojit; Choi, Hong-Kyu; Horres, Ralf; Jüngling, Ruth; Singh, Jagbir; Kavi Kishor, P. B.; Sivaramakrishnan, S.; Hoisington, Dave A.; Kahl, Günter; Winter, Peter; Cook, Douglas R.

    2010-01-01

    This study presents the development and mapping of simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers in chickpea. The mapping population is based on an inter-specific cross between domesticated and non-domesticated genotypes of chickpea (Cicer arietinum ICC 4958 × C. reticulatum PI 489777). This same population has been the focus of previous studies, permitting integration of new and legacy genetic markers into a single genetic map. We report a set of 311 novel SSR markers (designated ICCM—ICRISAT chickpea microsatellite), obtained from an SSR-enriched genomic library of ICC 4958. Screening of these SSR markers on a diverse panel of 48 chickpea accessions provided 147 polymorphic markers with 2–21 alleles and polymorphic information content value 0.04–0.92. Fifty-two of these markers were polymorphic between parental genotypes of the inter-specific population. We also analyzed 233 previously published (H-series) SSR markers that provided another set of 52 polymorphic markers. An additional 71 gene-based SNP markers were developed from transcript sequences that are highly conserved between chickpea and its near relative Medicago truncatula. By using these three approaches, 175 new marker loci along with 407 previously reported marker loci were integrated to yield an improved genetic map of chickpea. The integrated map contains 521 loci organized into eight linkage groups that span 2,602 cM, with an average inter-marker distance of 4.99 cM. Gene-based markers provide anchor points for comparing the genomes of Medicago and chickpea, and reveal extended synteny between these two species. The combined set of genetic markers and their integration into an improved genetic map should facilitate chickpea genetics and breeding, as well as translational studies between chickpea and Medicago. Electronic supplementary material The online version of this article (doi:10.1007/s00122-010-1265-1) contains supplementary material, which is

  11. Population structure and genetic diversity characterization of a sunflower association mapping population using SSR and SNP markers.

    PubMed

    Filippi, Carla V; Aguirre, Natalia; Rivas, Juan G; Zubrzycki, Jeremias; Puebla, Andrea; Cordes, Diego; Moreno, Maria V; Fusari, Corina M; Alvarez, Daniel; Heinz, Ruth A; Hopp, Horacio E; Paniego, Norma B; Lia, Veronica V

    2015-02-13

    Argentina has a long tradition of sunflower breeding, and its germplasm is a valuable genetic resource worldwide. However, knowledge of the genetic constitution and variability levels of the Argentinean germplasm is still scarce, rendering the global map of cultivated sunflower diversity incomplete. In this study, 42 microsatellite loci and 384 single nucleotide polymorphisms (SNPs) were used to characterize the first association mapping population used for quantitative trait loci mapping in sunflower, along with a selection of allied open-pollinated and composite populations from the germplasm bank of the National Institute of Agricultural Technology of Argentina. The ability of different kinds of markers to assess genetic diversity and population structure was also evaluated. The analysis of polymorphism in the set of sunflower accessions studied here showed that both the microsatellites and SNP markers were informative for germplasm characterization, although to different extents. In general, the estimates of genetic variability were moderate. The average genetic diversity, as quantified by the expected heterozygosity, was 0.52 for SSR loci and 0.29 for SNPs. Within SSR markers, those derived from non-coding regions were able to capture higher levels of diversity than EST-SSR. A significant correlation was found between SSR and SNP- based genetic distances among accessions. Bayesian and multivariate methods were used to infer population structure. Evidence for the existence of three different genetic groups was found consistently across data sets (i.e., SSR, SNP and SSR + SNP), with the maintainer/restorer status being the most prevalent characteristic associated with group delimitation. The present study constitutes the first report comparing the performance of SSR and SNP markers for population genetics analysis in cultivated sunflower. We show that the SSR and SNP panels examined here, either used separately or in conjunction, allowed consistent

  12. A high-density SNP genotyping array for Brassica napus and its ancestral diploid species based on optimised selection of single-locus markers in the allotetraploid genome.

    PubMed

    Clarke, Wayne E; Higgins, Erin E; Plieske, Joerg; Wieseke, Ralf; Sidebottom, Christine; Khedikar, Yogendra; Batley, Jacqueline; Edwards, Dave; Meng, Jinling; Li, Ruiyuan; Lawley, Cynthia Taylor; Pauquet, Jérôme; Laga, Benjamin; Cheung, Wing; Iniguez-Luy, Federico; Dyrszka, Emmanuelle; Rae, Stephen; Stich, Benjamin; Snowdon, Rod J; Sharpe, Andrew G; Ganal, Martin W; Parkin, Isobel A P

    2016-10-01

    The Brassica napus Illumina array provides genome-wide markers linked to the available genome sequence, a significant tool for genetic analyses of the allotetraploid B. napus and its progenitor diploid genomes. A high-density single nucleotide polymorphism (SNP) Illumina Infinium array, containing 52,157 markers, was developed for the allotetraploid Brassica napus. A stringent selection process employing the short probe sequence for each SNP assay was used to limit the majority of the selected markers to those represented a minimum number of times across the highly replicated genome. As a result approximately 60 % of the SNP assays display genome-specificity, resolving as three clearly separated clusters (AA, AB, and BB) when tested with a diverse range of B. napus material. This genome specificity was supported by the analysis of the diploid ancestors of B. napus, whereby 26,504 and 29,720 markers were scorable in B. oleracea and B. rapa, respectively. Forty-four percent of the assayed loci on the array were genetically mapped in a single doubled-haploid B. napus population allowing alignment of their physical and genetic coordinates. Although strong conservation of the two positions was shown, at least 3 % of the loci were genetically mapped to a homoeologous position compared to their presumed physical position in the respective genome, underlying the importance of genetic corroboration of locus identity. In addition, the alignments identified multiple rearrangements between the diploid and tetraploid Brassica genomes. Although mostly attributed to genome assembly errors, some are likely evidence of rearrangements that occurred since the hybridisation of the progenitor genomes in the B. napus nucleus. Based on estimates for linkage disequilibrium decay, the array is a valuable tool for genetic fine mapping and genome-wide association studies in B. napus and its progenitor genomes.

  13. Characterization of the Streptomyces sp. Strain C5 snp Locus and Development of snp-Derived Expression Vectors

    PubMed Central

    DeSanti, Charles L.; Strohl, William R.

    2003-01-01

    The Streptomyces sp. strain C5 snp locus is comprised of two divergently oriented genes: snpA, a metalloproteinase gene, and snpR, which encodes a LysR-like activator of snpA transcription. The transcriptional start point of snpR is immediately downstream of a strong T-N11-A inverted repeat motif likely to be the SnpR binding site, while the snpA transcriptional start site overlaps the ATG start codon, generating a leaderless snpA transcript. By using the aphII reporter gene of pIJ486 as a reporter, the plasmid-borne snpR-activated snpA promoter was ca. 60-fold more active than either the nonactivated snpA promoter or the melC1 promoter of pIJ702. The snpR-activated snpA promoter produced reporter protein levels comparable to those of the up-mutated ermE∗ promoter. The SnpR-activated snpA promoter was built into a set of transcriptional and translational fusion expression vectors which have been used for the intracellular expression of numerous daunomycin biosynthesis pathway genes from Streptomyces sp. strain C5 as well as the expression and secretion of soluble recombinant human endostatin. PMID:12620855

  14. Exploring germplasm diversity to understand the domestication process in Cicer spp. using SNP and DArT markers.

    PubMed

    Roorkiwal, Manish; von Wettberg, Eric J; Upadhyaya, Hari D; Warschefsky, Emily; Rathore, Abhishek; Varshney, Rajeev K

    2014-01-01

    To estimate genetic diversity within and between 10 interfertile Cicer species (94 genotypes) from the primary, secondary and tertiary gene pool, we analysed 5,257 DArT markers and 651 KASPar SNP markers. Based on successful allele calling in the tertiary gene pool, 2,763 DArT and 624 SNP markers that are polymorphic between genotypes from the gene pools were analyzed further. STRUCTURE analyses were consistent with 3 cultivated populations, representing kabuli, desi and pea-shaped seed types, with substantial admixture among these groups, while two wild populations were observed using DArT markers. AMOVA was used to partition variance among hierarchical sets of landraces and wild species at both the geographical and species level, with 61% of the variation found between species, and 39% within species. Molecular variance among the wild species was high (39%) compared to the variation present in cultivated material (10%). Observed heterozygosity was higher in wild species than the cultivated species for each linkage group. Our results support the Fertile Crescent both as the center of domestication and diversification of chickpea. The collection used in the present study covers all the three regions of historical chickpea cultivation, with the highest diversity in the Fertile Crescent region. Shared alleles between different gene pools suggest the possibility of gene flow among these species or incomplete lineage sorting and could indicate complicated patterns of divergence and fusion of wild chickpea taxa in the past.

  15. Exploring Germplasm Diversity to Understand the Domestication Process in Cicer spp. Using SNP and DArT Markers

    PubMed Central

    Roorkiwal, Manish; von Wettberg, Eric J.; Upadhyaya, Hari D.; Warschefsky, Emily; Rathore, Abhishek; Varshney, Rajeev K.

    2014-01-01

    To estimate genetic diversity within and between 10 interfertile Cicer species (94 genotypes) from the primary, secondary and tertiary gene pool, we analysed 5,257 DArT markers and 651 KASPar SNP markers. Based on successful allele calling in the tertiary gene pool, 2,763 DArT and 624 SNP markers that are polymorphic between genotypes from the gene pools were analyzed further. STRUCTURE analyses were consistent with 3 cultivated populations, representing kabuli, desi and pea-shaped seed types, with substantial admixture among these groups, while two wild populations were observed using DArT markers. AMOVA was used to partition variance among hierarchical sets of landraces and wild species at both the geographical and species level, with 61% of the variation found between species, and 39% within species. Molecular variance among the wild species was high (39%) compared to the variation present in cultivated material (10%). Observed heterozygosity was higher in wild species than the cultivated species for each linkage group. Our results support the Fertile Crescent both as the center of domestication and diversification of chickpea. The collection used in the present study covers all the three regions of historical chickpea cultivation, with the highest diversity in the Fertile Crescent region. Shared alleles between different gene pools suggest the possibility of gene flow among these species or incomplete lineage sorting and could indicate complicated patterns of divergence and fusion of wild chickpea taxa in the past. PMID:25010059

  16. Extensive Chromosome Homoeology among Brassiceae Species Were Revealed by Comparative Genetic Mapping with High-Density EST-Based SNP Markers in Radish (Raphanus sativus L.)‡

    PubMed Central

    Li, Feng; Hasegawa, Yoichi; Saito, Masako; Shirasawa, Sachiko; Fukushima, Aki; Ito, Toyoaki; Fujii, Hiroshi; Kishitani, Sachie; Kitashiba, Hiroyasu; Nishio, Takeshi

    2011-01-01

    A linkage map of expressed sequence tag (EST)-based markers in radish (Raphanus sativus L.) was constructed using a low-cost and high-efficiency single-nucleotide polymorphism (SNP) genotyping method named multiplex polymerase chain reaction–mixed probe dot-blot analysis developed in this study. Seven hundred and forty-six SNP markers derived from EST sequences of R. sativus were assigned to nine linkage groups with a total length of 806.7 cM. By BLASTN, 726 markers were found to have homologous genes in Arabidopsis thaliana, and 72 syntenic regions, which have great potential for utilizing genomic information of the model species A. thaliana in basic and applied genetics of R. sativus, were identified. By construction and analysis of the genome structures of R. sativus based on the 24 genomic blocks within the Brassicaceae ancestral karyotype, 23 of the 24 genomic blocks were detected in the genome of R. sativus, and half of them were found to be triplicated. Comparison of the genome structure of R. sativus with those of the A, B, and C genomes of Brassica species and that of Sinapis alba L. revealed extensive chromosome homoeology among Brassiceae species, which would facilitate transfer of the genomic information from one Brassiceae species to another. PMID:21816873

  17. Extensive chromosome homoeology among Brassiceae species were revealed by comparative genetic mapping with high-density EST-based SNP markers in radish (Raphanus sativus L.).

    PubMed

    Li, Feng; Hasegawa, Yoichi; Saito, Masako; Shirasawa, Sachiko; Fukushima, Aki; Ito, Toyoaki; Fujii, Hiroshi; Kishitani, Sachie; Kitashiba, Hiroyasu; Nishio, Takeshi

    2011-10-01

    A linkage map of expressed sequence tag (EST)-based markers in radish (Raphanus sativus L.) was constructed using a low-cost and high-efficiency single-nucleotide polymorphism (SNP) genotyping method named multiplex polymerase chain reaction-mixed probe dot-blot analysis developed in this study. Seven hundred and forty-six SNP markers derived from EST sequences of R. sativus were assigned to nine linkage groups with a total length of 806.7 cM. By BLASTN, 726 markers were found to have homologous genes in Arabidopsis thaliana, and 72 syntenic regions, which have great potential for utilizing genomic information of the model species A. thaliana in basic and applied genetics of R. sativus, were identified. By construction and analysis of the genome structures of R. sativus based on the 24 genomic blocks within the Brassicaceae ancestral karyotype, 23 of the 24 genomic blocks were detected in the genome of R. sativus, and half of them were found to be triplicated. Comparison of the genome structure of R. sativus with those of the A, B, and C genomes of Brassica species and that of Sinapis alba L. revealed extensive chromosome homoeology among Brassiceae species, which would facilitate transfer of the genomic information from one Brassiceae species to another.

  18. Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array

    PubMed Central

    2012-01-01

    Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and

  19. Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array.

    PubMed

    Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J

    2012-05-25

    A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been

  20. Assessment of microsatellite and SNP markers for parentage assignment in ex situ African Penguin (Spheniscus demersus) populations.

    PubMed

    Labuschagne, Christiaan; Nupen, Lisa; Kotzé, Antoinette; Grobler, Paul J; Dalton, Desiré L

    2015-10-01

    Captive management of ex situ populations of endangered species is traditionally based on pedigree information derived from studbook data. However, molecular methods could provide a powerful set of complementary tools to verify studbook records and also contribute to improving the understanding of the genetic status of captive populations. Here, we compare the utility of single nucleotide polymorphisms (SNPs) and microsatellites (MS) and two analytical methods for assigning parentage in ten families of captive African penguins held in South African facilities. We found that SNPs performed better than microsatellites under both analytical frameworks, but a combination of all markers was most informative. A subset of combined SNP (n = 14) and MS loci (n = 10) provided robust assessments of parentage. Captive or supportive breeding programs will play an important role in future African penguin conservation efforts as a source of individuals for reintroduction. Cooperation among these captive facilities is essential to facilitate this process and improve management. This study provided us with a useful set of SNP and MS markers for parentage and relatedness testing among these captive populations. Further assessment of the utility of these markers over multiple (>3) generations and the incorporation of a larger variety of relationships among individuals (e.g., half-siblings or cousins) is strongly suggested.

  1. Development and Applications of a Bovine 50,000 SNP Chip

    USDA-ARS?s Scientific Manuscript database

    To develop an Illumina iSelect high density single nucleotide polymorphism (SNP) assay for cattle, the collaborative iBMC (Illumina, USDA ARS Beltsville, University of Missouri, USDA ARS Clay Center) Consortium first performed a de novo SNP discovery project in which genomic reduced representation l...

  2. High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).

    PubMed

    Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C

    2016-03-01

    Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies.

  3. Development of an Alfalfa SNP Array and Its Use to Evaluate Patterns of Population Structure and Linkage Disequilibrium

    PubMed Central

    Li, Xuehui; Han, Yuanhong; Wei, Yanling; Acharya, Ananta; Farmer, Andrew D.; Ho, Julie; Monteros, Maria J.; Brummer, E. Charles

    2014-01-01

    A large set of genome-wide markers and a high-throughput genotyping platform can facilitate the genetic dissection of complex traits and accelerate molecular breeding applications. Previously, we identified about 0.9 million SNP markers by sequencing transcriptomes of 27 diverse alfalfa genotypes. From this SNP set, we developed an Illumina Infinium array containing 9,277 SNPs. Using this array, we genotyped 280 diverse alfalfa genotypes and several genotypes from related species. About 81% (7,476) of the SNPs met the criteria for quality control and showed polymorphisms. The alfalfa SNP array also showed a high level of transferability for several closely related Medicago species. Principal component analysis and model-based clustering showed clear population structure corresponding to subspecies and ploidy levels. Within cultivated tetraploid alfalfa, genotypes from dormant and nondormant cultivars were largely assigned to different clusters; genotypes from semidormant cultivars were split between the groups. The extent of linkage disequilibrium (LD) across all genotypes rapidly decayed to 26 Kbp at r2 = 0.2, but the rate varied across ploidy levels and subspecies. A high level of consistency in LD was found between and within the two subpopulations of cultivated dormant and nondormant alfalfa suggesting that genome-wide association studies (GWAS) and genomic selection (GS) could be conducted using alfalfa genotypes from throughout the fall dormancy spectrum. However, the relatively low LD levels would require a large number of markers to fully saturate the genome. PMID:24416217

  4. Genetic Variation and Breeding Signature in Mass Selection Lines of the Pacific Oyster (Crassostrea gigas) Assessed by SNP Markers

    PubMed Central

    Zhong, Xiaoxiao; Feng, Dandan; Yu, Hong; Kong, Lingfeng; Li, Qi

    2016-01-01

    In breeding industries, a challenging problem is how to keep genetic diversity over generations. To investigate genetic variation and identify breeding signatures in mass selected lines of Pacific oyster (Crassostrea gigas), three sixth-generation selected lines and four wild populations were assessed using 103 single nucleotide polymorphism (SNP) markers. The genetic diversity data indicated that the selected lines exhibited a significant reduction in the observed heterozygosity and observed number of alleles per locus compared with the wild populations (P≤0.05), indicating the selected lines tended to lose genetic diversity contrasted with the wild populations. The unweighted pair-group method with arithmetic mean (UPGMA) analysis showed that the wild populations and selected lines were not separated into two groups. Using four outlier tests, a total of 17 loci were found under selection at two levels. The global outlier detection suggested that 4 common outlier loci were subject to selection using both the hierarchical island model and Bayesian likelihood approaches. At regional level, 3 SNPs were detected as outlier using at least two outlier tests and one outlier SNP (CgSNP309) was overlapped in the two wild-selected population comparisons. The candidate outlier SNPs provide valuable resources for future association studies in C. gigas. PMID:26954577

  5. Identification and Validation of SNP Markers Linked to Dwarf Traits Using SLAF-Seq Technology in Lagerstroemia

    PubMed Central

    Ju, Yiqian; Jiao, Yao; Feng, Lu; Pan, Huitang; Cheng, Tangren; Zhang, Qixiang

    2016-01-01

    The genetic control of plant architecture is a promising approach to breed desirable cultivars, particularly in ornamental flowers. In this study, the F1 population (142 seedlings) derived from Lagerstroemia fauriei (non-dwarf) × L. indica ‘Pocomoke’ (dwarf) was phenotyped for six traits (plant height (PH), internode length (IL), internode number, primary lateral branch height (PLBH), secondary lateral branch height and primary branch number), and the IL and PLBH traits were positively correlated with the PH trait and considered representative indexes of PH. Fifty non-dwarf and dwarf seedlings were pooled and subjected to a specific-locus amplified fragment sequencing (SLAF-seq) method, which screened 1221 polymorphic markers. A total of 3 markers segregating between bulks were validated in the F1 population, with the M16337 and M38412 markers highly correlated with the IL trait and the M25207 marker highly correlated with the PLBH trait. These markers provide a predictability of approximately 80% using a single marker (M25207) and a predictability of 90% using marker combinations (M16337 + M25207) in the F1 population, which revealed that the IL and the PLBH traits, especially the PLBH, were the decisive elements for PH in terms of molecular regulation. Further validation was performed in the BC1 population and a set of 28 Lagerstroemia stocks using allele-specific PCR (AS-PCR) technology, and the results showed the stability and reliability of the SNP markers and the co-determination of PH by multiple genes. Our findings provide an important theoretical and practical basis for the early prediction and indirect selection of PH using the IL and the PLBH, and the detected SNPs may be useful for marker-assisted selection (MAS) in crape myrtle. PMID:27404662

  6. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    USDA-ARS?s Scientific Manuscript database

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ~4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification pr...

  7. Development and validation of a low-density SNP panel related to prolificacy in sheep

    USDA-ARS?s Scientific Manuscript database

    High-density SNP panels (e.g., 50,000 and 600,000 markers) have been used in exploratory population genetic studies with commercial and minor breeds of sheep. However, routine genetic diversity evaluations of large numbers of samples with large panels are in general cost-prohibitive for gene banks. ...

  8. Identification of QTL and Qualitative Trait Loci for Agronomic Traits Using SNP Markers in the Adzuki Bean

    PubMed Central

    Li, Yuan; Yang, Kai; Yang, Wei; Chu, Liwei; Chen, Chunhai; Zhao, Bo; Li, Yisong; Jian, Jianbo; Yin, Zhichao; Wang, Tianqi; Wan, Ping

    2017-01-01

    The adzuki bean (Vigna angularis) is an important grain legume. Fine mapping of quantitative trait loci (QTL) and qualitative trait genes plays an important role in gene cloning, molecular-marker-assisted selection (MAS), and trait improvement. However, the genetic control of agronomic traits in the adzuki bean remains poorly understood. Single-nucleotide polymorphisms (SNPs) are invaluable in the construction of high-density genetic maps. We mapped 26 agronomic QTLs and five qualitative trait genes related to pigmentation using 1,571 polymorphic SNP markers from the adzuki bean genome via restriction-site-associated DNA sequencing of 150 members of an F2 population derived from a cross between cultivated and wild adzuki beans. We mapped 11 QTLs for flowering time and pod maturity on chromosomes 4, 7, and 10. Six 100-seed weight (SD100WT) QTLs were detected. Two major flowering time QTLs were located on chromosome 4, firstly VaFld4.1 (PEVs 71.3%), co-segregating with SNP marker s690-144110, and VaFld4.2 (PEVs 67.6%) at a 0.974 cM genetic distance from the SNP marker s165-116310. Three QTLs for seed number per pod (Snp3.1, Snp3.2, and Snp4.1) were mapped on chromosomes 3 and 4. One QTL VaSdt4.1 of seed thickness (SDT) and three QTLs for branch number on the main stem were detected on chromosome 4. QTLs for maximum leaf width (LFMW) and stem internode length were mapped to chromosomes 2 and 9, respectively. Trait genes controlling the color of the seed coat, pod, stem and flower were mapped to chromosomes 3 and 1. Three candidate genes, VaAGL, VaPhyE, and VaAP2, were identified for flowering time and pod maturity. VaAGL encodes an agamous-like MADS-box protein of 379 amino acids. VaPhyE encodes a phytochrome E protein of 1,121 amino acids. Four phytochrome genes (VaPhyA1, VaPhyA2, VaPhyB, and VaPhyE) were identified in the adzuki bean genome. We found candidate genes VaAP2/ERF.81 and VaAP2/ERF.82 of SD100WT, VaAP2-s4 of SDT, and VaAP2/ERF.86 of LFMW. A candidate gene

  9. [Artificial selection for cattle based on high-density SNP markers].

    PubMed

    Liu, Xi-Dong; Wang, Zhi-Peng; Fan, Hui-Zhong; Li, Jun-Ya; Gao, Hui-Jiang

    2012-10-01

    With the implementation of genetic improvement in recent years, artificial selection has greatly improved beef cattle production performance and its genetic basis has been dramatically changed. In this study, based on the Illumina BovineSNP50 (54K) and BovineHD (770K) BeadChip and the FST value, we analyzed the genetic differentiation of cattle and screened the imprints of selection in bovine genome. Finally, we found 47104 OUTLIER SNP loci and 3064 candidate genes, for example, CLIC5, TG, CACNA2D1, and FSHR etc. The biological processes and molecular functions of genes were analyzed through gene annotation.The results of this study established a genome-wide map of selection footprints in beef cattle genome and a clue for in-depth study of artificial selection and understanding of biological evolution.Our results indicate that artificial selection has played an important role in cattle breed genetic improvement.

  10. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety 'Amrapali' (Mangifera indica L.).

    PubMed

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.

  11. Transcriptome sequencing for high throughput SNP development and genetic mapping in Pea

    PubMed Central

    2014-01-01

    Background Pea has a complex genome of 4.3 Gb for which only limited genomic resources are available to date. Although SNP markers are now highly valuable for research and modern breeding, only a few are described and used in pea for genetic diversity and linkage analysis. Results We developed a large resource by cDNA sequencing of 8 genotypes representative of modern breeding material using the Roche 454 technology, combining both long reads (400 bp) and high coverage (3.8 million reads, reaching a total of 1,369 megabases). Sequencing data were assembled and generated a 68 K unigene set, from which 41 K were annotated from their best blast hit against the model species Medicago truncatula. Annotated contigs showed an even distribution along M. truncatula pseudochromosomes, suggesting a good representation of the pea genome. 10 K pea contigs were found to be polymorphic among the genetic material surveyed, corresponding to 35 K SNPs. We validated a subset of 1538 SNPs through the GoldenGate assay, proving their ability to structure a diversity panel of breeding germplasm. Among them, 1340 were genetically mapped and used to build a new consensus map comprising a total of 2070 markers. Based on blast analysis, we could establish 1252 bridges between our pea consensus map and the pseudochromosomes of M. truncatula, which provides new insight on synteny between the two species. Conclusions Our approach created significant new resources in pea, i.e. the most comprehensive genetic map to date tightly linked to the model species M. truncatula and a large SNP resource for both academic research and breeding. PMID:24521263

  12. Development and analysis of a 20K SNP array for potato (Solanum tuberosum): an insight into the breeding history.

    PubMed

    Vos, Peter G; Uitdewilligen, Jan G A M L; Voorrips, Roeland E; Visser, Richard G F; van Eck, Herman J

    2015-12-01

    A 20K SNP array was developed and a comprehensive set of tetraploid cultivar was genotyped. This allowed us to identify footprints of the breeding history in contemporary breeding material such as identification of introgression segments, selection and founder signatures. A non-redundant subset of 15,138 previously identified SNPs and 4454 SNPs originating from the SolCAP project were combined into a 20k Infinium SNP array for genotyping a total of 569 potato genotypes. In this study we describe how this SNP array (encoded SolSTW array) was designed and analysed with fitTetra, software designed for autotetraploids. Genotypes from different countries and market segments, complemented with historic cultivars and important progenitors, were genotyped. This comprehensive set of genotypes combined with the deliberate inclusion of a large proportion of SNPs with a low minor allele frequency allowed us to distinguish genetic variation contributed by introgression breeding. This "new" (post 1945) genetic variation is located on specific chromosomal regions and enables the identification of SNP markers linked to R-genes. In addition, when the genetic composition of modern cultivars was compared with cultivars released before 1945, it appears that 96% of the genetic variants present in those ancestral cultivars remains polymorphic in modern cultivars. Hence, genetic erosion is almost absent in potato. Finally, we studied population genetic processes shaping the genetic composition of the modern European potato including drift, selection and founder effects. This resulted in the identification of major founders contributing to contemporary germplasm.

  13. A Picea abies Linkage Map Based on SNP Markers Identifies QTLs for Four Aspects of Resistance to Heterobasidion parviporum Infection

    PubMed Central

    Lind, Mårten; Källman, Thomas; Chen, Jun; Ma, Xiao-Fei; Bousquet, Jean; Morgante, Michele; Zaina, Giusi; Karlsson, Bo; Elfstrand, Malin; Lascoux, Martin; Stenlid, Jan

    2014-01-01

    A consensus linkage map of Picea abies, an economically important conifer, was constructed based on the segregation of 686 SNP markers in a F1 progeny population consisting of 247 individuals. The total length of 1889.2 cM covered 96.5% of the estimated genome length and comprised 12 large linkage groups, corresponding to the number of haploid P. abies chromosomes. The sizes of the groups (from 5.9 to 9.9% of the total map length) correlated well with previous estimates of chromosome sizes (from 5.8 to 10.8% of total genome size). Any locus in the genome has a 97% probability to be within 10 cM from a mapped marker, which makes the map suited for QTL mapping. Infecting the progeny trees with the root rot pathogen Heterobasidion parviporum allowed for mapping of four different resistance traits: lesion length at the inoculation site, fungal spread within the sapwood, exclusion of the pathogen from the host after initial infection, and ability to prevent the infection from establishing at all. These four traits were associated with two, four, four and three QTL regions respectively of which none overlapped between the traits. Each QTL explained between 4.6 and 10.1% of the respective traits phenotypic variation. Although the QTL regions contain many more genes than the ones represented by the SNP markers, at least four markers within the confidence intervals originated from genes with known function in conifer defence; a leucoanthocyanidine reductase, which has previously been shown to upregulate during H. parviporum infection, and three intermediates of the lignification process; a hydroxycinnamoyl CoA shikimate/quinate hydroxycinnamoyltransferase, a 4-coumarate CoA ligase, and a R2R3-MYB transcription factor. PMID:25036209

  14. Using online databases for developing SNP markers of forensic interest.

    PubMed

    Phillips, Christopher

    2005-01-01

    In this chapter we review and compare the online single nucleotide polymorphism databases that are now available as research tools. We give an outline of the search strategies that can be used to ensure the most appropriate loci for forensic applications are chosen.

  15. Identification of Korean-specific SNP markers from whole-exome sequencing data.

    PubMed

    Kim, Sung Min; Yoo, Seong Yeon; Nam, Soo Hyun; Lee, Jae Moon; Chung, Ki Wha

    2016-05-01

    Analysis of large numbers of single-nucleotide polymorphisms (SNPs) can increase individual discrimination power, and, particularly, it can supply important evidence for kinship or ethnic identification. We identified 300 Korean-specific SNPs from 306 Korean whole-exome sequencing (WES) data. Functionally significant SNPs (variants in splicing site, missense, nonsense, and exonic indels) were filtered out from the variant pool, and SNPs with minor allele frequencies (MAFs) of <0.3 in the 1000 Genomes (1000G) database but >0.3 in the Korean population were selected. Genotypes obtained from WES were confirmed by the Sanger sequencing method. The identified markers were evenly distributed throughout the autosomal chromosomes. All the SNPs were in the Hardy-Weinberg equilibrium with a mean MAF of 0.415 (0.161 in 1000G). The mean heterozygosities were 0.476 (observed) and 0.470 (experimental). The combined power of discrimination was very high. Korean MAFs in most SNPs were similar to those for the Chinese and Japanese populations, but were significantly higher than those for several other ethnic populations. These selected SNPs will be used to develop forensic markers and are expected to be widely used for additional individual identification, ethnic discrimination, and linkage analysis for kinship tests.

  16. Development of a maize 55 K SNP array with improved genome coverage for molecular breeding.

    PubMed

    Xu, Cheng; Ren, Yonghong; Jian, Yinqiao; Guo, Zifeng; Zhang, Yan; Xie, Chuanxiao; Fu, Junjie; Wang, Hongwu; Wang, Guoying; Xu, Yunbi; Li, Ping; Zou, Cheng

    2017-01-01

    With the decrease of cost in genotyping, single nucleotide polymorphisms (SNPs) have gained wide acceptance because of their abundance, even distribution throughout the maize (Zea mays L.) genome, and suitability for high-throughput analysis. In this study, a maize 55 K SNP array with improved genome coverage for molecular breeding was developed on an Affymetrix® Axiom® platform with 55,229 SNPs evenly distributed across the genome, including 22,278 exonic and 19,425 intronic SNPs. This array contains 451 markers that are associated with 368 known genes and two traits of agronomic importance (drought tolerance and kernel oil biosynthesis), 4067 markers that are not covered by the current reference genome, 734 markers that are differentiated significantly between heterotic groups, and 132 markers that are tags for important transgenic events. To evaluate the performance of 55 K array, we genotyped 593 inbred lines with diverse genetic backgrounds. Compared with the widely-used Illumina® MaizeSNP50 BeadChip, our 55 K array has lower missing and heterozygous rates and more SNPs with lower minor allele frequency (MAF) in tropical maize, facilitating in-depth dissection of rare but possibly valuable variation in tropical germplasm resources. Population structure and genetic diversity analysis revealed that this 55 K array is also quite efficient in resolving heterotic groups and performing fine fingerprinting of germplasm. Therefore, this maize 55 K SNP array is a potentially powerful tool for germplasm evaluation (including germplasm fingerprinting, genetic diversity analysis, and heterotic grouping), marker-assisted breeding, and primary quantitative trait loci (QTL) mapping and genome-wide association study (GWAS) for both tropical and temperate maize.

  17. Development of genetic markers distinguishing two invasive fire ant species (Hymenoptera: Formicidae) and their hybrids

    USDA-ARS?s Scientific Manuscript database

    Three SNP markers were developed that are completely diagnostic in distinguishing the two fire ant species Solenopsis invicta and S. richteri. Although a fourth marker we developed is not fully diagnostic, it is still useful given one of the variants is confined to S. richteri. Joint use of these ma...

  18. Development of a SNP-based panel for human identification for Indian populations.

    PubMed

    Sarkar, Anujit; Nandineni, Madhusudan R

    2017-03-01

    The widely employed short tandem repeat (STR)-based panels for forensic human identification (HID) have limitations while dealing with challenging forensic samples involving DNA degradation, resulting in dropping-out of higher molecular weight alleles/loci. To address this issue, bialleic markers like single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), which can be scored even when the template DNA is heavily degraded (<100bp), have been suggested as alternative markers for HID testing. Recent studies have highlighted their utility in forensic HID and several panels based on biallelic markers have been described for worldwide populations. However, there has been very little information about the behavior of such DNA markers in Indian populations, which is known to possess great genetic diversity. This study describes a two-step approach for designing a SNP-based panel consisting of 70 SNPs for HID testing in Indian populations. In the first step, candidate SNPs were shortlisted from public databases by screening them for several criteria including allelic distribution, genomic location, potential phenotypic expression or functionality and species specificity. The second step involved genotyping the shortlisted SNPs in various Indian populations followed by shortlisting of the best performers for identity-testing. Starting with 592,652 SNPs listed in Human660W-Quad Beadchip (Illumina Inc.), we shortlisted 275 candidate SNPs for identity-testing and genotyped them in 462 unrelated individuals from different population groups in India. Post genotyping and statistical analyses based on biogeographic regions, 206 SNPs demonstrated desired allelic distribution (Heterozygosity≥0.4 and FST≤0.02), from which 2-4 widely separated (>20 Mb apart) SNPs from each chromosome were finally selected to construct a panel of 70 SNPs. This panel on average possessed match probability 10e-29 and probability of paternity of 0.99999997, which was orders of

  19. Characterisation of microsatellite and SNP markers from Miseq and genotyping-by-sequencing data among parapatric Urophora cardui (Tephritidae) populations.

    PubMed

    Johannesen, Jes; Fabritzek, Armin G; Ebner, Bettina; Bikar, Sven-Ernö

    2017-01-01

    Phylogeographic analyses of the gall fly Urophora cardui have in earlier studies based on allozymes and mtDNA identified small-scale, parapatrically diverged populations within an expanding Western Palearctic population. However, the low polymorphism of these markers prohibited an accurate delimitation of the evolutionary origin of the parapatric divergence. Urophora cardui from the Western Palearctic have been introduced into Canada as biological control agents of the host plant Cirsium arvense. Here, we characterise 12 microsatellite loci with hexa-, penta- and tetra-nucleotide repeat motifs and report a genotyping-by-sequencing SNP protocol. We test the markers for genetic variation among three parapatric U. cardui populations. Microsatellite variability (N = 59 individuals) was high: expected heterozygosity/locus/population (0.60-0.90), allele number/locus/population (5-21). One locus was alternatively sex-linked in males or females. Cross-species amplification in the sister species U. stylata was successful or partially successful for seven loci. For genotyping-by-sequencing (N = 18 individuals), different DNA extraction methods did not affect data quality. Depending on sequence sorting criteria, 1,177-2,347 unlinked SNPs and 1,750-4,469 parsimony informative sites were found in 3,514-5,767 loci recovered after paralog filtering. Both marker systems quantified the same population partitions with high probabilities. Many and highly differentiated loci in both marker systems indicate genome-wide diversification and genetically distinct populations.

  20. Association of single nucleotide polymorphism (SNP) markers in candidate genes and QTL regions with pork quality traits in commercial pigs.

    PubMed

    Rohrer, G A; Nonneman, D J; Miller, R K; Zerby, H; Moeller, S J

    2012-12-01

    Numerous reports have described genetic markers or genomic regions (QTL) associated with pork quality and/or palatability but few validation studies have been reported. Therefore, 156 SNP markers from 45 candidate genes and eight QTL regions were analyzed for association with pork quality and palatability traits from 888 pork loins. Loins were collected at three slaughter facilities and selected to represent a wide range of pork color, pH and marbling. Phenotypic data recorded included objective and subjective measures of color and marbling, purge loss, shear force, and cooking loss. Data were analyzed with SAS PROC MIXED where loin was fit as a random effect. Results indicated some of the markers tested should be useful in industry, while others are not segregating in all populations or linkage disequilibrium between markers and causative genetic variation fluctuates among populations limiting their universal utility. Genes with the largest effects on pork quality were MC4R, IGF2, CAST and PRKAG3. Published by Elsevier Ltd.

  1. Selection and use of SNP markers for animal identification and paternity analysis in U.S. beef cattle.

    PubMed

    Heaton, Michael P; Harhay, Gregory P; Bennett, Gary L; Stone, Roger T; Grosse, W Michael; Casas, Eduardo; Keele, John W; Smith, Timothy P L; Chitko-McKown, Carol G; Laegreid, William W

    2002-05-01

    DNA marker technology represents a promising means for determining the genetic identity and kinship of an animal. Compared with other types of DNA markers, single nucleotide polymorphisms (SNPs) are attractive because they are abundant, genetically stable, and amenable to high-throughput automated analysis. In cattle, the challenge has been to identify a minimal set of SNPs with sufficient power for use in a variety of popular breeds and crossbred populations. This report describes a set of 32 highly informative SNP markers distributed among 18 autosomes and both sex chromosomes. Informativity of these SNPs in U.S. beef cattle populations was estimated from the distribution of allele and genotype frequencies in two panels: one consisting of 96 purebred sires representing 17 popular breeds, and another with 154 purebred American Angus from six herds in four Midwestern states. Based on frequency data from these panels, the estimated probability that two randomly selected, unrelated individuals will possess identical genotypes for all 32 loci was 2.0 x 10(-13) for multi-breed composite populations and 1.9 x 10(-10) for purebred Angus populations. The probability that a randomly chosen candidate sire will be excluded from paternity was estimated to be 99.9% and 99.4% for the same respective populations. The DNA immediately surrounding the 32 target SNPs was sequenced in the 96 sires of the multi-breed panel and found to contain an additional 183 polymorphic sites. Knowledge of these additional sites, together with the 32 target SNPs, allows the design of robust, accurate genotype assays on a variety of high-throughput SNP genotyping platforms.

  2. Candidate SNP Markers of Chronopathologies Are Predicted by a Significant Change in the Affinity of TATA-Binding Protein for Human Gene Promoters

    PubMed Central

    Ponomarenko, Petr; Rasskazov, Dmitry; Suslov, Valentin; Sharypova, Ekaterina; Savinkova, Ludmila; Podkolodnaya, Olga; Podkolodny, Nikolay L.; Tverdokhleb, Natalya N.; Chadaeva, Irina; Kolchanov, Nikolay

    2016-01-01

    Variations in human genome (e.g., single nucleotide polymorphisms, SNPs) may be associated with hereditary diseases, their complications, comorbidities, and drug responses. Using Web service SNP_TATA_Comparator presented in our previous paper, here we analyzed immediate surroundings of known SNP markers of diseases and identified several candidate SNP markers that can significantly change the affinity of TATA-binding protein for human gene promoters, with circadian consequences. For example, rs572527200 may be related to asthma, where symptoms are circadian (worse at night), and rs367732974 may be associated with heart attacks that are characterized by a circadian preference (early morning). By the same method, we analyzed the 90 bp proximal promoter region of each protein-coding transcript of each human gene of the circadian clock core. This analysis yielded 53 candidate SNP markers, such as rs181985043 (susceptibility to acute Q fever in male patients), rs192518038 (higher risk of a heart attack in patients with diabetes), and rs374778785 (emphysema and lung cancer in smokers). If they are properly validated according to clinical standards, these candidate SNP markers may turn out to be useful for physicians (to select optimal treatment for each patient) and for the general population (to choose a lifestyle preventing possible circadian complications of diseases). PMID:27635400

  3. Development and validation of a novel single nucleotide polymorphism (SNP) panel for genetic analysis of Blastomyces spp. and association analysis.

    PubMed

    Frost, Holly M; Anderson, Jennifer L; Ivacic, Lynn; Sloss, Brian L; Embil, John; Meece, Jennifer K

    2016-09-23

    Single nucleotide polymorphism (SNP) genotyping is increasingly being utilized for molecular typing of pathogens and is cost-effective, especially for large numbers of isolates. The goals of this study were 1) to develop and validate a SNP assay panel for genetic analysis of Blastomyces spp., 2) ascertain whether microsatellite genotyping and the SNP genotyping with the developed panel resolve identical genetic groups, and 3) explore the utility of SNPs for examining phylogenetic and virulence questions in humans. Three hundred sixty unique Blastomyces spp. isolates previously genotyped with microsatellite markers were genotyped with the MassARRAY® SNP genotyping system (Agena Bioscience™, San Diego, CA), for a custom panel of 28 SNPs. Clinical presentation data was analyzed for association with SNP variants. Three hundred twenty-three Blastomyces spp. isolates (90 %) were successfully genotyped by SNP analysis, with results obtained for at least 27 of 28 assays. For 99.7 % of isolates tested by both genotyping methods, microsatellite genetic group assignment correlated with species assignment based on internal transcribed spacer 2 (ITS2) genotyping, with Group 1 (Gr 1) being equivalent to B. gilchristii and Group 2 (Gr 2) being equivalent to B. dermatitidis. Thirteen isolates were genetic hybrids by one or both methods of genotyping and were difficult to assign to a particular genetic group or species. Fifteen SNP loci showed significantly different alleles in cases of pulmonary vs disseminated disease, at a p-value of <0.01 or less. This study is the largest genotyping study of Blastomyces spp. isolates and presents a new method for genetic analysis with which to further explore the relationship between the genetic diversity in Blastomyces spp. and clinical disease presentation. We demonstrated that microsatellite Gr 1 is equivalent to B. gilchristii and Gr 2 is equivalent to B. dermatitidis. We also discovered potential evidence of infrequent recombination

  4. Genetic Contribution of Ningmai 9 Wheat to Its Derivatives Evaluated by Using SNP Markers

    PubMed Central

    Jiang, Peng; Zhang, Ping-Ping

    2016-01-01

    Founder parent usually plays an important role in wheat breeding. Ningmai 9 is a soft wheat variety with good performance in yield, quality, and resistance to wheat disease. Therefore it serves as an important commercial variety and founder parent in middle and lower Yangtze River of China. To date, 20 new cultivars have been developed from Ningmai 9 and released to wheat production in the last 10 years. In this study, the 90K iSELECT ILLUMINA chip was used to analyze the genotype of Ningmai 9 and its 17 derivatives. The genetic similarity coefficients between Ningmai 9 and its derivatives were more than 0.7 except for Yangfumai 4. Neighbor-Joining analysis showed that Yangfumai 4 had the largest genetic distance from Ningmai 9 in all derivatives. There was a great difference for the same allele ratio in either derivatives or chromosomes, though the average values of the same allele ratio in genomes A, B, and D were close to each other. The phenotypic difference in Ningmai 9, Ningmai 13, and Yangfumai 4 was consistent with their difference in genetic background by comparing previous reported QTLs. Some hot chromosome regions were found and might be used for marker assisted selection in wheat breeding. PMID:27652255

  5. Identifying Litchi (Litchi chinensis Sonn.) Cultivars and Their Genetic Relationships Using Single Nucleotide Polymorphism (SNP) Markers

    PubMed Central

    Liu, Wei; Xiao, Zhidan; Bao, Xiuli; Yang, Xiaoyan; Fang, Jing; Xiang, Xu

    2015-01-01

    Litchi is an important fruit tree in tropical and subtropical areas of the world. However, there is widespread confusion regarding litchi cultivar nomenclature and detailed information of genetic relationships among litchi germplasm is unclear. In the present study, the potential of single nucleotide polymorphism (SNP) for the identification of 96 representative litchi accessions and their genetic relationships in China was evaluated using 155 SNPs that were evenly spaced across litchi genome. Ninety SNPs with minor allele frequencies above 0.05 and a good genotyping success rate were used for further analysis. A relatively high level of genetic variation was observed among litchi accessions, as quantified by the expected heterozygosity (He = 0.305). The SNP based multilocus matching identified two synonymous groups, ‘Heiye’ and ‘Wuye’, and ‘Chengtuo’ and ‘Baitangli 1’. A subset of 14 SNPs was sufficient to distinguish all the non-redundant litchi genotypes, and these SNPs were proven to be highly stable by repeated analyses of a selected group of cultivars. Unweighted pair-group method of arithmetic averages (UPGMA) cluster analysis divided the litchi accessions analyzed into four main groups, which corresponded to the traits of extremely early-maturing, early-maturing, middle-maturing, and late-maturing, indicating that the fruit maturation period should be considered as the primary criterion for litchi taxonomy. Two subpopulations were detected among litchi accessions by STRUCTURE analysis, and accessions with extremely early- and late-maturing traits showed membership coefficients above 0.99 for Cluster 1 and Cluster 2, respectively. Accessions with early- and middle-maturing traits were identified as admixture forms with varying levels of membership shared between the two clusters, indicating their hybrid origin during litchi domestication. The results of this study will benefit litchi germplasm conservation programs and facilitate maximum

  6. The case of the unreliable SNP: recurrent back-mutation of Y-chromosomal marker P25 through gene conversion.

    PubMed

    Adams, Susan M; King, Turi E; Bosch, Elena; Jobling, Mark A

    2006-05-25

    The Y-chromosomal binary marker P25 is a paralogous sequence variant, rather than a SNP: three copies of the P25 sequence lie within the giant palindromic repeats on Yq, and one copy has undergone a C to A transversion to define haplogroup R1b (designated C/C/A). Since gene conversion is known to be active in the palindromic repeats, we reasoned that P25 might be liable to back-mutation by gene conversion, yielding the ancestral state C/C/C. Through analysis of a set of binary markers in Y-chromosomes in two large samples from Great Britain and the Iberian Peninsula we show that such conversion events have occurred at least twice, and provide preliminary evidence that the reverse conversion event (yielding C/A/A) has also occurred. Because of its inherent instability, we suggest that P25 be used with caution in forensic studies, and perhaps replaced with the more reliable binary marker M269.

  7. Development and implementation of high-throughput SNP genotyping in barley.

    PubMed

    Close, Timothy J; Bhat, Prasanna R; Lonardi, Stefano; Wu, Yonghui; Rostoks, Nils; Ramsay, Luke; Druka, Arnis; Stein, Nils; Svensson, Jan T; Wanamaker, Steve; Bozdag, Serdar; Roose, Mikeal L; Moscou, Matthew J; Chao, Shiaoman; Varshney, Rajeev K; Szucs, Péter; Sato, Kazuhiro; Hayes, Patrick M; Matthews, David E; Kleinhofs, Andris; Muehlbauer, Gary J; DeYoung, Joseph; Marshall, David F; Madishetty, Kavitha; Fenton, Raymond D; Condamine, Pascal; Graner, Andreas; Waugh, Robbie

    2009-12-04

    High density genetic maps of plants have, nearly without exception, made use of marker datasets containing missing or questionable genotype calls derived from a variety of genic and non-genic or anonymous markers, and been presented as a single linear order of genetic loci for each linkage group. The consequences of missing or erroneous data include falsely separated markers, expansion of cM distances and incorrect marker order. These imperfections are amplified in consensus maps and problematic when fine resolution is critical including comparative genome analyses and map-based cloning. Here we provide a new paradigm, a high-density consensus genetic map of barley based only on complete and error-free datasets and genic markers, represented accurately by graphs and approximately by a best-fit linear order, and supported by a readily available SNP genotyping resource. Approximately 22,000 SNPs were identified from barley ESTs and sequenced amplicons; 4,596 of them were tested for performance in three pilot phase Illumina GoldenGate assays. Data from three barley doubled haploid mapping populations supported the production of an initial consensus map. Over 200 germplasm selections, principally European and US breeding material, were used to estimate minor allele frequency (MAF) for each SNP. We selected 3,072 of these tested SNPs based on technical performance, map location, MAF and biological interest to fill two 1536-SNP "production" assays (BOPA1 and BOPA2), which were made available to the barley genetics community. Data were added using BOPA1 from a fourth mapping population to yield a consensus map containing 2,943 SNP loci in 975 marker bins covering a genetic distance of 1099 cM. The unprecedented density of genic markers and marker bins enabled a high resolution comparison of the genomes of barley and rice. Low recombination in pericentric regions is evident from bins containing many more than the average number of markers, meaning that a large number of

  8. Detection of single nucleotide polymorphism (SNP) controlling the waxy character in wheat by using a derived cleaved amplified polymorphic sequence (dCAPS) marker.

    PubMed

    Yanagisawa, T; Kiribuchi-Otobe, C; Hirano, H; Suzuki, Y; Fujita, M

    2003-06-01

    We investigated a single nucleotide polymorphism (SNP) in the Wx-D1 gene, which was found in a mutant waxy wheat, and which expressed the Wx-D1 protein (granule-bound starch synthase I) as shown by immunoblot analysis. We also assayed starch synthase activity of granule-bound proteins. Using 22 doubled-haploid (DH) lines and 172 F(5) lines derived from the wild type x the mutant, we detected SNP via a PCR-based (dCAPS) marker. Amplified PCR products from Wx-D1 gene-specific primers, followed by mismatched primers designed for dCAPS analysis, were digested with the appropriate restriction enzyme. The two alleles, and the heterozygote genotype were easily and rapidly discriminated by gel-electrophoresis resolution to reveal SNP. All progeny lines that have the SNP of the mutant allele were waxy. Integrating the results of dCAPS analysis, immunoblot analysis and assays of starch synthase activity of granule-bound proteins indicates that the SNP in the Wx-D1 gene was responsible for its waxy character. This dCAPS marker is therefore useful as a marker to introduce the mutant allele into elite breeding lines.

  9. Genetic Map of Triticale Integrating Microsatellite, DArT and SNP Markers

    PubMed Central

    Tyrka, Mirosław; Tyrka, Dorota; Wędzony, Maria

    2015-01-01

    Triticale (×Triticosecale Wittm) is an economically important crop for fodder and biomass production. To facilitate the identification of markers for agronomically important traits and for genetic and genomic characteristics of this species, a new high-density genetic linkage map of triticale was constructed using doubled haploid (DH) population derived from a cross between cultivars ‘Hewo’ and ‘Magnat’. The map consists of 1615 bin markers, that represent 50 simple sequence repeat (SSR), 842 diversity array technology (DArT), and 16888 DArTseq markers mapped onto 20 linkage groups assigned to the A, B, and R genomes of triticale. No markers specific to chromosome 7R were found, instead mosaic linkage group composed of 1880 highly distorted markers (116 bins) from 10 wheat chromosomes was identified. The genetic map covers 4907 cM with a mean distance between two bins of 3.0 cM. Comparative analysis in respect to published maps of wheat, rye and triticale revealed possible deletions in chromosomes 4B, 5A, and 6A, as well as inversion in chromosome 7B. The number of bin markers in each chromosome varied from 24 in chromosome 3R to 147 in chromosome 6R. The length of individual chromosomes ranged between 50.7 cM for chromosome 2R and 386.2 cM for chromosome 7B. A total of 512 (31.7%) bin markers showed significant (P < 0.05) segregation distortion across all chromosomes. The number of 8 the segregation distorted regions (SDRs) were identified on 1A, 7A, 1B, 2B, 7B (2 SDRs), 5R and 6R chromosomes. The high-density genetic map of triticale will facilitate fine mapping of quantitative trait loci, the identification of candidate genes and map-based cloning. PMID:26717308

  10. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

    PubMed

    Hulse-Kemp, Amanda M; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L; Kochan, Kelli J; Riggs, Penny K; Scheffler, Jodi A; Udall, Joshua A; Ulloa, Mauricio; Wang, Shirley S; Zhu, Qian-Hao; Bag, Sumit K; Bhardwaj, Archana; Burke, John J; Byers, Robert L; Claverie, Michel; Gore, Michael A; Harker, David B; Islam, Md S; Jenkins, Johnie N; Jones, Don C; Lacape, Jean-Marc; Llewellyn, Danny J; Percy, Richard G; Pepper, Alan E; Poland, Jesse A; Mohan Rai, Krishan; Sawant, Samir V; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M; Wang, Fei; Yourstone, Scott M; Zheng, Xiuting; Lawley, Cindy T; Ganal, Martin W; Van Deynze, Allen; Wilson, Iain W; Stelly, David M

    2015-04-22

    High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community.

  11. Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

    PubMed Central

    Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.

    2015-01-01

    High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569

  12. The first genetic linkage map of Primulina eburnea (Gesneriaceae) based on EST-derived SNP markers.

    PubMed

    Feng, Chen; Feng, Chao; Kang, Ming

    2016-06-01

    Primulina eburnea is a promising candidate for domestication and floriculture, since it is easy to culture and has beautiful flowers. An F₂ population of 189 individuals was established for the construction of first-generation linkage maps based on expressed sequence tags-derived single-nucleotide polymorphism markers using the massARRAY genotyping platform. Of the 232 screened markers, 215 were assigned to 18 LG according to the haploid number of chromosomes in the species. The linkage map spanned a total of 3774.7 cM with an average distance of 17.6 cM between adjacent markers. This linkage map provides a framework for identification of important genes in breeding programmes.

  13. Obesity-related known and candidate SNP markers can significantly change affinity of TATA-binding protein for human gene promoters

    PubMed Central

    2015-01-01

    Background Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e., significant differences of personal genomes of patients from the reference human genome) can help physicians to improve treatment. Clinical research can validate SNP markers via genotyping of patients and demonstration that SNP alleles are significantly more frequent in patients than in healthy people. The search for biomedical SNP markers of interest can be accelerated by computer-based analysis of hundreds of millions of SNPs in the 1000 Genomes project because of selection of the most meaningful candidate SNP markers and elimination of neutral SNPs. Results We cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on comorbidities of obesity. Near the sites binding to TATA-binding protein (TBP) in human gene promoters, we found 22 obesity-related candidate SNP markers, including rs10895068 (male breast cancer in obesity); rs35036378 (reduced risk of obesity after ovariectomy); rs201739205 (reduced risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women); rs183433761 (obesity resistance during a high-fat diet); rs367732974 and rs549591993 (both: cardiovascular complications in obese patients with type 2 diabetes mellitus); rs200487063 and rs34104384 (both: obesity-caused hypertension); rs35518301, rs72661131, and rs562962093 (all: obesity); and rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473 (all: chronic inflammation in comorbidities of obesity). Using an electrophoretic mobility shift assay under nonequilibrium conditions, we

  14. Obesity-related known and candidate SNP markers can significantly change affinity of TATA-binding protein for human gene promoters.

    PubMed

    Arkova, Olga V; Ponomarenko, Mikhail P; Rasskazov, Dmitry A; Drachkova, Irina A; Arshinova, Tatjana V; Ponomarenko, Petr M; Savinkova, Ludmila K; Kolchanov, Nikolay A

    2015-01-01

    Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e., significant differences of personal genomes of patients from the reference human genome) can help physicians to improve treatment. Clinical research can validate SNP markers via genotyping of patients and demonstration that SNP alleles are significantly more frequent in patients than in healthy people. The search for biomedical SNP markers of interest can be accelerated by computer-based analysis of hundreds of millions of SNPs in the 1000 Genomes project because of selection of the most meaningful candidate SNP markers and elimination of neutral SNPs. We cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on comorbidities of obesity. Near the sites binding to TATA-binding protein (TBP) in human gene promoters, we found 22 obesity-related candidate SNP markers, including rs10895068 (male breast cancer in obesity); rs35036378 (reduced risk of obesity after ovariectomy); rs201739205 (reduced risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women); rs183433761 (obesity resistance during a high-fat diet); rs367732974 and rs549591993 (both: cardiovascular complications in obese patients with type 2 diabetes mellitus); rs200487063 and rs34104384 (both: obesity-caused hypertension); rs35518301, rs72661131, and rs562962093 (all: obesity); and rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473 (all: chronic inflammation in comorbidities of obesity). Using an electrophoretic mobility shift assay under nonequilibrium conditions, we empirically validated the

  15. Fine QTL mapping of mandarin (Citrus reticulata) fruit characters using high-throughput SNP markers

    USDA-ARS?s Scientific Manuscript database

    Seedlessness, flavor, and color are top priorities for mandarin (Citrus reticulata Blanco) cultivar improvement. Given long juvenility, large tree size, and high breeding cost, marker-assisted selection (MAS) may be an expeditious and economical approach to these challenges. The objectives of this s...

  16. Genome-wide association of 10 horticultural traits with expressed sequence tag-derived SNP markers in a collection of lettuce lines

    USDA-ARS?s Scientific Manuscript database

    Genetic diversity, population structure, and genome-wide marker-trait association analyses were conducted on a special collection of 298 homozygous lettuce (Lactuca sativa L.) lines. Each of these lines was derived from a single plant that had been genotyped with 384 SNP makers using LSGermOPA. They...

  17. An integrated genetic linkage map of watermelon and genetic diversity based on single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers

    USDA-ARS?s Scientific Manuscript database

    Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...

  18. Detection of QTLs for salt tolerance in Asian barley (Hordeum vulgare L.) by association analysis with SNP markers

    PubMed Central

    Sbei, Hanen; Sato, Kazuhiro; Shehzad, Tariq; Harrabi, Moncef; Okuno, Kazutoshi

    2014-01-01

    Two hundred ninety-six Asian barley (Hordeum vulgare L.) accessions were assessed to detect QTLs underlying salt tolerance by association analysis using a 384 single nucleotide polymorphism (SNP) marker system. The experiment was laid out at the seedling stage in a hydroponic solution under control and 250 mM NaCl solution with three replications of four plants each. Salt tolerance was assessed by leaf injury score (LIS) and salt tolerance indices (STIs) of the number of leaves (NL), shoot length (SL), root length (RL), shoot dry weight (SDW) and root dry weight (RDW). LIS was scored from 1 to 5 according to the severity of necrosis and chlorosis observed on leaves. There was a wide variation in salt tolerance among Asian barley accessions. LIS and STI (SDW) were the most suitable traits for screening salt tolerance. Association was estimated between markers and traits to detect QTLs for LIS and STI (SDW). Seven significant QTLs were located on chromosomes 1H (2 QTLs), 2H (2 QTLs), 3H (1 QTL), 4H (1 QTL) and 5H (1 QTL). Five QTLs were associated with LIS and 2 QTLs with STI (SDW). Two QTLs associated with LIS were newly identified on chromosomes 3H and 4H. PMID:25914593

  19. Genome-Wide SNP Markers Based on SLAF-Seq Uncover Breeding Traces in Rapeseed (Brassica napus L.)

    PubMed Central

    Zhou, Qinghong; Zhou, Can; Zheng, Wei; Mason, Annaliese S.; Fan, Shuying; Wu, Caijun; Fu, Donghui; Huang, Yingjin

    2017-01-01

    Single Nucleotide Polymorphisms (SNPs) are the most abundant and richest form of genomic polymorphism, and hence make highly favorable markers for genetic map construction and genome-wide association studies. In this study, a total of 300 rapeseed accessions (278 representative of Chinese germplasm, plus 22 outgroup accessions of different origins and ecotypes) were collected and sequenced using Specific-Locus Amplified Fragment Sequencing (SLAF-seq) technology, obtaining 660.25M reads with an average sequencing depth of 6.27 × and a mean Q30 of 85.96%. Based on the 238,711 polymorphic SLAF tags a total of 1,197,282 SNPs were discovered, and a subset of 201,817 SNPs with minor allele frequency >0.05 and integrity >0.8 were selected. Of these, 30,877 were designated SNP “hotspots,” and 41 SNP-rich genomic regions could be delineated, with 100 genes associated with plant resistance, vernalization response, and signal transduction detected in these regions. Subsequent analysis of genetic diversity, linkage disequilibrium (LD), and population structure in the 300 accessions was carried out based on the 201,817 SNPs. Nine subpopulations were observed based on the population structure analysis. Hierarchical clustering and principal component analysis divided the 300 varieties roughly in accordance with their ecotype origins. However, spring-type varieties were intermingled with semi-winter type varieties, indicating frequent hybridization between spring and semi-winter ecotypes in China. In addition, LD decay across the whole genome averaged 299 kb when r2 = 0.1, but the LD decay in the A genome (43 kb) was much shorter than in the C genome (1,455 kb), supporting the targeted introgression of the A genome from progenitor species B. rapa into Chinese rapeseed. This study also lays the foundation for genetic analysis of important agronomic traits using this rapeseed population. PMID:28503182

  20. Development of genome-wide SNP assays for rice

    USDA-ARS?s Scientific Manuscript database

    With the introduction of new sequencing technologies, single nucleotide polymorphisms (SNPs) are rapidly replacing simple sequence repeats (SSRs) as the DNA marker of choice for applications in plant breeding and genetics because they are more abundant, stable, amenable to automation, efficient, and...

  1. Single strand conformation polymorphism based SNP and Indel markers for genetic mapping and synteny analysis of common bean (Phaseolus vulgaris L.)

    PubMed Central

    2009-01-01

    Background Expressed sequence tags (ESTs) are an important source of gene-based markers such as those based on insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs), to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. Results A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 × G19833 recombinant inbred line (RIL) population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 × 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. Conclusion The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction of a transcript map and

  2. SSR DNA markers linked with Broad-Spectrum rust resistance in common bean discovered by bulk segregant analysis using a large set of SNP markers

    USDA-ARS?s Scientific Manuscript database

    DNA markers are invaluable plant breeding tools that can be used in the development of new crop cultivars with disease resistance. We wanted to develop the capacity for marker-assisted selection using the broad-spectrum rust resistance trait present in Mesoamerican common bean PI 310762. This commo...

  3. Genetic Analysis Workshop 15: simulation of a complex genetic model for rheumatoid arthritis in nuclear families including a dense SNP map with linkage disequilibrium between marker loci and trait loci

    PubMed Central

    Miller, Michael B; Lind, Gregg R; Li, Na; Jang, Soon-Young

    2007-01-01

    Data for Problem 3 of the Genetic Analysis Workshop 15 were generated by computer simulation in an attempt to mimic some of the genetic and epidemiological features of rheumatoid arthritis (RA) such as its population prevalence, sex ratio, risk to siblings of affected individuals, association with cigarette smoking, the strong effect of genotype in the HLA region and other genetic effects. A complex genetic model including epistasis and genotype-by-environment interaction was applied to a population of 1.9 million nuclear families of size four from which we selected 1500 families with both offspring affected and 2000 unrelated, unaffected individuals all of whose first-degree relatives were unaffected. This process was repeated to produce 100 replicate data sets. In addition, we generated marker data for 22 autosomes consisting of a genome-wide set of 730 simulated STRP markers, 9187 SNP markers and an additional 17,820 SNP markers on chromosome 6. Appropriate linkage disequilibrium between markers and between trait loci and markers was modelled using HapMap Phase 1 data . The code base for this project was written primarily in the Octave programming language, but it is being ported to the R language and developed into a larger project for general genetic simulation called GenetSim . All of the source code that was used to generate the GAW 15 Problem 3 data is freely available for download at . PMID:18466538

  4. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao

    PubMed Central

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-01-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. PMID:26070980

  5. Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

    PubMed

    Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

    2015-08-01

    Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  6. Novel quantitative real-time LCR for the sensitive detection of SNP frequencies in pooled DNA: method development, evaluation and application.

    PubMed

    Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios

    2011-01-19

    Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food.

  7. Novel Quantitative Real-Time LCR for the Sensitive Detection of SNP Frequencies in Pooled DNA: Method Development, Evaluation and Application

    PubMed Central

    Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios

    2011-01-01

    Background Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. Methods The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. Conclusions The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. Significance The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food. PMID:21283808

  8. Assessment of genetic variation within a global collection of lentil (Lens culinaris Medik.) cultivars and landraces using SNP markers.

    PubMed

    Lombardi, Maria; Materne, Michael; Cogan, Noel O I; Rodda, Matthew; Daetwyler, Hans D; Slater, Anthony T; Forster, John W; Kaur, Sukhjiwan

    2014-12-24

    Lentil is a self-pollinated annual diploid (2n = 2× = 14) crop with a restricted history of genetic improvement through breeding, particularly when compared to cereal crops. This limited breeding has probably contributed to the narrow genetic base of local cultivars, and a corresponding potential to continue yield increases and stability. Therefore, knowledge of genetic variation and relationships between populations is important for understanding of available genetic variability and its potential for use in breeding programs. Single nucleotide polymorphism (SNP) markers provide a method for rapid automated genotyping and subsequent data analysis over large numbers of samples, allowing assessment of genetic relationships between genotypes. In order to investigate levels of genetic diversity within lentil germplasm, 505 cultivars and landraces were genotyped with 384 genome-wide distributed SNP markers, of which 266 (69.2%) obtained successful amplification and detected polymorphisms. Gene diversity and PIC values varied between 0.108-0.5 and 0.102-0.375, with averages of 0.419 and 0.328, respectively. On the basis of clarity and interest to lentil breeders, the genetic structure of the germplasm collection was analysed separately for cultivars and landraces. A neighbour-joining (NJ) dendrogram was constructed for commercial cultivars, in which lentil cultivars were sorted into three major groups (G-I, G-II and G-III). These results were further supported by principal coordinate analysis (PCoA) and STRUCTURE, from which three clear clusters were defined based on differences in geographical location. In the case of landraces, a weak correlation between geographical origin and genetic relationships was observed. The landraces from the Mediterranean region, predominantly Greece and Turkey, revealed very high levels of genetic diversity. Lentil cultivars revealed clear clustering based on geographical origin, but much more limited correlation between geographic origin

  9. Selection of highly informative SNP markers for population affiliation of major US populations.

    PubMed

    Zeng, Xiangpei; Chakraborty, Ranajit; King, Jonathan L; LaRue, Bobby; Moura-Neto, Rodrigo S; Budowle, Bruce

    2016-03-01

    Ancestry informative markers (AIMs) can be used to detect and adjust for population stratification and predict the ancestry of the source of an evidence sample. Autosomal single nucleotide polymorphisms (SNPs) are the best candidates for AIMs. It is essential to identify the most informative AIM SNPs across relevant populations. Several informativeness measures for ancestry estimation have been used for AIMs selection: absolute allele frequency differences (δ), F statistics (F ST), and informativeness for assignment measure (In). However, their efficacy has not been compared objectively, particularly for determining affiliations of major US populations. In this study, these three measures were directly compared for AIMs selection among four major US populations, i.e., African American, Caucasian, East Asian, and Hispanic American. The results showed that the F ST panel performed slightly better for population resolution based on principal component analysis (PCA) clustering than did the δ panel and both performed better than the In panel. Therefore, the 23 AIMs selected by the F ST measure were used to characterize the four major American populations. Genotype data of nine sample populations were used to evaluate the efficiency of the 23-AIMs panel. The results indicated that individuals could be correctly assigned to the major population categories. Our AIMs panel could contribute to the candidate pool of AIMs for potential forensic identification purposes.

  10. Identification of SNP markers for inferring phylogeny in temperate bamboos (Poaceae: Bambusoideae) using RAD sequencing.

    PubMed

    Wang, X Q; Zhao, L; Eaton, D A R; Li, D Z; Guo, Z H

    2013-09-01

    Phylogenetic relationships among temperate species of bamboo are difficult to resolve, owing to both the challenge of detecting sufficiently variable markers and their polyploid history. Here, we use restriction site-associated DNA sequencing to identify candidate loci with fixed allelic differences segregating between and within two temperate species of bamboos: Arundinaria faberi and Yushania brevipaniculata. Approximately 27 million paired-end sequencing reads were generated across four samples. From pooled data, we assembled 67 685 and 70 668 de novo contigs from partial overlap among paired-end reads, with an average length of 240 and 241 bp for the two species, respectively, which were used to investigate functional classification of RAD tags in a blastx search. Analysed separately by population, we recovered 29 443 putatively orthologous RAD tags shared across the four sampled populations, containing 28 023 sequence variants, of which c. 13 000 are segregating between species, and c. 3000 segregating between populations within each species. Analyses based on these RAD tags yielded robust phylogenetic inferences, even with data set constructed from surprisingly few loci. This study illustrates the potential for reduced-representation genome data to resolve difficult phylogenetic relationships in temperate bamboos. © 2013 John Wiley & Sons Ltd.

  11. Discovery and validation of gene-linked diagnostic SNP markers for assessing hybridization between Largemouth bass (Micropterus salmoides) and Florida bass (M. floridanus).

    PubMed

    Li, Chao; Gowan, Spencer; Anil, Ammu; Beck, Benjamin H; Thongda, Wilawan; Kucuktas, Huseyin; Kaltenboeck, Ludmilla; Peatman, Eric

    2015-03-01

    Efforts to improve recreational fisheries have included widespread stocking of Micropterus floridanus outside its native range of peninsular Florida. Hybridization of Florida bass (M. floridanus) with largemouth bass (Micropterus salmoides) has now dramatically expanded beyond a naturally occurring intergrade zone in the southeast U.S. In recent years, there has been growing interest in protecting the genetic integrity of native basses and assessing the impact and nature of M. salmoides/M. floridanus introgression from the standpoint of hatchery and sport-fishery managers, fish biologists, ecologists and evolutionary biologists. Here, we conducted RNA-seq-based sequencing of the transcriptomes of M. salmoides, M. floridanus and their F1 hybrid and identified a set of 3674 SNP markers with fixed-allelic differences from 2112 unique genes. We then developed a subset of 25 of these markers into a single diagnostic multiplex assay and validated its capacity for assessing integrity and hybridization in hatchery and wild populations of largemouth and Florida bass. The availability of this resource, high-quality transcriptomes and a large set of gene-linked SNPs, should greatly facilitate functional and population genomics studies in these key species and allow the identification of traits and processes under selection during introgressive hybridization.

  12. The construction of a high-density linkage map for identifying SNP markers that are tightly linked to a nuclear-recessive major gene for male sterility in Cryptomeria japonica D. Don.

    PubMed

    Moriguchi, Yoshinari; Ujino-Ihara, Tokuko; Uchiyama, Kentaro; Futamura, Norihiro; Saito, Maki; Ueno, Saneyoshi; Matsumoto, Asako; Tani, Naoki; Taira, Hideaki; Shinohara, Kenji; Tsumura, Yoshihiko

    2012-03-16

    High-density linkage maps facilitate the mapping of target genes and the construction of partial linkage maps around target loci to develop markers for marker-assisted selection (MAS). MAS is quite challenging in conifers because of their large, complex, and poorly-characterized genomes. Our goal was to construct a high-density linkage map to facilitate the identification of markers that are tightly linked to a major recessive male-sterile gene (ms1) for MAS in C. japonica, a species that is important in Japanese afforestation but which causes serious social pollinosis problems. We constructed a high-density saturated genetic linkage map for C. japonica using expressed sequence-derived co-dominant single nucleotide polymorphism (SNP) markers, most of which were genotyped using the GoldenGate genotyping assay. A total of 1261 markers were assigned to 11 linkage groups with an observed map length of 1405.2 cM and a mean distance between two adjacent markers of 1.1 cM; the number of linkage groups matched the basic chromosome number in C. japonica. Using this map, we located ms1 on the 9th linkage group and constructed a partial linkage map around the ms1 locus. This enabled us to identify a marker (hrmSNP970_sf) that is closely linked to the ms1 gene, being separated from it by only 0.5 cM. Using the high-density map, we located the ms1 gene on the 9th linkage group and constructed a partial linkage map around the ms1 locus. The map distance between the ms1 gene and the tightly linked marker was only 0.5 cM. The identification of markers that are tightly linked to the ms1 gene will facilitate the early selection of male-sterile trees, which should expedite C. japonica breeding programs aimed at alleviating pollinosis problems without harming productivity.

  13. Quantitative trait loci controlling aluminum tolerance in soybean: candidate gene and SNP marker discovery

    USDA-ARS?s Scientific Manuscript database

    Aluminum (Al) toxicity is an important abiotic stress that affects soybean production in acidic soils. Development of Al-tolerant cultivars is an efficient and environmentally friendly solution to the problem. Effective selection of Al-tolerant genotypes in applied breeding requires an understanding...

  14. SNP marker discovery in Pima cotton (Gossypium barbadense L.) leaf transcriptomes

    USDA-ARS?s Scientific Manuscript database

    The vast information generated by the next generation sequencing (NGS) technology will continue to benefit the development of new strategies to study and characterize genetic diversity, the improvement of existing tools for molecular breeding, and the discovery of genes underlying important traits i...

  15. De Novo Transcriptome Assembly of Pummelo and Molecular Marker Development

    PubMed Central

    Liang, Mei; Yang, Xiaoming; Li, Hang; Su, Shiying; Yi, Hualin; Chai, Lijun; Deng, Xiuxin

    2015-01-01

    Pummelo (Citrus grandis) is an important fruit crop worldwide because of its nutritional value. To accelerate the pummelo breeding program, it is essential to obtain extensive genetic information and develop relative molecular markers. Here, we obtained a 12-Gb transcriptome dataset of pummelo through a mixture of RNA from seven tissues using Illumina pair-end sequencing, assembled into 57,212 unigenes with an average length of 1010 bp. The annotation and classification results showed that a total of 39,584 unigenes had similar hits to the known proteins of four public databases, and 31,501 were classified into 55 Gene Ontology (GO) functional sub-categories. The search for putative molecular markers among 57,212 unigenes identified 10,276 simple sequence repeats (SSRs) and 64,720 single nucleotide polymorphisms (SNPs). High-quality primers of 1174 SSR loci were designed, of which 88.16% were localized to nine chromosomes of sweet orange. Of 100 SSR primers that were randomly selected for testing, 87 successfully amplified clear banding patterns. Of these primers, 29 with a mean PIC (polymorphic information content) value of 0.52 were effectively applied for phylogenetic analysis. Of the 20 SNP primers, 14 primers, including 54 potential SNPs, yielded target amplifications, and 46 loci were verified via Sanger sequencing. This new dataset will be a valuable resource for molecular biology studies of pummelo and provides reliable information regarding SNP and SSR marker development, thus expediting the breeding program of pummelo. PMID:25799271

  16. Morphological features of an endangered Japanese strain of Cyprinus carpio: reconstruction based on seven SNP markers.

    PubMed

    Atsumi, K; Song, H Y; Senou, H; Inoue, K; Mabuchi, K

    2017-03-01

    Morphological analyses of 183 specimens of Japanese common carp Cyprinus carpio (171 from Lake Biwa and 12 from nursery ponds) using genetic hybrid indices demonstrated that the typical native Japanese strain of C. carpio has a more elongate body, more branched dorsal-fin rays, fewer and shorter gill rakers, more developed pneumatic bulb, more coiled pneumatic duct, longer posterior swimbladder and shorter intestine than the typical introduced C. carpio. These results provide a basis for a better understanding of the ecological characteristics and taxonomic status of the endangered Japanese strain of C. carpio.

  17. High-density genetic linkage map construction and identification of fruit-related QTLs in pear using SNP and SSR markers

    PubMed Central

    Wu, Jun; Li, Lei-Ting; Li, Meng; Khan, M. Awais; Li, Xiu-Gen; Chen, Hui; Yin, Hao; Zhang, Shao-Ling

    2014-01-01

    Pear (Pyrus spp) is an important fruit crop, grown in all temperate regions of the world, with global production ranked after grape and apples among deciduous tree crops. A high-density linkage map is a valuable tool for fine mapping quantitative trait loci (QTL) and map-based gene cloning. In this study, we firstly constructed a high-density linkage map of pear using SNPs integrated with SSRs, developed by the rapid and robust technology of restriction-associated DNA sequencing (RADseq). The linkage map consists of 3143 SNP markers and 98 SSRs, 3241 markers in total, spanning 2243.4 cM, with an average marker distance of 0.70 cM. Anchoring SSRs were able to anchor seventeen linkage groups to their corresponding chromosomes. Based on this high-density integrated pear linkage map and two years of fruit phenotyping, a total of 32 potential QTLs for 11 traits, including length of pedicel (LFP), single fruit weight (SFW), soluble solid content (SSC), transverse diameter (TD), vertical diameter (VD), calyx status (CS), flesh colour (FC), juice content (JC), number of seeds (NS), skin colour (SC), and skin smooth (SS), were identified and positioned on the genetic map. Among them, some important fruit-related traits have for the first time been identified, such as calyx status, length of pedicel, and flesh colour, and reliable localization of QTLs were verified repeatable. This high-density linkage map of pear is a worthy reference for mapping important fruit traits, QTL identification, and comparison and combination of different genetic maps. PMID:25129128

  18. SNP development from RNA-seq data in a nonmodel fish: how many individuals are needed for accurate allele frequency prediction?

    PubMed

    Schunter, C; Garza, J C; Macpherson, E; Pascual, M

    2014-01-01

    Single nucleotide polymorphisms (SNPs) are rapidly becoming the marker of choice in population genetics due to a variety of advantages relative to other markers, including higher genomic density, data quality, reproducibility and genotyping efficiency, as well as ease of portability between laboratories. Advances in sequencing technology and methodologies to reduce genomic representation have made the isolation of SNPs feasible for nonmodel organisms. RNA-seq is one such technique for the discovery of SNPs and development of markers for large-scale genotyping. Here, we report the development of 192 validated SNP markers for parentage analysis in Tripterygion delaisi (the black-faced blenny), a small rocky-shore fish from the Mediterranean Sea. RNA-seq data for 15 individual samples were used for SNP discovery by applying a series of selection criteria. Genotypes were then collected from 1599 individuals from the same population with the resulting loci. Differences in heterozygosity and allele frequencies were found between the two data sets. Heterozygosity was lower, on average, in the population sample, and the mean difference between the frequencies of particular alleles in the two data sets was 0.135 ± 0.100. We used bootstrap resampling of the sequence data to predict appropriate sample sizes for SNP discovery. As cDNA library production is time-consuming and expensive, we suggest that using seven individuals for RNA sequencing reduces the probability of discarding highly informative SNP loci, due to lack of observed polymorphism, whereas use of more than 12 samples does not considerably improve prediction of true allele frequencies.

  19. Development and Validation of a High-Density SNP Genotyping Array for African Oil Palm.

    PubMed

    Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Heng, Huey Ying; Lee, Heng Leng; Mohamed, Mohaimi; Low, Joel Zi-Bin; Apparow, Sukganah; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Appleton, David Ross

    2016-08-01

    High-density single nucleotide polymorphism (SNP) genotyping arrays are powerful tools that can measure the level of genetic polymorphism within a population. To develop a whole-genome SNP array for oil palms, SNP discovery was performed using deep resequencing of eight libraries derived from 132 Elaeis guineensis and Elaeis oleifera palms belonging to 59 origins, resulting in the discovery of >3 million putative SNPs. After SNP filtering, the Illumina OP200K custom array was built with 170 860 successful probes. Phenetic clustering analysis revealed that the array could distinguish between palms of different origins in a way consistent with pedigree records. Genome-wide linkage disequilibrium declined more slowly for the commercial populations (ranging from 120 kb at r(2) = 0.43 to 146 kb at r(2) = 0.50) when compared with the semi-wild populations (19.5 kb at r(2) = 0.22). Genetic fixation mapping comparing the semi-wild and commercial population identified 321 selective sweeps. A genome-wide association study (GWAS) detected a significant peak on chromosome 2 associated with the polygenic component of the shell thickness trait (based on the trait shell-to-fruit; S/F %) in tenera palms. Testing of a genomic selection model on the same trait resulted in good prediction accuracy (r = 0.65) with 42% of the S/F % variation explained. The first high-density SNP genotyping array for oil palm has been developed and shown to be robust for use in genetic studies and with potential for developing early trait prediction to shorten the oil palm breeding cycle. Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.

  20. Expression Level of the DREB2-Type Gene, Identified with Amplifluor SNP Markers, Correlates with Performance, and Tolerance to Dehydration in Bread Wheat Cultivars from Northern Kazakhstan

    PubMed Central

    Shavrukov, Yuri; Zhumalin, Aibek; Serikbay, Dauren; Botayeva, Makpal; Otemisova, Ainur; Absattarova, Aiman; Sereda, Grigoriy; Sereda, Sergey; Shvidchenko, Vladimir; Turbekova, Arysgul; Jatayev, Satyvaldy; Lopato, Sergiy; Soole, Kathleen; Langridge, Peter

    2016-01-01

    A panel of 89 local commercial cultivars of bread wheat was tested in field trials in the dry conditions of Northern Kazakhstan. Two distinct groups of cultivars (six cultivars in each group), which had the highest and the lowest grain yield under drought were selected for further experiments. A dehydration test conducted on detached leaves indicated a strong association between rates of water loss in plants from the first group with highest grain yield production in the dry environment relative to the second group. Modern high-throughput Amplifluor Single Nucleotide Polymorphism (SNP) technology was applied to study allelic variations in a series of drought-responsive genes using 19 SNP markers. Genotyping of an SNP in the TaDREB5 (DREB2-type) gene using the Amplifluor SNP marker KATU48 revealed clear allele distribution across the entire panel of wheat accessions, and distinguished between the two groups of cultivars with high and low yield under drought. Significant differences in expression levels of TaDREB5 were revealed by qRT-PCR. Most wheat plants from the first group of cultivars with high grain yield showed slight up-regulation in the TaDREB5 transcript in dehydrated leaves. In contrast, expression of TaDREB5 in plants from the second group of cultivars with low grain yield was significantly down-regulated. It was found that SNPs did not alter the amino acid sequence of TaDREB5 protein. Thus, a possible explanation is that alternative splicing and up-stream regulation of TaDREB5 may be affected by SNP, but these hypotheses require additional analysis (and will be the focus of future studies). PMID:27917186

  1. Genetic diversity, population structure and relationships in indigenous cattle populations of Ethiopia and Korean Hanwoo breeds using SNP markers

    PubMed Central

    Edea, Zewdu; Dadi, Hailu; Kim, Sang-Wook; Dessie, Tadelle; Lee, Taeheon; Kim, Heebal; Kim, Jong-Joo; Kim, Kwan-Suk

    2013-01-01

    In total, 166 individuals from five indigenous Ethiopian cattle populations – Ambo (n = 27), Borana (n = 35), Arsi (n = 30), Horro (n = 36), and Danakil (n = 38) – were genotyped for 8773 single nucleotide polymorphism (SNP) markers to assess genetic diversity, population structure, and relationships. As a representative of taurine breeds, Hanwoo cattle (n = 40) were also included in the study for reference. Among Ethiopian cattle populations, the proportion of SNPs with minor allele frequencies (MAFs) ≥0.05 ranged from 81.63% in Borana to 85.30% in Ambo, with a mean of 83.96% across all populations. The Hanwoo breed showed the highest proportion of polymorphism, with MAFs ≥0.05, accounting for 95.21% of total SNPs. The mean expected heterozygosity varied from 0.370 in Danakil to 0.410 in Hanwoo. The mean genetic differentiation (FST; 1%) in Ethiopian cattle revealed that within individual variation accounted for approximately 99% of the total genetic variation. As expected, FST and Reynold genetic distance were greatest between Hanwoo and Ethiopian cattle populations, with average values of 17.62 and 18.50, respectively. The first and second principal components explained approximately 78.33% of the total variation and supported the clustering of the populations according to their historical origins. At K = 2 and 3, a considerable source of variation among cattle is the clustering of the populations into Hanwoo (taurine) and Ethiopian cattle populations. The low estimate of genetic differentiation (FST) among Ethiopian cattle populations indicated that differentiation among these populations is low, possibly owing to a common historical origin and high gene flow. Genetic distance, phylogenic tree, principal component analysis, and population structure analyses clearly differentiated the cattle population according to their historical origins, and confirmed that Ethiopian cattle populations are genetically distinct from the Hanwoo breed. PMID:23518904

  2. Development of single-nucleotide polymorphism markers for Bromus tectorum (Poaceae) from a partially sequenced transcriptome

    Treesearch

    Keith R. Merrill; Craig E. Coleman; Susan E. Meyer; Elizabeth A. Leger; Katherine A. Collins

    2016-01-01

    Premise of the study: Bromus tectorum (Poaceae) is an annual grass species that is invasive in many areas of the world but most especially in the U.S. Intermountain West. Single-nucleotide polymorphism (SNP) markers were developed for use in investigating the geospatial and ecological diversity of B. tectorum in the Intermountain West to better understand the...

  3. Population structure of Atlantic mackerel inferred from RAD-seq-derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection.

    PubMed

    Rodríguez-Ezpeleta, Naiara; Bradbury, Ian R; Mendibil, Iñaki; Álvarez, Paula; Cotano, Unai; Irigoien, Xabier

    2016-07-01

    Restriction-site-associated DNA sequencing (RAD-seq) and related methods are revolutionizing the field of population genomics in nonmodel organisms as they allow generating an unprecedented number of single nucleotide polymorphisms (SNPs) even when no genomic information is available. Yet, RAD-seq data analyses rely on assumptions on nature and number of nucleotide variants present in a single locus, the choice of which may lead to an under- or overestimated number of SNPs and/or to incorrectly called genotypes. Using the Atlantic mackerel (Scomber scombrus L.) and a close relative, the Atlantic chub mackerel (Scomber colias), as case study, here we explore the sensitivity of population structure inferences to two crucial aspects in RAD-seq data analysis: the maximum number of mismatches allowed to merge reads into a locus and the relatedness of the individuals used for genotype calling and SNP selection. Our study resolves the population structure of the Atlantic mackerel, but, most importantly, provides insights into the effects of alternative RAD-seq data analysis strategies on population structure inferences that are directly applicable to other species.

  4. High-Density Genetic Linkage Mapping in Turbot (Scophthalmus maximus L.) Based on SNP Markers and Major Sex- and Growth-Related Regions Detection

    PubMed Central

    Wang, Weiji; Hu, Yulong; Ma, Yu; Xu, Liyong; Guan, Jiantao; Kong, Jie

    2015-01-01

    This paper describes the development of a high density consensus genetic linkage map of a turbot (Scophthalmus maximus L.) family composed of 149 mapping individuals using Single Nucleotide Polymorphisms (SNP) developed using the restriction-site associated DNA (RAD) sequencing technique with the restriction enzyme, PstI. A total of 6,647 SNPs were assigned to 22 linkage groups, which is equal to the number of chromosome pairs in turbot. For the first time, the average marker interval reached 0.3958 cM, which is equal to approximately 0.1203 Mb of the turbot genome. The observed 99.34% genome coverage indicates that the linkage map was genome-wide. A total of 220 Quantitative Traits Locus (QTLs) associated with two body length traits, two body weight traits in different growth periods and sex determination were detected with an LOD > 5.0 in 12 linkage groups (LGs), which explained the corresponding phenotypic variance (R2), ranging from 14.4–100%. Among them, 175 overlapped with linked SNPs, and the remaining 45 were located in regions between contiguous SNPs. According to the QTLs related to growth trait distribution and the changing of LGs during different growth periods, the growth traits are likely controlled by multi-SNPs distributed on several LGs; the effect of these SNPs changed during different growth periods. Most sex-related QTLs were detected at LG 21 with a linkage span of 70.882 cM. Additionally, a small number of QTLs with high feasibility and a narrow R2 distribution were also observed on LG7 and LG14, suggesting that multi LGs or chromosomes might be involved in sex determination. High homology was recorded between LG21 in Cynoglossus semilaevis and turbot. This high-saturated turbot RAD-Seq linkage map is undoubtedly a promising platform for marker assisted selection (MAS) and flatfish genomics research. PMID:25775256

  5. RAD SNP markers as a tool for conservation of dolphinfish Coryphaena hippurus in the Mediterranean Sea: Identification of subtle genetic structure and assessment of populations sex-ratios.

    PubMed

    Maroso, Francesco; Franch, Rafaella; Dalla Rovere, Giulia; Arculeo, Marco; Bargelloni, Luca

    2016-08-01

    Dolphinfish is an important fish species for both commercial and sport fishing, but so far limited information is available on genetic variability and pattern of differentiation of dolphinfish populations in the Mediterranean basin. Recently developed techniques allow genome-wide identification of genetic markers for better understanding of population structure in species with limited genome information. Using restriction-site associated DNA analysis we successfully genotyped 140 individuals of dolphinfish from eight locations in the Mediterranean Sea at 3324 SNP loci. We identified 311 sex-related loci that were used to assess sex-ratio in dolphinfish populations. In addition, we identified a weak signature of genetic differentiation of the population closer to Gibraltar Strait in comparison to other Mediterranean populations, which might be related to introgression of individuals from Atlantic. No further genetic differentiation could be detected in the other populations sampled, as expected considering the known highly mobility of the species. The results obtained improve our knowledge of the species and can help managing dolphinfish stock in the future.

  6. Accuracy of Assignment of Atlantic Salmon (Salmo salar L.) to Rivers and Regions in Scotland and Northeast England Based on Single Nucleotide Polymorphism (SNP) Markers

    PubMed Central

    Gilbey, John; Cauwelier, Eef; Coulson, Mark W.; Stradmeyer, Lee; Sampayo, James N.; Armstrong, Anja; Verspoor, Eric; Corrigan, Laura; Shelley, Jonathan; Middlemas, Stuart

    2016-01-01

    Understanding the habitat use patterns of migratory fish, such as Atlantic salmon (Salmo salar L.), and the natural and anthropogenic impacts on them, is aided by the ability to identify individuals to their stock of origin. Presented here are the results of an analysis of informative single nucleotide polymorphic (SNP) markers for detecting genetic structuring in Atlantic salmon in Scotland and NE England and their ability to allow accurate genetic stock identification. 3,787 fish from 147 sites covering 27 rivers were screened at 5,568 SNP markers. In order to identify a cost-effective subset of SNPs, they were ranked according to their ability to differentiate between fish from different rivers. A panel of 288 SNPs was used to examine both individual assignments and mixed stock fisheries and eighteen assignment units were defined. The results improved greatly on previously available methods and, for the first time, fish caught in the marine environment can be confidently assigned to geographically coherent units within Scotland and NE England, including individual rivers. As such, this SNP panel has the potential to aid understanding of the various influences acting upon Atlantic salmon on their marine migrations, be they natural environmental variations and/or anthropogenic impacts, such as mixed stock fisheries and interactions with marine power generation installations. PMID:27723810

  7. Characterizing the population structure and genetic diversity of maize breeding germplasm in Southwest China using genome-wide SNP markers.

    PubMed

    Zhang, Xiao; Zhang, Hua; Li, Lujiang; Lan, Hai; Ren, Zhiyong; Liu, Dan; Wu, Ling; Liu, Hailan; Jaqueth, Jennifer; Li, Bailin; Pan, Guangtang; Gao, Shibin

    2016-08-31

    Maize breeding germplasm used in Southwest China has high complexity because of the diverse ecological features of this area. In this study, the population structure, genetic diversity, and linkage disequilibrium decay distance of 362 important inbred lines collected from the breeding program of Southwest China were characterized using the MaizeSNP50 BeadChip with 56,110 single nucleotide polymorphisms (SNPs). With respect to population structure, two (Tropical and Temperate), three (Tropical, Stiff Stalk and non-Stiff Stalk), four [Tropical, group A germplasm derived from modern U.S. hybrids (PA), group B germplasm derived from modern U.S. hybrids (PB) and Reid] and six (Tropical, PB, Reid, Iowa Stiff Stalk Synthetic, PA and North) subgroups were identified. With increasing K value, the Temperate group showed pronounced hierarchical structure with division into further subgroups. The Genetic Diversity of each group was also estimated, and the Tropical group was more diverse than the Temperate group. Seven low-genetic-diversity and one high-genetic-diversity regions were collectively identified in the Temperate, Tropical groups, and the entire panel. SNPs with significant variation in allele frequency between the Tropical and Temperate groups were also evaluated. Among them, a region located at 130 Mb on Chromosome 2 showed the highest genetic diversity, including both number of SNPs with significant variation and the ratio of significant SNPs to total SNPs. Linkage disequilibrium decay distance in the Temperate group was greater (2.5-3 Mb) than that in the entire panel (0.5-0.75 Mb) and the Tropical group (0.25-0.5 Mb). A large region at 30-120 Mb of Chromosome 7 was concluded to be a region conserved during the breeding process by comparison between S37, which was considered a representative tropical line in Southwest China, and its 30 most similar derived lines. For the panel covered most of widely used inbred lines in Southwest China, this work

  8. Transcriptome analysis and SNP development can resolve population differentiation of Streblospio benedicti, a developmentally dimorphic marine annelid.

    PubMed

    Zakas, Christina; Schult, Nancy; McHugh, Damhnait; Jones, Kenneth L; Wares, John P

    2012-01-01

    Next-generation sequencing technology is now frequently being used to develop genomic tools for non-model organisms, which are generally important for advancing studies of evolutionary ecology. One such species, the marine annelid Streblospio benedicti, is an ideal system to study the evolutionary consequences of larval life history mode because the species displays a rare offspring dimorphism termed poecilogony, where females can produce either many small offspring or a few large ones. To further develop S. benedicti as a model system for studies of life history evolution, we apply 454 sequencing to characterize the transcriptome for embryos, larvae, and juveniles of this species, for which no genomic resources are currently available. Here we performed a de novo alignment of 336,715 reads generated by a quarter GS-FLX (Roche 454) run, which produced 7,222 contigs. We developed a novel approach for evaluating the site frequency spectrum across the transcriptome to identify potential signatures of selection. We also developed 84 novel single nucleotide polymorphism (SNP) markers for this species that are used to distinguish coastal populations of S. benedicti. We validated the SNPs by genotyping individuals of different developmental modes using the BeadXPress Golden Gate assay (Illumina). This allowed us to evaluate markers that may be associated with life-history mode.

  9. Transcriptome Analysis and SNP Development Can Resolve Population Differentiation of Streblospio benedicti, a Developmentally Dimorphic Marine Annelid

    PubMed Central

    Zakas, Christina; Schult, Nancy; McHugh, Damhnait; Jones, Kenneth L.; Wares, John P.

    2012-01-01

    Next-generation sequencing technology is now frequently being used to develop genomic tools for non-model organisms, which are generally important for advancing studies of evolutionary ecology. One such species, the marine annelid Streblospio benedicti, is an ideal system to study the evolutionary consequences of larval life history mode because the species displays a rare offspring dimorphism termed poecilogony, where females can produce either many small offspring or a few large ones. To further develop S. benedicti as a model system for studies of life history evolution, we apply 454 sequencing to characterize the transcriptome for embryos, larvae, and juveniles of this species, for which no genomic resources are currently available. Here we performed a de novo alignment of 336,715 reads generated by a quarter GS-FLX (Roche 454) run, which produced 7,222 contigs. We developed a novel approach for evaluating the site frequency spectrum across the transcriptome to identify potential signatures of selection. We also developed 84 novel single nucleotide polymorphism (SNP) markers for this species that are used to distinguish coastal populations of S. benedicti. We validated the SNPs by genotyping individuals of different developmental modes using the BeadXPress Golden Gate assay (Illumina). This allowed us to evaluate markers that may be associated with life-history mode. PMID:22359608

  10. Development of a single-nucleotide polymorphism (SNP) assay for genotyping of Pandora neoaphidis.

    PubMed

    Fournier, A; Widmer, F; Enkerli, J

    2010-01-01

    Pandora neoaphidis (Entomophthoromycotina, Entomophthorales) is one of the most important fungal pathogens of aphids with great potential as a biological control agent. Development of tools that allow high-resolution monitoring of P. neoaphidis in the environment is a prerequisite for the successful implementation of biological control strategies. In this study, a single-nucleotide polymorphism (SNP) assay was developed. The assay targets 13 SNPs identified in 6 genomic regions including the largest subunit of nuclear RNA polymerase II (RPB1) gene, the second-largest subunit of nuclear RNA polymerase II (RPB2) gene, the β-tubulin (BTUB) gene, the elongation factor 1α-like (EFL) gene, the large subunit (LSU) rRNA gene, and the small subunit (SSU) rRNA gene together with the internal transcribed spacer (ITS). The assay allowed the discrimination of 15 different SNP profiles among 19 P. neoaphidis isolates and 4 P. neoaphidis-infected cadavers. Results showed that the assay is applicable to DNA extracted from infected aphids allowing genotyping of the fungus without cultivation. The SNP assay provides an efficient tool for investigation of population structures and dynamics of P. neoaphidis, as well as its persistence and epidemiology in agro-ecosystems. Furthermore, it constitutes a powerful approach for monitoring potential biological control strains of P. neoaphidis in the environment. Copyright © 2010 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  11. Development and Validation of a 20K Single Nucleotide Polymorphism (SNP) Whole Genome Genotyping Array for Apple (Malus × domestica Borkh)

    PubMed Central

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088

  12. Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh).

    PubMed

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.

  13. OPRM1 SNP (A118G): Involvement in disease development, treatment response, and animal models

    PubMed Central

    Mague, Stephen D.; Blendy, Julie A.

    2010-01-01

    Endogenous opioids acting at μ-opioid receptors mediate many biological functions. Pharmacological intervention at these receptors has greatly aided in the treatment of acute and chronic pain, in addition to other uses. However, the development of tolerance and dependence has made it difficult to adequately prescribe these therapeutics. A common single nucleotide polymorphism (SNP), A118G, in the μ-opioid receptor gene can affect opioid function and, consequently, has been suggested to contribute to individual variability in pain management and drug addiction. Investigation into the role of A118G in human disease and treatment response has generated a large number of association studies across various disease states as well as physiological responses. However, characterizing the functional consequences of this SNP and establishing if it causes or contributes to disease phenotypes have been significant challenges. In this manuscript, we will review a number of association studies as well as investigations of the functional impact of this gene variant. In addition, we will describe a novel mouse model that was generated to recapitulate this SNP in mice. Evaluation of models that incorporate known human genetic variants into a tractable system, like the mouse, will facilitate the understanding of discrete contributions of SNPs to human disease. PMID:20074870

  14. AncestrySNPminer: A bioinformatics tool to retrieve and develop ancestry informative SNP panels

    PubMed Central

    Amirisetty, Sushil; Khurana Hershey, Gurjit K.; Baye, Tesfaye M.

    2012-01-01

    A wealth of genomic information is available in public and private databases. However, this information is underutilized for uncovering population specific and functionally relevant markers underlying complex human traits. Given the huge amount of SNP data available from the annotation of human genetic variation, data mining is a faster and cost effective approach for investigating the number of SNPs that are informative for ancestry. In this study, we present AncestrySNPminer, the first web-based bioinformatics tool specifically designed to retrieve Ancestry Informative Markers (AIMs) from genomic data sets and link these informative markers to genes and ontological annotation classes. The tool includes an automated and simple “scripting at the click of a button” functionality that enables researchers to perform various population genomics statistical analyses methods with user friendly querying and filtering of data sets across various populations through a single web interface. AncestrySNPminer can be freely accessed at https://research.cchmc.org/mershalab/AncestrySNPminer/login.php. PMID:22584067

  15. Translational genomics for abiotic stress in sorghum: transcriptional profiling and validation of SNP markers between germplasm with differential cold tolerance

    USDA-ARS?s Scientific Manuscript database

    One focus of the Sorghum Translational Genomics Lab (part of sorghum CRIS, PSGD, CSRL, USDA-ARS, Lubbock TX) is to utilize nucleotide variation between sorghum germplasm such as those derived from RNA seq for translation and validation of Single Nucleotide Polymorphism (SNP) into easy access DNA m...

  16. A large maize (Zea Mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome

    USDA-ARS?s Scientific Manuscript database

    SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection for accelerated breeding. We report the establishment of a large SNP array for maize and i...

  17. The Impact of Genotyping-by-Sequencing Pipelines on SNP Discovery and Identification of Markers Associated with Verticillium Wilt Resistance in Autotetraploid Alfalfa (Medicago sativa L.)

    PubMed Central

    Yu, Long-Xi; Zheng, Ping; Bhamidimarri, Suresh; Liu, Xiang-Ping; Main, Dorie

    2017-01-01

    Verticillium wilt (VW) of alfalfa is a soilborne disease causing severe yield loss in alfalfa. To identify molecular markers associated with VW resistance, we used an integrated framework of genome-wide association study (GWAS) with high-throughput genotyping by sequencing (GBS) to identify loci associated with VW resistance in an F1 full-sib alfalfa population. Phenotyping was performed using manual inoculation of the pathogen to cloned plants of each individual and disease severity was scored using a standard scale. Genotyping was done by GBS, followed by genotype calling using three bioinformatics pipelines including the TASSEL-GBS pipeline (TASSEL), the Universal Network Enabled Analysis Kit (UNEAK), and the haplotype-based FreeBayes pipeline (FreeBayes). The resulting numbers of SNPs, marker density, minor allele frequency (MAF) and heterozygosity were compared among the pipelines. The TASSEL pipeline generated more markers with the highest density and MAF, whereas the highest heterozygosity was obtained by the UNEAK pipeline. The FreeBayes pipeline generated tetraploid genotypes, with the least number of markers. SNP markers generated from each pipeline were used independently for marker-trait association. Markers significantly associated with VW resistance identified by each pipeline were compared. Similar marker loci were found on chromosomes 5, 6, and 7, whereas different loci on chromosome 1, 2, 3, and 4 were identified by different pipelines. Most significant markers were located on chromosome 6 and they were identified by all three pipelines. Of those identified, several loci were linked to known genes whose functions are involved in the plants’ resistance to pathogens. Further investigation on these loci and their linked genes would provide insight into understanding molecular mechanisms of VW resistance in alfalfa. Functional markers closely linked to the resistance loci would be useful for MAS to improve alfalfa cultivars with enhanced resistance

  18. The Impact of Genotyping-by-Sequencing Pipelines on SNP Discovery and Identification of Markers Associated with Verticillium Wilt Resistance in Autotetraploid Alfalfa (Medicago sativa L.).

    PubMed

    Yu, Long-Xi; Zheng, Ping; Bhamidimarri, Suresh; Liu, Xiang-Ping; Main, Dorie

    2017-01-01

    Verticillium wilt (VW) of alfalfa is a soilborne disease causing severe yield loss in alfalfa. To identify molecular markers associated with VW resistance, we used an integrated framework of genome-wide association study (GWAS) with high-throughput genotyping by sequencing (GBS) to identify loci associated with VW resistance in an F1 full-sib alfalfa population. Phenotyping was performed using manual inoculation of the pathogen to cloned plants of each individual and disease severity was scored using a standard scale. Genotyping was done by GBS, followed by genotype calling using three bioinformatics pipelines including the TASSEL-GBS pipeline (TASSEL), the Universal Network Enabled Analysis Kit (UNEAK), and the haplotype-based FreeBayes pipeline (FreeBayes). The resulting numbers of SNPs, marker density, minor allele frequency (MAF) and heterozygosity were compared among the pipelines. The TASSEL pipeline generated more markers with the highest density and MAF, whereas the highest heterozygosity was obtained by the UNEAK pipeline. The FreeBayes pipeline generated tetraploid genotypes, with the least number of markers. SNP markers generated from each pipeline were used independently for marker-trait association. Markers significantly associated with VW resistance identified by each pipeline were compared. Similar marker loci were found on chromosomes 5, 6, and 7, whereas different loci on chromosome 1, 2, 3, and 4 were identified by different pipelines. Most significant markers were located on chromosome 6 and they were identified by all three pipelines. Of those identified, several loci were linked to known genes whose functions are involved in the plants' resistance to pathogens. Further investigation on these loci and their linked genes would provide insight into understanding molecular mechanisms of VW resistance in alfalfa. Functional markers closely linked to the resistance loci would be useful for MAS to improve alfalfa cultivars with enhanced resistance to

  19. A Brassica rapa Linkage Map of EST-based SNP Markers for Identification of Candidate Genes Controlling Flowering Time and Leaf Morphological Traits

    PubMed Central

    Li, Feng; Kitashiba, Hiroyasu; Inaba, Kiyofumi; Nishio, Takeshi

    2009-01-01

    For identification of genes responsible for varietal differences in flowering time and leaf morphological traits, we constructed a linkage map of Brassica rapa DNA markers including 170 EST-based markers, 12 SSR markers, and 59 BAC sequence-based markers, of which 151 are single nucleotide polymorphism (SNP) markers. By BLASTN, 223 markers were shown to have homologous regions in Arabidopsis thaliana, and these homologous loci covered nearly the whole genome of A. thaliana. Synteny analysis between B. rapa and A. thaliana revealed 33 large syntenic regions. Three quantitative trait loci (QTLs) for flowering time were detected. BrFLC1 and BrFLC2 were linked to the QTLs for bolting time, budding time, and flowering time. Three SNPs in the promoter, which may be the cause of low expression of BrFLC2 in the early-flowering parental line, were identified. For leaf lobe depth and leaf hairiness, one major QTL corresponding to a syntenic region containing GIBBERELLIN 20 OXIDASE 3 and one major QTL containing BrGL1, respectively, were detected. Analysis of nucleotide sequences and expression of these genes suggested possible involvement of these genes in leaf morphological traits. PMID:19884167

  20. Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)

    PubMed Central

    2014-01-01

    Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in

  1. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.)

    PubMed Central

    Koning-Boucoiran, Carole F. S.; Esselink, G. Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P. C.; Gitonga, Virginia W.; Krens, Frans A.; Voorrips, Roeland E.; van de Weg, W. Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J. M.

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular. PMID:25954285

  2. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

    PubMed

    Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  3. Candidate SNP Markers of Familial and Sporadic Alzheimer's Diseases Are Predicted by a Significant Change in the Affinity of TATA-Binding Protein for Human Gene Promoters.

    PubMed

    Ponomarenko, Petr; Chadaeva, Irina; Rasskazov, Dmitry A; Sharypova, Ekaterina; Kashina, Elena V; Drachkova, Irina; Zhechev, Dmitry; Ponomarenko, Mikhail P; Savinkova, Ludmila K; Kolchanov, Nikolay

    2017-01-01

    While year after year, conditions, quality, and duration of human lives have been improving due to the progress in science, technology, education, and medicine, only eight diseases have been increasing in prevalence and shortening human lives because of premature deaths according to the retrospective official review on the state of US health, 1990-2010. These diseases are kidney cancer, chronic kidney diseases, liver cancer, diabetes, drug addiction, poisoning cases, consequences of falls, and Alzheimer's disease (AD) as one of the leading pathologies. There are familial AD of hereditary nature (~4% of cases) and sporadic AD of unclear etiology (remaining ~96% of cases; i.e., non-familial AD). Therefore, sporadic AD is no longer a purely medical problem, but rather a social challenge when someone asks oneself: "What can I do in my own adulthood to reduce the risk of sporadic AD at my old age to save the years of my lifespan from the destruction caused by it?" Here, we combine two computational approaches for regulatory SNPs: Web service SNP_TATA_Comparator for sequence analysis and a PubMed-based keyword search for articles on the biochemical markers of diseases. Our purpose was to try to find answers to the question: "What can be done in adulthood to reduce the risk of sporadic AD in old age to prevent the lifespan reduction caused by it?" As a result, we found 89 candidate SNP markers of familial and sporadic AD (e.g., rs562962093 is associated with sporadic AD in the elderly as a complication of stroke in adulthood, where natural marine diets can reduce risks of both diseases in case of the minor allele of this SNP). In addition, rs768454929, and rs761695685 correlate with sporadic AD as a comorbidity of short stature, where maximizing stature in childhood and adolescence as an integral indicator of health can minimize (or even eliminate) the risk of sporadic AD in the elderly. After validation by clinical protocols, these candidate SNP markers may become

  4. Candidate SNP Markers of Familial and Sporadic Alzheimer's Diseases Are Predicted by a Significant Change in the Affinity of TATA-Binding Protein for Human Gene Promoters

    PubMed Central

    Ponomarenko, Petr; Chadaeva, Irina; Rasskazov, Dmitry A.; Sharypova, Ekaterina; Kashina, Elena V.; Drachkova, Irina; Zhechev, Dmitry; Ponomarenko, Mikhail P.; Savinkova, Ludmila K.; Kolchanov, Nikolay

    2017-01-01

    While year after year, conditions, quality, and duration of human lives have been improving due to the progress in science, technology, education, and medicine, only eight diseases have been increasing in prevalence and shortening human lives because of premature deaths according to the retrospective official review on the state of US health, 1990-2010. These diseases are kidney cancer, chronic kidney diseases, liver cancer, diabetes, drug addiction, poisoning cases, consequences of falls, and Alzheimer's disease (AD) as one of the leading pathologies. There are familial AD of hereditary nature (~4% of cases) and sporadic AD of unclear etiology (remaining ~96% of cases; i.e., non-familial AD). Therefore, sporadic AD is no longer a purely medical problem, but rather a social challenge when someone asks oneself: “What can I do in my own adulthood to reduce the risk of sporadic AD at my old age to save the years of my lifespan from the destruction caused by it?” Here, we combine two computational approaches for regulatory SNPs: Web service SNP_TATA_Comparator for sequence analysis and a PubMed-based keyword search for articles on the biochemical markers of diseases. Our purpose was to try to find answers to the question: “What can be done in adulthood to reduce the risk of sporadic AD in old age to prevent the lifespan reduction caused by it?” As a result, we found 89 candidate SNP markers of familial and sporadic AD (e.g., rs562962093 is associated with sporadic AD in the elderly as a complication of stroke in adulthood, where natural marine diets can reduce risks of both diseases in case of the minor allele of this SNP). In addition, rs768454929, and rs761695685 correlate with sporadic AD as a comorbidity of short stature, where maximizing stature in childhood and adolescence as an integral indicator of health can minimize (or even eliminate) the risk of sporadic AD in the elderly. After validation by clinical protocols, these candidate SNP markers may

  5. The construction of a high-density linkage map for identifying SNP markers that are tightly linked to a nuclear-recessive major gene for male sterility in Cryptomeria japonica D. Don

    PubMed Central

    2012-01-01

    Background High-density linkage maps facilitate the mapping of target genes and the construction of partial linkage maps around target loci to develop markers for marker-assisted selection (MAS). MAS is quite challenging in conifers because of their large, complex, and poorly-characterized genomes. Our goal was to construct a high-density linkage map to facilitate the identification of markers that are tightly linked to a major recessive male-sterile gene (ms1) for MAS in C. japonica, a species that is important in Japanese afforestation but which causes serious social pollinosis problems. Results We constructed a high-density saturated genetic linkage map for C. japonica using expressed sequence-derived co-dominant single nucleotide polymorphism (SNP) markers, most of which were genotyped using the GoldenGate genotyping assay. A total of 1261 markers were assigned to 11 linkage groups with an observed map length of 1405.2 cM and a mean distance between two adjacent markers of 1.1 cM; the number of linkage groups matched the basic chromosome number in C. japonica. Using this map, we located ms1 on the 9th linkage group and constructed a partial linkage map around the ms1 locus. This enabled us to identify a marker (hrmSNP970_sf) that is closely linked to the ms1 gene, being separated from it by only 0.5 cM. Conclusions Using the high-density map, we located the ms1 gene on the 9th linkage group and constructed a partial linkage map around the ms1 locus. The map distance between the ms1 gene and the tightly linked marker was only 0.5 cM. The identification of markers that are tightly linked to the ms1 gene will facilitate the early selection of male-sterile trees, which should expedite C. japonica breeding programs aimed at alleviating pollinosis problems without harming productivity. PMID:22424262

  6. A method for selection of restriction enzymes for sdCAPS marker construction

    USDA-ARS?s Scientific Manuscript database

    Development of PCR-based markers for SNP detection is prerequisite for various genetic analyses. The use of restriction enzymes following PCR amplification is a common and relatively low cost method for SNP detection. Simple and cost-effective methodologies for SNP marker development that would en...

  7. Leaf Transcriptome Sequencing for Identifying Genic-SSR Markers and SNP Heterozygosity in Crossbred Mango Variety ‘Amrapali’ (Mangifera indica L.)

    PubMed Central

    Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar

    2016-01-01

    Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892

  8. Development and Evaluation of a Genome-Wide 6K SNP Array for Diploid Sweet Cherry and Tetraploid Sour Cherry

    PubMed Central

    Peace, Cameron; Bassil, Nahla; Main, Dorrie; Ficklin, Stephen; Rosyara, Umesh R.; Stegmeir, Travis; Sebolt, Audrey; Gilmore, Barbara; Lawley, Cindy; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Iezzoni, Amy

    2012-01-01

    High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a community initiative to enable marker-assisted breeding for rosaceous crops. Next-generation sequencing in diverse breeding germplasm provided 25 billion basepairs (Gb) of cherry DNA sequence from which were identified genome-wide SNPs for sweet cherry and for the two sour cherry subgenomes derived from sweet cherry (avium subgenome) and P. fruticosa (fruticosa subgenome). Anchoring to the peach genome sequence, recently released by the International Peach Genome Initiative, predicted relative physical locations of the 1.9 million putative SNPs detected, preliminarily filtered to 368,943 SNPs. Further filtering was guided by results of a 144-SNP subset examined with the Illumina GoldenGate® assay on 160 accessions. A 6K Infinium® II array was designed with SNPs evenly spaced genetically across the sweet and sour cherry genomes. SNPs were developed for each sour cherry subgenome by using minor allele frequency in the sour cherry detection panel to enrich for subgenome-specific SNPs followed by targeting to either subgenome according to alleles observed in sweet cherry. The array was evaluated using panels of sweet (n = 269) and sour (n = 330) cherry breeding germplasm. Approximately one third of array SNPs were informative for each crop. A total of 1825 polymorphic SNPs were verified in sweet cherry, 13% of these originally developed for sour cherry. Allele dosage was resolved for 2058 polymorphic SNPs in sour cherry, one third of these being originally developed for sweet cherry. This publicly available genomics resource represents a significant advance in cherry genome-scanning capability that will accelerate marker-locus-trait association discovery, genome

  9. Genomewide association study for seeding emergence and tiller number using SNP markers in an elite winter wheat population.

    PubMed

    Chen, Guang Feng; Wu, Ru Gang; Li, Dong Mei; Yu, Hai Xia; Deng, Zhiying; Tian, Ji Chun

    2017-03-01

    Seeding emergence and tiller number are the most important traits for wheat (Triticum aestivum L.) yield, but the inheritance of seeding emergence and tillering is poorly understood. We conducted a genomewide association study focussing on seeding emergence and tiller number at different growth stages with a panel of 205 elite winter wheat accessions. The population was genotyped with a high-density Illumina iSelect 90K SNPs assay. A total of 31 loci were found to be associated with seeding emergence rate (SER) and tiller number in different growth stages. Loci distributed among 12 chromosomes accounted for 5.35 to 11.33% of the observed phenotypic variation. With this information, 10 stable SNPs were identified for eventual development of cleaved amplified polymorphic sequence markers for SER and tiller number in different growth stages. Additionally, a set of elite alleles were identified, such as Ra_c14761_1348-T, which may increase SER by 13.35%, and Excalibur_c11045_236-A and BobWhite_c8436_391-T, which may increase the rate of available tillering by 14.78 and 8.47%, respectively. These results should provide valuable information for marker-assisted selection and parental selection in wheat breeding programmes.

  10. Mapping a Large Number of QTL for Durable Resistance to Stripe Rust in Winter Wheat Druchamp Using SSR and SNP Markers

    PubMed Central

    Hou, Lu; Chen, Xianming; Wang, Meinan; See, Deven R.; Chao, Shiaoman; Bulli, Peter; Jing, Jinxue

    2015-01-01

    Winter wheat Druchamp has both high-temperature adult-plant (HTAP) resistance and all-stage resistance to stripe rust caused by Puccinia striiformis f. sp. tritici (Pst). The HTAP resistance in Druchamp is durable as the variety has been resistant in adult-plant stage since it was introduced from France to the United States in late 1940s. To map the quantitative trait loci (QTL) for stripe rust resistance, an F8 recombinant inbred line (RIL) population from cross Druchamp × Michigan Amber was phenotyped for stripe rust response in multiple years in fields under natural infection and with selected Pst races under controlled greenhouse conditions, and genotyped with simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers. Composite interval mapping (CIM) identified eight HTAP resistance QTL and three all-stage resistance QTL. Among the eight HTAP resistance QTL, QYrdr.wgp-1BL.2 (explaining 2.36-31.04% variation), QYrdr.wgp-2BL (2.81–15.65%), QYrdr.wgp-5AL (2.27–17.22%) and QYrdr.wgp-5BL.2 (2.42–15.13%) were significant in all tests; and QYrdr.wgp-1BL.1 (1.94–10.19%), QYrdr.wgp-1DS (2.04–27.24%), QYrdr.wgp-3AL (1.78–13.85%) and QYrdr.wgp-6BL.2 (1.69–33.71%) were significant in some of the tests. The three all-stage resistance QTL, QYrdr.wgp-5BL.1 (5.47–36.04%), QYrdr.wgp-5DL (9.27–11.94%) and QYrdr.wgp-6BL.1 (13.07-20.36%), were detected based on reactions in the seedlings tested with certain Pst races. Among the eleven QTL detected in Druchamp, at least three (QYrdr.wgp-5DL for race-specific all-stage resistance and QYrdr.wgp-3AL and QYrdr.wgp-6BL.2 for race non-specific HTAP resistance) are new. All these QTL, especially those for durable HTAP resistance, and their closely linked molecular markers could be useful for developing wheat cultivars with durable resistance to stripe rust. PMID:25970329

  11. Development of a SNP-based assay for measuring genetic diversity in the Tasmanian devil insurance population.

    PubMed

    Wright, Belinda; Morris, Katrina; Grueber, Catherine E; Willet, Cali E; Gooley, Rebecca; Hogg, Carolyn J; O'Meally, Denis; Hamede, Rodrigo; Jones, Menna; Wade, Claire; Belov, Katherine

    2015-10-14

    The Tasmanian devil (Sarcophilus harrisii) has undergone a recent, drastic population decline due to the highly contagious devil facial tumor disease. The tumor is one of only two naturally occurring transmissible cancers and is almost inevitably fatal. In 2006 a disease-free insurance population was established to ensure that the Tasmanian devil is protected from extinction. The insurance program is dependent upon preserving as much wild genetic diversity as possible to maximize the success of subsequent reintroductions to the wild. Accurate genotypic data is vital to the success of the program to ensure that loss of genetic diversity does not occur in captivity. Until recently, microsatellite markers have been used to study devil population genetics, however as genetic diversity is low in the devil and potentially decreasing in the captive population, a more sensitive genotyping assay is required. Utilising the devil reference genome and whole genome re-sequencing data, we have identified polymorphic regions for use in a custom genotyping assay. These regions were amplified using PCR and sequenced on the Illumina MiSeq platform to refine a set a markers to genotype the Tasmanian devil insurance population. We have developed a set of single nucleotide polymorphic (SNP) markers, assayed by amplicon sequencing, that provide a high-throughput method for monitoring genetic diversity and assessing familial relationships among devils. To date we have used a total of 267 unique SNPs within both putatively neutral and functional loci to genotype 305 individuals in the Tasmanian devil insurance population. We have used these data to assess genetic diversity in the population as well as resolve the parentage of 21 offspring. Our molecular data has been incorporated with studbook management practices to provide more accurate pedigree information and to inform breeding recommendations. The assay will continue to be used to monitor the genetic diversity of the insurance

  12. Development of a 690 K SNP array in catfish and its application for genetic mapping and validation of the reference genome sequence

    PubMed Central

    Zeng, Qifan; Fu, Qiang; Li, Yun; Waldbieser, Geoff; Bosworth, Brian; Liu, Shikai; Yang, Yujia; Bao, Lisui; Yuan, Zihao; Li, Ning; Liu, Zhanjiang

    2017-01-01

    Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such as SNP arrays. In this work, we developed a high-density SNP array with 690,662 unique SNPs (herein 690 K array) that were relatively evenly distributed across the entire genome, and covered 98.6% of the reference genome sequence. Here we also report linkage mapping using the 690 K array, which allowed mapping of over 250,000 SNPs on the linkage map, the highest marker density among all the constructed linkage maps. These markers were mapped to 29 linkage groups (LGs) with 30,591 unique marker positions. This linkage map anchored 1,602 scaffolds of the reference genome sequence to LGs, accounting for over 97% of the total genome assembly. A total of 1,007 previously unmapped scaffolds were placed to LGs, allowing validation and in few instances correction of the reference genome sequence assembly. This linkage map should serve as a valuable resource for various genetic and genomic analyses, especially for GWAS and QTL mapping for genes associated with economically important traits. PMID:28079141

  13. Development of a 690 K SNP array in catfish and its application for genetic mapping and validation of the reference genome sequence.

    PubMed

    Zeng, Qifan; Fu, Qiang; Li, Yun; Waldbieser, Geoff; Bosworth, Brian; Liu, Shikai; Yang, Yujia; Bao, Lisui; Yuan, Zihao; Li, Ning; Liu, Zhanjiang

    2017-01-12

    Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such as SNP arrays. In this work, we developed a high-density SNP array with 690,662 unique SNPs (herein 690 K array) that were relatively evenly distributed across the entire genome, and covered 98.6% of the reference genome sequence. Here we also report linkage mapping using the 690 K array, which allowed mapping of over 250,000 SNPs on the linkage map, the highest marker density among all the constructed linkage maps. These markers were mapped to 29 linkage groups (LGs) with 30,591 unique marker positions. This linkage map anchored 1,602 scaffolds of the reference genome sequence to LGs, accounting for over 97% of the total genome assembly. A total of 1,007 previously unmapped scaffolds were placed to LGs, allowing validation and in few instances correction of the reference genome sequence assembly. This linkage map should serve as a valuable resource for various genetic and genomic analyses, especially for GWAS and QTL mapping for genes associated with economically important traits.

  14. Development of a SNP array and its application to genetic mapping and diversity assessment in pepper (Capsicum spp.)

    PubMed Central

    Cheng, Jiaowen; Qin, Cheng; Tang, Xin; Zhou, Huangkai; Hu, Yafei; Zhao, Zicheng; Cui, Junjie; Li, Bo; Wu, Zhiming; Yu, Jiping; Hu, Kailin

    2016-01-01

    The development and application of single nucleotide polymorphisms (SNPs) is in its infancy for pepper. Here, a set of 15,000 SNPs were chosen from the resequencing data to develop an array for pepper with 12,720 loci being ultimately synthesized. Of these, 8,199 (~64.46%) SNPs were found to be scorable and covered ~81.18% of the whole genome. With this array, a high-density interspecific genetic map with 5,569 SNPs was constructed using 297 F2 individuals, and genetic diversity of a panel of 399 pepper elite/landrace lines was successfully characterized. Based on the genetic map, one major QTL, named Up12.1, was detected for the fruit orientation trait. A total of 65 protein-coding genes were predicted within this QTL region based on the current annotation of the Zunla-1 genome. In summary, the thousands of well-validated SNP markers, high-density genetic map and genetic diversity information will be useful for molecular genetics and innovative breeding in pepper. Furthermore, the mapping results lay foundation for isolating the genes underlying variation in fruit orientation of Capsicum. PMID:27623541

  15. Using SNP markers to dissect linkage disequilibrium at a major quantitative trait locus for resistance to the potato cyst nematode Globodera pallida on potato chromosome V.

    PubMed

    Achenbach, Ute; Paulo, Joao; Ilarionova, Evgenyia; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Gebhardt, Christiane

    2009-02-01

    The damage caused by the parasitic root cyst nematode Globodera pallida is a major yield-limiting factor in potato cultivation . Breeding for resistance is facilitated by the PCR-based marker 'HC', which is diagnostic for an allele conferring high resistance against G. pallida pathotype Pa2/3 that has been introgressed from the wild potato species Solanum vernei into the Solanum tuberosum tetraploid breeding pool. The major quantitative trait locus (QTL) controlling this nematode resistance maps on potato chromosome V in a hot spot for resistance to various pathogens including nematodes and the oomycete Phytophthora infestans. An unstructured sample of 79 tetraploid, highly heterozygous varieties and breeding clones was selected based on presence (41 genotypes) or absence (38 genotypes) of the HC marker. Testing the clones for resistance to G. pallida confirmed the diagnostic power of the HC marker. The 79 individuals were genotyped for 100 single nucleotide polymorphisms (SNPs) at 10 loci distributed over 38 cM on chromosome V. Forty-five SNPs at six loci spanning 2 cM in the interval between markers GP21-GP179 were associated with resistance to G. pallida. Based on linkage disequilibrium (LD) between SNP markers, six LD groups comprising between 2 and 18 SNPs were identified. The LD groups indicated the existence of multiple alleles at a single resistance locus or at several, physically linked resistance loci. LD group C comprising 18 SNPs corresponded to the 'HC' marker. LD group E included 16 SNPs and showed an association peak, which positioned one nematode resistance locus physically close to the R1 gene family.

  16. Development of a RAD-Seq Based DNA Polymorphism Identification Software, AgroMarker Finder, and Its Application in Rice Marker-Assisted Breeding.

    PubMed

    Fan, Wei; Zong, Jie; Luo, Zhijing; Chen, Mingjiao; Zhao, Xiangxiang; Zhang, Dabing; Qi, Yiping; Yuan, Zheng

    2016-01-01

    Rapid and accurate genome-wide marker detection is essential to the marker-assisted breeding and functional genomics studies. In this work, we developed an integrated software, AgroMarker Finder (AMF: http://erp.novelbio.com/AMF), for providing graphical user interface (GUI) to facilitate the recently developed restriction-site associated DNA (RAD) sequencing data analysis in rice. By application of AMF, a total of 90,743 high-quality markers (82,878 SNPs and 7,865 InDels) were detected between rice varieties JP69 and Jiaoyuan5A. The density of the identified markers is 0.2 per Kb for SNP markers, and 0.02 per Kb for InDel markers. Sequencing validation revealed that the accuracy of genome-wide marker detection by AMF is 93%. In addition, a validated subset of 82 SNPs and 31 InDels were found to be closely linked to 117 important agronomic trait genes, providing a basis for subsequent marker-assisted selection (MAS) and variety identification. Furthermore, we selected 12 markers from 31 validated InDel markers to identify seed authenticity of variety Jiaoyuanyou69, and we also identified 10 markers closely linked to the fragrant gene BADH2 to minimize linkage drag for Wuxiang075 (BADH2 donor)/Jiachang1 recombinants selection. Therefore, this software provides an efficient approach for marker identification from RAD-seq data, and it would be a valuable tool for plant MAS and variety protection.

  17. Development of a RAD-Seq Based DNA Polymorphism Identification Software, AgroMarker Finder, and Its Application in Rice Marker-Assisted Breeding

    PubMed Central

    Luo, Zhijing; Chen, Mingjiao; Zhao, Xiangxiang; Zhang, Dabing; Qi, Yiping; Yuan, Zheng

    2016-01-01

    Rapid and accurate genome-wide marker detection is essential to the marker-assisted breeding and functional genomics studies. In this work, we developed an integrated software, AgroMarker Finder (AMF: http://erp.novelbio.com/AMF), for providing graphical user interface (GUI) to facilitate the recently developed restriction-site associated DNA (RAD) sequencing data analysis in rice. By application of AMF, a total of 90,743 high-quality markers (82,878 SNPs and 7,865 InDels) were detected between rice varieties JP69 and Jiaoyuan5A. The density of the identified markers is 0.2 per Kb for SNP markers, and 0.02 per Kb for InDel markers. Sequencing validation revealed that the accuracy of genome-wide marker detection by AMF is 93%. In addition, a validated subset of 82 SNPs and 31 InDels were found to be closely linked to 117 important agronomic trait genes, providing a basis for subsequent marker-assisted selection (MAS) and variety identification. Furthermore, we selected 12 markers from 31 validated InDel markers to identify seed authenticity of variety Jiaoyuanyou69, and we also identified 10 markers closely linked to the fragrant gene BADH2 to minimize linkage drag for Wuxiang075 (BADH2 donor)/Jiachang1 recombinants selection. Therefore, this software provides an efficient approach for marker identification from RAD-seq data, and it would be a valuable tool for plant MAS and variety protection. PMID:26799713

  18. Evaluation of inbreeding depression in Holstein cattle using whole-genome SNP markers and alternative measures of genomic inbreeding.

    PubMed

    Bjelland, D W; Weigel, K A; Vukasinovic, N; Nkrumah, J D

    2013-07-01

    The effects of increased pedigree inbreeding in dairy cattle populations have been well documented and result in a negative impact on profitability. Recent advances in genotyping technology have allowed researchers to move beyond pedigree analysis and study inbreeding at a molecular level. In this study, 5,853 animals were genotyped for 54,001 single nucleotide polymorphisms (SNP); 2,913 cows had phenotypic records including a single lactation for milk yield (from either lactation 1, 2, 3, or 4), reproductive performance, and linear type conformation. After removing SNP with poor call rates, low minor allele frequencies, and departure from Hardy-Weinberg equilibrium, 33,025 SNP remained for analyses. Three measures of genomic inbreeding were evaluated: percent homozygosity (FPH), inbreeding calculated from runs of homozygosity (FROH), and inbreeding derived from a genomic relationship matrix (FGRM). Average FPH was 60.5±1.1%, average FROH was 3.8±2.1%, and average FGRM was 20.8±2.3%, where animals with larger values for each of the genomic inbreeding indices were considered more inbred. Decreases in total milk yield to 205d postpartum of 53, 20, and 47kg per 1% increase in FPH, FROH, and FGRM, respectively, were observed. Increases in days open per 1% increase in FPH (1.76 d), FROH (1.72 d), and FGRM (1.06 d) were also noted, as well as increases in maternal calving difficulty (0.09, 0.03, and 0.04 on a 5-point scale for FPH, FROH, and FGRM, respectively). Several linear type traits, such as strength (-0.40, -0.11, and -0.19), rear legs rear view (-0.35, -0.16, and -0.14), front teat placement (0.35, 0.25, 0.18), and teat length (-0.24, -0.14, and -0.13) were also affected by increases in FPH, FROH, and FGRM, respectively. Overall, increases in each measure of genomic inbreeding in this study were associated with negative effects on production and reproductive ability in dairy cows.

  19. The importance of integrating SNP and cheminformatics resources to pharmacogenomics.

    PubMed

    Chang, Hsueh-Wei; Chuang, Li-Yeh; Tsai, Ming-Tz; Yang, Cheng-Hong

    2012-09-01

    Single nucleotide polymorphisms (SNPs) are the most frequent variants in many genes and are promising markers in relation to drug responses in pharmacogenomics studies. In this review, we emphasized the importance of the cheminformatic-related and SNP-related resources and tools and how they can improve pharmacogenomics studies. Currently, many cheminformatic resources are well developed and provide much information on drug metabolism and targeting. In parallel, there are also many well established SNP-related resources that are able to provide the information related to SNP genotyping, tag SNPs and functional classification. However, cheminformatic and SNP resources have not, as yet, been well-integrated to provide a user-friendly platform for pharmacogenomics studies. This paper presents a brief overview of the many available public resources for cheminformatics (DrugBank, PharmGKB and other drugrelated databases) and SNPs (dbSNP, HapMap, SNP500Cancer, SNP-RFLPing 2 and other SNP tools) and points out the importance of integrating cheminformatic and SNP resources for the future of pharmacogenomics.

  20. Development of single-nucleotide polymorphism markers for Bromus tectorum (Poaceae) from a partially sequenced transcriptome1

    PubMed Central

    Merrill, Keith R.; Coleman, Craig E.; Meyer, Susan E.; Leger, Elizabeth A.; Collins, Katherine A.

    2016-01-01

    Premise of the study: Bromus tectorum (Poaceae) is an annual grass species that is invasive in many areas of the world but most especially in the U.S. Intermountain West. Single-nucleotide polymorphism (SNP) markers were developed for use in investigating the geospatial and ecological diversity of B. tectorum in the Intermountain West to better understand the mechanisms behind its successful invasion. Methods and Results: Normalized cDNA libraries from six diverse B. tectorum individuals were pooled and sequenced using 454 sequencing. Ninety-five SNP assays were developed for use on 96.96 arrays with the Fluidigm EP1 genotyping platform. Verification of the 95 SNPs by genotyping 251 individuals from 12 populations is reported, along with amplification data from four related Bromus species. Conclusions: These SNP markers are polymorphic across populations of B. tectorum, are optimized for high-throughput applications, and may be applicable to other, related Bromus species. PMID:27843723

  1. Development of high-throughput SNP-based genotyping in Acacia auriculiformis x A. mangium hybrids using short-read transcriptome data

    PubMed Central

    2012-01-01

    Background Next Generation Sequencing has provided comprehensive, affordable and high-throughput DNA sequences for Single Nucleotide Polymorphism (SNP) discovery in Acacia auriculiformis and Acacia mangium. Like other non-model species, SNP detection and genotyping in Acacia are challenging due to lack of genome sequences. The main objective of this study is to develop the first high-throughput SNP genotyping assay for linkage map construction of A. auriculiformis x A. mangium hybrids. Results We identified a total of 37,786 putative SNPs by aligning short read transcriptome data from four parents of two Acacia hybrid mapping populations using Bowtie against 7,839 de novo transcriptome contigs. Given a set of 10 validated SNPs from two lignin genes, our in silico SNP detection approach is highly accurate (100%) compared to the traditional in vitro approach (44%). Further validation of 96 SNPs using Illumina GoldenGate Assay gave an overall assay success rate of 89.6% and conversion rate of 37.5%. We explored possible factors lowering assay success rate by predicting exon-intron boundaries and paralogous genes of Acacia contigs using Medicago truncatula genome as reference. This assessment revealed that presence of exon-intron boundary is the main cause (50%) of assay failure. Subsequent SNPs filtering and improved assay design resulted in assay success and conversion rate of 92.4% and 57.4%, respectively based on 768 SNPs genotyping. Analysis of clustering patterns revealed that 27.6% of the assays were not reproducible and flanking sequence might play a role in determining cluster compression. In addition, we identified a total of 258 and 319 polymorphic SNPs in A. auriculiformis and A. mangium natural germplasms, respectively. Conclusion We have successfully discovered a large number of SNP markers in A. auriculiformis x A. mangium hybrids using next generation transcriptome sequencing. By using a reference genome from the most closely related species, we

  2. Development of high-throughput SNP-based genotyping in Acacia auriculiformis x A. mangium hybrids using short-read transcriptome data.

    PubMed

    Wong, Melissa M L; Cannon, Charles H; Wickneswari, Ratnam

    2012-12-24

    Next Generation Sequencing has provided comprehensive, affordable and high-throughput DNA sequences for Single Nucleotide Polymorphism (SNP) discovery in Acacia auriculiformis and Acacia mangium. Like other non-model species, SNP detection and genotyping in Acacia are challenging due to lack of genome sequences. The main objective of this study is to develop the first high-throughput SNP genotyping assay for linkage map construction of A. auriculiformis x A. mangium hybrids. We identified a total of 37,786 putative SNPs by aligning short read transcriptome data from four parents of two Acacia hybrid mapping populations using Bowtie against 7,839 de novo transcriptome contigs. Given a set of 10 validated SNPs from two lignin genes, our in silico SNP detection approach is highly accurate (100%) compared to the traditional in vitro approach (44%). Further validation of 96 SNPs using Illumina GoldenGate Assay gave an overall assay success rate of 89.6% and conversion rate of 37.5%. We explored possible factors lowering assay success rate by predicting exon-intron boundaries and paralogous genes of Acacia contigs using Medicago truncatula genome as reference. This assessment revealed that presence of exon-intron boundary is the main cause (50%) of assay failure. Subsequent SNPs filtering and improved assay design resulted in assay success and conversion rate of 92.4% and 57.4%, respectively based on 768 SNPs genotyping. Analysis of clustering patterns revealed that 27.6% of the assays were not reproducible and flanking sequence might play a role in determining cluster compression. In addition, we identified a total of 258 and 319 polymorphic SNPs in A. auriculiformis and A. mangium natural germplasms, respectively. We have successfully discovered a large number of SNP markers in A. auriculiformis x A. mangium hybrids using next generation transcriptome sequencing. By using a reference genome from the most closely related species, we converted most SNPs to successful

  3. FBAT-SNP-PC: an approach for multiple markers and single trait in family-based association tests.

    PubMed

    Rakovski, Cyril S; Weiss, Scott T; Laird, Nan M; Lange, Christoph

    2008-01-01

    Develop a new test for family-based association studies and continuous traits that incorporates power- enhancing techniques from two existing testing strategies. The new procedure initiates with an extraction of the relevant information from the variability of the genotypes and an assessment of the approximate individual markers effects and their directions. This information is incorporated in the construction of the actual test statistic through a selection of a data-determined number of optimal linear combinations of the offspring genotypes which, in a power enhancing step, are consequently combined into a single degree of freedom test. We conduct a comparison simulation study in which the performance of the new test is contrasted with the test that is currently known to offer the highest overall power, FBAT-LC. The new test has an overall performance very similar to that of FBAT-LC but attains higher power in candidate genes with lower average pairwise correlations and moderate to high allele frequencies with large gains (up to 80%) for some of the analyzed genes possessing the above-mentioned characteristics. The new test is a promising tool for candidate gene studies with substantial power gains for genes that are characterized by SNPs with low mean pairwise correlation. (c) 2008 S. Karger AG, Basel

  4. Development and evaluation of 200 novel SNP assays for population genetic studies of westslope cutthroat trout and genetic identification of related taxa.

    PubMed

    Campbell, N R; Amish, S J; Pritchard, V L; McKelvey, K S; Young, M K; Schwartz, M K; Garza, J C; Luikart, G; Narum, S R

    2012-09-01

    DNA sequence data were collected and screened for single nucleotide polymorphisms (SNPs) in westslope cutthroat trout (Oncorhynchus clarki lewisi) and also for substitutions that could be used to genetically discriminate rainbow trout (O. mykiss) and cutthroat trout, as well as several cutthroat trout subspecies. In total, 260 expressed sequence tag-derived loci were sequenced and allelic discrimination genotyping assays developed from 217 of the variable sites. Another 50 putative SNPs in westslope cutthroat trout were identified by restriction-site-associated DNA sequencing, and seven of these were developed into assays. Twelve O. mykiss SNP assays that were variable within westslope cutthroat trout and 12 previously published SNP assays were also included in downstream testing. A total of 241 assays were tested on six westslope cutthroat trout populations (N = 32 per population), as well as collections of four other cutthroat trout subspecies and a population of rainbow trout. All assays were evaluated for reliability and deviation from Hardy-Weinberg and linkage equilibria. Poorly performing and duplicate assays were removed from the data set, and the remaining 200 assays were used in tests of population differentiation. The remaining markers easily distinguished the various subspecies tested, as evidenced by mean G(ST) of 0.74. A smaller subset of the markers (N = 86; average G(ST) = 0.40) was useful for distinguishing the six populations of westslope cutthroat trout. This study increases by an order of magnitude the number of genetic markers available for the study of westslope cutthroat trout and closely related taxa and includes many markers in genes (developed from ESTs).

  5. Identification of SNP-SNP interaction for chronic dialysis patients.

    PubMed

    Yang, Cheng-Hong; Weng, Zi-Jie; Chuang, Li-Yeh; Yang, Cheng-San

    2017-04-01

    Analyses of interactions between single nucleotide polymorphisms (SNPs) have reported significant associations between mitochondrial displacement loops (D-loops) and chronic dialysis diseases. However, the method used to detect potential SNP-SNP interaction still requires improvement. This study proposes an effective algorithm named dynamic center particle swarm optimization k-nearest neighbors (DCPSO-KNN) to detect the SNP-SNP interaction. DCPSO-KNN uses dynamic center particle swarm optimization (DCPSO) to generate SNP combinations with a fitness function designed using the KNN method and statistical verification. A total of 77 SNPs in the mitochondrial D-loop were used to detect the SNP-SNP interactions and the search ability was compared against that of other methods. The detected SNP-SNP interactions were statistically evaluated. Experimental results showed that DCPSO-KNN successfully detects SNP-SNP interactions in two-to-seven-order combinations (positive predictive value (PPV)+negative predictive value (NPV)=1.154 to 1.310; odds ratio (OR)=1.859 to 4.015; 95% confidence interval (95% CI)=1.151 to 4.265; p-value <0.001). DCPSO-KNN can improve the detection ability of SNP-SNP associations between mitochondrial D-loops and chronic dialysis diseases, thus facilitating the development of biomedical applications. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. Generation and analysis of expressed sequence tags(ESTs) for marker development in yam (Dioscores alata L.)

    USDA-ARS?s Scientific Manuscript database

    A total of 44,757 EST sequences , 1705 EST-SSR and 104 SNP markers were generated from the cDNA libraries of the resistant and susceptible genotypes. We have developed a comprehensive annotated transcriptome data set in yam to enrich the EST information in public databases. These EST resources prov...

  7. Development of a high density 600K SNP genotyping array for chicken

    PubMed Central

    2013-01-01

    Background High density (HD) SNP genotyping arrays are an important tool for genetic analyses of animals and plants. Although the chicken is one of the most important farm animals, no HD array is yet available for high resolution genetic analysis of this species. Results We report here the development of a 600 K Affymetrix® Axiom® HD genotyping array designed using SNPs segregating in a wide variety of chicken populations. In order to generate a large catalogue of segregating SNPs, we re-sequenced 243 chickens from 24 chicken lines derived from diverse sources (experimental, commercial broiler and layer lines) by pooling 10–15 samples within each line. About 139 million (M) putative SNPs were detected by mapping sequence reads to the new reference genome (Gallus_gallus_4.0) of which ~78 M appeared to be segregating in different lines. Using criteria such as high SNP-quality score, acceptable design scores predicting high conversion performance in the final array and uniformity of distribution across the genome, we selected ~1.8 M SNPs for validation through genotyping on an independent set of samples (n = 282). About 64% of the SNPs were polymorphic with high call rates (>98%), good cluster separation and stable Mendelian inheritance. Polymorphic SNPs were further analysed for their population characteristics and genomic effects. SNPs with extreme breach of Hardy-Weinberg equilibrium (P < 0.00001) were excluded from the panel. The final array, designed on the basis of these analyses, consists of 580,954 SNPs and includes 21,534 coding variants. SNPs were selected to achieve an essentially uniform distribution based on genetic map distance for both broiler and layer lines. Due to a lower extent of LD in broilers compared to layers, as reported in previous studies, the ratio of broiler and layer SNPs in the array was kept as 3:2. The final panel was shown to genotype a wide range of samples including broilers and layers with over 100 K to 450 K

  8. Development of a high density 600K SNP genotyping array for chicken.

    PubMed

    Kranis, Andreas; Gheyas, Almas A; Boschiero, Clarissa; Turner, Frances; Yu, Le; Smith, Sarah; Talbot, Richard; Pirani, Ali; Brew, Fiona; Kaiser, Pete; Hocking, Paul M; Fife, Mark; Salmon, Nigel; Fulton, Janet; Strom, Tim M; Haberer, Georg; Weigend, Steffen; Preisinger, Rudolf; Gholami, Mahmood; Qanbari, Saber; Simianer, Henner; Watson, Kellie A; Woolliams, John A; Burt, David W

    2013-01-28

    High density (HD) SNP genotyping arrays are an important tool for genetic analyses of animals and plants. Although the chicken is one of the most important farm animals, no HD array is yet available for high resolution genetic analysis of this species. We report here the development of a 600 K Affymetrix® Axiom® HD genotyping array designed using SNPs segregating in a wide variety of chicken populations. In order to generate a large catalogue of segregating SNPs, we re-sequenced 243 chickens from 24 chicken lines derived from diverse sources (experimental, commercial broiler and layer lines) by pooling 10-15 samples within each line. About 139 million (M) putative SNPs were detected by mapping sequence reads to the new reference genome (Gallus_gallus_4.0) of which ~78 M appeared to be segregating in different lines. Using criteria such as high SNP-quality score, acceptable design scores predicting high conversion performance in the final array and uniformity of distribution across the genome, we selected ~1.8 M SNPs for validation through genotyping on an independent set of samples (n = 282). About 64% of the SNPs were polymorphic with high call rates (>98%), good cluster separation and stable Mendelian inheritance. Polymorphic SNPs were further analysed for their population characteristics and genomic effects. SNPs with extreme breach of Hardy-Weinberg equilibrium (P < 0.00001) were excluded from the panel. The final array, designed on the basis of these analyses, consists of 580,954 SNPs and includes 21,534 coding variants. SNPs were selected to achieve an essentially uniform distribution based on genetic map distance for both broiler and layer lines. Due to a lower extent of LD in broilers compared to layers, as reported in previous studies, the ratio of broiler and layer SNPs in the array was kept as 3:2. The final panel was shown to genotype a wide range of samples including broilers and layers with over 100 K to 450 K informative SNPs per line. A principal

  9. Genetic diversity and population structure assessed by SSR and SNP markers in a large germplasm collection of grape

    PubMed Central

    2013-01-01

    Background The economic importance of grapevine has driven significant efforts in genomics to accelerate the exploitation of Vitis resources for development of new cultivars. However, although a large number of clonally propagated accessions are maintained in grape germplasm collections worldwide, their use for crop improvement is limited by the scarcity of information on genetic diversity, population structure and proper phenotypic assessment. The identification of representative and manageable subset of accessions would facilitate access to the diversity available in large collections. A genome-wide germplasm characterization using molecular markers can offer reliable tools for adjusting the quality and representativeness of such core samples. Results We investigated patterns of molecular diversity at 22 common microsatellite loci and 384 single nucleotide polymorphisms (SNPs) in 2273 accessions of domesticated grapevine V. vinifera ssp. sativa, its wild relative V. vinifera ssp. sylvestris, interspecific hybrid cultivars and rootstocks. Despite the large number of putative duplicates and extensive clonal relationships among the accessions, we observed high level of genetic variation. In the total germplasm collection the average genetic diversity, as quantified by the expected heterozygosity, was higher for SSR loci (0.81) than for SNPs (0.34). The analysis of the genetic structure in the grape germplasm collection revealed several levels of stratification. The primary division was between accessions of V. vinifera and non-vinifera, followed by the distinction between wild and domesticated grapevine. Intra-specific subgroups were detected within cultivated grapevine representing different eco-geographic groups. The comparison of a phenological core collection and genetic core collections showed that the latter retained more genetic diversity, while maintaining a similar phenotypic variability. Conclusions The comprehensive molecular characterization of our grape

  10. A comprehensive transcriptome provides candidate genes for sex determination/differentiation and SSR/SNP markers in yellow catfish.

    PubMed

    Chen, Xin; Mei, Jie; Wu, Junjie; Jing, Jing; Ma, Wenge; Zhang, Jin; Dan, Cheng; Wang, Weimin; Gui, Jian-Fang

    2015-04-01

    Sex dimorphic growth pattern has significant theory and application implications in fish. Recently, a Y- and X-specific allele marker-assisted sex control technique has been developed for mass production of all-male population in yellow catfish (Pelteobagrus fulvidraco), but the genetic information for sex determination and sex control breeding has remained unclear. Here, we attempted to provide the first insight into a comprehensive transcriptome covering multiple tissues from XX females, XY males, and YY super-males of yellow catfish by using 454 GS-FLX platform, for a better assembly and gene coverage. A total of 1,202,933 high quality reads (about 540 Mbp) were obtained and assembled into 28,297 contigs and 141,951 singletons. BLASTX searches against the NCBI non-redundant protein database (nr) led a total of 52,564 unique sequences including 18,748 contigs and 33,816 singletons to match 25,669 known or predicted unique proteins. All of them with annotated function were categorized by gene ontology (GO) analysis, and 712 were assigned to reproduction and reproductive process. Some potential genes relevant to reproductive system including steroid hormone biosynthesis and GnRH (gonadotropin-releasing hormone) signaling pathway were further identified by Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis; and at least 21 sex determination and differentiation-related genes, such as Dmrt1, Sox9a/b, Cyp19b, WT1, and AMH were identified and characterized. Additionally, a total of 82,794 simple sequence repeats (SSRs), 26,450 single nucleotide polymorphisms (SNPs), and 4,145 insertions and deletions (INDELs) were revealed from the transcriptome data. Therefore, the current transcriptome resources highlight further studies on sex-control breeding in yellow catfish and will benefit future studies on reproduction and sex determination in teleost fish.

  11. Development of a SNP panel dedicated to parentage assignment in French sheep populations.

    PubMed

    Tortereau, F; Moreno, C R; Tosser-Klopp, G; Servin, B; Raoul, J

    2017-05-26

    The efficiency of breeding programs partly relies on the accuracy of the estimated breeding values which decreases when pedigrees are incomplete. Two reproduction techniques are mainly used by sheep breeders to identify the sires of lambs: animal insemination and natural matings with a single ram per group of ewes. Both methods have major drawbacks, notably time-consuming tasks for breeders, and are thus used at varying levels in breeding programs. As a consequence, the percentage of known sires can be very low in some breeds and results in less accurate estimated breeding values. In order to address this issue and offer an alternative strategy for obtaining parentage information, we designed a set of 249 SNPs for parentage assignment in French sheep breeds and tested its efficiency in one breed. The set was derived from the 54 K SNP chip that was used to genotype the thirty main French sheep populations. Only SNPs in Hardy-Weinberg equilibrium, displaying the highest Minor Allele Frequency across all the thirty populations and not associated with Mendelian errors in verified family trios were selected. The panel of 249 SNPs was successfully used in an on-farm test in the BMC breed and resulted in more than 95% of lambs being assigned to a unique sire. In this study we developed a SNP panel for assignment that achieved good results in the on-farm testing. We also raised some conditions for optimal use of this panel: at least 180 SNPs should be used and a minute preparation of the list of candidate sires. Our panel also displays high levels of MAF in the SheepHapMap breeds, particularly in the South West European breeds.

  12. A Coordinated Approach to Peach SNP Discovery in RosBREED

    USDA-ARS?s Scientific Manuscript database

    In the USDA-funded multi-institutional and trans-disciplinary project, “RosBREED”, crop-specific SNP genome scan platforms are being developed for peach, apple, strawberry, and cherry at a resolution of at least one polymorphic SNP marker every 5 cM in any random cross, for use in Pedigree-Based Ana...

  13. Allele diversity for abiotic stress responsive candidate genes in chickpea reference set using gene based SNP markers

    PubMed Central

    Roorkiwal, Manish; Nayak, Spurthi N.; Thudi, Mahendar; Upadhyaya, Hari D.; Brunel, Dominique; Mournet, Pierre; This, Dominique; Sharma, Prakash C.; Varshney, Rajeev K.

    2014-01-01

    Chickpea is an important food legume crop for the semi-arid regions, however, its productivity is adversely affected by various biotic and abiotic stresses. Identification of candidate genes associated with abiotic stress response will help breeding efforts aiming to enhance its productivity. With this objective, 10 abiotic stress responsive candidate genes were selected on the basis of prior knowledge of this complex trait. These 10 genes were subjected to allele specific sequencing across a chickpea reference set comprising 300 genotypes including 211 genotypes of chickpea mini core collection. A total of 1.3 Mbp sequence data were generated. Multiple sequence alignment (MSA) revealed 79 SNPs and 41 indels in nine genes while the CAP2 gene was found to be conserved across all the genotypes. Among 10 candidate genes, the maximum number of SNPs (34) was observed in abscisic acid stress and ripening (ASR) gene including 22 transitions, 11 transversions and one tri-allelic SNP. Nucleotide diversity varied from 0.0004 to 0.0029 while polymorphism information content (PIC) values ranged from 0.01 (AKIN gene) to 0.43 (CAP2 promoter). Haplotype analysis revealed that alleles were represented by more than two haplotype blocks, except alleles of the CAP2 and sucrose synthase (SuSy) gene, where only one haplotype was identified. These genes can be used for association analysis and if validated, may be useful for enhancing abiotic stress, including drought tolerance, through molecular breeding. PMID:24926299

  14. Allele diversity for abiotic stress responsive candidate genes in chickpea reference set using gene based SNP markers.

    PubMed

    Roorkiwal, Manish; Nayak, Spurthi N; Thudi, Mahendar; Upadhyaya, Hari D; Brunel, Dominique; Mournet, Pierre; This, Dominique; Sharma, Prakash C; Varshney, Rajeev K

    2014-01-01

    Chickpea is an important food legume crop for the semi-arid regions, however, its productivity is adversely affected by various biotic and abiotic stresses. Identification of candidate genes associated with abiotic stress response will help breeding efforts aiming to enhance its productivity. With this objective, 10 abiotic stress responsive candidate genes were selected on the basis of prior knowledge of this complex trait. These 10 genes were subjected to allele specific sequencing across a chickpea reference set comprising 300 genotypes including 211 genotypes of chickpea mini core collection. A total of 1.3 Mbp sequence data were generated. Multiple sequence alignment (MSA) revealed 79 SNPs and 41 indels in nine genes while the CAP2 gene was found to be conserved across all the genotypes. Among 10 candidate genes, the maximum number of SNPs (34) was observed in abscisic acid stress and ripening (ASR) gene including 22 transitions, 11 transversions and one tri-allelic SNP. Nucleotide diversity varied from 0.0004 to 0.0029 while polymorphism information content (PIC) values ranged from 0.01 (AKIN gene) to 0.43 (CAP2 promoter). Haplotype analysis revealed that alleles were represented by more than two haplotype blocks, except alleles of the CAP2 and sucrose synthase (SuSy) gene, where only one haplotype was identified. These genes can be used for association analysis and if validated, may be useful for enhancing abiotic stress, including drought tolerance, through molecular breeding.

  15. Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean.

    PubMed

    Burt, Andrew J; William, H Manilal; Perry, Gregory; Khanal, Raja; Pauls, K Peter; Kelly, James D; Navabi, Alireza

    2015-01-01

    Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co-4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co-4 is localized. Three SCAR markers with known linkage to Co-4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK-4 loci found in previous studies. It is possible that the Co-4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases.

  16. Development of 101 novel EST-derived single nucleotide polymorphism markers for Zhikong scallop ( Chlamys farreri)

    NASA Astrophysics Data System (ADS)

    Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli

    2013-09-01

    Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.

  17. SNP Arrays

    PubMed Central

    Louhelainen, Jari

    2016-01-01

    The papers published in this Special Issue “SNP arrays” (Single Nucleotide Polymorphism Arrays) focus on several perspectives associated with arrays of this type. The range of papers vary from a case report to reviews, thereby targeting wider audiences working in this field. The research focus of SNP arrays is often human cancers but this Issue expands that focus to include areas such as rare conditions, animal breeding and bioinformatics tools. Given the limited scope, the spectrum of papers is nothing short of remarkable and even from a technical point of view these papers will contribute to the field at a general level. Three of the papers published in this Special Issue focus on the use of various SNP array approaches in the analysis of three different cancer types. Two of the papers concentrate on two very different rare conditions, applying the SNP arrays slightly differently. Finally, two other papers evaluate the use of the SNP arrays in the context of genetic analysis of livestock. The findings reported in these papers help to close gaps in the current literature and also to give guidelines for future applications of SNP arrays. PMID:27792140

  18. Development and evaluation of a genome-wide 6K SNP array for diploid sweet cherry and tetraploid sour cherry

    USDA-ARS?s Scientific Manuscript database

    High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a commun...

  19. Candidate Gene Identification with SNP Marker-Based Fine Mapping of Anthracnose Resistance Gene Co-4 in Common Bean

    PubMed Central

    Burt, Andrew J.; William, H. Manilal; Perry, Gregory; Khanal, Raja; Pauls, K. Peter; Kelly, James D.; Navabi, Alireza

    2015-01-01

    Anthracnose, caused by Colletotrichum lindemuthianum, is an important fungal disease of common bean (Phaseolus vulgaris). Alleles at the Co–4 locus confer resistance to a number of races of C. lindemuthianum. A population of 94 F4:5 recombinant inbred lines of a cross between resistant black bean genotype B09197 and susceptible navy bean cultivar Nautica was used to identify markers associated with resistance in bean chromosome 8 (Pv08) where Co–4 is localized. Three SCAR markers with known linkage to Co–4 and a panel of single nucleotide markers were used for genotyping. A refined physical region on Pv08 with significant association with anthracnose resistance identified by markers was used in BLAST searches with the genomic sequence of common bean accession G19833. Thirty two unique annotated candidate genes were identified that spanned a physical region of 936.46 kb. A majority of the annotated genes identified had functional similarity to leucine rich repeats/receptor like kinase domains. Three annotated genes had similarity to 1, 3-β-glucanase domains. There were sequence similarities between some of the annotated genes found in the study and the genes associated with phosphoinositide-specific phosphilipases C associated with Co-x and the COK–4 loci found in previous studies. It is possible that the Co–4 locus is structured as a group of genes with functional domains dominated by protein tyrosine kinase along with leucine rich repeats/nucleotide binding site, phosphilipases C as well as β-glucanases. PMID:26431031

  20. Multiple SNP Markers Reveal Fine-Scale Population and Deep Phylogeographic Structure in European Anchovy (Engraulis encrasicolus L.)

    PubMed Central

    Zarraonaindia, Iratxe; Iriondo, Mikel; Albaina, Aitor; Pardo, Miguel Angel; Manzano, Carmen; Grant, W. Stewart; Irigoien, Xabier; Estonba, Andone

    2012-01-01

    Geographic surveys of allozymes, microsatellites, nuclear DNA (nDNA) and mitochondrial DNA (mtDNA) have detected several genetic subdivisions among European anchovy populations. However, these studies have been limited in their power to detect some aspects of population structure by the use of a single or a few molecular markers, or by limited geographic sampling. We use a multi-marker approach, 47 nDNA and 15 mtDNA single nucleotide polymorphisms (SNPs), to analyze 626 European anchovies from the whole range of the species to resolve shallow and deep levels of population structure. Nuclear SNPs define 10 genetic entities within two larger genetically distinctive groups associated with oceanic variables and different life-history traits. MtDNA SNPs define two deep phylogroups that reflect ancient dispersals and colonizations. These markers define two ecological groups. One major group of Iberian-Atlantic populations is associated with upwelling areas on narrow continental shelves and includes populations spawning and overwintering in coastal areas. A second major group includes northern populations in the North East (NE) Atlantic (including the Bay of Biscay) and the Mediterranean and is associated with wide continental shelves with local larval retention currents. This group tends to spawn and overwinter in oceanic areas. These two groups encompass ten populations that differ from previously defined management stocks in the Alboran Sea, Iberian-Atlantic and Bay of Biscay regions. In addition, a new North Sea-English Channel stock is defined. SNPs indicate that some populations in the Bay of Biscay are genetically closer to North Western (NW) Mediterranean populations than to other populations in the NE Atlantic, likely due to colonizations of the Bay of Biscay and NW Mediterranean by migrants from a common ancestral population. Northern NE Atlantic populations were subsequently established by migrants from the Bay of Biscay. Populations along the Iberian

  1. Multiple SNP markers reveal fine-scale population and deep phylogeographic structure in European anchovy (Engraulis encrasicolus L.).

    PubMed

    Zarraonaindia, Iratxe; Iriondo, Mikel; Albaina, Aitor; Pardo, Miguel Angel; Manzano, Carmen; Grant, W Stewart; Irigoien, Xabier; Estonba, Andone

    2012-01-01

    Geographic surveys of allozymes, microsatellites, nuclear DNA (nDNA) and mitochondrial DNA (mtDNA) have detected several genetic subdivisions among European anchovy populations. However, these studies have been limited in their power to detect some aspects of population structure by the use of a single or a few molecular markers, or by limited geographic sampling. We use a multi-marker approach, 47 nDNA and 15 mtDNA single nucleotide polymorphisms (SNPs), to analyze 626 European anchovies from the whole range of the species to resolve shallow and deep levels of population structure. Nuclear SNPs define 10 genetic entities within two larger genetically distinctive groups associated with oceanic variables and different life-history traits. MtDNA SNPs define two deep phylogroups that reflect ancient dispersals and colonizations. These markers define two ecological groups. One major group of Iberian-Atlantic populations is associated with upwelling areas on narrow continental shelves and includes populations spawning and overwintering in coastal areas. A second major group includes northern populations in the North East (NE) Atlantic (including the Bay of Biscay) and the Mediterranean and is associated with wide continental shelves with local larval retention currents. This group tends to spawn and overwinter in oceanic areas. These two groups encompass ten populations that differ from previously defined management stocks in the Alboran Sea, Iberian-Atlantic and Bay of Biscay regions. In addition, a new North Sea-English Channel stock is defined. SNPs indicate that some populations in the Bay of Biscay are genetically closer to North Western (NW) Mediterranean populations than to other populations in the NE Atlantic, likely due to colonizations of the Bay of Biscay and NW Mediterranean by migrants from a common ancestral population. Northern NE Atlantic populations were subsequently established by migrants from the Bay of Biscay. Populations along the Iberian

  2. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-02-09

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.

  3. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  4. Anchoring Linkage Groups of the Rosa Genetic Map to Physical Chromosomes with Tyramide-FISH and EST-SNP Markers

    PubMed Central

    Kirov, Ilya; Van Laere, Katrijn; De Riek, Jan; De Keyser, Ellen; Van Roy, Nadine; Khrustaleva, Ludmila

    2014-01-01

    In order to anchor Rosa linkage groups to physical chromosomes, a combination of the Tyramide-FISH technology and the modern molecular marker system based on High Resolution Melting (HRM) is an efficient approach. Although, Tyramide-FISH is a very promising technique for the visualization of short DNA probes, it is very challenging for plant species with small chromosomes such as Rosa. In this study, we successfully applied the Tyramide-FISH technique for Rosa and compared different detection systems. An indirect detection system exploiting biotinylated tyramides was shown to be the most suitable technique for reliable signal detection. Three gene fragments with a size of 1100 pb–1700 bp (Phenylalanine Ammonia Lyase, Pyrroline-5-Carboxylate Synthase and Orcinol O-Methyl Transferase) have been physically mapped on chromosomes 7, 4 and 1, respectively, of Rosa wichurana. The signal frequency was between 25% and 40%. HRM markers of these 3 gene fragments were used to include the gene fragments on the existing genetic linkage map of Rosa wichurana. As a result, three linkage groups could be anchored to their physical chromosomes. The information was used to check for synteny between the Rosa chromosomes and Fragaria. PMID:24755945

  5. Anchoring linkage groups of the Rosa genetic map to physical chromosomes with tyramide-FISH and EST-SNP markers.

    PubMed

    Kirov, Ilya; Van Laere, Katrijn; De Riek, Jan; De Keyser, Ellen; Van Roy, Nadine; Khrustaleva, Ludmila

    2014-01-01

    In order to anchor Rosa linkage groups to physical chromosomes, a combination of the Tyramide-FISH technology and the modern molecular marker system based on High Resolution Melting (HRM) is an efficient approach. Although, Tyramide-FISH is a very promising technique for the visualization of short DNA probes, it is very challenging for plant species with small chromosomes such as Rosa. In this study, we successfully applied the Tyramide-FISH technique for Rosa and compared different detection systems. An indirect detection system exploiting biotinylated tyramides was shown to be the most suitable technique for reliable signal detection. Three gene fragments with a size of 1100 pb-1700 bp (Phenylalanine Ammonia Lyase, Pyrroline-5-Carboxylate Synthase and Orcinol O-Methyl Transferase) have been physically mapped on chromosomes 7, 4 and 1, respectively, of Rosa wichurana. The signal frequency was between 25% and 40%. HRM markers of these 3 gene fragments were used to include the gene fragments on the existing genetic linkage map of Rosa wichurana. As a result, three linkage groups could be anchored to their physical chromosomes. The information was used to check for synteny between the Rosa chromosomes and Fragaria.

  6. Novel SNP markers in InvGE and SssI genes are associated with natural variation of sugar contents and frying color in Solanum tuberosum Group Phureja.

    PubMed

    Duarte-Delgado, Diana; Juyó, Deissy; Gebhardt, Christiane; Sarmiento, Felipe; Mosquera-Vásquez, Teresa

    2017-03-09

    Potato frying color is an agronomic trait influenced by the sugar content of tubers. The candidate gene approach was employed to elucidate the molecular basis of this trait in Solanum tuberosum Group Phureja, which is mainly diploid and represents an important genetic resource for potato breeding. The objective of this research was to identify novel genetic variants related with frying quality in loci with key functions in carbohydrate metabolism, with the purpose of discovering genetic variability useful in breeding programs. Therefore, an association analysis was implemented with 109 SNP markers identified in ten candidate genes. The analyses revealed four associations in the locus InvGE coding for an apoplastic invertase and one association in the locus SssI coding for a soluble starch synthase. The SNPs SssI-C 45711901 T and InvGE-C 2475454 T were associated with sucrose content and frying color, respectively, and were not found previously in tetraploid genotypes. The rare haplotype InvGE-A 2475187 C 2475295 A 2475344 was associated with higher fructose contents. Our study allowed a more detailed analysis of the sequence variation of exon 3 from InvGE, which was not possible in previous studies because of the high frequency of insertion-deletion polymorphisms in tetraploid potatoes. The association mapping strategy using a candidate gene approach in Group Phureja allowed the identification of novel SNP markers in InvGE and SssI associated with frying color and the tuber sugar content measured by High Performance Liquid Chromatography (HPLC). These novel associations might be useful in potato breeding programs for improving quality traits and to increase crop genetic variability. The results suggest that some genes involved in the natural variation of tuber sugar content and frying color are conserved in both Phureja and tetraploid germplasm. Nevertheless, the associated variants in both types of germplasm were present in different regions of these genes. This

  7. Molecular marker development from transcript sequences and germplasm evaluation for cultivated peanut (Arachis hypogaea L.).

    PubMed

    Peng, Ze; Gallo, Maria; Tillman, Barry L; Rowland, Diane; Wang, Jianping

    2016-02-01

    Molecular markers are important tools for genotyping in genetic studies and molecular breeding. The SSR and SNP are two commonly used marker systems developed from genomic or transcript sequences. The objectives of this study were to: (1) assemble and annotate the publicly available ESTs in Arachis and the in-house short reads, (2) develop and validate SSR and SNP markers, and (3) investigate the genetic diversity and population structure of the peanut breeding lines and the U.S. peanut mini core collection using developed SSR markers. An NCBI EST dataset with 252,951 sequences and an in-house 454 RNAseq dataset with 288,701 sequences were assembled separately after trimming. Transcript sequence comparison and phylogenetic analysis suggested that peanut is closer to cowpea and scarlet bean than to soybean, common bean and Medicago. From these two datasets, 6455 novel SSRs and 11,902 SNPs were identified. Of the discovered SSRs, 380 representing various SSR types were selected for PCR validation. The amplification rate was 89.2 %. Twenty-two (6.5 %) SSRs were polymorphic between at least one pair of four genotypes. Sanger sequencing of PCR products targeting 110 SNPs revealed 13 true SNPs between tetraploid genotypes and 193 homoeologous SNPs within genotypes. Eight out of the 22 polymorphic SSR markers were selected to evaluate the genetic diversity of Florida peanut breeding lines and the U.S. peanut mini core collection. This marker set demonstrated high discrimination power by displaying an average polymorphism information content value of 0.783, a combined probability of identity of 10(-11), and a combined power of exclusion of 0.99991. The structure analysis revealed four sub-populations among the peanut accessions and lines evaluated. The results of this study enriched the peanut genomic resources, provided over 6000 novel SSR markers and the credentials for true peanut SNP marker development, and demonstrated the power of newly developed SSR markers in

  8. Use of genotyping by sequencing data to develop a high-throughput and multifunctional SNP panel for conservation applications in Pacific lamprey.

    PubMed

    Hess, Jon E; Campbell, Nathan R; Docker, Margaret F; Baker, Cyndi; Jackson, Aaron; Lampman, Ralph; McIlraith, Brian; Moser, Mary L; Statler, David P; Young, William P; Wildbill, Andrew J; Narum, Shawn R

    2015-01-01

    Next-generation sequencing data can be mined for highly informative single nucleotide polymorphisms (SNPs) to develop high-throughput genomic assays for nonmodel organisms. However, choosing a set of SNPs to address a variety of objectives can be difficult because SNPs are often not equally informative. We developed an optimal combination of 96 high-throughput SNP assays from a total of 4439 SNPs identified in a previous study of Pacific lamprey (Entosphenus tridentatus) and used them to address four disparate objectives: parentage analysis, species identification and characterization of neutral and adaptive variation. Nine of these SNPs are FST outliers, and five of these outliers are localized within genes and significantly associated with geography, run-timing and dwarf life history. Two of the 96 SNPs were diagnostic for two other lamprey species that were morphologically indistinguishable at early larval stages and were sympatric in the Pacific Northwest. The majority (85) of SNPs in the panel were highly informative for parentage analysis, that is, putatively neutral with high minor allele frequency across the species' range. Results from three case studies are presented to demonstrate the broad utility of this panel of SNP markers in this species. As Pacific lamprey populations are undergoing rapid decline, these SNPs provide an important resource to address critical uncertainties associated with the conservation and recovery of this imperiled species. © 2014 John Wiley & Sons Ltd.

  9. SNP Discovery and Linkage Map Construction in Cultivated Tomato

    PubMed Central

    Shirasawa, Kenta; Isobe, Sachiko; Hirakawa, Hideki; Asamizu, Erika; Fukuoka, Hiroyuki; Just, Daniel; Rothan, Christophe; Sasamoto, Shigemi; Fujishiro, Tsunakazu; Kishida, Yoshie; Kohara, Mitsuyo; Tsuruoka, Hisano; Wada, Tsuyuko; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2010-01-01

    Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/. PMID:21044984

  10. Comparative transcriptomics uncovers alternative splicing and molecular marker development in radish (Raphanus sativus L.).

    PubMed

    Luo, Xiaobo; Xu, Liang; Liang, Dongyi; Wang, Yan; Zhang, Wei; Zhu, Xianwen; Zhu, Yuelin; Jiang, Haiyan; Tang, Mingjia; Liu, Liwang

    2017-07-03

    Alternative splicing (AS) plays important roles in gene expression and proteome diversity. Single nucleotide polymorphism (SNP) and insertion/deletion (InDel) are abundant polymorphisms and co-dominant inheritance markers, which have been widely used in germplasm identification, genetic mapping and marker-assisted selection in plants. So far, however, little information is available on utilization of AS events and development of SNP and InDel markers from transcriptome in radish. In this study, three radish transcriptome datasets were collected and aligned to the reference radish genome. A total of 56,530 AS events were identified from three radish genotypes with intron retention (IR) being the most frequent AS type, which accounted for 59.4% of the total expressed genes in radish. In all, 22,412 SNPs and 9436 InDels were identified with an average frequency of 1 SNP/17.9 kb and 1 InDel/42.5 kb, respectively. A total of 43,680 potential SSRs were identified in 31,604 assembled unigenes with a density of 1 SSR/2.5 kb. The ratio of SNPs with nonsynonymous/synonymous mutations was 1.05:1. Moreover, 35 SNPs and 200 InDels were randomly selected and validated by Sanger sequencing, 83.9% of the SNPs and 70% of the InDels exhibited polymorphism among these three genotypes. In addition, the 15 SNPs and 125 InDels were found to be unevenly distributed on 9 linkage groups. Furthermore, 40 informative InDel markers were successfully used for the genetic diversity analysis on 32 radish accessions. These results would not only provide new insights into transcriptome complexity and AS regulation, but also furnish large amount of molecular marker resources for germplasm identification, genetic mapping and further genetic improvement of radish in breeding programs.

  11. Genomic-assisted phylogenetic analysis and marker development for next generation soybean cyst nematode resistance breeding.

    PubMed

    Kadam, Suhas; Vuong, Tri D; Qiu, Dan; Meinhardt, Clinton G; Song, Li; Deshmukh, Rupesh; Patil, Gunvant; Wan, Jinrong; Valliyodan, Babu; Scaboo, Andrew M; Shannon, J Grover; Nguyen, Henry T

    2016-01-01

    Soybean cyst nematode (SCN, Heterodera glycines Ichinohe) is a serious soybean pest. The use of resistant cultivars is an effective approach for preventing yield loss. In this study, 19,652 publicly available soybean accessions that were previously genotyped with the SoySNP50K iSelect BeadChip were used to evaluate the phylogenetic diversity of SCN resistance genes Rhg1 and Rhg4 in an attempt to identify novel sources of resistance. The sequence information of soybean lines was utilized to develop KASPar (KBioscience Competitive Allele-Specific PCR) assays from single nucleotide polymorphisms (SNPs) of Rhg1, Rhg4, and other novel quantitative trait loci (QTL). These markers were used to genotype a diverse set of 95 soybean germplasm lines and three recombinant inbred line (RIL) populations. SNP markers from the Rhg1 gene were able to differentiate copy number variation (CNV), such as resistant-high copy (PI 88788-type), low copy (Peking-type), and susceptible-single copy (Williams 82) numbers. Similarly, markers for the Rhg4 gene were able to detect Peking-type (resistance) genotypes. The phylogenetic information of SCN resistance loci from a large set of soybean accessions and the gene/QTL specific markers that were developed in this study will accelerate SCN resistance breeding programs.

  12. Differentiation of Populus species using chloroplast single nucleotide polymorphism (SNP) markers--essential for comprehensible and reliable poplar breeding.

    PubMed

    Schroeder, H; Hoeltken, A M; Fladung, M

    2012-03-01

    Within the genus Populus several species belonging to different sections are cross-compatible. Hence, high numbers of interspecies hybrids occur naturally and, additionally, have been artificially produced in huge breeding programmes during the last 100 years. Therefore, determination of a single poplar species, used for the production of 'multi-species hybrids' is often difficult, and represents a great challenge for the use of molecular markers in species identification. Within this study, over 20 chloroplast regions, both intergenic spacers and coding regions, have been tested for their ability to differentiate different poplar species using 23 already published barcoding primer combinations and 17 newly designed primer combinations. About half of the published barcoding primers yielded amplification products, whereas the new primers designed on the basis of the total sequenced cpDNA genome of Populus trichocarpa Torr. & Gray yielded much higher amplification success. Intergenic spacers were found to be more variable than coding regions within the genus Populus. The highest discrimination power of Populus species was found in the combination of two intergenic spacers (trnG-psbK, psbK-psbl) and the coding region rpoC. In barcoding projects, the coding regions matK and rbcL are often recommended, but within the genus Populus they only show moderate variability and are not efficient in species discrimination.

  13. SNP variation with latitude: Analysis of the SNPforID 52-plex markers in north, mid-region and south Chilean populations.

    PubMed

    Moreno, F; Freire-Aradas, A; Phillips, C; Fondevila, M; Carracedo, Á; Lareu, M V

    2014-05-01

    Chile is a disproportionately long and narrow country defined by the southern Andes and Pacific coastline where a level of genetic sub-structure resulting from distances of several thousand kilometers might be expected across the most distantly separated regions. Although STR databases created for the Chilean Legal Medical Service indicate an absence of sub-structure, such a characteristic requires further exploration when introducing additional forensic markers. Notably, Single Nucleotide Polymorphisms (SNPs) have a much lower mutation rate than STRs and can show more stable distributions of genetic variation if population movement is restricted. In this study we evaluated 451 Chilean urban samples from the North, North-Central, Central, South-Central and South regions of Chile for the 52 SNPs of the SNPforID forensic identification panel to explore the underlying genetic structure of Chilean populations. Results reveal similar genetic distances between groups suggesting a single SNP database for the whole of Chile is appropriate. To further understand the genetic composition of Chilean populations that comprise the bulk of individuals with both European and Native American ancestries, ancestral membership proportions were evaluated and pairwise comparisons to other American populations were made. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  14. Genome-wide association mapping and Identification of candidate genes for fatty acid composition in Brassica napus L. using SNP markers.

    PubMed

    Qu, Cunmin; Jia, Ledong; Fu, Fuyou; Zhao, Huiyan; Lu, Kun; Wei, Lijuan; Xu, Xinfu; Liang, Ying; Li, Shimeng; Wang, Rui; Li, Jiana

    2017-03-14

    B. napus (oilseed) is an important source of edible vegetable oil, and its nutritional and economic value is determined by its fatty acid composition and content. Using the Brassica 60 K SNP array, we performed a genome-wide association study of fatty acid composition in a population of 520 genetically diverse oilseed accessions. Using the PCA + K model in TASSEL 5.2.1, we identified 62 genomic regions that were significantly associated with the composition of seven fatty acids, and five consensus regions that mapped to the A2, A8, A9, C1, and C3 chromosomes, respectively, of the Brassica napus Darmor-bzh genome. We then identified 24 orthologs of the functional candidate genes involved in fatty acid biosynthesis, excluding BnaA.FAE1 and BnaC.FAE1 on the A8 and C3 homologous genome blocks, which are known to have critical roles in the fatty acid biosynthesis pathway, and potential orthologs of these genes (e.g., LACS9, KCR1, FAB1, LPAT4, KCS17, CER4, TT16, and ACBP5). Our results demonstrate the power of association mapping in identifying genes of interest in B. napus and provide insight into the genetic basis of fatty acid biosynthesis in B. napus. Furthermore, our findings may facilitate marker-based breeding efforts aimed at improving fatty acid composition and quality in B. napus.

  15. Genetic Diversity, Population Structure, and Linkage Disequilibrium of an Association-Mapping Panel Revealed by Genome-Wide SNP Markers in Sesame

    PubMed Central

    Cui, Chengqi; Mei, Hongxian; Liu, Yanyang; Zhang, Haiyang; Zheng, Yongzhan

    2017-01-01

    The characterization of genetic diversity and population structure can be used in tandem to detect reliable phenotype–genotype associations. In the present study, we genotyped a set of 366 sesame germplasm accessions by using 89,924 single-nucleotide polymorphisms (SNPs). The number of SNPs on each chromosome was consistent with the physical length of the respective chromosome, and the average marker density was approximately 2.67 kb/SNP. The genetic diversity analysis showed that the average nucleotide diversity of the panel was 1.1 × 10-3, with averages of 1.0 × 10-4, 2.7 × 10-4, and 3.6 × 10-4 obtained, respectively for three identified subgroups of the panel: Pop 1, Pop 2, and the Mixed. The genetic structure analysis revealed that these sesame germplasm accessions were structured primarily along the basis of their geographic collection, and that an extensive admixture occurred in the panel. The genome-wide linkage disequilibrium (LD) analysis showed that an average LD extended up to ∼99 kb. The genetic diversity and population structure revealed in this study should provide guidance to the future design of association studies and the systematic utilization of the genetic variation characterizing the sesame panel. PMID:28729877

  16. Genetic Diversity, Population Structure, and Linkage Disequilibrium of an Association-Mapping Panel Revealed by Genome-Wide SNP Markers in Sesame.

    PubMed

    Cui, Chengqi; Mei, Hongxian; Liu, Yanyang; Zhang, Haiyang; Zheng, Yongzhan

    2017-01-01

    The characterization of genetic diversity and population structure can be used in tandem to detect reliable phenotype-genotype associations. In the present study, we genotyped a set of 366 sesame germplasm accessions by using 89,924 single-nucleotide polymorphisms (SNPs). The number of SNPs on each chromosome was consistent with the physical length of the respective chromosome, and the average marker density was approximately 2.67 kb/SNP. The genetic diversity analysis showed that the average nucleotide diversity of the panel was 1.1 × 10(-3), with averages of 1.0 × 10(-4), 2.7 × 10(-4), and 3.6 × 10(-4) obtained, respectively for three identified subgroups of the panel: Pop 1, Pop 2, and the Mixed. The genetic structure analysis revealed that these sesame germplasm accessions were structured primarily along the basis of their geographic collection, and that an extensive admixture occurred in the panel. The genome-wide linkage disequilibrium (LD) analysis showed that an average LD extended up to ∼99 kb. The genetic diversity and population structure revealed in this study should provide guidance to the future design of association studies and the systematic utilization of the genetic variation characterizing the sesame panel.

  17. Mass production of SNP markers in a nonmodel passerine bird through RAD sequencing and contig mapping to the zebra finch genome.

    PubMed

    Bourgeois, Yann X C; Lhuillier, Emeline; Cézard, Timothée; Bertrand, Joris A M; Delahaie, Boris; Cornuault, Josselin; Duval, Thomas; Bouchez, Olivier; Milá, Borja; Thébaud, Christophe

    2013-09-01

    Here, we present an adaptation of restriction-site-associated DNA sequencing (RAD-seq) to the Illumina HiSeq2000 technology that we used to produce SNP markers in very large quantities at low cost per unit in the Réunion grey white-eye (Zosterops borbonicus), a nonmodel passerine bird species with no reference genome. We sequenced a set of six pools of 18-25 individuals using a single sequencing lane. This allowed us to build around 600 000 contigs, among which at least 386 000 could be mapped to the zebra finch (Taeniopygia guttata) genome. This yielded more than 80 000 SNPs that could be mapped unambiguously and are evenly distributed across the genome. Thus, our approach provides a good illustration of the high potential of paired-end RAD sequencing of pooled DNA samples combined with comparative assembly to the zebra finch genome to build large contigs and characterize vast numbers of informative SNPs in nonmodel passerine bird species in a very efficient and cost-effective way. © 2013 John Wiley & Sons Ltd.

  18. New softwares for automated microsatellite marker development

    PubMed Central

    Martins, Wellington; de Sousa, Daniel; Proite, Karina; Guimarães, Patrícia; Moretzsohn, Marcio; Bertioli, David

    2006-01-01

    Microsatellites are repeated small sequence motifs that are highly polymorphic and abundant in the genomes of eukaryotes. Often they are the molecular markers of choice. To aid the development of microsatellite markers we have developed a module that integrates a program for the detection of microsatellites (TROLL), with the sequence assembly and analysis software, the Staden Package. The module has easily adjustable parameters for microsatellite lengths and base pair quality control. Starting with large datasets of unassembled sequence data in the form of chromatograms and/or text data, it enables the creation of a compact database consisting of the processed and assembled microsatellite containing sequences. For the final phase of primer design, we developed a program that accepts the multi-sequence ‘experiment file’ format as input and produces a list of primer pairs for amplification of microsatellite markers. The program can take into account the quality values of consensus bases, improving success rate of primer pairs in PCR. The software is freely available and simple to install in both Windows and Unix-based operating systems. Here we demonstrate the software by developing primer pairs for 427 new candidate markers for peanut. PMID:16493138

  19. New softwares for automated microsatellite marker development.

    PubMed

    Martins, Wellington; de Sousa, Daniel; Proite, Karina; Guimarães, Patrícia; Moretzsohn, Marcio; Bertioli, David

    2006-02-21

    Microsatellites are repeated small sequence motifs that are highly polymorphic and abundant in the genomes of eukaryotes. Often they are the molecular markers of choice. To aid the development of microsatellite markers we have developed a module that integrates a program for the detection of microsatellites (TROLL), with the sequence assembly and analysis software, the Staden Package. The module has easily adjustable parameters for microsatellite lengths and base pair quality control. Starting with large datasets of unassembled sequence data in the form of chromatograms and/or text data, it enables the creation of a compact database consisting of the processed and assembled microsatellite containing sequences. For the final phase of primer design, we developed a program that accepts the multi-sequence 'experiment file' format as input and produces a list of primer pairs for amplification of microsatellite markers. The program can take into account the quality values of consensus bases, improving success rate of primer pairs in PCR. The software is freely available and simple to install in both Windows and Unix-based operating systems. Here we demonstrate the software by developing primer pairs for 427 new candidate markers for peanut.

  20. Prediction of heterosis using genome-wide SNP-marker data: application to egg production traits in white Leghorn crosses.

    PubMed

    Amuzu-Aweh, E N; Bijma, P; Kinghorn, B P; Vereijken, A; Visscher, J; van Arendonk, J Am; Bovenhuis, H

    2013-12-01

    Prediction of heterosis has a long history with mixed success, partly due to low numbers of genetic markers and/or small data sets. We investigated the prediction of heterosis for egg number, egg weight and survival days in domestic white Leghorns, using ∼400 000 individuals from 47 crosses and allele frequencies on ∼53 000 genome-wide single nucleotide polymorphisms (SNPs). When heterosis is due to dominance, and dominance effects are independent of allele frequencies, heterosis is proportional to the squared difference in allele frequency (SDAF) between parental pure lines (not necessarily homozygous). Under these assumptions, a linear model including regression on SDAF partitions crossbred phenotypes into pure-line values and heterosis, even without pure-line phenotypes. We therefore used models where phenotypes of crossbreds were regressed on the SDAF between parental lines. Accuracy of prediction was determined using leave-one-out cross-validation. SDAF predicted heterosis for egg number and weight with an accuracy of ∼0.5, but did not predict heterosis for survival days. Heterosis predictions allowed preselection of pure lines before field-testing, saving ∼50% of field-testing cost with only 4% loss in heterosis. Accuracies from cross-validation were lower than from the model-fit, suggesting that accuracies previously reported in literature are overestimated. Cross-validation also indicated that dominance cannot fully explain heterosis. Nevertheless, the dominance model had considerable accuracy, clearly greater than that of a general/specific combining ability model. This work also showed that heterosis can be modelled even when pure-line phenotypes are unavailable. We concluded that SDAF is a useful predictor of heterosis in commercial layer breeding.

  1. Candidate SNP Markers of Gender-Biased Autoimmune Complications of Monogenic Diseases Are Predicted by a Significant Change in the Affinity of TATA-Binding Protein for Human Gene Promoters

    PubMed Central

    Ponomarenko, Mikhail P.; Arkova, Olga; Rasskazov, Dmitry; Ponomarenko, Petr; Savinkova, Ludmila; Kolchanov, Nikolay

    2016-01-01

    Some variations of human genome [for example, single nucleotide polymorphisms (SNPs)] are markers of hereditary diseases and drug responses. Analysis of them can help to improve treatment. Computer-based analysis of millions of SNPs in the 1000 Genomes project makes a search for SNP markers more targeted. Here, we combined two computer-based approaches: DNA sequence analysis and keyword search in databases. In the binding sites for TATA-binding protein (TBP) in human gene promoters, we found candidate SNP markers of gender-biased autoimmune diseases, including rs1143627 [cachexia in rheumatoid arthritis (double prevalence among women)]; rs11557611 [demyelinating diseases (thrice more prevalent among young white women than among non-white individuals)]; rs17231520 and rs569033466 [both: atherosclerosis comorbid with related diseases (double prevalence among women)]; rs563763767 [Hughes syndrome-related thrombosis (lethal during pregnancy)]; rs2814778 [autoimmune diseases (excluding multiple sclerosis and rheumatoid arthritis) underlying hypergammaglobulinemia in women]; rs72661131 and rs562962093 (both: preterm delivery in pregnant diabetic women); and rs35518301, rs34166473, rs34500389, rs33981098, rs33980857, rs397509430, rs34598529, rs33931746, rs281864525, and rs63750953 (all: autoimmune diseases underlying hypergammaglobulinemia in women). Validation of these predicted candidate SNP markers using the clinical standards may advance personalized medicine. PMID:27092142

  2. Candidate SNP Markers of Gender-Biased Autoimmune Complications of Monogenic Diseases Are Predicted by a Significant Change in the Affinity of TATA-Binding Protein for Human Gene Promoters.

    PubMed

    Ponomarenko, Mikhail P; Arkova, Olga; Rasskazov, Dmitry; Ponomarenko, Petr; Savinkova, Ludmila; Kolchanov, Nikolay

    2016-01-01

    Some variations of human genome [for example, single nucleotide polymorphisms (SNPs)] are markers of hereditary diseases and drug responses. Analysis of them can help to improve treatment. Computer-based analysis of millions of SNPs in the 1000 Genomes project makes a search for SNP markers more targeted. Here, we combined two computer-based approaches: DNA sequence analysis and keyword search in databases. In the binding sites for TATA-binding protein (TBP) in human gene promoters, we found candidate SNP markers of gender-biased autoimmune diseases, including rs1143627 [cachexia in rheumatoid arthritis (double prevalence among women)]; rs11557611 [demyelinating diseases (thrice more prevalent among young white women than among non-white individuals)]; rs17231520 and rs569033466 [both: atherosclerosis comorbid with related diseases (double prevalence among women)]; rs563763767 [Hughes syndrome-related thrombosis (lethal during pregnancy)]; rs2814778 [autoimmune diseases (excluding multiple sclerosis and rheumatoid arthritis) underlying hypergammaglobulinemia in women]; rs72661131 and rs562962093 (both: preterm delivery in pregnant diabetic women); and rs35518301, rs34166473, rs34500389, rs33981098, rs33980857, rs397509430, rs34598529, rs33931746, rs281864525, and rs63750953 (all: autoimmune diseases underlying hypergammaglobulinemia in women). Validation of these predicted candidate SNP markers using the clinical standards may advance personalized medicine.

  3. Developing Exon-Primed Intron-Crossing (EPIC) markers for population genetic studies in three Aedes disease vectors.

    PubMed

    White, Vanessa Linley; Endersby, Nancy Margaret; Chan, Janice; Hoffmann, Ary Anthony; Weeks, Andrew Raymond

    2015-03-01

    Aedes aegypti, Aedes notoscriptus, and Aedes albopictus are important vectors of many arboviruses implicated in human disease such as dengue fever. Genetic markers applied across vector species can provide important information on population structure, gene flow, insecticide resistance, and taxonomy, however, robust microsatellite markers have proven difficult to develop in these species and mosquitoes generally. Here we consider the utility and transferability of 15 Ribosome protein (Rp) Exon-Primed Intron-Crossing (EPIC) markers for population genetic studies in these 3 Aedes species. Rp EPIC markers designed for Ae. aegypti also successfully amplified populations of the sister species, Ae. albopictus, as well as the distantly related species, Ae. notoscriptus. High SNP and good indel diversity in sequenced alleles plus support for amplification of the same regions across populations and species were additional benefits of these markers. These findings point to the general value of EPIC markers in mosquito population studies.

  4. 1 + 1 = 3: Development and validation of a SNP-based algorithm to identify genetic contributions from three distinct inbred mouse strains.

    PubMed

    Gorham, James D; Ranson, Matthew S; Smith, Janebeth C; Gorham, Beverly J; Muirhead, Kristen-Ashley

    2012-12-01

    State-of-the-art, genome-wide assessment of mouse genetic background uses single nucleotide polymorphism (SNP) PCR. As SNP analysis can use multiplex testing, it is amenable to high-throughput analysis and is the preferred method for shared resource facilities that offer genetic background assessment of mouse genomes. However, a typical individual SNP query yields only two alleles (A vs. B), limiting the application of this methodology to distinguishing contributions from no more than two inbred mouse strains. By contrast, simple sequence length polymorphism (SSLP) analysis yields multiple alleles but is not amenable to high-throughput testing. We sought to devise a SNP-based technique to identify donor strain origins when three distinct mouse strains potentially contribute to the genetic makeup of an individual mouse. A computational approach was used to devise a three-strain analysis (3SA) algorithm that would permit identification of three genetic backgrounds while still using a binary-output SNP platform. A panel of 15 mosaic mice with contributions from BALB/c, C57Bl/6, and DBA/2 genetic backgrounds was bred and analyzed using a genome-wide SNP panel using 1449 markers. The 3SA algorithm was applied and then validated using SSLP. The 3SA algorithm assigned 85% of 1449 SNPs as informative for the C57Bl/6, BALB/c, or DBA/2 backgrounds, respectively. Testing the panel of 15 F2 mice, the 3SA algorithm predicted donor strain origins genome-wide. Donor strain origins predicted by the 3SA algorithm correlated perfectly with results from individual SSLP markers located on five different chromosomes (n=70 tests). We have established and validated an analysis algorithm based on binary SNP data that can successfully identify the donor strain origins of chromosomal regions in mice that are bred from three distinct inbred mouse strains.

  5. Development of gene-based markers and construction of an integrated linkage map in eggplant by using Solanum orthologous (SOL) gene sets.

    PubMed

    Fukuoka, Hiroyuki; Miyatake, Koji; Nunome, Tsukasa; Negoro, Satomi; Shirasawa, Kenta; Isobe, Sachiko; Asamizu, Erika; Yamaguchi, Hirotaka; Ohyama, Akio

    2012-06-01

    We constructed an integrated DNA marker linkage map of eggplant (Solanum melongena L.) using DNA marker segregation data sets obtained from two independent intraspecific F(2) populations. The linkage map consisted of 12 linkage groups and encompassed 1,285.5 cM in total. We mapped 952 DNA markers, including 313 genomic SSR markers developed by random sequencing of simple sequence repeat (SSR)-enriched genomic libraries, and 623 single-nucleotide polymorphisms (SNP) and insertion/deletion polymorphisms (InDels) found in eggplant-expressed sequence tags (ESTs) and related genomic sequences [introns and untranslated regions (UTRs)]. Because of their co-dominant inheritance and their highly polymorphic and multi-allelic nature, the SSR markers may be more versatile than the SNP and InDel markers for map-based genetic analysis of any traits of interest using segregating populations derived from any intraspecific crosses of practical breeding materials. However, we found that the distribution of microsatellites in the genome was biased to some extent, and therefore a considerable part of the eggplant genome was first detected when gene-derived SNP and InDel markers were mapped. Of the 623 SNP and InDel markers mapped onto the eggplant integrated map, 469 were derived from eggplant unigenes contained within Solanum orthologous (SOL) gene sets (i.e., sets of orthologous unigenes from eggplant, tomato, and potato). Out of the 469 markers, 326 could also be mapped onto the tomato map. These common markers will be informative landmarks for the transfer of tomato's more saturated genomic information to eggplant and will also provide comparative information on the genome organization of the two solanaceous species. The data are available from the DNA marker database of vegetables, VegMarks (http://vegmarks.nivot.affrc.go.jp).

  6. Transferring automation for large-scale development and production of Invader SNP assays

    NASA Astrophysics Data System (ADS)

    Neri, Bruce P.; Ganske, R.; Isaczyszyn, W.; Beaty, Edward L.

    2000-03-01

    The Human Genome Project has led to the discovery of hundreds of thousands of single nucleotide polymorphisms (SNPs). SNPs can act as genetic markers to create high- density maps of the human genome for large-scale genetic analysis for evaluating links between genetic mutations and human diseases and for performing association studies. To create those maps, assays capable of detecting many different SNPs must be developed rapidly, as additional SNPs are discovered. When both the design of and the technology used in the assays can be partially or fully automated, the development process and the time to results can be accomplished quickly and efficiently. InvaderTM technology offers a highly sensitive signal amplification system that detects and quantifies mutations and SNPs from unamplified human genomic DNA in two sequential steps.

  7. Development and use of genic molecular markers (GMMs) for construction of a transcript map of chickpea (Cicer arietinum L.).

    PubMed

    Gujaria, Neha; Kumar, Ashish; Dauthal, Preeti; Dubey, Anuja; Hiremath, Pavana; Bhanu Prakash, A; Farmer, Andrew; Bhide, Mangla; Shah, Trushar; Gaur, Pooran M; Upadhyaya, Hari D; Bhatia, Sabhyata; Cook, Douglas R; May, Greg D; Varshney, Rajeev K

    2011-05-01

    A transcript map has been constructed by the development and integration of genic molecular markers (GMMs) including single nucleotide polymorphism (SNP), genic microsatellite or simple sequence repeat (SSR) and intron spanning region (ISR)-based markers, on an inter-specific mapping population of chickpea, the third food legume crop of the world and the first food legume crop of India. For SNP discovery through allele re-sequencing, primer pairs were designed for 688 genes/expressed sequence tags (ESTs) of chickpea and 657 genes/ESTs of closely related species of chickpea. High-quality sequence data obtained for 220 candidate genic regions on 2-20 genotypes representing 9 Cicer species provided 1,893 SNPs with an average frequency of 1/35.83 bp and 0.34 PIC (polymorphism information content) value. On an average 2.9 haplotypes were present in 220 candidate genic regions with an average haplotype diversity of 0.6326. SNP2CAPS analysis of 220 sequence alignments, as mentioned above, provided a total of 192 CAPS candidates. Experimental analysis of these 192 CAPS candidates together with 87 CAPS candidates identified earlier through in silico mining of ESTs provided scorable amplification in 173 (62.01%) cases of which predicted assays were validated in 143 (82.66%) cases (CGMM). Alignments of chickpea unigenes with Medicago truncatula genome were used to develop 121 intron spanning region (CISR) markers of which 87 yielded scorable products. In addition, optimization of 77 EST-derived SSR (ICCeM) markers provided 51 scorable markers. Screening of easily assayable 281 markers including 143 CGMMs, 87 CISRs and 51 ICCeMs on 5 parental genotypes of three mapping populations identified 104 polymorphic markers including 90 markers on the inter-specific mapping population. Sixty-two of these GMMs together with 218 earlier published markers (including 64 GMM loci) and 20 other unpublished markers could be integrated into this genetic map. A genetic map developed here

  8. Development of a rapid SNP-typing assay to differentiate Bifidobacterium animalis ssp. lactis strains used in probiotic-supplemented dairy products.

    PubMed

    Lomonaco, Sara; Furumoto, Emily J; Loquasto, Joseph R; Morra, Patrizia; Grassi, Ausilia; Roberts, Robert F

    2015-02-01

    Identification at the genus, species, and strain levels is desirable when a probiotic microorganism is added to foods. Strains of Bifidobacterium animalis ssp. lactis (BAL) are commonly used worldwide in dairy products supplemented with probiotic strains. However, strain discrimination is difficult because of the high degree of genome identity (99.975%) between different genomes of this subspecies. Typing of monomorphic species can be carried out efficiently by targeting informative single nucleotide polymorphisms (SNP). Findings from a previous study analyzing both reference and commercial strains of BAL identified SNP that could be used to discriminate common strains into 8 groups. This paper describes development of a minisequencing assay based on the primer extension reaction (PER) targeting multiple SNP that can allow strain differentiation of BAL. Based on previous data, 6 informative SNP were selected for further testing, and a multiplex preliminary PCR was optimized to amplify the DNA regions containing the selected SNP. Extension primers (EP) annealing immediately adjacent to the selected SNP were developed and tested in simplex and multiplex PER to evaluate their performance. Twenty-five strains belonging to 9 distinct genomic clusters of B. animalis ssp. lactis were selected and analyzed using the developed minisequencing assay, simultaneously targeting the 6 selected SNP. Fragment analysis was subsequently carried out in duplicate and demonstrated that the assay yielded 8 specific profiles separating the most commonly used commercial strains. This novel multiplex PER approach provides a simple, rapid, flexible SNP-based subtyping method for proper characterization and identification of commercial probiotic strains of BAL from fermented dairy products. To assess the usefulness of this method, DNA was extracted from yogurt manufactured with and without the addition of B. animalis ssp. lactis BB-12. Extracted DNA was then subjected to the minisequencing

  9. Development and Evaluation of SoySNP50K, a High-Density Genotyping Array for Soybean

    PubMed Central

    Song, Qijian; Hyten, David L.; Jia, Gaofeng; Quigley, Charles V.; Fickus, Edward W.; Nelson, Randall L.; Cregan, Perry B.

    2013-01-01

    The objective of this research was to identify single nucleotide polymorphisms (SNPs) and to develop an Illumina Infinium BeadChip that contained over 50,000 SNPs from soybean (Glycine max L. Merr.). A total of 498,921,777 reads 35–45bp in length were obtained from DNA sequence analysis of reduced representation libraries from several soybean accessions which included six cultivated and two wild soybean (G. soja Sieb. et Zucc.) genotypes. These reads were mapped to the soybean whole genome sequence and 209,903 SNPs were identified. After applying several filters, a total of 146,161 of the 209,903 SNPs were determined to be ideal candidates for Illumina Infinium II BeadChip design. To equalize the distance between selected SNPs, increase assay success rate, and minimize the number of SNPs with low minor allele frequency, an iteration algorithm based on a selection index was developed and used to select 60,800 SNPs for Infinium BeadChip design. Of the 60,800 SNPs, 50,701 were targeted to euchromatic regions and 10,000 to heterochromatic regions of the 20 soybean chromosomes. In addition, 99 SNPs were targeted to unanchored sequence scaffolds. Of the 60,800 SNPs, a total of 52,041 passed Illumina’s manufacturing phase to produce the SoySNP50K iSelect BeadChip. Validation of the SoySNP50K chip with 96 landrace genotypes, 96 elite cultivars and 96 wild soybean accessions showed that 47,337 SNPs were polymorphic and generated successful SNP allele calls. In addition, 40,841 of the 47,337 SNPs (86%) had minor allele frequencies ≥10% among the landraces, elite cultivars and the wild soybean accessions. A total of 620 and 42 candidate regions which may be associated with domestication and recent selection were identified, respectively. The SoySNP50K iSelect SNP beadchip will be a powerful tool for characterizing soybean genetic diversity and linkage disequilibrium, and for constructing high resolution linkage maps to improve the soybean whole genome sequence assembly

  10. Developing single nucleotide polymorphism markers for the identification of pineapple (Ananas comosus) germplasm

    PubMed Central

    Zhou, Lin; Matsumoto, Tracie; Tan, Hua-Wei; Meinhardt, Lyndel W; Mischke, Sue; Wang, Boyi; Zhang, Dapeng

    2015-01-01

    Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using expressed sequence tag and nucleotide sequences from public databases, we developed 213 single nucleotide polymorphism (SNP) markers and validated 96 SNPs by genotyping the United States Department of Agriculture - Agricultural Research Service pineapple germplasm collection, maintained in Hilo, Hawaii. The validation resulted in designation of a set of 57 polymorphic SNP markers that revealed a high rate of duplicates in this pineapple collection. Twenty-four groups of duplicates were detected, encompassing 130 of the total 170 A cosmos accessions. The results show that somatic mutation has been the main source of intra-cultivar variations in pineapple. Multivariate clustering and a model-based population stratification suggest that the modern pineapple cultivars are comprised of progenies that are derived from different wild Ananas botanical varieties. Parentage analysis further revealed that both A. comosus var. bracteatus and A. comosus var. ananassoides are likely progenitors of pineapple cultivars. However, the traditional classification of cultivated pineapple into horticultural groups (e.g. ‘Cayenne’, ‘Spanish’, ‘Queen’) was not well supported by the present study. These SNP markers provide robust and universally comparable DNA fingerprints; thus, they can serve as an efficient genotyping tool to assist pineapple germplasm management, propagation of planting material, and pineapple cultivar protection. The high rate of genetic redundancy detected in this pineapple collection suggests the potential impact of applying this technology on other clonally propagated perennial crops. PMID:26640697

  11. Developing single nucleotide polymorphism markers for the identification of pineapple (Ananas comosus) germplasm.

    PubMed

    Zhou, Lin; Matsumoto, Tracie; Tan, Hua-Wei; Meinhardt, Lyndel W; Mischke, Sue; Wang, Boyi; Zhang, Dapeng

    2015-01-01

    Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using expressed sequence tag and nucleotide sequences from public databases, we developed 213 single nucleotide polymorphism (SNP) markers and validated 96 SNPs by genotyping the United States Department of Agriculture - Agricultural Research Service pineapple germplasm collection, maintained in Hilo, Hawaii. The validation resulted in designation of a set of 57 polymorphic SNP markers that revealed a high rate of duplicates in this pineapple collection. Twenty-four groups of duplicates were detected, encompassing 130 of the total 170 A cosmos accessions. The results show that somatic mutation has been the main source of intra-cultivar variations in pineapple. Multivariate clustering and a model-based population stratification suggest that the modern pineapple cultivars are comprised of progenies that are derived from different wild Ananas botanical varieties. Parentage analysis further revealed that both A. comosus var. bracteatus and A. comosus var. ananassoides are likely progenitors of pineapple cultivars. However, the traditional classification of cultivated pineapple into horticultural groups (e.g. 'Cayenne', 'Spanish', 'Queen') was not well supported by the present study. These SNP markers provide robust and universally comparable DNA fingerprints; thus, they can serve as an efficient genotyping tool to assist pineapple germplasm management, propagation of planting material, and pineapple cultivar protection. The high rate of genetic redundancy detected in this pineapple collection suggests the potential impact of applying this technology on other clonally propagated perennial crops.

  12. Development of a Traceability System Based on a SNP Array for Large-Scale Production of High-Value White Spruce (Picea glauca)

    PubMed Central

    Godbout, Julie; Tremblay, Laurence; Levasseur, Caroline; Lavigne, Patricia; Rainville, André; Mackay, John; Bousquet, Jean; Isabel, Nathalie

    2017-01-01

    Biological material is at the forefront of research programs, as well as application fields such as breeding, aquaculture, and reforestation. While sophisticated techniques are used to produce this material, all too often, there is no strict monitoring during the “production” process to ensure that the specific varieties are the expected ones. Confidence rather than evidence is often applied when the time comes to start a new experiment or to deploy selected varieties in the field. During the last decade, genomics research has led to the development of important resources, which have created opportunities for easily developing tools to assess the conformity of the material along the production chains. In this study, we present a simple methodology that enables the development of a traceability system which, is in fact a by-product of previous genomic projects. The plant production system in white spruce (Picea glauca) is used to illustrate our purpose. In Quebec, one of the favored strategies to produce elite varieties is to use somatic embryogenesis (SE). In order to detect human errors both upstream and downstream of the white spruce production process, this project had two main objectives: (i) to develop methods that make it possible to trace the origin of plants produced, and (ii) to generate a unique genetic fingerprint that could be used to differentiate each embryogenic cell line and ensure its genetic monitoring. Such a system had to rely on a minimum number of low-cost DNA markers and be easy to use by non-specialists. An efficient marker selection process was operationalized by testing different classification methods on simulated datasets. These datasets were generated using in-house bioinformatics tools that simulated crosses involved in the breeding program for which genotypes from hundreds of SNP markers were already available. The rate of misidentification was estimated and various sources of mishandling or contamination were identified. The

  13. Developing Single Nucleotide Polymorphism (SNP) markers for the identification of pineapple (Ananas comosus) germplasm

    USDA-ARS?s Scientific Manuscript database

    Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango and a major agricultural commodity in Hawaii. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using E...

  14. Putting the cacao genome to work: Development and utilization of Theobroma cacao SNP markers

    USDA-ARS?s Scientific Manuscript database

    Next Generation Sequencing technology is driving the sequencing and assembly of whole genomes at an ever increasing rate. With the release of the Theobroma cacao genome sequence, vast amounts of data are currently available to researchers worldwide, however mining this data to provide cacao breeder...

  15. Identification of mitochondrial DNA sequence variation and development of single nucleotide polymorphic markers for CMS-D8 in cotton.

    PubMed

    Suzuki, Hideaki; Yu, Jiwen; Wang, Fei; Zhang, Jinfa

    2013-06-01

    Cytoplasmic male sterility (CMS), which is a maternally inherited trait and controlled by novel chimeric genes in the mitochondrial genome, plays a pivotal role in the production of hybrid seed. In cotton, no PCR-based marker has been developed to discriminate CMS-D8 (from Gossypium trilobum) from its normal Upland cotton (AD1, Gossypium hirsutum) cytoplasm. The objective of the current study was to develop PCR-based single nucleotide polymorphic (SNP) markers from mitochondrial genes for the CMS-D8 cytoplasm. DNA sequence variation in mitochondrial genes involved in the oxidative phosphorylation chain including ATP synthase subunit 1, 4, 6, 8 and 9, and cytochrome c oxidase 1, 2 and 3 subunits were identified by comparing CMS-D8, its isogenic maintainer and restorer lines on the same nuclear genetic background. An allelic specific PCR (AS-PCR) was utilized for SNP typing by incorporating artificial mismatched nucleotides into the third or fourth base from the 3' terminus in both the specific and nonspecific primers. The result indicated that the method modifying allele-specific primers was successful in obtaining eight SNP markers out of eight SNPs using eight primer pairs to discriminate two alleles between AD1 and CMS-D8 cytoplasms. Two of the SNPs for atp1 and cox1 could also be used in combination to discriminate between CMS-D8 and CMS-D2 cytoplasms. Additionally, a PCR-based marker from a nine nucleotide insertion-deletion (InDel) sequence (AATTGTTTT) at the 59-67 bp positions from the start codon of atp6, which is present in the CMS and restorer lines with the D8 cytoplasm but absent in the maintainer line with the AD1 cytoplasm, was also developed. A SNP marker for two nucleotide substitutions (AA in AD1 cytoplasm to CT in CMS-D8 cytoplasm) in the intron (1,506 bp) of cox2 gene was also developed. These PCR-based SNP markers should be useful in discriminating CMS-D8 and AD1 cytoplasms, or those with CMS-D2 cytoplasm as a rapid, simple, inexpensive, and

  16. Development of two major resources for pea genomics: the GenoPea 13.2K SNP Array and a high-density, high-resolution consensus genetic map.

    PubMed

    Tayeh, Nadim; Aluome, Christelle; Falque, Matthieu; Jacquin, Françoise; Klein, Anthony; Chauveau, Aurélie; Bérard, Aurélie; Houtin, Hervé; Rond, Céline; Kreplak, Jonathan; Boucherot, Karen; Martin, Chantal; Baranger, Alain; Pilet-Nayel, Marie-Laure; Warkentin, Thomas D; Brunel, Dominique; Marget, Pascal; Le Paslier, Marie-Christine; Aubert, Grégoire; Burstin, Judith

    2015-12-01

    Single nucleotide polymorphism (SNP) arrays represent important genotyping tools for innovative strategies in both basic research and applied breeding. Pea is an important food, feed and sustainable crop with a large (about 4.45 Gbp) but not yet available genome sequence. In the present study, 12 pea recombinant inbred line populations were genotyped using the newly developed GenoPea 13.2K SNP Array. Individual and consensus genetic maps were built providing insights into the structure and organization of the pea genome. Largely collinear genetic maps of 3918-8503 SNPs were obtained from all mapping populations, and only two of these exhibited putative chromosomal rearrangement signatures. Similar distortion patterns in different populations were noted. A total of 12 802 transcript-derived SNP markers placed on a 15 079-marker high-density, high-resolution consensus map allowed the identification of ohnologue-rich regions within the pea genome and the localization of local duplicates. Dense syntenic networks with sequenced legume genomes were further established, paving the way for the identification of the molecular bases of important agronomic traits segregating in the mapping populations. The information gained on the structure and organization of the genome from this research will undoubtedly contribute to the understanding of the evolution of the pea genome and to its assembly. The GenoPea 13.2K SNP Array and individual and consensus genetic maps are valuable genomic tools for plant scientists to strengthen pea as a model for genetics and physiology and enhance breeding. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

  17. SNP discovery in complex allotetraploid genomes (Gossypium spp., Malvaceae) using genotyping by sequencing

    USDA-ARS?s Scientific Manuscript database

    Dramatic decreases in the cost of DNA sequencing have enabled the development of very large numbers of markers based on single nucleotide polymorphism (SNP) for phylogenetic studies, population genetics, linkage mapping, marker-assisted breeding and other applications. Using Illumina next-generatio...

  18. The coding region of the UFGT gene is a source of diagnostic SNP markers that allow single-locus DNA genotyping for the assessment of cultivar identity and ancestry in grapevine (Vitis vinifera L.)

    PubMed Central

    2013-01-01

    Background Vitis vinifera L. is one of society’s most important agricultural crops with a broad genetic variability. The difficulty in recognizing grapevine genotypes based on ampelographic traits and secondary metabolites prompted the development of molecular markers suitable for achieving variety genetic identification. Findings Here, we propose a comparison between a multi-locus barcoding approach based on six chloroplast markers and a single-copy nuclear gene sequencing method using five coding regions combined with a character-based system with the aim of reconstructing cultivar-specific haplotypes and genotypes to be exploited for the molecular characterization of 157 V. vinifera accessions. The analysis of the chloroplast target regions proved the inadequacy of the DNA barcoding approach at the subspecies level, and hence further DNA genotyping analyses were targeted on the sequences of five nuclear single-copy genes amplified across all of the accessions. The sequencing of the coding region of the UFGT nuclear gene (UDP-glucose: flavonoid 3-0-glucosyltransferase, the key enzyme for the accumulation of anthocyanins in berry skins) enabled the discovery of discriminant SNPs (1/34 bp) and the reconstruction of 130 V. vinifera distinct genotypes. Most of the genotypes proved to be cultivar-specific, and only few genotypes were shared by more, although strictly related, cultivars. Conclusion On the whole, this technique was successful for inferring SNP-based genotypes of grapevine accessions suitable for assessing the genetic identity and ancestry of international cultivars and also useful for corroborating some hypotheses regarding the origin of local varieties, suggesting several issues of misidentification (synonymy/homonymy). PMID:24298902

  19. Development of a Fluidigm SNP panel for genetic analysis in rainbow trout

    USDA-ARS?s Scientific Manuscript database

    Although microsatellite markers have been widely used in aquaculture species for genetic analysis such as parentage assignment and genetic mapping, SNPs (single nucleotide polymorphism) are the marker of choice as they are highly abundant and are amenable for high throughput genotyping. Recently we ...

  20. Model SNP development based on the complex oat genome using high-throughput 454 sequencing technology

    USDA-ARS?s Scientific Manuscript database

    Genetic markers are pivotal to modern genomics research; however, discovery and genotyping of molecular markers in oat has been hindered by the size and complexity of the genome, and by a scarcity of sequence data. The purpose of this study was to generate oat expressed sequence tag (EST) informatio...

  1. Genic SNP markers and legume synteny reveal candidate genes underlying QTL for Macrophomina phaseolina resistance and maturity in cowpea [Vigna unguiculata (L) Walp.].

    PubMed

    Muchero, Wellington; Ehlers, Jeffrey D; Close, Timothy J; Roberts, Philip A

    2011-01-05

    Macrophomina phaseolina is an emerging and devastating fungal pathogen that causes significant losses in crop production under high temperatures and drought stress. An increasing number of disease incidence reports highlight the wide prevalence of the pathogen around the world and its contribution toward crop yield suppression. In cowpea [Vigna unguiculata (L) Walp.], limited sources of low-level host resistance have been identified, the genetic basis of which is unknown. In this study we report on the identification of strong sources of host resistance to M. phaseolina and the genetic mapping of putative resistance loci on a cowpea genetic map comprised of gene-derived single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs). Nine quantitative trait loci (QTLs), accounting for between 6.1 and 40.0% of the phenotypic variance (R2), were identified using plant mortality data taken over three years in field experiments and disease severity scores taken from two greenhouse experiments. Based on annotated genic SNPs as well as synteny with soybean (Glycine max) and Medicago truncatula, candidate resistance genes were found within mapped QTL intervals. QTL Mac-2 explained the largest percent R2 and was identified in three field and one greenhouse experiments where the QTL peak co-located with a SNP marker derived from a pectin esterase inhibitor encoding gene. Maturity effects on the expression of resistance were indicated by the co-location of Mac-6 and Mac-7 QTLs with maturity-related senescence QTLs Mat-2 and Mat-1, respectively. Homologs of the ELF4 and FLK flowering genes were found in corresponding syntenic soybean regions. Only three Macrophomina resistance QTLs co-located with delayed drought-induced premature senescence QTLs previously mapped in the same population, suggesting that largely different genetic mechanisms mediate cowpea response to drought stress and Macrophomina infection. Effective sources of host resistance were

  2. Genic SNP markers and legume synteny reveal candidate genes underlying QTL for Macrophomina phaseolina resistance and maturity in cowpea [Vigna unguiculata (L) Walp.

    PubMed Central

    2011-01-01

    Background Macrophomina phaseolina is an emerging and devastating fungal pathogen that causes significant losses in crop production under high temperatures and drought stress. An increasing number of disease incidence reports highlight the wide prevalence of the pathogen around the world and its contribution toward crop yield suppression. In cowpea [Vigna unguiculata (L) Walp.], limited sources of low-level host resistance have been identified, the genetic basis of which is unknown. In this study we report on the identification of strong sources of host resistance to M. phaseolina and the genetic mapping of putative resistance loci on a cowpea genetic map comprised of gene-derived single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs). Results Nine quantitative trait loci (QTLs), accounting for between 6.1 and 40.0% of the phenotypic variance (R2), were identified using plant mortality data taken over three years in field experiments and disease severity scores taken from two greenhouse experiments. Based on annotated genic SNPs as well as synteny with soybean (Glycine max) and Medicago truncatula, candidate resistance genes were found within mapped QTL intervals. QTL Mac-2 explained the largest percent R2 and was identified in three field and one greenhouse experiments where the QTL peak co-located with a SNP marker derived from a pectin esterase inhibitor encoding gene. Maturity effects on the expression of resistance were indicated by the co-location of Mac-6 and Mac-7 QTLs with maturity-related senescence QTLs Mat-2 and Mat-1, respectively. Homologs of the ELF4 and FLK flowering genes were found in corresponding syntenic soybean regions. Only three Macrophomina resistance QTLs co-located with delayed drought-induced premature senescence QTLs previously mapped in the same population, suggesting that largely different genetic mechanisms mediate cowpea response to drought stress and Macrophomina infection. Conclusion Effective

  3. Exploring of new Y-chromosome SNP loci using Pyrosequencing and the SNaPshot methods.

    PubMed

    Wei, Wei; Luo, Hai-Bo; Yan, Jing; Hou, Yi-Ping

    2012-11-01

    The single nucleotide polymorphisms on the Y chromosome (Y-SNP) have been considered to be important in forensic casework. However, Y-SNP loci were mostly population specific and lacked biallelic polymorphisms in the Asian population. In this study, we developed a strategy for seeking and genotyping new Y-SNP markers based on both Pyrosequencing and the SNaPshot methods. As results, 34 new biallelic markers were observed to be polymorphic in the Chinese Han population by estimation of allele frequencies of 103 candidate's Y-SNP loci in DNA pools using Pyrosequencing technology. Then, a multiplex system with 20 Y-SNP loci was genotyped using the SNaPshot™ multiplex kit. Twenty Y-SNP loci defined 56 different haplotypes, and the haplotype diversity was estimated to be 0.9539. Our result demonstrated that the strategy could be used as an efficient tool to search and genotype biallelic markers from a large amount of candidate loci. In addition, 20 Y-SNP loci constructed a multiplex system, which could provide supplementary information for forensic identification.

  4. Identification, validation and survey of a single nucleotide polymorphism (SNP) associated with pungency in Capsicum spp.

    PubMed

    Garcés-Claver, Ana; Fellman, Shanna Moore; Gil-Ortega, Ramiro; Jahn, Molly; Arnedo-Andrés, María S

    2007-11-01

    A single nucleotide polymorphism (SNP) associated with pungency was detected within an expressed sequence tag (EST) of 307 bp. This fragment was identified after expression analysis of the EST clone SB2-66 in placenta tissue of Capsicum fruits. Sequence alignments corresponding to this new fragment allowed us to identify an SNP between pungent and non-pungent accessions. Two methods were chosen for the development of the SNP marker linked to pungency: tetra-primer amplification refractory mutation system-PCR (tetra-primer ARMS-PCR) and cleaved amplified polymorphic sequence. Results showed that both methods were successful in distinguishing genotypes. Nevertheless, tetra-primer ARMS-PCR was chosen for SNP genotyping because it was more rapid, reliable and less cost-effective. The utility of this SNP marker for pungency was demonstrated by the ability to distinguish between 29 pungent and non-pungent cultivars of Capsicum annuum. In addition, the SNP was also associated with phenotypic pungent character in the tested genotypes of C. chinense, C. baccatum, C. frutescens, C. galapagoense, C. eximium, C. tovarii and C. cardenasi. This SNP marker is a faster, cheaper and more reproducible method for identifying pungent peppers than other techniques such as panel tasting, and allows rapid screening of the trait in early growth stages.

  5. Development of a cassava core collection based on single nucleotide polymorphism markers.

    PubMed

    Oliveira, E J; Ferreira, C F; Santos, V S; Oliveira, G A F

    2014-08-25

    Single nucleotide polymorphism (SNP) markers were used in the largest cassava (Manihot esculenta Crantz) germplasm collection from Brazil to develop core collections based on the maximization strategy. Subsets with 61, 64, 84, 128, 256, and 384 cassava accessions were selected and named PoHEU, MST64, PoRAN, MST128, MST256, and MST384, respectively. All the 798 alleles identified by 402 SNP markers in the entire collection were captured in all core collections. Only small alterations in the diversity parameters were observed for the different core collections compared with the complete collection. Because of the optimal adjustment of the validation parameters representative of the complete collection, the absence of genotypes with high genetic similarity and the maximization of the genetic distances between accessions of the PoHEU core collection, which contained 4.7% of the accessions of the complete collection, maximized the genetic conservation of this important cassava collection. Furthermore, the development of this core collection will allow concentrated efforts toward future characterization and agronomic evaluation of accessions to maximize the diversity and genetic gains in cassava breeding programs.

  6. Single-nucleotide polymorphism versus microsatellite markers in a combined linkage and segregation analysis of a quantitative trait

    PubMed Central

    Daw, E Warwick; Heath, Simon C; Lu, Yue

    2005-01-01

    Increasingly, single-nucleotide polymorphism (SNP) markers are being used in preference to microsatellite markers. However, methods developed for microsatellites may be problematic when applied to SNP markers. We evaluated the results of using SNPs vs. microsatellites in Monte Carlo Markov chain (MCMC) oligogenic combined segregation and linkage analysis methods. These methods were developed with microsatellite markers in mind. We selected chromosome 7 from the Collaborative Study on the Genetics of Alcoholism dataset for analysis because linkage to an electrophysiological trait had been reported there. We found linkage in the same region of chromosome 7 with the Affymetrix SNP data, the Illumina SNP data, and the microsatellite marker data. The MCMC sampler appears to mix with both types of data. The sampler implemented in this MCMC oligogenic combined segregation and linkage analysis appears to handle SNP data as well as microsatellite data and it is possible that the localizations with the SNP data are better. PMID:16451642

  7. Single-nucleotide polymorphism versus microsatellite markers in a combined linkage and segregation analysis of a quantitative trait.

    PubMed

    Daw, E Warwick; Heath, Simon C; Lu, Yue

    2005-12-30

    Increasingly, single-nucleotide polymorphism (SNP) markers are being used in preference to microsatellite markers. However, methods developed for microsatellites may be problematic when applied to SNP markers. We evaluated the results of using SNPs vs. microsatellites in Monte Carlo Markov chain (MCMC) oligogenic combined segregation and linkage analysis methods. These methods were developed with microsatellite markers in mind. We selected chromosome 7 from the Collaborative Study on the Genetics of Alcoholism dataset for analysis because linkage to an electrophysiological trait had been reported there. We found linkage in the same region of chromosome 7 with the Affymetrix SNP data, the Illumina SNP data, and the microsatellite marker data. The MCMC sampler appears to mix with both types of data. The sampler implemented in this MCMC oligogenic combined segregation and linkage analysis appears to handle SNP data as well as microsatellite data and it is possible that the localizations with the SNP data are better.

  8. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development

    PubMed Central

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J.; Zhou, Xiang; Liu, Bang; Woods, Leah C. Solberg; Fuchs, Rita A.

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals. PMID:26722221

  9. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development.

    PubMed

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J; Zhou, Xiang; Liu, Bang; Woods, Leah C Solberg; Fuchs, Rita A

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals.

  10. Association analysis and identification of SNP markers for Stemphylium leaf spot (Stemphylium botryosum f. sp. spinacia) resistance in spinach (Spinacia oleracea)

    USDA-ARS?s Scientific Manuscript database

    Stemphylium leaf spot, caused by Stemphylium botryosum f. sp. spinacia is an important disease in spinach. Use of genetic resistance is an efficient, economic and environment-friendly method to control this disease. The objective of this research was to conduct association analysis and identify SNP ...

  11. Allele specific CAPS marker development and characterization of chalcone synthase gene in Indian mulberry (Morus spp., family Moraceae).

    PubMed

    Arora, Vivek; Ghosh, M K; Pal, Soumili; Gangopadhyay, Gaurab

    2017-01-01

    Chalcone synthase (CHS) is an essential enzyme in the phenylpropanoid pathway that catalyzes the first step in flavonoid biosynthesis in plants under diverse environmental stress. We have used CHS as a candidate gene in mulberry and developed Single Nucleotide Polymorphism (SNP) based co-dominant Cleaved Amplified Polymorphic Sequence (CAPS) marker associated with the CHS locus. The segregation pattern of the marker was studied in an F1 population derived from a hybridization program between two mulberry genotypes showing polymorphism for the CHS locus. Differential CHS activity of the recombinants has been correlated with the segregation pattern of the marker. Homology modelling and docking studies are performed for both the identified CHS alleles and correlated with respective CHS activity. Phenotyping of Powdery Mildew infected F1 population indicated a probable association with the CAPS marker.

  12. [Advances in development of gene-gene interaction analysis methods based on SNP data: a review].

    PubMed

    Luan, Yi-Zhao; Zuo, Xiao-Yu; Liu, Ke; Li, Gu; Rao, Shao-Qi

    2013-12-01

    The SNP-based association analysis has become one of the most important approaches to interpret the underlying molecular mechanisms for human complex diseases. Nevertheless, the widely-used singe-locus analysis is only capable of capturing a small portion of susceptible SNPs with prominent marginal effects, leaving the important genetic component, epistasis or joint effects, to be undetectable. Identifying the complex interplays among multiple genes in the genome-wide context is an essential task for systematically unraveling the molecular mechanisms for complex diseases. Many approaches have been used to detect genome-wide gene-gene interactions and provided new insights into the genetic basis of complex diseases. This paper reviewed recent advances of the methods for detecting gene-gene interaction, categorized into three types, model-based and model-free statistical methods, and data mining methods, based on their characteristics in theory and numerical algorithm. In particular, the basic principle, numerical implementation and cautions for application for each method were elucidated. In addition, this paper briefly discussed the limitations and challenges associated with detecting genome-wide epistasis, in order to provide some methodological consultancies for scientists in the related fields.

  13. New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies.

    PubMed

    De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A

    2002-06-01

    Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.

  14. SNP500Cancer: a public resource for sequence validation, assay development, and frequency analysis for genetic variation in candidate genes.

    PubMed

    Packer, Bernice R; Yeager, Meredith; Burdett, Laura; Welch, Robert; Beerman, Michael; Qi, Liqun; Sicotte, Hugues; Staats, Brian; Acharya, Mekhala; Crenshaw, Andrew; Eckert, Andrew; Puri, Vinita; Gerhard, Daniela S; Chanock, Stephen J

    2006-01-01

    The SNP500Cancer database provides sequence and genotype assay information for candidate SNPs useful in mapping complex diseases, such as cancer. The database is an integral component of the NCI Cancer Genome Anatomy Project (http://cgap.nci.nih.gov). SNP500Cancer reports sequence analysis of anonymized control DNA samples (n = 102 Coriell samples representing four self-described ethnic groups: African/African-American, Caucasian, Hispanic and Pacific Rim). The website is searchable by gene, chromosome, gene ontology pathway, dbSNP ID and SNP500Cancer SNP ID. As of October 2005, the database contains >13 400 SNPs, 9124 of which have been sequenced in the SNP500Cancer population. For each analysed SNP, gene location and >200 bp of surrounding annotated sequence (including nearby SNPs) are provided, with frequency information in total and per subpopulation as well as calculation of Hardy-Weinberg equilibrium for each subpopulation. The website provides the conditions for validated sequencing and genotyping assays, as well as genotype results for the 102 samples, in both viewable and downloadable formats. A subset of sequence validated SNPs with minor allele frequency >5% are entered into a high-throughput pipeline for genotyping analysis to determine concordance for the same 102 samples. In addition, the results of genotype analysis for select validated SNP assays (defined as 100% concordance between sequence analysis and genotype results) are posted for an additional 280 samples drawn from the Human Diversity Panel (HDP). SNP500Cancer provides an invaluable resource for investigators to select SNPs for analysis, design genotyping assays using validated sequence data, choose selected assays already validated on one or more genotyping platforms, and select reference standards for genotyping assays. The SNP500Cancer database is freely accessible via the web page at http://snp500cancer.nci.nih.gov.

  15. Development of Single Nucleotide Polymorphism markers in Theobroma cacao and comparison to Simple Sequence Repeat markers for genotyping of Cameroon clones.

    USDA-ARS?s Scientific Manuscript database

    Single Nucleotide Polymorphism (SNP) markers are increasingly being used in crop breeding programs, slowly replacing Simple Sequence Repeats (SSR) and other markers. SNPs provide many benefits over SSRs, including ease of analysis and unambiguous results across various platforms. We have identifie...

  16. SNP-based high density genetic map and mapping of btwd1 dwarfing gene in barley

    PubMed Central

    Ren, Xifeng; Wang, Jibin; Liu, Lipan; Sun, Genlou; Li, Chengdao; Luo, Hong; Sun, Dongfa

    2016-01-01

    A high-density linkage map is a valuable tool for functional genomics and breeding. A newly developed sequence-based marker technology, restriction site associated DNA (RAD) sequencing, has been proven to be powerful for the rapid discovery and genotyping of genome-wide single nucleotide polymorphism (SNP) markers and for the high-density genetic map construction. The objective of this research was to construct a high-density genetic map of barley using RAD sequencing. 1894 high-quality SNP markers were developed and mapped onto all seven chromosomes together with 68 SSR markers. These 1962 markers constituted a total genetic length of 1375.8 cM and an average of 0.7 cM between adjacent loci. The number of markers within each linkage group ranged from 209 to 396. The new recessive dwarfing gene btwd1 in Huaai 11 was mapped onto the high density linkage maps. The result showed that the btwd1 is positioned between SNP marks 7HL_6335336 and 7_249275418 with a genetic distance of 0.9 cM and 0.7 cM on chromosome 7H, respectively. The SNP-based high-density genetic map developed and the dwarfing gene btwd1 mapped in this study provide critical information for position cloning of the btwd1 gene and molecular breeding of barley. PMID:27530597

  17. A flexible multi-species genome-wide 60K SNP chip developed from pooled resequencing of 240 Eucalyptus tree genomes across 12 species.

    PubMed

    Silva-Junior, Orzenil B; Faria, Danielle A; Grattapaglia, Dario

    2015-06-01

    We used whole genome resequencing of pooled individuals to develop a high-density single-nucleotide polymorphism (SNP) chip for Eucalyptus. Genomes of 240 trees of 12 species were sequenced at 3.5× each, and 46 997 586 raw SNP variants were subject to multivariable filtering metrics toward a multispecies, genome-wide distributed chip content. Of the 60 904 SNPs on the chip, 59 222 were genotyped and 51 204 were polymorphic across 14 Eucalyptus species, providing a 96% genome-wide coverage with 1 SNP/12-20 kb, and 47 069 SNPs at ≤ 10 kb from 30 444 of the 33 917 genes in the Eucalyptus genome. Given the EUChip60K multi-species genotyping flexibility, we show that both the sample size and taxonomic composition of cluster files impact heterozygous call specificity and sensitivity by benchmarking against 'gold standard' genotypes derived from deeply sequenced individual tree genomes. Thousands of SNPs were shared across species, likely representing ancient variants arisen before the split of these taxa, hinting to a recent eucalypt radiation. We show that the variable SNP filtering constraints allowed coverage of the entire site frequency spectrum, mitigating SNP ascertainment bias. The EUChip60K represents an outstanding tool with which to address population genomics questions in Eucalyptus and to empower genomic selection, GWAS and the broader study of complex trait variation in eucalypts. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  18. De novo Transcriptome Analysis and Molecular Marker Development of Two Hemarthria Species

    PubMed Central

    Huang, Xiu; Yan, Hai-Dong; Zhang, Xin-Quan; Zhang, Jian; Frazier, Taylor P.; Huang, De-Jun; Lu, Lu; Huang, Lin-Kai; Liu, Wei; Peng, Yan; Ma, Xiao; Yan, Yan-Hong

    2016-01-01

    Hemarthria R. Br. is an important genus of perennial forage grasses that is widely used in subtropical and tropical regions. Hemarthria grasses have made remarkable contributions to the development of animal husbandry and agro-ecosystem maintenance; however, there is currently a lack of comprehensive genomic data available for these species. In this study, we used Illumina high-throughput deep sequencing to characterize of two agriculturally important Hemarthria materials, H. compressa “Yaan” and H. altissima “1110.” Sequencing runs that used each of four normalized RNA samples from the leaves or roots of the two materials yielded more than 24 million high-quality reads. After de novo assembly, 137,142 and 77,150 unigenes were obtained for “Yaan” and “1110,” respectively. In addition, a total of 86,731 “Yaan” and 48,645 “1110” unigenes were successfully annotated. After consolidating the unigenes for both materials, 42,646 high-quality SNPs were identified in 10,880 unigenes and 10,888 SSRs were identified in 8330 unigenes. To validate the identified markers, high quality PCR primers were designed for both SNPs and SSRs. We randomly tested 16 of the SNP primers and 54 of the SSR primers and found that the majority of these primers successfully amplified the desired PCR product. In addition, high cross-species transferability (61.11–87.04%) of SSR markers was achieved for four other Poaceae species. The amount of RNA sequencing data that was generated for these two Hemarthria species greatly increases the amount of genomic information available for Hemarthria and the SSR and SNP markers identified in this study will facilitate further advancements in genetic and molecular studies of the Hemarthria genus. PMID:27148320

  19. Development, genetic mapping and QTL association of cotton PHYA, PHYB, and HY5-specific CAPS and dCAPS markers

    USDA-ARS?s Scientific Manuscript database

    Among SNP markers that become increasingly valuable in molecular breeding of crop plants are the CAP and dCAP markers derived from the genes of interest. To date, the number of such gene-based markers is small in polyploid crop plants such as tetraploid cotton that has A and D subgenomes. The obje...

  20. Association and interaction of myopia with SNP markers rs13382811 and rs6469937 at ZFHX1B and SNTB1 in Han Chinese and European populations.

    PubMed

    Li, Jiali; Jiao, Xiaodong; Zhang, Qingjiong; Hejtmancik, J Fielding

    2017-01-01

    Previously, a genome-wide association study (GWAS) identified rs13382811 (near ZFHX1B) and rs6469937 (near SNTB1) to be associated with high myopia. The present study evaluates the association of these two single nucleotide polymorphisms (SNPs) with moderate to high myopia in two Chinese cohorts and two cohorts of European populations. Two Chinese university student cohorts, including one with 300 unrelated subjects with high myopia and 308 emmetropic controls from Guangzhou and a second with 96 unrelated individuals with moderate to high myopia and 96 emmetropic controls of Chaoshanese origin in Guangzhou, were enrolled in this study. Two SNPs, rs6469937 and rs13382811, were selected for genotyping based on their reported associations with severe myopia. The SNPs were genotyped via DNA sequencing. In addition, association analysis of both SNPs was performed using genotype data from the database of Genotypes and Phenotypes (dbGaP) involving a total of 2,423 samples in two independent cohorts of European-derived populations, as follows: Kooperative Gesundheitsforschung in der Region Augsburg (KORA) and TwinsUK. The allelic and genotypic distribution among cases and controls were analyzed using the Chi-square test. Logistic regression was used to evaluate the SNP-SNP interaction. Fisher's exact test was used for two-SNP comparisons. In the Guangzhou cohort, SNP rs13382811 near ZFHX1B showed significant association with high myopia (pallelic = 0.0001, pgenotypic = 4.07 × 10(-5)), with the minor T allele showing an increased risk of high myopia (odds ratio [OR] = 1.68, 95% confidence interval [CI] = 1.28-2.20). SNP rs6469937 near SNTB1 showed nominal evidence of association (pallelic = 0.0085, pgenotypic = 0.0166), which did not withstand correction for multiple testing. No significant association was detected in the smaller Chaoshan cohort alone. The association of SNPs rs13382811 and rs6469937 remained significant when both Han Chinese cohorts were combined

  1. Empirical evaluation of DArT, SNP, and SSR marker-systems for genotyping, clustering, and assigning sugar beet hybrid varieties into populations

    USDA-ARS?s Scientific Manuscript database

    Dominant and co-dominant molecular markers are routinely used in plant genetic diversity research. In the present study we assessed the success-rate of three marker-systems for estimating genotypic diversity, clustering varieties into populations, and assigning a single variety into the expected pop...

  2. Developing Temporal Markers to Profile Operational Errors

    DTIC Science & Technology

    2006-08-01

    ATC subject matter experts ( SMEs ) to assist in the devel- opment of a comprehensive list of TMs. Their collective experience included 87 years of...controllers involved. The SMEs were provided with our definition of a temporal marker and a list of some TM examples, such as the time when the aircraft...the first control instruction to the pilot. Procedure The SMEs convened as a group on several occasions to create an exhaustive list of TMs. An

  3. Development and validation of functional CAPS markers for the FAE genes in Brassica juncea and their use in marker-assisted selection

    PubMed Central

    Saini, Navinder; Singh, Naveen; Kumar, Anil; Vihan, Nitika; Yadav, Sangita; Vasudev, Sujata; Yadava, D.K.

    2016-01-01

    Low erucic acid is a major breeding target to improve the edible oil quality in Brassica juncea. The single nucleotide polymorphism (SNP) in fatty acid elongase 1 (FAE1.1 and FAE1.2) gene was exploited to expedite the breeding program. The paralogs of FAE1 gene were sequenced from low erucic acid genotype Pusa Mustard 30 and SNPs were identified through homologous alignment with sequence downloaded from NCBI GenBank. Two SNPs in FAE1.1 at position 591 and 1265 and one in FAE1.2 at 237 were found polymorphic among low and high erucic acid genotypes. These SNPs either create or change the recognition site of restriction enzymes. Transition of a single nucleotide at position 591 and 1265 in FAE1.1, and at position 237 in FAE1.2, leads to a change in the recognition site of Hpy99I, BglII and MnlI restriction enzymes, respectively. Two CAPS markers for FAE1.1 and one for FAE1.2 were developed to differentiate low and high erucic acid genotypes. The efficiency of these CAPS markers was found 100 per cent when validated in Brassica juncea, and B. nigra genotypes and used in back-cross breeding. These CAPS markers will facilitate in marker-assisted selection for improvement of oil quality in Brassica juncea. PMID:28163599

  4. Combined use of a new SNP-based assay and multilocus SSR markers to assess genetic diversity of Xylella fastidiosa subsp. pauca infecting citrus and coffee plants.

    PubMed

    Montes-Borrego, Miguel; Lopes, Joao R S; Jiménez-Díaz, Rafael M; Landa, Blanca B

    2015-03-01

    Two haplotypes of Xylella fastidiosa subsp. pauca (Xfp) that correlated with their host of origin were identified in a collection of 90 isolates infecting citrus and coffee plants in Brazil, based on a single-nucleotide polymorphism in the gyrB sequence. A new single-nucleotide primer extension (SNuPE) protocol was designed for rapid identification of Xfp according to the host source. The protocol proved to be robust for the prediction of the Xfp host source in blind tests using DNA from cultures of the bacterium, infected plants, and insect vectors allowed to feed on Xfp-infected citrus plants. AMOVA and STRUCTURE analyses of microsatellite data separated most Xfp populations on the basis of their host source, indicating that they were genetically distinct. The combined use of the SNaPshot protocol and three previously developed multilocus SSR markers showed that two haplotypes and distinct isolates of Xfp infect citrus and coffee in Brazil and that multiple, genetically different isolates can be present in a single orchard or infect a single tree. This combined approach will be very useful in studies of the epidemiology of Xfp-induced diseases, host specificity of bacterial genotypes, the occurrence of Xfp host jumping, vector feeding habits, etc., in economically important cultivated plants or weed host reservoirs of Xfp in Brazil and elsewhere.

  5. Development of core SSR markers for Gossypium germplasm characterization

    USDA-ARS?s Scientific Manuscript database

    A set of 105 portable DNA markers were carefully developed to provide a common basis for systematic characterization of cotton germplasm collections in the U.S. and throughout the world. The 105 PCR-based SSR markers of different origins were evenly distributed on each of the 26 cotton chromosomes ...

  6. A SNP-Based Molecular Barcode for Characterization of Common Wheat

    PubMed Central

    Gao, LiFeng; Jia, JiZeng; Kong, XiuYing

    2016-01-01

    Wheat is grown as a staple crop worldwide. It is important to develop an effective genotyping tool for this cereal grain both to identify germplasm diversity and to protect the rights of breeders. Single-nucleotide polymorphism (SNP) genotyping provides a means for developing a practical, rapid, inexpensive and high-throughput assay. Here, we investigated SNPs as robust markers of genetic variation for typing wheat cultivars. We identified SNPs from an array of 9000 across a collection of 429 well-known wheat cultivars grown in China, of which 43 SNP markers with high minor allele frequency and variations discriminated the selected wheat varieties and their wild ancestors. This SNP-based barcode will allow for the rapid and precise identification of wheat germplasm resources and newly released varieties and will further assist in the wheat breeding program. PMID:26985664

  7. Development Of Interspecific Cssls In Rice Using SNP-Based Selection

    USDA-ARS?s Scientific Manuscript database

    Six libraries of chromosome segment substitution lines (CSSLs) are being developed based on crosses between three diverse accessions of O. rufipogon (from China, Laos and Indonesia) and two O. sativa recurrent parents, IR64, an indica variety (from the Philippines), and Cybonnet, a tropical japonica...

  8. Development of high density SNP-based linkage map in pearl millet

    USDA-ARS?s Scientific Manuscript database

    Pearl millet (Cenchrus americanus (L.) Morrone) is a gluten free grain crop which is additionally gaining importance in the USA due to the increased demand for pearl millet flour by many ethnic groups. As a result, efforts are underway in the Southeast to develop high grain yielding adapted pearl mi...

  9. High-throughput RAD-SNP genotyping for characterization of sugar beet genotypes

    USDA-ARS?s Scientific Manuscript database

    High-throughput SNP genotyping provides a rapid way of developing resourceful set of markers for delineating the genetic architecture and for effective species discrimination. In the presented research, we demonstrate a set of 192 SNPs for effective genotyping in sugar beet using high-throughput mar...

  10. DEVELOPMENT OF CODOMINANT MARKERS FOR IDENTIFYING SPECIES HYBRIDS

    EPA Science Inventory

    Herein we describe a simple method for developing species-diagnostic markers that would permit the rapid identification of hybrid individuals. Our method relies on amplified length polymorphism (AFLP) and single strand conformation polymorphism (SSCP) technologies, both of which...

  11. DEVELOPMENT OF CODOMINANT MARKERS FOR IDENTIFYING SPECIES HYBRIDS

    EPA Science Inventory

    Herein we describe a simple method for developing species-diagnostic markers that would permit the rapid identification of hybrid individuals. Our method relies on amplified length polymorphism (AFLP) and single strand conformation polymorphism (SSCP) technologies, both of which...

  12. The impact of genotyping-by-sequencing pipelines on SNP discovery and identification of markers associated verticillium wilt resistance in autotetraploid alfalfa (sedicago sativa l.)

    USDA-ARS?s Scientific Manuscript database

    Verticillium wilt (VW) of alfalfa is a soilborne disease that causes severe yield loss in alfalfa. To identify molecular markers associated with VW resistance, an integrated framework of genome-wide association study (GWAS) with high-throughput genotyping by sequencing (GBS) was used for mapping lo...

  13. Cacao single-nucleotide polymorphism (SNP) markers: A discovery strategy to identify SNPs for genotyping, genetic mapping and genome wide association studies (GWAS)

    USDA-ARS?s Scientific Manuscript database

    Single-nucleotide polymorphisms (SNPs) are the most common genetic markers in Theobroma cacao, occurring approximately once in every 200 nucleotides. SNPs, like microsatellites, are co-dominant and PCR-based, but they have several advantages over microsatellites. They are unambiguous, so that a SN...

  14. Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

    PubMed Central

    2011-01-01

    Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation

  15. New developments in biological markers of bone metabolism in osteoporosis.

    PubMed

    Garnero, Patrick

    2014-09-01

    Over the last 15 years several biological markers of bone turnover have been developed with increased specificity and sensitivity. In osteoporosis clinical studies, the IOF and IFCC organizations have recently recommended the measurements of serum type I collagen N-propeptide (PINP) and the crosslinked C-terminal telopeptide (serum CTX) as markers of bone formation and bone resorption, respectively. However these markers have some limitations including a lack of specificity for bone tissue, their inability to reflect osteocyte activity or periosteal apposition. In addition they do not allow the investigation of bone tissue quality an important determinant of skeletal fragility. To address these limitations, new developments in markers of bone metabolism have been recently achieved. These include assays for periostin, a matricellular protein preferentially localized in the periosteal tissue, sphingosine 1-phosphate, a lipid mediator which acts mainly on osteoclastogenesis and the osteocyte factors such as sclerostin and FGF-23. Recent studies have shown an association between the circulating levels of these biological markers and fracture risk in postmenopausal women or elderly men, although data require confirmation in additional prospective studies. Finally, recent studies suggest that the measurements of circulating microRNAs may represent a novel class of early biological markers in osteoporosis. It is foreseen that with the use of genomics and proteomics, new markers will be developed to ultimately improve the management of patients with osteoporosis.

  16. Development of a high-throughput SNP resource to advance genomic, genetic and breeding research in carrot (Daucus carota L.)

    USDA-ARS?s Scientific Manuscript database

    The rapid advancement in high-throughput SNP genotyping technologies along with next generation sequencing (NGS) platforms has decreased the cost, improved the quality of large-scale genome surveys, and allowed specialty crops with limited genomic resources such as carrot (Daucus carota) to access t...

  17. Markers

    ERIC Educational Resources Information Center

    Healthy Schools Network, Inc., 2011

    2011-01-01

    Dry erase whiteboards come with toxic dry erase markers and toxic cleaning products. Dry erase markers labeled "nontoxic" are not free of toxic chemicals and can cause health problems. Children are especially vulnerable to environmental health hazards; moreover, schools commonly have problems with indoor air pollution, as they are more densely…

  18. Chaotic particle swarm optimization for detecting SNP-SNP interactions for CXCL12-related genes in breast cancer prevention.

    PubMed

    Chuang, Li-Yeh; Chang, Hsueh-Wei; Lin, Ming-Cheng; Yang, Cheng-Hong

    2012-07-01

    Genome-wide association studies have revealed that many single nucleotide polymorphisms (SNPs) are associated with breast cancer, and yet the potential SNP-SNP interactions have not been well addressed to date. This study aims to develop a methodology for the selection of SNP-genotype combinations with a maximum difference between case and control groups. We propose a new chaotic particle swarm optimization (CPSO) algorithm that identifies the best SNP combinations for breast cancer association studies containing seven SNPs. Five scoring functions, that is, the percentage correct, sensitivity/specificity, positive predictive value/negative predictive value, risk ratio, and odds ratio, are provided for evaluating SNP interactions in different SNP combinations. The CPSO algorithm identified the best SNP combinations associated with breast cancer protection. Some SNP interactions in specific SNPs and their corresponding genotypes were revealed. These SNP combinations showed a significant association with breast cancer protection (P<0.05). The sensitivity and specificity of the respective best SNP combinations were all higher than 90%. In contrast to the corresponding non-SNP-SNP interaction combinations, the estimated odds ratio and risk ratio of the SNP-SNP interaction in SNP combinations for breast cancer were less than 100%. This suggests that CPSO can successfully identify the best SNP combinations for breast cancer protection. In conclusion, we focus on developing a methodology for the selection of SNP-genotype combinations with a maximum difference between case and control groups. The CPSO method can effectively identify SNP-SNP interactions in complex biological relationships underlying the progression of breast cancer.

  19. A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome

    PubMed Central

    Ganal, Martin W.; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S.; Charcosset, Alain; Clarke, Joseph D.; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D.; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C.; Falque, Matthieu

    2011-01-01

    SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding. PMID:22174790

  20. A large maize (Zea mays L.) SNP genotyping array: development and germplasm genotyping, and genetic mapping to compare with the B73 reference genome.

    PubMed

    Ganal, Martin W; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S; Charcosset, Alain; Clarke, Joseph D; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C; Falque, Matthieu

    2011-01-01

    SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations - IBM (B73×Mo17) and LHRF (F2×F252) - were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding.

  1. Development of diagnostic markers from disease resistance QTLs for marker-assisted breeding in peanut

    USDA-ARS?s Scientific Manuscript database

    Breeding for disease resistance in peanut cultivars has been constrained due to both a narrow genetic base and a low degree of polymorphism. Earlier attempts have resulted in the development of a few hundreds of simple sequence repeat (SSR) markers in peanut that could define broad QTL on the physic...

  2. Enriching Genomic Resources and Marker Development from Transcript Sequences of Jatropha curcas for Microgravity Studies

    PubMed Central

    Tian, Wenlan; Paudel, Dev

    2017-01-01

    Jatropha (Jatropha curcas L.) is an economically important species with a great potential for biodiesel production. To enrich the jatropha genomic databases and resources for microgravity studies, we sequenced and annotated the transcriptome of jatropha and developed SSR and SNP markers from the transcriptome sequences. In total 1,714,433 raw reads with an average length of 441.2 nucleotides were generated. De novo assembling and clustering resulted in 115,611 uniquely assembled sequences (UASs) including 21,418 full-length cDNAs and 23,264 new jatropha transcript sequences. The whole set of UASs were fully annotated, out of which 59,903 (51.81%) were assigned with gene ontology (GO) term, 12,584 (10.88%) had orthologs in Eukaryotic Orthologous Groups (KOG), and 8,822 (7.63%) were mapped to 317 pathways in six different categories in Kyoto Encyclopedia of Genes and Genome (KEGG) database, and it contained 3,588 putative transcription factors. From the UASs, 9,798 SSRs were discovered with AG/CT as the most frequent (45.8%) SSR motif type. Further 38,693 SNPs were detected and 7,584 remained after filtering. This UAS set has enriched the current jatropha genomic databases and provided a large number of genetic markers, which can facilitate jatropha genetic improvement and many other genetic and biological studies. PMID:28154822

  3. Markers of tolerance development to food allergens.

    PubMed

    Ponce, M; Diesner, S C; Szépfalusi, Z; Eiwegger, T

    2016-10-01

    IgE-mediated reactions to food allergens are the most common cause of anaphylaxis in childhood. Although allergies to cow's milk, egg, or soy proteins, in contrast to peanut and tree nut allergens, resolve within the first 6 years of life in up to 60% due to natural tolerance development, this process is not well understood. At present, there is no cure or treatment for food allergy that would result in an induction of tolerance to the symptom-eliciting food. Avoidance, providing an emergency plan and education, is the standard of treatment. Oral immunotherapeutic approaches have been proven reasonable efficacy; however, they are associated with high rates of side-effects and low numbers of patients achieving tolerance. Nevertheless, mechanisms that take place during oral immunotherapy may help to understand tolerance development. On the basis of these therapeutic interventions, events like loss of basophil activation and induction of regulatory lymphocyte subsets and of blocking antibodies have been described. Their functional importance at a clinical level, however, remains to be investigated in detail. Consequently, there is eminent need to understand the process of tolerance development to food allergens and define biomarkers to develop and monitor new treatment strategies for food allergy.

  4. SNP-SNP Interaction Analysis on Soybean Oil Content under Multi-Environments

    PubMed Central

    Yin, Zhengong; Leng, Yue; Yu, Hongxiao; Jia, Huiying; Jiang, Shanshan; Ni, Zhongqiu; Jiang, Hongwei; Han, Xue; Liu, Chunyan; Hu, Zhenbang; Wu, Xiaoxia; Hu, Guohua; Xin, Dawei; Qi, Zhaoming

    2016-01-01

    Soybean oil content is one of main quality traits. In this study, we used the multifactor dimensionality reduction (MDR) method and a soybean high-density genetic map including 5,308 markers to identify stable single nucleotide polymorphism (SNP)—SNP interactions controlling oil content in soybean across 23 environments. In total, 36,442,756 SNP-SNP interaction pairs were detected, 1865 of all interaction pairs associated with soybean oil content were identified under multiple environments by the Bonferroni correction with p <3.55×10−11. Two and 1863 SNP-SNP interaction pairs detected stable across 12 and 11 environments, respectively, which account around 50% of total environments. Epistasis values and contribution rates of stable interaction (the SNP interaction pairs were detected in more than 2 environments) pairs were detected by the two way ANOVA test, the available interaction pairs were ranged 0.01 to 0.89 and from 0.01 to 0.85, respectively. Some of one side of the interaction pairs were identified with previously research as a major QTL without epistasis effects. The results of this study provide insights into the genetic architecture of soybean oil content and can serve as a basis for marker-assisted selection breeding. PMID:27668866

  5. Mining for SNPs and SSRs using SNPServer, dbSNP and SSR taxonomy tree.

    PubMed

    Batley, Jacqueline; Edwards, David

    2009-01-01

    Molecular genetic markers represent one of the most powerful tools for the analysis of genomes and the association of heritable traits with underlying genetic variation. The development of high-throughput methods for the detection of single nucleotide polymorphisms (SNPs) and simple sequence repeats (SSRs) has led to a revolution in their use as molecular markers. The availability of large sequence data sets permits mining for these molecular markers, which may then be used for applications such as genetic trait mapping, diversity analysis and marker assisted selection in agriculture. Here we describe web-based automated methods for the discovery of SSRs using SSR taxonomy tree, the discovery of SNPs from sequence data using SNPServer and the identification of validated SNPs from within the dbSNP database. SSR taxonomy tree identifies pre-determined SSR amplification primers for virtually all species represented within the GenBank database. SNPServer uses a redundancy based approach to identify SNPs within DNA sequences. Following submission of a sequence of interest, SNPServer uses BLAST to identify similar sequences, CAP3 to cluster and assemble these sequences and then the SNP discovery software autoSNP to detect SNPs and insertion/deletion (indel) polymorphisms. The NCBI dbSNP database is a catalogue of molecular variation, hosting validated SNPs for several species within a public-domain archive.

  6. Genomewide linkage analysis of bipolar disorder by use of a high-density single-nucleotide-polymorphism (SNP) genotyping assay: a comparison with microsatellite marker assays and finding of significant linkage to chromosome 6q22.

    PubMed

    Middleton, F A; Pato, M T; Gentile, K L; Morley, C P; Zhao, X; Eisener, A F; Brown, A; Petryshen, T L; Kirby, A N; Medeiros, H; Carvalho, C; Macedo, A; Dourado, A; Coelho, I; Valente, J; Soares, M J; Ferreira, C P; Lei, M; Azevedo, M H; Kennedy, J L; Daly, M J; Sklar, P; Pato, C N

    2004-05-01

    We performed a linkage analysis on 25 extended multiplex Portuguese families segregating for bipolar disorder, by use of a high-density single-nucleotide-polymorphism (SNP) genotyping assay, the GeneChip Human Mapping 10K Array (HMA10K). Of these families, 12 were used for a direct comparison of the HMA10K with the traditional 10-cM microsatellite marker set and the more dense 4-cM marker set. This comparative analysis indicated the presence of significant linkage peaks in the SNP assay in chromosomal regions characterized by poor coverage and low information content on the microsatellite assays. The HMA10K provided consistently high information and enhanced coverage throughout these regions. Across the entire genome, the HMA10K had an average information content of 0.842 with 0.21-Mb intermarker spacing. In the 12-family set, the HMA10K-based analysis detected two chromosomal regions with genomewide significant linkage on chromosomes 6q22 and 11p11; both regions had failed to meet this strict threshold with the microsatellite assays. The full 25-family collection further strengthened the findings on chromosome 6q22, achieving genomewide significance with a maximum nonparametric linkage (NPL) score of 4.20 and a maximum LOD score of 3.56 at position 125.8 Mb. In addition to this highly significant finding, several other regions of suggestive linkage have also been identified in the 25-family data set, including two regions on chromosome 2 (57 Mb, NPL = 2.98; 145 Mb, NPL = 3.09), as well as regions on chromosomes 4 (91 Mb, NPL = 2.97), 16 (20 Mb, NPL = 2.89), and 20 (60 Mb, NPL = 2.99). We conclude that at least some of the linkage peaks we have identified may have been largely undetected in previous whole-genome scans for bipolar disorder because of insufficient coverage or information content, particularly on chromosomes 6q22 and 11p11.

  7. Genomewide Linkage Analysis of Bipolar Disorder by Use of a High-Density Single-Nucleotide–Polymorphism (SNP) Genotyping Assay: A Comparison with Microsatellite Marker Assays and Finding of Significant Linkage to Chromosome 6q22

    PubMed Central

    Middleton, F. A.; Pato, M. T.; Gentile, K. L.; Morley, C. P.; Zhao, X.; Eisener, A. F.; Brown, A.; Petryshen, T. L.; Kirby, A. N.; Medeiros, H.; Carvalho, C.; Macedo, A.; Dourado, A.; Coelho, I.; Valente, J.; Soares, M. J.; Ferreira, C. P.; Lei, M.; Azevedo, M. H.; Kennedy, J. L.; Daly, M. J.; Sklar, P.; Pato, C. N.

    2004-01-01

    We performed a linkage analysis on 25 extended multiplex Portuguese families segregating for bipolar disorder, by use of a high-density single-nucleotide–polymorphism (SNP) genotyping assay, the GeneChip Human Mapping 10K Array (HMA10K). Of these families, 12 were used for a direct comparison of the HMA10K with the traditional 10-cM microsatellite marker set and the more dense 4-cM marker set. This comparative analysis indicated the presence of significant linkage peaks in the SNP assay in chromosomal regions characterized by poor coverage and low information content on the microsatellite assays. The HMA10K provided consistently high information and enhanced coverage throughout these regions. Across the entire genome, the HMA10K had an average information content of 0.842 with 0.21-Mb intermarker spacing. In the 12-family set, the HMA10K-based analysis detected two chromosomal regions with genomewide significant linkage on chromosomes 6q22 and 11p11; both regions had failed to meet this strict threshold with the microsatellite assays. The full 25-family collection further strengthened the findings on chromosome 6q22, achieving genomewide significance with a maximum nonparametric linkage (NPL) score of 4.20 and a maximum LOD score of 3.56 at position 125.8 Mb. In addition to this highly significant finding, several other regions of suggestive linkage have also been identified in the 25-family data set, including two regions on chromosome 2 (57 Mb, NPL = 2.98; 145 Mb, NPL = 3.09), as well as regions on chromosomes 4 (91 Mb, NPL = 2.97), 16 (20 Mb, NPL = 2.89), and 20 (60 Mb, NPL = 2.99). We conclude that at least some of the linkage peaks we have identified may have been largely undetected in previous whole-genome scans for bipolar disorder because of insufficient coverage or information content, particularly on chromosomes 6q22 and 11p11. PMID:15060841

  8. Development of high-density SNP genotyping arrays for white spruce (Picea glauca) and transferability to subtropical and nordic congeners.

    PubMed

    Pavy, Nathalie; Gagnon, France; Rigault, Philippe; Blais, Sylvie; Deschênes, Astrid; Boyle, Brian; Pelgas, Betty; Deslauriers, Marie; Clément, Sébastien; Lavigne, Patricia; Lamothe, Manuel; Cooke, Janice E K; Jaramillo-Correa, Juan P; Beaulieu, Jean; Isabel, Nathalie; Mackay, John; Bousquet, Jean

    2013-03-01

    High-density SNP genotyping arrays can be designed for any species given sufficient sequence information of high quality. Two high-density SNP arrays relying on the Infinium iSelect technology (Illumina) were designed for use in the conifer white spruce (Picea glauca). One array contained 7338 segregating SNPs representative of 2814 genes of various molecular functional classes for main uses in genetic association and population genetics studies. The other one contained 9559 segregating SNPs representative of 9543 genes for main uses in population genetics, linkage mapping of the genome and genomic prediction. The SNPs assayed were discovered from various sources of gene resequencing data. SNPs predicted from high-quality sequences derived from genomic DNA reached a genotyping success rate of 64.7%. Nonsingleton in silico SNPs (i.e. a sequence polymorphism present in at least two reads) predicted from expressed sequenced tags obtained with the Roche 454 technology and Illumina GAII analyser resulted in a similar genotyping success rate of 71.6% when the deepest alignment was used and the most favourable SNP probe per gene was selected. A variable proportion of these SNPs was shared by other nordic and subtropical spruce species from North America and Europe. The number of shared SNPs was inversely proportional to phylogenetic divergence and standing genetic variation in the recipient species, but positively related to allele frequency in P. glauca natural populations. These validated SNP resources should open up new avenues for population genetics and comparative genetic mapping at a genomic scale in spruce species.

  9. An unbiased resource of novel SNP markers provides a new chronology for the human Y chromosome and reveals a deep phylogenetic structure in Africa.

    PubMed

    Scozzari, Rosaria; Massaia, Andrea; Trombetta, Beniamino; Bellusci, Giovanna; Myres, Natalie M; Novelletto, Andrea; Cruciani, Fulvio

    2014-03-01

    Sequence diversity and the ages of the deepest nodes of the MSY phylogeny remain largely unexplored due to the severely biased collection of SNPs available for study. We characterized 68 worldwide Y chromosomes by high-coverage next-generation sequencing, including 18 deep-rooting ones, and identified 2386 SNPs, 80% of which were novel. Many aspects of this pool of variants resembled the pattern observed among genome-wide de novo events, suggesting that in the MSY, a large proportion of newly arisen alleles has survived in the phylogeny. Some degree of purifying selection emerged in the form of an excess of private missense variants. Our tree recapitulated the previously known topology, but the relative lengths of major branches were drastically modified and the associated node ages were remarkably older. We found significantly different branch lengths when comparing the rare deep-rooted A1b African lineage with the rest of the tree. Our dating results and phylogeography led to the following main conclusions: (1) Patrilineal lineages with ages approaching those of early AMH fossils survive today only in central-western Africa; (2) only a few evolutionarily successful MSY lineages survived between 160 and 115 kya; and (3) an early exit out of Africa (before 70 kya), which fits recent western Asian archaeological evidence, should be considered. Our experimental design produced an unbiased resource of new MSY markers informative for the initial formation of the anatomically modern human gene pool, i.e., a period of our evolution that had been previously considered to be poorly accessible with paternally inherited markers.

  10. Parentage Reconstruction in Eucalyptus nitens Using SNPs and Microsatellite Markers: A Comparative Analysis of Marker Data Power and Robustness

    PubMed Central

    Telfer, Emily J.; Stovold, Grahame T.; Li, Yongjun; Silva-Junior, Orzenil B.; Grattapaglia, Dario G.; Dungey, Heidi S.

    2015-01-01

    Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP) marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K). The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi-species SNP resource

  11. Parentage Reconstruction in Eucalyptus nitens Using SNPs and Microsatellite Markers: A Comparative Analysis of Marker Data Power and Robustness.

    PubMed

    Telfer, Emily J; Stovold, Grahame T; Li, Yongjun; Silva-Junior, Orzenil B; Grattapaglia, Dario G; Dungey, Heidi S

    2015-01-01

    Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP) marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K). The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi-species SNP resource

  12. Development of allele-specific primer PCR for a swine TLR2 SNP and comparison of the frequency among several pig breeds of Japan and the Czech Republic.

    PubMed

    Muneta, Yoshihiro; Minagawa, Yu; Kusumoto, Masahiro; Shinkai, Hiroki; Uenishi, Hirohide; Splichal, Igor

    2012-05-01

    In the present study, we have developed an allele-specific primer-polymerase chain reaction (ASP-PCR) for genotyping a single nucleotide polymorphism (SNP) of swine Toll-like receptor 2 (TLR2) (C406G), which is related to the prevalence of pneumonia caused by Mycoplasma hyopneumoniae. We also compared the allele frequency among several pig breeds of Japan and the Czech Republic. Allele-specific primers were constructed by introducing 1-base mismatch sequence before the SNP site. The swine TLR2 C406G mutation was successfully determined by the ASP-PCR using genomic DNA samples in Japan as previously genotyped by a sequencing method. Using the PCR condition determined, genomic DNA samples from pig blood obtained from 110 pigs from 7 different breeds in the Czech Republic were genotyped by the ASP-PCR. The genotyping results from the ASP-PCR were completely matched with the results from the sequencing method. The allele frequency of the swine TLR2 C406G mutation was 27.5% in the Czech Republic and 3.6% in Japan. The C406G mutation was only found in the Landrace breed in Japan, and was almost exclusively found in the Landrace breed in the Czech Republic as well. These results indicated the usefulness of ASP-PCR for detecting a specific SNP for swine TLR2.

  13. Early Markers of Vulnerable Language Skill Development in Galactosaemia

    ERIC Educational Resources Information Center

    Lewis, Fiona M.; Coman, David J.; Syrmis, Maryanne

    2014-01-01

    There are no known biomedical or genetic markers to identify which infants with galactosaemia (GAL) are most at risk of poor language skill development, yet pre-linguistic communicative "red flag" behaviours are recognised as early identifiers of heightened vulnerability to impaired language development. We report on pre-linguistic…

  14. Early Markers of Vulnerable Language Skill Development in Galactosaemia

    ERIC Educational Resources Information Center

    Lewis, Fiona M.; Coman, David J.; Syrmis, Maryanne

    2014-01-01

    There are no known biomedical or genetic markers to identify which infants with galactosaemia (GAL) are most at risk of poor language skill development, yet pre-linguistic communicative "red flag" behaviours are recognised as early identifiers of heightened vulnerability to impaired language development. We report on pre-linguistic…

  15. SNP-VISTA

    SciTech Connect

    Shah, Nameeta; Teplitsky, Michael; Minovitsky, Simon; Dubchak, Inna

    2005-11-07

    SNP-VISTA aids in analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) Mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNPs data.

  16. Allelic diversity of a beer haze active protein gene in cultivated and Tibetan wild barley and development of allelic specific markers.

    PubMed

    Ye, Lingzhen; Dai, Fei; Qiu, Long; Sun, Dongfa; Zhang, Guoping

    2011-07-13

    The formation of haze is a serious quality problem in beer production. It has been shown that the use of silica elute (SE)-ve malt (absence of molecular weight (MW) ∼14000 Da) for brewing can improve haze stability in the resultant beer, and the protein was identified as a barley trypsin inhibitor of the chloroform/methanol type (BTI-CMe). The objectives of this study were to determine (1) the allelic diversity of the gene controlling BTI-CMe in cultivated and Tibetan wild barley and (2) allele-specific (AS) markers for screening SE protein type. A survey of 172 Tibetan annual wild barley accessions and 71 cultivated barley genotypes was conducted, and 104 wild accessions and 35 cultivated genotypes were identified as SE+ve and 68 wild accessions and 36 cultivated genotypes as SE-ve. The allelic diversity of the gene controlling BTI-CMe was investigated by cloning, alignment, and association analysis. It was found that there were significant differences between the SE+ve and SE-ve types in single-nucleotide polymorphisms at 234 (SNP(234)), SNP(313), and SNP(385.) Furthermore, two sets of AS markers were developed to screen SE protein type based on SNP(313). AS-PCR had results very similar to those obtained by immunoblot method. Mapping analysis showed that the gene controlling the MW∼14 kDa band was located on the short arm of chromosome 3H, at the position of marker BPB-0527 (33.302 cM) in the Franklin/Yerong DH population.

  17. Inferring Loss-of-Heterozygosity from Unpaired Tumors Using High-Density Oligonucleotide SNP Arrays

    PubMed Central

    Park, Yuhyun; Hao, Ke; Zhao, Xiaojun; Garraway, Levi A; Fox, Edward A; Hochberg, Ephraim P; Mellinghoff, Ingo K; Hofer, Matthias D; Descazeaud, Aurelien; Rubin, Mark A; Meyerson, Matthew; Wong, Wing Hung; Sellers, William R; Li, Cheng

    2006-01-01

    Loss of heterozygosity (LOH) of chromosomal regions bearing tumor suppressors is a key event in the evolution of epithelial and mesenchymal tumors. Identification of these regions usually relies on genotyping tumor and counterpart normal DNA and noting regions where heterozygous alleles in the normal DNA become homozygous in the tumor. However, paired normal samples for tumors and cell lines are often not available. With the advent of oligonucleotide arrays that simultaneously assay thousands of single-nucleotide polymorphism (SNP) markers, genotyping can now be done at high enough resolution to allow identification of LOH events by the absence of heterozygous loci, without comparison to normal controls. Here we describe a hidden Markov model-based method to identify LOH from unpaired tumor samples, taking into account SNP intermarker distances, SNP-specific heterozygosity rates, and the haplotype structure of the human genome. When we applied the method to data genotyped on 100 K arrays, we correctly identified 99% of SNP markers as either retention or loss. We also correctly identified 81% of the regions of LOH, including 98% of regions greater than 3 megabases. By integrating copy number analysis into the method, we were able to distinguish LOH from allelic imbalance. Application of this method to data from a set of prostate samples without paired normals identified known regions of prevalent LOH. We have developed a method for analyzing high-density oligonucleotide SNP array data to accurately identify of regions of LOH and retention in tumors without the need for paired normal samples. PMID:16699594

  18. Application of next-generation sequencing for rapid marker development in molecular plant breeding: a case study on anthracnose disease resistance in Lupinus angustifolius L.

    PubMed

    Yang, Huaan; Tao, Ye; Zheng, Zequn; Li, Chengdao; Sweetingham, Mark W; Howieson, John G

    2012-07-17

    In the last 30 years, a number of DNA fingerprinting methods such as RFLP, RAPD, AFLP, SSR, DArT, have been extensively used in marker development for molecular plant breeding. However, it remains a daunting task to identify highly polymorphic and closely linked molecular markers for a target trait for molecular marker-assisted selection. The next-generation sequencing (NGS) technology is far more powerful than any existing generic DNA fingerprinting methods in generating DNA markers. In this study, we employed a grain legume crop Lupinus angustifolius (lupin) as a test case, and examined the utility of an NGS-based method of RAD (restriction-site associated DNA) sequencing as DNA fingerprinting for rapid, cost-effective marker development tagging a disease resistance gene for molecular breeding. Twenty informative plants from a cross of RxS (disease resistant x susceptible) in lupin were subjected to RAD single-end sequencing by multiplex identifiers. The entire RAD sequencing products were resolved in two lanes of the 16-lanes per run sequencing platform Solexa HiSeq2000. A total of 185 million raw reads, approximately 17 Gb of sequencing data, were collected. Sequence comparison among the 20 test plants discovered 8207 SNP markers. Filtration of DNA sequencing data with marker identification parameters resulted in the discovery of 38 molecular markers linked to the disease resistance gene Lanr1. Five randomly selected markers were converted into cost-effective, simple PCR-based markers. Linkage analysis using marker genotyping data and disease resistance phenotyping data on a F8 population consisting of 186 individual plants confirmed that all these five markers were linked to the R gene. Two of these newly developed sequence-specific PCR markers, AnSeq3 and AnSeq4, flanked the target R gene at a genetic distance of 0.9 centiMorgan (cM), and are now replacing the markers previously developed by a traditional DNA fingerprinting method for marker

  19. Application of next-generation sequencing for rapid marker development in molecular plant breeding: a case study on anthracnose disease resistance in Lupinus angustifolius L.

    PubMed Central

    2012-01-01

    Background In the last 30 years, a number of DNA fingerprinting methods such as RFLP, RAPD, AFLP, SSR, DArT, have been extensively used in marker development for molecular plant breeding. However, it remains a daunting task to identify highly polymorphic and closely linked molecular markers for a target trait for molecular marker-assisted selection. The next-generation sequencing (NGS) technology is far more powerful than any existing generic DNA fingerprinting methods in generating DNA markers. In this study, we employed a grain legume crop Lupinus angustifolius (lupin) as a test case, and examined the utility of an NGS-based method of RAD (restriction-site associated DNA) sequencing as DNA fingerprinting for rapid, cost-effective marker development tagging a disease resistance gene for molecular breeding. Results Twenty informative plants from a cross of RxS (disease resistant x susceptible) in lupin were subjected to RAD single-end sequencing by multiplex identifiers. The entire RAD sequencing products were resolved in two lanes of the 16-lanes per run sequencing platform Solexa HiSeq2000. A total of 185 million raw reads, approximately 17 Gb of sequencing data, were collected. Sequence comparison among the 20 test plants discovered 8207 SNP markers. Filtration of DNA sequencing data with marker identification parameters resulted in the discovery of 38 molecular markers linked to the disease resistance gene Lanr1. Five randomly selected markers were converted into cost-effective, simple PCR-based markers. Linkage analysis using marker genotyping data and disease resistance phenotyping data on a F8 population consisting of 186 individual plants confirmed that all these five markers were linked to the R gene. Two of these newly developed sequence-specific PCR markers, AnSeq3 and AnSeq4, flanked the target R gene at a genetic distance of 0.9 centiMorgan (cM), and are now replacing the markers previously developed by a traditional DNA fingerprinting method for

  20. Development of microsatellite markers for Carallia brachiata (Rhizophoraceae)1

    PubMed Central

    Qiang, Yinmeng; Xie, Hongxian; Qiao, Sitan; Yuan, Yang; Liu, Ying; Shi, Xianggang; Shu, Mi; Jin, Jianhua; Shi, Suhua; Tan, Fengxiao; Huang, Yelin

    2015-01-01

    Premise of the study: Microsatellite markers were developed for Carallia brachiata to assess the genetic diversity and structure of this terrestrial species of the Rhizophoraceae. Methods and Results: Based on transcriptome data for C. brachiata, 40 primer pairs were initially designed and tested, of which 18 were successfully amplified and 11 were polymorphic. For these microsatellites, one to three alleles per locus were identified. The observed and expected heterozygosities ranged from 0 to 0.727 and 0 to 0.520, respectively. In addition, all primers were successfully amplified in two congeners: C. pectinifolia and C. garciniifolia. Conclusions: The microsatellite markers described here will be useful in population genetic studies of C. brachiata and related species, suggesting that developing microsatellite markers from next-generation sequencing data can be efficient for genetic studies across this genus. PMID:25798345

  1. Development and characterization of SSR markers for Aster savatieri (Asteraceae).

    PubMed

    Ishikawa, Naoko; Sakaguchi, Shota; Ito, Motomi

    2016-06-01

    Simple sequence repeat (SSR) markers were developed for Aster savatieri (Asteraceae) and the serpentine variety A. savatieri var. pygmaeus to re-evaluate their taxonomic status. Using RNA-Seq data, 22 expressed sequence tag (EST)-SSR markers were developed. Polymorphisms were assessed in A. savatieri and in A. savatieri var. pygmaeus. The average number of alleles ranged from four to 15, and expected heterozygosity ranged from 0.417 to 0.870. Transferability was examined in six representative species of Japanese Aster and in Solidago virgaurea subsp. asiatica var. asiatica, a member of the tribe Astereae (Asteraceae); most of the loci were transferable to these examined species. These markers will be useful for genetic studies of variation in A. savatieri and other Aster species that occur in Japan.

  2. SNP discovery and genetic mapping using genotyping by sequencing of whole genome genomic DNA from a pea RIL population.

    PubMed

    Boutet, Gilles; Alves Carvalho, Susete; Falque, Matthieu; Peterlongo, Pierre; Lhuillier, Emeline; Bouchez, Olivier; Lavaud, Clément; Pilet-Nayel, Marie-Laure; Rivière, Nathalie; Baranger, Alain

    2016-02-18

    Progress in genetics and breeding in pea still suffers from the limited availability of molecular resources. SNP markers that can be identified through affordable sequencing processes, without the need for prior genome reduction or a reference genome to assemble sequencing data would allow the discovery and genetic mapping of thousands of molecular markers. Such an approach could significantly speed up genetic studies and marker assisted breeding for non-model species. A total of 419,024 SNPs were discovered using HiSeq whole genome sequencing of four pea lines, followed by direct identification of SNP markers without assembly using the discoSnp tool. Subsequent filtering led to the identification of 131,850 highly designable SNPs, polymorphic between at least two of the four pea lines. A subset of 64,754 SNPs was called and genotyped by short read sequencing on a subpopulation of 48 RILs from the cross 'Baccara' x 'PI180693'. This data was used to construct a WGGBS-derived pea genetic map comprising 64,263 markers. This map is collinear with previous pea consensus maps and therefore with the Medicago truncatula genome. Sequencing of four additional pea lines showed that 33 % to 64 % of the mapped SNPs, depending on the pairs of lines considered, are polymorphic and can therefore be useful in other crosses. The subsequent genotyping of a subset of 1000 SNPs, chosen for their mapping positions using a KASP™ assay, showed that almost all generated SNPs are highly designable and that most (95 %) deliver highly qualitative genotyping results. Using rather low sequencing coverages in SNP discovery and in SNP inferring did not hinder the identification of hundreds of thousands of high quality SNPs. The development and optimization of appropriate tools in SNP discovery and genetic mapping have allowed us to make available a massive new genomic resource in pea. It will be useful for both fine mapping within chosen QTL confidence intervals and marker assisted breeding for

  3. MarkerMiner 1.0: A new application for phylogenetic marker development using angiosperm transcriptomes.

    PubMed

    Chamala, Srikar; García, Nicolás; Godden, Grant T; Krishnakumar, Vivek; Jordon-Thaden, Ingrid E; De Smet, Riet; Barbazuk, W Brad; Soltis, Douglas E; Soltis, Pamela S

    2015-04-01

    Targeted sequencing using next-generation sequencing (NGS) platforms offers enormous potential for plant systematics by enabling economical acquisition of multilocus data sets that can resolve difficult phylogenetic problems. However, because discovery of single-copy nuclear (SCN) loci from NGS data requires both bioinformatics skills and access to high-performance computing resources, the application of NGS data has been limited. We developed MarkerMiner 1.0, a fully automated, open-access bioinformatic workflow and application for discovery of SCN loci in angiosperms. Our new tool identified as many as 1993 SCN loci from transcriptomic data sampled as part of four independent test cases representing marker development projects at different phylogenetic scales. MarkerMiner is an easy-to-use and effective tool for discovery of putative SCN loci. It can be run locally or via the Web, and its tabular and alignment outputs facilitate efficient downstream assessments of phylogenetic utility, locus selection, intron-exon boundary prediction, and primer or probe development.

  4. Design and validation of a 90K SNP genotyping assay for the water buffalo (Bubalus bubalis).

    PubMed

    Iamartino, Daniela; Nicolazzi, Ezequiel L; Van Tassell, Curtis P; Reecy, James M; Fritz-Waters, Eric R; Koltes, James E; Biffani, Stefano; Sonstegard, Tad S; Schroeder, Steven G; Ajmone-Marsan, Paolo; Negrini, Riccardo; Pasquariello, Rolando; Ramelli, Paola; Coletta, Angelo; Garcia, José F; Ali, Ahmad; Ramunno, Luigi; Cosenza, Gianfranco; de Oliveira, Denise A A; Drummond, Marcela G; Bastianetto, Eduardo; Davassi, Alessandro; Pirani, Ali; Brew, Fiona; Williams, John L

    2017-01-01

    The availability of the bovine genome sequence and SNP panels has improved various genomic analyses, from exploring genetic diversity to aiding genetic selection. However, few of the SNP on the bovine chips are polymorphic in buffalo, therefore a panel of single nucleotide DNA markers exclusive for buffalo was necessary for molecular genetic analyses and to develop genomic selection approaches for water buffalo. The creation of a 90K SNP panel for river buffalo and testing in a genome wide association study for milk production is described here. The genomes of 73 buffaloes of 4 different breeds were sequenced and aligned against the bovine genome, which facilitated the identification of 22 million of sequence variants among the buffalo genomes. Based on frequencies of variants within and among buffalo breeds, and their distribution across the genome, inferred from the bovine genome sequence, 90,000 putative single nucleotide polymorphisms were selected to create an Axiom® Buffalo Genotyping Array 90K. This 90K "SNP-Chip" was tested in several river buffalo populations and found to have ∼70% high quality and polymorphic SNPs. Of the 90K SNPs about 24K were also found to be polymorphic in swamp buffalo. The SNP chip was used to investigate the structure of buffalo populations, and could distinguish buffalo from different farms. A Genome Wide Association Study identified genomic regions on 5 chromosomes putatively involved in milk production. The 90K buffalo SNP chip described here is suitable for the analysis of the genomes of river buffalo breeds, and could be used for genetic diversity studies and potentially as a starting point for genome-assisted selection programmes. This SNP Chip could also be used to analyse swamp buffalo, but many loci are not informative and creation of a revised SNP set specific for swamp buffalo would be advised.

  5. Development of a Medium Density Combined-Species SNP Array for Pacific and European Oysters (Crassostrea gigas and Ostrea edulis)

    PubMed Central

    Gutierrez, Alejandro P.; Turner, Frances; Gharbi, Karim; Talbot, Richard; Lowe, Natalie R.; Peñaloza, Carolina; McCullough, Mark; Prodöhl, Paulo A.; Bean, Tim P.; Houston, Ross D.

    2017-01-01

    SNP arrays are enabling tools for high-resolution studies of the genetic basis of complex traits in farmed and wild animals. Oysters are of critical importance in many regions from both an ecological and economic perspective, and oyster aquaculture forms a key component of global food security. The aim of our study was to design a combined-species, medium density SNP array for Pacific oyster (Crassostrea gigas) and European flat oyster (Ostrea edulis), and to test the performance of this array on farmed and wild populations from multiple locations, with a focus on European populations. SNP discovery was carried out by whole-genome sequencing (WGS) of pooled genomic DNA samples from eight C. gigas populations, and restriction site-associated DNA sequencing (RAD-Seq) of 11 geographically diverse O. edulis populations. Nearly 12 million candidate SNPs were discovered and filtered based on several criteria, including preference for SNPs segregating in multiple populations and SNPs with monomorphic flanking regions. An Affymetrix Axiom Custom Array was created and tested on a diverse set of samples (n = 219) showing ∼27 K high quality SNPs for C. gigas and ∼11 K high quality SNPs for O. edulis segregating in these populations. A high proportion of SNPs were segregating in each of the populations, and the array was used to detect population structure and levels of linkage disequilibrium (LD). Further testing of the array on three C. gigas nuclear families (n = 165) revealed that the array can be used to clearly distinguish between both families based on identity-by-state (IBS) clustering parental assignment software. This medium density, combined-species array will be publicly available through Affymetrix, and will be applied for genome-wide association and evolutionary genetic studies, and for genomic selection in oyster breeding programs. PMID:28533337

  6. Development of a Medium Density Combined-Species SNP Array for Pacific and European Oysters (Crassostrea gigas and Ostrea edulis).

    PubMed

    Gutierrez, Alejandro P; Turner, Frances; Gharbi, Karim; Talbot, Richard; Lowe, Natalie R; Peñaloza, Carolina; McCullough, Mark; Prodöhl, Paulo A; Bean, Tim P; Houston, Ross D

    2017-07-05

    SNP arrays are enabling tools for high-resolution studies of the genetic basis of complex traits in farmed and wild animals. Oysters are of critical importance in many regions from both an ecological and economic perspective, and oyster aquaculture forms a key component of global food security. The aim of our study was to design a combined-species, medium density SNP array for Pacific oyster (Crassostrea gigas) and European flat oyster (Ostrea edulis), and to test the performance of this array on farmed and wild populations from multiple locations, with a focus on European populations. SNP discovery was carried out by whole-genome sequencing (WGS) of pooled genomic DNA samples from eight C. gigas populations, and restriction site-associated DNA sequencing (RAD-Seq) of 11 geographically diverse O. edulis populations. Nearly 12 million candidate SNPs were discovered and filtered based on several criteria, including preference for SNPs segregating in multiple populations and SNPs with monomorphic flanking regions. An Affymetrix Axiom Custom Array was created and tested on a diverse set of samples (n = 219) showing ∼27 K high quality SNPs for C. gigas and ∼11 K high quality SNPs for O. edulis segregating in these populations. A high proportion of SNPs were segregating in each of the populations, and the array was used to detect population structure and levels of linkage disequilibrium (LD). Further testing of the array on three C. gigas nuclear families (n = 165) revealed that the array can be used to clearly distinguish between both families based on identity-by-state (IBS) clustering parental assignment software. This medium density, combined-species array will be publicly available through Affymetrix, and will be applied for genome-wide association and evolutionary genetic studies, and for genomic selection in oyster breeding programs. Copyright © 2017 Gutierrez et al.

  7. Development of DArT Marker Platforms and Genetic Diversity Assessment of the U.S. Collection of the New Oilseed Crop Lesquerella and Related Species

    PubMed Central

    Cruz, Von Mark V.; Kilian, Andrzej; Dierig, David A.

    2013-01-01

    The advantages of using molecular markers in modern genebanks are well documented. They are commonly used to understand the distribution of genetic diversity in populations and among species which is crucial for efficient management and effective utilization of germplasm collections. We describe the development of two types of DArT molecular marker platforms for the new oilseed crop lesquerella (Physaria spp.), a member of the Brassicaceae family, to characterize a collection in the National Plant Germplasm System (NPGS) with relatively little known in regards to the genetic diversity and traits. The two types of platforms were developed using a subset of the germplasm conserved ex situ consisting of 87 Physaria and 2 Paysonia accessions. The microarray DArT revealed a total of 2,833 polymorphic markers with an average genotype call rate of 98.4% and a scoring reproducibility of 99.7%. On the other hand, the DArTseq platform developed for SNP and DArT markers from short sequence reads showed a total of 27,748 high quality markers. Cluster analysis and principal coordinate analysis indicated that the different accessions were successfully classified by both systems based on species, by geographical source, and breeding status. In the germplasm set analyzed, which represented more than 80% of the P. fendleri collection, we observed that a substantial amount of variation exists in the species collection. These markers will be valuable in germplasm management studies and lesquerella breeding, and augment the microsatellite markers previously developed on the taxa. PMID:23724020

  8. SNP discrimination through proofreading and OFF-switch of exo+ polymerase.

    PubMed

    Zhang, Jia; Li, Kai; Pardinas, Jose R; Liao, Duan F; Li, Hong J; Zhang, Xu

    2004-05-01

    Single nucleotide polymorphisms (SNPs) are useful physical markers for genetic studies as well as the cause of some genetic diseases. To develop more reliable SNP assays, we examined the underlying molecular mechanisms by which deoxyribonucleic acid (DNA) polymerases with 3' exonuclease activity maintain the high fidelity of DNA replication. In addition to mismatch removal by proofreading, we have discovered a premature termination of polymerization mediated by a novel OFF-switch mechanism. Two SNP assays were developed, one based on proofreading using 3' end-labeled primer extension and the other based on the newly identified OFF-switch, respectively. These two new assays are well suited for conventional techniques, such as electrophoresis and microplates detection systems as well as the sophisticated microchips. Application of these reliable SNP assays will greatly facilitate genetic and biomedical studies in the postgenome era.

  9. Gene-based SNP discovery and genetic mapping in pea.

    PubMed

    Sindhu, Anoop; Ramsay, Larissa; Sanderson, Lacey-Anne; Stonehouse, Robert; Li, Rong; Condie, Janet; Shunmugam, Arun S K; Liu, Yong; Jha, Ambuj B; Diapari, Marwan; Burstin, Judith; Aubert, Gregoire; Tar'an, Bunyamin; Bett, Kirstin E; Warkentin, Thomas D; Sharpe, Andrew G

    2014-10-01

    Gene-based SNPs were identified and mapped in pea using five recombinant inbred line populations segregating for traits of agronomic importance. Pea (Pisum sativum L.) is one of the world's oldest domesticated crops and has been a model system in plant biology and genetics since the work of Gregor Mendel. Pea is the second most widely grown pulse crop in the world following common bean. The importance of pea as a food crop is growing due to its combination of moderate protein concentration, slowly digestible starch, high dietary fiber concentration, and its richness in micronutrients; however, pea has lagged behind other major crops in harnessing recent advances in molecular biology, genomics and bioinformatics, partly due to its large genome size with a large proportion of repetitive sequence, and to the relatively limited investment in research in this crop globally. The objective of this research was the development of a genome-wide transcriptome-based pea single-nucleotide polymorphism (SNP) marker platform using next-generation sequencing technology. A total of 1,536 polymorphic SNP loci selected from over 20,000 non-redundant SNPs identified using deep transcriptome sequencing of eight diverse Pisum accessions were used for genotyping in five RIL populations using an Illumina GoldenGate assay. The first high-density pea SNP map defining all seven linkage groups was generated by integrating with previously published anchor markers. Syntenic relationships of this map with the model legume Medicago truncatula and lentil (Lens culinaris Medik.) maps were established. The genic SNP map establishes a foundation for future molecular breeding efforts by enabling both the identification and tracking of introgression of genomic regions harbouring QTLs related to agronomic and seed quality traits.

  10. Development of SSR markers for the genus Patellifolia (Chenopodiaceae)1

    PubMed Central

    Nachtigall, Marion; Bülow, Lorenz; Schubert, Jörg; Frese, Lothar

    2016-01-01

    Premise of the study: Microsatellite primers were developed to promote studies on the patterns of genetic diversity within Patellifolia patellaris (Chenopodiaceae) and the relationship between the three species of the genus Patellifolia. Methods and Results: The genomic sequence from P. procumbens was screened for simple sequence repeats (SSRs), and 3648 SSRs were identified. A subset of 53 SSR markers was validated, of which 25 proved to be polymorphic in the three species except for the P. webbiana–specific marker JKIPat16. The number of alleles ranged from 85 in P. patellaris, 187 in P. procumbens, and 202 in P. webbiana. Conclusions: The set of 25 new markers will facilitate studies of the relationships between the three Patellifolia species and of the spatial and temporal distribution of genetic diversity within the species. PMID:27610279

  11. Development of fluorescent markers using polycyclic aromatic hydrocarbons with vaseline.

    PubMed

    Kurata, Shoji; Hirano, Haruo; Nagai, Masatoshi

    2002-03-01

    Identifiable fluorescent markers were developed as tracers to tail suspects using phenanthrene, anthracene, fluoranthene, pyrene, perylene, and coronene in vaseline. Vaseline was used as a carrier of the marker. Of the six compounds in the vaseline, perylene and fluoranthene were readily observed under ultraviolet (UV) light at a wavelength of 365 nm. All six compounds were identified selectively and sensitively without interference of vaseline using a high performance liquid chromatograph (HPLC) with a fluorescence detector. The detection limit was much less than 1 ng, corresponding to that of the observation behavior under UV light. The results showed that each component with vaseline was more effective than the individual component for the delay in degradation. The case examples of the fluorescent markers are shown.

  12. An Improved Opposition-Based Learning Particle Swarm Optimization for the Detection of SNP-SNP Interactions.

    PubMed

    Shang, Junliang; Sun, Yan; Li, Shengjun; Liu, Jin-Xing; Zheng, Chun-Hou; Zhang, Junying

    2015-01-01

    SNP-SNP interactions have been receiving increasing attention in understanding the mechanism underlying susceptibility to complex diseases. Though many works have been done for the detection of SNP-SNP interactions, the algorithmic development is still ongoing. In this study, an improved opposition-based learning particle swarm optimization (IOBLPSO) is proposed for the detection of SNP-SNP interactions. Highlights of IOBLPSO are the introduction of three strategies, namely, opposition-based learning, dynamic inertia weight, and a postprocedure. Opposition-based learning not only enhances the global explorative ability, but also avoids premature convergence. Dynamic inertia weight allows particles to cover a wider search space when the considered SNP is likely to be a random one and converges on promising regions of the search space while capturing a highly suspected SNP. The postprocedure is used to carry out a deep search in highly suspected SNP sets. Experiments of IOBLPSO are performed on both simulation data sets and a real data set of age-related macular degeneration, results of which demonstrate that IOBLPSO is promising in detecting SNP-SNP interactions. IOBLPSO might be an alternative to existing methods for detecting SNP-SNP interactions.

  13. Development of molecular markers and preliminary investigation of the population structure and mating system in one lineage of black morel (Morchella elata) in the Pacific Northwestern USA.

    PubMed

    Pagliaccia, Deborah; Douhan, Greg W; Douhan, LeAnn; Peever, Tobin L; Carris, Lori M; Kerrigan, Julia L

    2011-01-01

    Phylogenetic analysis of LSU/ITS sequence data revealed two distinct lineages among 44 morphologically similar fruiting bodies of natural black morels (Morchella elata group) sampled at three non-burn locations in the St Joe and Kanisku National Forests in northern Idaho. Most of the sampled isolates (n = 34) represented a dominant LSU/ITS haplotype present at all three sites and identical to the Mel-12 phylogenetic lineage (GU551425) identified in a previous study. Variation at 1-3 nucleotide sites was detected among a small number of isolates (n = 6) within this well supported clade (94%). Four isolates sampled from a single location were in a well supported clade (97%) distinct from the dominant haplotypes and may represent a previously un-sampled, cryptic phylogenetic species. Species-specific SNP and SCAR markers were developed for Mel-12 lineage isolates by cloning and sequencing AFLP amplicons, and segregation of AFLP markers were studied from single ascospore isolates from individual fruiting bodies. Based on the segregation of AFLP markers within single fruiting bodies, split decomposition analyses of two SCAR markers, and population genetic analyses of SNP, SCAR, and AFLP markers, it appears that members of the Morchella sp. Mel-12 phylogenetic lineage are heterothallic and outcross in nature similar to yellow morels. This is the first set of locus-specific molecular markers that has been developed for any Morchella species, to our knowledge. These markers will prove to be valuable tools to study mating system, gene flow and genetic structure of black morels at various spatial scales with field-collected fruiting bodies and eliminate the need to culture samples in vitro.

  14. A second generation SNP and SSR integrated linkage map and QTL mapping for the Chinese mitten crab Eriocheir sinensis

    PubMed Central

    Qiu, Gao-Feng; Xiong, Liang-Wei; Han, Zhi-Ke; Liu, Zhi-Qiang; Feng, Jian-Bin; Wu, Xu-Gan; Yan, Yin-Long; Shen, Hong; Huang, Long; Chen, Li

    2017-01-01

    The Chinese mitten crab Eriocheir sinensis is the most economically important cultivated crab species in China, and its genome has a high number of chromosomes (2n = 146). To obtain sufficient markers for construction of a dense genetic map for this species, we employed the recently developed specific-locus amplified fragment sequencing (SLAF-seq) method for large-scale SNPs screening and genotyping in a F1 full-sib family of 149 individuals. SLAF-seq generated 127,677 polymorphic SNP markers, of which 20,803 valid markers were assigned into five segregation types and were used together with previous SSR markers for linkage map construction. The final integrated genetic map included 17,680 SNP and 629 SSR markers on the 73 linkage groups (LG), and spanned 14,894.9 cM with an average marker interval of 0.81 cM. QTL mapping localized three significant growth-related QTL to a 1.2 cM region in LG53 as well as 146 sex-linked markers in LG48. Genome-wide QTL-association analysis further identified four growth-related QTL genes named LNX2, PAK2, FMRFamide and octopamine receptors. These genes are involved in a variety of different signaling pathways including cell proliferation and growth. The map and SNP markers described here will be a valuable resource for the E. sinensis genome project and selective breeding programs. PMID:28045132

  15. DNA sequences of Pima (Gossypium barbadense L.) cotton leaf for examining transcriptome diversity and SNP biomarker discovery

    USDA-ARS?s Scientific Manuscript database

    As an initial step to explore the transcriptome genetic diversity and to discover single nucleotide polymorphic (SNP)-biomarkers for marker assisted breeding within Pima (Gossypium barbadense L.) cotton, leaves from 25 day plants of three diverse genotypes were used to develop cDNA libraries. Using ...

  16. A Novel Test for Detecting SNP-SNP Interactions in Case-Only Trio Studies.

    PubMed

    Balliu, Brunilda; Zaitlen, Noah

    2016-04-01

    Epistasis plays a significant role in the genetic architecture of many complex phenotypes in model organisms. To date, there have been very few interactions replicated in human studies due in part to the multiple-hypothesis burden implicit in genome-wide tests of epistasis. Therefore, it is of paramount importance to develop the most powerful tests possible for detecting interactions. In this work we develop a new SNP-SNP interaction test for use in case-only trio studies called the trio correlation (TC) test. The TC test computes the expected joint distribution of marker pairs in offspring conditional on parental genotypes. This distribution is then incorporated into a standard 1 d.f. correlation test of interaction. We show via extensive simulations under a variety of disease models that our test substantially outperforms existing tests of interaction in case-only trio studies. We also demonstrate a bias in a previous case-only trio interaction test and identify its origin. Finally, we show that a previously proposed permutation scheme in trio studies mitigates the known biases of case-only tests in the presence of population stratification. We conclude that the TC test shows improved power to identify interactions in existing, as well as emerging, trio association studies. The method is publicly available at www.github.com/BrunildaBalliu/TrioEpi.

  17. High-throughput SNP genotyping for breeding applications in rice using the BeadXpress platform

    USDA-ARS?s Scientific Manuscript database

    Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...

  18. Exploration of SNP variants affecting hair colour prediction in Europeans.

    PubMed

    Söchtig, Jens; Phillips, Chris; Maroñas, Olalla; Gómez-Tato, Antonio; Cruz, Raquel; Alvarez-Dios, Jose; de Cal, María-Ángeles Casares; Ruiz, Yarimar; Reich, Kristian; Fondevila, Manuel; Carracedo, Ángel; Lareu, María V

    2015-09-01

    DNA profiling is a key tool for forensic analysis; however, current methods identify a suspect either by direct comparison or from DNA database searches. In cases with unidentified suspects, prediction of visible physical traits e.g. pigmentation or hair distribution of the DNA donors can provide important probative information. This study aimed to explore single nucleotide polymorphism (SNP) variants for their effect on hair colour prediction. A discovery panel of 63 SNPs consisting of already established hair colour markers from the HIrisPlex hair colour phenotyping assay as well as additional markers for which associations to human pigmentation traits were previously identified was used to develop multiplex assays based on SNaPshot single-base extension technology. A genotyping study was performed on a range of European populations (n = 605). Hair colour phenotyping was accomplished by matching donor's hair to a graded colour category system of reference shades and photography. Since multiple SNPs in combination contribute in varying degrees to hair colour predictability in Europeans, we aimed to compile a compact marker set that could provide a reliable hair colour inference from the fewest SNPs. The predictive approach developed uses a naïve Bayes classifier to provide hair colour assignment probabilities for the SNP profiles of the key SNPs and was embedded into the Snipper online SNP classifier ( http://mathgene.usc.es/snipper/ ). Results indicate that red, blond, brown and black hair colours are predictable with informative probabilities in a high proportion of cases. Our study resulted in the identification of 12 most strongly associated SNPs to hair pigmentation variation in six genes.

  19. Developing biochemical and molecular markers for cyanobacterial inoculants.

    PubMed

    Prasanna, R; Madhan, K; Singh, R N; Chauhan, A K; Nain, L

    2010-09-01

    Markers for evaluating the establishment of cyanobacteria based on their sensitivity or resistance to antibiotics, saccharide utilization patterns and PCR generated fingerprints were developed. Four selected strains (isolates from rhizosphere soils of diverse agro-ecosystems) have shown potential as diazotrophs and exhibited plant growth promoting abilities. Different responses were obtained on screening against 40 antibiotics, which aided in developing selectable antibiotic markers for each strain. Biochemical profiles generated using standardized chromogenic identification system (including saccharide utilization tests) revealed that 53 % of the saccharides tested were not utilized by any strain, while some strains exhibited unique ability for utilization of saccharides such as melibiose, cellobiose, maltose and glucosamine. PCR based amplification profiles developed using a number of primers based on repeat sequences revealed the utility of 3 primers in providing unique fingerprints for the strains.

  20. SNPMeta: SNP annotation and SNP metadata collection without a reference genome

    USDA-ARS?s Scientific Manuscript database

    The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a ...

  1. HapRice, an SNP haplotype database and a web tool for rice.

    PubMed

    Yonemaru, Jun-ichi; Ebana, Kaworu; Yano, Masahiro

    2014-01-01

    Genome-wide single nucleotide polymorphism (SNP) analysis is a promising tool to examine the genetic diversity of rice populations and genetic traits of scientific and economic importance. Next-generation sequencing technology has accelerated the re-sequencing of diverse rice varieties and the discovery of genome-wide SNPs. Notably, validation of these SNPs by a high-throughput genotyping system, such as an SNP array, could provide a manageable and highly accurate SNP set. To enhance the potential utility of genome-wide SNPs for geneticists and breeders, analysis tools need to be developed. Here, we constructed an SNP haplotype database, which allows visualization of the allele frequency of all SNPs in the genome browser. We calculated the allele frequencies of 3,334 SNPs in 76 accessions from the world rice collection and 3,252 SNPs in 177 Japanese rice accessions; all these SNPs have been validated in our previous studies. The SNP haplotypes were defined by the allele frequency in each cultivar group (aus, indica, tropical japonica and temperate japonica) for the world rice accessions, and in non-irrigated and three irrigated groups (three variety registration periods) for Japanese rice accessions. We also developed web tools for finding polymorphic SNPs between any two rice accessions and for the primer design to develop cleaved amplified polymorphic sequence markers at any SNP. The 'HapRice' database and the web tools can be accessed at http://qtaro.abr.affrc.go.jp/index.html. In addition, we established a core SNP set consisting of 768 SNPs uniformly distributed in the rice genome; this set is of a practically appropriate size for use in rice genetic analysis.

  2. Transcriptome sequencing, and rapid development and application of SNP markers for the legume pod borer Maruca vitrata (Lepidoptera: Crambidae)

    USDA-ARS?s Scientific Manuscript database

    The legume pod borer, Maruca vitrata (Lepidoptera: Crambidae), is an insect pest species that is destructive to crops grown by subsistence farmers in tropical regions of West Africa. We present the de novo assembly of 3729 contigs from 454- and Sanger-derived sequencing reads for midgut, salivary, ...

  3. Barley stripe rust resistance QTL: Development and validation of SNP markers for resistance to Puccinia striiformis f. sp. hordei

    USDA-ARS?s Scientific Manuscript database

    Quantitative trait loci (QTL) linked with seedling and field resistance to barley stripe rust were mapped in 156 recombinant inbred lines (RILs) derived from a Lenetah by Grannelose Zweizeilige (GZ) cross. A major QTL for seedling resistance on chromosome 4H (LOD = 15.94 at 97.19 cM) was identified,...

  4. Discovery and development of integrative biological markers for schizophrenia.

    PubMed

    Oertel-Knöchel, Viola; Bittner, Robert A; Knöchel, Christian; Prvulovic, David; Hampel, Harald

    2011-12-01

    Schizophrenia is one of the most disabling forms of mental illness. One of the most important challenges is to establish biological markers which can accurately identify at-risk individuals in preclinical stages and thus improve the effects of early intervention strategies. Here, we review recent findings in the field of molecular genetics, CSF (cerebrospinal fluid) based markers as well as structural and functional neuroimaging in the light of their relevance for schizophrenia biomarker research. We also examine evidence supporting the hypothesis that schizophrenia and neurodegenerative disorders such as Alzheimer's disease may share certain pathophysiological features, e.g. chronic inflammation and oxidative stress, and discuss their possible role in schizophrenia. The heterogeneous, multifaceted and multifactorial nature of the traditionally clinically operationalized entity "schizophrenia" presents an enormous challenge towards the identification of single diagnostic or surrogate markers. We propose that abnormal neural coordination is a major point of convergence of a number of crucial pathophysiological pathways. Therefore, functional markers reflecting disturbed neural coordination might be particularly attractive biomarker candidates, because of their ability to integrate the influence of diverse pathophysiological mechanisms. Similarly, combinatorial and multimodal approaches may be a promising way to more accurately capture the complex biological underpinnings schizophrenia. We consider the development of such integrative biomarkers to be essential in order to facilitate a timely diagnosis of schizophrenia. They should also advance our understanding of the subtle and intricate biological nature of schizophrenia. Copyright © 2011 Elsevier Ltd. All rights reserved.

  5. Characterization of the miiuy croaker (Miichthys miiuy) transcriptome and development of immune-relevant genes and molecular markers.

    PubMed

    Che, Rongbo; Sun, Yueyan; Sun, Dianqiao; Xu, Tianjun

    2014-01-01

    The miiuy croaker (Miichthys miiuy) is an important species of marine fish that supports capture fisheries and aquaculture. At present commercial scale aquaculture of this species is limited due to diseases caused by pathogens and parasites which restrict production and limit commercial value. The lack of transcriptomic and genomic information for the miiuy croaker limits the ability of researchers to study the pathogenesis and immune system of this species. In this study we constructed a cDNA library from liver, spleen and kidney which was sequenced using Illumina paired-end sequencing to enable gene discovery and molecular marker development. In our study, a total of 69,071 unigenes with an average length of 572 bp were obtained. Of these, 45,676 (66.13%) were successfully annotated in public databases. The unigenes were also annotated with Gene Ontology, Clusters of Orthologous Groups and KEGG pathways. Additionally, 498 immune-relevant genes were identified and classified. Furthermore, 14,885 putative simple sequence repeats (cSSRs) and 8,510 putative single nucleotide polymorphisms (SNPs) were identified from the 69,071 unigenes. The miiuy croaker (Miichthys miiuy) transcriptome data provides a large resource to identify new genes involved in many processes including those involved in the response to pathogens and diseases. Furthermore, the thousands of potential cSSR and SNP markers found in this study are important resources with respect to future development of molecular marker assisted breeding programs for the miiuy croaker.

  6. Characterization of the Miiuy Croaker (Miichthys miiuy) Transcriptome and Development of Immune-Relevant Genes and Molecular Markers

    PubMed Central

    Che, Rongbo; Sun, Yueyan; Sun, Dianqiao; Xu, Tianjun

    2014-01-01

    Background The miiuy croaker (Miichthys miiuy) is an important species of marine fish that supports capture fisheries and aquaculture. At present commercial scale aquaculture of this species is limited due to diseases caused by pathogens and parasites which restrict production and limit commercial value. The lack of transcriptomic and genomic information for the miiuy croaker limits the ability of researchers to study the pathogenesis and immune system of this species. In this study we constructed a cDNA library from liver, spleen and kidney which was sequenced using Illumina paired-end sequencing to enable gene discovery and molecular marker development. Principal Findings In our study, a total of 69,071 unigenes with an average length of 572 bp were obtained. Of these, 45,676 (66.13%) were successfully annotated in public databases. The unigenes were also annotated with Gene Ontology, Clusters of Orthologous Groups and KEGG pathways. Additionally, 498 immune-relevant genes were identified and classified. Furthermore, 14,885 putative simple sequence repeats (cSSRs) and 8,510 putative single nucleotide polymorphisms (SNPs) were identified from the 69,071 unigenes. Conclusion The miiuy croaker (Miichthys miiuy) transcriptome data provides a large resource to identify new genes involved in many processes including those involved in the response to pathogens and diseases. Furthermore, the thousands of potential cSSR and SNP markers found in this study are important resources with respect to future development of molecular marker assisted breeding programs for the miiuy croaker. PMID:24714210

  7. Medline search engine for finding genetic markers with biological significance.

    PubMed

    Xuan, Weijian; Wang, Pinglang; Watson, Stanley J; Meng, Fan

    2007-09-15

    Genome-wide high density SNP association studies are expected to identify various SNP alleles associated with different complex disorders. Understanding the biological significance of these SNP alleles in the context of existing literature is a major challenge since existing search engines are not designed to search literature for SNPs or other genetic markers. The literature mining of gene and protein functions has received significant attention and effort while similar work on genetic markers and their related diseases is still in its infancy. Our goal is to develop a web-based tool that facilitates the mining of Medline literature related to genetic studies and gene/protein function studies. Our solution consists of four main function modules for (1) identification of different types of genetic markers or genetic variations in Medline records (2) distinguishing positive versus negative linkage or association between genetic markers and diseases (3) integrating marker genomic location data from different databases to enable the retrieval of Medline records related to markers in the same linkage disequilibrium region (4) and a web interface called MarkerInfoFinder to search, display, sort and download Medline citation results. Tests using published data suggest MarkerInfoFinder can significantly increase the efficiency of finding genetic disorders and their underlying molecular mechanisms. The functions we developed will also be used to build a knowledge base for genetic markers and diseases. The MarkerInfoFinder is publicly available at: http://brainarray.mbni.med.umich.edu/brainarray/datamining/MarkerInfoFinder.

  8. Identification and authentication of Rosa species through development of species-specific SCAR marker(s).

    PubMed

    Bashir, K M I; Awan, F S; Khan, I A; Khan, A I; Usman, M

    2014-05-30

    Roses (Rosa indica) belong to one of the most crucial groups of plants in the floriculture industry. Rosa species have special fragrances of interest to the perfume and pharmaceutical industries. The genetic diversity of plants based on morphological characteristics is difficult to measure under natural conditions due to the influence of environmental factors, which is why a reliable fingerprinting method was developed to overcome this problem. The development of molecular markers will enable the identification of Rosa species. In the present study, randomly amplified polymorphic DNA (RAPD) analysis was done on four Rosa species, Rosa gruss-an-teplitz (Surkha), Rosa bourboniana, Rosa centifolia, and Rosa damascena. A polymorphic RAPD fragment of 391 bp was detected in R. bourboniana, which was cloned, purified, sequenced, and used to design a pair of species-specific sequence-characterized amplified region (SCAR) primers (forward and reverse). These SCAR primers were used to amplify the specific regions of the rose genome. These PCR amplifications with specific primers are less sensitive to reaction conditions, and due to their high reproducibility, these species-specific SCAR primers can be used for marker-assisted selection and identification of Rosa species.

  9. Analysis of SNP-SNP interactions and bone quantitative ultrasound parameter in early adulthood.

    PubMed

    Correa-Rodríguez, María; Viatte, Sebastien; Massey, Jonathan; Schmidt-RioValle, Jacqueline; Rueda-Medina, Blanca; Orozco, Gisela

    2017-10-03

    Osteoporosis individual susceptibility is determined by the interaction of multiple genetic variants and environmental factors. The aim of this study was to conduct SNP-SNP interaction analyses in candidate genes influencing heel quantitative ultrasound (QUS) parameter in early adulthood to identify novel insights into the mechanism of disease. The study population included 575 healthy subjects (mean age 20.41; SD 2.36). To assess bone mass QUS was performed to determine Broadband ultrasound attenuation (BUA, dB/MHz). A total of 32 SNPs mapping to loci that have been characterized as genetic markers for QUS and/or BMD parameters were selected as genetic markers in this study. The association of all possible SNP pairs with QUS was assessed by linear regression and a SNP-SNP interaction was defined as a significant departure from additive effects. The pairwise SNP-SNP analysis showed multiple interactions. The interaction comprising SNPs rs9340799 and rs3736228 that map in the ESR1 and LRP5 genes respectively, revealed the lowest p value after adjusting for confounding factors (p-value = 0.001, β (95% CI) = 14.289 (5.548, 23.029). In addition, our model reported others such as TMEM135-WNT16 (p = 0.007, β(95%CI) = 9.101 (2.498, 15.704), ESR1-DKK1 (p = 0.012, β(95%CI) = 13.641 (2.959, 24.322) or OPG-LRP5 (p = 0.012, β(95%CI) = 8.724 (1.936, 15.512). However, none of the detected interactions remain significant considering the Bonferroni significance threshold for multiple testing (p<0.0001). Our analysis of SNP-SNP interaction in candidate genes of QUS in Caucasian young adults reveal several interactions, especially between ESR1 and LRP5 genes, that did not reach statistical significance. Although our results do not support a relevant genetic contribution of SNP-SNP epistatic interactions to QUS in young adults, further studies in larger independent populations would be necessary to support these preliminary findings.

  10. A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species

    PubMed Central

    Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric

    2016-01-01

    Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289

  11. Determinant molecular markers for peri-gastrulating bovine embryo development.

    PubMed

    Hue, Isabelle

    2016-01-01

    Peri-gastrulation defines the time frame between blastocyst formation and implantation that also corresponds in cattle to elongation, pregnancy recognition and uterine secretion. Optimally, this developmental window prepares the conceptus for implantation, placenta formation and fetal development. However, this is a highly sensitive period, as evidenced by the incidence of embryo loss or early post-implantation mortality after AI, embryo transfer or somatic cell nuclear transfer. Elongation markers have often been used within this time frame to assess developmental defects or delays, originating either from the embryo, the uterus or the dam. Comparatively, gastrulation markers have not received great attention, although elongation and gastrulation are linked by reciprocal interactions at the molecular and cellular levels. To make this clearer, this peri-gastrulating period is described herein with a focus on its main developmental landmarks, and the resilience of the landmarks in the face of biotechnologies is questioned.

  12. Development of Molecular Markers for Determining Continental Origin of Wood from White Oaks (Quercus L. sect. Quercus)

    PubMed Central

    Schroeder, Hilke; Cronn, Richard; Yanbaev, Yulai; Jennings, Tara; Mader, Malte; Degen, Bernd; Kersten, Birgit

    2016-01-01

    To detect and avoid illegal logging of valuable tree species, identification methods for the origin of timber are necessary. We used next-generation sequencing to identify chloroplast genome regions that differentiate the origin of white oaks from the three continents; Asia, Europe, and North America. By using the chloroplast genome of Asian Q. mongolica as a reference, we identified 861 variant sites (672 single nucleotide polymorphisms (SNPs); 189 insertion/deletion (indel) polymorphism) from representative species of three continents (Q. mongolica from Asia; Q. petraea and Q. robur from Europe; Q. alba from North America), and we identified additional chloroplast polymorphisms in pools of 20 individuals each from Q. mongolica (789 variant sites) and Q. robur (346 variant sites). Genome sequences were screened for indels to develop markers that identify continental origin of oak species, and that can be easily evaluated using a variety of detection methods. We identified five indels and one SNP that reliably identify continent-of-origin, based on evaluations of up to 1078 individuals representing 13 white oak species and three continents. Due to the size of length polymorphisms revealed, this marker set can be visualized using capillary electrophoresis or high resolution gel (acrylamide or agarose) electrophoresis. With these markers, we provide the wood trading market with an instrument to comply with the U.S. and European laws that require timber companies to avoid the trade of illegally harvested timber. PMID:27352242

  13. Development of Molecular Markers for Determining Continental Origin of Wood from White Oaks (Quercus L. sect. Quercus).

    PubMed

    Schroeder, Hilke; Cronn, Richard; Yanbaev, Yulai; Jennings, Tara; Mader, Malte; Degen, Bernd; Kersten, Birgit

    2016-01-01

    To detect and avoid illegal logging of valuable tree species, identification methods for the origin of timber are necessary. We used next-generation sequencing to identify chloroplast genome regions that differentiate the origin of white oaks from the three continents; Asia, Europe, and North America. By using the chloroplast genome of Asian Q. mongolica as a reference, we identified 861 variant sites (672 single nucleotide polymorphisms (SNPs); 189 insertion/deletion (indel) polymorphism) from representative species of three continents (Q. mongolica from Asia; Q. petraea and Q. robur from Europe; Q. alba from North America), and we identified additional chloroplast polymorphisms in pools of 20 individuals each from Q. mongolica (789 variant sites) and Q. robur (346 variant sites). Genome sequences were screened for indels to develop markers that identify continental origin of oak species, and that can be easily evaluated using a variety of detection methods. We identified five indels and one SNP that reliably identify continent-of-origin, based on evaluations of up to 1078 individuals representing 13 white oak species and three continents. Due to the size of length polymorphisms revealed, this marker set can be visualized using capillary electrophoresis or high resolution gel (acrylamide or agarose) electrophoresis. With these markers, we provide the wood trading market with an instrument to comply with the U.S. and European laws that require timber companies to avoid the trade of illegally harvested timber.

  14. Using Hamming Distance as Information for SNP-Sets Clustering and Testing in Disease Association Studies.

    PubMed

    Wang, Charlotte; Kao, Wen-Hsin; Hsiao, Chuhsing Kate

    2015-01-01

    The availability of high-throughput genomic data has led to several challenges in recent genetic association studies, including the large number of genetic variants that must be considered and the computational complexity in statistical analyses. Tackling these problems with a marker-set study such as SNP-set analysis can be an efficient solution. To construct SNP-sets, we first propose a clustering algorithm, which employs Hamming distance to measure the similarity between strings of SNP genotypes and evaluates whether the given SNPs or SNP-sets should be clustered. A dendrogram can then be constructed based on such distance measure, and the number of clusters can be determined. With the resulting SNP-sets, we next develop an association test HDAT to examine susceptibility to the disease of interest. This proposed test assesses, based on Hamming distance, whether the similarity between a diseased and a normal individual differs from the similarity between two individuals of the same disease status. In our proposed methodology, only genotype information is needed. No inference of haplotypes is required, and SNPs under consideration do not need to locate in nearby regions. The proposed clustering algorithm and association test are illustrated with applications and simulation studies. As compared with other existing methods, the clustering algorithm is faster and better at identifying sets containing SNPs exerting a similar effect. In addition, the simulation studies demonstrated that the proposed test works well for SNP-sets containing a large proportion of neutral SNPs. Furthermore, employing the clustering algorithm before testing a large set of data improves the knowledge in confining the genetic regions for susceptible genetic markers.

  15. A Diagnostic Marker to Discriminate Childhood Apraxia of Speech from Speech Delay: I. Development and Description of the Pause Marker

    ERIC Educational Resources Information Center

    Shriberg, Lawrence D.; Strand, Edythe A.; Fourakis, Marios; Jakielski, Kathy J.; Hall, Sheryl D.; Karlsson, Heather B.; Mabie, Heather L.; McSweeny, Jane L.; Tilkens, Christie M.; Wilson, David L.

    2017-01-01

    Purpose: The goal of this article (PM I) is to describe the rationale for and development of the Pause Marker (PM), a single-sign diagnostic marker proposed to discriminate early or persistent childhood apraxia of speech from speech delay. Method: The authors describe and prioritize 7 criteria with which to evaluate the research and clinical…

  16. Supervised learning-based tagSNP selection for genome-wide disease classifications

    PubMed Central

    Liu, Qingzhong; Yang, Jack; Chen, Zhongxue; Yang, Mary Qu; Sung, Andrew H; Huang, Xudong

    2008-01-01

    Background Comprehensive evaluation of common genetic variations through association of single nucleotide polymorphisms (SNPs) with complex human diseases on the genome-wide scale is an active area in human genome research. One of the fundamental questions in a SNP-disease association study is to find an optimal subset of SNPs with predicting power for disease status. To find that subset while reducing study burden in terms of time and costs, one can potentially reconcile information redundancy from associations between SNP markers. Results We have developed a feature selection method named Supervised Recursive Feature Addition (SRFA). This method combines supervised learning and statistical measures for the chosen candidate features/SNPs to reconcile the redundancy information and, in doing so, improve the classification performance in association studies. Additionally, we have proposed a Support Vector based Recursive Feature Addition (SVRFA) scheme in SNP-disease association analysis. Conclusions We have proposed using SRFA with different statistical learning classifiers and SVRFA for both SNP selection and disease classification and then applying them to two complex disease data sets. In general, our approaches outperform the well-known feature selection method of Support Vector Machine Recursive Feature Elimination and logic regression-based SNP selection for disease classification in genetic association studies. Our study further indicates that both genetic and environmental variables should be taken into account when doing disease predictions and classifications for the most complex human diseases that have gene-environment interactions. PMID:18366619

  17. Supervised learning-based tagSNP selection for genome-wide disease classifications.

    PubMed

    Liu, Qingzhong; Yang, Jack; Chen, Zhongxue; Yang, Mary Qu; Sung, Andrew H; Huang, Xudong

    2008-01-01

    Comprehensive evaluation of common genetic variations through association of single nucleotide polymorphisms (SNPs) with complex human diseases on the genome-wide scale is an active area in human genome research. One of the fundamental questions in a SNP-disease association study is to find an optimal subset of SNPs with predicting power for disease status. To find that subset while reducing study burden in terms of time and costs, one can potentially reconcile information redundancy from associations between SNP markers. We have developed a feature selection method named Supervised Recursive Feature Addition (SRFA). This method combines supervised learning and statistical measures for the chosen candidate features/SNPs to reconcile the redundancy information and, in doing so, improve the classification performance in association studies. Additionally, we have proposed a Support Vector based Recursive Feature Addition (SVRFA) scheme in SNP-disease association analysis. We have proposed using SRFA with different statistical learning classifiers and SVRFA for both SNP selection and disease classification and then applying them to two complex disease data sets. In general, our approaches outperform the well-known feature selection method of Support Vector Machine Recursive Feature Elimination and logic regression-based SNP selection for disease classification in genetic association studies. Our study further indicates that both genetic and environmental variables should be taken into account when doing disease predictions and classifications for the most complex human diseases that have gene-environment interactions.

  18. A High-Density Consensus Map of Common Wheat Integrating Four Mapping Populations Scanned by the 90K SNP Array

    PubMed Central

    Wen, Weie; He, Zhonghu; Gao, Fengmei; Liu, Jindong; Jin, Hui; Zhai, Shengnan; Qu, Yanying; Xia, Xianchun

    2017-01-01

    A high-density consensus map is a powerful tool for gene mapping, cloning and molecular marker-assisted selection in wheat breeding. The objective of this study was to construct a high-density, single nucleotide polymorphism (SNP)-based consensus map of common wheat (Triticum aestivum L.) by integrating genetic maps from four recombinant inbred line populations. The populations were each genotyped using the wheat 90K Infinium iSelect SNP assay. A total of 29,692 SNP markers were mapped on 21 linkage groups corresponding to 21 hexaploid wheat chromosomes, covering 2,906.86 cM, with an overall marker density of 10.21 markers/cM. Compared with the previous maps based on the wheat 90K SNP chip detected 22,736 (76.6%) of the SNPs with consistent chromosomal locations, whereas 1,974 (6.7%) showed different chromosomal locations, and 4,982 (16.8%) were newly mapped. Alignment of the present consensus map and the wheat expressed sequence tags (ESTs) Chromosome Bin Map enabled assignment of 1,221 SNP markers to specific chromosome bins and 819 ESTs were integrated into the consensus map. The marker orders of the consensus map were validated based on physical positions on the wheat genome with Spearman rank correlation coefficients ranging from 0.69 (4D) to 0.97 (1A, 4B, 5B, and 6A), and were also confirmed by comparison with genetic position on the previously 40K SNP consensus map with Spearman rank correlation coefficients ranging from 0.84 (6D) to 0.99 (6A). Chromosomal rearrangements reported previously were confirmed in the present consensus map and new putative rearrangements were identified. In addition, an integrated consensus map was developed through the combination of five published maps with ours, containing 52,607 molecular markers. The consensus map described here provided a high-density SNP marker map and a reliable order of SNPs, representing a step forward in mapping and validation of chromosomal locations of SNPs on the wheat 90K array. Moreover, it can be

  19. Transcriptome-based SNP discovery by GBS and the construction of a genetic map for olive.

    PubMed

    İpek, Ahmet; İpek, Meryem; Ercişli, Sezai; Tangu, Nesrin Aktepe

    2017-02-18

    Molecular markers located in the genic regions of plants are valuable tools for the identification of candidate genes of economically important traits and consequent use in marker-assisted selection (MAS). In the past, simple sequence repeat markers (SSRs) and single-nucleotide polymorphisms (SNPs) located in expressed sequence tags (ESTs) were developed by sequencing RNA derived from different plant tissues, which involves laborious RNA extraction, mRNA isolation, and cDNA synthesis. In order to develop SNP markers located in olive transcriptomes, we used the recently developed genotyping-by-sequencing (GBS) technique. An analysis was done for 125 olive DNA samples (123 DNA samples from a cross-pollinated F1 mapping population, and two samples from parents). From 45 to 66% of Illumina reads from GBS analysis were aligned to the olive transcriptome. A total of 22,033 transcriptome-based SNP markers were identified, and 3384 of these were mapped in the olive genome. The genetic linkage map constructed in this study consists of 1 cleaved amplified polymorphic sequence (CAPS), 19 SSR, and 3384 transcriptome-based SNP markers. The map covers 3340.8 cM of the olive genome in 23 linkage groups, with the length of the linkage groups ranging from 55.6 to 248.7 cM. Average map distance between flanking markers was 0.98 cM. This genetic linkage map is a saturated genetic map and will be a useful tool for the localization of quantitative trait loci (QTLs) and gene(s) of interest and for the identification of candidate genes for economically important traits.

  20. An Integrated SNP Mining and Utilization (ISMU) Pipeline for Next Generation Sequencing Data

    PubMed Central

    Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M.; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A. V. S. K.; Varshney, Rajeev K.

    2014-01-01

    Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone

  1. An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data.

    PubMed

    Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A V S K; Varshney, Rajeev K

    2014-01-01

    Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone

  2. Construction of a high-density genetic map for sesame based on large scale marker development by specific length amplified fragment (SLAF) sequencing

    PubMed Central

    2013-01-01

    Background The genetics and molecular biology of sesame has only recently begun to be studied even though sesame is an important oil seed crop. A high-density genetic map for sesame has not been published yet due to a lack of sufficient molecular markers. Specific length amplified fragment sequencing (SLAF-seq) is a recently developed high-resolution strategy for large-scale de novo SNP discovery and genotyping. SLAF-seq was employed in this study to obtain sufficient markers to construct a high-density genetic map for sesame. Results In total, 28.21 Gb of data containing 201,488,285 pair-end reads was obtained after sequencing. The average coverage for each SLAF marker was 23.48-fold in the male parent, 23.38-fold in the female parent, and 14.46-fold average in each F2 individual. In total, 71,793 high-quality SLAFs were detected of which 3,673 SLAFs were polymorphic and 1,272 of the polymorphic markers met the requirements for use in the construction of a genetic map. The final map included 1,233 markers on the 15 linkage groups (LGs) and was 1,474.87 cM in length with an average distance of 1.20 cM between adjacent markers. To our knowledge, this map is the densest genetic linkage map to date for sesame. 'SNP_only’ markers accounted for 87.51% of the markers on the map. A total of 205 markers on the map showed significant (P < 0.05) segregation distortion. Conclusions We report here the first high-density genetic map for sesame. The map was constructed using an F2 population and the SLAF-seq approach, which allowed the efficient development of a large number of polymorphic markers in a short time. Results of this study will not only provide a platform for gene/QTL fine mapping, map-based gene isolation, and molecular breeding for sesame, but will also serve as a reference for positioning sequence scaffolds on a physical map, to assist in the process of assembling the sesame genome sequence. PMID:24060091

  3. Construction of a high-density genetic map for sesame based on large scale marker development by specific length amplified fragment (SLAF) sequencing.

    PubMed

    Zhang, Yanxin; Wang, Linhai; Xin, Huaigen; Li, Donghua; Ma, Chouxian; Ding, Xia; Hong, Weiguo; Zhang, Xiurong

    2013-09-24

    The genetics and molecular biology of sesame has only recently begun to be studied even though sesame is an important oil seed crop. A high-density genetic map for sesame has not been published yet due to a lack of sufficient molecular markers. Specific length amplified fragment sequencing (SLAF-seq) is a recently developed high-resolution strategy for large-scale de novo SNP discovery and genotyping. SLAF-seq was employed in this study to obtain sufficient markers to construct a high-density genetic map for sesame. In total, 28.21 Gb of data containing 201,488,285 pair-end reads was obtained after sequencing. The average coverage for each SLAF marker was 23.48-fold in the male parent, 23.38-fold in the female parent, and 14.46-fold average in each F2 individual. In total, 71,793 high-quality SLAFs were detected of which 3,673 SLAFs were polymorphic and 1,272 of the polymorphic markers met the requirements for use in the construction of a genetic map. The final map included 1,233 markers on the 15 linkage groups (LGs) and was 1,474.87 cM in length with an average distance of 1.20 cM between adjacent markers. To our knowledge, this map is the densest genetic linkage map to date for sesame. 'SNP_only' markers accounted for 87.51% of the markers on the map. A total of 205 markers on the map showed significant (P < 0.05) segregation distortion. We report here the first high-density genetic map for sesame. The map was constructed using an F2 population and the SLAF-seq approach, which allowed the efficient development of a large number of polymorphic markers in a short time. Results of this study will not only provide a platform for gene/QTL fine mapping, map-based gene isolation, and molecular breeding for sesame, but will also serve as a reference for positioning sequence scaffolds on a physical map, to assist in the process of assembling the sesame genome sequence.

  4. SNP Discovery and Chromosome Anchoring Provide the First Physically-Anchored Hexaploid Oat Map and Reveal Synteny with Model Species

    PubMed Central

    Chao, Shiaoman; Jellen, Eric N.; Carson, Martin L.; Rines, Howard W.; Obert, Donald E.; Lutz, Joseph D.; Shackelford, Irene; Korol, Abraham B.; Wight, Charlene P.; Gardner, Kyle M.; Hattori, Jiro; Beattie, Aaron D.; Bjørnstad, Åsmund; Bonman, J. Michael; Jannink, Jean-Luc; Sorrells, Mark E.; Brown-Guedira, Gina L.; Mitchell Fetch, Jennifer W.; Harrison, Stephen A.; Howarth, Catherine J.; Ibrahim, Amir; Kolb, Frederic L.; McMullen, Michael S.; Murphy, J. Paul; Ohm, Herbert W.; Rossnagel, Brian G.; Yan, Weikai; Miclaus, Kelci J.; Hiller, Jordan; Maughan, Peter J.; Redman Hulse, Rachel R.; Anderson, Joseph M.; Islamovic, Emir

    2013-01-01

    A physically anchored consensus map is foundational to modern genomics research; however, construction of such a map in oat (Avena sativa L., 2n = 6x = 42) has been hindered by the size and complexity of the genome, the scarcity of robust molecular markers, and the lack of aneuploid stocks. Resources developed in this study include a modified SNP discovery method for complex genomes, a diverse set of oat SNP markers, and a novel chromosome-deficient SNP anchoring strategy. These resources were applied to build the first complete, physically-anchored consensus map of hexaploid oat. Approximately 11,000 high-confidence in silico SNPs were discovered based on nine million inter-varietal sequence reads of genomic and cDNA origin. GoldenGate genotyping of 3,072 SNP assays yielded 1,311 robust markers, of which 985 were mapped in 390 recombinant-inbred lines from six bi-parental mapping populations ranging in size from 49 to 97 progeny. The consensus map included 985 SNPs and 68 previously-published markers, resolving 21 linkage groups with a total map distance of 1,838.8 cM. Consensus linkage groups were assigned to 21 chromosomes using SNP deletion analysis of chromosome-deficient monosomic hybrid stocks. Alignments with sequenced genomes of rice and Brachypodium provide evidence for extensive conservation of genomic regions, and renewed encouragement for orthology-based genomic discovery in this important hexaploid species. These results also provide a framework for high-resolution genetic analysis in oat, and a model for marker development and map construction in other species with complex genomes and limited resources. PMID:23533580

  5. SNP panels/Imputation

    USDA-ARS?s Scientific Manuscript database

    Participants from thirteen countries discussed services that Interbull can perform or recommendations that Interbull can make to promote harmonization and assist member countries in improving their genomic evaluations in regard to SNP panels and imputation. The panel recommended: A mechanism to shar...

  6. High-throughput development of genome-wide locus-specific informative SSR markers in wheat

    USDA-ARS?s Scientific Manuscript database

    Although simple sequence repeat (SSR) markers are not new, they are still useful and often used markers in molecular mapping and marker-assisted breeding, particularly in developing countries. However, locus-specific SSR markers could be more useful and informative in wheat breeding and genetic stud...

  7. A Dual Role for KRT81: A miR-SNP Associated with Recurrence in Non-Small-Cell Lung Cancer and a Novel Marker of Squamous Cell Lung Carcinoma

    PubMed Central

    Campayo, Marc; Navarro, Alfons; Viñolas, Nuria; Tejero, Rut; Muñoz, Carmen; Diaz, Tania; Marrades, Ramon; Cabanas, Maria L.; Gimferrer, Josep M.; Gascon, Pere; Ramirez, Jose; Monzo, Mariano

    2011-01-01

    MicroRNAs (miRNAs) play an important role in carcinogenesis through the regulation of their target genes. miRNA-related single nucleotide polymorphisms (miR-SNPs) can affect miRNA biogenesis and target sites and can alter microRNA expression and functions. We examined 11 miR-SNPs, including 5 in microRNA genes, 3 in microRNA binding sites and 3 in microRNA-processing machinery components, and evaluated time to recurrence (TTR) according to miR-SNP genotypes in 175 surgically resected non-small-cell lung cancer (NSCLC) patients. Significant differences in TTR were found according to KRT81 rs3660 (median TTR: 20.3 months for the CC genotype versus 86.8 months for the CG or GG genotype; P = 0.003) and XPO5 rs11077 (median TTR: 24.7 months for the AA genotype versus 73.1 months for the AC or CC genotypes; P = 0.029). Moreover, when patients were divided according to stage, these differences were maintained for stage I patients (P = 0.002 for KRT81 rs3660; P<0.001 for XPO5 rs11077). When patients were divided into sub-groups according to histology, the effect of the KRT81 rs3660 genotype on TTR was significant in patients with squamous cell carcinoma (P = 0.004) but not in those with adenocarcinoma. In the multivariate analyses, the KRT81 rs3660 CC genotype (OR = 1.8; P = 0.023) and the XPO5 rs11077 AA genotype (OR = 1.77; P = 0.026) emerged as independent variables influencing TTR. Immunohistochemical analyses in 80 lung specimens showed that 95% of squamous cell carcinomas were positive for KRT81, compared to only 19% of adenocarcinomas (P<0.0001). In conclusion, miR-SNPs are a novel class of SNPs that can add useful prognostic information on the clinical outcome of resected NSCLC patients and may be a potential key tool for selecting high-risk stage I patients. Moreover, KRT81 has emerged as a promising immunohistochemical marker for the identification of squamous cell lung carcinoma. PMID:21799879

  8. High-density SNP-based genetic maps for the parents of an outcrossed and a selfed tetraploid garden rose cross, inferred from admixed progeny using the 68k rose SNP array

    PubMed Central

    Vukosavljev, Mirjana; Arens, Paul; Voorrips, Roeland E; van ‘t Westende, Wendy PC; Esselink, GD; Bourke, Peter M; Cox, Peter; van de Weg, W Eric; Visser, Richard GF; Maliepaard, Chris; Smulders, Marinus JM

    2016-01-01

    Dense genetic maps create a base for QTL analysis of important traits and future implementation of marker-assisted breeding. In tetraploid rose, the existing linkage maps include <300 markers to cover 28 linkage groups (4 homologous sets of 7 chromosomes). Here we used the 68k WagRhSNP Axiom single-nucleotide polymorphism (SNP) array for rose, in combination with SNP dosage calling at the tetraploid level, to genotype offspring from the garden rose cultivar ‘Red New Dawn’. The offspring proved to be not from a single bi-parental cross. In rose breeding, crosses with unintended parents occur regularly. We developed a strategy to separate progeny into putative populations, even while one of the parents was unknown, using principle component analysis on pairwise genetic distances based on sets of selected SNP markers that were homozygous, and therefore uninformative for one parent. One of the inferred populations was consistent with self-fertilization of ‘Red New Dawn’. Subsequently, linkage maps were generated for a bi-parental and a self-pollinated population with ‘Red New Dawn’ as the common maternal parent. The densest map, for the selfed parent, had 1929 SNP markers on 25 linkage groups, covering 1765.5 cM at an average marker distance of 0.9 cM. Synteny with the strawberry (Fragaria vesca) genome was extensive. Rose ICM1 corresponded to F. vesca pseudochromosome 7 (Fv7), ICM4 to Fv4, ICM5 to Fv3, ICM6 to Fv2 and ICM7 to Fv5. Rose ICM2 corresponded to parts of F. vesca pseudochromosomes 1 and 6, whereas ICM3 is syntenic to the remainder of Fv6. PMID:27818777

  9. A 48 SNP set for grapevine cultivar identification

    PubMed Central

    2011-01-01

    Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP

  10. Comparative analysis of disease-linked single nucleotide polymorphic markers from Brassica rapa for their applicability to Brassica oleracea.

    PubMed

    Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun

    2015-01-01

    Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH--developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP--based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS--derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species.

  11. Comparative Analysis of Disease-Linked Single Nucleotide Polymorphic Markers from Brassica rapa for Their Applicability to Brassica oleracea

    PubMed Central

    Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun

    2015-01-01

    Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH—developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP—based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS—derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species. PMID:25790283

  12. A SNP transferability survey within the genus Vitis

    PubMed Central

    Vezzulli, Silvia; Micheletti, Diego; Riaz, Summaira; Pindo, Massimo; Viola, Roberto; This, Patrice; Walker, M Andrew; Troggio, Michela; Velasco, Riccardo

    2008-01-01

    Background Efforts to sequence the genomes of different organisms continue to increase. The DNA sequence is usually decoded for one individual and its application is for the whole species. The recent sequencing of the highly heterozygous Vitis vinifera L. cultivar Pinot Noir (clone ENTAV 115) genome gave rise to several thousand polymorphisms and offers a good model to study the transferability of its degree of polymorphism to other individuals of the same species and within the genus. Results This study was performed by genotyping 137 SNPs through the SNPlex™ Genotyping System (Applied Biosystems Inc.) and by comparing the SNPlex sequencing results across 35 (of the 137) regions from 69 grape accessions. A heterozygous state transferability of 31.5% across the unrelated cultivars of V. vinifera, of 18.8% across the wild forms of V. vinifera, of 2.3% among non-vinifera Vitis species, and of 0% with Muscadinia rotundifolia was found. In addition, mean allele frequencies were used to evaluate SNP informativeness and develop useful subsets of markers. Conclusion Using SNPlex application and corroboration from the sequencing analysis, the informativeness of SNP markers from the heterozygous grape cultivar Pinot Noir was validated in V. vinifera (including cultivars and wild forms), but had a limited application for non-vinifera Vitis species where a resequencing strategy may be preferred, knowing that homology at priming sites is sufficient. This work will allow future applications such as mapping and diversity studies, accession identification and genomic-research assisted breeding within V. vinifera. PMID:19087337

  13. Development of ARMS-PCR assay for genotyping of Pro12Ala SNP of PPARG gene: a cost effective way for case-control studies of type 2 diabetes in developing countries.

    PubMed

    Islam, Mehboob; Awan, Fazli Rabbi; Baig, Shahid Mahmood

    2014-09-01

    Type 2 diabetes (T2D) is a prevalent metabolic disorder across the globe. Research is underway on various aspects including genetics to understand and control the global epidemic of diabetes. Recently, several SNPs in various genes have been associated with T2D. These association studies are mainly carried out in the developed countries through Genome Wide Association Scans, with follow-up replication/validation studies by high-throughput genotyping techniques (e.g. Taqman Technology). Although, similar studies could be conducted in developing countries, however, the limiting factors are the associated cost and expertise. These factors hamper research into the genetic association and replication studies from low-income countries to figure out the role of putatively associated SNPs in diabetes. Although, there are several SNP detection methods (e.g. Taqman assay, Dot-blot, PCR-RFLP, DGGE, SSCP) but these are either expensive or labor intensive or less sensitive. Hence, our aim was to develop a low-cost method for the validation of PPARG (Pro12Ala, CCA>GCA) SNP (rs1801282) for its association with T2D. Here, we developed a cost-effective and rapid amplification refractory mutation specific-PCR (ARMS-PCR) method for this SNP detection. We successfully genotyped PPARG SNPs (Pro12Ala) in human samples and the validity of this method was confirmed by DNA sequencing of a few representative samples for the three different genotypes. Furthermore, ARMS-PCR was applied to T2D patients and control samples for the screening of this SNP.

  14. Transcriptome Characterization and Functional Marker Development in Sorghum Sudanense.

    PubMed

    Li, Jieqin; Wang, Lihua; Zhan, Qiuwen; Liu, Yanlong; Yang, Xiaocui

    2016-01-01

    Sudangrass, Sorghum sudanense, is an important forage in warm regions. But little is known about its genome. In this study, the transcriptomes of sudangrass S722 and sorghum Tx623B were sequenced by Illumina sequencing. More than 4Gb bases were sequenced for each library. For Tx623B and S722, 88.79% and 83.88% reads, respectively were matched to the Sorghum bicolor genome. A total of 2,397 differentially expressed genes (DEGs) were detected by RNA-Seq between the two libraries, including 849 up-regulated genes and 1,548 down-regulated genes. These DEGs could be divided into three groups by annotation analysis. A total of 44,495 single nucleotide polymorphisms (SNPs) were discovered by aligning S722 reads to the sorghum reference genome. Of these SNPs, 61.37% were transition, and this value did not differ much between different chromosomes. In addition, 16,928 insertion and deletion (indel) loci were identified between the two genomes. A total of 5,344 indel markers were designed, 15 of which were selected to construct the genetic map derived from the cross of Tx623A and Sa. It was indicated that the indel markers were useful and versatile between sorghum and sudangrass. Comparison of synonymous base substitutions (Ks) and non-synonymous base substitutions (Ka) between the two libraries showed that 95% orthologous pairs exhibited Ka/Ks<1.0, indicating that these genes were influenced by purifying selection. The results from this study provide important information for molecular genetic research and a rich resource for marker development in sudangrass and other Sorghum species.

  15. A Multiple-SNP Approach for Genome-Wide Association Study of Milk Production Traits in Chinese Holstein Cattle

    PubMed Central

    Fang, Ming; Fu, Weixuan; Jiang, Dan; Zhang, Qin; Sun, Dongxiao; Ding, Xiangdong; Liu, Jianfeng

    2014-01-01

    The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation. PMID:25148050

  16. A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.

    PubMed

    Fang, Ming; Fu, Weixuan; Jiang, Dan; Zhang, Qin; Sun, Dongxiao; Ding, Xiangdong; Liu, Jianfeng

    2014-01-01

    The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation.

  17. Development of a 44K SNP assay focussing on the analysis of a varroa-specific defence behaviour in honey bees (Apis mellifera carnica).

    PubMed

    Spötter, A; Gupta, P; Nürnberg, G; Reinsch, N; Bienefeld, K

    2012-03-01

    Honey bees are exposed to a number of damaging pathogens and parasites. The most destructive among them, affecting mainly the brood, is Varroa destructor. A promising approach to prevent its spread is to breed for Varroa-tolerant honey bees. A trait that has been shown to provide significant resistance against the Varroa mite is hygienic behaviour, a behavioural response of honey bee workers to brood diseases in general. This study reports the development of a 44K SNP assay, specifically designed for the analysis of hygienic behaviour of individual worker bees (Apis mellifera carnica) directed against V. destructor. Initially, 70,000 SNPs chosen from a large set of SNPs published by the Honey Bee Genome Project were validated for their suitability in the analysis of the Varroa resistance trait 'uncapping of Varroa-infested brood'. This was achieved by genotyping of pooled DNA samples of trait bearers and two trait-negative controls using next-generation sequencing. Approximately 36,000 of these validated SNPs and another 8000 SNPs not validated in this study were selected for the construction of a SNP assay. This assay will be employed in following experiments to analyse individualized DNA samples in order to identify quantitative trait loci (QTL) involved in the control of the investigated trait and to evaluate and possibly confirm QTL found in other studies. However, this assay is not just suitable to study Varroa tolerance, it is as well applicable to analyse any other trait in honey bees. In addition, because of its high density, this assay provides access into genomic selection with respect to several traits considered in honey bee breeding. It will become publicly available via AROS Applied Biotechnology AS, Aarhus, Denmark, before the end of the year 2011.

  18. Mutations of C-reactive protein (CRP) -286 SNP, APC and p53 in colorectal cancer: implication for a CRP-Wnt crosstalk.

    PubMed

    Su, Hai-Xiang; Zhou, Hai-Hong; Wang, Ming-Yu; Cheng, Jin; Zhang, Shi-Chao; Hui, Feng; Chen, Xue-Zhong; Liu, Shan-Hui; Liu, Qin-Jiang; Zhu, Zi-Jiang; Hu, Qing-Rong; Wu, Yi; Ji, Shang-Rong

    2014-01-01

    C-reactive protein (CRP) is an established marker of inflammation with pattern-recognition receptor-like activities. Despite the close association of the serum level of CRP with the risk and prognosis of several types of cancer, it remains elusive whether CRP contributes directly to tumorigenesis or just represents a bystander marker. We have recently identified recurrent mutations at the SNP position -286 (rs3091244) in the promoter of CRP gene in several tumor types, instead suggesting that locally produced CRP is a potential driver of tumorigenesis. However, it is unknown whether the -286 site is the sole SNP position of CRP gene targeted for mutation and whether there is any association between CRP SNP mutations and other frequently mutated genes in tumors. Herein, we have examined the genotypes of three common CRP non-coding SNPs (rs7553007, rs1205, rs3093077) in tumor/normal sample pairs of 5 cancer types (n = 141). No recurrent somatic mutations are found at these SNP positions, indicating that the -286 SNP mutations are preferentially selected during the development of cancer. Further analysis reveals that the -286 SNP mutations of CRP tend to co-occur with mutated APC particularly in rectal cancer (p = 0.04; n = 67). By contrast, mutations of CRP and p53 or K-ras appear to be unrelated. There results thus underscore the functional importance of the -286 mutation of CRP in tumorigenesis and imply an interaction between CRP and Wnt signaling pathway.

  19. Large-Scale Development of Cost-Effective Single-Nucleotide Polymorphism Marker Assays for Genetic Mapping in Pigeonpea and Comparative Mapping in Legumes

    PubMed Central

    Saxena, Rachit K.; Varma Penmetsa, R.; Upadhyaya, Hari D.; Kumar, Ashish; Carrasquilla-Garcia, Noelia; Schlueter, Jessica A.; Farmer, Andrew; Whaley, Adam M.; Sarma, Birinchi K.; May, Gregory D.; Cook, Douglas R.; Varshney, Rajeev K.

    2012-01-01

    Single-nucleotide polymorphisms (SNPs, >2000) were discovered by using RNA-seq and allele-specific sequencing approaches in pigeonpea (Cajanus cajan). For making the SNP genotyping cost-effective, successful competitive allele-specific polymerase chain reaction (KASPar) assays were developed for 1616 SNPs and referred to as PKAMs (pigeonpea KASPar assay markers). Screening of PKAMs on 24 genotypes [23 from cultivated species and 1 wild species (Cajanus scarabaeoides)] defined a set of 1154 polymorphic markers (77.4%) with a polymorphism information content (PIC) value from 0.04 to 0.38. One thousand and ninety-four PKAMs showed polymorphisms between parental lines of the reference mapping population (C. cajan ICP 28 × C. scarabaeoides ICPW 94). By using high-quality marker genotyping data on 167 F2 lines from the population, a comprehensive genetic map comprising 875 PKAMs with an average inter-marker distance of 1.11 cM was developed. Previously mapped 35 simple sequence repeat markers were integrated into the PKAM map and an integrated genetic map of 996.21 cM was constructed. Mapped PKAMs showed a higher degree of synteny with the genome of Glycine max followed by Medicago truncatula and Lotus japonicus and least with Vigna unguiculata. These PKAMs will be useful for genetics research and breeding applications in pigeonpea and for utilizing genome information from other legume species. PMID:23103470

  20. EST, COSII, and arbitrary gene markers give similar estimates of nucleotide diversity in cultivated tomato (Solanum lycopersicum L.).

    PubMed

    Labate, Joanne A; Robertson, Larry D; Wu, Feinan; Tanksley, Steven D; Baldo, Angela M

    2009-03-01

    Because cultivated tomato (Solanum lycopersicum L.) is low in genetic diversity, public, verified single nucleotide polymorphism (SNP) markers within the species are in demand. To promote marker development we resequenced approximately 23 kb in a diverse set of 31 tomato lines including TA496. Three classes of markers were sampled: (1) 26 expressed-sequence tag (EST), all of which were predicted to be polymorphic based on TA496, (2) 14 conserved ortholog set II (COSII) or unigene, and (3) ten published sequences, composed of nine fruit quality genes and one anonymous RFLP marker. The latter two types contained mostly noncoding DNA. In total, 154 SNPs and 34 indels were observed. The distributions of nucleotide diversity estimates among marker types were not significantly different from each other. Ascertainment bias of SNPs was evaluated for the EST markers. Despite the fact that the EST markers were developed using SNP prediction within a sample consisting of only one TA496 allele and one additional allele, the majority of polymorphisms in the 26 EST markers were represented among the other 30 tomato lines. Fifteen EST markers with published SNPs were more closely examined for bias. Mean SNP diversity observations were not significantly different between the original discovery sample of two lines (53 SNPs) and the 31 line diversity panel (56 SNPs). Furthermore, TA496 shared its haplotype with at least one other line at 11 of the 15 markers. These data demonstrate that public EST databases and noncoding regions are a valuable source of unbiased SNP markers in tomato.

  1. Multiplex single nucleotide polymorphism (SNP) assay for detection of soybean mosaic virus resistance genes in soybean.

    PubMed

    Shi, Ainong; Chen, Pengyin; Vierling, Richard; Zheng, Cuming; Li, Dexiao; Dong, Dekun; Shakiba, Ehsan; Cervantez, Innan

    2011-02-01

    Soybean mosaic virus (SMV) is one of the most destructive viral diseases in soybean (Glycine max). Three independent loci for SMV resistance have been identified in soybean germplasm. The use of genetic resistance is the most effective method of controlling this disease. Marker assisted selection (MAS) has become very important and useful in the effort of selecting genes for SMV resistance. Single nucleotide polymorphism (SNP), because of its abundance and high-throughput potential, is a powerful tool in genome mapping, association studies, diversity analysis, and tagging of important genes in plant genomics. In this study, a 10 SNPs plus one insert/deletion (InDel) multiplex assay was developed for SMV resistance: two SNPs were developed from the candidate gene 3gG2 at Rsv1 locus, two SNPs selected from the clone N11PF linked to Rsv1, one 'BARC' SNP screened from soybean chromosome 13 [linkage group (LG) F] near Rsv1, two 'BARC' SNPs from probe A519 linked to Rsv3, one 'BARC' SNP from chromosome 14 (LG B2) near Rsv3, and two 'BARC' SNPs from chromosome 2 (LG D1b) near Rsv4, plus one InDel marker from expressed sequence tag (EST) AW307114 linked to Rsv4. This 11 SNP/InDel multiplex assay showed polymorphism among 47 diverse soybean germplasm, indicating this assay can be used to investigate the mode of inheritance in a SMV resistant soybean line carrying Rsv1, Rsv3, and/or Rsv4 through a segregating population with phenotypic data, and to select a specific gene or pyramid two or three genes for SMV resistance through MAS in soybean breeding program. The presence of two SMV resistance genes (Rsv1 and Rsv3) in J05 soybean was confirmed by the SNP assay.

  2. Development of Pineapple Microsatellite Markers and Germplasm Genetic Diversity Analysis

    PubMed Central

    Tong, Helin; Chen, You; Wang, Jingyi; Chen, Yeyuan; Sun, Guangming; He, Junhu; Wu, Yaoting

    2013-01-01

    Two methods were used to develop pineapple microsatellite markers. Genomic library-based SSR development: using selectively amplified microsatellite assay, 86 sequences were generated from pineapple genomic library. 91 (96.8%) of the 94 Simple Sequence Repeat (SSR) loci were dinucleotide repeats (39 AC/GT repeats and 52 GA/TC repeats, accounting for 42.9% and 57.1%, resp.), and the other three were mononucleotide repeats. Thirty-six pairs of SSR primers were designed; 24 of them generated clear bands of expected sizes, and 13 of them showed polymorphism. EST-based SSR development: 5659 pineapple EST sequences obtained from NCBI were analyzed; among 1397 nonredundant EST sequences, 843 were found containing 1110 SSR loci (217 of them contained more than one SSR locus). Frequency of SSRs in pineapple EST sequences is 1SSR/3.73 kb, and 44 types were found. Mononucleotide, dinucleotide, and trinucleotide repeats dominate, accounting for 95.6% in total. AG/CT and AGC/GCT were the dominant type of dinucleotide and trinucleotide repeats, accounting for 83.5% and 24.1%, respectively. Thirty pairs of primers were designed for each of randomly selected 30 sequences; 26 of them generated clear and reproducible bands, and 22 of them showed polymorphism. Eighteen pairs of primers obtained by the one or the other of the two methods above that showed polymorphism were selected to carry out germplasm genetic diversity analysis for 48 breeds of pineapple; similarity coefficients of these breeds were between 0.59 and 1.00, and they can be divided into four groups accordingly. Amplification products of five SSR markers were extracted and sequenced, corresponding repeat loci were found and locus mutations are mainly in copy number of repeats and base mutations in the flanking region. PMID:24024187

  3. Genome wide association and genomic prediction for growth traits in juvenile farmed Atlantic salmon using a high density SNP array.

    PubMed

    Tsai, Hsin-Yuan; Hamilton, Alastair; Tinch, Alan E; Guy, Derrick R; Gharbi, Karim; Stear, Michael J; Matika, Oswald; Bishop, Steve C; Houston, Ross D

    2015-11-18

    the additive genetic variation in complex traits. However, the traits of weight and length both appear to be very polygenic with only one SNP surpassing the chromosome-wide threshold. Genomic prediction using the array is effective, leading to an improvement in accuracy compared to pedigree methods, and this improvement can be achieved with only a small subset of the markers in this population. The results have practical relevance for genomic selection in salmon and may also provide insight into variation in the identified genes underpinning body growth and development in salmonid species.

  4. High-Throughput SNP Discovery through Deep Resequencing of a Reduced Representation Library to Anchor and Orient Scaffolds in the Soybean Whole Genome Sequence

    USDA-ARS?s Scientific Manuscript database

    The soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy but only properly oriented 66% of the sequence scaffolds. To find additional single nucleotide polymorphism (SNP) markers for additiona...

  5. Identification of Novel Markers of Mouse Fetal Ovary Development

    PubMed Central

    Thiagarajan, Rathi D.; Dinger, Marcel E.; Lesieur, Emmanuelle; Chiu, Hansheng; Schulz, Alexandra; Spiller, Cassy; Grimmond, Sean M.; Little, Melissa H.; Koopman, Peter; Wilhelm, Dagmar

    2012-01-01

    In contrast to the developing testis, molecular pathways driving fetal ovarian development have been difficult to characterise. To date no single master regulator of ovarian development has been identified that would be considered the female equivalent of Sry. Using a genomic approach we identified a number of novel protein-coding as well as non-coding genes that were detectable at higher levels in the ovary compared to testis during early mouse gonad development. We were able to cluster these ovarian genes into different temporal expression categories. Of note, Lrrc34 and AK015184 were detected in XX but not XY germ cells before the onset of sex-specific germ cell differentiation marked by entry into meiosis in an ovary and mitotic arrest in a testis. We also defined distinct spatial expression domains of somatic cell genes in the developing ovary. Our data expands the set of markers of early mouse ovary differentiation and identifies a classification of early ovarian genes, thus providing additional avenues with which to dissect this process. PMID:22844512

  6. The Usage of an SNP-SNP Relationship Matrix for Best Linear Unbiased Prediction (BLUP) Analysis Using a Community-Based Cohort Study

    PubMed Central

    Lee, Young-Sup; Kim, Hyeon-Jeong; Cho, Seoae

    2014-01-01

    Best linear unbiased prediction (BLUP) has been used to estimate the fixed effects and random effects of complex traits. Traditionally, genomic relationship matrix-based (GRM) and random marker-based BLUP analyses are prevalent to estimate the genetic values of complex traits. We used three methods: GRM-based prediction (G-BLUP), random marker-based prediction using an identity matrix (so-called single-nucleotide polymorphism [SNP]-BLUP), and SNP-SNP variance-covariance matrix (so-called SNP-GBLUP). We used 35,675 SNPs and R package "rrBLUP" for the BLUP analysis. The SNP-SNP relationship matrix was calculated using the GRM and Sherman-Morrison-Woodbury lemma. The SNP-GBLUP result was very similar to G-BLUP in the prediction of genetic values. However, there were many discrepancies between SNP-BLUP and the other two BLUPs. SNP-GBLUP has the merit to be able to predict genetic values through SNP effects. PMID:25705167

  7. PCR amplification of SNP loci from crude DNA for large-scale genotyping of oomycetes.

    PubMed

    Hu, Jian; Lyon, Rebecca; Zhou, Yuxin; Lamour, Kurt

    2014-01-01

    Similar to other eukaryotes, single nucleotide polymorphism (SNP) markers are abundant in many oomycete plant pathogen genomes. High resolution DNA melting analysis (HR-DMA) is a cost-effective method for SNP genotyping, but like many SNP marker technologies, is limited by the amount and quality of template DNA. We describe PCR preamplification of Phytophthora and Peronospora SNP loci from crude DNA extracted from a small amount of mycelium and/or infected plant tissue to produce sufficient template to genotype at least 10 000 SNPs. The approach is fast, inexpensive, requires minimal biological material and should be useful for many organisms in a variety of contexts.

  8. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography

    PubMed Central

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-01-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420

  9. Identification of Novel SNP in Promoter Sequence of TaGW2-6A Associated with Grain Weight and Other Agronomic Traits in Wheat (Triticum aestivum L.)

    PubMed Central

    Jaiswal, Vandana; Gahlaut, Vijay; Mathur, Saloni; Agarwal, Priyanka; Khandelwal, Manoj Kumar; Khurana, Jitendra Paul; Tyagi, Akhilesh Kumar; Balyan, Harindra Singh; Gupta, Pushpendra Kumar

    2015-01-01

    TaGW2 is an orthologue of rice gene OsGW2, which encodes E3 RING ubiquitin ligase and controls the grain size in rice. In wheat, three copies of TaGW2 have been identified and mapped on wheat homoeologous group 6 viz. TaGW2-6A, TaGW2-6B and TaGW2-6D. In the present study, using as many as 207 Indian wheat genotypes, we identified four SNPs including two novel SNPs (SNP-988 and SNP-494) in the promoter sequence of TaGW2-6A. All the four SNPs were G/A or A/G substitutions (transitions). Out of the four SNPs, SNP-494 was causal, since it was found associated with grain weight. The mean TGW (41.1 g) of genotypes with the allele SNP-494_A was significantly higher than mean TGW (38.6 g) of genotypes with the allele SNP-494_G. SNP-494 also regulates the expression of TaGW2-6A so that the wheat genotypes with SNP-494_G have higher expression and lower TGW and the genotypes with SNP-494_A have lower expression but higher TGW. Besides, SNP-494 was also found associated with grain length-width ratio, awn length, spike length, grain protein content, peduncle length and plant height. This suggested that gene TaGW2-6A not only controls grain size, but also controls other agronomic traits. In the promoter region, SNP-494 was present in ‘CGCG’ motif that plays an important role in Ca2+/calmodulin mediated regulation of genes. A user-friendly CAPS marker was also developed to identify the desirable allele of causal SNP (SNP-494) for use in marker-assisted selection for improvement of grain weight in wheat. Using four SNPs, five haplotypes were identified; of these, Hap_5 (G_A_G_A) was found to be a desirable haplotype having significantly higher grain weight (41.13g) relative to other four haplotypes (36.33-39.16 g). PMID:26076351

  10. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

    PubMed Central

    Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633

  11. Homozygosity mapping with SNP arrays identifies TRIM32, an E3 ubiquitin ligase, as a Bardet–Biedl syndrome gene (BBS11)

    PubMed Central

    Chiang, Annie P.; Beck, John S.; Yen, Hsan-Jan; Tayeh, Marwan K.; Scheetz, Todd E.; Swiderski, Ruth E.; Nishimura, Darryl Y.; Braun, Terry A.; Kim, Kwang-Youn A.; Huang, Jian; Elbedour, Khalil; Carmi, Rivka; Slusarski, Diane C.; Casavant, Thomas L.; Stone, Edwin M.; Sheffield, Val C.

    2006-01-01

    The identification of mutations in genes that cause human diseases has largely been accomplished through the use of positional cloning, which relies on linkage mapping. In studies of rare diseases, the resolution of linkage mapping is limited by the number of available meioses and informative marker density. One recent advance is the development of high-density SNP microarrays for genotyping. The SNP arrays overcome low marker informativity by using a large number of markers to achieve greater coverage at finer resolution. We used SNP microarray genotyping for homozygosity mapping in a small consanguineous Israeli Bedouin family with autosomal recessive Bardet–Biedl syndrome (BBS; obesity, pigmentary retinopathy, polydactyly, hypogonadism, renal and cardiac abnormalities, and cognitive impairment) in which previous linkage studies using short tandem repeat polymorphisms failed to identify a disease locus. SNP genotyping revealed a homozygous candidate region. Mutation analysis in the region of homozygosity identified a conserved homozygous missense mutation in the TRIM32 gene, a gene coding for an E3 ubiquitin ligase. Functional analysis of this gene in zebrafish and expression correlation analyses among other BBS genes in an expression quantitative trait loci data set demonstrate that TRIM32 is a BBS gene. This study shows the value of high-density SNP genotyping for homozygosity mapping and the use of expression correlation data for evaluation of candidate genes and identifies the proteasome degradation pathway as a pathway involved in BBS. PMID:16606853

  12. Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality.

    PubMed

    Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A

    2015-01-01

    Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.

  13. Characterization of Single-Nucleotide-Polymorphism Markers for Plasmopara viticola, the Causal Agent of Grapevine Downy Mildew▿

    PubMed Central

    Delmotte, F.; Machefer, V.; Giresse, X.; Richard-Cervera, S.; Latorse, M. P.; Beffa, R.

    2011-01-01

    We report 34 new nuclear single-nucleotide-polymorphism (SNP) markers that have been developed from an expressed sequence tag library of Plasmopara viticola, the causal agent of grapevine downy mildew. This newly developed battery of markers will provide useful additional genetic tools for population genetic studies of this important agronomic species. PMID:21926208

  14. Towards the Development of a Molecular Map in Switchgrass: I. Microsatellite Marker Development

    SciTech Connect

    Gunter, L.E.

    2001-08-23

    The long-term goal of the switchgrass breeding program is to improve regionally adapted varieties and increase biomass yield and feedstock quality. Although, to some extent, biomass yields are dependent on environmental constraints, increased yield can be achieved through the development of genotypes with improved seasonal adaptation, tolerance to unfavorable environmental conditions, and improved resistance to pest and disease. To date, improvement in switchgrass has relied on recurrent breeding strategies based on phenotypic or genotypic selection. Yield improvements have been modest by this method. If we expect to make significant increase in yields, we need tools that will allow us to map complex traits and uncover the genes that influence them. A genetic linkage map could be a powerful tool for accelerating switchgrass development through marker-assisted selection, breeding and recombination. This type of mapping requires the development of markers that can be associated with phenotypic traits in a population of known pedigree. The most commonly used markers for mapping include restriction fragment length polymorphisms (RFLP) and simple sequence repeats (SSR). At ORNL, we have been concentrating on the development of SSR markers, while our colleagues at the University of Georgia are developing RFLP markers in order to select parents to produce a mapping population and from there to create a framework map from {approx}100 F1 progeny.

  15. Developing urinary metabolomic signatures as early bladder cancer diagnostic markers.

    PubMed

    Shen, Chong; Sun, Zeyu; Chen, Deying; Su, Xiaoling; Jiang, Jing; Li, Gonghui; Lin, Biaoyang; Yan, Jiajun

    2015-01-01

    Early detection is vital to improve the overall survival rate of bladder cancer (BCa) patients, yet there is a lack of a reliable urine-based assay for early detection of BCa. Urine metabolites represented a potential rich source of biomarkers for BCa. This study aimed to develop a metabolomics approach for high coverage discovery and identification of metabolites in urine samples. Urine samples from 23 early stage BCa patients and 21 healthy volunteers with minimum sample preparations were analyzed by a short 30 min UPLC-HRMS method. We detected and quantified over 9000 unique UPLC-HRMS features, which is more than four times than about 2000 features detected in previous urine metabolomic studies. Furthermore, multivariate OPLS-DA classification models were established to differentiate urine samples from bladder cancer cohort and normal health cohort. We identified three BCa-upregulated metabolites: nicotinuric acid, trehalose, AspAspGlyTrp, and three BCa-downregulated metabolites: inosinic acid, ureidosuccinic acid, GlyCysAlaLys. Finally, analysis of six post-surgery BCa urine samples showed that these BCa-metabolomic features reverted to normal state after tumor removal, suggesting that they reflected metabolomic features associated with BCa. ROC analyses using two linear regression models to combine the identified markers showed a high diagnostic performance for detecting BCa with AUC (area under the ROC curve) values of 0.919 to 0.934. In summary, we developed a high coverage metabolomic approach that has potential for biomarker discovery in cancers.

  16. Development of highly reliable in silico SNP resource and genotyping assay from exome capture and sequencing: an example from black spruce (Picea mariana).

    PubMed

    Pavy, Nathalie; Gagnon, France; Deschênes, Astrid; Boyle, Brian; Beaulieu, Jean; Bousquet, Jean

    2016-03-01

    Picea mariana is a widely distributed boreal conifer across Canada and the subject of advanced breeding programmes for which population genomics and genomic selection approaches are being developed. Targeted sequencing was achieved after capturing P. mariana exome with probes designed from the sequenced transcriptome of Picea glauca, a distant relative. A high capture efficiency of 75.9% was reached although spruce has a complex and large genome including gene sequences interspersed by some long introns. The results confirmed the relevance of using probes from congeneric species to perform successfully interspecific exome capture in the genus Picea. A bioinformatics pipeline was developed including stringent criteria that helped detect a set of 97,075 highly reliable in silico SNPs. These SNPs were distributed across 14,909 genes. Part of an Infinium iSelect array was used to estimate the rate of true positives by validating 4267 of the predicted in silico SNPs by genotyping trees from P. mariana populations. The true positive rate was 96.2% for in silico SNPs, compared to a genotyping success rate of 96.7% for a set 1115 P. mariana control SNPs recycled from previous genotyping arrays. These results indicate the high success rate of the genotyping array and the relevance of the selection criteria used to delineate the new P. mariana in silico SNP resource. Furthermore, in silico SNPs were generally of medium to high frequency in natural populations, thus providing high informative value for future population genomics applications. © 2015 John Wiley & Sons Ltd.

  17. Whole genome-wide association study using affymetrix SNP chip: a two-stage sequential selection method to identify genes that increase the risk of developing complex diseases.

    PubMed

    Yang, Howard H; Hu, Nan; Taylor, Philip R; Lee, Maxwell P

    2008-01-01

    Whole-genome association studies of complex diseases hold great promise to identify systematically genetic loci that influence one's risk of developing these diseases. However, the polygenic nature of the complex diseases and genetic interactions among the genes pose significant challenge in both experimental design and data analysis. High-density genotype data make it possible to identify most of the genetic loci that may be involved in the etiology. On the other hand, utilizing large number of statistic tests could lead to false positives if the tests are not adequately adjusted. In this paper, we discuss a two-stage method that sequentially applies a generalized linear model (GLM) and principal components analysis (PCA) to identify genetic loci that jointly determine the likelihood of developing disease. The method was applied to a pilot case-control study of esophageal squamous cell carcinoma (ESCC) that included 50 ESCC patients and 50 neighborhood-matched controls. Genotype data were determined by using the Affymetrix 10K SNP chip. We will discuss some of the special considerations that are important to the proper interpretation of whole genome-wide association studies, which include multiple comparisons, epistatic interaction among multiple genetic loci, and generalization of predictive models.

  18. Performance of the SNPforID 52 SNP-plex assay in paternity testing.

    PubMed

    Børsting, Claus; Sanchez, Juan J; Hansen, Hanna E; Hansen, Anders J; Bruun, Hanne Q; Morling, Niels

    2008-09-01

    The performance of a multiplex assay with 52 autosomal single nucleotide polymorphisms (SNPs) developed for human identification was tested on 124 mother-child-father trios. The typical paternity indices (PIs) were 10(5)-10(6) for the trios and 10(3)-10(4) for the child-father duos. Using the SNP profiles from the randomly selected trios and 700 previously typed individuals, a total of 83,096 comparisons between mother, child and an unrelated man were performed. On average, 9-10 mismatches per comparison were detected. Four mismatches were genetic inconsistencies and 5-6 mismatches were opposite homozygosities. In only two of the 83,096 comparisons did an unrelated man match perfectly to a mother-child duo, and in both cases the PI of the true father was much higher than the PI of the unrelated man. The trios were also typed for 15 short tandem repeats (STRs) and seven variable number of tandem repeats (VNTRs). The typical PIs based on 15 STRs or seven VNTRs were 5-50 times higher than the typical PIs based on 52 SNPs. Six mutations in tandem repeats were detected among the randomly selected trios. In contrast, there was not found any mutations in the SNP loci. The results showed that the 52 SNP-plex assay is a very useful alternative to currently used methods in relationship testing. The usefulness of SNP markers with low mutation rates in paternity and immigration casework is discussed.

  19. Utilization of a whole genome SNP panel for efficient genetic mapping in the mouse

    PubMed Central

    Moran, Jennifer L.; Bolton, Andrew D.; Tran, Pamela V.; Brown, Alison; Dwyer, Noelle D.; Manning, Danielle K.; Bjork, Bryan C.; Li, Cheng; Montgomery, Kate; Siepka, Sandra M.; Vitaterna, Martha Hotz; Takahashi, Joseph S.; Wiltshire, Tim; Kwiatkowski, David J.; Kucherlapati, Raju; Beier, David R.

    2006-01-01

    Phenotype-driven genetics can be used to create mouse models of human disease and birth defects. However, the utility of these mutant models is limited without identification of the causal gene. To facilitate genetic mapping, we developed a fixed single nucleotide polymorphism (SNP) panel of 394 SNPs as an alternative to analyses using simple sequence length polymorphism (SSLP) marker mapping. With the SNP panel, chromosomal locations for 22 monogenic mutants were identified. The average number of affected progeny genotyped for mapped monogenic mutations is nine. Map locations for several mutants have been obtained with as few as four affected progeny. The average size of genetic intervals obtained for these mutants is 43 Mb, with a range of 17–83 Mb. Thus, our SNP panel allows for identification of moderate resolution map position with small numbers of mice in a high-throughput manner. Importantly, the panel is suitable for mapping crosses from many inbred and wild-derived inbred strain combinations. The chromosomal localizations obtained with the SNP panel allow one to quickly distinguish between potentially novel loci or remutations in known genes, and facilitates fine mapping and positional cloning. By using this approach, we identified DNA sequence changes in two ethylnitrosourea-induced mutants. PMID:16461637

  20. Estimating genomic diversity and population differentiation - an empirical comparison of microsatellite and SNP variation in Arabidopsis halleri.

    PubMed

    Fischer, Martin C; Rellstab, Christian; Leuzinger, Marianne; Roumet, Marie; Gugerli, Felix; Shimizu, Kentaro K; Holderegger, Rolf; Widmer, Alex

    2017-01-11

    Microsatellite markers are widely used for estimating genetic diversity within and differentiation among populations. However, it has rarely been tested whether such estimates are useful proxies for genome-wide patterns of variation and differentiation. Here, we compared microsatellite variation with genome-wide single nucleotide polymorphisms (SNPs) to assess and quantify potential marker-specific biases and derive recommendations for future studies. Overall, we genotyped 180 Arabidopsis halleri individuals from nine populations using 20 microsatellite markers. Twelve of these markers were originally developed for Arabidopsis thaliana (cross-species markers) and eight for A. halleri (species-specific markers). We further characterized 2 million SNPs across the genome with a pooled whole-genome re-sequencing approach (Pool-Seq). Our analyses revealed that estimates of genetic diversity and differentiation derived from cross-species and species-specific microsatellites differed substantially and that expected microsatellite heterozygosity (SSR-H e) was not significantly correlated with genome-wide SNP diversity estimates (SNP-H e and θ Watterson) in A. halleri. Instead, microsatellite allelic richness (A r) was a better proxy for genome-wide SNP diversity. Estimates of genetic differentiation among populations (F ST) based on both marker types were correlated, but microsatellite-based estimates were significantly larger than those from SNPs. Possible causes include the limited number of microsatellite markers used, marker ascertainment bias, as well as the high variance in microsatellite-derived estimates. In contrast, genome-wide SNP data provided unbiased estimates of genetic diversity independent of whether genome- or only exome-wide SNPs were used. Further, we inferred that a few thousand random SNPs are sufficient to reliably estimate genome-wide diversity and to distinguish among populations differing in genetic variation. We recommend that future analyses of

  1. Development of novel InDel markers and genetic diversity in Chenopodium quinoa through whole-genome re-sequencing.

    PubMed

    Zhang, Tifu; Gu, Minfeng; Liu, Yuhe; Lv, Yuanda; Zhou, Ling; Lu, Haiyan; Liang, Shuaiqiang; Bao, Huabin; Zhao, Han

    2017-09-05

    Quinoa (Chenopodium quinoa Willd.) is a balanced nutritional crop, but its breeding improvement has been limited by the lack of information on its genetics and genomics. Therefore, it is necessary to obtain knowledge on genomic variation, population structure, and genetic diversity and to develop novel Insertion/Deletion (InDel) markers for quinoa by whole-genome re-sequencing. We re-sequenced 11 quinoa accessions and obtained a coverage depth between approximately 7× to 23× the quinoa genome. Based on the 1453-megabase (Mb) assembly from the reference accession Riobamba, 8,441,022 filtered bi-allelic single nucleotide polymorphisms (SNPs) and 842,783 filtered InDels were identified, with an estimated SNP and InDel density of 5.81 and 0.58 per kilobase (kb). From the genomic InDel variations, 85 dimorphic InDel markers were newly developed and validated. Together with the 62 simple sequence repeat (SSR) markers reported, a total of 147 markers were used for genotyping the 129 quinoa accessions. Molecular grouping analysis showed classification into two major groups, the Andean highland (composed of the northern and southern highland subgroups) and Chilean coastal, based on combined STRUCTURE, phylogenetic tree and PCA (Principle Component Analysis) analyses. Further analysis of the genetic diversity exhibited a decreasing tendency from the Chilean coast group to the Andean highland group, and the gene flow between subgroups was more frequent than that between the two subgroups and the Chilean coastal group. The majority of the variations (approximately 70%) were found through an analysis of molecular variation (AMOVA) due to the diversity between the groups. This was congruent with the observation of a highly significant FST value (0.705) between the groups, demonstrating significant genetic differentiation between the Andean highland type of quinoa and the Chilean coastal type. Moreover, a core set of 16 quinoa germplasms that capture all 362 alleles was

  2. SNP array and phenotype correlation shows that FLI1 deletion per se is not responsible for thrombocytopenia development in Jacobsen syndrome.

    PubMed

    Trkova, Marie; Becvarova, Vera; Hynek, Martin; Hnykova, Lenka; Hlavova, Eva; Kreckova, Gabriela; Kulovany, Eduard; Cutka, David; Zatloukalova, Jitka; Markova, Kristyna; Sukova, Martina; Horacek, Jiri; Stejskal, David

    2012-10-01

    Jacobsen syndrome (JBS) is a rare chromosomal disorder caused by terminal deletion of the long arm of chromosome 11. We report on four prenatally diagnosed patients with JBS with variable prenatal and postnatal phenotypes and 11q deletions of varying sizes. Precise characterization of the deleted region in three patients was performed by SNP arrays. The severity of both the prenatal and postnatal phenotypes did not correlate with the size of the haploinsufficient region. Despite the large difference in the deletion size (nearly 6 Mb), both of the live-born patients had similar phenotypes corresponding to JBS. However, one of the most prominent features of JBS, thrombocytopenia, was only present in the live-born boy. The girl, who had a significantly longer deletion spanning all four genes suspected of being causative of JBS-related thrombocytopenia (FLI1, ETS1, NFRKB, and JAM3), did not manifest a platelet phenotype. Therefore, our findings do not support the traditional view of deletion size correlation in JBS or the causative role of FLI1, ETS1, NFRKB, and JAM3 deletion per se for the development of disease-related thrombocytopenia.

  3. Construction of a genetic linkage map for cultivated peanut and development of QTLs/markers for marker-assisted breeding

    USDA-ARS?s Scientific Manuscript database

    Several genetic maps based on recombinant inbred line (RIL) and backcross (BC) populations have been developed for tetraploid peanut recently. The marker density, however, is still very low especially in context of large genome size (2,800Mb/1C) and 20 linkage groups (LGs). Therefore, improvement of...

  4. Development and preliminary evaluation of a 90 K Axiom® SNP array for the allo-octoploid cultivated strawberry Fragaria × ananassa.

    PubMed

    Bassil, Nahla V; Davis, Thomas M; Zhang, Hailong; Ficklin, Stephen; Mittmann, Mike; Webster, Teresa; Mahoney, Lise; Wood, David; Alperin, Elisabeth S; Rosyara, Umesh R; Koehorst-Vanc Putten, Herma; Monfort, Amparo; Sargent, Daniel J; Amaya, Iraida; Denoyes, Beatrice; Bianco, Luca; van Dijk, Thijs; Pirani, Ali; Iezzoni, Amy; Main, Dorrie; Peace, Cameron; Yang, Yilong; Whitaker, Vance; Verma, Sujeet; Bellon, Laurent; Brew, Fiona; Herrera, Raul; van de Weg, Eric

    2015-03-07

    A high-throughput genotyping platform is needed to enable marker-assisted breeding in the allo-octoploid cultivated strawberry Fragaria × ananassa. Short-read sequences from one diploid and 19 octoploid accessions were aligned to the diploid Fragaria vesca 'Hawaii 4' reference genome to identify single nucleotide polymorphisms (SNPs) and indels for incorporation into a 90 K Affymetrix® Axiom® array. We report the development and preliminary evaluation of this array. About 36 million sequence variants were identified in a 19 member, octoploid germplasm panel. Strategies and filtering pipelines were developed to identify and incorporate markers of several types: di-allelic SNPs (66.6%), multi-allelic SNPs (1.8%), indels (10.1%), and ploidy-reducing "haploSNPs" (11.7%). The remaining SNPs included those discovered in the diploid progenitor F. iinumae (3.9%), and speculative "codon-based" SNPs (5.9%). In genotyping 306 octoploid accessions, SNPs were assigned to six classes with Affymetrix's "SNPolisher" R package. The highest quality classes, PolyHigh Resolution (PHR), No Minor Homozygote (NMH), and Off-Target Variant (OTV) comprised 25%, 38%, and 1% of array markers, respectively. These markers were suitable for genetic studies as demonstrated in the full-sib family 'Holiday' × 'Korona' with the generation of a genetic linkage map consisting of 6,594 PHR SNPs evenly distributed across 28 chromosomes with an average density of approximately one marker per 0.5 cM, thus exceeding our goal of one marker per cM. The Affymetrix IStraw90 Axiom array is the first high-throughput genotyping platform for cultivated strawberry and is commercially available to the worldwide scientific community. The array's high success rate is likely driven by the presence of naturally occurring variation in ploidy level within the nominally octoploid genome, and by effectiveness of the employed array design and ploidy-reducing strategies. This array enables genetic analyses

  5. RAD sequencing yields a high success rate for westslope cutthroat and rainbow trout species-diagnostic SNP assays

    USGS Publications Warehouse

    Stephen J. Amish,; Paul A. Hohenlohe,; Sally Painter,; Robb F. Leary,; Muhlfeld, Clint C.; Fred W. Allendorf,; Luikart, Gordon

    2012-01-01

    Hybridization with introduced rainbow trout threatens most native westslope cutthroat trout populations. Understanding the genetic effects of hybridization and introgression requires a large set of high-throughput, diagnostic genetic markers to inform conservation and management. Recently, we identified several thousand candidate single-nucleotide polymorphism (SNP) markers based on RAD sequencing of 11 westslope cutthroat trout and 13 rainbow trout individuals. Here, we used flanking sequence for 56 of these candidate SNP markers to design high-throughput genotyping assays. We validated the assays on a total of 92 individuals from 22 populations and seven hatchery strains. Forty-six assays (82%) amplified consistently and allowed easy identification of westslope cutthroat and rainbow trout alleles as well as heterozygote controls. The 46 SNPs will provide high power for early detection of population admixture and improved identification of hybrid and nonhybridized individuals. This technique shows promise as a very low-cost, reliable and relatively rapid method for developing and testing SNP markers for nonmodel organisms with limited genomic resources.

  6. RAD sequencing yields a high success rate for westslope cutthroat and rainbow trout species-diagnostic SNP assays.

    PubMed

    Amish, Stephen J; Hohenlohe, Paul A; Painter, Sally; Leary, Robb F; Muhlfeld, Clint; Allendorf, Fred W; Luikart, Gordon

    2012-07-01

    Hybridization with introduced rainbow trout threatens most native westslope cutthroat trout populations. Understanding the genetic effects of hybridization and introgression requires a large set of high-throughput, diagnostic genetic markers to inform conservation and management. Recently, we identified several thousand candidate single-nucleotide polymorphism (SNP) markers based on RAD sequencing of 11 westslope cutthroat trout and 13 rainbow trout individuals. Here, we used flanking sequence for 56 of these candidate SNP markers to design high-throughput genotyping assays. We validated the assays on a total of 92 individuals from 22 populations and seven hatchery strains. Forty-six assays (82%) amplified consistently and allowed easy identification of westslope cutthroat and rainbow trout alleles as well as heterozygote controls. The 46 SNPs will provide high power for early detection of population admixture and improved identification of hybrid and nonhybridized individuals. This technique shows promise as a very low-cost, reliable and relatively rapid method for developing and testing SNP markers for nonmodel organisms with limited genomic resources.

  7. A Diagnostic Marker to Discriminate Childhood Apraxia of Speech From Speech Delay: I. Development and Description of the Pause Marker.

    PubMed

    Shriberg, Lawrence D; Strand, Edythe A; Fourakis, Marios; Jakielski, Kathy J; Hall, Sheryl D; Karlsson, Heather B; Mabie, Heather L; McSweeny, Jane L; Tilkens, Christie M; Wilson, David L

    2017-04-14

    The goal of this article (PM I) is to describe the rationale for and development of the Pause Marker (PM), a single-sign diagnostic marker proposed to discriminate early or persistent childhood apraxia of speech from speech delay. The authors describe and prioritize 7 criteria with which to evaluate the research and clinical utility of a diagnostic marker for childhood apraxia of speech, including evaluation of the present proposal. An overview is given of the Speech Disorders Classification System, including extensions completed in the same approximately 3-year period in which the PM was developed. The finalized Speech Disorders Classification System includes a nosology and cross-classification procedures for childhood and persistent speech disorders and motor speech disorders (Shriberg, Strand, & Mabie, 2017). A PM is developed that provides procedural and scoring information, and citations to papers and technical reports that include audio exemplars of the PM and reference data used to standardize PM scores are provided. The PM described here is an acoustic-aided perceptual sign that quantifies one aspect of speech precision in the linguistic domain of phrasing. This diagnostic marker can be used to discriminate early or persistent childhood apraxia of speech from speech delay.

  8. A single base substitution in BADH/AMADH is responsible for fragrance in cucumber (Cucumis sativus L.), and development of SNAP markers for the fragrance.

    PubMed

    Yundaeng, Chutintorn; Somta, Prakit; Tangphatsornruang, Sithichoke; Chankaew, Sompong; Srinives, Peerasak

    2015-09-01

    Sequence analysis revealed that an SNP (A1855G) in CsBADH of cucumber accession PK2011T202 causes amino acid change in a highly conserved motif, Y163C. Gene mapping showed association between the SNP and the fragrance. Pandan-like fragrance is a value-added trait in several food crops such as rice, vegetable soybean and sorghum. The fragrance is caused by the volatile chemical 2-acetyl-1-pyrroline (2AP). Mutation(s) in betaine aldehyde dehydrogenase 2 (BADH2; also known as aminoaldehyde dehydrogenase 2) gene causes defective BADH2 and results in biosynthesis of 2AP. Recently, cucumber cultivars possessing pandan-like fragrance were discovered in Thailand. In this study, we report an association between CsBADH and the fragrance in cucumber accession "PK2011T202". Gene expression analysis of CsBADH in leaves of PK2011T202 and "301176" (non-fragrant) at various growth stages revealed that CsBADH was expressed in both accessions. Sequence comparison of CsBADH showed that PK2011T202 possesses a single base substitution (A1855G) in exon 5 which causes an amino acid change in a highly conserved motif of BADH, Y163C. Single nucleotide-amplified polymorphism markers were developed to detect the SNP polymorphism between the wild-type and fragrance alleles. Since CsBADH is located on chromosome 1, quantitative trait locus (QTL) mapping was conducted for this chromosome using an F2 and a backcross populations developed from the cross between PK2011T202 and 301176. QTL analysis in both populations showed that the major QTL for fragrance, qFgr, was co-localized with the CsBADH. We concluded that the defect function of CsBADH is responsible for fragrance in cucumber PK2011T202.

  9. Developing diagnostic SNP panels for the identification of true fruit flies (Diptera: Tephritidae) within the limits of COI-based species delimitation

    PubMed Central

    2013-01-01

    Background Rapid and reliable identification of quarantine pests is essential for plant inspection services to prevent introduction of invasive species. For insects, this may be a serious problem when dealing with morphologically similar cryptic species complexes and early developmental stages that lack distinctive characters useful for taxonomic identification. DNA based barcoding could solve many of these problems. The standard barcode fragment, an approx. 650 base pairs long sequence of the 5′end of the mitochondrial cytochrome oxidase I (COI), enables differentiation of a very wide range of arthropods. However, problems remain in some taxa, such as Tephritidae, where recent genetic differentiation among some of the described species hinders accurate molecular discrimination. Results In order to explore the full species discrimination potential of COI, we sequenced the barcoding region of the COI gene of a range of economically important Tephritid species and complemented these data with all GenBank and BOLD entries for the systematic group available as of January 2012. We explored the limits of species delimitation of this barcode fragment among 193 putative Tephritid species and established operational taxonomic units (OTUs), between which discrimination is reliably possible. Furthermore, to enable future development of rapid diagnostic assays based on this sequence information, we characterized all single nucleotide polymorphisms (SNPs) and established “near-minimal” sets of SNPs that differentiate among all included OTUs with at least three and four SNPs, respectively. Conclusions We found that although several species cannot be differentiated based on the genetic diversity observed in COI and hence form composite OTUs, 85% of all OTUs correspond to described species. Because our SNP panels are developed based on all currently available sequence information and rely on a minimal pairwise difference of three SNPs, they are highly reliable and hence

  10. Developing diagnostic SNP panels for the identification of true fruit flies (Diptera: Tephritidae) within the limits of COI-based species delimitation.

    PubMed

    Frey, Juerg E; Guillén, Larissa; Frey, Beatrice; Samietz, Joerg; Rull, Juan; Aluja, Martín

    2013-05-29

    Rapid and reliable identification of quarantine pests is essential for plant inspection services to prevent introduction of invasive species. For insects, this may be a serious problem when dealing with morphologically similar cryptic species complexes and early developmental stages that lack distinctive characters useful for taxonomic identification. DNA based barcoding could solve many of these problems. The standard barcode fragment, an approx. 650 base pairs long sequence of the 5'end of the mitochondrial cytochrome oxidase I (COI), enables differentiation of a very wide range of arthropods. However, problems remain in some taxa, such as Tephritidae, where recent genetic differentiation among some of the described species hinders accurate molecular discrimination. In order to explore the full species discrimination potential of COI, we sequenced the barcoding region of the COI gene of a range of economically important Tephritid species and complemented these data with all GenBank and BOLD entries for the systematic group available as of January 2012. We explored the limits of species delimitation of this barcode fragment among 193 putative Tephritid species and established operational taxonomic units (OTUs), between which discrimination is reliably possible. Furthermore, to enable future development of rapid diagnostic assays based on this sequence information, we characterized all single nucleotide polymorphisms (SNPs) and established "near-minimal" sets of SNPs that differentiate among all included OTUs with at least three and four SNPs, respectively. We found that although several species cannot be differentiated based on the genetic diversity observed in COI and hence form composite OTUs, 85% of all OTUs correspond to described species. Because our SNP panels are developed based on all currently available sequence information and rely on a minimal pairwise difference of three SNPs, they are highly reliable and hence represent an important resource for

  11. High-throughput SNP-genotyping analysis of the relationships among Ponto-Caspian sturgeon species

    PubMed Central

    Rastorguev, Sergey M; Nedoluzhko, Artem V; Mazur, Alexander M; Gruzdeva, Natalia M; Volkov, Alexander A; Barmintseva, Anna E; Mugue, Nikolai S; Prokhortchouk, Egor B

    2013-01-01

    Abstract Legally certified sturgeon fisheries require population protection and conservation methods, including DNA tests to identify the source of valuable sturgeon roe. However, the available genetic data are insufficient to distinguish between different sturgeon populations, and are even unable to distinguish between some species. We performed high-throughput single-nucleotide polymorphism (SNP)-genotyping analysis on different populations of Russian (Acipenser gueldenstaedtii), Persian (A. persicus), and Siberian (A. baerii) sturgeon species from the Caspian Sea region (Volga and Ural Rivers), the Azov Sea, and two Siberian rivers. We found that Russian sturgeons from the Volga and Ural Rivers were essentially indistinguishable, but they differed from Russian sturgeons in the Azov Sea, and from Persian and Siberian sturgeons. We identified eight SNPs that were sufficient to distinguish these sturgeon populations with 80% confidence, and allowed the development of markers to distinguish sturgeon species. Finally, on the basis of our SNP data, we propose that the A. baerii-like mitochondrial DNA found in some Russian sturgeons from the Caspian Sea arose via an introgression event during the Pleistocene glaciation. In the present study, the high-throughput genotyping analysis of several sturgeon populations was performed. SNP markers for species identification were defined. The possible explanation of the baerii-like mitotype presence in some Russian sturgeons in the Caspian Sea was suggested. PMID:24567827

  12. High-throughput SNP-genotyping analysis of the relationships among Ponto-Caspian sturgeon species.

    PubMed

    Rastorguev, Sergey M; Nedoluzhko, Artem V; Mazur, Alexander M; Gruzdeva, Natalia M; Volkov, Alexander A; Barmintseva, Anna E; Mugue, Nikolai S; Prokhortchouk, Egor B

    2013-08-01

    Legally certified sturgeon fisheries require population protection and conservation methods, including DNA tests to identify the source of valuable sturgeon roe. However, the available genetic data are insufficient to distinguish between different sturgeon populations, and are even unable to distinguish between some species. We performed high-throughput single-nucleotide polymorphism (SNP)-genotyping analysis on different populations of Russian (Acipenser gueldenstaedtii), Persian (A. persicus), and Siberian (A. baerii) sturgeon species from the Caspian Sea region (Volga and Ural Rivers), the Azov Sea, and two Siberian rivers. We found that Russian sturgeons from the Volga and Ural Rivers were essentially indistinguishable, but they differed from Russian sturgeons in the Azov Sea, and from Persian and Siberian sturgeons. We identified eight SNPs that were sufficient to distinguish these sturgeon populations with 80% confidence, and allowed the development of markers to distinguish sturgeon species. Finally, on the basis of our SNP data, we propose that the A. baerii-like mitochondrial DNA found in some Russian sturgeons from the Caspian Sea arose via an introgression event during the Pleistocene glaciation. In the present study, the high-throughput genotyping analysis of several sturgeon populations was performed. SNP markers for species identification were defined. The possible explanation of the baerii-like mitotype presence in some Russian sturgeons in the Caspian Sea was suggested.

  13. SNP Discovery Using Next Generation Transcriptomic Sequencing in Atlantic Herring (Clupea harengus)

    PubMed Central

    Bekkevold, Dorte; Babbucci, Massimiliano; van Houdt, Jeroen; Maes, Gregory E.; Bargelloni, Luca; Nielsen, Rasmus O.; Taylor, Martin I.; Ogden, Rob; Cariani, Alessia; Carvalho, Gary R.; Consortium, FishPopTrace; Panitz, Frank

    2012-01-01

    The introduction of Next Generation Sequencing (NGS) has revolutionised population genetics, providing studies of non-model species with unprecedented genomic coverage, allowing evolutionar