Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad
2016-03-01
The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide. Copyright © 2016 Elsevier Inc. All rights reserved.
Qi, L L; Talukder, Z I; Hulke, B S; Foley, M E
2017-06-01
Diagnostic DNA markers are an invaluable resource in breeding programs for successful introgression and pyramiding of disease resistance genes. Resistance to downy mildew (DM) disease in sunflower is mediated by Pl genes which are known to be effective against the causal fungus, Plasmopara halstedii. Two DM resistance genes, Pl Arg and Pl 8 , are highly effective against P. halstedii races in the USA, and have been previously mapped to the sunflower linkage groups (LGs) 1 and 13, respectively, using simple sequence repeat (SSR) markers. In this study, we developed high-density single nucleotide polymorphism (SNP) maps encompassing the Pl arg and Pl 8 genes and identified diagnostic SNP markers closely linked to these genes. The specificity of the diagnostic markers was validated in a highly diverse panel of 548 sunflower lines. Dissection of a large marker cluster co-segregated with Pl Arg revealed that the closest SNP markers NSA_007595 and NSA_001835 delimited Pl Arg to an interval of 2.83 Mb on the LG1 physical map. The SNP markers SFW01497 and SFW06597 delimited Pl 8 to an interval of 2.85 Mb on the LG13 physical map. We also developed sunflower lines with homozygous, three gene pyramids carrying Pl Arg , Pl 8 , and the sunflower rust resistance gene R 12 using the linked SNP markers from a segregating F 2 population of RHA 340 (carrying Pl 8 )/RHA 464 (carrying Pl Arg and R 12 ). The high-throughput diagnostic SNP markers developed in this study will facilitate marker-assisted selection breeding, and the pyramided sunflower lines will provide durable resistance to downy mildew and rust diseases.
Xiao, Shijun; Wang, Panpan; Dong, Linsong; Zhang, Yaguang; Han, Zhaofang; Wang, Qiurong
2016-01-01
Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms. PMID:28028455
Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun
2015-01-01
Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH—developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP—based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS—derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species. PMID:25790283
Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J
2012-05-25
A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the 'Golden Delicious' reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
2012-01-01
Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety. PMID:22631220
Fu, Yong-Bi; Yang, Mo-Hua; Zeng, Fangqin; Biligetu, Bill
2017-01-01
Molecular plant breeding with the aid of molecular markers has played an important role in modern plant breeding over the last two decades. Many marker-based predictions for quantitative traits have been made to enhance parental selection, but the trait prediction accuracy remains generally low, even with the aid of dense, genome-wide SNP markers. To search for more accurate trait-specific prediction with informative SNP markers, we conducted a literature review on the prediction issues in molecular plant breeding and on the applicability of an RNA-Seq technique for developing function-associated specific trait (FAST) SNP markers. To understand whether and how FAST SNP markers could enhance trait prediction, we also performed a theoretical reasoning on the effectiveness of these markers in a trait-specific prediction, and verified the reasoning through computer simulation. To the end, the search yielded an alternative to regular genomic selection with FAST SNP markers that could be explored to achieve more accurate trait-specific prediction. Continuous search for better alternatives is encouraged to enhance marker-based predictions for an individual quantitative trait in molecular plant breeding. PMID:28729875
Using Next Generation Sequencing for Multiplexed Trait-Linked Markers in Wheat
Bernardo, Amy; Wang, Shan; St. Amand, Paul; Bai, Guihua
2015-01-01
With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat ( Triticum aestivum L.) that can be effectively used in marker-assisted selection (MAS) is still limited and SNP assays for MAS are usually uniplex. A shift from uniplex to multiplex assays will allow the simultaneous analysis of multiple markers and increase MAS efficiency. We designed 33 locus-specific markers from SNP or indel-based marker sequences that linked to 20 different quantitative trait loci (QTL) or genes of agronomic importance in wheat and analyzed the amplicon sequences using an Ion Torrent Proton Sequencer and a custom allele detection pipeline to determine the genotypes of 24 selected germplasm accessions. Among the 33 markers, 27 were successfully multiplexed and 23 had 100% SNP call rates. Results from analysis of "kompetitive allele-specific PCR" (KASP) and sequence tagged site (STS) markers developed from the same loci fully verified the genotype calls of 23 markers. The NGS-based multiplexed assay developed in this study is suitable for rapid and high-throughput screening of SNPs and some indel-based markers in wheat. PMID:26625271
USDA-ARS?s Scientific Manuscript database
Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in...
Kongchum, Pawapol; Palti, Yniv; Hallerman, Eric M; Hulata, Gideon; David, Lior
2010-08-01
Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers for susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpesvirus 3 (CyHV-3) is highly contagious and virulent in common carp (Cyprinus carpio). With the aim to develop molecular tools for breeding CyHV-3-resistant carp, we have amplified and sequenced 11 candidate genes for viral disease resistance including TLR2, TLR3, TLR4ba, TLR7, TLR9, TLR21, TLR22, MyD88, TRAF6, type I IFN and IL-1beta. For each gene, we initially cloned and sequenced PCR amplicons from 8 to 12 fish (2-3 fish per strain) from the SNP discovery panel. We then identified and evaluated putative SNPs for their polymorphisms in the SNP discovery panel and validated their usefulness for linkage analysis in a full-sib family using the SNaPshot method. Our sequencing results and phylogenetic analyses suggested that TLR3, TLR7 and MyD88 genes are duplicated in the common carp genome. We, therefore, developed locus-specific PCR primers and SNP genotyping assays for the duplicated loci. A total of 48 SNP markers were developed from PCR fragments of the 13 loci (7 single-locus and 3 duplicated genes). Thirty-nine markers were polymorphic with estimated minor allele frequencies of more than 0.1. The utility of the SNP markers was evaluated in one full-sib family and revealed that 20 markers from 9 loci segregated in a disomic and Mendelian pattern and would be useful for linkage analysis. Published by Elsevier Ltd.
Report on the development of putative functional SSR and SNP markers in passion fruits.
da Costa, Zirlane Portugal; Munhoz, Carla de Freitas; Vieira, Maria Lucia Carneiro
2017-09-06
Passionflowers Passiflora edulis and Passiflora alata are diploid, outcrossing and understudied fruit bearing species. In Brazil, passion fruit cultivation began relatively recently and has earned the country an outstanding position as the world's top producer of passion fruit. The fruit's main economic value lies in the production of juice, an essential exotic ingredient in juice blends. Currently, crop improvement strategies, including those for underexploited tropical species, tend to incorporate molecular genetic approaches. In this study, we examined a set of P. edulis transcripts expressed in response to infection by Xanthomonas axonopodis, (the passion fruit's main bacterial pathogen that attacks the vines), aiming at the development of putative functional markers, i.e. SSRs (simple sequence repeats) and SNPs (single nucleotide polymorphisms). A total of 210 microsatellites were found in 998 sequences, and trinucleotide repeats were found to be the most frequent (31.4%). Of the sequences selected for designing primers, 80.9% could be used to develop SSR markers, and 60.6% SNP markers for P. alata. SNPs were all biallelic and found within 15 gene fragments of P. alata. Overall, gene fragments generated 10,003 bp. SNP frequency was estimated as one SNP every 294 bp. Polymorphism rates revealed by SSR and SNP loci were 29.4 and 53.6%, respectively. Passiflora edulis transcripts were useful for the development of putative functional markers for P. alata, suggesting a certain level of sequence conservation between these cultivated species. The markers developed herein could be used for genetic mapping purposes and also in diversity studies.
Huang, Zhen; Peng, Gary; Liu, Xunjia; Deora, Abhinandan; Falk, Kevin C.; Gossen, Bruce D.; McDonald, Mary R.; Yu, Fengqun
2017-01-01
Clubroot, caused by Plasmodiophora brassicae, is an important disease of canola (Brassica napus) in western Canada and worldwide. In this study, a clubroot resistance gene (Rcr2) was identified and fine mapped in Chinese cabbage cv. “Jazz” using single-nucleotide polymorphisms (SNP) markers identified from bulked segregant RNA sequencing (BSR-Seq) and molecular markers were developed for use in marker assisted selection. In total, 203.9 million raw reads were generated from one pooled resistant (R) and one pooled susceptible (S) sample, and >173,000 polymorphic SNP sites were identified between the R and S samples. One significant peak was observed between 22 and 26 Mb of chromosome A03, which had been predicted by BSR-Seq to contain the causal gene Rcr2. There were 490 polymorphic SNP sites identified in the region. A segregating population consisting of 675 plants was analyzed with 15 SNP sites in the region using the Kompetitive Allele Specific PCR method, and Rcr2 was fine mapped between two SNP markers, SNP_A03_32 and SNP_A03_67 with 0.1 and 0.3 cM from Rcr2, respectively. Five SNP markers co-segregated with Rcr2 in this region. Variants were identified in 14 of 36 genes annotated in the Rcr2 target region. The numbers of poly variants differed among the genes. Four genes encode TIR-NBS-LRR proteins and two of them Bra019410 and Bra019413, had high numbers of polymorphic variants and so are the most likely candidates of Rcr2. PMID:28894454
Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl
2017-01-01
Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273
Wang, Boyi; Tan, Hua-Wei; Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Matsumoto, Tracie; Zhang, Dapeng
2015-01-01
Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in 50 longan germplasm accessions, including cultivated varieties and wild germplasm; and designated 25 SNP markers that unambiguously identified all tested longan varieties with high statistical rigor (P<0.0001). Multiple trees from the same clone were verified and off-type trees were identified. Diversity analysis revealed genetic relationships among analyzed accessions. Cultivated varieties differed significantly from wild populations (Fst=0.300; P<0.001), demonstrating untapped genetic diversity for germplasm conservation and utilization. Within cultivated varieties, apparent differences between varieties from China and those from Thailand and Hawaii indicated geographic patterns of genetic differentiation. These SNP markers provide a powerful tool to manage longan genetic resources and breeding, with accurate and efficient genotype identification. PMID:26504559
SNP Discovery and Linkage Map Construction in Cultivated Tomato
Shirasawa, Kenta; Isobe, Sachiko; Hirakawa, Hideki; Asamizu, Erika; Fukuoka, Hiroyuki; Just, Daniel; Rothan, Christophe; Sasamoto, Shigemi; Fujishiro, Tsunakazu; Kishida, Yoshie; Kohara, Mitsuyo; Tsuruoka, Hisano; Wada, Tsuyuko; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi
2010-01-01
Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/. PMID:21044984
Telfer, Emily J; Stovold, Grahame T; Li, Yongjun; Silva-Junior, Orzenil B; Grattapaglia, Dario G; Dungey, Heidi S
2015-01-01
Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP) marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K). The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi-species SNP resource available with the EuCHIP60k, opens a whole new array of opportunities for high-throughput, genome-wide or targeted genotyping in species of Eucalyptus.
An improved consensus linkage map of barley based on flow-sorted chromosomes and SNP markers
USDA-ARS?s Scientific Manuscript database
Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a SNP-based genotyping platform was developed a...
Genome-wide SNP identification and QTL mapping for black rot resistance in cabbage.
Lee, Jonghoon; Izzah, Nur Kholilatul; Jayakodi, Murukarthick; Perumal, Sampath; Joh, Ho Jun; Lee, Hyeon Ju; Lee, Sang-Choon; Park, Jee Young; Yang, Ki-Woung; Nou, Il-Sup; Seo, Joodeok; Yoo, Jaeheung; Suh, Youngdeok; Ahn, Kyounggu; Lee, Ji Hyun; Choi, Gyung Ja; Yu, Yeisoo; Kim, Heebal; Yang, Tae-Jin
2015-02-03
Black rot is a destructive bacterial disease causing large yield and quality losses in Brassica oleracea. To detect quantitative trait loci (QTL) for black rot resistance, we performed whole-genome resequencing of two cabbage parental lines and genome-wide SNP identification using the recently published B. oleracea genome sequences as reference. Approximately 11.5 Gb of sequencing data was produced from each parental line. Reference genome-guided mapping and SNP calling revealed 674,521 SNPs between the two cabbage lines, with an average of one SNP per 662.5 bp. Among 167 dCAPS markers derived from candidate SNPs, 117 (70.1%) were validated as bona fide SNPs showing polymorphism between the parental lines. We then improved the resolution of a previous genetic map by adding 103 markers including 87 SNP-based dCAPS markers. The new map composed of 368 markers and covers 1467.3 cM with an average interval of 3.88 cM between adjacent markers. We evaluated black rot resistance in the mapping population in three independent inoculation tests using F2:3 progenies and identified one major QTL and three minor QTLs. We report successful utilization of whole-genome resequencing for large-scale SNP identification and development of molecular markers for genetic map construction. In addition, we identified novel QTLs for black rot resistance. The high-density genetic map will promote QTL analysis for other important agricultural traits and marker-assisted breeding of B. oleracea.
Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.
2015-01-01
High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui
2017-01-01
Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.
A genetic map and germplasm diversity estimation of Mangifera indica (mango) with SNPs
USDA-ARS?s Scientific Manuscript database
Mango (Mangifera indica) is often referred to as the “King of Fruits”. As the first steps in developing a mango genomics project, we genotyped 582 individuals comprising six mapping populations with 1054 SNP markers. The resulting consensus map had 20 linkage groups defined by 726 SNP markers with...
Genome wide association study (GWAS) for grain yield in rice cultivated under water deficit.
Pantalião, Gabriel Feresin; Narciso, Marcelo; Guimarães, Cléber; Castro, Adriano; Colombari, José Manoel; Breseghello, Flavio; Rodrigues, Luana; Vianello, Rosana Pereira; Borba, Tereza Oliveira; Brondani, Claudio
2016-12-01
The identification of rice drought tolerant materials is crucial for the development of best performing cultivars for the upland cultivation system. This study aimed to identify markers and candidate genes associated with drought tolerance by Genome Wide Association Study analysis, in order to develop tools for use in rice breeding programs. This analysis was made with 175 upland rice accessions (Oryza sativa), evaluated in experiments with and without water restriction, and 150,325 SNPs. Thirteen SNP markers associated with yield under drought conditions were identified. Through stepwise regression analysis, eight SNP markers were selected and validated in silico, and when tested by PCR, two out of the eight SNP markers were able to identify a group of rice genotypes with higher productivity under drought. These results are encouraging for deriving markers for the routine analysis of marker assisted selection. From the drought experiment, including the genes inherited in linkage blocks, 50 genes were identified, from which 30 were annotated, and 10 were previously related to drought and/or abiotic stress tolerance, such as the transcription factors WRKY and Apetala2, and protein kinases.
A set of 14 DIP-SNP markers to detect unbalanced DNA mixtures.
Liu, Zhizhen; Liu, Jinding; Wang, Jiaqi; Chen, Deqing; Liu, Zidong; Shi, Jie; Li, Zeqin; Li, Wenyan; Zhang, Gengqian; Du, Bing
2018-03-04
Unbalanced DNA mixture is still a difficult problem for forensic practice. DIP-STRs are useful markers for detection of minor DNA but they are not widespread in the human genome and having long amplicons. In this study, we proposed a novel type of genetic marker, termed DIP-SNP. DIP-SNP refers to the combination of INDEL and SNP in less than 300bp length of human genome. The multiplex PCR and SNaPshot assay were established for 14 DIP-SNP markers in a Chinese Han population from Shanxi, China. This novel compound marker allows detection of the minor DNA contributor with sensitivity from 1:50 to 1:1000 in a DNA mixture of any gender with 1 ng-10 ng DNA template. Most of the DIP-SNP markers had a relatively high probability of informative alleles with an average I value of 0.33. In all, we proposed DIP-SNP as a novel kind of genetic marker for detection of minor contributor from unbalanced DNA mixture and established the detection method by associating the multiplex PCR and SNaPshot assay. DIP-SNP polymorphisms are promising markers for forensic or clinical mixture examination because they are shorter, widespread and higher sensitive. Copyright © 2018 Elsevier Inc. All rights reserved.
Genetic diversity and trait genomic prediction in a pea diversity panel.
Burstin, Judith; Salloignon, Pauline; Chabert-Martinello, Marianne; Magnin-Robert, Jean-Bernard; Siol, Mathieu; Jacquin, Françoise; Chauveau, Aurélie; Pont, Caroline; Aubert, Grégoire; Delaitre, Catherine; Truntzer, Caroline; Duc, Gérard
2015-02-21
Pea (Pisum sativum L.), a major pulse crop grown for its protein-rich seeds, is an important component of agroecological cropping systems in diverse regions of the world. New breeding challenges imposed by global climate change and new regulations urge pea breeders to undertake more efficient methods of selection and better take advantage of the large genetic diversity present in the Pisum sativum genepool. Diversity studies conducted so far in pea used Simple Sequence Repeat (SSR) and Retrotransposon Based Insertion Polymorphism (RBIP) markers. Recently, SNP marker panels have been developed that will be useful for genetic diversity assessment and marker-assisted selection. A collection of diverse pea accessions, including landraces and cultivars of garden, field or fodder peas as well as wild peas was characterised at the molecular level using newly developed SNP markers, as well as SSR markers and RBIP markers. The three types of markers were used to describe the structure of the collection and revealed different pictures of the genetic diversity among the collection. SSR showed the fastest rate of evolution and RBIP the slowest rate of evolution, pointing to their contrasted mode of evolution. SNP markers were then used to predict phenotypes -the date of flowering (BegFlo), the number of seeds per plant (Nseed) and thousand seed weight (TSW)- that were recorded for the collection. Different statistical methods were tested including the LASSO (Least Absolute Shrinkage ans Selection Operator), PLS (Partial Least Squares), SPLS (Sparse Partial Least Squares), Bayes A, Bayes B and GBLUP (Genomic Best Linear Unbiased Prediction) methods and the structure of the collection was taken into account in the prediction. Despite a limited number of 331 markers used for prediction, TSW was reliably predicted. The development of marker assisted selection has not reached its full potential in pea until now. This paper shows that the high-throughput SNP arrays that are being developed will most probably allow for a more efficient selection in this species.
Stephen J. Amish,; Paul A. Hohenlohe,; Sally Painter,; Robb F. Leary,; Muhlfeld, Clint C.; Fred W. Allendorf,; Luikart, Gordon
2012-01-01
Hybridization with introduced rainbow trout threatens most native westslope cutthroat trout populations. Understanding the genetic effects of hybridization and introgression requires a large set of high-throughput, diagnostic genetic markers to inform conservation and management. Recently, we identified several thousand candidate single-nucleotide polymorphism (SNP) markers based on RAD sequencing of 11 westslope cutthroat trout and 13 rainbow trout individuals. Here, we used flanking sequence for 56 of these candidate SNP markers to design high-throughput genotyping assays. We validated the assays on a total of 92 individuals from 22 populations and seven hatchery strains. Forty-six assays (82%) amplified consistently and allowed easy identification of westslope cutthroat and rainbow trout alleles as well as heterozygote controls. The 46 SNPs will provide high power for early detection of population admixture and improved identification of hybrid and nonhybridized individuals. This technique shows promise as a very low-cost, reliable and relatively rapid method for developing and testing SNP markers for nonmodel organisms with limited genomic resources.
Ma, Chun-Lei; Jin, Ji-Qiang; Li, Chun-Fang; Wang, Rong-Kai; Zheng, Hong-Kun; Yao, Ming-Zhe; Chen, Liang
2015-01-01
Genetic maps are important tools in plant genomics and breeding. The present study reports the large-scale discovery of single nucleotide polymorphisms (SNPs) for genetic map construction in tea plant. We developed a total of 6,042 valid SNP markers using specific-locus amplified fragment sequencing (SLAF-seq), and subsequently mapped them into the previous framework map. The final map contained 6,448 molecular markers, distributing on fifteen linkage groups corresponding to the number of tea plant chromosomes. The total map length was 3,965 cM, with an average inter-locus distance of 1.0 cM. This map is the first SNP-based reference map of tea plant, as well as the most saturated one developed to date. The SNP markers and map resources generated in this study provide a wealth of genetic information that can serve as a foundation for downstream genetic analyses, such as the fine mapping of quantitative trait loci (QTL), map-based cloning, marker-assisted selection, and anchoring of scaffolds to facilitate the process of whole genome sequencing projects for tea plant. PMID:26035838
RAD tag sequencing as a source of SNP markers in Cynara cardunculus L
2012-01-01
Background The globe artichoke (Cynara cardunculus L. var. scolymus) genome is relatively poorly explored, especially compared to those of the other major Asteraceae crops sunflower and lettuce. No SNP markers are in the public domain. We have combined the recently developed restriction-site associated DNA (RAD) approach with the Illumina DNA sequencing platform to effect the rapid and mass discovery of SNP markers for C. cardunculus. Results RAD tags were sequenced from the genomic DNA of three C. cardunculus mapping population parents, generating 9.7 million reads, corresponding to ~1 Gbp of sequence. An assembly based on paired ends produced ~6.0 Mbp of genomic sequence, separated into ~19,000 contigs (mean length 312 bp), of which ~21% were fragments of putative coding sequence. The shared sequences allowed for the discovery of ~34,000 SNPs and nearly 800 indels, equivalent to a SNP frequency of 5.6 per 1,000 nt, and an indel frequency of 0.2 per 1,000 nt. A sample of heterozygous SNP loci was mapped by CAPS assays and this exercise provided validation of our mining criteria. The repetitive fraction of the genome had a high representation of retrotransposon sequence, followed by simple repeats, AT-low complexity regions and mobile DNA elements. The genomic k-mers distribution and CpG rate of C. cardunculus, compared with data derived from three whole genome-sequenced dicots species, provided a further evidence of the random representation of the C. cardunculus genome generated by RAD sampling. Conclusion The RAD tag sequencing approach is a cost-effective and rapid method to develop SNP markers in a highly heterozygous species. Our approach permitted to generate a large and robust SNP datasets by the adoption of optimized filtering criteria. PMID:22214349
Suzuki, Hideaki; Yu, Jiwen; Wang, Fei; Zhang, Jinfa
2013-06-01
Cytoplasmic male sterility (CMS), which is a maternally inherited trait and controlled by novel chimeric genes in the mitochondrial genome, plays a pivotal role in the production of hybrid seed. In cotton, no PCR-based marker has been developed to discriminate CMS-D8 (from Gossypium trilobum) from its normal Upland cotton (AD1, Gossypium hirsutum) cytoplasm. The objective of the current study was to develop PCR-based single nucleotide polymorphic (SNP) markers from mitochondrial genes for the CMS-D8 cytoplasm. DNA sequence variation in mitochondrial genes involved in the oxidative phosphorylation chain including ATP synthase subunit 1, 4, 6, 8 and 9, and cytochrome c oxidase 1, 2 and 3 subunits were identified by comparing CMS-D8, its isogenic maintainer and restorer lines on the same nuclear genetic background. An allelic specific PCR (AS-PCR) was utilized for SNP typing by incorporating artificial mismatched nucleotides into the third or fourth base from the 3' terminus in both the specific and nonspecific primers. The result indicated that the method modifying allele-specific primers was successful in obtaining eight SNP markers out of eight SNPs using eight primer pairs to discriminate two alleles between AD1 and CMS-D8 cytoplasms. Two of the SNPs for atp1 and cox1 could also be used in combination to discriminate between CMS-D8 and CMS-D2 cytoplasms. Additionally, a PCR-based marker from a nine nucleotide insertion-deletion (InDel) sequence (AATTGTTTT) at the 59-67 bp positions from the start codon of atp6, which is present in the CMS and restorer lines with the D8 cytoplasm but absent in the maintainer line with the AD1 cytoplasm, was also developed. A SNP marker for two nucleotide substitutions (AA in AD1 cytoplasm to CT in CMS-D8 cytoplasm) in the intron (1,506 bp) of cox2 gene was also developed. These PCR-based SNP markers should be useful in discriminating CMS-D8 and AD1 cytoplasms, or those with CMS-D2 cytoplasm as a rapid, simple, inexpensive, and reliable genotyping tool to assist hybrid cotton breeding.
Henning, John A; Coggins, Jamie; Peterson, Matthew
2015-10-06
Hop is an economically important crop for the Pacific Northwest USA as well as other regions of the world. It is a perennial crop with rhizomatous or clonal propagation system for varietal distribution. A big concern for growers as well as brewers is variety purity and questions are regularly posed to public agencies concerning the availability of genotype testing. Current means for genotyping are based upon 25 microsatellites that provides relatively accurate genotyping but cannot always differentiate sister-lines. In addition, numerous PCR runs (25) are required to complete this process and only a few laboratories exist that perform this service. A genotyping protocol based upon SNPs would enable rapid accurate genotyping that can be assayed at any laboratory facility set up for SNP-based genotyping. The results of this study arose from a larger project designed for whole genome association studies upon the USDA-ARS hop germplasm collection consisting of approximately 116 distinct hop varieties and germplasm (female lines) from around the world. The original dataset that arose from partial sequencing of 121 genotypes resulted in the identification of 374,829 SNPs using TASSEL-UNEAK pipeline. After filtering out genotypes with more than 50% missing data (5 genotypes) and SNP markers with more than 20% missing data, 32,206 highly filtered SNP markers across 116 genotypes were identified and considered for this study. Minor allele frequency (MAF) was calculated for each SNP and ranked according to the most informative to least informative. Only those markers without missing data across genotypes as well as 60% or less heterozygous gamete calls were considered for further analysis. Genetic distances among individuals in the study were calculated using the marker with the highest MAF value, then by using a combination of the two markers with highest MAF values and so on. This process was reiterated until a set of markers was identified that allowed for all genotypes in the study to be genetically differentiated from each other. Next, we compared genetic matrices calculated from the minimal marker sets [(Table 2; 6-, 7-, 8-, 10- and 12-marker set matrices] and that of a matrix calculated from a set of markers with no missing data across all 116 samples (1006 SNP markers). The minimum number of markers required to meet both specifications was a set of 7-markers (Table 3). These seven SNPs were then aligned with a genome assembly, and DNA sequence both upstream and downstream were used to identify primer sequences that can be used to develop seven amplicons for high resolution melting curve PCR detection or other SNP-based PCR detection methods. This study identifies a set of 7 SNP markers that may prove useful for the identification and validation of hop varieties and accessions. Variety validation of unknown samples assumes that the variety under question has been included a priori in a discovery panel. These results are based upon in silica studies and markers need to be validated using different SNP marker technology upon a differential set of hop genotypes. The marker sequence data and suggested primer sets provide potential means to fingerprint hop varieties in most genetic laboratories utilizing SNP-marker technology.
Diversity analysis of cotton (Gossypium hirsutum L.) germplasm using the CottonSNP63K Array.
Hinze, Lori L; Hulse-Kemp, Amanda M; Wilson, Iain W; Zhu, Qian-Hao; Llewellyn, Danny J; Taylor, Jen M; Spriggs, Andrew; Fang, David D; Ulloa, Mauricio; Burke, John J; Giband, Marc; Lacape, Jean-Marc; Van Deynze, Allen; Udall, Joshua A; Scheffler, Jodi A; Hague, Steve; Wendel, Jonathan F; Pepper, Alan E; Frelichowski, James; Lawley, Cindy T; Jones, Don C; Percy, Richard G; Stelly, David M
2017-02-03
Cotton germplasm resources contain beneficial alleles that can be exploited to develop germplasm adapted to emerging environmental and climate conditions. Accessions and lines have traditionally been characterized based on phenotypes, but phenotypic profiles are limited by the cost, time, and space required to make visual observations and measurements. With advances in molecular genetic methods, genotypic profiles are increasingly able to identify differences among accessions due to the larger number of genetic markers that can be measured. A combination of both methods would greatly enhance our ability to characterize germplasm resources. Recent efforts have culminated in the identification of sufficient SNP markers to establish high-throughput genotyping systems, such as the CottonSNP63K array, which enables a researcher to efficiently analyze large numbers of SNP markers and obtain highly repeatable results. In the current investigation, we have utilized the SNP array for analyzing genetic diversity primarily among cotton cultivars, making comparisons to SSR-based phylogenetic analyses, and identifying loci associated with seed nutritional traits. The SNP markers distinctly separated G. hirsutum from other Gossypium species and distinguished the wild from cultivated types of G. hirsutum. The markers also efficiently discerned differences among cultivars, which was the primary goal when designing the CottonSNP63K array. Population structure within the genus compared favorably with previous results obtained using SSR markers, and an association study identified loci linked to factors that affect cottonseed protein content. Our results provide a large genome-wide variation data set for primarily cultivated cotton. Thousands of SNPs in representative cotton genotypes provide an opportunity to finely discriminate among cultivated cotton from around the world. The SNPs will be relevant as dense markers of genome variation for association mapping approaches aimed at correlating molecular polymorphisms with variation in phenotypic traits, as well as for molecular breeding approaches in cotton.
A Coordinated Approach to Peach SNP Discovery in RosBREED
USDA-ARS?s Scientific Manuscript database
In the USDA-funded multi-institutional and trans-disciplinary project, “RosBREED”, crop-specific SNP genome scan platforms are being developed for peach, apple, strawberry, and cherry at a resolution of at least one polymorphic SNP marker every 5 cM in any random cross, for use in Pedigree-Based Ana...
Genome-wide association analysis of seedling root development in maize (Zea mays L.).
Pace, Jordon; Gardner, Candice; Romay, Cinta; Ganapathysubramanian, Baskar; Lübberstedt, Thomas
2015-02-05
Plants rely on the root system for anchorage to the ground and the acquisition and absorption of nutrients critical to sustaining productivity. A genome wide association analysis enables one to analyze allelic diversity of complex traits and identify superior alleles. 384 inbred lines from the Ames panel were genotyped with 681,257 single nucleotide polymorphism markers using Genotyping-by-Sequencing technology and 22 seedling root architecture traits were phenotyped. Utilizing both a general linear model and mixed linear model, a GWAS study was conducted identifying 268 marker trait associations (p ≤ 5.3×10(-7)). Analysis of significant SNP markers for multiple traits showed that several were located within gene models with some SNP markers localized within regions of previously identified root quantitative trait loci. Gene model GRMZM2G153722 located on chromosome 4 contained nine significant markers. This predicted gene is expressed in roots and shoots. This study identifies putatively associated SNP markers associated with root traits at the seedling stage. Some SNPs were located within or near (<1 kb) gene models. These gene models identify possible candidate genes involved in root development at the seedling stage. These and respective linked or functional markers could be targets for breeders for marker assisted selection of seedling root traits.
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma Jj; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco Cam; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple ( Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289
Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron
2012-01-01
Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.
Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul
2012-11-20
Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies.
Wen, Weie; He, Zhonghu; Gao, Fengmei; Liu, Jindong; Jin, Hui; Zhai, Shengnan; Qu, Yanying; Xia, Xianchun
2017-01-01
A high-density consensus map is a powerful tool for gene mapping, cloning and molecular marker-assisted selection in wheat breeding. The objective of this study was to construct a high-density, single nucleotide polymorphism (SNP)-based consensus map of common wheat (Triticum aestivum L.) by integrating genetic maps from four recombinant inbred line populations. The populations were each genotyped using the wheat 90K Infinium iSelect SNP assay. A total of 29,692 SNP markers were mapped on 21 linkage groups corresponding to 21 hexaploid wheat chromosomes, covering 2,906.86 cM, with an overall marker density of 10.21 markers/cM. Compared with the previous maps based on the wheat 90K SNP chip detected 22,736 (76.6%) of the SNPs with consistent chromosomal locations, whereas 1,974 (6.7%) showed different chromosomal locations, and 4,982 (16.8%) were newly mapped. Alignment of the present consensus map and the wheat expressed sequence tags (ESTs) Chromosome Bin Map enabled assignment of 1,221 SNP markers to specific chromosome bins and 819 ESTs were integrated into the consensus map. The marker orders of the consensus map were validated based on physical positions on the wheat genome with Spearman rank correlation coefficients ranging from 0.69 (4D) to 0.97 (1A, 4B, 5B, and 6A), and were also confirmed by comparison with genetic position on the previously 40K SNP consensus map with Spearman rank correlation coefficients ranging from 0.84 (6D) to 0.99 (6A). Chromosomal rearrangements reported previously were confirmed in the present consensus map and new putative rearrangements were identified. In addition, an integrated consensus map was developed through the combination of five published maps with ours, containing 52,607 molecular markers. The consensus map described here provided a high-density SNP marker map and a reliable order of SNPs, representing a step forward in mapping and validation of chromosomal locations of SNPs on the wheat 90K array. Moreover, it can be used as a reference for quantitative trait loci (QTL) mapping to facilitate exploitation of genes and QTL in wheat breeding. PMID:28848588
The Development of Quality Control Genotyping Approaches: A Case Study Using Elite Maize Lines.
Chen, Jiafa; Zavala, Cristian; Ortega, Noemi; Petroli, Cesar; Franco, Jorge; Burgueño, Juan; Costich, Denise E; Hearne, Sarah J
2016-01-01
Quality control (QC) of germplasm identity and purity is a critical component of breeding and conservation activities. SNP genotyping technologies and increased availability of markers provide the opportunity to employ genotyping as a low-cost and robust component of this QC. In the public sector available low-cost SNP QC genotyping methods have been developed from a very limited panel of markers of 1,000 to 1,500 markers without broad selection of the most informative SNPs. Selection of optimal SNPs and definition of appropriate germplasm sampling in addition to platform section impact on logistical and resource-use considerations for breeding and conservation applications when mainstreaming QC. In order to address these issues, we evaluated the selection and use of SNPs for QC applications from large DArTSeq data sets generated from CIMMYT maize inbred lines (CMLs). Two QC genotyping strategies were developed, the first is a "rapid QC", employing a small number of SNPs to identify potential mislabeling of seed packages or plots, the second is a "broad QC", employing a larger number of SNP, used to identify each germplasm entry and to measure heterogeneity. The optimal marker selection strategies combined the selection of markers with high minor allele frequency, sampling of clustered SNP in proportion to marker cluster distance and selecting markers that maintain a uniform genomic distribution. The rapid and broad QC SNP panels selected using this approach were further validated using blind test assessments of related re-generation samples. The influence of sampling within each line was evaluated. Sampling 192 individuals would result in close to 100% possibility of detecting a 5% contamination in the entry, and approximately a 98% probability to detect a 2% contamination of the line. These results provide a framework for the establishment of QC genotyping. A comparison of financial and time costs for use of these approaches across different platforms is discussed providing a framework for institutions involved in maize conservation and breeding to assess the resource use effectiveness of QC genotyping. Application of these research findings, in combination with existing QC approaches, will ensure the regeneration, distribution and use in breeding of true to type inbred germplasm. These findings also provide an effective approach to optimize SNP selection for QC genotyping in other species.
Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations
Truong, Hoa T.; Ramos, A. Marcos; Yalcin, Feyruz; de Ruiter, Marjo; van der Poel, Hein J. A.; Huvenaars, Koen H. J.; Hogers, René C. J.; van Enckevort, Leonora. J. G.; Janssen, Antoine; van Orsouw, Nathalie J.; van Eijk, Michiel J. T.
2012-01-01
Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n = 222 samples) and lettuce (n = 87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike. PMID:22662172
Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R. K.; Singh, N. K.; Singh, Rakesh
2013-01-01
Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis. PMID:24367635
LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs
Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong
2009-01-01
Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380
Qiu, Gao-Feng; Xiong, Liang-Wei; Han, Zhi-Ke; Liu, Zhi-Qiang; Feng, Jian-Bin; Wu, Xu-Gan; Yan, Yin-Long; Shen, Hong; Huang, Long; Chen, Li
2017-01-01
The Chinese mitten crab Eriocheir sinensis is the most economically important cultivated crab species in China, and its genome has a high number of chromosomes (2n = 146). To obtain sufficient markers for construction of a dense genetic map for this species, we employed the recently developed specific-locus amplified fragment sequencing (SLAF-seq) method for large-scale SNPs screening and genotyping in a F1 full-sib family of 149 individuals. SLAF-seq generated 127,677 polymorphic SNP markers, of which 20,803 valid markers were assigned into five segregation types and were used together with previous SSR markers for linkage map construction. The final integrated genetic map included 17,680 SNP and 629 SSR markers on the 73 linkage groups (LG), and spanned 14,894.9 cM with an average marker interval of 0.81 cM. QTL mapping localized three significant growth-related QTL to a 1.2 cM region in LG53 as well as 146 sex-linked markers in LG48. Genome-wide QTL-association analysis further identified four growth-related QTL genes named LNX2, PAK2, FMRFamide and octopamine receptors. These genes are involved in a variety of different signaling pathways including cell proliferation and growth. The map and SNP markers described here will be a valuable resource for the E. sinensis genome project and selective breeding programs. PMID:28045132
Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S
2018-01-01
Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.
Montanari, Sara; Saeed, Munazza; Knäbel, Mareike; Kim, YoonKyeong; Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E; Crowhurst, Ross N; Chagné, David
2013-01-01
We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear ('Old Home'×'Louise Bon Jersey') and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality.
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-06-04
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.
SNP-markers in Allium species to facilitate introgression breeding in onion.
Scholten, Olga E; van Kaauwen, Martijn P W; Shahin, Arwa; Hendrickx, Patrick M; Keizer, L C Paul; Burger, Karin; van Heusden, Adriaan W; van der Linden, C Gerard; Vosman, Ben
2016-08-31
Within onion, Allium cepa L., the availability of disease resistance is limited. The identification of sources of resistance in related species, such as Allium roylei and Allium fistulosum, was a first step towards the improvement of onion cultivars by breeding. SNP markers linked to resistance and polymorphic between these related species and onion cultivars are a valuable tool to efficiently introgress disease resistance genes. In this paper we describe the identification and validation of SNP markers valuable for onion breeding. Transcriptome sequencing resulted in 192 million RNA seq reads from the interspecific F1 hybrid between A. roylei and A. fistulosum (RF) and nine onion cultivars. After assembly, reliable SNPs were discovered in about 36 % of the contigs. For genotyping of the interspecific three-way cross population, derived from a cross between an onion cultivar and the RF (CCxRF), 1100 SNPs that are polymorphic in RF and monomorphic in the onion cultivars (RF SNPs) were selected for the development of KASP assays. A molecular linkage map based on 667 RF-SNP markers was constructed for CCxRF. In addition, KASP assays were developed for 1600 onion-SNPs (SNPs polymorphic among onion cultivars). A second linkage map was constructed for an F2 of onion x A. roylei (F2(CxR)) that consisted of 182 onion-SNPs and 119 RF-SNPs, and 76 previously mapped markers. Markers co-segregating in both the F2(CxR) and the CCxRF population were used to assign the linkage groups of RF to onion chromosomes. To validate usefulness of these SNP markers, QTL mapping was applied in the CCxRF population that segregates for resistance to Botrytis squamosa and resulted in a QTL for resistance on chromosome 6 of A. roylei. Our research has more than doubled the publicly available marker sequences of expressed onion genes and two onion-related species. It resulted in a detailed genetic map for the interspecific CCxRF population. This is the first paper that reports the detection of a QTL for resistance to B. squamosa in A. roylei.
Unravelling the Genetic Diversity among Cassava Bemisia tabaci Whiteflies Using NextRAD Sequencing.
Wosula, Everlyne N; Chen, Wenbo; Fei, Zhangjun; Legg, James P
2017-11-01
Bemisia tabaci threatens production of cassava in Africa through vectoring viruses that cause cassava mosaic disease (CMD) and cassava brown streak disease (CBSD). B. tabaci sampled from cassava in eight countries in Africa were genotyped using NextRAD sequencing, and their phylogeny and population genetics were investigated using the resultant single nucleotide polymorphism (SNP) markers. SNP marker data and short sequences of mitochondrial DNA cytochrome oxidase I (mtCOI) obtained from the same insect were compared. Eight genetically distinct groups were identified based on mtCOI, whereas phylogenetic analysis using SNPs identified six major groups, which were further confirmed by PCA and multidimensional analyses. STRUCTURE analysis identified four ancestral B. tabaci populations that have contributed alleles to the six SNP-based groups. Significant gene flows were detected between several of the six SNP-based groups. Evidence of gene flow was strongest for SNP-based groups occurring in central Africa. Comparison of the mtCOI and SNP identities of sampled insects provided a strong indication that hybrid populations are emerging in parts of Africa recently affected by the severe CMD pandemic. This study reveals that mtCOI is not an effective marker at distinguishing cassava-colonizing B. tabaci haplogroups, and that more robust SNP-based multilocus markers should be developed. Significant gene flows between populations could lead to the emergence of haplogroups that might alter the dynamics of cassava virus spread and disease severity in Africa. Continuous monitoring of genetic compositions of whitefly populations should be an essential component in efforts to combat cassava viruses in Africa. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa
2012-01-01
Background Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions Two transcriptome sets were built that are valuable resources for marker development, comparative genomic studies and candidate gene approaches. Next generation sequencing of leaf transcriptome is very effective; however, deeper sequencing and using more tissues and stages is advisable for extended comparative studies. PMID:23167289
Meuwissen, Theo H E; Indahl, Ulf G; Ødegård, Jørgen
2017-12-27
Non-linear Bayesian genomic prediction models such as BayesA/B/C/R involve iteration and mostly Markov chain Monte Carlo (MCMC) algorithms, which are computationally expensive, especially when whole-genome sequence (WGS) data are analyzed. Singular value decomposition (SVD) of the genotype matrix can facilitate genomic prediction in large datasets, and can be used to estimate marker effects and their prediction error variances (PEV) in a computationally efficient manner. Here, we developed, implemented, and evaluated a direct, non-iterative method for the estimation of marker effects for the BayesC genomic prediction model. The BayesC model assumes a priori that markers have normally distributed effects with probability [Formula: see text] and no effect with probability (1 - [Formula: see text]). Marker effects and their PEV are estimated by using SVD and the posterior probability of the marker having a non-zero effect is calculated. These posterior probabilities are used to obtain marker-specific effect variances, which are subsequently used to approximate BayesC estimates of marker effects in a linear model. A computer simulation study was conducted to compare alternative genomic prediction methods, where a single reference generation was used to estimate marker effects, which were subsequently used for 10 generations of forward prediction, for which accuracies were evaluated. SVD-based posterior probabilities of markers having non-zero effects were generally lower than MCMC-based posterior probabilities, but for some regions the opposite occurred, resulting in clear signals for QTL-rich regions. The accuracies of breeding values estimated using SVD- and MCMC-based BayesC analyses were similar across the 10 generations of forward prediction. For an intermediate number of generations (2 to 5) of forward prediction, accuracies obtained with the BayesC model tended to be slightly higher than accuracies obtained using the best linear unbiased prediction of SNP effects (SNP-BLUP model). When reducing marker density from WGS data to 30 K, SNP-BLUP tended to yield the highest accuracies, at least in the short term. Based on SVD of the genotype matrix, we developed a direct method for the calculation of BayesC estimates of marker effects. Although SVD- and MCMC-based marker effects differed slightly, their prediction accuracies were similar. Assuming that the SVD of the marker genotype matrix is already performed for other reasons (e.g. for SNP-BLUP), computation times for the BayesC predictions were comparable to those of SNP-BLUP.
Identification of SNP and SSR Markers in Finger Millet Using Next Generation Sequencing Technologies
Gimode, Davis; Odeny, Damaris A.; de Villiers, Etienne P.; Wanyonyi, Solomon; Dida, Mathews M.; Mneney, Emmarold E.; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M.
2016-01-01
Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity. PMID:27454301
Gimode, Davis; Odeny, Damaris A; de Villiers, Etienne P; Wanyonyi, Solomon; Dida, Mathews M; Mneney, Emmarold E; Muchugi, Alice; Machuka, Jesse; de Villiers, Santie M
2016-01-01
Finger millet is an important cereal crop in eastern Africa and southern India with excellent grain storage quality and unique ability to thrive in extreme environmental conditions. Since negligible attention has been paid to improving this crop to date, the current study used Next Generation Sequencing (NGS) technologies to develop both Simple Sequence Repeat (SSR) and Single Nucleotide Polymorphism (SNP) markers. Genomic DNA from cultivated finger millet genotypes KNE755 and KNE796 was sequenced using both Roche 454 and Illumina technologies. Non-organelle sequencing reads were assembled into 207 Mbp representing approximately 13% of the finger millet genome. We identified 10,327 SSRs and 23,285 non-homeologous SNPs and tested 101 of each for polymorphism across a diverse set of wild and cultivated finger millet germplasm. For the 49 polymorphic SSRs, the mean polymorphism information content (PIC) was 0.42, ranging from 0.16 to 0.77. We also validated 92 SNP markers, 80 of which were polymorphic with a mean PIC of 0.29 across 30 wild and 59 cultivated accessions. Seventy-six of the 80 SNPs were polymorphic across 30 wild germplasm with a mean PIC of 0.30 while only 22 of the SNP markers showed polymorphism among the 59 cultivated accessions with an average PIC value of 0.15. Genetic diversity analysis using the polymorphic SNP markers revealed two major clusters; one of wild and another of cultivated accessions. Detailed STRUCTURE analysis confirmed this grouping pattern and further revealed 2 sub-populations within wild E. coracana subsp. africana. Both STRUCTURE and genetic diversity analysis assisted with the correct identification of the new germplasm collections. These polymorphic SSR and SNP markers are a significant addition to the existing 82 published SSRs, especially with regard to the previously reported low polymorphism levels in finger millet. Our results also reveal an unexploited finger millet genetic resource that can be included in the regional breeding programs in order to efficiently optimize productivity.
Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela
2014-01-01
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela
2014-01-01
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088
Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E.; Crowhurst, Ross N.; Chagné, David
2013-01-01
We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear (‘Old Home’בLouise Bon Jersey’) and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality. PMID:24155917
Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin
2013-01-01
Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of genetic variation among Turkish olive genotypes revealed by SNPs, AFLPs and SSRs allowed us to characterize the Turkish olive genotype. PMID:24058483
Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun
2016-01-01
Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to “Gopoong” and “K-1” were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information. PMID:27271615
Kuhn, David N; Motamayor, Juan Carlos; Meerow, Alan W; Borrone, James W; Schnell, Raymond J
2008-10-01
For well-studied plant species with whole genome sequence or extensive EST data, SNP markers are the logical choice for both genotyping and whole genome association studies. However, SNP markers may not address the needs of researchers working on specialty crops with limited available genomic information. Microsatellite markers have been frequently employed due to their robustness, but marker development can be difficult and may result in few polymorphic markers. SSCP markers, such as microsatellites, are PCR-based and scored by electrophoretic mobility but, because they are based on SNPs rather than length differences, occur more frequently and are easier to develop than microsatellites. We have examined how well correlated the estimation of genetic diversity and genetic distance are in a population or germplasm collection when measured by 13 highly polymorphic microsatellite markers or 20 SSCP markers. We observed a significant correlation in pairwise genetic distances of 82 individuals in an international cacao germplasm collection (Mantel test Rxy=0.59, p<0.0001 for 10 000 permutations). Both sets of markers could distinguish each individual in the population. These data provide strong support for the use of SSCP markers in the genotyping of plant species where development of microsatellites would be difficult or expensive.
2013-01-01
Background Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs. Results In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for selection of resistant cultivars. Comparison of sequences underpinning these SNP markers to the M. truncatula genome defined genomic regions containing candidate genes associated with saline stress tolerance. Conclusion The SNP assays and associated genetic linkage maps developed in this study permitted identification of salinity tolerance QTLs and candidate genes. This constitutes an important set of tools for marker-assisted selection (MAS) programs aimed at performance enhancement of field pea cultivars. PMID:24134188
Leonforte, Antonio; Sudheesh, Shimna; Cogan, Noel O I; Salisbury, Philip A; Nicolas, Marc E; Materne, Michael; Forster, John W; Kaur, Sukhjiwan
2013-10-17
Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs. In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for selection of resistant cultivars. Comparison of sequences underpinning these SNP markers to the M. truncatula genome defined genomic regions containing candidate genes associated with saline stress tolerance. The SNP assays and associated genetic linkage maps developed in this study permitted identification of salinity tolerance QTLs and candidate genes. This constitutes an important set of tools for marker-assisted selection (MAS) programs aimed at performance enhancement of field pea cultivars.
Sabiel, Salih A I; Huang, Sisi; Hu, Xin; Ren, Xifeng; Fu, Chunjie; Peng, Junhua; Sun, Dongfa
2017-03-01
In the present study, 150 accessions of worldwide originated durum wheat germplasm ( Triticum turgidum spp. durum ) were observed for major seedling traits and their growth. The accessions were evaluated for major seedling traits under controlled conditions of hydroponics at the 13 th , 20 th , 27 th and 34 th day-after germination. Biomass traits were measured at the 34 th day-after germination. Correlation analysis was conducted among the seedling traits and three field traits at maturity, plant height, grain weight and 1000-grain weight observed in four consecutive years. Associations of the measured seedling traits and SNP markers were analyzed based on the mixed linear model (MLM). The results indicated that highly significant genetic variation and robust heritability were found for the seedling and field mature traits. In total, 259 significant associations were detected for all the traits and four growth stages. The phenotypic variation explained (R2) by a single SNP marker is higher than 10% for most (84%) of the significant SNP markers. Forty-six SNP markers associated with multiple traits, indicating non-neglectable pleiotropy in seedling stage. The associated SNP markers could be helpful for genetic analysis of seedling traits, and marker-assisted breeding of new wheat varieties with strong seedling vigor.
High-throughput RAD-SNP genotyping for characterization of sugar beet genotypes
USDA-ARS?s Scientific Manuscript database
High-throughput SNP genotyping provides a rapid way of developing resourceful set of markers for delineating the genetic architecture and for effective species discrimination. In the presented research, we demonstrate a set of 192 SNPs for effective genotyping in sugar beet using high-throughput mar...
Weigel, K A; de los Campos, G; González-Recio, O; Naya, H; Wu, X L; Long, N; Rosa, G J M; Gianola, D
2009-10-01
The objective of the present study was to assess the predictive ability of subsets of single nucleotide polymorphism (SNP) markers for development of low-cost, low-density genotyping assays in dairy cattle. Dense SNP genotypes of 4,703 Holstein bulls were provided by the USDA Agricultural Research Service. A subset of 3,305 bulls born from 1952 to 1998 was used to fit various models (training set), and a subset of 1,398 bulls born from 1999 to 2002 was used to evaluate their predictive ability (testing set). After editing, data included genotypes for 32,518 SNP and August 2003 and April 2008 predicted transmitting abilities (PTA) for lifetime net merit (LNM$), the latter resulting from progeny testing. The Bayesian least absolute shrinkage and selection operator method was used to regress August 2003 PTA on marker covariates in the training set to arrive at estimates of marker effects and direct genomic PTA. The coefficient of determination (R(2)) from regressing the April 2008 progeny test PTA of bulls in the testing set on their August 2003 direct genomic PTA was 0.375. Subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP were created by choosing equally spaced and highly ranked SNP, with the latter based on the absolute value of their estimated effects obtained from the training set. The SNP effects were re-estimated from the training set for each subset of SNP, and the 2008 progeny test PTA of bulls in the testing set were regressed on corresponding direct genomic PTA. The R(2) values for subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP with largest effects (evenly spaced SNP) were 0.184 (0.064), 0.236 (0.111), 0.269 (0.190), 0.289 (0.179), 0.307 (0.228), 0.313 (0.268), and 0.322 (0.291), respectively. These results indicate that a low-density assay comprising selected SNP could be a cost-effective alternative for selection decisions and that significant gains in predictive ability may be achieved by increasing the number of SNP allocated to such an assay from 300 or fewer to 1,000 or more.
Ma, G J; Song, Q J; Markell, S G; Qi, L L
2018-07-01
A novel rust resistance gene, R 15 , derived from the cultivated sunflower HA-R8 was assigned to linkage group 8 of the sunflower genome using a genotyping-by-sequencing approach. SNP markers closely linked to R 15 were identified, facilitating marker-assisted selection of resistance genes. The rust virulence gene is co-evolving with the resistance gene in sunflower, leading to the emergence of new physiologic pathotypes. This presents a continuous threat to the sunflower crop necessitating the development of resistant sunflower hybrids providing a more efficient, durable, and environmentally friendly host plant resistance. The inbred line HA-R8 carries a gene conferring resistance to all known races of the rust pathogen in North America and can be used as a broad-spectrum resistance resource. Based on phenotypic assessments of 140 F 2 individuals derived from a cross of HA 89 with HA-R8, rust resistance in the population was found to be conferred by a single dominant gene (R 15 ) originating from HA-R8. Genotypic analysis with the currently available SSR markers failed to find any association between rust resistance and any markers. Therefore, we used genotyping-by-sequencing (GBS) analysis to achieve better genomic coverage. The GBS data showed that R 15 was located at the top end of linkage group (LG) 8. Saturation with 71 previously mapped SNP markers selected within this region further showed that it was located in a resistance gene cluster on LG8, and mapped to a 1.0-cM region between three co-segregating SNP makers SFW01920, SFW00128, and SFW05824 as well as the NSA_008457 SNP marker. These closely linked markers will facilitate marker-assisted selection and breeding in sunflower.
2012-01-01
Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison. PMID:22908993
Wu, Jianhui; Huang, Shuo; Zeng, Qingdong; Liu, Shengjie; Wang, Qilin; Mu, Jingmei; Yu, Shizhou; Han, Dejun; Kang, Zhensheng
2018-06-16
A major stripe rust resistance QTL on chromosome 4BL was localized to a 4.5-Mb interval using comparative QTL mapping methods and validated in 276 wheat genotypes by haplotype analysis. CYMMIT-derived wheat line P10103 was previously identified to have adult plant resistance (APR) to stripe rust in the greenhouse and field. The conventional approach for QTL mapping in common wheat is laborious. Here, we performed QTL detection of APR using a combination of genome-wide scanning and extreme pool-genotyping. SNP-based genetic maps were constructed using the Wheat55 K SNP array to genotype a recombinant inbred line (RIL) population derived from the cross Mingxian 169 × P10103. Five stable QTL were detected across multiple environments. A fter comparing SNP profiles from contrasting, extreme DNA pools of RILs six putative QTL were located to approximate chromosome positions. A major QTL on chromosome 4B was identified in F 2:4 contrasting pools from cross Zhengmai 9023 × P10103. A consensus QTL (LOD = 26-40, PVE = 42-55%), named QYr.nwafu-4BL, was defined and localized to a 4.5-Mb interval flanked by SNP markers AX-110963704 and AX-110519862 in chromosome arm 4BL. Based on stripe rust response, marker genotypes, pedigree analysis and mapping data, QYr.nwafu-4BL is likely to be a new APR QTL. The applicability of the SNP-based markers flanking QYr.nwafu-4BL was validated on a diversity panel of 276 wheat lines. The additional minor QTL on chromosomes 4A, 5A, 5B and 6A enhanced the level of resistance conferred by QYr.nwafu-4BL. Marker-assisted pyramiding of QYr.nwafu-4BL and other favorable minor QTL in new wheat cultivars should improve the level of APR to stripe rust.
USDA-ARS?s Scientific Manuscript database
The development of resources for genomic studies in Mangifera indica (mango) will allow marker-assisted selection and identification of genetically diverse germplasm, greatly aiding mango breeding programs. We report here a first step in developing such resources, our identification of thousands una...
Jin, Hui; Wen, Weie; Liu, Jindong; Zhai, Shengnan; Zhang, Yan; Yan, Jun; Liu, Zhiyong; Xia, Xianchun; He, Zhonghu
2016-01-01
Dough rheological and starch pasting properties play an important role in determining processing quality in bread wheat (Triticum aestivum L.). In the present study, a recombinant inbred line (RIL) population derived from a Gaocheng 8901/Zhoumai 16 cross grown in three environments was used to identify quantitative trait loci (QTLs) for dough rheological and starch pasting properties evaluated by Mixograph, Rapid Visco-Analyzer (RVA), and Mixolab parameters using the wheat 90 and 660 K single nucleotide polymorphism (SNP) chip assays. A high-density linkage map constructed with 46,961 polymorphic SNP markers from the wheat 90 and 660 K SNP assays spanned a total length of 4121 cM, with an average chromosome length of 196.2 cM and marker density of 0.09 cM/marker; 6596 new SNP markers were anchored to the bread wheat linkage map, with 1046 and 5550 markers from the 90 and 660 K SNP assays, respectively. Composite interval mapping identified 119 additive QTLs on 20 chromosomes except 4D; among them, 15 accounted for more than 10% of the phenotypic variation across two or three environments. Twelve QTLs for Mixograph parameters, 17 for RVA parameters and 55 for Mixolab parameters were new. Eleven QTL clusters were identified. The closely linked SNP markers can be used in marker-assisted wheat breeding in combination with the Kompetitive Allele Specific PCR (KASP) technique for improvement of processing quality in bread wheat.
Jin, Hui; Wen, Weie; Liu, Jindong; Zhai, Shengnan; Zhang, Yan; Yan, Jun; Liu, Zhiyong; Xia, Xianchun; He, Zhonghu
2016-01-01
Dough rheological and starch pasting properties play an important role in determining processing quality in bread wheat (Triticum aestivum L.). In the present study, a recombinant inbred line (RIL) population derived from a Gaocheng 8901/Zhoumai 16 cross grown in three environments was used to identify quantitative trait loci (QTLs) for dough rheological and starch pasting properties evaluated by Mixograph, Rapid Visco-Analyzer (RVA), and Mixolab parameters using the wheat 90 and 660 K single nucleotide polymorphism (SNP) chip assays. A high-density linkage map constructed with 46,961 polymorphic SNP markers from the wheat 90 and 660 K SNP assays spanned a total length of 4121 cM, with an average chromosome length of 196.2 cM and marker density of 0.09 cM/marker; 6596 new SNP markers were anchored to the bread wheat linkage map, with 1046 and 5550 markers from the 90 and 660 K SNP assays, respectively. Composite interval mapping identified 119 additive QTLs on 20 chromosomes except 4D; among them, 15 accounted for more than 10% of the phenotypic variation across two or three environments. Twelve QTLs for Mixograph parameters, 17 for RVA parameters and 55 for Mixolab parameters were new. Eleven QTL clusters were identified. The closely linked SNP markers can be used in marker-assisted wheat breeding in combination with the Kompetitive Allele Specific PCR (KASP) technique for improvement of processing quality in bread wheat. PMID:27486464
Pyne, Robert; Honig, Josh; Vaiciunas, Jennifer; Koroch, Adolfina; Wyenandt, Christian; Bonos, Stacy; Simon, James
2017-01-01
Limited understanding of sweet basil (Ocimum basilicum L.) genetics and genome structure has reduced efficiency of breeding strategies. This is evidenced by the rapid, worldwide dissemination of basil downy mildew (Peronospora belbahrii) in the absence of resistant cultivars. In an effort to improve available genetic resources, expressed sequence tag simple sequence repeat (EST-SSR) and single nucleotide polymorphism (SNP) markers were developed and used to genotype the MRI x SB22 F2 mapping population, which segregates for response to downy mildew. SNP markers were generated from genomic sequences derived from double digestion restriction site associated DNA sequencing (ddRADseq). Disomic segregation was observed in both SNP and EST-SSR markers providing evidence of an O. basilicum allotetraploid genome structure and allowing for subsequent analysis of the mapping population as a diploid intercross. A dense linkage map was constructed using 42 EST-SSR and 1,847 SNP markers spanning 3,030.9 cM. Multiple quantitative trait loci (QTL) model (MQM) analysis identified three QTL that explained 37-55% of phenotypic variance associated with downy mildew response across three environments. A single major QTL, dm11.1 explained 21-28% of phenotypic variance and demonstrated dominant gene action. Two minor QTL dm9.1 and dm14.1 explained 5-16% and 4-18% of phenotypic variance, respectively. Evidence is provided for an additive effect between the two minor QTL and the major QTL dm11.1 increasing downy mildew susceptibility. Results indicate that ddRADseq-facilitated SNP and SSR marker genotyping is an effective approach for mapping the sweet basil genome.
Honig, Josh; Vaiciunas, Jennifer; Koroch, Adolfina; Wyenandt, Christian; Bonos, Stacy; Simon, James
2017-01-01
Limited understanding of sweet basil (Ocimum basilicum L.) genetics and genome structure has reduced efficiency of breeding strategies. This is evidenced by the rapid, worldwide dissemination of basil downy mildew (Peronospora belbahrii) in the absence of resistant cultivars. In an effort to improve available genetic resources, expressed sequence tag simple sequence repeat (EST-SSR) and single nucleotide polymorphism (SNP) markers were developed and used to genotype the MRI x SB22 F2 mapping population, which segregates for response to downy mildew. SNP markers were generated from genomic sequences derived from double digestion restriction site associated DNA sequencing (ddRADseq). Disomic segregation was observed in both SNP and EST-SSR markers providing evidence of an O. basilicum allotetraploid genome structure and allowing for subsequent analysis of the mapping population as a diploid intercross. A dense linkage map was constructed using 42 EST-SSR and 1,847 SNP markers spanning 3,030.9 cM. Multiple quantitative trait loci (QTL) model (MQM) analysis identified three QTL that explained 37–55% of phenotypic variance associated with downy mildew response across three environments. A single major QTL, dm11.1 explained 21–28% of phenotypic variance and demonstrated dominant gene action. Two minor QTL dm9.1 and dm14.1 explained 5–16% and 4–18% of phenotypic variance, respectively. Evidence is provided for an additive effect between the two minor QTL and the major QTL dm11.1 increasing downy mildew susceptibility. Results indicate that ddRADseq-facilitated SNP and SSR marker genotyping is an effective approach for mapping the sweet basil genome. PMID:28922359
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.
USDA-ARS?s Scientific Manuscript database
Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ~4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification pr...
USDA-ARS?s Scientific Manuscript database
As an initial step to explore the transcriptome genetic diversity and to discover single nucleotide polymorphic (SNP)-biomarkers for marker assisted breeding within Pima (Gossypium barbadense L.) cotton, leaves from 25 day plants of three diverse genotypes were used to develop cDNA libraries. Using ...
Development and validation of a low-density SNP panel related to prolificacy in sheep
USDA-ARS?s Scientific Manuscript database
High-density SNP panels (e.g., 50,000 and 600,000 markers) have been used in exploratory population genetic studies with commercial and minor breeds of sheep. However, routine genetic diversity evaluations of large numbers of samples with large panels are in general cost-prohibitive for gene banks. ...
Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A
2015-01-01
Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.
Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.
2015-01-01
Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633
Patil, Gunvant; Do, Tuyen; Vuong, Tri D.; Valliyodan, Babu; Lee, Jeong-Dong; Chaudhary, Juhi; Shannon, J. Grover; Nguyen, Henry T.
2016-01-01
Soil salinity is a limiting factor of crop yield. The soybean is sensitive to soil salinity, and a dominant gene, Glyma03g32900 is primarily responsible for salt-tolerance. The identification of high throughput and robust markers as well as the deployment of salt-tolerant cultivars are effective approaches to minimize yield loss under saline conditions. We utilized high quality (15x) whole-genome resequencing (WGRS) on 106 diverse soybean lines and identified three major structural variants and allelic variation in the promoter and genic regions of the GmCHX1 gene. The discovery of single nucleotide polymorphisms (SNPs) associated with structural variants facilitated the design of six KASPar assays. Additionally, haplotype analysis and pedigree tracking of 93 U.S. ancestral lines were performed using publically available WGRS datasets. Identified SNP markers were validated, and a strong correlation was observed between the genotype and salt treatment phenotype (leaf scorch, chlorophyll content and Na+ accumulation) using a panel of 104 soybean lines and, an interspecific bi-parental population (F8) from PI483463 x Hutcheson. These markers precisely identified salt-tolerant/sensitive genotypes (>91%), and different structural-variants (>98%). These SNP assays, supported by accurate phenotyping, haplotype analyses and pedigree tracking information, will accelerate marker-assisted selection programs to enhance the development of salt-tolerant soybean cultivars. PMID:26781337
Sun, Hua; Wang, Hong-Tao; Kwon, Woo-Saeng; Kim, Yeon-Ju; In, Jun-Gyo; Yang, Deok-Chun
2011-11-01
Yunpoong is an important Korean ginseng (Panax ginseng C. A. Meyer) cultivar, but no molecular marker has been available to identify Yunpoong from other cultivars. In this study, we developed a single nucleotide polymorphism (SNP) marker for Yunpoong based on analysis of expressed sequence tags (ESTs) in an exon region of the glyceraldehyde 3-phosphate dehydrogenase (GAPDH) gene. This SNP marker had high specificity to authenticate Yunpoong in twelve different main ginseng cultivars. For application of the molecular marker, a rapid identification method was established based on the NaOH-Tris method and real-time polymerase chain reaction (PCR) in order to ensure more efficiency in the cultivar selection. The biggest feature of the NaOH-Tris method was that it made the extraction of DNA very simple and rapid in young leaf tissues. We only spent 1 min to extract DNA and directly used it to do PCR. In this report, the conventional DNA extraction method was used to develop molecular marker process, and the NaOH-Tris method was applied in screening large numbers of cultivars. Moreover, the greatest advantage of the real-time PCR compared with traditional PCR, is time saving and high efficiency. Thus, this strategy provides a rapid and reliable method for the specific identification of Yunpoong in a large number of samples. Copyright © 2011 Elsevier B.V. All rights reserved.
Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple
Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron
2012-01-01
As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718
NASA Astrophysics Data System (ADS)
Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli
2013-09-01
Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.
Ferchaud, Anne-Laure; Pedersen, Susanne H; Bekkevold, Dorte; Jian, Jianbo; Niu, Yongchao; Hansen, Michael M
2014-10-06
The threespine stickleback (Gasterosteus aculeatus) has become an important model species for studying both contemporary and parallel evolution. In particular, differential adaptation to freshwater and marine environments has led to high differentiation between freshwater and marine stickleback populations at the phenotypic trait of lateral plate morphology and the underlying candidate gene Ectodysplacin (EDA). Many studies have focused on this trait and candidate gene, although other genes involved in marine-freshwater adaptation may be equally important. In order to develop a resource for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. We have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene Ectodysplacin (EDA).
A 48 SNP set for grapevine cultivar identification
2011-01-01
Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP markers are bi-allelic, allele identification and genotype naming are extremely simple and genotypes obtained with different equipments and by different laboratories are always fully comparable. PMID:22060012
Zhong, Chao; Sun, Suli; Li, Yinping; Duan, Canxing; Zhu, Zhendong
2018-03-01
A novel Phytophthora sojae resistance gene RpsHC18 was identified and finely mapped on soybean chromosome 3. Two NBS-LRR candidate genes were identified and two diagnostic markers of RpsHC18 were developed. Phytophthora root rot caused by Phytophthora sojae is a destructive disease of soybean. The most effective disease-control strategy is to deploy resistant cultivars carrying Phytophthora-resistant Rps genes. The soybean cultivar Huachun 18 has a broad and distinct resistance spectrum to 12 P. sojae isolates. Quantitative trait loci sequencing (QTL-seq), based on the whole-genome resequencing (WGRS) of two extreme resistant and susceptible phenotype bulks from an F 2:3 population, was performed, and one 767-kb genomic region with ΔSNP-index ≥ 0.9 on chromosome 3 was identified as the RpsHC18 candidate region in Huachun 18. The candidate region was reduced to a 146-kb region by fine mapping. Nonsynonymous SNP and haplotype analyses were carried out in the 146-kb region among ten soybean genotypes using WGRS. Four specific nonsynonymous SNPs were identified in two nucleotide-binding sites-leucine-rich repeat (NBS-LRR) genes, RpsHC18-NBL1 and RpsHC18-NBL2, which were considered to be the candidate genes. Finally, one specific SNP marker in each candidate gene was successfully developed using a tetra-primer ARMS-PCR assay, and the two markers were verified to be specific for RpsHC18 and to effectively distinguish other known Rps genes. In this study, we applied an integrated genomic-based strategy combining WGRS with traditional genetic mapping to identify RpsHC18 candidate genes and develop diagnostic markers. These results suggest that next-generation sequencing is a precise, rapid and cost-effective way to identify candidate genes and develop diagnostic markers, and it can accelerate Rps gene cloning and marker-assisted selection for breeding of P. sojae-resistant soybean cultivars.
Clarke, Shannon M.; Henry, Hannah M.; Dodds, Ken G.; Jowett, Timothy W. D.; Manley, Tim R.; Anderson, Rayna M.; McEwan, John C.
2014-01-01
Accurate pedigree information is critical to animal breeding systems to ensure the highest rate of genetic gain and management of inbreeding. The abundance of available genomic data, together with development of high throughput genotyping platforms, means that single nucleotide polymorphisms (SNPs) are now the DNA marker of choice for genomic selection studies. Furthermore the superior qualities of SNPs compared to microsatellite markers allows for standardization between laboratories; a property that is crucial for developing an international set of markers for traceability studies. The objective of this study was to develop a high throughput SNP assay for use in the New Zealand sheep industry that gives accurate pedigree assignment and will allow a reduction in breeder input over lambing. This required two phases of development- firstly, a method of extracting quality DNA from ear-punch tissue performed in a high throughput cost efficient manner and secondly a SNP assay that has the ability to assign paternity to progeny resulting from mob mating. A likelihood based approach to infer paternity was used where sires with the highest LOD score (log of the ratio of the likelihood given parentage to likelihood given non-parentage) are assigned. An 84 “parentage SNP panel” was developed that assigned, on average, 99% of progeny to a sire in a problem where there were 3,000 progeny from 120 mob mated sires that included numerous half sib sires. In only 6% of those cases was there another sire with at least a 0.02 probability of paternity. Furthermore dam information (either recorded, or by genotyping possible dams) was absent, highlighting the SNP test’s suitability for paternity testing. Utilization of this parentage SNP assay will allow implementation of progeny testing into large commercial farms where the improved accuracy of sire assignment and genetic evaluations will increase genetic gain in the sheep industry. PMID:24740141
Clarke, Shannon M; Henry, Hannah M; Dodds, Ken G; Jowett, Timothy W D; Manley, Tim R; Anderson, Rayna M; McEwan, John C
2014-01-01
Accurate pedigree information is critical to animal breeding systems to ensure the highest rate of genetic gain and management of inbreeding. The abundance of available genomic data, together with development of high throughput genotyping platforms, means that single nucleotide polymorphisms (SNPs) are now the DNA marker of choice for genomic selection studies. Furthermore the superior qualities of SNPs compared to microsatellite markers allows for standardization between laboratories; a property that is crucial for developing an international set of markers for traceability studies. The objective of this study was to develop a high throughput SNP assay for use in the New Zealand sheep industry that gives accurate pedigree assignment and will allow a reduction in breeder input over lambing. This required two phases of development--firstly, a method of extracting quality DNA from ear-punch tissue performed in a high throughput cost efficient manner and secondly a SNP assay that has the ability to assign paternity to progeny resulting from mob mating. A likelihood based approach to infer paternity was used where sires with the highest LOD score (log of the ratio of the likelihood given parentage to likelihood given non-parentage) are assigned. An 84 "parentage SNP panel" was developed that assigned, on average, 99% of progeny to a sire in a problem where there were 3,000 progeny from 120 mob mated sires that included numerous half sib sires. In only 6% of those cases was there another sire with at least a 0.02 probability of paternity. Furthermore dam information (either recorded, or by genotyping possible dams) was absent, highlighting the SNP test's suitability for paternity testing. Utilization of this parentage SNP assay will allow implementation of progeny testing into large commercial farms where the improved accuracy of sire assignment and genetic evaluations will increase genetic gain in the sheep industry.
Construction of a versatile SNP array for pyramiding useful genes of rice.
Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki
2016-01-01
DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A
2002-06-01
Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.
Shavrukov, Yuri; Suchecki, Radoslaw; Eliby, Serik; Abugalieva, Aigul; Kenebayev, Serik; Langridge, Peter
2014-09-28
New SNP marker platforms offer the opportunity to investigate the relationships between wheat cultivars from different regions and assess the mechanism and processes that have led to adaptation to particular production environments. Wheat breeding has a long history in Kazakhstan and the aim of this study was to explore the relationship between key varieties from Kazakhstan and germplasm from breeding programs for other regions. The study revealed 5,898 polymorphic markers amongst ten cultivars, of which 2,730 were mapped in the consensus genetic map. Mapped SNP markers were distributed almost equally across the A and B genomes, with between 279 and 484 markers assigned to each chromosome. Marker coverage was approximately 10-fold lower in the D genome. There were 863 SNP markers identified as unique to specific cultivars, and clusters of these markers (regions containing more than three closely mapped unique SNPs) showed specific patterns on the consensus genetic map for each cultivar. Significant intra-varietal genetic polymorphism was identified in three cultivars (Tzelinnaya 3C, Kazakhstanskaya rannespelaya and Kazakhstanskaya 15). Phylogenetic analysis based on inter-varietal polymorphism showed that the very old cultivar Erythrospermum 841 was the most genetically distinct from the other nine cultivars from Kazakhstan, falling in a clade together with the American cultivar Sonora and genotypes from Central and South Asia. The modern cultivar Kazakhstanskaya 19 also fell into a separate clade, together with the American cultivar Thatcher. The remaining eight cultivars shared a single sub-clade but were categorised into four clusters. The accumulated data for SNP marker polymorphisms amongst bread wheat genotypes from Kazakhstan may be used for studying genetic diversity in bread wheat, with potential application for marker-assisted selection and the preparation of a set of genotype-specific markers.
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
USDA-ARS?s Scientific Manuscript database
Among SNP markers that become increasingly valuable in molecular breeding of crop plants are the CAP and dCAP markers derived from the genes of interest. To date, the number of such gene-based markers is small in polyploid crop plants such as tetraploid cotton that has A and D subgenomes. The obje...
Nakatochi, Masahiro; Ushida, Yasunori; Yasuda, Yoshinari; Yoshida, Yasuko; Kawai, Shun; Kato, Ryuji; Nakashima, Toru; Iwata, Masamitsu; Kuwatsuka, Yachiyo; Ando, Masahiko; Hamajima, Nobuyuki; Kondo, Takaaki; Oda, Hiroaki; Hayashi, Mutsuharu; Kato, Sawako; Yamaguchi, Makoto; Maruyama, Shoichi; Matsuo, Seiichi; Honda, Hiroyuki
2015-01-01
Although many single nucleotide polymorphisms (SNPs) have been identified to be associated with metabolic syndrome (MetS), there was only a slight improvement in the ability to predict future MetS by the simply addition of SNPs to clinical risk markers. To improve the ability to predict future MetS, combinational effects, such as SNP-SNP interaction, SNP-environment interaction, and SNP-clinical parameter (SNP × CP) interaction should be also considered. We performed a case-control study to explore novel SNP × CP interactions as risk markers for MetS based on health check-up data of Japanese male employees. We selected 99 SNPs that were previously reported to be associated with MetS and components of MetS; subsequently, we genotyped these SNPs from 360 cases and 1983 control subjects. First, we performed logistic regression analyses to assess the association of each SNP with MetS. Of these SNPs, five SNPs were significantly associated with MetS (P < 0.05): LRP2 rs2544390, rs1800592 between UCP1 and TBC1D9, APOA5 rs662799, VWF rs7965413, and rs1411766 between MYO16 and IRS2. Furthermore, we performed multiple logistic regression analyses, including an SNP term, a CP term, and an SNP × CP interaction term for each CP and SNP that was significantly associated with MetS. We identified a novel SNP × CP interaction between rs7965413 and platelet count that was significantly associated with MetS [SNP term: odds ratio (OR) = 0.78, P = 0.004; SNP × CP interaction term: OR = 1.33, P = 0.001]. This association of the SNP × CP interaction with MetS remained nominally significant in multiple logistic regression analysis after adjustment for either the number of MetS components or MetS components excluding obesity. Our results reveal new insight into platelet count as a risk marker for MetS.
Recent advance in carrot genomics
USDA-ARS?s Scientific Manuscript database
In recent years there has been an effort towards the development of genomic resources in carrot. The number of available sequences for carrot in public databases has increased recently. This has allowed the design of SSRs markers, COS markers and a high-throughput SNP assay for genotyping. Additiona...
USDA-ARS?s Scientific Manuscript database
The soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy but only properly oriented 66% of the sequence scaffolds. To find additional single nucleotide polymorphism (SNP) markers for additiona...
Candidate gene association analyses for ketosis resistance in Holsteins.
Kroezen, V; Schenkel, F S; Miglior, F; Baes, C F; Squires, E J
2018-06-01
High-yielding dairy cattle are susceptible to ketosis, a metabolic disease that negatively affects the health, fertility, and milk production of the cow. Interest in breeding for more robust dairy cattle with improved resistance to disease is global; however, genetic evaluations for ketosis would benefit from the additional information provided by genetic markers. Candidate genes that are proposed to have a biological role in the pathogenesis of ketosis were investigated in silico and a custom panel of 998 putative single nucleotide polymorphism (SNP) markers was developed. The objective of this study was to test the associations of these new markers with deregressed estimated breeding values (EBV) for ketosis. A sample of 653 Canadian Holstein cows that had been previously genotyped with a medium-density SNP chip were regenotyped with the custom panel. The EBV for ketosis in first and later lactations were obtained for each animal and deregressed for use as pseudo-phenotypes for association analyses. Results of the mixed inheritance model for single SNP association analyses suggested 15 markers in 6 unique candidate genes were associated with the studied trait. Genes encoding proteins involved in metabolic processes, including the synthesis and degradation of fatty acids and ketone bodies, gluconeogenesis, lipid mobilization, and the citric acid cycle, were identified to contain SNP associated with ketosis resistance. This work confirmed the presence of previously described quantitative trait loci for dairy cattle, suggested novel markers for ketosis-resistance, and provided insight into the underlying biology of this disease. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
A high-density intraspecific SNP linkage map of pigeonpea (Cajanas cajan L. Millsp.)
Mandal, Paritra; Bhutani, Shefali; Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram Pratap; Chaudhary, A. K.; Yadav, Rekha; Gaikwad, K.; Sevanthi, Amitha Mithra; Datta, Subhojit; Raje, Ranjeet S.; Sharma, Tilak R.; Singh, Nagendra Kumar
2017-01-01
Pigeonpea (Cajanus cajan (L.) Millsp.) is a major food legume cultivated in semi-arid tropical regions including the Indian subcontinent, Africa, and Southeast Asia. It is an important source of protein, minerals, and vitamins for nearly 20% of the world population. Due to high carbon sequestration and drought tolerance, pigeonpea is an important crop for the development of climate resilient agriculture and nutritional security. However, pigeonpea productivity has remained low for decades because of limited genetic and genomic resources, and sparse utilization of landraces and wild pigeonpea germplasm. Here, we present a dense intraspecific linkage map of pigeonpea comprising 932 markers that span a total adjusted map length of 1,411.83 cM. The consensus map is based on three different linkage maps that incorporate a large number of single nucleotide polymorphism (SNP) markers derived from next generation sequencing data, using Illumina GoldenGate bead arrays, and genotyping with restriction site associated DNA (RAD) sequencing. The genotyping-by-sequencing enhanced the marker density but was met with limited success due to lack of common markers across the genotypes of mapping population. The integrated map has 547 bead-array SNP, 319 RAD-SNP, and 65 simple sequence repeat (SSR) marker loci. We also show here correspondence between our linkage map and published genome pseudomolecules of pigeonpea. The availability of a high-density linkage map will help improve the anchoring of the pigeonpea genome to its chromosomes and the mapping of genes and quantitative trait loci associated with useful agronomic traits. PMID:28654689
Evolution of the Oat Genetic Road Map: From Tetraploid to Hexaploid
USDA-ARS?s Scientific Manuscript database
The development of a genetic linkage map for hexaploid oat (Avena sativa L. 2n = 6 x = 42) that defines all 21 chromosomes has been hindered due to the lack of oat-based markers and the size and complexity of the oat genome. Recent efforts in oat DArT, SSR, and SNP marker development should improve...
Sawayama, Eitaro; Noguchi, Daiki; Nakayama, Kei; Takagi, Motohiro
2018-03-23
We previously reported a body color deformity in juvenile red sea bream, which shows transparency in the juvenile stage because of delayed chromatophore development compared with normal individuals, and this finding suggested a genetic cause based on parentage assessments. To conduct marker-assisted selection to eliminate broodstock inheriting the causative gene, developing DNA markers associated with the phenotype was needed. We first conducted SNP mining based on AFLP analysis using bulked-DNA from normal and transparent individuals. One SNP was identified from a transparent-specific AFLP fragment, which significantly associated with transparent individuals. Two alleles (A/G) were observed in this locus, and the genotype G/G was dominantly observed in the transparent groups (97.1%) collected from several production lots produced from different broodstock populations. A few normal individuals inherited the G/G genotype (5.0%), but the A/A and A/G genotypes were dominantly observed in the normal groups. The homologs region of the SNP was searched using a medaka genome database, and intron 12 of the Nell2a gene (located on chromosome 6 of the medaka genome) was highly matched. We also mapped the red sea bream Nell2a gene on the previously developed linkage maps, and this gene was mapped on a male linkage group, LG4-M. The newly found SNP was useful in eliminating broodstock possessing the causative gene of the body color transparency observed in juvenile stage of red sea bream.
Ulloa, Mauricio; Hulse-Kemp, Amanda M; De Santiago, Luis M; Stelly, David M; Burke, John J
2017-01-01
High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton ( Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F 2 , a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD) 1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the A t and D t subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the D t subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral A t -subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum . However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid A t and D t subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Van Inghelandt, Delphine; Melchinger, Albrecht E; Lebreton, Claude; Stich, Benjamin
2010-05-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger's distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity.
Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick
2015-01-01
Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP marker set will be useful for systematic estimation of admixture structure of citrus germplasm and for diverse genetic studies. PMID:25973611
Curk, Franck; Ancillo, Gema; Ollitrault, Frédérique; Perrier, Xavier; Jacquemoud-Collet, Jean-Pierre; Garcia-Lor, Andres; Navarro, Luis; Ollitrault, Patrick
2015-01-01
Most cultivated Citrus species originated from interspecific hybridisation between four ancestral taxa (C. reticulata, C. maxima, C. medica, and C. micrantha) with limited further interspecific recombination due to vegetative propagation. This evolution resulted in admixture genomes with frequent interspecific heterozygosity. Moreover, a major part of the phenotypic diversity of edible citrus results from the initial differentiation between these taxa. Deciphering the phylogenomic structure of citrus germplasm is therefore essential for an efficient utilization of citrus biodiversity in breeding schemes. The objective of this work was to develop a set of species-diagnostic single nucleotide polymorphism (SNP) markers for the four Citrus ancestral taxa covering the nine chromosomes, and to use these markers to infer the phylogenomic structure of secondary species and modern cultivars. Species-diagnostic SNPs were mined from 454 amplicon sequencing of 57 gene fragments from 26 genotypes of the four basic taxa. Of the 1,053 SNPs mined from 28,507 kb sequence, 273 were found to be highly diagnostic for a single basic taxon. Species-diagnostic SNP markers (105) were used to analyse the admixture structure of varieties and rootstocks. This revealed C. maxima introgressions in most of the old and in all recent selections of mandarins, and suggested that C. reticulata × C. maxima reticulation and introgression processes were important in edible mandarin domestication. The large range of phylogenomic constitutions between C. reticulata and C. maxima revealed in mandarins, tangelos, tangors, sweet oranges, sour oranges, grapefruits, and orangelos is favourable for genetic association studies based on phylogenomic structures of the germplasm. Inferred admixture structures were in agreement with previous hypotheses regarding the origin of several secondary species and also revealed the probable origin of several acid citrus varieties. The developed species-diagnostic SNP marker set will be useful for systematic estimation of admixture structure of citrus germplasm and for diverse genetic studies.
Yang, Huaan; Jian, Jianbo; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark W; Tan, Cong; Li, Chengdao
2015-09-02
Molecular marker-assisted breeding provides an efficient tool to develop improved crop varieties. A major challenge for the broad application of markers in marker-assisted selection is that the marker phenotypes must match plant phenotypes in a wide range of breeding germplasm. In this study, we used the legume crop species Lupinus angustifolius (lupin) to demonstrate the utility of whole genome sequencing and re-sequencing on the development of diagnostic markers for molecular plant breeding. Nine lupin cultivars released in Australia from 1973 to 2007 were subjected to whole genome re-sequencing. The re-sequencing data together with the reference genome sequence data were used in marker development, which revealed 180,596 to 795,735 SNP markers from pairwise comparisons among the cultivars. A total of 207,887 markers were anchored on the lupin genetic linkage map. Marker mining obtained an average of 387 SNP markers and 87 InDel markers for each of the 24 genome sequence assembly scaffolds bearing markers linked to 11 genes of agronomic interest. Using the R gene PhtjR conferring resistance to phomopsis stem blight disease as a test case, we discovered 17 candidate diagnostic markers by genotyping and selecting markers on a genetic linkage map. A further 243 candidate diagnostic markers were discovered by marker mining on a scaffold bearing non-diagnostic markers linked to the PhtjR gene. Nine out from the ten tested candidate diagnostic markers were confirmed as truly diagnostic on a broad range of commercial cultivars. Markers developed using these strategies meet the requirements for broad application in molecular plant breeding. We demonstrated that low-cost genome sequencing and re-sequencing data were sufficient and very effective in the development of diagnostic markers for marker-assisted selection. The strategies used in this study may be applied to any trait or plant species. Whole genome sequencing and re-sequencing provides a powerful tool to overcome current limitations in molecular plant breeding, which will enable plant breeders to precisely pyramid favourable genes to develop super crop varieties to meet future food demands.
Foresman, Bradley J.; Oliver, Rebekah E.; Jackson, Eric W.; Chao, Shiaoman; Arruda, Marcio P.; Kolb, Frederic L.
2016-01-01
Barley yellow dwarf viruses (BYDVs) are responsible for the disease barley yellow dwarf (BYD) and affect many cereals including oat (Avena sativa L.). Until recently, the molecular marker technology in oat has not allowed for many marker-trait association studies to determine the genetic mechanisms for tolerance. A genome-wide association study (GWAS) was performed on 428 spring oat lines using a recently developed high-density oat single nucleotide polymorphism (SNP) array as well as a SNP-based consensus map. Marker-trait associations were performed using a Q-K mixed model approach to control for population structure and relatedness. Six significant SNP-trait associations representing two QTL were found on chromosomes 3C (Mrg17) and 18D (Mrg04). This is the first report of BYDV tolerance QTL on chromosome 3C (Mrg17) and 18D (Mrg04). Haplotypes using the two QTL were evaluated and distinct classes for tolerance were identified based on the number of favorable alleles. A large number of lines carrying both favorable alleles were observed in the panel. PMID:27175781
Qi, L L; Foley, M E; Cai, X W; Gulya, T J
2016-04-01
A novel downy mildew resistance gene, Pl(18), was introgressed from wild Helianthus argophyllus into cultivated sunflower and genetically mapped to linkage group 2 of the sunflower genome. The new germplasm, HA-DM1, carrying Pl(18) has been released to the public. Sunflower downy mildew (DM) is considered to be the most destructive foliar disease that has spread to every major sunflower-growing country of the world, except Australia. A new dominant downy mildew resistance gene (Pl 18) transferred from wild Helianthus argophyllus (PI 494573) into cultivated sunflower was mapped to linkage group (LG) 2 of the sunflower genome using bulked segregant analysis with 869 simple sequence repeat (SSR) markers. Phenotyping 142 BC1F2:3 families derived from the cross of HA 89 and H. argophyllus confirmed the single gene inheritance of resistance. Since no other Pl gene has been mapped to LG2, this gene was novel and designated as Pl (18). SSR markers CRT214 and ORS203 flanked Pl(18) at a genetic distance of 1.1 and 0.4 cM, respectively. Forty-six single nucleotide polymorphism (SNP) markers that cover the Pl(18) region were surveyed for saturation mapping of the region. Six co-segregating SNP markers were 1.2 cM distal to Pl(18), and another four co-segregating SNP markers were 0.9 cM proximal to Pl(18). The new BC2F4-derived germplasm, HA-DM1, carrying Pl(18) has been released to the public. This new line is highly resistant to all Plasmopara halstedii races identified in the USA providing breeders with an effective new source of resistance against downy mildew in sunflower. The molecular markers that were developed will be especially useful in marker-assisted selection and pyramiding of Pl resistance genes because of their close proximity to the gene and the availability of high-throughput SNP detection assays.
On marker-based parentage verification via non-linear optimization.
Boerner, Vinzent
2017-06-15
Parentage verification by molecular markers is mainly based on short tandem repeat markers. Single nucleotide polymorphisms (SNPs) as bi-allelic markers have become the markers of choice for genotyping projects. Thus, the subsequent step is to use SNP genotypes for parentage verification as well. Recent developments of algorithms such as evaluating opposing homozygous SNP genotypes have drawbacks, for example the inability of rejecting all animals of a sample of potential parents. This paper describes an algorithm for parentage verification by constrained regression which overcomes the latter limitation and proves to be very fast and accurate even when the number of SNPs is as low as 50. The algorithm was tested on a sample of 14,816 animals with 50, 100 and 500 SNP genotypes randomly selected from 40k genotypes. The samples of putative parents of these animals contained either five random animals, or four random animals and the true sire. Parentage assignment was performed by ranking of regression coefficients, or by setting a minimum threshold for regression coefficients. The assignment quality was evaluated by the power of assignment (P[Formula: see text]) and the power of exclusion (P[Formula: see text]). If the sample of putative parents contained the true sire and parentage was assigned by coefficient ranking, P[Formula: see text] and P[Formula: see text] were both higher than 0.99 for the 500 and 100 SNP genotypes, and higher than 0.98 for the 50 SNP genotypes. When parentage was assigned by a coefficient threshold, P[Formula: see text] was higher than 0.99 regardless of the number of SNPs, but P[Formula: see text] decreased from 0.99 (500 SNPs) to 0.97 (100 SNPs) and 0.92 (50 SNPs). If the sample of putative parents did not contain the true sire and parentage was rejected using a coefficient threshold, the algorithm achieved a P[Formula: see text] of 1 (500 SNPs), 0.99 (100 SNPs) and 0.97 (50 SNPs). The algorithm described here is easy to implement, fast and accurate, and is able to assign parentage using genomic marker data with a size as low as 50 SNPs.
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao
Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos
2015-01-01
Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. PMID:26070980
Van Inghelandt, Delphine; Melchinger, Albrecht E.; Lebreton, Claude
2010-01-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger’s distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity. Electronic supplementary material The online version of this article (doi:10.1007/s00122-009-1256-2) contains supplementary material, which is available to authorized users. PMID:20063144
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.
Kafkas, Salih; Khodaeiaminjan, Mortaza; Güney, Murat; Kafkas, Ebru
2015-02-18
Pistachio (Pistacia vera L.) is a dioecious species that has a long juvenility period. Therefore, development of marker-assisted selection (MAS) techniques would greatly facilitate pistachio cultivar-breeding programs. The sex determination mechanism is presently unknown in pistachio. The generation of sex-linked markers is likely to reduce time, labor, and costs associated with breeding programs, and will help to clarify the sex determination system in pistachio. Restriction site-associated DNA (RAD) markers were used to identify sex-linked markers and to elucidate the sex determination system in pistachio. Eight male and eight female F1 progenies from a Pistacia vera L. Siirt × Bağyolu cross, along with the parents, were subjected to RAD sequencing in two lanes of a Hi-Seq 2000 sequencing platform. This generated 449 million reads, comprising approximately 37.7 Gb of sequences. There were 33,757 polymorphic single nucleotide polymorphism (SNP) loci between the parents. Thirty-eight of these, from 28 RAD reads, were detected as putative sex-associated loci in pistachio. Validation was performed by SNaPshot analysis in 42 mature F1 progenies and in 124 cultivars and genotypes in a germplasm collection. Eight loci could distinguish sex with 100% accuracy in pistachio. To ascertain cost-effective application of markers in a breeding program, high-resolution melting (HRM) analysis was performed; four markers were found to perfectly separate sexes in pistachio. Because of the female heterogamety in all candidate SNP loci, we report for the first time that pistachio has a ZZ/ZW sex determination system. As the reported female-to-male segregation ratio is 1:1 in all known segregating populations and there is no previous report of super-female genotypes or female heteromorphic chromosomes in pistachio, it appears that the WW genotype is not viable. Sex-linked SNP markers were identified and validated in a large germplasm and proved their suitability for MAS in pistachio. HRM analysis successfully validated the sex-linked markers for MAS. For the first time in dioecious pistachio, a female heterogamety ZW/ZZ sex determination system is suggested.
N'Diaye, Amidou; Haile, Jemanesh K; Fowler, D Brian; Ammar, Karim; Pozniak, Curtis J
2017-01-01
Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called 'large p, small n' problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly.
N’Diaye, Amidou; Haile, Jemanesh K.; Fowler, D. Brian; Ammar, Karim; Pozniak, Curtis J.
2017-01-01
Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called ‘large p, small n’ problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly. PMID:28878789
2015-01-01
Background Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e., significant differences of personal genomes of patients from the reference human genome) can help physicians to improve treatment. Clinical research can validate SNP markers via genotyping of patients and demonstration that SNP alleles are significantly more frequent in patients than in healthy people. The search for biomedical SNP markers of interest can be accelerated by computer-based analysis of hundreds of millions of SNPs in the 1000 Genomes project because of selection of the most meaningful candidate SNP markers and elimination of neutral SNPs. Results We cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on comorbidities of obesity. Near the sites binding to TATA-binding protein (TBP) in human gene promoters, we found 22 obesity-related candidate SNP markers, including rs10895068 (male breast cancer in obesity); rs35036378 (reduced risk of obesity after ovariectomy); rs201739205 (reduced risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women); rs183433761 (obesity resistance during a high-fat diet); rs367732974 and rs549591993 (both: cardiovascular complications in obese patients with type 2 diabetes mellitus); rs200487063 and rs34104384 (both: obesity-caused hypertension); rs35518301, rs72661131, and rs562962093 (all: obesity); and rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473 (all: chronic inflammation in comorbidities of obesity). Using an electrophoretic mobility shift assay under nonequilibrium conditions, we empirically validated the statistical significance (α < 0.00025) of the differences in TBP affinity values between the minor and ancestral alleles of 4 out of the 22 SNPs: rs200487063, rs201381696, rs34104384, and rs183433761. We also measured half-life (t1/2), Gibbs free energy change (ΔG), and the association and dissociation rate constants, ka and kd, of the TBP-DNA complex for these SNPs. Conclusions Validation of the 22 candidate SNP markers by proper clinical protocols appears to have a strong rationale and may advance postgenomic predictive preventive personalized medicine. PMID:26694100
Liu, Y; Yan, L; Li, Z; Huang, W-F; Pokhrel, S; Liu, X; Su, S
2016-06-01
Chalkbrood is a disease affecting honey bees that seriously impairs brood growth and productivity of diseased colonies. Although honey bees can develop chalkbrood resistance naturally, the details underlying the mechanisms of resistance are not fully understood, and no easy method is currently available for selecting and breeding resistant bees. Finding the genes involved in the development of resistance and identifying single nucleotide polymorphisms (SNPs) that can be used as molecular markers of resistance is therefore a high priority. We conducted genome resequencing to compare resistant (Res) and susceptible (Sus) larvae that were selected following in vitro chalkbrood inoculation. Twelve genomic libraries, including 14.4 Gb of sequence data, were analysed using SNP-finding algorithms. Unique SNPs derived from chromosomes 2 and 11 were analysed in this study. SNPs from resistant individuals were confirmed by PCR and Sanger sequencing using in vitro reared larvae and resistant colonies. We found strong support for an association between the C allele at SNP C2587245T and chalkbrood resistance. SNP C2587245T may be useful as a genetic marker for the selection of chalkbrood resistance and high royal jelly production honey bee lines, thereby helping to minimize the negative effects of chalkbrood on managed honey bees. © 2016 The Royal Entomological Society.
High-throughput SNP genotyping for breeding applications in rice using the BeadXpress platform
USDA-ARS?s Scientific Manuscript database
Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Molecular genetic characterization of lasquerella new industrial crop using DArTseq markers
USDA-ARS?s Scientific Manuscript database
DArTseq, a new SNP-based marker platform, was developed and used to analyze the genetic diversity of the US germplasm collection of lesquerella. Lesquerella is a new oilseed crop in the Brassica family found native in the American Southwest. The potential of the species as a domestic source of indu...
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers of susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpes virus 3 (CyHV-3) is highly contagious and virulent in common carp. With the aim to investigate the gene...
USDA-ARS?s Scientific Manuscript database
Knowledge of germplasm diversity and relationships among elite breeding materials is fundamentally important in crop improvement. We genotyped 450 maize lines developed and/or widely used by CIMMYT breeding programs both in Kenya and Zimbabwe using 1065 SNP markers to (i) investigate population stru...
Nakatochi, Masahiro; Ushida, Yasunori; Yasuda, Yoshinari; Yoshida, Yasuko; Kawai, Shun; Kato, Ryuji; Nakashima, Toru; Iwata, Masamitsu; Kuwatsuka, Yachiyo; Ando, Masahiko; Hamajima, Nobuyuki; Kondo, Takaaki; Oda, Hiroaki; Hayashi, Mutsuharu; Kato, Sawako; Yamaguchi, Makoto; Maruyama, Shoichi; Matsuo, Seiichi; Honda, Hiroyuki
2015-01-01
Although many single nucleotide polymorphisms (SNPs) have been identified to be associated with metabolic syndrome (MetS), there was only a slight improvement in the ability to predict future MetS by the simply addition of SNPs to clinical risk markers. To improve the ability to predict future MetS, combinational effects, such as SNP—SNP interaction, SNP—environment interaction, and SNP—clinical parameter (SNP × CP) interaction should be also considered. We performed a case-control study to explore novel SNP × CP interactions as risk markers for MetS based on health check-up data of Japanese male employees. We selected 99 SNPs that were previously reported to be associated with MetS and components of MetS; subsequently, we genotyped these SNPs from 360 cases and 1983 control subjects. First, we performed logistic regression analyses to assess the association of each SNP with MetS. Of these SNPs, five SNPs were significantly associated with MetS (P < 0.05): LRP2 rs2544390, rs1800592 between UCP1 and TBC1D9, APOA5 rs662799, VWF rs7965413, and rs1411766 between MYO16 and IRS2. Furthermore, we performed multiple logistic regression analyses, including an SNP term, a CP term, and an SNP × CP interaction term for each CP and SNP that was significantly associated with MetS. We identified a novel SNP × CP interaction between rs7965413 and platelet count that was significantly associated with MetS [SNP term: odds ratio (OR) = 0.78, P = 0.004; SNP × CP interaction term: OR = 1.33, P = 0.001]. This association of the SNP × CP interaction with MetS remained nominally significant in multiple logistic regression analysis after adjustment for either the number of MetS components or MetS components excluding obesity. Our results reveal new insight into platelet count as a risk marker for MetS. PMID:25646961
Chao, Shiaoman; Singh, Ravi P.; Sorrells, Mark E.
2017-01-01
Wheat stem rust (Puccinia graminis f. sp. tritici Eriks. and E. Henn.) is one of the most destructive diseases world-wide. Races belonging to Ug99 (or TTKSK) continue to cause crop losses in East Africa and threaten global wheat production. Developing and deploying wheat varieties with multiple race-specific genes or complex adult plant resistance is necessary to achieve durability. In the present study, we applied genome-wide association studies (GWAS) for identifying loci associated with the Ug99 stem rust resistance (SR) in a panel of wheat lines developed at the International Maize and Wheat Improvement Center (CIMMYT). Genotyping was carried out using the wheat 9K iSelect single nucleotide polymorphism (SNP) chip. Phenotyping was done in the field in Kenya by infection of Puccinia graminis f. sp. tritici race TTKST, the Sr24-virulent variant of Ug99. Marker-trait association identified 12 SNP markers significantly associated with resistance. Among them, 7 were mapped on five chromosomes. Markers located on chromosomes 4A and 4B overlapped with the location of the Ug99 resistance genes SrND643 and Sr37, respectively. Markers identified on 7DL were collocated with Sr25. Additional significant markers were located in the regions where no Sr gene has been reported. The chromosome location for five of the SNP markers was unknown. A BLASTN search of the NCBI database using the flanking sequences of the SNPs associated with Ug99 resistance revealed that several markers were linked to plant disease resistance analogues, while others were linked to regulatory factors or metabolic enzymes. A KASP (Kompetitive Allele Specific PCR) assay was used for validating six marker loci linked to genes with resistance to Ug99. Of those, four co-segregated with the Sr25-pathotypes while the rest identified unknown resistance genes. With further investigation, these markers can be used for marker-assisted selection in breeding for Ug99 stem rust resistance in wheat. PMID:28241006
Keith R. Merrill; Craig E. Coleman; Susan E. Meyer; Elizabeth A. Leger; Katherine A. Collins
2016-01-01
Premise of the study: Bromus tectorum (Poaceae) is an annual grass species that is invasive in many areas of the world but most especially in the U.S. Intermountain West. Single-nucleotide polymorphism (SNP) markers were developed for use in investigating the geospatial and ecological diversity of B. tectorum in the Intermountain West to better understand the...
Manaffar, R; Zare, S; Agh, N; Abdolahzadeh, N; Soltanian, S; Sorgeloos, P; Bossier, P; Van Stappen, G
2011-01-01
In order to find a marker for differentiating between a bisexual and a parthenogenetic Artemia strain, Exon-7 of the Na/K ATPase α(1) subunit gene was screened by RFLP technique. The results revealed a constant synonymous SNP (single nucleotide polymorphism) in digestion by the Tru1I enzyme that was consistent with these two types of Artemia. This SNP was identified as an accurate molecular marker for discrimination between bisexual and parthenogenetic Artemia. According to the Nei's genetic distance (1973), the lowest genetic distance was found between individuals from Artemia urmiana Günther 1890 and parthenogenetic populations, making the described marker the first marker to easily distinguish between these two cooccurring species. © 2010 Blackwell Publishing Ltd.
Transcriptome-enabled marker discovery and mapping of plastochron-related genes in Petunia spp.
Guo, Yufang; Wiegert-Rininger, Krystle E; Vallejo, Veronica A; Barry, Cornelius S; Warner, Ryan M
2015-09-24
Petunia (Petunia × hybrida), derived from a hybrid between P. axillaris and P. integrifolia, is one of the most economically important bedding plant crops and Petunia spp. serve as model systems for investigating the mechanisms underlying diverse mating systems and pollination syndromes. In addition, we have previously described genetic variation and quantitative trait loci (QTL) related to petunia development rate and morphology, which represent important breeding targets for the floriculture industry to improve crop production and performance. Despite the importance of petunia as a crop, the floriculture industry has been slow to adopt marker assisted selection to facilitate breeding strategies and there remains a limited availability of sequences and molecular markers from the genus compared to other economically important members of the Solanaceae family such as tomato, potato and pepper. Here we report the de novo assembly, annotation and characterization of transcriptomes from P. axillaris, P. exserta and P. integrifolia. Each transcriptome assembly was derived from five tissue libraries (callus, 3-week old seedlings, shoot apices, flowers of mixed developmental stages, and trichomes). A total of 74,573, 54,913, and 104,739 assembled transcripts were recovered from P. axillaris, P. exserta and P. integrifolia, respectively and following removal of multiple isoforms, 32,994 P. axillaris, 30,225 P. exserta, and 33,540 P. integrifolia high quality representative transcripts were extracted for annotation and expression analysis. The transcriptome data was mined for single nucleotide polymorphisms (SNP) and simple sequence repeat (SSR) markers, yielding 89,007 high quality SNPs and 2949 SSRs, respectively. 15,701 SNPs were computationally converted into user-friendly cleaved amplified polymorphic sequence (CAPS) markers and a subset of SNP and CAPS markers were experimentally verified. CAPS markers developed from plastochron-related homologous transcripts from P. axillaris were mapped in an interspecific Petunia population and evaluated for co-localization with QTL for development rate. The high quality of the three Petunia spp. transcriptomes coupled with the utility of the SNP data will serve as a resource for further exploration of genetic diversity within the genus and will facilitate efforts to develop genetic and physical maps to aid the identification of QTL associated with traits of interest.
Lu, Xia; Luan, Sheng; Hu, Long Yang; Mao, Yong; Tao, Ye; Zhong, Sheng Ping; Kong, Jie
2016-06-01
The Kuruma prawn, Marsupenaeus japonicus, is one of the most promising marine invertebrates in the industry in Asia, Europe and Australia. However, the increasing global temperatures result in considerable economic losses in M. japonicus farming. In the present study, to select genetically improved animals for the sustainable development of the Kuruma prawn industry, a high-resolution genetic linkage map and quantitative trait locus (QTL) identification were performed using the RAD technology. The maternal map contained 5849 SNP markers and spanned 3127.23 cM, with an average marker interval of 0.535 cM. Instead, the paternal map contained 3927 SNP markers and spanned 3326.19 cM, with an average marker interval of 0.847 cM. The consensus map contained 9289 SNP markers and spanned 3610.90 cM, with an average marker interval of 0.388 cM and coverage of 99.06 % of the genome. The markers were grouped into 41 linkage groups in the maps. Significantly, negative correlation was detected between high-temperature tolerance (UTT) and body weight (BW). The QTL mapping revealed 129 significant QTL loci for UTT and four significant QTL loci for BW at the genome-wide significance threshold. Among these QTLs, 129 overlapped with linked SNPs, and the remaining four were located in regions between contiguous SNPs. They explained the total phenotypic variance ranging from 8.9 to 12.4 %. Because of a significantly negative correlation between growth and high-temperature tolerance, we demonstrate that this high-resolution linkage map and QTLs would be useful for further marker-assisted selection in the genetic improvement of M. japonicus.
2011-01-01
Background In a previously reported genome-wide association study based on a high-density bovine SNP genotyping array, 8 SNP were nominally associated (P ≤ 0.003) with average daily gain (ADG) and 3 of these were also associated (P ≤ 0.002) with average daily feed intake (ADFI) in a population of crossbred beef cattle. The SNP were clustered in a 570 kb region around 38 Mb on the draft sequence of bovine chromosome 6 (BTA6), an interval containing several positional and functional candidate genes including the bovine LAP3, NCAPG, and LCORL genes. The goal of the present study was to develop and examine additional markers in this region to optimize the ability to distinguish favorable alleles, with potential to identify functional variation. Results Animals from the original study were genotyped for 47 SNP within or near the gene boundaries of the three candidate genes. Sixteen markers in the NCAPG-LCORL locus displayed significant association with both ADFI and ADG even after stringent correction for multiple testing (P ≤ 005). These markers were evaluated for their effects on meat and carcass traits. The alleles associated with higher ADFI and ADG were also associated with higher hot carcass weight (HCW) and ribeye area (REA), and lower adjusted fat thickness (AFT). A reduced set of markers was genotyped on a separate, crossbred population including genetic contributions from 14 beef cattle breeds. Two of the markers located within the LCORL gene locus remained significant for ADG (P ≤ 0.04). Conclusions Several markers within the NCAPG-LCORL locus were significantly associated with feed intake and body weight gain phenotypes. These markers were also associated with HCW, REA and AFT suggesting that they are involved with lean growth and reduced fat deposition. Additionally, the two markers significant for ADG in the validation population of animals may be more robust for the prediction of ADG and possibly the correlated trait ADFI, across multiple breeds and populations of cattle. PMID:22168586
Zhang, Tiejun; Yu, Long-Xi; McCord, Per; Miller, David; Bhamidimarri, Suresh; Johnson, David; Monteros, Maria J.; Ho, Julie; Reisen, Peter; Samac, Deborah A.
2014-01-01
Verticillium wilt, caused by the soilborne fungus, Verticillium alfalfae, is one of the most serious diseases of alfalfa (Medicago sativa L.) worldwide. To identify loci associated with resistance to Verticillium wilt, a bulk segregant analysis was conducted in susceptible or resistant pools constructed from 13 synthetic alfalfa populations, followed by association mapping in two F1 populations consisted of 352 individuals. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers were used for genotyping. Phenotyping was done by manual inoculation of the pathogen to replicated cloned plants of each individual and disease severity was scored using a standard scale. Marker-trait association was analyzed by TASSEL. Seventeen SNP markers significantly associated with Verticillium wilt resistance were identified and they were located on chromosomes 1, 2, 4, 7 and 8. SNP markers identified on chromosomes 2, 4 and 7 co-locate with regions of Verticillium wilt resistance loci reported in M. truncatula. Additional markers identified on chromosomes 1 and 8 located the regions where no Verticillium resistance locus has been reported. This study highlights the value of SNP genotyping by high resolution melting to identify the disease resistance loci in tetraploid alfalfa. With further validation, the markers identified in this study could be used for improving resistance to Verticillium wilt in alfalfa breeding programs. PMID:25536106
Zhang, Tiejun; Yu, Long-Xi; McCord, Per; Miller, David; Bhamidimarri, Suresh; Johnson, David; Monteros, Maria J; Ho, Julie; Reisen, Peter; Samac, Deborah A
2014-01-01
Verticillium wilt, caused by the soilborne fungus, Verticillium alfalfae, is one of the most serious diseases of alfalfa (Medicago sativa L.) worldwide. To identify loci associated with resistance to Verticillium wilt, a bulk segregant analysis was conducted in susceptible or resistant pools constructed from 13 synthetic alfalfa populations, followed by association mapping in two F1 populations consisted of 352 individuals. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) markers were used for genotyping. Phenotyping was done by manual inoculation of the pathogen to replicated cloned plants of each individual and disease severity was scored using a standard scale. Marker-trait association was analyzed by TASSEL. Seventeen SNP markers significantly associated with Verticillium wilt resistance were identified and they were located on chromosomes 1, 2, 4, 7 and 8. SNP markers identified on chromosomes 2, 4 and 7 co-locate with regions of Verticillium wilt resistance loci reported in M. truncatula. Additional markers identified on chromosomes 1 and 8 located the regions where no Verticillium resistance locus has been reported. This study highlights the value of SNP genotyping by high resolution melting to identify the disease resistance loci in tetraploid alfalfa. With further validation, the markers identified in this study could be used for improving resistance to Verticillium wilt in alfalfa breeding programs.
Two Novel SNPs of PPARγ Significantly Affect Weaning Growth Traits of Nanyang Cattle.
Huang, Jieping; Chen, Ningbo; Li, Xin; An, Shanshan; Zhao, Minghui; Sun, Taihong; Hao, Ruijie; Ma, Yun
2018-01-02
Peroxisome-proliferator-activated receptor gamma (PPARγ) is a key transcription factor that controls adipocyte differentiation and energy in mammals. Therefore, PPARγ is a potential factor influencing animal growth traits. This study primarily evaluates PPARγ as candidate gene for growth traits of cattle and identifies potential molecular marker for cattle breeding. Per previous studies, PPARγ mRNA was mainly expressed at extremely high levels in adipose tissues as shown by quantitative real-time polymerase chain reaction analysis. Three novel SNPs of the bovine PPARγ gene were identified in 514 individuals from six Chinese cattle breeds: SNP1 (AC_000179.1 g.57386668 C > G) in intron 2 and SNP2 (AC_000179.1 g.57431964 C > T) and SNP3 (AC_000179.1 g.57431994 T > C) in exon 7. The present study also investigated genetic characteristics of these SNP loci in six populations. Association analysis showed that SNP1 and SNP3 loci significantly affect weaning growth traits, especially body weight of Nanyang cattle. These results revealed that SNP1 and SNP3 are potential molecular markers for cattle breeding.
Wang, Xuefeng; Lee, Seunggeun; Zhu, Xiaofeng; Redline, Susan; Lin, Xihong
2013-12-01
Family-based genetic association studies of related individuals provide opportunities to detect genetic variants that complement studies of unrelated individuals. Most statistical methods for family association studies for common variants are single marker based, which test one SNP a time. In this paper, we consider testing the effect of an SNP set, e.g., SNPs in a gene, in family studies, for both continuous and discrete traits. Specifically, we propose a generalized estimating equations (GEEs) based kernel association test, a variance component based testing method, to test for the association between a phenotype and multiple variants in an SNP set jointly using family samples. The proposed approach allows for both continuous and discrete traits, where the correlation among family members is taken into account through the use of an empirical covariance estimator. We derive the theoretical distribution of the proposed statistic under the null and develop analytical methods to calculate the P-values. We also propose an efficient resampling method for correcting for small sample size bias in family studies. The proposed method allows for easily incorporating covariates and SNP-SNP interactions. Simulation studies show that the proposed method properly controls for type I error rates under both random and ascertained sampling schemes in family studies. We demonstrate through simulation studies that our approach has superior performance for association mapping compared to the single marker based minimum P-value GEE test for an SNP-set effect over a range of scenarios. We illustrate the application of the proposed method using data from the Cleveland Family GWAS Study. © 2013 WILEY PERIODICALS, INC.
High-density genetic map construction and comparative genome analysis in asparagus bean.
Huang, Haitao; Tan, Huaqiang; Xu, Dongmei; Tang, Yi; Niu, Yisong; Lai, Yunsong; Tie, Manman; Li, Huanxiu
2018-03-19
Genetic maps are a prerequisite for quantitative trait locus (QTL) analysis, marker-assisted selection (MAS), fine gene mapping, and assembly of genome sequences. So far, several asparagus bean linkage maps have been established using various kinds of molecular markers. However, these maps were all constructed by gel- or array-based markers. No maps based on sequencing method have been reported. In this study, an NGS-based strategy, SLAF-seq, was applied to create a high-density genetic map for asparagus bean. Through SLAF library construction and Illumina sequencing of two parents and 100 F2 individuals, a total of 55,437 polymorphic SLAF markers were developed and mined for SNP markers. The map consisted of 5,225 SNP markers in 11 LGs, spanning a total distance of 1,850.81 cM, with an average distance between markers of 0.35 cM. Comparative genome analysis with four other legume species, soybean, common bean, mung bean and adzuki bean showed that asparagus bean is genetically more related to adzuki bean. The results will provide a foundation for future genomic research, such as QTL fine mapping, comparative mapping in pulses, and offer support for assembling asparagus bean genome sequence.
Taranto, F; D'Agostino, N; Greco, B; Cardi, T; Tripodi, P
2016-11-21
Knowledge on population structure and genetic diversity in vegetable crops is essential for association mapping studies and genomic selection. Genotyping by sequencing (GBS) represents an innovative method for large scale SNP detection and genotyping of genetic resources. Herein we used the GBS approach for the genome-wide identification of SNPs in a collection of Capsicum spp. accessions and for the assessment of the level of genetic diversity in a subset of 222 cultivated pepper (Capsicum annum) genotypes. GBS analysis generated a total of 7,568,894 master tags, of which 43.4% uniquely aligned to the reference genome CM334. A total of 108,591 SNP markers were identified, of which 105,184 were in C. annuum accessions. In order to explore the genetic diversity of C. annuum and to select a minimal core set representing most of the total genetic variation with minimum redundancy, a subset of 222 C. annuum accessions were analysed using 32,950 high quality SNPs. Based on Bayesian and Hierarchical clustering it was possible to divide the collection into three clusters. Cluster I had the majority of varieties and landraces mainly from Southern and Northern Italy, and from Eastern Europe, whereas clusters II and III comprised accessions of different geographical origins. Considering the genome-wide genetic variation among the accessions included in cluster I, a second round of Bayesian (K = 3) and Hierarchical (K = 2) clustering was performed. These analysis showed that genotypes were grouped not only based on geographical origin, but also on fruit-related features. GBS data has proven useful to assess the genetic diversity in a collection of C. annuum accessions. The high number of SNP markers, uniformly distributed on the 12 chromosomes, allowed the accessions to be distinguished according to geographical origin and fruit-related features. SNP markers and information on population structure developed in this study will undoubtedly support genome-wide association mapping studies and marker-assisted selection programs.
Daca-Roszak, P; Pfeifer, A; Żebracka-Gala, J; Jarząb, B; Witt, M; Ziętkiewicz, E
2016-01-01
Assays that allow analysis of the biogeographic origin of biological samples in a standard forensic laboratory have to target a small number of highly differentiating markers. Such markers should be easy to multiplex and the assay must perform well in the degraded and scarce biological material. SNPs localized in the genome regions, which in the past were subjected to differential selective pressure in various populations, are the most widely used markers in the studies of biogeographic affiliation. SNPs reflecting biogeographic differences not related to any phenotypic traits are not sufficiently explored. The goal of our study was to identify a small set of SNPs not related to any known pigmentation/phenotype-specific genes, which would allow efficient discrimination between populations of Europe and East Asia. The selection of SNPs was based on the comparative analysis of representative European and Chinese/Japanese samples (B-lymphocyte cell lines), genotyped using the Infinium HumanOmniExpressExome microarray (Illumina). The classifier, consisting of 24 unlinked SNPs (24-SNP classifier), was selected. The performance of a 14-SNP subset of this classifier (14-SNP subclassifier) was tested using genotype data from several populations. The 14-SNP subclassifier differentiated East Asians, Europeans and Africans with ∼100% accuracy; Palestinians, representative of the Middle East, clustered with Europeans, while Amerindians and Pakistani were placed between East Asian and European populations. Based on these results, we have developed a SNaPshot assay (EurEAs_Gplex) for genotyping SNPs from the 14-SNP subclassifier, combined with an additional marker for gender identification. Forensic utility of the EurEAs_Gplex was verified using degraded and low quantity DNA samples. The performance of the EurEAs_Gplex was satisfactory when using degraded DNA; tests using low quantity DNA samples revealed a previously not described source of genotyping errors, potentially important for any SNaPshot-based assays. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Comparison between genotyping by sequencing and SNP-chip genotyping in QTL mapping in wheat
USDA-ARS?s Scientific Manuscript database
Array- or chip-based single nucleotide polymorphism (SNP) markers are widely used in genomic studies because of their abundance in a genome and cost less per data point compared to older marker technologies. Genotyping by sequencing (GBS), a relatively newer approach of genotyping, suggests equal or...
USDA-ARS?s Scientific Manuscript database
In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers for susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpesvirus 3 (CyHV-3) is highly contagious and virulent in common carp (Cyprinus carpio). With the aim to de...
Bennett, G L; Shackelford, S D; Wheeler, T L; King, D A; Casas, E; Smith, T P L
2013-02-01
Genetic markers in casein (CSN1S1) and thyroglobulin (TG) genes have previously been associated with fat distribution in cattle. Determining the nature of these genetic associations (additive, recessive, or dominant) has been difficult, because both markers have small minor allele frequencies in most beef cattle populations. This results in few animals homozygous for the minor alleles. selection to increase the frequencies of the minor alleles for 2 SNP markers in these genes was undertaken in a composite population. The objective was to obtain better estimates of genetic effects associated with these markers and determine if there were epistatic interactions. Selection increased the frequencies of minor alleles for both SNP from <0.30 to 0.45. Bulls (n = 24) heterozygous for both SNP were used in 3 yr to produce 204 steer progeny harvested at an average age of 474 d. The combined effect of the 9 CSN1S1 × TG genotypes was associated with carcass-adjusted fat thickness (P < 0.06) and meat tenderness predicted at the abattoir by visible and near-infrared reflectance spectroscopy (P < 0.04). Genotype did not affect BW from birth through harvest, ribeye area, marbling score, slice shear force, or image-based yield grade (P > 0.10). Additive, dominance, and epistatic SNP association effects were estimated from genotypic effects for adjusted fat thickness and predicted meat tenderness. Adjusted fat thickness showed a dominance association with TG SNP (P < 0.06) and an epistatic additive CSN1S1 × additive TG association (P < 0.03). For predicted meat tenderness, heterozygous TG meat was more tender than meat from either homozygote (P < 0.002). Dominance and epistatic associations can result in different SNP allele substitution effects in populations where SNP have the same linkage disequilibrium with causal mutations but have different frequencies. Although the complex associations estimated in this study would contribute little to within-population selection response, they could be important for marker-assisted management or reciprocal selection schemes.
Cuenca, Jose; Aleza, Pablo; Garcia-Lor, Andres; Ollitrault, Patrick; Navarro, Luis
2016-01-01
Alternaria brown spot (ABS) is a serious disease affecting susceptible citrus genotypes, which is a strong concern regarding citrus breeding programs. Resistance is conferred by a recessive locus (ABSr) previously located by our group within a 3.3 Mb genome region near the centromere in chromosome III. This work addresses fine-linkage mapping of this region for identifying candidate resistance genes and develops new molecular markers for ABS-resistance effective marker-assisted selection (MAS). Markers closely linked to ABSr locus were used for fine mapping using a 268-segregating diploid progeny derived from a heterozygous susceptible × resistant cross. Fine mapping limited the genomic region containing the ABSr resistance gene to 366 kb, flanked by markers at 0.4 and 0.7 cM. This region contains nine genes related to pathogen resistance. Among them, eight are resistance (R) gene homologs, with two of them harboring a serine/threonine protein kinase domain. These two genes along with a gene encoding a S-adenosyl-L-methionine-dependent-methyltransferase protein, should be considered as strong candidates for ABS-resistance. Moreover, the closest SNP was genotyped in 40 citrus varieties, revealing very high association with the resistant/susceptible phenotype. This new marker is currently used in our citrus breeding program for ABS-resistant parent and cultivar selection, at diploid, triploid and tetraploid level. PMID:28066498
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection. PMID:25048025
Haplotype-Based Genotyping in Polyploids.
Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott
2018-01-01
Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.
Molecular Mapping of Restriction-Site Associated DNA Markers In Allotetraploid Upland Cotton.
Wang, Yangkun; Ning, Zhiyuan; Hu, Yan; Chen, Jiedan; Zhao, Rui; Chen, Hong; Ai, Nijiang; Guo, Wangzhen; Zhang, Tianzhen
2015-01-01
Upland cotton (Gossypium hirsutum L., 2n = 52, AADD) is an allotetraploid, therefore the discovery of single nucleotide polymorphism (SNP) markers is difficult. The recent emergence of genome complexity reduction technologies based on the next-generation sequencing (NGS) platform has greatly expedited SNP discovery in crops with highly repetitive and complex genomes. Here we applied restriction-site associated DNA (RAD) sequencing technology for de novo SNP discovery in allotetraploid cotton. We identified 21,109 SNPs between the two parents and used these for genotyping of 161 recombinant inbred lines (RILs). Finally, a high dense linkage map comprising 4,153 loci over 3500-cM was developed based on the previous result. Using this map quantitative trait locus (QTLs) conferring fiber strength and Verticillium Wilt (VW) resistance were mapped to a more accurate region in comparison to the 1576-cM interval determined using the simple sequence repeat (SSR) genetic map. This suggests that the newly constructed map has more power and resolution than the previous SSR map. It will pave the way for the rapid identification of the marker-assisted selection in cotton breeding and cloning of QTL of interest traits.
SNP discovery by high-throughput sequencing in soybean
2010-01-01
Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770
Dong, Lin-Lin; Chen, Zhong-Jian; Wang, Yong; Wei, Fu-Gang; Zhang, Lian-Juan; Xu, Jiang; Wei, Guang-Fei; Wang, Rui; Yang, Juan; Liu, Wei-Lin; Li, Xi-Wen; Yu, Yu-Qi; Chen, Shi-Lin
2017-01-01
DNA marker-assisted selection of medicinal plants is based on the DNA polymorphism, selects the DNA sequences related to the phenotypes such as high yields, superior quality, stress-resistance and so on according to the technologies of molecular hybridization, polymerase chain reaction and high-throughput sequencing, and assists the breeding of new cultivars. This study bred the first disease-resistant cultivar of notoginseng "Miaoxiang Kangqi 1" using the technology of DNA marker-assisted selection of medicinal plants and systematic breeding. The disease-resistant cultivar of notoginseng contained 12 special SNPs based on the analysis of Restriction-site Associated DNA Sequencing (RAD-Seq). Among the SNP (record_519688) was related to the root rot-resistant characteristics, which indicated this SNP could serve as genetic markers of disease-resistant cultivars and assist the systematic breeding. Compared to the conventional cultivated cultivars, the incidence rate of root-rot and rust-rot in notoginseng seedlings decreased by 83.6% and 71.8%, respectively. The incidence rate of root-rot respectively declined by 43.6% and 62.9% in notoginseng cultivation for 2 and 3 years compared with those of the conventional cultivated cultivars. Additionally, the potential disease-resistant groups were screened based on the relative SNP, and this model enlarged the target groups and advanced the breeding efficiency. DNA marker-assisted selection of medicinal plants accelerated the breeding and promotion of new cultivars, and guaranteed the healthy development of Chinese medicinal materials industry. Copyright© by the Chinese Pharmaceutical Association.
Cuenca, José; Aleza, Pablo; Vicent, Antonio; Brunel, Dominique; Ollitrault, Patrick; Navarro, Luis
2013-01-01
Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids.
Cuenca, José; Aleza, Pablo; Vicent, Antonio; Brunel, Dominique; Ollitrault, Patrick; Navarro, Luis
2013-01-01
Genetic analysis of phenotypical traits and marker-trait association in polyploid species is generally considered as a challenge. In the present work, different approaches were combined taking advantage of the particular genetic structures of 2n gametes resulting from second division restitution (SDR) to map a genome region linked to Alternaria brown spot (ABS) resistance in triploid citrus progeny. ABS in citrus is a serious disease caused by the tangerine pathotype of the fungus Alternaria alternata. This pathogen produces ACT-toxin, which induces necrotic lesions on fruit and young leaves, defoliation and fruit drop in susceptible genotypes. It is a strong concern for triploid breeding programs aiming to produce seedless mandarin cultivars. The monolocus dominant inheritance of susceptibility, proposed on the basis of diploid population studies, was corroborated in triploid progeny. Bulk segregant analysis coupled with genome scan using a large set of genetically mapped SNP markers and targeted genetic mapping by half tetrad analysis, using SSR and SNP markers, allowed locating a 3.3 Mb genomic region linked to ABS resistance near the centromere of chromosome III. Clusters of resistance genes were identified by gene ontology analysis of this genomic region. Some of these genes are good candidates to control the dominant susceptibility to the ACT-toxin. SSR and SNP markers were developed for efficient early marker-assisted selection of ABS resistant hybrids. PMID:24116149
Nakajima, Ayaka; Kawaguchi, Fuki; Uemoto, Yoshinobu; Fukushima, Moriyuki; Yoshida, Emi; Iwamoto, Eiji; Akiyama, Takayuki; Kohama, Namiko; Kobayashi, Eiji; Honda, Takeshi; Oyama, Kenji; Mannen, Hideyuki; Sasazaki, Shinji
2018-05-01
The objective of this study was to identify genomic regions associated with fat-related traits using a Japanese Black cattle population in Hyogo. From 1836 animals, those with high or low values were selected on the basis of corrected phenotype and then pooled into high and low groups (n = 100 each), respectively. DNA pool-based genome-wide association study (GWAS) was performed using Illumina BovineSNP50 BeadChip v2 with three replicate assays for each pooled sample. GWAS detected that two single nucleotide polymorphisms (SNPs) on BTA7 (ARS-BFGL-NGS-35463 and Hapmap23838-BTA-163815) and one SNP on BTA12 (ARS-BFGL-NGS-2915) significantly affected fat percentage (FAR). The significance of ARS-BFGL-NGS-35463 on BTA7 was confirmed by individual genotyping in all pooled samples. Moreover, association analysis between SNP and FAR in 803 Japanese Black cattle revealed a significant effect of SNP on FAR. Thus, further investigation of these regions is required to identify FAR-associated genes and mutations, which can lead to the development of DNA markers for marker-assisted selection for the genetic improvement of beef quality. © 2018 Japanese Society of Animal Science.
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.
Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos
2015-08-01
Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Molecular differentiation of Russian wild ginseng using mitochondrial nad7 intron 3 region.
Li, Guisheng; Cui, Yan; Wang, Hongtao; Kwon, Woo-Saeng; Yang, Deok-Chun
2017-07-01
Cultivated ginseng is often introduced as a substitute and adulterant of Russian wild ginseng due to its lower cost or misidentification caused by similarity in appearance with wild ginseng. The aim of this study is to develop a simple and reliable method to differentiate Russian wild ginseng from cultivated ginseng. The mitochondrial NADH dehydrogenase subunit 7 ( nad 7) intron 3 regions of Russian wild ginseng and Chinese cultivated ginseng were analyzed. Based on the multiple sequence alignment result, a specific primer for Russian wild ginseng was designed by introducing additional mismatch and allele-specific polymerase chain reaction (PCR) was performed for identification of wild ginseng. Real-time allele-specific PCR with endpoint analysis was used for validation of the developed Russian wild ginseng single nucleotide polymorphism (SNP) marker. An SNP site specific to Russian wild ginseng was exploited by multiple alignments of mitochondrial nad 7 intron 3 regions of different ginseng samples. With the SNP-based specific primer, Russian wild ginseng was successfully discriminated from Chinese and Korean cultivated ginseng samples by allele-specific PCR. The reliability and specificity of the SNP marker was validated by checking 20 individuals of Russian wild ginseng samples with real-time allele-specific PCR assay. An effective DNA method for molecular discrimination of Russian wild ginseng from Chinese and Korean cultivated ginseng was developed. The established real-time allele-specific PCR was simple and reliable, and the present method should be a crucial complement of chemical analysis for authentication of Russian wild ginseng.
Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila
2017-04-01
Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Liu, Kaihua; Zhang, Bin; Teng, Zhaochun; Wang, Youtao; Dong, Guodong; Xu, Cong; Qin, Bo; Song, Chunlian; Chai, Jun; Li, Yang; Shi, Xianwei; Shu, Xianghua; Zhang, Yifang
2017-03-01
We investigated the associations between SLC11A1 polymorphisms and susceptibility to tuberculosis (TB) in Chinese Holstein cattle, using a case-control study of 136 animals that had positive reactions to TB tests and showed symptoms and 96 animals that had negative reactions to tests and showed no symptoms. Polymerase chain reaction (PCR) sequencing and the restriction fragment length polymorphism (RFLP) technique were used to detect and determine SLC11A1 polymorphisms. Association analysis identified significant correlations between SLC11A1 polymorphisms and susceptibility/resistance to TB, and two genetic markers for SLC11A1 were established using PCR-RFLP. Sequence alignment of SLC11A1 revealed seven single-nucleotide polymorphisms (SNPs). This is the first report of MaeII PCR-RFLP markers for the SLC11A1-SNP3 site and PstI PCR-RFLP markers for the SLC11A1-SNP5 and SLC11A1-SNP6 sites in Chinese Holstein cattle. Logistic regression analysis indicated that SLC11A1-SNP1, SLC11A1-SNP3, and SLC11A1-SNP5 were significantly associated with susceptibility/resistance to TB. Two genotypes of SLC11A1-SNP3 were susceptible to TB, whereas one genotype of SLC11A1-SNP1 and two genotypes of SLC11A1-SNP5 were resistant. Haplotype analysis showed that nine haplotypes were potentially resistant to TB. After Bonferroni correction, three of the haplotypes remained significantly associated with TB resistance. SLC11A1 is a useful candidate gene related to TB in Chinese Holstein cattle. Copyright © 2016 Elsevier Ltd. All rights reserved.
2010-01-01
Background The information provided by dense genome-wide markers using high throughput technology is of considerable potential in human disease studies and livestock breeding programs. Genome-wide association studies relate individual single nucleotide polymorphisms (SNP) from dense SNP panels to individual measurements of complex traits, with the underlying assumption being that any association is caused by linkage disequilibrium (LD) between SNP and quantitative trait loci (QTL) affecting the trait. Often SNP are in genomic regions of no trait variation. Whole genome Bayesian models are an effective way of incorporating this and other important prior information into modelling. However a full Bayesian analysis is often not feasible due to the large computational time involved. Results This article proposes an expectation-maximization (EM) algorithm called emBayesB which allows only a proportion of SNP to be in LD with QTL and incorporates prior information about the distribution of SNP effects. The posterior probability of being in LD with at least one QTL is calculated for each SNP along with estimates of the hyperparameters for the mixture prior. A simulated example of genomic selection from an international workshop is used to demonstrate the features of the EM algorithm. The accuracy of prediction is comparable to a full Bayesian analysis but the EM algorithm is considerably faster. The EM algorithm was accurate in locating QTL which explained more than 1% of the total genetic variation. A computational algorithm for very large SNP panels is described. Conclusions emBayesB is a fast and accurate EM algorithm for implementing genomic selection and predicting complex traits by mapping QTL in genome-wide dense SNP marker data. Its accuracy is similar to Bayesian methods but it takes only a fraction of the time. PMID:20969788
Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W
2014-09-01
A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.
USDA-ARS?s Scientific Manuscript database
For the first time in many years a comprehensive genome map for cultivated oat has been constructed using a combination of single nucleotide polymorphism (SNP) markers and validated with a collection of cytogenetically defined germplasm lines. The markers were able to help distinguish the three geno...
USDA-ARS?s Scientific Manuscript database
Genetic diversity, population structure, and genome-wide marker-trait association analyses were conducted on a special collection of 298 homozygous lettuce (Lactuca sativa L.) lines. Each of these lines was derived from a single plant that had been genotyped with 384 SNP makers using LSGermOPA. They...
USDA-ARS?s Scientific Manuscript database
Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...
Zhou, Lin; Matsumoto, Tracie; Tan, Hua-Wei; Meinhardt, Lyndel W; Mischke, Sue; Wang, Boyi; Zhang, Dapeng
2015-01-01
Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using expressed sequence tag and nucleotide sequences from public databases, we developed 213 single nucleotide polymorphism (SNP) markers and validated 96 SNPs by genotyping the United States Department of Agriculture - Agricultural Research Service pineapple germplasm collection, maintained in Hilo, Hawaii. The validation resulted in designation of a set of 57 polymorphic SNP markers that revealed a high rate of duplicates in this pineapple collection. Twenty-four groups of duplicates were detected, encompassing 130 of the total 170 A cosmos accessions. The results show that somatic mutation has been the main source of intra-cultivar variations in pineapple. Multivariate clustering and a model-based population stratification suggest that the modern pineapple cultivars are comprised of progenies that are derived from different wild Ananas botanical varieties. Parentage analysis further revealed that both A. comosus var. bracteatus and A. comosus var. ananassoides are likely progenitors of pineapple cultivars. However, the traditional classification of cultivated pineapple into horticultural groups (e.g. 'Cayenne', 'Spanish', 'Queen') was not well supported by the present study. These SNP markers provide robust and universally comparable DNA fingerprints; thus, they can serve as an efficient genotyping tool to assist pineapple germplasm management, propagation of planting material, and pineapple cultivar protection. The high rate of genetic redundancy detected in this pineapple collection suggests the potential impact of applying this technology on other clonally propagated perennial crops.
Joint Identification of Genetic Variants for Physical Activity in Korean Population
Kim, Jayoun; Kim, Jaehee; Min, Haesook; Oh, Sohee; Kim, Yeonjung; Lee, Andy H.; Park, Taesung
2014-01-01
There has been limited research on genome-wide association with physical activity (PA). This study ascertained genetic associations between PA and 344,893 single nucleotide polymorphism (SNP) markers in 8842 Korean samples. PA data were obtained from a validated questionnaire that included information on PA intensity and duration. Metabolic equivalent of tasks were calculated to estimate the total daily PA level for each individual. In addition to single- and multiple-SNP association tests, a pathway enrichment analysis was performed to identify the biological significance of SNP markers. Although no significant SNP was found at genome-wide significance level via single-SNP association tests, 59 genetic variants mapped to 76 genes were identified via a multiple SNP approach using a bootstrap selection stability measure. Pathway analysis for these 59 variants showed that maturity onset diabetes of the young (MODY) was enriched. Joint identification of SNPs could enable the identification of multiple SNPs with good predictive power for PA and a pathway enriched for PA. PMID:25026172
The Minnesota Center for Twin and Family Research Genome-Wide Association Study
Miller, Michael B.; Basu, Saonli; Cunningham, Julie; Eskin, Eleazar; Malone, Steven M.; Oetting, William S.; Schork, Nicholas; Sul, Jae Hoon; Iacono, William G.; Mcgue, Matt
2012-01-01
As part of the Genes, Environment and Development Initiative (GEDI), the Minnesota Center for Twin and Family Research (MCTFR) undertook a genome-wide association study (GWAS), which we describe here. A total of 8405 research participants, clustered in 4-member families, have been successfully genotyped on 527,829 single nucleotide polymorphism (SNP) markers using Illumina’s Human660W-Quad array. Quality control screening of samples and markers as well as SNP imputation procedures are described. We also describe methods for ancestry control and how the familial clustering of the MCTFR sample can be accounted for in the analysis using a Rapid Feasible Generalized Least Squares algorithm. The rich longitudinal MCTFR assessments provide numerous opportunities for collaboration. PMID:23363460
Zhang, Ning; Zhang, Linan; Tao, Ye; Guo, Li; Sun, Juan; Li, Xia; Zhao, Nan; Peng, Jie; Li, Xiaojie; Zeng, Liang; Chen, Jinsa; Yang, Guanpin
2015-03-15
Kelp (Saccharina japonica) has been intensively cultured in China for almost a century. Its genetic improvement is comparable with that of rice. However, the development of its molecular tools is extremely limited, thus its genes, genetics and genomics. Kelp performs an alternative life cycle during which sporophyte generation alternates with gametophyte generation. The gametophytes of kelp can be cloned and crossed. Due to these characteristics, kelp may serve as a reference for the biological and genetic studies of Volvox, mosses and ferns. We constructed a high density single nucleotide polymorphism (SNP) linkage map for kelp by restriction site associated DNA (RAD) sequencing. In total, 4,994 SNP-containing physical (tag-defined) RAD loci were mapped on 31 linkage groups. The map expanded a total genetic distance of 1,782.75 cM, covering 98.66% of the expected (1,806.94 cM). The length of RAD tags (85 bp) was extended to 400-500 bp with Miseq method, offering us an easiness of developing SNP chips and shifting SNP genotyping to a high throughput track. The number of linkage groups was in accordance with the documented with cytological methods. In addition, we identified a set of microsatellites (99 in total) from the extended RAD tags. A gametophyte sex determining locus was mapped on linkage group 2 in a window about 9.0 cM in width, which was 2.66 cM up to marker_40567 and 6.42 cM down to marker_23595. A high density SNP linkage map was constructed for kelp, an intensively cultured brown alga in China. The RAD tags were also extended so that a SNP chip could be developed. In addition, a set of microsatellites were identified among mapped loci, and a gametophyte sex determining locus was mapped. This map will facilitate the genetic studies of kelp including for example the evaluation of germplasm and the decipherment of the genetic bases of economic traits.
Manivannan, Abinaya; Kim, Jin-Hee; Yang, Eun-Young; Ahn, Yul-Kyun; Lee, Eun-Su; Choi, Sena; Kim, Do-Sun
2018-01-01
Pepper is an economically important horticultural plant that has been widely used for its pungency and spicy taste in worldwide cuisines. Therefore, the domestication of pepper has been carried out since antiquity. Owing to meet the growing demand for pepper with high quality, organoleptic property, nutraceutical contents, and disease tolerance, genomics assisted breeding techniques can be incorporated to develop novel pepper varieties with desired traits. The application of next-generation sequencing (NGS) approaches has reformed the plant breeding technology especially in the area of molecular marker assisted breeding. The availability of genomic information aids in the deeper understanding of several molecular mechanisms behind the vital physiological processes. In addition, the NGS methods facilitate the genome-wide discovery of DNA based markers linked to key genes involved in important biological phenomenon. Among the molecular markers, single nucleotide polymorphism (SNP) indulges various benefits in comparison with other existing DNA based markers. The present review concentrates on the impact of NGS approaches in the discovery of useful SNP markers associated with pungency and disease resistance in pepper. The information provided in the current endeavor can be utilized for the betterment of pepper breeding in future.
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are the marker of choice for many researchers due to their abundance and the high-throughput methods available for their multiplex analysis. Only recently have SNP markers been available to researchers in soybean [Glycine max (L.) Merr.] with the release of th...
USDA-ARS?s Scientific Manuscript database
An Upland cotton multi-parent advanced generation inter-cross (MAGIC) population was developed through random-mating of 11 diverse cultivars for five generations. In this study, fiber quality data obtained from four environments and 6,071 SNP markers generated via GBS and 223 microsatellite markers...
Lata, Charu; Bhutty, Sarita; Bahadur, Ranjit Prasad; Majee, Manoj; Prasad, Manoj
2011-06-01
The DREB genes code for important plant transcription factors involved in the abiotic stress response and signal transduction. Characterization of DREB genes and development of functional markers for effective alleles is important for marker-assisted selection in foxtail millet. Here the characterization of a cDNA (SiDREB2) encoding a putative dehydration-responsive element-binding protein 2 from foxtail millet and the development of an allele-specific marker (ASM) for dehydration tolerance is reported. A cDNA clone (GenBank accession no. GT090998) coding for a putative DREB2 protein was isolated as a differentially expressed gene from a 6 h dehydration stress SSH library. A 5' RACE (rapid amplification of cDNA ends) was carried out to obtain the full-length cDNA, and sequence analysis showed that SiDREB2 encoded a polypeptide of 234 amino acids with a predicted mol. wt of 25.72 kDa and a theoretical pI of 5.14. A theoretical model of the tertiary structure shows that it has a highly conserved GCC-box-binding N-terminal domain, and an acidic C-terminus that acts as an activation domain for transcription. Based on its similarity to AP2 domains, SiDREB2 was classified into the A-2 subgroup of the DREB subfamily. Quantitative real-time PCR analysis showed significant up-regulation of SiDREB2 by dehydration (polyethylene glycol) and salinity (NaCl), while its expression was less affected by other stresses. A synonymous single nucleotide polymorphism (SNP) associated with dehydration tolerance was detected at the 558th base pair (an A/G transition) in the SiDREB2 gene in a core set of 45 foxtail millet accessions used. Based on the identified SNP, three primers were designed to develop an ASM for dehydration tolerance. The ASM produced a 261 bp fragment in all the tolerant accessions and produced no amplification in the sensitive accessions. The use of this ASM might be faster, cheaper, and more reproducible than other SNP genotyping methods, and thus will enable marker-aided breeding of foxtail millet for dehydration tolerance.
A two step Bayesian approach for genomic prediction of breeding values.
Shariati, Mohammad M; Sørensen, Peter; Janss, Luc
2012-05-21
In genomic models that assign an individual variance to each marker, the contribution of one marker to the posterior distribution of the marker variance is only one degree of freedom (df), which introduces many variance parameters with only little information per variance parameter. A better alternative could be to form clusters of markers with similar effects where markers in a cluster have a common variance. Therefore, the influence of each marker group of size p on the posterior distribution of the marker variances will be p df. The simulated data from the 15th QTL-MAS workshop were analyzed such that SNP markers were ranked based on their effects and markers with similar estimated effects were grouped together. In step 1, all markers with minor allele frequency more than 0.01 were included in a SNP-BLUP prediction model. In step 2, markers were ranked based on their estimated variance on the trait in step 1 and each 150 markers were assigned to one group with a common variance. In further analyses, subsets of 1500 and 450 markers with largest effects in step 2 were kept in the prediction model. Grouping markers outperformed SNP-BLUP model in terms of accuracy of predicted breeding values. However, the accuracies of predicted breeding values were lower than Bayesian methods with marker specific variances. Grouping markers is less flexible than allowing each marker to have a specific marker variance but, by grouping, the power to estimate marker variances increases. A prior knowledge of the genetic architecture of the trait is necessary for clustering markers and appropriate prior parameterization.
Wang, Jun; Wang, Zhilan; Du, Xiaofen; Yang, Huiqing; Han, Fang; Han, Yuanhuai; Yuan, Feng; Zhang, Linyi; Peng, Shuzhong; Guo, Erhu
2017-01-01
Foxtail millet (Setaria italica), a very important grain crop in China, has become a new model plant for cereal crops and biofuel grasses. Although its reference genome sequence was released recently, quantitative trait loci (QTLs) controlling complex agronomic traits remains limited. The development of massively parallel genotyping methods and next-generation sequencing technologies provides an excellent opportunity for developing single-nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of complex quantitative traits. In this study, a high-throughput and cost-effective RAD-seq approach was employed to generate a high-density genetic map for foxtail millet. A total of 2,668,587 SNP loci were detected according to the reference genome sequence; meanwhile, 9,968 SNP markers were used to genotype 124 F2 progenies derived from the cross between Hongmiaozhangu and Changnong35; a high-density genetic map spanning 1648.8 cM, with an average distance of 0.17 cM between adjacent markers was constructed; 11 major QTLs for eight agronomic traits were identified; five co-dominant DNA markers were developed. These findings will be of value for the identification of candidate genes and marker-assisted selection in foxtail millet.
Wang, Zhilan; Du, Xiaofen; Yang, Huiqing; Han, Fang; Han, Yuanhuai; Yuan, Feng; Zhang, Linyi; Peng, Shuzhong; Guo, Erhu
2017-01-01
Foxtail millet (Setaria italica), a very important grain crop in China, has become a new model plant for cereal crops and biofuel grasses. Although its reference genome sequence was released recently, quantitative trait loci (QTLs) controlling complex agronomic traits remains limited. The development of massively parallel genotyping methods and next-generation sequencing technologies provides an excellent opportunity for developing single-nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of complex quantitative traits. In this study, a high-throughput and cost-effective RAD-seq approach was employed to generate a high-density genetic map for foxtail millet. A total of 2,668,587 SNP loci were detected according to the reference genome sequence; meanwhile, 9,968 SNP markers were used to genotype 124 F2 progenies derived from the cross between Hongmiaozhangu and Changnong35; a high-density genetic map spanning 1648.8 cM, with an average distance of 0.17 cM between adjacent markers was constructed; 11 major QTLs for eight agronomic traits were identified; five co-dominant DNA markers were developed. These findings will be of value for the identification of candidate genes and marker-assisted selection in foxtail millet. PMID:28644843
Huang, Chao-Wei; Lin, Yu-Tsung; Ding, Shih-Torng; Lo, Ling-Ling; Wang, Pei-Hwa; Lin, En-Chung; Liu, Fang-Wei; Lu, Yen-Wen
2015-01-01
The genetic markers associated with economic traits have been widely explored for animal breeding. Among these markers, single-nucleotide polymorphism (SNPs) are gradually becoming a prevalent and effective evaluation tool. Since SNPs only focus on the genetic sequences of interest, it thereby reduces the evaluation time and cost. Compared to traditional approaches, SNP genotyping techniques incorporate informative genetic background, improve the breeding prediction accuracy and acquiesce breeding quality on the farm. This article therefore reviews the typical procedures of animal breeding using SNPs and the current status of related techniques. The associated SNP information and genotyping techniques, including microarray and Lab-on-a-Chip based platforms, along with their potential are highlighted. Examples in pig and poultry with different SNP loci linked to high economic trait values are given. The recommendations for utilizing SNP genotyping in nimal breeding are summarized. PMID:27600241
USDA-ARS?s Scientific Manuscript database
Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Mester, David; Ronin, Yefim; Schnable, Patrick; Aluru, Srinivas; Korol, Abraham
2015-01-01
Our aim was to develop a fast and accurate algorithm for constructing consensus genetic maps for chip-based SNP genotyping data with a high proportion of shared markers between mapping populations. Chip-based genotyping of SNP markers allows producing high-density genetic maps with a relatively standardized set of marker loci for different mapping populations. The availability of a standard high-throughput mapping platform simplifies consensus analysis by ignoring unique markers at the stage of consensus mapping thereby reducing mathematical complicity of the problem and in turn analyzing bigger size mapping data using global optimization criteria instead of local ones. Our three-phase analytical scheme includes automatic selection of ~100-300 of the most informative (resolvable by recombination) markers per linkage group, building a stable skeletal marker order for each data set and its verification using jackknife re-sampling, and consensus mapping analysis based on global optimization criterion. A novel Evolution Strategy optimization algorithm with a global optimization criterion presented in this paper is able to generate high quality, ultra-dense consensus maps, with many thousands of markers per genome. This algorithm utilizes "potentially good orders" in the initial solution and in the new mutation procedures that generate trial solutions, enabling to obtain a consensus order in reasonable time. The developed algorithm, tested on a wide range of simulated data and real world data (Arabidopsis), outperformed two tested state-of-the-art algorithms by mapping accuracy and computation time. PMID:25867943
Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios
2011-01-19
Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food.
Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios
2011-01-01
Background Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. Methods The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. Conclusions The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. Significance The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food. PMID:21283808
Pandareesh, M D; Anand, T
2014-05-01
Sodium nitroprusside (SNP) is a widely used nitric oxide (NO) donor, known to exert nitrative stress by up-regulation of inducible nitric oxide synthase (iNOS). Nω-nitro-L-arginine-methyl esther (L-NAME) is a NO inhibitor, which inhibits iNOS expression, is used as positive control. The present study was designed to assess neuroprotective propensity of Bacopa monniera extract (BME) in SNP-induced neuronal damage and oxido-nitrative stress in PC12 cells via modulation of iNOS, heat shock proteins and apoptotic markers. Our results elucidate that pre-treatment of PC12 cells with BME ameliorates the mitochondrial and plasma membrane damage induced by SNP (200 μM) as evidenced by MTT and LDH assays. BME pre-treatment inhibited NO generation by down regulating iNOS expression. BME replenished the depleted antioxidant status induced by SNP treatment. SNP-induced damage to cellular, nuclear and mitochondrial integrity was also restored by BME, which was confirmed by ROS estimation, comet assay and mitochondrial membrane potential assays respectively. BME pre-treatment efficiently attenuated the SNP-induced apoptotic protein biomarkers such as Bax, Bcl-2, cytochrome-c and caspase-3, which orchestrate the proteolytic damage of the cell. Q-PCR results further elucidated up-regulation of neuronal cell stress markers like HO-1 and iNOS and down-regulation of BDNF upon SNP exposure was attenuated by BME pre-treatment. By considering all these findings, we report that BME protects PC12 cells against SNP-induced toxicity via its free radical scavenging and neuroprotective mechanism.
McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.
2013-01-01
To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982
Song, H; Li, L; Ma, P; Zhang, S; Su, G; Lund, M S; Zhang, Q; Ding, X
2018-06-01
This study investigated the efficiency of genomic prediction with adding the markers identified by genome-wide association study (GWAS) using a data set of imputed high-density (HD) markers from 54K markers in Chinese Holsteins. Among 3,056 Chinese Holsteins with imputed HD data, 2,401 individuals born before October 1, 2009, were used for GWAS and a reference population for genomic prediction, and the 220 younger cows were used as a validation population. In total, 1,403, 1,536, and 1,383 significant single nucleotide polymorphisms (SNP; false discovery rate at 0.05) associated with conformation final score, mammary system, and feet and legs were identified, respectively. About 2 to 3% genetic variance of 3 traits was explained by these significant SNP. Only a very small proportion of significant SNP identified by GWAS was included in the 54K marker panel. Three new marker sets (54K+) were herein produced by adding significant SNP obtained by linear mixed model for each trait into the 54K marker panel. Genomic breeding values were predicted using a Bayesian variable selection (BVS) model. The accuracies of genomic breeding value by BVS based on the 54K+ data were 2.0 to 5.2% higher than those based on the 54K data. The imputed HD markers yielded 1.4% higher accuracy on average (BVS) than the 54K data. Both the 54K+ and HD data generated lower bias of genomic prediction, and the 54K+ data yielded the lowest bias in all situations. Our results show that the imputed HD data were not very useful for improving the accuracy of genomic prediction and that adding the significant markers derived from the imputed HD marker panel could improve the accuracy of genomic prediction and decrease the bias of genomic prediction. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Development of genetic markers in abalone through construction of a SNP database.
Kang, J-H; Appleyard, S A; Elliott, N G; Jee, Y-J; Lee, J B; Kang, S W; Baek, M K; Han, Y S; Choi, T-J; Lee, Y S
2011-06-01
In the absence of a reference genome, single-nucleotide polymorphisms (SNP) discovery in a group of abalone species was undertaken by random sequence assembly. A web-based interface was constructed, and 11 932 DNA sequences from the genus Haliotis were assembled, with 1321 contigs built. Of these, 118 contigs that consisted of at least ten annotation groups were selected. The 1577 putative SNPs were identified from the 118 contigs, with SNPs in several HSP70 gene contigs confirmed by PCR amplification of an 809-bp DNA fragment. SNPs in the HSP70 gene were compared across eight abalone species. A total of 129 polymorphic sites, including heterozygote sites within and among species, were observed. Phylogenetic analysis of the partial HSP70 gene region showed separation of the tested abalone into two groups, one reflecting the southern hemisphere species and the other the northern hemisphere species. Interestingly, Haliotis iris from New Zealand showed a closer relationship to species distributed in the northern Pacific region. Although HSP genes are known to be highly conserved among taxa, the validation of polymorphic SNPs from HSP70 in this mollusc demonstrates the applicability of cross-species SNP markers in abalone and the first step towards universal nuclear markers in Haliotis. © 2010 NFRDI, Animal Genetics © 2010 Stichting International Foundation for Animal Genetics.
USDA-ARS?s Scientific Manuscript database
Our objective was to evaluate whether breed composition of crossbred cattle could be predicted using reference breed frequencies of SNP markers on the BovineSNP50 array. Semen DNA samples of over 2,000 bulls from 16 common commercial beef breeds were genotyped using the array and used to estimate cu...
Coding SNP in tenascin-C Fn-III-D domain associates with adult asthma.
Matsuda, Akira; Hirota, Tomomitsu; Akahoshi, Mitsuteru; Shimizu, Makiko; Tamari, Mayumi; Miyatake, Akihiko; Takahashi, Atsushi; Nakashima, Kazuko; Takahashi, Naomi; Obara, Kazuhiko; Yuyama, Noriko; Doi, Satoru; Kamogawa, Yumiko; Enomoto, Tadao; Ohshima, Koichi; Tsunoda, Tatsuhiko; Miyatake, Shoichiro; Fujita, Kimie; Kusakabe, Moriaki; Izuhara, Kenji; Nakamura, Yusuke; Hopkin, Julian; Shirakawa, Taro
2005-10-01
The extracellular matrix glycoprotein tenascin-C (TNC) has been accepted as a valuable histopathological subepithelial marker for evaluating the severity of asthmatic disease and the therapeutic response to drugs. We found an association between an adult asthma and an SNP encoding TNC fibronectin type III-D (Fn-III-D) domain in a case-control study between a Japanese population including 446 adult asthmatic patients and 658 normal healthy controls. The SNP (44513A/T in exon 17) strongly associates with adult bronchial asthma (chi2 test, P=0.00019, Odds ratio=1.76, 95% confidence interval=1.31-2.36). This coding SNP induces an amino acid substitution (Leu1677Ile) within the Fn-III-D domain of the alternative splicing region. Computer-assisted protein structure modeling suggests that the substituted amino acid locates at the outer edge of the beta-sheet in Fn-III-D domain and causes instability of this beta-sheet. As the TNC fibronectin-III domain has molecular elasticity, the structural change may affect the integrity and stiffness of asthmatic airways. In addition, TNC expression in lung fibroblasts increases with Th2 immune cytokine stimulation. Thus, Leu1677Ile may be valuable marker for evaluating the risk for developing asthma and plays a role in its pathogenesis.
Zhou, Lin; Matsumoto, Tracie; Tan, Hua-Wei; Meinhardt, Lyndel W; Mischke, Sue; Wang, Boyi; Zhang, Dapeng
2015-01-01
Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using expressed sequence tag and nucleotide sequences from public databases, we developed 213 single nucleotide polymorphism (SNP) markers and validated 96 SNPs by genotyping the United States Department of Agriculture - Agricultural Research Service pineapple germplasm collection, maintained in Hilo, Hawaii. The validation resulted in designation of a set of 57 polymorphic SNP markers that revealed a high rate of duplicates in this pineapple collection. Twenty-four groups of duplicates were detected, encompassing 130 of the total 170 A cosmos accessions. The results show that somatic mutation has been the main source of intra-cultivar variations in pineapple. Multivariate clustering and a model-based population stratification suggest that the modern pineapple cultivars are comprised of progenies that are derived from different wild Ananas botanical varieties. Parentage analysis further revealed that both A. comosus var. bracteatus and A. comosus var. ananassoides are likely progenitors of pineapple cultivars. However, the traditional classification of cultivated pineapple into horticultural groups (e.g. ‘Cayenne’, ‘Spanish’, ‘Queen’) was not well supported by the present study. These SNP markers provide robust and universally comparable DNA fingerprints; thus, they can serve as an efficient genotyping tool to assist pineapple germplasm management, propagation of planting material, and pineapple cultivar protection. The high rate of genetic redundancy detected in this pineapple collection suggests the potential impact of applying this technology on other clonally propagated perennial crops. PMID:26640697
Selection and Management of DNA Markers for Use in Genomic Evaluation
USDA-ARS?s Scientific Manuscript database
A database was constructed to store genotypes for 50,972 single-nucleotide polymorphisms (SNP) from the Illumina BovineSNP50 BeadChip for over 30,000 animals. The database allows storage of multiple samples per animal and stores all SNP genotypes for a sample in a single row. An indicator specifies ...
Ulloa, Pilar E; Rincón, Gonzalo; Islas-Trejo, Alma; Araneda, Cristian; Iturra, Patricia; Neira, Roberto; Medrano, Juan F
2015-06-01
The objectives of this study were to measure gene expression in zebrafish and then identify SNP to be used as potential markers in a growth association study. We developed an approach where muscle samples collected from low- and high-growth fish were analyzed using RNA-Sequencing (RNA-seq), and SNP were chosen from the genes that were differentially expressed between the low and high groups. A population of 24 families was fed a plant protein-based diet from the larval to adult stages. From a total of 440 males, 5 % of the fish from both tails of the weight gain distribution were selected. Total RNA was extracted from individual muscle of 8 low-growth and 8 high-growth fish. Two pooled RNA-Seq libraries were prepared for each phenotype using 4 fish per library. Libraries were sequenced using the Illumina GAII Sequencer and analyzed using the CLCBio genomic workbench software. One hundred and twenty-four genes were differentially expressed between phenotypes (p value < 0.05 and FDR < 0.2). From these genes, 164 SNP were selected and genotyped in 240 fish samples. Marker-trait analysis revealed 5 SNP associated with growth in key genes (Nars, Lmod2b, Cuzd1, Acta1b, and Plac8l1). These genes are good candidates for further growth studies in fish and to consider for identification of potential SNPs associated with different growth rates in response to a plant protein-based diet.
Hagen, Ingerid J; Billing, Anna M; Rønning, Bernt; Pedersen, Sindre A; Pärn, Henrik; Slate, Jon; Jensen, Henrik
2013-05-01
With the advent of next generation sequencing, new avenues have opened to study genomics in wild populations of non-model species. Here, we describe a successful approach to a genome-wide medium density Single Nucleotide Polymorphism (SNP) panel in a non-model species, the house sparrow (Passer domesticus), through the development of a 10 K Illumina iSelect HD BeadChip. Genomic DNA and cDNA derived from six individuals were sequenced on a 454 GS FLX system and generated a total of 1.2 million sequences, in which SNPs were detected. As no reference genome exists for the house sparrow, we used the zebra finch (Taeniopygia guttata) reference genome to determine the most likely position of each SNP. The 10 000 SNPs on the SNP-chip were selected to be distributed evenly across 31 chromosomes, giving on average one SNP per 100 000 bp. The SNP-chip was screened across 1968 individual house sparrows from four island populations. Of the original 10 000 SNPs, 7413 were found to be variable, and 99% of these SNPs were successfully called in at least 93% of all individuals. We used the SNP-chip to demonstrate the ability of such genome-wide marker data to detect population sub-division, and compared these results to similar analyses using microsatellites. The SNP-chip will be used to map Quantitative Trait Loci (QTL) for fitness-related phenotypic traits in natural populations. © 2013 Blackwell Publishing Ltd.
Yuan, Congying; Wang, Meinan; Skinner, Danniel Z; See, Deven R; Xia, Chongjing; Guo, Xinhong; Chen, Xianming
2018-01-01
Puccinia striiformis f. sp. tritici, the wheat stripe rust pathogen, is a dikaryotic, biotrophic, and macrocyclic fungus. Genetic study of P. striiformis f. sp. tritici virulence was not possible until the recent discovery of Berberis spp. and Mahonia spp. as alternate hosts. To determine inheritance of virulence and map virulence genes, a segregating population of 119 isolates was developed by self-fertilizing P. striiformis f. sp. tritici isolate 08-220 (race PSTv-11) on barberry leaves under controlled greenhouse conditions. The progeny isolates were phenotyped on a set of 29 wheat lines with single genes for race-specific resistance and genotyped with simple sequence repeat (SSR) markers, single nucleotide polymorphism (SNP) markers derived from secreted protein genes, and SNP markers from genotyping-by-sequencing (GBS). Using the GBS technique, 10,163 polymorphic GBS-SNP markers were identified. Clustering and principal component analysis grouped these markers into six genetic groups, and a genetic map, consisting of six linkage groups, was constructed with 805 markers. The six clusters or linkage groups resulting from these analyses indicated a haploid chromosome number of six in P. striiformis f. sp. tritici. Through virulence testing of the progeny isolates, the parental isolate was found to be homozygous for the avirulence loci corresponding to resistance genes Yr5, Yr10, Yr15, Yr24, Yr32, YrSP, YrTr1, Yr45, and Yr53 and homozygous for the virulence locus corresponding to resistance gene Yr41. Segregation was observed for virulence phenotypes in response to the remaining 19 single-gene lines. A single dominant gene or two dominant genes with different nonallelic gene interactions were identified for each of the segregating virulence phenotypes. Of 27 dominant virulence genes identified, 17 were mapped to two chromosomes. Markers tightly linked to some of the virulence loci may facilitate further studies to clone these genes. The virulence genes and their inheritance information are useful for understanding the host-pathogen interactions and for selecting effective resistance genes or gene combinations for developing stripe rust resistant wheat cultivars.
An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data.
Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A V S K; Varshney, Rajeev K
2014-01-01
Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone free software.
Park, Jung Hun; Jang, Hyowon; Jung, Yun Kyung; Jung, Ye Lim; Shin, Inkyung; Cho, Dae-Yeon; Park, Hyun Gyu
2017-05-15
We herein describe a new mass spectrometry-based method for multiplex SNP genotyping by utilizing allele-specific ligation and strand displacement amplification (SDA) reaction. In this method, allele-specific ligation is first performed to discriminate base sequence variations at the SNP site within the PCR-amplified target DNA. The primary ligation probe is extended by a universal primer annealing site while the secondary ligation probe has base sequences as an overhang with a nicking enzyme recognition site and complementary mass marker sequence. The ligation probe pairs are ligated by DNA ligase only at specific allele in the target DNA and the resulting ligated product serves as a template to promote the SDA reaction using a universal primer. This process isothermally amplifies short DNA fragments, called mass markers, to be analyzed by mass spectrometry. By varying the sizes of the mass markers, we successfully demonstrated the multiplex SNP genotyping capability of this method by reliably identifying several BRCA mutations in a multiplex manner with mass spectrometry. Copyright © 2016 Elsevier B.V. All rights reserved.
Automated tetraploid genotype calling by hierarchical clustering
USDA-ARS?s Scientific Manuscript database
SNP arrays are transforming breeding and genetics research for autotetraploids. To fully utilize these arrays, however, the relationship between signal intensity and allele dosage must be inferred independently for each marker. We developed an improved computational method to automate this process, ...
AncestrySNPminer: A bioinformatics tool to retrieve and develop ancestry informative SNP panels
Amirisetty, Sushil; Khurana Hershey, Gurjit K.; Baye, Tesfaye M.
2012-01-01
A wealth of genomic information is available in public and private databases. However, this information is underutilized for uncovering population specific and functionally relevant markers underlying complex human traits. Given the huge amount of SNP data available from the annotation of human genetic variation, data mining is a faster and cost effective approach for investigating the number of SNPs that are informative for ancestry. In this study, we present AncestrySNPminer, the first web-based bioinformatics tool specifically designed to retrieve Ancestry Informative Markers (AIMs) from genomic data sets and link these informative markers to genes and ontological annotation classes. The tool includes an automated and simple “scripting at the click of a button” functionality that enables researchers to perform various population genomics statistical analyses methods with user friendly querying and filtering of data sets across various populations through a single web interface. AncestrySNPminer can be freely accessed at https://research.cchmc.org/mershalab/AncestrySNPminer/login.php. PMID:22584067
Ren, Jing; Sun, Daokun; Chen, Liang; You, Frank M; Wang, Jirui; Peng, Yunliang; Nevo, Eviatar; Sun, Dongfa; Luo, Ming-Cheng; Peng, Junhua
2013-03-28
Evaluation of genetic diversity and genetic structure in crops has important implications for plant breeding programs and the conservation of genetic resources. Newly developed single nucleotide polymorphism (SNP) markers are effective in detecting genetic diversity. In the present study, a worldwide durum wheat collection consisting of 150 accessions was used. Genetic diversity and genetic structure were investigated using 946 polymorphic SNP markers covering the whole genome of tetraploid wheat. Genetic structure was greatly impacted by multiple factors, such as environmental conditions, breeding methods reflected by release periods of varieties, and gene flows via human activities. A loss of genetic diversity was observed from landraces and old cultivars to the modern cultivars released during periods of the Early Green Revolution, but an increase in cultivars released during the Post Green Revolution. Furthermore, a comparative analysis of genetic diversity among the 10 mega ecogeographical regions indicated that South America, North America, and Europe possessed the richest genetic variability, while the Middle East showed moderate levels of genetic diversity.
Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Bellato, Cláudia M; Motilal, Lambert; Zhang, Dapeng
2014-01-15
Cacao (Theobroma cacao L.), the source of cocoa, is an economically important tropical crop. One problem with the premium cacao market is contamination with off-types adulterating raw premium material. Accurate determination of the genetic identity of single cacao beans is essential for ensuring cocoa authentication. Using nanofluidic single nucleotide polymorphism (SNP) genotyping with 48 SNP markers, we generated SNP fingerprints for small quantities of DNA extracted from the seed coat of single cacao beans. On the basis of the SNP profiles, we identified an assumed adulterant variety, which was unambiguously distinguished from the authentic beans by multilocus matching. Assignment tests based on both Bayesian clustering analysis and allele frequency clearly separated all 30 authentic samples from the non-authentic samples. Distance-based principle coordinate analysis further supported these results. The nanofluidic SNP protocol, together with forensic statistical tools, is sufficiently robust to establish authentication and to verify gourmet cacao varieties. This method shows significant potential for practical application.
Massa, Alicia N; Manrique-Carpintero, Norma C; Coombs, Joseph J; Zarka, Daniel G; Boone, Anne E; Kirk, William W; Hackett, Christine A; Bryan, Glenn J; Douches, David S
2015-09-14
The objective of this study was to construct a single nucleotide polymorphism (SNP)-based genetic map at the cultivated tetraploid level to locate quantitative trait loci (QTL) contributing to economically important traits in potato (Solanum tuberosum L.). The 156 F1 progeny and parents of a cross (MSL603) between "Jacqueline Lee" and "MSG227-2" were genotyped using the Infinium 8303 Potato Array. Furthermore, the progeny and parents were evaluated for foliar late blight reaction to isolates of the US-8 genotype of Phytophthora infestans (Mont.) de Bary and vine maturity. Linkage analyses and QTL mapping were performed using a novel approach that incorporates allele dosage information. The resulting genetic maps contained 1972 SNP markers with an average density of 1.36 marker per cM. QTL mapping identified the major source of late blight resistance in "Jacqueline Lee." The best SNP marker mapped ~0.54 Mb from a resistance hotspot on the long arm of chromosome 9. For vine maturity, the major-effect QTL was located on chromosome 5 with allelic effects from both parents. A candidate SNP marker for this trait mapped ~0.25 Mb from the StCDF1 gene, which is a candidate gene for the maturity trait. The identification of markers for P. infestans resistance will enable the introgression of multiple sources of resistance through marker-assisted selection. Moreover, the discovery of a QTL for late blight resistance not linked to the QTL for vine maturity provides the opportunity to use marker-assisted selection for resistance independent of the selection for vine maturity classifications. Copyright © 2015 Massa et al.
Massa, Alicia N.; Manrique-Carpintero, Norma C.; Coombs, Joseph J.; Zarka, Daniel G.; Boone, Anne E.; Kirk, William W.; Hackett, Christine A.; Bryan, Glenn J.; Douches, David S.
2015-01-01
The objective of this study was to construct a single nucleotide polymorphism (SNP)-based genetic map at the cultivated tetraploid level to locate quantitative trait loci (QTL) contributing to economically important traits in potato (Solanum tuberosum L.). The 156 F1 progeny and parents of a cross (MSL603) between “Jacqueline Lee” and “MSG227-2” were genotyped using the Infinium 8303 Potato Array. Furthermore, the progeny and parents were evaluated for foliar late blight reaction to isolates of the US-8 genotype of Phytophthora infestans (Mont.) de Bary and vine maturity. Linkage analyses and QTL mapping were performed using a novel approach that incorporates allele dosage information. The resulting genetic maps contained 1972 SNP markers with an average density of 1.36 marker per cM. QTL mapping identified the major source of late blight resistance in “Jacqueline Lee.” The best SNP marker mapped ∼0.54 Mb from a resistance hotspot on the long arm of chromosome 9. For vine maturity, the major-effect QTL was located on chromosome 5 with allelic effects from both parents. A candidate SNP marker for this trait mapped ∼0.25 Mb from the StCDF1 gene, which is a candidate gene for the maturity trait. The identification of markers for P. infestans resistance will enable the introgression of multiple sources of resistance through marker-assisted selection. Moreover, the discovery of a QTL for late blight resistance not linked to the QTL for vine maturity provides the opportunity to use marker-assisted selection for resistance independent of the selection for vine maturity classifications. PMID:26374597
Mousel, Michelle R.; Reynolds, James O.; White, Stephen N.
2015-01-01
Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10-5) were identified including markers in or near PIK3CB (P = 2.22x10-6; additive model), KCNB1 (P = 2.93x10-6; dominance model), ZC3H12C (P = 3.25x10-6; genotypic model), JPH1 (P = 4.68x20-6; genotypic model), and MYO3B (P = 5.74x10-6; recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909
Mousel, Michelle R; Reynolds, James O; White, Stephen N
2015-01-01
Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10(-5)) were identified including markers in or near PIK3CB (P = 2.22x10(-6); additive model), KCNB1 (P = 2.93x10(-6); dominance model), ZC3H12C (P = 3.25x10(-6); genotypic model), JPH1 (P = 4.68x20(-6); genotypic model), and MYO3B (P = 5.74x10(-6); recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection.
2012-01-01
Background High-density linkage maps facilitate the mapping of target genes and the construction of partial linkage maps around target loci to develop markers for marker-assisted selection (MAS). MAS is quite challenging in conifers because of their large, complex, and poorly-characterized genomes. Our goal was to construct a high-density linkage map to facilitate the identification of markers that are tightly linked to a major recessive male-sterile gene (ms1) for MAS in C. japonica, a species that is important in Japanese afforestation but which causes serious social pollinosis problems. Results We constructed a high-density saturated genetic linkage map for C. japonica using expressed sequence-derived co-dominant single nucleotide polymorphism (SNP) markers, most of which were genotyped using the GoldenGate genotyping assay. A total of 1261 markers were assigned to 11 linkage groups with an observed map length of 1405.2 cM and a mean distance between two adjacent markers of 1.1 cM; the number of linkage groups matched the basic chromosome number in C. japonica. Using this map, we located ms1 on the 9th linkage group and constructed a partial linkage map around the ms1 locus. This enabled us to identify a marker (hrmSNP970_sf) that is closely linked to the ms1 gene, being separated from it by only 0.5 cM. Conclusions Using the high-density map, we located the ms1 gene on the 9th linkage group and constructed a partial linkage map around the ms1 locus. The map distance between the ms1 gene and the tightly linked marker was only 0.5 cM. The identification of markers that are tightly linked to the ms1 gene will facilitate the early selection of male-sterile trees, which should expedite C. japonica breeding programs aimed at alleviating pollinosis problems without harming productivity. PMID:22424262
RNA-Seq identifies SNP markers for growth traits in rainbow trout.
Salem, Mohamed; Vallejo, Roger L; Leeds, Timothy D; Palti, Yniv; Liu, Sixin; Sabbagh, Annas; Rexroad, Caird E; Yao, Jianbo
2012-01-01
Fast growth is an important and highly desired trait, which affects the profitability of food animal production, with feed costs accounting for the largest proportion of production costs. Traditional phenotype-based selection is typically used to select for growth traits; however, genetic improvement is slow over generations. Single nucleotide polymorphisms (SNPs) explain 90% of the genetic differences between individuals; therefore, they are most suitable for genetic evaluation and strategies that employ molecular genetics for selective breeding. SNPs found within or near a coding sequence are of particular interest because they are more likely to alter the biological function of a protein. We aimed to use SNPs to identify markers and genes associated with genetic variation in growth. RNA-Seq whole-transcriptome analysis of pooled cDNA samples from a population of rainbow trout selected for improved growth versus unselected genetic cohorts (10 fish from 1 full-sib family each) identified SNP markers associated with growth-rate. The allelic imbalances (the ratio between the allele frequencies of the fast growing sample and that of the slow growing sample) were considered at scores >5.0 as an amplification and <0.2 as loss of heterozygosity. A subset of SNPs (n = 54) were validated and evaluated for association with growth traits in 778 individuals of a three-generation parent/offspring panel representing 40 families. Twenty-two SNP markers and one mitochondrial haplotype were significantly associated with growth traits. Polymorphism of 48 of the markers was confirmed in other commercially important aquaculture stocks. Many markers were clustered into genes of metabolic energy production pathways and are suitable candidates for genetic selection. The study demonstrates that RNA-Seq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes. This technique is suitable for marker development in non-model species lacking complete and well-annotated genome reference sequences.
2012-01-01
Background Brassica oleracea encompass a family of vegetables and cabbage that are among the most widely cultivated crops. In 2009, the B. oleracea Genome Sequencing Project was launched using next generation sequencing technology. None of the available maps were detailed enough to anchor the sequence scaffolds for the Genome Sequencing Project. This report describes the development of a large number of SSR and SNP markers from the whole genome shotgun sequence data of B. oleracea, and the construction of a high-density genetic linkage map using a double haploid mapping population. Results The B. oleracea high-density genetic linkage map that was constructed includes 1,227 markers in nine linkage groups spanning a total of 1197.9 cM with an average of 0.98 cM between adjacent loci. There were 602 SSR markers and 625 SNP markers on the map. The chromosome with the highest number of markers (186) was C03, and the chromosome with smallest number of markers (99) was C09. Conclusions This first high-density map allowed the assembled scaffolds to be anchored to pseudochromosomes. The map also provides useful information for positional cloning, molecular breeding, and integration of information of genes and traits in B. oleracea. All the markers on the map will be transferable and could be used for the construction of other genetic maps. PMID:23033896
Development of genome-wide SNP assays for rice
USDA-ARS?s Scientific Manuscript database
With the introduction of new sequencing technologies, single nucleotide polymorphisms (SNPs) are rapidly replacing simple sequence repeats (SSRs) as the DNA marker of choice for applications in plant breeding and genetics because they are more abundant, stable, amenable to automation, efficient, and...
Zhu, Qian-Hao; Spriggs, Andrew; Taylor, Jennifer M.; Llewellyn, Danny; Wilson, Iain
2014-01-01
Varietal single nucleotide polymorphisms (SNPs) are the differences within one of the two subgenomes between different tetraploid cotton varieties and have not been practically used in cotton genetics and breeding because they are difficult to identify due to low genetic diversity and very high sequence identity between homeologous genes in cotton. We have used transcriptome and restriction site−associated DNA sequencing to identify varietal SNPs among 18 G. hirsutum varieties based on the rationale that varietal SNPs can be more confidently called when flanked by subgenome-specific SNPs. Using transcriptome data, we successfully identified 37,413 varietal SNPs and, of these, 22,121 did not have an additional varietal SNP within their 20-bp flanking regions so can be used in most SNP genotyping assays. From restriction site−associated DNA sequencing data, we identified an additional 3090 varietal SNPs between two of the varieties. Of the 1583 successful SNP assays achieved using different genotyping platforms, 1363 were verified. Many of the SNPs behaved as dominant markers because of coamplification from homeologous loci, but the number of SNPs acting as codominant markers increased when one or more subgenome-specific SNP(s) were incorporated in their assay primers, giving them greater utility for breeding applications. A G. hirsutum genetic map with 1244 SNP markers was constructed covering 5557.42 centiMorgan and used to map qualitative and quantitative traits. This collection of G. hirsutum varietal SNPs complements existing intra-specific SNPs and provides the cotton community with a valuable marker resource applicable to genetic analyses and breeding programs. PMID:25106949
Rašić, Gordana; Filipović, Igor; Weeks, Andrew R; Hoffmann, Ary A
2014-04-11
Genetic markers are widely used to understand the biology and population dynamics of disease vectors, but often markers are limited in the resolution they provide. In particular, the delineation of population structure, fine scale movement and patterns of relatedness are often obscured unless numerous markers are available. To address this issue in the major arbovirus vector, the yellow fever mosquito (Aedes aegypti), we used double digest Restriction-site Associated DNA (ddRAD) sequencing for the discovery of genome-wide single nucleotide polymorphisms (SNPs). We aimed to characterize the new SNP set and to test the resolution against previously described microsatellite markers in detecting broad and fine-scale genetic patterns in Ae. aegypti. We developed bioinformatics tools that support the customization of restriction enzyme-based protocols for SNP discovery. We showed that our approach for RAD library construction achieves unbiased genome representation that reflects true evolutionary processes. In Ae. aegypti samples from three continents we identified more than 18,000 putative SNPs. They were widely distributed across the three Ae. aegypti chromosomes, with 47.9% found in intergenic regions and 17.8% in exons of over 2,300 genes. Pattern of their imputed effects in ORFs and UTRs were consistent with those found in a recent transcriptome study. We demonstrated that individual mosquitoes from Indonesia, Australia, Vietnam and Brazil can be assigned with a very high degree of confidence to their region of origin using a large SNP panel. We also showed that familial relatedness of samples from a 0.4 km2 area could be confidently established with a subset of SNPs. Using a cost-effective customized RAD sequencing approach supported by our bioinformatics tools, we characterized over 18,000 SNPs in field samples of the dengue fever mosquito Ae. aegypti. The variants were annotated and positioned onto the three Ae. aegypti chromosomes. The new SNP set provided much greater resolution in detecting population structure and estimating fine-scale relatedness than a set of polymorphic microsatellites. RAD-based markers demonstrate great potential to advance our understanding of mosquito population processes, critical for implementing new control measures against this major disease vector.
Muchero, Wellington; Diop, Ndeye N; Bhat, Prasanna R; Fenton, Raymond D; Wanamaker, Steve; Pottorff, Marti; Hearne, Sarah; Cisse, Ndiaga; Fatokun, Christian; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J
2009-10-27
Consensus genetic linkage maps provide a genomic framework for quantitative trait loci identification, map-based cloning, assessment of genetic diversity, association mapping, and applied breeding in marker-assisted selection schemes. Among "orphan crops" with limited genomic resources such as cowpea [Vigna unguiculata (L.) Walp.] (2n = 2x = 22), the use of transcript-derived SNPs in genetic maps provides opportunities for automated genotyping and estimation of genome structure based on synteny analysis. Here, we report the development and validation of a high-throughput EST-derived SNP assay for cowpea, its application in consensus map building, and determination of synteny to reference genomes. SNP mining from 183,118 ESTs sequenced from 17 cDNA libraries yielded approximately 10,000 high-confidence SNPs from which an Illumina 1,536-SNP GoldenGate genotyping array was developed and applied to 741 recombinant inbred lines from six mapping populations. Approximately 90% of the SNPs were technically successful, providing 1,375 dependable markers. Of these, 928 were incorporated into a consensus genetic map spanning 680 cM with 11 linkage groups and an average marker distance of 0.73 cM. Comparison of this cowpea genetic map to reference legumes, soybean (Glycine max) and Medicago truncatula, revealed extensive macrosynteny encompassing 85 and 82%, respectively, of the cowpea map. Regions of soybean genome duplication were evident relative to the simpler diploid cowpea. Comparison with Arabidopsis revealed extensive genomic rearrangement with some conserved microsynteny. These results support evolutionary closeness between cowpea and soybean and identify regions for synteny-based functional genomics studies in legumes.
Lipphardt, Mark F; Deryal, Mustafa; Ong, Mei Fang; Schmidt, Werner; Mahlknecht, Ulrich
2013-01-01
Estrogen and progesterone hormones are key regulators of a wide variety of biological processes. In addition to their influence on reproduction, cell differentiation and apoptosis, they affect inflammatory response, cell metabolism and most importantly, they regulate physiological breast tissue proliferation and differentiation as well as the development and progression of breast cancer. In order to assess whether genetic variants in the steroid hormone receptor gene ESR1 (estrogen receptor alpha) had an effect on sporadic breast cancer susceptibility, we assessed 7 ESR1 single nucleotide polymorphisms (SNPs) for associations with breast cancer susceptibility and clinical parameters in 221 breast cancer patients and 221 controls, respectively. We identified ESR1 intron SNP +2464 C/T (rs3020314) and ESR1 intron SNP -4576 A/C (rs1514348) to correlate with breast cancer susceptibility and progesterone receptor expression status. Patients genotyped CT for ESR1 intron SNP +2464 (rs3020314) (p ≤ 0.045) or genotyped AC for ESR1 intron SNP -4576 (rs1514348) (p ≤ 0.000026) were identified to carry a significant risk as to the development of breast cancer in the Central European Caucasian population (both together: p ≤ 0.000488). Our study could confirm previous associations and revealed new associations of SNP rs1514348 with susceptibility to breast cancer and clinical outcome, which might be used as new additional SNP markers.
High-throughput SNP-genotyping analysis of the relationships among Ponto-Caspian sturgeon species
Rastorguev, Sergey M; Nedoluzhko, Artem V; Mazur, Alexander M; Gruzdeva, Natalia M; Volkov, Alexander A; Barmintseva, Anna E; Mugue, Nikolai S; Prokhortchouk, Egor B
2013-01-01
Abstract Legally certified sturgeon fisheries require population protection and conservation methods, including DNA tests to identify the source of valuable sturgeon roe. However, the available genetic data are insufficient to distinguish between different sturgeon populations, and are even unable to distinguish between some species. We performed high-throughput single-nucleotide polymorphism (SNP)-genotyping analysis on different populations of Russian (Acipenser gueldenstaedtii), Persian (A. persicus), and Siberian (A. baerii) sturgeon species from the Caspian Sea region (Volga and Ural Rivers), the Azov Sea, and two Siberian rivers. We found that Russian sturgeons from the Volga and Ural Rivers were essentially indistinguishable, but they differed from Russian sturgeons in the Azov Sea, and from Persian and Siberian sturgeons. We identified eight SNPs that were sufficient to distinguish these sturgeon populations with 80% confidence, and allowed the development of markers to distinguish sturgeon species. Finally, on the basis of our SNP data, we propose that the A. baerii-like mitochondrial DNA found in some Russian sturgeons from the Caspian Sea arose via an introgression event during the Pleistocene glaciation. In the present study, the high-throughput genotyping analysis of several sturgeon populations was performed. SNP markers for species identification were defined. The possible explanation of the baerii-like mitotype presence in some Russian sturgeons in the Caspian Sea was suggested. PMID:24567827
Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David
2018-04-11
Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.
A high-density genetic map of Arachis duranensis, a diploid ancestor of cultivated peanut
2012-01-01
Background Cultivated peanut (Arachis hypogaea) is an allotetraploid species whose ancestral genomes are most likely derived from the A-genome species, A. duranensis, and the B-genome species, A. ipaensis. The very recent (several millennia) evolutionary origin of A. hypogaea has imposed a bottleneck for allelic and phenotypic diversity within the cultigen. However, wild diploid relatives are a rich source of alleles that could be used for crop improvement and their simpler genomes can be more easily analyzed while providing insight into the structure of the allotetraploid peanut genome. The objective of this research was to establish a high-density genetic map of the diploid species A. duranensis based on de novo generated EST databases. Arachis duranensis was chosen for mapping because it is the A-genome progenitor of cultivated peanut and also in order to circumvent the confounding effects of gene duplication associated with allopolyploidy in A. hypogaea. Results More than one million expressed sequence tag (EST) sequences generated from normalized cDNA libraries of A. duranensis were assembled into 81,116 unique transcripts. Mining this dataset, 1236 EST-SNP markers were developed between two A. duranensis accessions, PI 475887 and Grif 15036. An additional 300 SNP markers also were developed from genomic sequences representing conserved legume orthologs. Of the 1536 SNP markers, 1054 were placed on a genetic map. In addition, 598 EST-SSR markers identified in A. hypogaea assemblies were included in the map along with 37 disease resistance gene candidate (RGC) and 35 other previously published markers. In total, 1724 markers spanning 1081.3 cM over 10 linkage groups were mapped. Gene sequences that provided mapped markers were annotated using similarity searches in three different databases, and gene ontology descriptions were determined using the Medicago Gene Atlas and TAIR databases. Synteny analysis between A. duranensis, Medicago and Glycine revealed significant stretches of conserved gene clusters spread across the peanut genome. A higher level of colinearity was detected between A. duranensis and Glycine than with Medicago. Conclusions The first high-density, gene-based linkage map for A. duranensis was generated that can serve as a reference map for both wild and cultivated Arachis species. The markers developed here are valuable resources for the peanut, and more broadly, to the legume research community. The A-genome map will have utility for fine mapping in other peanut species and has already had application for mapping a nematode resistance gene that was introgressed into A. hypogaea from A. cardenasii. PMID:22967170
Idrissi, Omar; Udupa, Sripada M.; De Keyser, Ellen; McGee, Rebecca J.; Coyne, Clarice J.; Saha, Gopesh C.; Muehlbauer, Fred J.; Van Damme, Patrick; De Riek, Jan
2016-01-01
Drought is one of the major abiotic stresses limiting lentil productivity in rainfed production systems. Specific rooting patterns can be associated with drought avoidance mechanisms that can be used in lentil breeding programs. In all, 252 co-dominant and dominant markers were used for Quantitative Trait Loci (QTL) analysis on 132 lentil recombinant inbred lines based on greenhouse experiments for root and shoot traits during two seasons under progressive drought-stressed conditions. Eighteen QTLs controlling a total of 14 root and shoot traits were identified. A QTL-hotspot genomic region related to a number of root and shoot characteristics associated with drought tolerance such as dry root biomass, root surface area, lateral root number, dry shoot biomass and shoot length was identified. Interestingly, a QTL (QRSratioIX-2.30) related to root-shoot ratio, an important trait for drought avoidance, explaining the highest phenotypic variance of 27.6 and 28.9% for the two consecutive seasons, respectively, was detected. This QTL was closed to the co-dominant SNP marker TP6337 and also flanked by the two SNP TP518 and TP1280. An important QTL (QLRNIII-98.64) related to lateral root number was found close to TP3371 and flanked by TP5093 and TP6072 SNP markers. Also, a QTL (QSRLIV-61.63) associated with specific root length was identified close to TP1873 and flanked by F7XEM6b SRAP marker and TP1035 SNP marker. These two QTLs were detected in both seasons. Our results could be used for marker-assisted selection in lentil breeding programs targeting root and shoot characteristics conferring drought avoidance as an efficient alternative to slow and labor-intensive conventional breeding methods. PMID:27602034
Terracciano, Irma; Maccaferri, Marco; Bassi, Filippo; Mantovani, Paola; Sanguineti, Maria C; Salvi, Silvio; Simková, Hana; Doležel, Jaroslav; Massi, Andrea; Ammar, Karim; Kolmer, James; Tuberosa, Roberto
2013-04-01
Leaf rust (Puccinia triticina Eriks. & Henn.) is a major disease affecting durum wheat production. The Lr14a-resistant gene present in the durum wheat cv. Creso and its derivative cv. Colosseo is one of the best characterized leaf-rust resistance sources deployed in durum wheat breeding. Lr14a has been mapped close to the simple sequence repeat markers gwm146, gwm344 and wmc10 in the distal portion of the chromosome arm 7BL, a gene-dense region. The objectives of this study were: (1) to enrich the Lr14a region with single nucleotide polymorphisms (SNPs) and high-resolution melting (HRM)-based markers developed from conserved ortholog set (COS) genes and from sequenced Diversity Array Technology (DArT(®)) markers; (2) to further investigate the gene content and colinearity of this region with the Brachypodium and rice genomes. Ten new COS-SNP and five HRM markers were mapped within an 8.0 cM interval spanning Lr14a. Two HRM markers pinpointed the locus in an interval of <1.0 cM and eight COS-SNPs were mapped 2.1-4.1 cM distal to Lr14a. Each marker was tested for its capacity to predict the state of Lr14a alleles (in particular, Lr14-Creso associated to resistance) in a panel of durum wheat elite germplasm including 164 accessions. Two of the most informative markers were converted into KASPar(®) markers. Single assay markers ubw14 and wPt-4038-HRM designed for agarose gel electrophoresis/KASPar(®) assays and high-resolution melting analysis, respectively, as well as the double-marker combinations ubw14/ubw18, ubw14/ubw35 and wPt-4038-HRM-ubw35 will be useful for germplasm haplotyping and for molecular-assisted breeding.
Upadhyaya, Hari D; Wang, Yi-Hong; Sharma, Rajan; Sharma, Shivali
2013-06-01
Anthracnose in sorghum caused by Colletotrichum sublineolum is one of the most destructive diseases affecting sorghum production under warm and humid conditions. Markers and genes linked to resistance to the disease are important for plant breeding. Using 14,739 SNP markers, we have mapped eight loci linked to resistance in sorghum through association analysis of a sorghum mini-core collection consisting of 242 diverse accessions evaluated for anthracnose resistance for 2 years in the field. The mini-core was representative of the International Crops Research Institute for the Semi-Arid Tropics' world-wide sorghum landrace collection. Eight marker loci were associated with anthracnose resistance in both years. Except locus 8, disease resistance-related genes were found in all loci based on their physical distance from linked SNP markers. These include two NB-ARC class of R genes on chromosome 10 that were partially homologous to the rice blast resistance gene Pib, two hypersensitive response-related genes: autophagy-related protein 3 on chromosome 1 and 4 harpin-induced 1 (Hin1) homologs on chromosome 8, a RAV transcription factor that is also part of R gene pathway, an oxysterol-binding protein that functions in the non-specific host resistance, and homologs of menthone:neomenthol reductase (MNR) that catalyzes a menthone reduction to produce the antimicrobial neomenthol. These genes and markers may be developed into molecular tools for genetic improvement of anthracnose resistance in sorghum.
Translational genomics for analysis of complex traits in peanut and sorghum
USDA-ARS?s Scientific Manuscript database
The integration of sequencing and genotype data from natural variation studies (by whole genome resequencing [wgs] or genotype by sequencing [gbs]), transcriptome (RNA-seq) and mutant analysis (also by wgs) facilitated the development of DNA markers in the form of single nucleotide polymorphic (SNP)...
USDA-ARS?s Scientific Manuscript database
High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...
A review on SNP and other types of molecular markers and their use in animal genetics
Vignal, Alain; Milan, Denis; SanCristobal, Magali; Eggen, André
2002-01-01
During the last ten years, the use of molecular markers, revealing polymorphism at the DNA level, has been playing an increasing part in animal genetics studies. Amongst others, the microsatellite DNA marker has been the most widely used, due to its easy use by simple PCR, followed by a denaturing gel electrophoresis for allele size determination, and to the high degree of information provided by its large number of alleles per locus. Despite this, a new marker type, named SNP, for Single Nucleotide Polymorphism, is now on the scene and has gained high popularity, even though it is only a bi-allelic type of marker. In this review, we will discuss the reasons for this apparent step backwards, and the pertinence of the use of SNPs in animal genetics, in comparison with other marker types. PMID:12081799
USDA-ARS?s Scientific Manuscript database
Plants must respond to environmental cues and schedule their development in order to react to periods of abiotic stress and commit fully to growth and reproduction under favorable conditions. This study was initiated to identify SNP markers for characters expressed from the seedling stage to plant m...
Buitenhuis, Bart; Poulsen, Nina A; Larsen, Lotte B; Sehested, Jakob
2015-05-21
Bovine milk provides important minerals, essential for human nutrition and dairy product quality. For changing the mineral composition of the milk to improve dietary needs in human nutrition and technological properties of milk, a thorough understanding of the genetics underlying milk mineral contents is important. Therefore the aim of this study was to 1) estimate the genetic parameters for individual minerals in Danish Holstein (DH) (n=371) and Danish Jersey (DJ) (n=321) milk, and 2) detect genomic regions associated with mineral content in the milk using a genome-wide association study (GWAS) approach. For DH, high heritabilities were found for Ca (0.72), Zn (0.49), and P (0.46), while for DJ, high heritabilities were found for Ca (0.63), Zn (0.57), and Mg (0.57). Furthermore, intermediate heritabilities were found for Cu in DH, and for K, Na, P and Se in the DJ. The GWAS revealed a total of 649 significant SNP markers detected for Ca (24), Cu (90), Fe (111), Mn (3), Na (1), P (4), Se (12) and Zn (404) in DH, while for DJ, a total of 787 significant SNP markers were detected for Ca (44), Fe (43), K (498), Na (4), Mg (1), P (94) and Zn (3). Comparing the list of significant markers between DH and DJ revealed that the SNP ARS-BFGL-NGS-4939 was common in both breeds for Zn. This SNP marker is closely linked to the DGAT1 gene. Even though we found significant SNP markers on BTA14 in both DH and DJ for Ca, and Fe these significant SNPs did not overlap. The results show that Ca, Zn, P and Mg show high heritabilities. In combination with the GWAS results this opens up possibilities to select for specific minerals in bovine milk.
Tsuruta, S; Lourenco, D A L; Misztal, I; Lawlor, T J
2015-08-01
The objective of this study was to investigate genotype by environment interactions for culling rates and milk production in large and small dairy herds in 3 US regions, using genotypes, pedigree, and phenotypes. Single nucleotide polymorphism (SNP) marker variances were also estimated in these different environments. Culling rates including cow mortality were based on 6 Dairy Herd Improvement termination codes reported by dairy producers. Separate data sets for culling rates and 305-d milk yield were created for large and small dairy herds in the US regions of the Southeast (SE), Southwest (SW), and Northeast (NE) for the first 3 lactation cows that calved between 1999 and 2008. Genomic information from 42,503 SNP markers on 34,506 bulls was included in the analysis to predict genomic estimated breeding value (GEBV) of culling rates and 305-d milk yield with a single-step genomic BLUP using a bivariate threshold-linear model. Cow replacement rates in large SE and NE herds were higher. Heritability estimates of culling rates ranged from 0.03 to 0.11, but the differences were small between large and small herds and among the 3 US regions. Genetic correlations between culling rates and 305-d milk yield were medium to high for cows sold for poor production and reproduction problems. Correlations of GEBV for culling rates among the 3 US regions ranged from 0.34 to 0.92 and were lower between the SW and the other regions, especially in small herds. Correlations of GEBV between large and small herds ranged from 0.44 to 0.90 and were lower in the SW. These results indicate genotype by environment interactions of cow culling rate between the US regions and between large and small herds. Correlations of top 30 SNP marker effects for culling rates between 2 US regions ranged from 0.64 to 0.98 and were higher than those of more SNP marker effects except for a culling reason "sold for dairy purpose." Those correlations between large and small herds ranged from 0.67 to 0.98. High correlations of top SNP marker effects on culling reasons between the US regions and between large and small herds suggest that major markers can be useful for selection in different environments. The SNP variance shown in a marker gene segment on chromosome 14 was strongly associated with milk production in large and small herds in the NE but not in the SE and SW. Marker genes on chromosome 14 also showed a strong association with cow culling rates due to poor production and mortality in large herds in the NE. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Identification of SNP Haplotypes and Prospects of Association Mapping in Watermelon
USDA-ARS?s Scientific Manuscript database
Watermelon is the fifth most economically important vegetable crop cultivated world-wide. Implementing Single Nucleotide Polymorphism (SNP) marker technology in watermelon breeding and germplasm evaluation programs holds a key to improve horticulturally important traits. Next-generation sequencing...
The polymorphisms of bovine VEGF gene and their associations with growth traits in Chinese cattle.
Pang, Yonghong; Wang, Juqiang; Zhang, Chunlei; Lei, Chuzhao; Lan, Xianyong; Yue, Wangping; Gu, Chuanwen; Chen, Danxia; Chen, Hong
2011-02-01
PCR-SSCP and DNA sequencing methods were employed to screen the genetic variation of VEGF gene in 671 individuals belonging to three Chinese indigenous cattle breeds including Nanyang, Jiaxian Red and Qinchuan. Three haplotypes (A, B and C), four observed genotypes (AA, AB, BB and AC) and three new SNPs (6765T>C ss130456744, 6860A>G ss130456745, 6893T>C ss130456746) were detected. The analysis suggested that one SNP (ss130456744) in the bovine VEGF gene had significant effects on birth weight, body weight and heart girth at 6 months old in the Nanyang breed (P < 0.05). The results showed that the SNP (ss130456744) in intron 2 of the VEGF gene is associated with early development and growth of Chinese cattle. These findings raise hope that this polymorphism can be a molecular breeding marker in breeding strategies through marker assisted selection (MAS) in Chinese domestic cattle.
An Integrated SNP Mining and Utilization (ISMU) Pipeline for Next Generation Sequencing Data
Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M.; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A. V. S. K.; Varshney, Rajeev K.
2014-01-01
Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone free software. PMID:25003610
NASA Astrophysics Data System (ADS)
He, Feng; Wen, Haishen; Yu, Dahui; Li, Jifang; Shi, Bao; Chen, Caifang; Zhang, Jiaren; Jin, Guoxiong; Chen, Xiaoyan; Shi, Dan; Yang, Yanping
2010-12-01
Follicle stimulating hormone β (FSHβ) of Japanese flounder ( Paralichthys olivaceus) plays a key role in the regulation of gonadal development. This study aimed to investigate molecular genetic characteristics of the FSHβ gene and elucidate the effects of single nucleotide polymorphisms (SNPs) of FSHβ on reproductive traits in Japanese flounder. We used polymerase chain reaction single-strand conformation polymorphism (PCR-SSCP) and sequencing of the FSHβ gene in 60 individuals. We identified only an SNP (T/C) in the coding region of exon3 of FSHβ. The SNP (T/C) did not lead to amino acid changes at the position 340 bp of FSHβ gene. Statistical analysis showed that the SNP was significantly associated with testosterone (T) level and gonadosomatic index (GSI) ( P < 0.05). Individuals with genotype TC of the SNP had significantly higher serum T levels and GSI ( P < 0.05) than that of genotype CC. Therefore, FSHβ gene could be a useful molecular marker in selection for prominent reproductive trait in Japanese Flounder.
Johnson, Katherine A; Barry, Edwina; Lambert, David; Fitzgerald, Michael; McNicholas, Fiona; Kirley, Aiveen; Gill, Michael; Bellgrove, Mark A; Hawi, Ziarih
2013-12-01
A naturalistic, prospective study of the influence of genetic variation on dose prescribed, clinical response, and side effects related to stimulant medication in 77 children with attention-deficit/hyperactivity disorder (ADHD) was undertaken. The influence of genetic variation of the CES1 gene coding for carboxylesterase 1A1 (CES1A1), the major enzyme responsible for the first-pass, stereoselective metabolism of methylphenidate, was investigated. Parent- and teacher-rated behavioral questionnaires were collected at baseline when the children were medication naïve, and again at 6 weeks while they were on medication. Medication dose, prescribed at the discretion of the treating clinician, and side effects, were recorded at week 6. Blood and saliva samples were collected for genotyping. Single nucleotide polymorphisms (SNPs) were selected in the coding, non-coding and the 3' flanking region of the CES1 gene. Genetic association between CES1 variants and ADHD was investigated in an expanded sample of 265 Irish ADHD families. Analyses were conducted using analysis of covariance (ANCOVA) and logistic regression models. None of the CES1 gene variants were associated with the dose of methylphenidate provided or the clinical response recorded at the 6 week time point. An association between two CES1 SNP markers and the occurrence of sadness as a side effect of short-acting methylphenidate was found. The two associated CES1 markers were in linkage disequilibrium and were significantly associated with ADHD in a larger sample of ADHD trios. The associated CES1 markers were also in linkage disequilibrium with two SNP markers of the noradrenaline transporter gene (SLC6A2). This study found an association between two CES1 SNP markers and the occurrence of sadness as a side effect of short-acting methylphenidate. These markers were in linkage disequilibrium together and with two SNP markers of the noradrenaline transporter gene.
Association mapping of stem rust race TTKSK resistance in US barley breeding germplasm.
Zhou, H; Steffenson, B J; Muehlbauer, Gary; Wanyera, Ruth; Njau, Peter; Ndeda, Sylvester
2014-06-01
Loci conferring resistance to the highly virulent African stem rust race TTKSK were identified in advanced barley breeding germplasm and positioned to chromosomes 5H and 7H using an association mapping approach. African races of the stem rust pathogen (Puccinia graminis f. sp. tritici) are a serious threat to barley production worldwide because of their wide virulence. To discover and characterize resistance to African stem rust race TTKSK in US barley breeding germplasm, over 3,000 lines/cultivars were assessed for resistance at the seedling stage in the greenhouse and also the adult plant stage in the field in Kenya. Only 12 (0.3 %) and 64 (2.1 %) lines exhibited a resistance level comparable to the resistant control at the seedling and adult plant stage, respectively. To map quantitative trait loci (QTL) for resistance to race TTKSK, an association mapping approach was conducted, utilizing 3,072 single nucleotide polymorphism (SNP) markers. At the seedling stage, two neighboring SNP markers (0.8 cM apart) on chromosome 7H (11_21491 and 12_30528) were found significantly associated with resistance. The most significant one found was 12_30528; thus, the resistance QTL was named Rpg-qtl-7H-12_30528. At the adult plant stage, two SNP markers on chromosome 5H (11_11355 and 12_31427) were found significantly associated with resistance. This resistance QTL was named Rpg-qtl-5H-11_11355 for the most significant marker identified. Adult plant resistance is of paramount importance for stem rust. The marker associated with Rpg-qtl-5H-11_11355 for adult plant resistance explained only a small portion of the phenotypic variation (0.02); however, this QTL reduced disease severity up to 55.0 % under low disease pressure and up to 21.1 % under heavy disease pressure. SNP marker 11_11355 will be valuable for marker-assisted selection of adult plant stem rust resistance in barley breeding.
USDA-ARS?s Scientific Manuscript database
Aluminum (Al) toxicity is an important abiotic stress that affects soybean production in acidic soils. Development of Al-tolerant cultivars is an efficient and environmentally friendly solution to the problem. Effective selection of Al-tolerant genotypes in applied breeding requires an understanding...
Discovery of 100K SNP array and its utilization in sugarcane
USDA-ARS?s Scientific Manuscript database
Next generation sequencing (NGS) enable us to identify thousands of single nucleotide polymorphisms (SNPs) marker for genotyping and fingerprinting. However, the process requires very precise bioinformatics analysis and filtering process. High throughput SNP array with predefined genomic location co...
A ddRAD Based Linkage Map of the Cultivated Strawberry, Fragaria xananassa
Davik, Jahn; Sargent, Daniel James; Brurberg, May Bente; Lien, Sigbjørn; Kent, Matthew; Alsheikh, Muath
2015-01-01
The cultivated strawberry (Fragaria ×ananassa Duch.) is an allo-octoploid considered difficult to disentangle genetically due to its four relatively similar sub-genomic chromosome sets. This has been alleviated by the recent release of the strawberry IStraw90 whole genome genotyping array. However, array resolution relies on the genotypes used in the array construction and may be of limited general use. SNP detection based on reduced genomic sequencing approaches has the potential of providing better coverage in cases where the studied genotypes are only distantly related from the SNP array’s construction foundation. Here we have used double digest restriction-associated DNA sequencing (ddRAD) to identify SNPs in a 145 seedling F1 hybrid population raised from the cross between the cultivars Sonata (♀) and Babette (♂). A linkage map containing 907 markers which spanned 1,581.5 cM across 31 linkage groups representing the 28 chromosomes of the species. Comparing the physical span of the SNP markers with the F. vesca genome sequence, the linkage groups resolved covered 79% of the estimated 830 Mb of the F. ×ananassa genome. Here, we have developed the first linkage map for F. ×ananassa using ddRAD and show that this technique and other related techniques are useful tools for linkage map development and downstream genetic studies in the octoploid strawberry. PMID:26398886
A Bayesian antedependence model for whole genome prediction.
Yang, Wenzhao; Tempelman, Robert J
2012-04-01
Hierarchical mixed effects models have been demonstrated to be powerful for predicting genomic merit of livestock and plants, on the basis of high-density single-nucleotide polymorphism (SNP) marker panels, and their use is being increasingly advocated for genomic predictions in human health. Two particularly popular approaches, labeled BayesA and BayesB, are based on specifying all SNP-associated effects to be independent of each other. BayesB extends BayesA by allowing a large proportion of SNP markers to be associated with null effects. We further extend these two models to specify SNP effects as being spatially correlated due to the chromosomally proximal effects of causal variants. These two models, that we respectively dub as ante-BayesA and ante-BayesB, are based on a first-order nonstationary antedependence specification between SNP effects. In a simulation study involving 20 replicate data sets, each analyzed at six different SNP marker densities with average LD levels ranging from r(2) = 0.15 to 0.31, the antedependence methods had significantly (P < 0.01) higher accuracies than their corresponding classical counterparts at higher LD levels (r(2) > 0. 24) with differences exceeding 3%. A cross-validation study was also conducted on the heterogeneous stock mice data resource (http://mus.well.ox.ac.uk/mouse/HS/) using 6-week body weights as the phenotype. The antedependence methods increased cross-validation prediction accuracies by up to 3.6% compared to their classical counterparts (P < 0.001). Finally, we applied our method to other benchmark data sets and demonstrated that the antedependence methods were more accurate than their classical counterparts for genomic predictions, even for individuals several generations beyond the training data.
Kazusa Marker DataBase: a database for genomics, genetics, and molecular breeding in plants.
Shirasawa, Kenta; Isobe, Sachiko; Tabata, Satoshi; Hirakawa, Hideki
2014-09-01
In order to provide useful genomic information for agronomical plants, we have established a database, the Kazusa Marker DataBase (http://marker.kazusa.or.jp). This database includes information on DNA markers, e.g., SSR and SNP markers, genetic linkage maps, and physical maps, that were developed at the Kazusa DNA Research Institute. Keyword searches for the markers, sequence data used for marker development, and experimental conditions are also available through this database. Currently, 10 plant species have been targeted: tomato (Solanum lycopersicum), pepper (Capsicum annuum), strawberry (Fragaria × ananassa), radish (Raphanus sativus), Lotus japonicus, soybean (Glycine max), peanut (Arachis hypogaea), red clover (Trifolium pratense), white clover (Trifolium repens), and eucalyptus (Eucalyptus camaldulensis). In addition, the number of plant species registered in this database will be increased as our research progresses. The Kazusa Marker DataBase will be a useful tool for both basic and applied sciences, such as genomics, genetics, and molecular breeding in crops.
Genome-wide association study for host response to bovine leukemia virus in Holstein cows.
Brym, P; Bojarojć-Nosowicz, B; Oleński, K; Hering, D M; Ruść, A; Kaczmarczyk, E; Kamiński, S
2016-07-01
The mechanisms of leukemogenesis induced by bovine leukemia virus (BLV) and the processes underlying the phenomenon of differential host response to BLV infection still remain poorly understood. The aim of the study was to screen the entire cattle genome to identify markers and candidate genes that might be involved in host response to bovine leukemia virus infection. A genome-wide association study was performed using Holstein cows naturally infected by BLV. A data set included 43 cows (BLV positive) and 30 cows (BLV negative) genotyped for 54,609 SNP markers (Illumina Bovine SNP50 BeadChip). The BLV status of cows was determined by serum ELISA, nested-PCR and hematological counts. Linear Regression Analysis with a False Discovery Rate and kinship matrix (computed on the autosomal SNPs) was calculated to find out which SNP markers significantly differentiate BLV-positive and BLV-negative cows. Nine markers reached genome-wide significance. The most significant SNPs were located on chromosomes 23 (rs41583098), 3 (rs109405425, rs110785500) and 8 (rs43564499) in close vicinity of a patatin-like phospholipase domain containing 1 (PNPLA1); adaptor-related protein complex 4, beta 1 subunit (AP4B1); tripartite motif-containing 45 (TRIM45) and cell division cycle associated 2 (CDCA2) genes, respectively. Furthermore, a list of 41 candidate genes was composed based on their proximity to significant markers (within a distance of ca. 1 Mb) and functional involvement in processes potentially underlying BLV-induced pathogenesis. In conclusion, it was demonstrated that host response to BLV infection involves nine sub-regions of the cattle genome (represented by 9 SNP markers), containing many genes which, based on the literature, could be involved to enzootic bovine leukemia progression. New group of promising candidate genes associated with the host response to BLV infection were identified and could therefore be a target for future studies. The functions of candidate genes surrounding significant SNP markers imply that there is no single regulatory process that is solely targeted by BLV infection, but rather the network of interrelated pathways is deregulated, leading to the disruption of the control of B-cell proliferation and programmed cell death. Copyright © 2016 Elsevier B.V. All rights reserved.
Ma, Li; Runesha, H Birali; Dvorkin, Daniel; Garbe, John R; Da, Yang
2008-01-01
Background Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers provide opportunities to detect epistatic SNPs associated with quantitative traits and to detect the exact mode of an epistasis effect. Computational difficulty is the main bottleneck for epistasis testing in large scale GWAS. Results The EPISNPmpi and EPISNP computer programs were developed for testing single-locus and epistatic SNP effects on quantitative traits in GWAS, including tests of three single-locus effects for each SNP (SNP genotypic effect, additive and dominance effects) and five epistasis effects for each pair of SNPs (two-locus interaction, additive × additive, additive × dominance, dominance × additive, and dominance × dominance) based on the extended Kempthorne model. EPISNPmpi is the parallel computing program for epistasis testing in large scale GWAS and achieved excellent scalability for large scale analysis and portability for various parallel computing platforms. EPISNP is the serial computing program based on the EPISNPmpi code for epistasis testing in small scale GWAS using commonly available operating systems and computer hardware. Three serial computing utility programs were developed for graphical viewing of test results and epistasis networks, and for estimating CPU time and disk space requirements. Conclusion The EPISNPmpi parallel computing program provides an effective computing tool for epistasis testing in large scale GWAS, and the epiSNP serial computing programs are convenient tools for epistasis analysis in small scale GWAS using commonly available computer hardware. PMID:18644146
Ren, Yi; Di Jiao; Gong, Guoyi; Zhang, Haiying; Guo, Shaogui; Zhang, Jie; Xu, Yong
Fusarium wilt (FW) caused by Fusarium oxysporum f. sp . niveum (FON) is the major soilborne disease of watermelon ( Citrullus lanatus L.). The development and deployment of resistant cultivars is generally considered to be an effective approach to control FW. In this study, an F8 population consisting of 103 recombinant inbred lines derived from a cross between the cultivar 97103 and a wild accession PI 296341-FR was used for FON race 1 and race 2 fungal inoculations. One major QTL on chromosome 1 for FON race 1 resistance was detected with a logarithm of odds of 13.2 and explained phenotypic variation R 2 = 48.1 %; two QTLs of FON race 2 resistance on chromosomes 9 and 10 were discovered based on the high-density integrated genetic map we constructed. The nearest molecular marker should be useful for marker-assisted selection of FON race 1 and race 2 resistance. One receptor kinase, one glucan endo-1,3-β-glucosidase precursors and three acidic chitinase located in the FON-1 QTL genomic region. In Qfon2.1 QTL region, one lipoxygenase gene, five receptor-like kinases and four glutathione S-transferase genes are discovered. One arginine biosynthesis bifunctional protein, two receptor kinase proteins and one lipid-transfer protein located in Qfon2.2 QTL region. Based on SNP analysis by using 20 re-sequenced accessions of watermelon and 231-plant F 2 population generated from Black Diamond × Calhoun Grey, we developed a SNP marker Chr1SNP_502124 for FON-1 detection.
Blåhed, Ida-Maria; Königsson, Helena; Ericsson, Göran; Spong, Göran
2018-01-01
Monitoring of wild animal populations is challenging, yet reliable information about population processes is important for both management and conservation efforts. Access to molecular markers, such as SNPs, enables population monitoring through genotyping of various DNA sources. We have developed 96 high quality SNP markers for individual identification of moose (Alces alces), an economically and ecologically important top-herbivore in boreal regions. Reduced representation libraries constructed from 34 moose were high-throughput de novo sequenced, generating nearly 50 million read pairs. About 50 000 stacks of aligned reads containing one or more SNPs were discovered with the Stacks pipeline. Several quality criteria were applied on the candidate SNPs to find markers informative on the individual level and well representative for the population. An empirical validation by genotyping of sequenced individuals and additional moose, resulted in the selection of a final panel of 86 high quality autosomal SNPs. Additionally, five sex-specific SNPs and five SNPs for sympatric species diagnostics are included in the panel. The genotyping error rate was 0.002 for the total panel and probability of identities were low enough to separate individuals with high confidence. Moreover, the autosomal SNPs were highly informative also for population level analyses. The potential applications of this SNP panel are thus many including investigations of population size, sex ratios, relatedness, reproductive success and population structure. Ideally, SNP-based studies could improve today's population monitoring and increase our knowledge about moose population dynamics.
Ren, Jing; Chen, Liang; Jin, Xiaoli; Zhang, Miaomiao; You, Frank M; Wang, Jirui; Frenkel, Vladimir; Yin, Xuegui; Nevo, Eviatar; Sun, Dongfa; Luo, Ming-Cheng; Peng, Junhua
2017-01-01
Whole-genome scans with large number of genetic markers provide the opportunity to investigate local adaptation in natural populations and identify candidate genes under positive selection. In the present study, adaptation genetic differentiation associated with solar radiation was investigated using 695 polymorphic SNP markers in wild emmer wheat originated in a micro-site at Yehudiyya, Israel. The test involved two solar radiation niches: (1) sun, in-between trees; and (2) shade, under tree canopy, separated apart by a distance of 2-4 m. Analysis of molecular variance showed a small (0.53%) but significant portion of overall variation between the sun and shade micro-niches, indicating a non-ignorable genetic differentiation between sun and shade habitats. Fifty SNP markers showed a medium (0.05 ≤ F ST ≤ 0.15) or high genetic differentiation ( F ST > 0.15). A total of 21 outlier loci under positive selection were identified by using four different F ST -outlier testing algorithms. The markers and genome locations under positive selection are consistent with the known patterns of selection. These results suggested that genetic differentiation between sun and shade habitats is substantial, radiation-associated, and therefore ecologically determined. Hence, the results of this study reflected effects of natural selection through solar radiation on EST-related SNP genetic diversity, resulting presumably in different adaptive complexes at a micro-scale divergence. The present work highlights the evolutionary theory and application significance of solar radiation-driven natural selection in wheat improvement.
Da, Yang; Wang, Chunkao; Wang, Shengwen; Hu, Guo
2014-01-01
We established a genomic model of quantitative trait with genomic additive and dominance relationships that parallels the traditional quantitative genetics model, which partitions a genotypic value as breeding value plus dominance deviation and calculates additive and dominance relationships using pedigree information. Based on this genomic model, two sets of computationally complementary but mathematically identical mixed model methods were developed for genomic best linear unbiased prediction (GBLUP) and genomic restricted maximum likelihood estimation (GREML) of additive and dominance effects using SNP markers. These two sets are referred to as the CE and QM sets, where the CE set was designed for large numbers of markers and the QM set was designed for large numbers of individuals. GBLUP and associated accuracy formulations for individuals in training and validation data sets were derived for breeding values, dominance deviations and genotypic values. Simulation study showed that GREML and GBLUP generally were able to capture small additive and dominance effects that each accounted for 0.00005–0.0003 of the phenotypic variance and GREML was able to differentiate true additive and dominance heritability levels. GBLUP of the total genetic value as the summation of additive and dominance effects had higher prediction accuracy than either additive or dominance GBLUP, causal variants had the highest accuracy of GREML and GBLUP, and predicted accuracies were in agreement with observed accuracies. Genomic additive and dominance relationship matrices using SNP markers were consistent with theoretical expectations. The GREML and GBLUP methods can be an effective tool for assessing the type and magnitude of genetic effects affecting a phenotype and for predicting the total genetic value at the whole genome level. PMID:24498162
Da, Yang; Wang, Chunkao; Wang, Shengwen; Hu, Guo
2014-01-01
We established a genomic model of quantitative trait with genomic additive and dominance relationships that parallels the traditional quantitative genetics model, which partitions a genotypic value as breeding value plus dominance deviation and calculates additive and dominance relationships using pedigree information. Based on this genomic model, two sets of computationally complementary but mathematically identical mixed model methods were developed for genomic best linear unbiased prediction (GBLUP) and genomic restricted maximum likelihood estimation (GREML) of additive and dominance effects using SNP markers. These two sets are referred to as the CE and QM sets, where the CE set was designed for large numbers of markers and the QM set was designed for large numbers of individuals. GBLUP and associated accuracy formulations for individuals in training and validation data sets were derived for breeding values, dominance deviations and genotypic values. Simulation study showed that GREML and GBLUP generally were able to capture small additive and dominance effects that each accounted for 0.00005-0.0003 of the phenotypic variance and GREML was able to differentiate true additive and dominance heritability levels. GBLUP of the total genetic value as the summation of additive and dominance effects had higher prediction accuracy than either additive or dominance GBLUP, causal variants had the highest accuracy of GREML and GBLUP, and predicted accuracies were in agreement with observed accuracies. Genomic additive and dominance relationship matrices using SNP markers were consistent with theoretical expectations. The GREML and GBLUP methods can be an effective tool for assessing the type and magnitude of genetic effects affecting a phenotype and for predicting the total genetic value at the whole genome level.
Mapping the non-darkening trait from 'Wit-rood boontje' in bean (Phaseolus vulgaris).
Erfatpour, M; Navabi, A; Pauls, K P
2018-06-01
A QTL for non-darkening seed coat from 'Wit-rood boontje' was mapped in pinto bean population on chromosome Pv10, comprising 40 candidate genes. The seed coat colour darkens with age in some market classes of dry beans (Phaseolus vulgaris), including pinto bean. Beans with darkened seed coats are discounted in the market place, since they are believed to be associated with lower nutritional quality, increased cooking time, and decreased palatability. The objective of this research was to map a non-darkening gene from a cranberry-like bean 'Wit-rood boontje' using a recombinant inbred line population, derived from a cross between 'Wit-rood boontje' and a slow-darkening pinto bean (1533-15). The population was characterized for seed phenotype and genotyped with an Illumina BeadChip. A genetic linkage map was constructed with 1327 informative SNP markers plus an STS marker (OL4S 500 ) and an SSR marker (Pvsd-0028), previously associated with the J gene and Sd gene, respectively, as well as non-darkening and slow-darkening phenotypes. The linkage map spanned 1253.2 cM over 11 chromosomes. A major QTL for the non-darkening trait was flanked by SNP 715646341 and SNP 715646348 on chromosome Pv10. The region, which spanned 13.2 cM, explained 48% of the phenotypic variation for seed coat darkening. Forty candidate genes were identified in the QTL interval. This information can be used to develop a gene-based marker to facilitate breeding non-darkening pinto beans and may lead to a better understanding of the molecular mechanism for the postharvest darkening phenomenon in pinto bean.
Wang, Zi-nian; Cai, Han-fang; Li, Ming-xun; Cao, Xiu-kai; Lan, Xian-yong; Lei, Chu-zhao; Chen, Hong
2016-01-10
Patatin-like phospholipase domain-containing protein 3 (PNPLA3), a member of the patatin like phospholipase domain-containing (PNPLA) family, plays an important role in energy balance, fat metabolism regulation, glucose metabolism and fatty liver disease. Tetra-primer amplification refractory mutation system PCR (T-ARMS-PCR) is a new method offering fast detection and extreme simplicity at a negligible cost for SNP genotyping. In this paper, we investigated the genetic variations at different ages of 660 Chinese indigenous cattle belonging to three breeds (QC, NY, JX) and applied T-ARMS-PCR and PCR-RFLP methods to genotype four SNPs, SNP1: g.A2980G, SNP2: g.A2996T, SNP3: g.A36718G, SNP4: g.G36850A. The statistical analyses indicated that these 4 SNPs affected growth traits markedly (P<0.05) in QC population, whereas combined haplotypes were not (P>0.05). The qPCR (quantitative PCR) indicated that bovine PNPLA3 gene was exclusively expressed in fat tissues. Besides, the analysis between SNP and mRNA expression revealed that, in SNP1, the expression of AG was much higher than AA and GG (P<0.05), which was in accordance with the results of growth traits association analysis, while the results of SNP4 was not. These results supported high potential that SNPs of bovine PNPLA3 gene might be utilized as genetic markers in marker-assisted selection (MAS) for Chinese cattle breeding programs. Copyright © 2015 Elsevier B.V. All rights reserved.
Bose, Nikhil; Carlberg, Katie; Sensabaugh, George; Erlich, Henry; Calloway, Cassandra
2018-05-01
DNA from biological forensic samples can be highly fragmented and present in limited quantity. When DNA is highly fragmented, conventional PCR based Short Tandem Repeat (STR) analysis may fail as primer binding sites may not be present on a single template molecule. Single Nucleotide Polymorphisms (SNPs) can serve as an alternative type of genetic marker for analysis of degraded samples because the targeted variation is a single base. However, conventional PCR based SNP analysis methods still require intact primer binding sites for target amplification. Recently, probe capture methods for targeted enrichment have shown success in recovering degraded DNA as well as DNA from ancient bone samples using next-generation sequencing (NGS) technologies. The goal of this study was to design and test a probe capture assay targeting forensically relevant nuclear SNP markers for clonal and massively parallel sequencing (MPS) of degraded and limited DNA samples as well as mixtures. A set of 411 polymorphic markers totaling 451 nuclear SNPs (375 SNPs and 36 microhaplotype markers) was selected for the custom probe capture panel. The SNP markers were selected for a broad range of forensic applications including human individual identification, kinship, and lineage analysis as well as for mixture analysis. Performance of the custom SNP probe capture NGS assay was characterized by analyzing read depth and heterozygote allele balance across 15 samples at 25 ng input DNA. Performance thresholds were established based on read depth ≥500X and heterozygote allele balance within ±10% deviation from 50:50, which was observed for 426 out of 451 SNPs. These 426 SNPs were analyzed in size selected samples (at ≤75 bp, ≤100 bp, ≤150 bp, ≤200 bp, and ≤250 bp) as well as mock degraded samples fragmented to an average of 150 bp. Samples selected for ≤75 bp exhibited 99-100% reportable SNPs across varied DNA amounts and as low as 0.5 ng. Mock degraded samples at 1 ng and 10 ng exhibited >90% reportable SNPs. Finally, two-person male-male mixtures were tested at 10 ng in contributor varying ratios. Overall, 85-100% of alleles unique to the minor contributor were observed at all mixture ratios. Results from these studies using the SNP probe capture NGS system demonstrates proof of concept for application to forensically relevant degraded and mixed DNA samples. Copyright © 2018 Elsevier B.V. All rights reserved.
Kim, H; Lee, S K; Hong, M W; Park, S R; Lee, Y S; Kim, J W; Lee, H K; Jeong, D K; Song, Y H; Lee, S J
2013-12-01
The akirin 2 gene, located on chromosome 9 in cattle, was previously reported to be associated with nuclear factor-kappa B (NF-κB), involved in immune reactions and marbling of meat. To determine whether a single nucleotide polymorphism (SNP) in akirin 2 is associated with economically important traits of Korean native cattle, the c.*188G>A SNP DNA marker in the 3'-UTR region of akirin 2 was analyzed for its association with carcass weight, longissimus muscle area and marbling. The c.*188G>A SNP was genotyped by polymerase chain reaction restriction fragment length polymorphism, and the frequency of the AA, AG, and GG genotypes were 6.82%, 71.29% and 21.88% respectively. This SNP was significantly associated with longissimus muscle area (Bonferroni corrected P < 0.05), and marbling score (Bonferroni corrected P < 0.01). These results suggest that the c.*188G>A SNP of akirin 2 might be useful as a DNA marker for longissimus muscle area and marbling scores in Korean native cattle. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
Fondevila, M; Børsting, C; Phillips, C; de la Puente, M; Consortium, Euroforen-NoE; Carracedo, A; Morling, N; Lareu, M V
2017-01-01
This review explores the key factors that influence the optimization, routine use, and profile interpretation of the SNaPshot single-base extension (SBE) system applied to forensic single-nucleotide polymorphism (SNP) genotyping. Despite being a mainly complimentary DNA genotyping technique to routine STR profiling, use of SNaPshot is an important part of the development of SNP sets for a wide range of forensic applications with these markers, from genotyping highly degraded DNA with very short amplicons to the introduction of SNPs to ascertain the ancestry and physical characteristics of an unidentified contact trace donor. However, this technology, as resourceful as it is, displays several features that depart from the usual STR genotyping far enough to demand a certain degree of expertise from the forensic analyst before tackling the complex casework on which SNaPshot application provides an advantage. In order to provide the basis for developing such expertise, we cover in this paper the most challenging aspects of the SNaPshot technology, focusing on the steps taken to design primer sets, optimize the PCR and single-base extension chemistries, and the important features of the peak patterns observed in typical forensic SNP profiles using SNaPshot. With that purpose in mind, we provide guidelines and troubleshooting for multiplex-SNaPshot-oriented primer design and the resulting capillary electrophoresis (CE) profile interpretation (covering the most commonly observed artifacts and expected departures from the ideal conditions). Copyright © 2017 Central Police University.
An innovative SNP genotyping method adapting to multiple platforms and throughputs
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are highly abundant, distributed throughout the genome in various species, and therefore they are widely used as genetic markers. However, the usefulness of this genetic tool relies heavily on the availability of user-friendly SNP genotyping methods. We have d...
USDA-ARS?s Scientific Manuscript database
Microsatellite markers (MS) have traditionally been used for parental verification and are still the international standard in spite of their higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP)-based assays. Despite domestic and international demands fro...
Analysis of genetic diversity using SNP markers in oat
USDA-ARS?s Scientific Manuscript database
A large-scale single nucleotide polymorphism (SNP) discovery was carried out in cultivated oat using Roche 454 sequencing methods. DNA sequences were generated from cDNAs originating from a panel of 20 diverse oat cultivars, and from Diversity Array Technology (DArT) genomic complexity reductions fr...
White, Vanessa Linley; Endersby, Nancy Margaret; Chan, Janice; Hoffmann, Ary Anthony; Weeks, Andrew Raymond
2015-03-01
Aedes aegypti, Aedes notoscriptus, and Aedes albopictus are important vectors of many arboviruses implicated in human disease such as dengue fever. Genetic markers applied across vector species can provide important information on population structure, gene flow, insecticide resistance, and taxonomy, however, robust microsatellite markers have proven difficult to develop in these species and mosquitoes generally. Here we consider the utility and transferability of 15 Ribosome protein (Rp) Exon-Primed Intron-Crossing (EPIC) markers for population genetic studies in these 3 Aedes species. Rp EPIC markers designed for Ae. aegypti also successfully amplified populations of the sister species, Ae. albopictus, as well as the distantly related species, Ae. notoscriptus. High SNP and good indel diversity in sequenced alleles plus support for amplification of the same regions across populations and species were additional benefits of these markers. These findings point to the general value of EPIC markers in mosquito population studies. © 2014 Institute of Zoology, Chinese Academy of Sciences.
2012-01-01
Background In the last 30 years, a number of DNA fingerprinting methods such as RFLP, RAPD, AFLP, SSR, DArT, have been extensively used in marker development for molecular plant breeding. However, it remains a daunting task to identify highly polymorphic and closely linked molecular markers for a target trait for molecular marker-assisted selection. The next-generation sequencing (NGS) technology is far more powerful than any existing generic DNA fingerprinting methods in generating DNA markers. In this study, we employed a grain legume crop Lupinus angustifolius (lupin) as a test case, and examined the utility of an NGS-based method of RAD (restriction-site associated DNA) sequencing as DNA fingerprinting for rapid, cost-effective marker development tagging a disease resistance gene for molecular breeding. Results Twenty informative plants from a cross of RxS (disease resistant x susceptible) in lupin were subjected to RAD single-end sequencing by multiplex identifiers. The entire RAD sequencing products were resolved in two lanes of the 16-lanes per run sequencing platform Solexa HiSeq2000. A total of 185 million raw reads, approximately 17 Gb of sequencing data, were collected. Sequence comparison among the 20 test plants discovered 8207 SNP markers. Filtration of DNA sequencing data with marker identification parameters resulted in the discovery of 38 molecular markers linked to the disease resistance gene Lanr1. Five randomly selected markers were converted into cost-effective, simple PCR-based markers. Linkage analysis using marker genotyping data and disease resistance phenotyping data on a F8 population consisting of 186 individual plants confirmed that all these five markers were linked to the R gene. Two of these newly developed sequence-specific PCR markers, AnSeq3 and AnSeq4, flanked the target R gene at a genetic distance of 0.9 centiMorgan (cM), and are now replacing the markers previously developed by a traditional DNA fingerprinting method for marker-assisted selection in the Australian national lupin breeding program. Conclusions We demonstrated that more than 30 molecular markers linked to a target gene of agronomic trait of interest can be identified from a small portion (1/8) of one sequencing run on HiSeq2000 by applying NGS based RAD sequencing in marker development. The markers developed by the strategy described in this study are all co-dominant SNP markers, which can readily be converted into high throughput multiplex format or low-cost, simple PCR-based markers desirable for large scale marker implementation in plant breeding programs. The high density and closely linked molecular markers associated with a target trait help to overcome a major bottleneck for implementation of molecular markers on a wide range of germplasm in breeding programs. We conclude that application of NGS based RAD sequencing as DNA fingerprinting is a very rapid and cost-effective strategy for marker development in molecular plant breeding. The strategy does not require any prior genome knowledge or molecular information for the species under investigation, and it is applicable to other plant species. PMID:22805587
Yang, Huaan; Tao, Ye; Zheng, Zequn; Li, Chengdao; Sweetingham, Mark W; Howieson, John G
2012-07-17
In the last 30 years, a number of DNA fingerprinting methods such as RFLP, RAPD, AFLP, SSR, DArT, have been extensively used in marker development for molecular plant breeding. However, it remains a daunting task to identify highly polymorphic and closely linked molecular markers for a target trait for molecular marker-assisted selection. The next-generation sequencing (NGS) technology is far more powerful than any existing generic DNA fingerprinting methods in generating DNA markers. In this study, we employed a grain legume crop Lupinus angustifolius (lupin) as a test case, and examined the utility of an NGS-based method of RAD (restriction-site associated DNA) sequencing as DNA fingerprinting for rapid, cost-effective marker development tagging a disease resistance gene for molecular breeding. Twenty informative plants from a cross of RxS (disease resistant x susceptible) in lupin were subjected to RAD single-end sequencing by multiplex identifiers. The entire RAD sequencing products were resolved in two lanes of the 16-lanes per run sequencing platform Solexa HiSeq2000. A total of 185 million raw reads, approximately 17 Gb of sequencing data, were collected. Sequence comparison among the 20 test plants discovered 8207 SNP markers. Filtration of DNA sequencing data with marker identification parameters resulted in the discovery of 38 molecular markers linked to the disease resistance gene Lanr1. Five randomly selected markers were converted into cost-effective, simple PCR-based markers. Linkage analysis using marker genotyping data and disease resistance phenotyping data on a F8 population consisting of 186 individual plants confirmed that all these five markers were linked to the R gene. Two of these newly developed sequence-specific PCR markers, AnSeq3 and AnSeq4, flanked the target R gene at a genetic distance of 0.9 centiMorgan (cM), and are now replacing the markers previously developed by a traditional DNA fingerprinting method for marker-assisted selection in the Australian national lupin breeding program. We demonstrated that more than 30 molecular markers linked to a target gene of agronomic trait of interest can be identified from a small portion (1/8) of one sequencing run on HiSeq2000 by applying NGS based RAD sequencing in marker development. The markers developed by the strategy described in this study are all co-dominant SNP markers, which can readily be converted into high throughput multiplex format or low-cost, simple PCR-based markers desirable for large scale marker implementation in plant breeding programs. The high density and closely linked molecular markers associated with a target trait help to overcome a major bottleneck for implementation of molecular markers on a wide range of germplasm in breeding programs. We conclude that application of NGS based RAD sequencing as DNA fingerprinting is a very rapid and cost-effective strategy for marker development in molecular plant breeding. The strategy does not require any prior genome knowledge or molecular information for the species under investigation, and it is applicable to other plant species.
2012-01-01
Background Genome-wide association studies (GWAS) do not provide a full account of the heritability of genetic diseases since gene-gene interactions, also known as epistasis are not considered in single locus GWAS. To address this problem, a considerable number of methods have been developed for identifying disease-associated gene-gene interactions. However, these methods typically fail to identify interacting markers explaining more of the disease heritability over single locus GWAS, since many of the interactions significant for disease are obscured by uninformative marker interactions e.g., linkage disequilibrium (LD). Results In this study, we present a novel SNP interaction prioritization algorithm, named iLOCi (Interacting Loci). This algorithm accounts for marker dependencies separately in case and control groups. Disease-associated interactions are then prioritized according to a novel ranking score calculated from the difference in marker dependencies for every possible pair between case and control groups. The analysis of a typical GWAS dataset can be completed in less than a day on a standard workstation with parallel processing capability. The proposed framework was validated using simulated data and applied to real GWAS datasets using the Wellcome Trust Case Control Consortium (WTCCC) data. The results from simulated data showed the ability of iLOCi to identify various types of gene-gene interactions, especially for high-order interaction. From the WTCCC data, we found that among the top ranked interacting SNP pairs, several mapped to genes previously known to be associated with disease, and interestingly, other previously unreported genes with biologically related roles. Conclusion iLOCi is a powerful tool for uncovering true disease interacting markers and thus can provide a more complete understanding of the genetic basis underlying complex disease. The program is available for download at http://www4a.biotec.or.th/GI/tools/iloci. PMID:23281813
Emebiri, L C; Tan, M-K; El-Bouhssini, M; Wildman, O; Jighly, A; Tadesse, W; Ogbonnaya, F C
2017-02-01
This research provides the first report of a major locus controlling wheat resistance to Sunn pest. It developed and validated SNP markers that will be useful for marker-assisted selection. Sunn pest (Eurygaster integriceps Puton) is the most destructive insect pest of bread wheat and durum wheat in West and Central Asia and East Europe. Breeding for resistance at the vegetative stage of growth is vital in reducing the damage caused by overwintered adult populations that feed on shoot and leaves of seedlings, and in reducing the next generation of pest populations (nymphs and adults), which can cause damage to grain quality by feeding on spikes. In the present study, two doubled haploid (DH) populations involving resistant landraces from Afghanistan were genotyped with the 90k SNP iSelect assay and candidate gene-based KASP markers. The DH lines and parents were phenotyped for resistance to Sunn pest feeding, using artificial infestation cages at Terbol station, in Lebanon, over three years. Quantitative trait locus (QTL) analysis identified a single major locus on chromosome 4BS in the two populations, with the resistance allele derived from the landrace accessions, IG139431 and IG139883. The QTL explained a maximum of 42 % of the phenotypic variation in the Cham6 × IG139431 and 56 % in the Cham6 × IG139883 populations. SNP markers closest to the QTL showed high similarity to rice genes that putatively encode proteins for defense response to herbivory and wounding. The markers were validated in a large, unrelated population of parental wheat genotypes. All wheat lines carrying the 'C-G' haplotype at the identified SNPs were resistant, suggesting that selection based on a haplotype of favourable alleles would be effective in predicting resistance status of unknown genotypes.
Sallam, Ahmed; Arbaoui, Mustapha; El-Esawi, Mohamed; Abshire, Nathan; Martsch, Regina
2016-01-01
Frost stress is one of the abiotic stresses that causes a significant reduction in winter faba bean yield in Europe. The main objective of this work is to genetically improve frost tolerance in winter faba bean by identifying and validating QTL associated with frost tolerance to be used in marker-assisted selection (MAS). Two different genetic backgrounds were used: a biparental population (BPP) consisting of 101 inbred lines, and 189 genotypes from single seed descent (SSD) from the Gottingen Winter bean Population (GWBP). All experiments were conducted in a frost growth chamber under controlled conditions. Both populations were genotyped using the same set of 189 SNP markers. Visual scoring for frost stress symptoms was used to define frost tolerance in both populations. In addition, leaf fatty acid composition (FAC) and proline content were analyzed in BPP as physiological traits. QTL mapping (for BPP) and genome wide association studies (for GWBP) were performed to detect QTL associated with frost tolerance. High genetic variation between genotypes, and repeatability estimates, were found for all traits. QTL mapping and GWAS identified new putative QTL associated with promising frost tolerance and related traits. A set of 54 SNP markers common in both genetic backgrounds showed a high genetic diversity with polymorphic information content (PIC) ranging from 0.31 to 0.37 and gene diversity ranging from 0.39 to 0.50. This indicates that these markers may be polymorphic for many faba bean populations. Five SNP markers showed a significant marker-trait association with frost tolerance and related traits in both populations. Moreover, synteny analysis between Medicago truncatula (a model legume) and faba bean genomes was performed to identify candidate genes for these markers. Collinearity was evaluated between the faba bean genetic map constructed in this study and the faba bean consensus map, resulting in identifying possible genomic regions in faba bean which may control frost tolerance genes. The two genetic backgrounds were useful in detecting new variation for improving frost tolerance in winter faba bean. Of the five validated SNP markers, one (VF_Mt3g086600) was found to be associated with frost tolerance and FAC in both populations. This marker was also associated with winter hardiness and high yield in earlier studies. This marker is located in a gene of unknown function.
Sallam, Ahmed; Arbaoui, Mustapha; El-Esawi, Mohamed; Abshire, Nathan; Martsch, Regina
2016-01-01
Frost stress is one of the abiotic stresses that causes a significant reduction in winter faba bean yield in Europe. The main objective of this work is to genetically improve frost tolerance in winter faba bean by identifying and validating QTL associated with frost tolerance to be used in marker-assisted selection (MAS). Two different genetic backgrounds were used: a biparental population (BPP) consisting of 101 inbred lines, and 189 genotypes from single seed descent (SSD) from the Gottingen Winter bean Population (GWBP). All experiments were conducted in a frost growth chamber under controlled conditions. Both populations were genotyped using the same set of 189 SNP markers. Visual scoring for frost stress symptoms was used to define frost tolerance in both populations. In addition, leaf fatty acid composition (FAC) and proline content were analyzed in BPP as physiological traits. QTL mapping (for BPP) and genome wide association studies (for GWBP) were performed to detect QTL associated with frost tolerance. High genetic variation between genotypes, and repeatability estimates, were found for all traits. QTL mapping and GWAS identified new putative QTL associated with promising frost tolerance and related traits. A set of 54 SNP markers common in both genetic backgrounds showed a high genetic diversity with polymorphic information content (PIC) ranging from 0.31 to 0.37 and gene diversity ranging from 0.39 to 0.50. This indicates that these markers may be polymorphic for many faba bean populations. Five SNP markers showed a significant marker-trait association with frost tolerance and related traits in both populations. Moreover, synteny analysis between Medicago truncatula (a model legume) and faba bean genomes was performed to identify candidate genes for these markers. Collinearity was evaluated between the faba bean genetic map constructed in this study and the faba bean consensus map, resulting in identifying possible genomic regions in faba bean which may control frost tolerance genes. The two genetic backgrounds were useful in detecting new variation for improving frost tolerance in winter faba bean. Of the five validated SNP markers, one (VF_Mt3g086600) was found to be associated with frost tolerance and FAC in both populations. This marker was also associated with winter hardiness and high yield in earlier studies. This marker is located in a gene of unknown function. PMID:27540381
A unified SNP map of sunflower (Helianthus annuus L.) derived from current genomic resources
USDA-ARS?s Scientific Manuscript database
Dense genetic maps are critical tools for plant breeders and geneticists. While many maps have been developed for sunflower in the last few decades, most have been based on low-throughput technologies and include markers numbers in the hundreds. However, two maps with reasonably dense coverage of a...
Construction of a SNP and SSR linkage map in autotetraploid blueberry using genotyping by sequencing
USDA-ARS?s Scientific Manuscript database
A mapping population developed from a cross between two key highbush blueberry cultivars, Draper × Jewel (Vaccinium corymbosum), segregating for a number of important phenotypic traits, has been utilized to produce a genetic linkage map. Data on 233 single sequence repeat (SSR) markers and 1794 sing...
USDA-ARS?s Scientific Manuscript database
Pineapple (Ananas comosus [L.] Merr.) is the third most important tropical fruit in the world after banana and mango and a major agricultural commodity in Hawaii. As a crop with vegetative propagation, genetic redundancy is a major challenge for efficient genebank management and in breeding. Using E...
Pacifiplex: an ancestry-informative SNP panel centred on Australia and the Pacific region.
Santos, Carla; Phillips, Christopher; Fondevila, Manuel; Daniel, Runa; van Oorschot, Roland A H; Burchard, Esteban G; Schanfield, Moses S; Souto, Luis; Uacyisrael, Jolame; Via, Marc; Carracedo, Ángel; Lareu, Maria V
2016-01-01
The analysis of human population variation is an area of considerable interest in the forensic, medical genetics and anthropological fields. Several forensic single nucleotide polymorphism (SNP) assays provide ancestry-informative genotypes in sensitive tests designed to work with limited DNA samples, including a 34-SNP multiplex differentiating African, European and East Asian ancestries. Although assays capable of differentiating Oceanian ancestry at a global scale have become available, this study describes markers compiled specifically for differentiation of Oceanian populations. A sensitive multiplex assay, termed Pacifiplex, was developed and optimized in a small-scale test applicable to forensic analyses. The Pacifiplex assay comprises 29 ancestry-informative marker SNPs (AIM-SNPs) selected to complement the 34-plex test, that in a combined set distinguish Africans, Europeans, East Asians and Oceanians. Nine Pacific region study populations were genotyped with both SNP assays, then compared to four reference population groups from the HGDP-CEPH human diversity panel. STRUCTURE analyses estimated population cluster membership proportions that aligned with the patterns of variation suggested for each study population's currently inferred demographic histories. Aboriginal Taiwanese and Philippine samples indicated high East Asian ancestry components, Papua New Guinean and Aboriginal Australians samples were predominantly Oceanian, while other populations displayed cluster patterns explained by the distribution of divergence amongst Melanesians, Polynesians and Micronesians. Genotype data from Pacifiplex and 34-plex tests is particularly well suited to analysis of Australian Aboriginal populations and when combined with Y and mitochondrial DNA variation will provide a powerful set of markers for ancestry inference applied to modern Australian demographic profiles. On a broader geographic scale, Pacifiplex adds highly informative data for inferring the ancestry of individuals from Oceanian populations. The sensitivity of Pacifiplex enabled successful genotyping of population samples from 50-year-old serum samples obtained from several Oceanian regions that would otherwise be unlikely to produce useful population data. This indicates tests primarily developed for forensic ancestry analysis also provide an important contribution to studies of populations where useful samples are in limited supply. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing
2014-01-01
Background Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families. Results Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays. Conclusions This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication. PMID:24571138
Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean
2012-01-01
Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675
Generation of a Saturated Genetic Recombination Map for Avocado (Persea americana)
USDA-ARS?s Scientific Manuscript database
Two large mapping populations of avocado consisting of 1582 trees were genotyped with 5050 SNP markers from transcribed genes using an Illumina Infinium SNP chip. A Florida mapping population consisted of 527 progeny from 'Tonnage' x 'Simmonds' and 249 from 'Simmonds' x 'Tonnage'. A California map...
NIH CIDR Program Studies For whole exome sequencing projects, we pretest all samples using a high -density SNP array (>200,000 markers). For custom targeted sequencing, we pretest all samples using a 96 pretest samples using a 96 SNP GoldenGate assay. This extensive pretesting allows us to unambiguously tie
Yu, Long-Xi; Zheng, Ping; Zhang, Tiejun; Rodringuez, Jonas; Main, Dorrie
2017-02-01
Verticillium wilt (VW) is a fungal disease that causes severe yield losses in alfalfa. The most effective method to control the disease is through the development and use of resistant varieties. The identification of marker loci linked to VW resistance can facilitate breeding for disease-resistant alfalfa. In the present investigation, we applied an integrated framework of genome-wide association with genotyping-by-sequencing (GBS) to identify VW resistance loci in a panel of elite alfalfa breeding lines. Phenotyping was performed by manual inoculation of the pathogen to healthy seedlings, and scoring for disease resistance was carried out according to the standard test of the North America Alfalfa Improvement Conference (NAAIC). Marker-trait association by linkage disequilibrium identified 10 single nucleotide polymorphism (SNP) markers significantly associated with VW resistance. Alignment of the SNP marker sequences to the M. truncatula genome revealed multiple quantitative trait loci (QTLs). Three, two, one and five markers were located on chromosomes 5, 6, 7 and 8, respectively. Resistance loci found on chromosomes 7 and 8 in the present study co-localized with the QTLs reported previously. A pairwise alignment (blastn) using the flanking sequences of the resistance loci against the M. truncatula genome identified potential candidate genes with putative disease resistance function. With further investigation, these markers may be implemented into breeding programmes using marker-assisted selection, ultimately leading to improved VW resistance in alfalfa. PUBLISHED 2016. THIS ARTICLE IS A U.S. GOVERNMENT WORK AND IS IN THE PUBLIC DOMAIN IN THE USA.
Nuñez-Acuña, Gustavo; Valenzuela-Muñoz, Valentina; Gallardo-Escárate, Cristian
2014-06-01
The salmon louse Caligus rogercresseyi is the dominant ectoparasite species affecting the salmon aquaculture industry in the Southern hemisphere, and it is currently the main cause for economic losses in Chilean aquaculture. However, despite the great concern over Caligus infestations, genomic information on this louse is still scarce, even while the need to develop high-resolution molecular markers is growing. This study provides the first deep transcriptome survey to identify thousands of SNP markers from C. rogercresseyi, with a total of 69,466 SNPs identified using the MiSeq platform (Illumina®), 30,605 (52%) of which were found in contigs successfully annotated against known protein databases. Furthermore, in silico gene expression profiles associated with SNP variants were evaluated, and the results evidenced a wide array of genes that were down- and upregulated throughout the developmental stages of C. rogercresseyi. Interestingly, putative KEGG pathways involved in resistance to antiparasitic agents were also identified, where ten pathways were associated with the nervous system and one was related to ABC transporters. Taken together, this information could be highly useful for investigating the molecular underpinnings involved in the susceptibility or resistance of salmon lice to chemical treatments. Copyright © 2014 Elsevier Inc. All rights reserved.
Chang, Hao-Xun; Hartman, Glen L.
2017-01-01
Management of insects that cause economic damage to yields of soybean mainly rely on insecticide applications. Sources of resistance in soybean plant introductions (PIs) to different insect pests have been reported, and some of these sources, like for the soybean aphid (SBA), have been used to develop resistant soybean cultivars. With the availability of SoySNP50K and the statistical power of genome-wide association studies, we integrated phenotypic data for beet armyworm, Mexican bean beetle (MBB), potato leafhopper (PLH), SBA, soybean looper (SBL), velvetbean caterpillar (VBC), and chewing damage caused by unspecified insects for a comprehensive understanding of insect resistance in the United States Department of Agriculture Soybean Germplasm Collection. We identified significant single nucleotide (SNP) polymorphic markers for MBB, PLH, SBL, and VBC, and we highlighted several leucine-rich repeat-containing genes and myeloblastosis transcription factors within the high linkage disequilibrium region surrounding significant SNP markers. Specifically for soybean resistance to PLH, we found the PLH locus is close but distinct to a locus for soybean pubescence density on chromosome 12. The results provide genetic support that pubescence density may not directly link to PLH resistance. This study offers a novel insight of soybean resistance to four insect pests and reviews resistance mapping studies for major soybean insects. PMID:28555141
Bouakaze, Caroline; Keyser, Christine; Crubézy, Eric; Montagnon, Daniel; Ludes, Bertrand
2009-07-01
In the present study, a multiplexed genotyping assay for ten single nucleotide polymorphisms (SNPs) located within six pigmentation candidate genes was developed on modern biological samples and applied to DNA retrieved from 25 archeological human remains from southern central Siberia dating from the Bronze and Iron Ages. SNP genotyping was successful for the majority of ancient samples and revealed that most probably had typical European pigment features, i.e., blue or green eye color, light hair color and skin type, and were likely of European individual ancestry. To our knowledge, this study reports for the first time the multiplexed typing of autosomal SNPs on aged and degraded DNA. By providing valuable information on pigment traits of an individual and allowing individual biogeographical ancestry estimation, autosomal SNP typing can improve ancient DNA studies and aid human identification in some forensic casework situations when used to complement conventional molecular markers.
Chadaeva, Irina V; Ponomarenko, Mikhail P; Rasskazov, Dmitry A; Sharypova, Ekaterina B; Kashina, Elena V; Matveeva, Marina Yu; Arshinova, Tatjana V; Ponomarenko, Petr M; Arkova, Olga V; Bondar, Natalia P; Savinkova, Ludmila K; Kolchanov, Nikolay A
2016-12-28
Aggressiveness in humans is a hereditary behavioral trait that mobilizes all systems of the body-first of all, the nervous and endocrine systems, and then the respiratory, vascular, muscular, and others-e.g., for the defense of oneself, children, family, shelter, territory, and other possessions as well as personal interests. The level of aggressiveness of a person determines many other characteristics of quality of life and lifespan, acting as a stress factor. Aggressive behavior depends on many parameters such as age, gender, diseases and treatment, diet, and environmental conditions. Among them, genetic factors are believed to be the main parameters that are well-studied at the factual level, but in actuality, genome-wide studies of aggressive behavior appeared relatively recently. One of the biggest projects of the modern science-1000 Genomes-involves identification of single nucleotide polymorphisms (SNPs), i.e., differences of individual genomes from the reference genome. SNPs can be associated with hereditary diseases, their complications, comorbidities, and responses to stress or a drug. Clinical comparisons between cohorts of patients and healthy volunteers (as a control) allow for identifying SNPs whose allele frequencies significantly separate them from one another as markers of the above conditions. Computer-based preliminary analysis of millions of SNPs detected by the 1000 Genomes project can accelerate clinical search for SNP markers due to preliminary whole-genome search for the most meaningful candidate SNP markers and discarding of neutral and poorly substantiated SNPs. Here, we combine two computer-based search methods for SNPs (that alter gene expression) {i} Web service SNP_TATA_Comparator (DNA sequence analysis) and {ii} PubMed-based manual search for articles on aggressiveness using heuristic keywords. Near the known binding sites for TATA-binding protein (TBP) in human gene promoters, we found aggressiveness-related candidate SNP markers, including rs1143627 (associated with higher aggressiveness in patients undergoing cytokine immunotherapy), rs544850971 (higher aggressiveness in old women taking lipid-lowering medication), and rs10895068 (childhood aggressiveness-related obesity in adolescence with cardiovascular complications in adulthood). After validation of these candidate markers by clinical protocols, these SNPs may become useful for physicians (may help to improve treatment of patients) and for the general population (a lifestyle choice preventing aggressiveness-related complications).
SNP-VISTA: An interactive SNP visualization tool
Shah, Nameeta; Teplitsky, Michael V; Minovitsky, Simon; Pennacchio, Len A; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L
2005-01-01
Background Recent advances in sequencing technologies promise to provide a better understanding of the genetics of human disease as well as the evolution of microbial populations. Single Nucleotide Polymorphisms (SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it has become possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease in an attempt to identify causative mutations. In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples enables more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at [1]. Results We have developed and present two modifications of an interactive visualization tool, SNP-VISTA, to aid in the analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein evolutionary conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. Conclusion The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNP data by the user. PMID:16336665
Li, Yuan; Yang, Kai; Yang, Wei; Chu, Liwei; Chen, Chunhai; Zhao, Bo; Li, Yisong; Jian, Jianbo; Yin, Zhichao; Wang, Tianqi; Wan, Ping
2017-01-01
The adzuki bean ( Vigna angularis ) is an important grain legume. Fine mapping of quantitative trait loci (QTL) and qualitative trait genes plays an important role in gene cloning, molecular-marker-assisted selection (MAS), and trait improvement. However, the genetic control of agronomic traits in the adzuki bean remains poorly understood. Single-nucleotide polymorphisms (SNPs) are invaluable in the construction of high-density genetic maps. We mapped 26 agronomic QTLs and five qualitative trait genes related to pigmentation using 1,571 polymorphic SNP markers from the adzuki bean genome via restriction-site-associated DNA sequencing of 150 members of an F 2 population derived from a cross between cultivated and wild adzuki beans. We mapped 11 QTLs for flowering time and pod maturity on chromosomes 4, 7, and 10. Six 100-seed weight (SD100WT) QTLs were detected. Two major flowering time QTLs were located on chromosome 4, firstly VaFld4.1 (PEVs 71.3%), co-segregating with SNP marker s690-144110, and VaFld4.2 (PEVs 67.6%) at a 0.974 cM genetic distance from the SNP marker s165-116310. Three QTLs for seed number per pod ( Snp3.1, Snp3.2 , and Snp4.1 ) were mapped on chromosomes 3 and 4. One QTL VaSdt4.1 of seed thickness (SDT) and three QTLs for branch number on the main stem were detected on chromosome 4. QTLs for maximum leaf width (LFMW) and stem internode length were mapped to chromosomes 2 and 9, respectively. Trait genes controlling the color of the seed coat, pod, stem and flower were mapped to chromosomes 3 and 1. Three candidate genes, VaAGL, VaPhyE , and VaAP2 , were identified for flowering time and pod maturity. VaAGL encodes an agamous-like MADS-box protein of 379 amino acids. VaPhyE encodes a phytochrome E protein of 1,121 amino acids. Four phytochrome genes ( VaPhyA1, VaPhyA2, VaPhyB , and VaPhyE ) were identified in the adzuki bean genome. We found candidate genes VaAP2/ERF.81 and VaAP2/ERF.82 of SD100WT, VaAP2-s4 of SDT, and VaAP2/ERF.86 of LFMW. A candidate gene VaUGT related to black seed coat color was identified. These mapped QTL and qualitative trait genes provide information helpful for future adzuki bean candidate gene cloning and MAS breeding to improve cultivars with desirable growth periods, yields, and seed coat color types.
Jo, Jinkwan; Venkatesh, Jelli; Han, Koeun; Lee, Hea-Young; Choi, Gyung Ja; Lee, Hee Jae; Choi, Doil; Kang, Byoung-Cheorl
2017-01-01
Powdery mildew, caused by Leveillula taurica , is a major fungal disease affecting greenhouse-grown pepper ( Capsicum annuum ). Powdery mildew resistance has a complex mode of inheritance. In the present study, we investigated a novel powdery mildew resistance locus, PMR1 , using two mapping populations: 102 'VK515' F 2:3 families (derived from a cross between resistant parental line 'VK515R' and susceptible parental line 'VK515S') and 80 'PM Singang' F 2 plants (derived from the F 1 'PM Singang' commercial hybrid). Genetic analysis of the F 2:3 'VK515' and F 2 'PM Singang' populations revealed a single dominant locus for inheritance of the powdery mildew resistance trait. Genetic mapping showed that the PMR1 locus is located on syntenic regions of pepper chromosome 4 in a 4-Mb region between markers CZ2_11628 and HRM4.1.6 in 'VK515R'. Six molecular markers including one SCAR marker and five SNP markers were localized to a region 0 cM from the PMR1 locus. Two putative nucleotide-binding site leucine-rich repeat (NBS-LRR)-type disease resistance genes were identified in this PMR1 region. Genotyping-by-sequencing (GBS) and genetic mapping analysis revealed suppressed recombination in the PMR1 region, perhaps due to alien introgression. In addition, a comparison of species-specific InDel markers as well as GBS-derived SNP markers indicated that C. baccatum represents a possible source of such alien introgression of powdery mildew resistance into 'VK515R'. The molecular markers developed in this study will be especially helpful for marker-assisted selection in pepper breeding programs for powdery mildew resistance.
Jo, Jinkwan; Venkatesh, Jelli; Han, Koeun; Lee, Hea-Young; Choi, Gyung Ja; Lee, Hee Jae; Choi, Doil; Kang, Byoung-Cheorl
2017-01-01
Powdery mildew, caused by Leveillula taurica, is a major fungal disease affecting greenhouse-grown pepper (Capsicum annuum). Powdery mildew resistance has a complex mode of inheritance. In the present study, we investigated a novel powdery mildew resistance locus, PMR1, using two mapping populations: 102 ‘VK515' F2:3 families (derived from a cross between resistant parental line ‘VK515R' and susceptible parental line ‘VK515S') and 80 ‘PM Singang' F2 plants (derived from the F1 ‘PM Singang' commercial hybrid). Genetic analysis of the F2:3 ‘VK515' and F2 ‘PM Singang' populations revealed a single dominant locus for inheritance of the powdery mildew resistance trait. Genetic mapping showed that the PMR1 locus is located on syntenic regions of pepper chromosome 4 in a 4-Mb region between markers CZ2_11628 and HRM4.1.6 in ‘VK515R'. Six molecular markers including one SCAR marker and five SNP markers were localized to a region 0 cM from the PMR1 locus. Two putative nucleotide-binding site leucine-rich repeat (NBS-LRR)-type disease resistance genes were identified in this PMR1 region. Genotyping-by-sequencing (GBS) and genetic mapping analysis revealed suppressed recombination in the PMR1 region, perhaps due to alien introgression. In addition, a comparison of species-specific InDel markers as well as GBS-derived SNP markers indicated that C. baccatum represents a possible source of such alien introgression of powdery mildew resistance into ‘VK515R'. The molecular markers developed in this study will be especially helpful for marker-assisted selection in pepper breeding programs for powdery mildew resistance. PMID:29276524
Dong, Yan; Zhang, Yan; Xiao, Yonggui; Yan, Jun; Liu, Jindong; Wen, Weie; Zhang, Yong; Jing, Ruilian; Xia, Xianchun; He, Zhonghu
2016-05-01
We cloned TaSST genes, developed a gene-specific marker for TaSST-D1, and identified three QTL in the Doumai/Shi 4185 RIL population. TaSST-D1 is within one of the three QTL. Sucrose:sucrose-1-fructosyltransferase (1-SST), a critical enzyme in the fructan biosynthetic pathway, is significantly and positively associated with water soluble carbohydrate (WSC) content in bread wheat stems. In the present study, wheat 1-SST genes (TaSST) were isolated and located on chromosomes 4A, 7A and 7D. Sequence analysis of TaSST-D1 revealed 15 single nucleotide polymorphisms (SNP) in the third exon between cultivars with higher and lower WSC content. A cleaved amplified polymorphism sequence (CAPS) marker, WSC7D, based on the polymorphism at position 1216 (C-G) was developed to discriminate the two alleles. WSC7D was located on chromosome 7DS using a recombinant inbred line (RIL) population from a Doumai/Shi 4185 cross, and a set of Chinese Spring nullisomic-tetrasomic lines. TaSST-D1 co-segregated with the CAPS marker WSC7D and was linked to SNP marker BS00108793_51 on chromosome 7DS at a genetic distance of 6.1 cM. It explained 8.8, 10.9, and 11.3% of the phenotypic variances in trials at Beijing and Shijiazhuang as well as the averaged data from those environments, respectively. Two additional QTL (QWSC.caas-4BS and QWSC.caas-7AS) besides TaSST-D1 were mapped in the RIL population. One hundred and forty-nine Chinese wheat cultivars and advanced lines tested in four environments were used to validate a highly significant (P < 0.01) association between WSC7D and WSC content in wheat stems. WSC7D can be used as a gene-specific marker for improvement of stem WSC content in wheat breeding programs.
Using next generation sequencing for multiplexed trait-linked markers in wheat
USDA-ARS?s Scientific Manuscript database
With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat (Triticum aestivum L.) that can be effectively used...
Ganal, Martin W.; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S.; Charcosset, Alain; Clarke, Joseph D.; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D.; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C.; Falque, Matthieu
2011-01-01
SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding. PMID:22174790
Klaften, Matthias; Hrabé de Angelis, Martin
2005-07-01
Genome-wide mapping in the identification of novel candidate genes has always been the standard method in genetics and genomics to correlate a clinically interesting phenotypic trait with a genotype. However, the performance of a mapping experiment using classical microsatellite approaches can be very time consuming. The high-throughput analysis of single-nucleotide polymorphisms (SNPs) has the potential of being the successor of microsatellite analysis routinely used for these mapping approaches, where one of the major obstacles is the design of the appropriate SNP marker set itself. Here we report on ARTS, an advanced retrieval tool for SNPs, which allows researchers to comb freely the public mouse dbSNP database for multiple reference and test strains. Several filters can be applied in order to improve the sensitivity and the specificity of the search results. By employing the panel generator function of this program, it is possible to abbreviate the extraction of reliable sequence data for a large marker panel including several different mouse strains from days to minutes. The concept of ARTS is easily adaptable to other species for which SNP databases are available, making it a versatile tool for the use of SNPs as markers for genotyping. The web interface is accessible at http://andromeda.gsf.de/arts.
Gilbey, John; Cauwelier, Eef; Coulson, Mark W.; Stradmeyer, Lee; Sampayo, James N.; Armstrong, Anja; Verspoor, Eric; Corrigan, Laura; Shelley, Jonathan; Middlemas, Stuart
2016-01-01
Understanding the habitat use patterns of migratory fish, such as Atlantic salmon (Salmo salar L.), and the natural and anthropogenic impacts on them, is aided by the ability to identify individuals to their stock of origin. Presented here are the results of an analysis of informative single nucleotide polymorphic (SNP) markers for detecting genetic structuring in Atlantic salmon in Scotland and NE England and their ability to allow accurate genetic stock identification. 3,787 fish from 147 sites covering 27 rivers were screened at 5,568 SNP markers. In order to identify a cost-effective subset of SNPs, they were ranked according to their ability to differentiate between fish from different rivers. A panel of 288 SNPs was used to examine both individual assignments and mixed stock fisheries and eighteen assignment units were defined. The results improved greatly on previously available methods and, for the first time, fish caught in the marine environment can be confidently assigned to geographically coherent units within Scotland and NE England, including individual rivers. As such, this SNP panel has the potential to aid understanding of the various influences acting upon Atlantic salmon on their marine migrations, be they natural environmental variations and/or anthropogenic impacts, such as mixed stock fisheries and interactions with marine power generation installations. PMID:27723810
Jiang, Rong; French, John E.; Stober, Vandy P.; Kang-Sickel, Juei-Chuan C.; Zou, Fei
2012-01-01
Background: Individual genetic variation that results in differences in systemic response to xenobiotic exposure is not accounted for as a predictor of outcome in current exposure assessment models. Objective: We developed a strategy to investigate individual differences in single-nucleotide polymorphisms (SNPs) as genetic markers associated with naphthyl–keratin adduct (NKA) levels measured in the skin of workers exposed to naphthalene. Methods: The SNP-association analysis was conducted in PLINK using candidate-gene analysis and genome-wide analysis. We identified significant SNP–NKA associations and investigated the potential impact of these SNPs along with personal and workplace factors on NKA levels using a multiple linear regression model and the Pratt index. Results: In candidate-gene analysis, a SNP (rs4852279) located near the CYP26B1 gene contributed to the 2-naphthyl–keratin adduct (2NKA) level. In the multiple linear regression model, the SNP rs4852279, dermal exposure, exposure time, task replacing foam, age, and ethnicity all were significant predictors of 2NKA level. In genome-wide analysis, no single SNP reached genome-wide significance for NKA levels (all p ≥ 1.05 × 10–5). Pathway and network analyses of SNPs associated with NKA levels were predicted to be involved in the regulation of cellular processes and homeostasis. Conclusions: These results provide evidence that a quantitative biomarker can be used as an intermediate phenotype when investigating the association between genetic markers and exposure–dose relationship in a small, well-characterized exposed worker population. PMID:22391508
The use of SNP data for the monitoring of genetic diversity in cattle breeds
USDA-ARS?s Scientific Manuscript database
LD between SNPs contains information about effective population size. In this study, we investigate the use of genome-wide SNP data for marker based estimation of effective population size for two taurine cattle breeds of Africa and two local cattle breeds of Switzerland. Estimated recombination rat...
USDA-ARS?s Scientific Manuscript database
Microsatellite markers (MS) have traditionally been used for parental verification and are still the international standard in spite of their higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP) -based assays. Despite domestic and international demands fr...
SNPConvert: SNP Array Standardization and Integration in Livestock Species.
Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra
2016-06-09
One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git.
Diversity in 113 cowpea [Vigna unguiculata (L) Walp] accessions assessed with 458 SNP markers.
Egbadzor, Kenneth F; Ofori, Kwadwo; Yeboah, Martin; Aboagye, Lawrence M; Opoku-Agyeman, Michael O; Danquah, Eric Y; Offei, Samuel K
2014-01-01
Single Nucleotide Polymorphism (SNP) markers were used in characterization of 113 cowpea accessions comprising of 108 from Ghana and 5 from abroad. Leaf tissues from plants cultivated at the University of Ghana were genotyped at KBioscience in the United Kingdom. Data was generated for 477 SNPs, out of which 458 revealed polymorphism. The results were used to analyze genetic dissimilarity among the accessions using Darwin 5 software. The markers discriminated among all of the cowpea accessions and the dissimilarity values which ranged from 0.006 to 0.63 were used for factorial plot. Unexpected high levels of heterozygosity were observed on some of the accessions. Accessions known to be closely related clustered together in a dendrogram drawn with WPGMA method. A maximum length sub-tree which comprised of 48 core accessions was constructed. The software package structure was used to separate accessions into three groups, and the programme correctly identified varieties that were known hybrids. The hybrids were those accessions with numerous heterozygous loci. The structure plot showed closely related accessions with similar genome patterns. The SNP markers were more efficient in discriminating among the cowpea germplasm than morphological, seed protein polymorphism and simple sequence repeat studies reported earlier on the same collection.
USDA-ARS?s Scientific Manuscript database
High-throughput genotyping arrays provide a standardized resource for crop research communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), candidate marker and quantitative trait loci (QTL) ide...
Identification of bovine NPC1 gene cSNPs and their effects on body size traits of Qinchuan cattle.
Dang, Yonglong; Li, Mingxun; Yang, Mingjuan; Cao, Xiukai; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Lin, Qing; Chen, Hong
2014-05-01
NPC1 gene is an important gene closely related to the Niemann-Pick type C (NPC). Mutations in the NPC1 gene tend to cause Niemann-Pick type C, a lysosomal storage disorder. Previous studies have shown that NPC1 protein plays an important role in subcellular lipid transport, homeostasis, platelet function and formation, which are basic metabolic activities in the process of development. In this study, to explore the association between the NPC1 gene variation and body size traits in Qinchuan cattle, we detected four novel coding single nucleotide polymorphisms (cSNPs) in the bovine NPC1 gene, including one missense mutation (SNP1) and three synonymous mutations (SNP2, SNP3 and SNP4). Population genetic analyses of 518 individuals and association correlations between cSNPs and bovine body size traits were conducted in this research. A missense mutation at SNP1 locus was found to be significantly related to the heart girth, hip width and body weight (P<0.01 or P<0.05, 3.5-year-old). Two synonymous mutations at SNP2 and SNP3 loci also showed significant effects on hip width (P<0.05, 3.5-year-old). One synonymous mutation at SNP4 locus showed significant effect on body weight (P<0.05, 2.0-year-old). Combined haplotypes H2H6 and H6H6 showed significant effects on body size traits such as heart girth, hip width, and body weight (3.5-year-old, P<0.01 or P<0.05). This study provides evidence that the NPC1 gene might be involved in the regulation of bovine growth and body development, and may be considered as a candidate gene for marker assisted selection (MAS) in beef cattle breeding industry. Copyright © 2014. Published by Elsevier B.V.
Ain, Qurat-ul; Rasheed, Awais; Anwar, Alia; Mahmood, Tariq; Imtiaz, Muhammad; Mahmood, Tariq; Xia, Xianchun; He, Zhonghu; Quraishi, Umar M.
2015-01-01
Genome-wide association studies (GWAS) were undertaken to identify SNP markers associated with yield and yield-related traits in 123 Pakistani historical wheat cultivars evaluated during 2011–2014 seasons under rainfed field conditions. The population was genotyped by using high-density Illumina iSelect 90K single nucleotide polymorphism (SNP) assay, and finally 14,960 high quality SNPs were used in GWAS. Population structure examined using 1000 unlinked markers identified seven subpopulations (K = 7) that were representative of different breeding programs in Pakistan, in addition to local landraces. Forty four stable marker-trait associations (MTAs) with -log p > 4 were identified for nine yield-related traits. Nine multi-trait MTAs were found on chromosomes 1AL, 1BS, 2AL, 2BS, 2BL, 4BL, 5BL, 6AL, and 6BL, and those on 5BL and 6AL were stable across two seasons. Gene annotation and syntey identified that 14 trait-associated SNPs were linked to genes having significant importance in plant development. Favorable alleles for days to heading (DH), plant height (PH), thousand grain weight (TGW), and grain yield (GY) showed minor additive effects and their frequencies were slightly higher in cultivars released after 2000. However, no selection pressure on any favorable allele was identified. These genomic regions identified have historically contributed to achieve yield gains from 2.63 million tons in 1947 to 25.7 million tons in 2015. Future breeding strategies can be devised to initiate marker assisted breeding to accumulate these favorable alleles of SNPs associated with yield-related traits to increase grain yield. Additionally, in silico identification of 454-contigs corresponding to MTAs will facilitate fine mapping and subsequent cloning of candidate genes and functional marker development. PMID:26442056
Fontanesi, L; Galimberti, G; Calò, D G; Fronza, R; Martelli, P L; Scotti, E; Colombo, M; Schiavo, G; Casadio, R; Buttazzoni, L; Russo, V
2012-08-01
Combining different approaches (resequencing of portions of 54 obesity candidate genes, literature mining for pig markers associated with fat deposition or related traits in 77 genes, and in silico mining of porcine expressed sequence tags and other sequences available in databases), we identified and analyzed 736 SNP within candidate genes to identify markers associated with back fat thickness (BFT) in Italian Large White sows. Animals were chosen using a selective genotyping approach according to their EBV for BFT (276 with most negative and 279 with most positive EBV) within a population of ≈ 12,000 pigs. Association analysis between the SNP and BFT has been carried out using the MAX test proposed for case-control studies. The designed assays were successful for 656 SNP: 370 were excluded (low call rate or minor allele frequency <5%), whereas the remaining 286 in 212 genes were taken for subsequent analyses, among which 64 showed a P(nominal) value <0.1. To deal with the multiple testing problem in a candidate gene approach, we applied the proportion of false positives (PFP) method. Thirty-eight SNP were significant (P(PFP) < 0.20). The most significant SNP was the IGF2 intron3-g.3072G>A polymorphism (P(nominal) < 1.0E-50). The second most significant SNP was the MC4R c.1426A>G polymorphism (P(nominal) = 8.0E-05). The third top SNP (P(nominal) = 6.2E-04) was the intronic TBC1D1 g.219G>A polymorphic site, in agreement with our previous results obtained in an independent study. The list of significant markers also included SNP in additional genes (ABHD16A, ABHD5, ACP2, ALMS1, APOA2, ATP1A2, CALR, COL14A1, CTSF, DARS, DECR1, ENPP1, ESR1, GH1, GHRL, GNMT, IKBKB, JAK3, MTTP, NFKBIA, NT5E, PLAT, PPARG, PPP2R5D, PRLR, RRAGD, RFC2, SDHD, SERPINF1, UBE2H, VCAM1, and WAT). Functional relationships between genes were obtained using the Ingenuity Pathway Analysis (IPA) Knowledge Base. The top scoring pathway included 19 genes with a P(nominal) < 0.1, 2 of which (IKBKB and NFKBIA) are involved in the hypothalamic IKKβ/NFκB program that could represent a key axis to affect fat deposition traits in pigs. These results represent a starting point to plan marker-assisted selection in Italian Large White nuclei for BFT. Because of similarities between humans and pigs, this study might also provide useful clues to investigate genetic factors affecting human obesity.
Analysis of high-order SNP barcodes in mitochondrial D-loop for chronic dialysis susceptibility.
Yang, Cheng-Hong; Lin, Yu-Da; Chuang, Li-Yeh; Chang, Hsueh-Wei
2016-10-01
Positively identifying disease-associated single nucleotide polymorphism (SNP) markers in genome-wide studies entails the complex association analysis of a huge number of SNPs. Such large numbers of SNP barcode (SNP/genotype combinations) continue to pose serious computational challenges, especially for high-dimensional data. We propose a novel exploiting SNP barcode method based on differential evolution, termed IDE (improved differential evolution). IDE uses a "top combination strategy" to improve the ability of differential evolution to explore high-order SNP barcodes in high-dimensional data. We simulate disease data and use real chronic dialysis data to test four global optimization algorithms. In 48 simulated disease models, we show that IDE outperforms existing global optimization algorithms in terms of exploring ability and power to detect the specific SNP/genotype combinations with a maximum difference between cases and controls. In real data, we show that IDE can be used to evaluate the relative effects of each individual SNP on disease susceptibility. IDE generated significant SNP barcode with less computational complexity than the other algorithms, making IDE ideally suited for analysis of high-order SNP barcodes. Copyright © 2016 Elsevier Inc. All rights reserved.
Winkler, Cheryl A.; Li, Ji; Guan, Li; Tang, Minzhong; Liao, Jian; Deng, Hong; de Thé, Guy; Zeng, Yi; O'Brien, Stephen J.
2014-01-01
Genetic factors, as well as environmental factors, play a role in development of nasopharyngeal carcinoma (NPC). A number of single nucleotide polymorphisms (SNPs) have been reported to be associated with NPC. To confirm these genetic associations with NPC, two independent case-control studies from Southern China comprising 1166 NPC cases and 2340 controls were conducted. Seven SNPs in ITGA9 at 3p21.3 and 9 SNPs within the 6p21.3 HLA region were genotyped. To explore the potential clinical application of these genetic markers in NPC, we further evaluate the predictive/diagnostic role of significant SNPs by calculating the area under the curve (AUC). Results. The reported associations between ITGA9 variants and NPC were not replicated. Multiple loci of GABBR1, HLA-F, HLA-A, and HCG9 were statistically significant in both cohorts (P combined range from 5.96 × 10−17 to 0.02). We show for the first time that these factors influence NPC development independent of environmental risk factors. This study also indicated that the SNP alone cannot serve as a predictive/diagnostic marker for NPC. Integrating the most significant SNP with IgA antibodies status to EBV, which is presently used as screening/diagnostic marker for NPC in Chinese populations, did not improve the AUC estimate for diagnosis of NPC. PMID:25180181
Guo, Xiuchan; Winkler, Cheryl A; Li, Ji; Guan, Li; Tang, Minzhong; Liao, Jian; Deng, Hong; de Thé, Guy; Zeng, Yi; O'Brien, Stephen J
2014-01-01
Genetic factors, as well as environmental factors, play a role in development of nasopharyngeal carcinoma (NPC). A number of single nucleotide polymorphisms (SNPs) have been reported to be associated with NPC. To confirm these genetic associations with NPC, two independent case-control studies from Southern China comprising 1166 NPC cases and 2340 controls were conducted. Seven SNPs in ITGA9 at 3p21.3 and 9 SNPs within the 6p21.3 HLA region were genotyped. To explore the potential clinical application of these genetic markers in NPC, we further evaluate the predictive/diagnostic role of significant SNPs by calculating the area under the curve (AUC). The reported associations between ITGA9 variants and NPC were not replicated. Multiple loci of GABBR1, HLA-F, HLA-A, and HCG9 were statistically significant in both cohorts (P(combined) range from 5.96 × 10(-17) to 0.02). We show for the first time that these factors influence NPC development independent of environmental risk factors. This study also indicated that the SNP alone cannot serve as a predictive/diagnostic marker for NPC. Integrating the most significant SNP with IgA antibodies status to EBV, which is presently used as screening/diagnostic marker for NPC in Chinese populations, did not improve the AUC estimate for diagnosis of NPC.
A massively parallel strategy for STR marker development, capture, and genotyping.
Kistler, Logan; Johnson, Stephen M; Irwin, Mitchell T; Louis, Edward E; Ratan, Aakrosh; Perry, George H
2017-09-06
Short tandem repeat (STR) variants are highly polymorphic markers that facilitate powerful population genetic analyses. STRs are especially valuable in conservation and ecological genetic research, yielding detailed information on population structure and short-term demographic fluctuations. Massively parallel sequencing has not previously been leveraged for scalable, efficient STR recovery. Here, we present a pipeline for developing STR markers directly from high-throughput shotgun sequencing data without a reference genome, and an approach for highly parallel target STR recovery. We employed our approach to capture a panel of 5000 STRs from a test group of diademed sifakas (Propithecus diadema, n = 3), endangered Malagasy rainforest lemurs, and we report extremely efficient recovery of targeted loci-97.3-99.6% of STRs characterized with ≥10x non-redundant sequence coverage. We then tested our STR capture strategy on P. diadema fecal DNA, and report robust initial results and suggestions for future implementations. In addition to STR targets, this approach also generates large, genome-wide single nucleotide polymorphism (SNP) panels from flanking regions. Our method provides a cost-effective and scalable solution for rapid recovery of large STR and SNP datasets in any species without needing a reference genome, and can be used even with suboptimal DNA more easily acquired in conservation and ecological studies. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.
Lu, Fu-Hao; Kwon, Soon-Wook; Yoon, Min-Young; Kim, Ki-Taek; Cho, Myeong-Cheoul; Yoon, Moo-Kyung; Park, Yong-Jin
2012-01-01
Red pepper, Capsicum annuum L., has been attracting geneticists’ and breeders’ attention as one of the important agronomic crops. This study was to integrate 41 SNP markers newly developed from comparative transcriptomes into a previous linkage map, and map 12 agronomic and morphological traits into the integrated map. A total of 39 markers found precise position and were assigned to 13 linkage groups (LGs) as well as the unassigned LGe, leading to total 458 molecular markers present in this genetic map. Linkage mapping was supported by the physical mapping to tomato and potato genomes using BLAST retrieving, revealing at least two-thirds of the markers mapped to the corresponding LGs. A sum of 23 quantitative trait loci from 11 traits was detected using the composite interval mapping algorithm. A consistent interval between a035_1 and a170_1 on LG5 was detected as a main-effect locus among the resistance QTLs to Phytophthora capsici at high-, intermediate- and low-level tests, and interactions between the QTLs for high-level resistance test were found. Considering the epistatic effect, those QTLs could explain up to 98.25% of the phenotype variations of resistance. Moreover, 17 QTLs for another eight traits were found to locate on LG3, 4, and 12 mostly with varying phenotypic contribution. Furthermore, the locus for corolla color was mapped to LG10 as a marker. The integrated map and the QTLs identified would be helpful for current genetics research and crop breeding, especially in the Solanaceae family. PMID:22684870
Piedra, María; Berja, Ana; García-Unzueta, María Teresa; Ramos, Laura; Valero, Carmen; Amado, José Antonio
2015-01-01
The CLDN14 gene encodes a protein involved in the regulation of paracellular permeability or ion transport at epithelial tight junctions as in the nephron. The C allele of the rs219780 SNP (single nucleotide polymorphism) of CLDN14 has been associated with renal lithiasis, high levels of parathormone (PTH), and with low bone mineral density (BMD) in healthy women. Our aim is to study the relationship between rs219780 SNP of CLDN14 and renal lithiasis, fractures, and BMD in patients with primary hyperparathyroidism (PHPT). We enrolled 298 Caucasian patients with PHPT and 328 healthy volunteers in a cross-sectional study. We analysed anthropometric data, history of fractures or kidney stones, biochemical parameters including markers for bone remodelling, abdominal ultrasound, and BMD and genotyping for the rs219780 SNP of CLDN14. We did not find any difference in the frequency of fractures or renal lithiasis between the genotype groups in PHPT patients. Moreover, we did not find any relationship between the T or C alleles and BMD or biochemical parameters. rs219780 SNP of CLDN14 does not appear to be a risk factor for the development of PHPT nor does it seem to influence the clinical expression of PHPT.
USDA-ARS?s Scientific Manuscript database
Breeding and selection for the traits with polygenic inheritance is a challenging task that can be done by phenotypic selection, by marker-assisted selection or by genome wide selection. We tested predictive ability of four selection models in a biparental population genotyped with 95 SNP markers an...
Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S; Jaing, C
2012-03-27
The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interimmore » report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.« less
Fine Mapping of Ur-3, a Historically Important Rust Resistance Locus in Common Bean
Hurtado-Gonzales, Oscar P.; Valentini, Giseli; Gilio, Thiago A. S.; Martins, Alexandre M.; Song, Qijian; Pastor-Corrales, Marcial A.
2016-01-01
Bean rust, caused by Uromyces appendiculatus, is a devastating disease of common bean (Phaseolus vulgaris) in the Americas and Africa. The historically important Ur-3 gene confers resistance to many races of the highly variable bean rust pathogen that overcome other rust resistance genes. Existing molecular markers tagging Ur-3 for use in marker-assisted selection produce false results. Here, we describe the fine mapping of the Ur-3 locus for the development of highly accurate markers linked to Ur-3. An F2 population from the cross Pinto 114 (susceptible) × Aurora (resistant with Ur-3) was evaluated for its reaction to four different races of U. appendiculatus. A bulked segregant analysis using the SNP chip BARCBEAN6K_3 placed the approximate location of Ur-3 in the lower arm of chromosome Pv11. Specific SSR and SNP markers and haplotype analysis of 18 sequenced bean varieties positioned Ur-3 in a 46.5 kb genomic region from 46.96 to 47.01 Mb on Pv11. We discovered in this region the SS68 KASP marker that was tightly linked to Ur-3. Validation of SS68 on a panel of 130 diverse common bean cultivars containing all known rust resistance genes revealed that SS68 was highly accurate and produced no false results. The SS68 marker will be of great value in pyramiding Ur-3 with other rust resistance genes. It will also significantly reduce time and labor associated with the current phenotypic detection of Ur-3. This is the first utilization of fine mapping to discover markers linked to rust resistance in common bean. PMID:28031244
Fine Mapping of Ur-3, a Historically Important Rust Resistance Locus in Common Bean.
Hurtado-Gonzales, Oscar P; Valentini, Giseli; Gilio, Thiago A S; Martins, Alexandre M; Song, Qijian; Pastor-Corrales, Marcial A
2017-02-09
Bean rust, caused by Uromyces appendiculatus , is a devastating disease of common bean ( Phaseolus vulgaris ) in the Americas and Africa. The historically important Ur-3 gene confers resistance to many races of the highly variable bean rust pathogen that overcome other rust resistance genes. Existing molecular markers tagging Ur-3 for use in marker-assisted selection produce false results. Here, we describe the fine mapping of the Ur-3 locus for the development of highly accurate markers linked to Ur-3 An F 2 population from the cross Pinto 114 (susceptible) × Aurora (resistant with Ur-3 ) was evaluated for its reaction to four different races of U. appendiculatus A bulked segregant analysis using the SNP chip BARCBEAN6K_3 placed the approximate location of Ur-3 in the lower arm of chromosome Pv11. Specific SSR and SNP markers and haplotype analysis of 18 sequenced bean varieties positioned Ur-3 in a 46.5 kb genomic region from 46.96 to 47.01 Mb on Pv11. We discovered in this region the SS68 KASP marker that was tightly linked to Ur-3 Validation of SS68 on a panel of 130 diverse common bean cultivars containing all known rust resistance genes revealed that SS68 was highly accurate and produced no false results. The SS68 marker will be of great value in pyramiding Ur-3 with other rust resistance genes. It will also significantly reduce time and labor associated with the current phenotypic detection of Ur-3 This is the first utilization of fine mapping to discover markers linked to rust resistance in common bean. Copyright © 2017 Hurtado-Gonzales et al.
Fonseca, João Eurico; Cavaleiro, João; Teles, José; Sousa, Elsa; Andreozzi, Valeska L; Antunes, Marília; Amaral-Turkman, Maria A; Canhão, Helena; Mourão, Ana F; Lopes, Joana; Caetano-Lopes, Joana; Weinmann, Pamela; Sobral, Marta; Nero, Patrícia; Saavedra, Maria J; Malcata, Armando; Cruz, Margarida; Melo, Rui; Braña, Araceli; Miranda, Luis; Patto, José V; Barcelos, Anabela; da Silva, José Canas; Santos, Luís M; Figueiredo, Guilherme; Rodrigues, Mário; Jesus, Herberto; Quintal, Alberto; Carvalho, Teresa; da Silva, José A Pereira; Branco, Jaime; Queiroz, Mário Viana
2007-01-01
The objective of this study was to assess whether clinical measures of rheumatoid arthritis activity and severity were influenced by tumor necrosis factor-alpha (TNF-alpha) promoter genotype/haplotype markers. Each patient's disease activity was assessed by the disease activity score using 28 joint counts (DAS28) and functional capacity by the Health Assessment Questionnaire (HAQ) score. Systemic manifestations, radiological damage evaluated by the Sharp/van der Heijde (SvdH) score, disease-modifying anti-rheumatic drug use, joint surgeries, and work disability were also assessed. The promoter region of the TNF-alpha gene, between nucleotides -1,318 and +49, was sequenced using an automated platform. Five hundred fifty-four patients were evaluated and genotyped for 10 single-nucleotide polymorphism (SNP) markers, but 5 of these markers were excluded due to failure to fall within Hardy-Weinberg equilibrium or to monomorphism. Patients with more than 10 years of disease duration (DD) presented significant associations between the -857 SNP and systemic manifestations, as well as joint surgeries. Associations were also found between the -308 SNP and work disability in patients with more than 2 years of DD and radiological damage in patients with less than 10 years of DD. A borderline effect was found between the -238 SNP and HAQ score and radiological damage in patients with 2 to 10 years of DD. An association was also found between haplotypes and the SvdH score for those with more than 10 years of DD. An association was found between some TNF-alpha promoter SNPs and systemic manifestations, radiological progression, HAQ score, work disability, and joint surgeries, particularly in some classes of DD and between haplotypes and radiological progression for those with more than 10 years of DD.
Fonseca, João Eurico; Cavaleiro, João; Teles, José; Sousa, Elsa; Andreozzi, Valeska L; Antunes, Marília; Amaral-Turkman, Maria A; Canhão, Helena; Mourão, Ana F; Lopes, Joana; Caetano-Lopes, Joana; Weinmann, Pamela; Sobral, Marta; Nero, Patrícia; Saavedra, Maria J; Malcata, Armando; Cruz, Margarida; Melo, Rui; Braña, Araceli; Miranda, Luis; Patto, José V; Barcelos, Anabela; da Silva, José Canas; Santos, Luís M; Figueiredo, Guilherme; Rodrigues, Mário; Jesus, Herberto; Quintal, Alberto; Carvalho, Teresa; da Silva, José A Pereira; Branco, Jaime; Queiroz, Mário Viana
2007-01-01
The objective of this study was to assess whether clinical measures of rheumatoid arthritis activity and severity were influenced by tumor necrosis factor-alpha (TNF-α) promoter genotype/haplotype markers. Each patient's disease activity was assessed by the disease activity score using 28 joint counts (DAS28) and functional capacity by the Health Assessment Questionnaire (HAQ) score. Systemic manifestations, radiological damage evaluated by the Sharp/van der Heijde (SvdH) score, disease-modifying anti-rheumatic drug use, joint surgeries, and work disability were also assessed. The promoter region of the TNF-α gene, between nucleotides -1,318 and +49, was sequenced using an automated platform. Five hundred fifty-four patients were evaluated and genotyped for 10 single-nucleotide polymorphism (SNP) markers, but 5 of these markers were excluded due to failure to fall within Hardy-Weinberg equilibrium or to monomorphism. Patients with more than 10 years of disease duration (DD) presented significant associations between the -857 SNP and systemic manifestations, as well as joint surgeries. Associations were also found between the -308 SNP and work disability in patients with more than 2 years of DD and radiological damage in patients with less than 10 years of DD. A borderline effect was found between the -238 SNP and HAQ score and radiological damage in patients with 2 to 10 years of DD. An association was also found between haplotypes and the SvdH score for those with more than 10 years of DD. An association was found between some TNF-α promoter SNPs and systemic manifestations, radiological progression, HAQ score, work disability, and joint surgeries, particularly in some classes of DD and between haplotypes and radiological progression for those with more than 10 years of DD. PMID:17408492
Bengtsson, Therése; Åhman, Inger; Manninen, Outi; Reitan, Lars; Christerson, Therese; Due Jensen, Jens; Krusell, Lene; Jahoor, Ahmed; Orabi, Jihad
2017-01-01
The powdery mildew fungus, Blumeria graminis f. sp. hordei is a worldwide threat to barley (Hordeum vulgare L. ssp. vulgare) production. One way to control the disease is by the development and deployment of resistant cultivars. A genome-wide association study was performed in a Nordic spring barley panel consisting of 169 genotypes, to identify marker-trait associations significant for powdery mildew. Powdery mildew was scored during three years (2012–2014) in four different locations within the Nordic region. There were strong correlations between data from all locations and years. In total four QTLs were identified, one located on chromosome 4H in the same region as the previously identified mlo locus and three on chromosome 6H. Out of these three QTLs identified on chromosome 6H, two are in the same region as previously reported QTLs for powdery mildew resistance, whereas one QTL appears to be novel. The top NCBI BLASTn hit of the SNP markers within the novel QTL predicted the responsible gene to be the 26S proteasome regulatory subunit, RPN1, which is required for innate immunity and powdery mildew-induced cell death in Arabidopsis. The results from this study have revealed SNP marker candidates that can be exploited for use in marker-assisted selection and stacking of genes for powdery mildew resistance in barley. PMID:29184565
Bengtsson, Therése; Åhman, Inger; Manninen, Outi; Reitan, Lars; Christerson, Therese; Due Jensen, Jens; Krusell, Lene; Jahoor, Ahmed; Orabi, Jihad
2017-01-01
The powdery mildew fungus, Blumeria graminis f. sp. hordei is a worldwide threat to barley ( Hordeum vulgare L. ssp. vulgare ) production. One way to control the disease is by the development and deployment of resistant cultivars. A genome-wide association study was performed in a Nordic spring barley panel consisting of 169 genotypes, to identify marker-trait associations significant for powdery mildew. Powdery mildew was scored during three years (2012-2014) in four different locations within the Nordic region. There were strong correlations between data from all locations and years. In total four QTLs were identified, one located on chromosome 4H in the same region as the previously identified mlo locus and three on chromosome 6H. Out of these three QTLs identified on chromosome 6H, two are in the same region as previously reported QTLs for powdery mildew resistance, whereas one QTL appears to be novel. The top NCBI BLASTn hit of the SNP markers within the novel QTL predicted the responsible gene to be the 26S proteasome regulatory subunit, RPN1, which is required for innate immunity and powdery mildew-induced cell death in Arabidopsis . The results from this study have revealed SNP marker candidates that can be exploited for use in marker-assisted selection and stacking of genes for powdery mildew resistance in barley.
USDA-ARS?s Scientific Manuscript database
One focus of the Sorghum Translational Genomics Lab (part of sorghum CRIS, PSGD, CSRL, USDA-ARS, Lubbock TX) is to utilize nucleotide variation between sorghum germplasm such as those derived from RNA seq for translation and validation of Single Nucleotide Polymorphism (SNP) into easy access DNA m...
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Gretchen H. Roffler; Stephen J. Amish; Seth Smith; Ted Cosart; Marty Kardos; Michael K. Schwartz; Gordon Luikart
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding...
USDA-ARS?s Scientific Manuscript database
Stemphylium leaf spot, caused by Stemphylium botryosum f. sp. spinacia is an important disease in spinach. Use of genetic resistance is an efficient, economic and environment-friendly method to control this disease. The objective of this research was to conduct association analysis and identify SNP ...
Bertolini, F; Galimberti, G; Schiavo, G; Mastrangelo, S; Di Gerlando, R; Strillacci, M G; Bagnato, A; Portolano, B; Fontanesi, L
2018-01-01
Commercial single nucleotide polymorphism (SNP) arrays have been recently developed for several species and can be used to identify informative markers to differentiate breeds or populations for several downstream applications. To identify the most discriminating genetic markers among thousands of genotyped SNPs, a few statistical approaches have been proposed. In this work, we compared several methods of SNPs preselection (Delta, F st and principal component analyses (PCA)) in addition to Random Forest classifications to analyse SNP data from six dairy cattle breeds, including cosmopolitan (Holstein, Brown and Simmental) and autochthonous Italian breeds raised in two different regions and subjected to limited or no breeding programmes (Cinisara, Modicana, raised only in Sicily and Reggiana, raised only in Emilia Romagna). From these classifications, two panels of 96 and 48 SNPs that contain the most discriminant SNPs were created for each preselection method. These panels were evaluated in terms of the ability to discriminate as a whole and breed-by-breed, as well as linkage disequilibrium within each panel. The obtained results showed that for the 48-SNP panel, the error rate increased mainly for autochthonous breeds, probably as a consequence of their admixed origin lower selection pressure and by ascertaining bias in the construction of the SNP chip. The 96-SNP panels were generally more able to discriminate all breeds. The panel derived by PCA-chrom (obtained by a preselection chromosome by chromosome) could identify informative SNPs that were particularly useful for the assignment of minor breeds that reached the lowest value of Out Of Bag error even in the Cinisara, whose value was quite high in all other panels. Moreover, this panel contained also the lowest number of SNPs in linkage disequilibrium. Several selected SNPs are located nearby genes affecting breed-specific phenotypic traits (coat colour and stature) or associated with production traits. In general, our results demonstrated the usefulness of Random Forest in combination to other reduction techniques to identify population informative SNPs.
The clinical application of single-sperm-based SNP haplotyping for PGD of osteogenesis imperfecta.
Chen, Linjun; Diao, Zhenyu; Xu, Zhipeng; Zhou, Jianjun; Yan, Guijun; Sun, Haixiang
2018-05-15
Osteogenesis imperfecta (OI) is a genetically heterogeneous disorder, presenting either autosomal dominant, autosomal recessive or X-linked inheritance patterns. The majority of OI cases are autosomal dominant and are caused by heterozygous mutations in either the COL1A1 or COL1A2 gene. In these dominant disorders, allele dropout (ADO) can lead to misdiagnosis in preimplantation genetic diagnosis (PGD). Polymorphic markers linked to the mutated genes have been used to establish haplotypes for identifying ADO and ensuring the accuracy of PGD. However, the haplotype of male patients cannot be determined without data from affected relatives. Here, we developed a method for single-sperm-based single-nucleotide polymorphism (SNP) haplotyping via next-generation sequencing (NGS) for the PGD of OI. After NGS, 10 informative polymorphic SNP markers located upstream and downstream of the COL1A1 gene and its pathogenic mutation site were linked to individual alleles in a single sperm from an affected male. After haplotyping, a normal blastocyst was transferred to the uterus for a subsequent frozen embryo transfer cycle. The accuracy of PGD was confirmed by amniocentesis at 19 weeks of gestation. A healthy infant weighing 4,250 g was born via vaginal delivery at the 40th week of gestation. Single-sperm-based SNP haplotyping can be applied for PGD of any monogenic disorders or de novo mutations in males in whom the haplotype of paternal mutations cannot be determined due to a lack of affected relatives. ADO: allele dropout; DI: dentinogenesis imperfect; ESHRE: European Society of Human Reproduction and Embryology; FET: frozen embryo transfer; gDNA: genomic DNA; ICSI: intracytoplasmic sperm injection; IVF: in vitro fertilization; MDA: multiple displacement amplification; NGS: next-generation sequencing; OI: osteogenesis imperfect; PBS: phosphate buffer saline; PCR: polymerase chain reaction; PGD: preimplantation genetic diagnosis; SNP: single-nucleotide polymorphism; STR: short tandem repeat; TE: trophectoderm; WGA: whole-genome amplification.
2009-01-01
Background Expressed sequence tags (ESTs) are an important source of gene-based markers such as those based on insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). Several gel based methods have been reported for the detection of sequence variants, however they have not been widely exploited in common bean, an important legume crop of the developing world. The objectives of this project were to develop and map EST based markers using analysis of single strand conformation polymorphisms (SSCPs), to create a transcript map for common bean and to compare synteny of the common bean map with sequenced chromosomes of other legumes. Results A set of 418 EST based amplicons were evaluated for parental polymorphisms using the SSCP technique and 26% of these presented a clear conformational or size polymorphism between Andean and Mesoamerican genotypes. The amplicon based markers were then used for genetic mapping with segregation analysis performed in the DOR364 × G19833 recombinant inbred line (RIL) population. A total of 118 new marker loci were placed into an integrated molecular map for common bean consisting of 288 markers. Of these, 218 were used for synteny analysis and 186 presented homology with segments of the soybean genome with an e-value lower than 7 × 10-12. The synteny analysis with soybean showed a mosaic pattern of syntenic blocks with most segments of any one common bean linkage group associated with two soybean chromosomes. The analysis with Medicago truncatula and Lotus japonicus presented fewer syntenic regions consistent with the more distant phylogenetic relationship between the galegoid and phaseoloid legumes. Conclusion The SSCP technique is a useful and inexpensive alternative to other SNP or Indel detection techniques for saturating the common bean genetic map with functional markers that may be useful in marker assisted selection. In addition, the genetic markers based on ESTs allowed the construction of a transcript map and given their high conservation between species allowed synteny comparisons to be made to sequenced genomes. This synteny analysis may support positional cloning of target genes in common bean through the use of genomic information from these other legumes. PMID:20030833
Erbe, M; Hayes, B J; Matukumalli, L K; Goswami, S; Bowman, P J; Reich, C M; Mason, B A; Goddard, M E
2012-07-01
Achieving accurate genomic estimated breeding values for dairy cattle requires a very large reference population of genotyped and phenotyped individuals. Assembling such reference populations has been achieved for breeds such as Holstein, but is challenging for breeds with fewer individuals. An alternative is to use a multi-breed reference population, such that smaller breeds gain some advantage in accuracy of genomic estimated breeding values (GEBV) from information from larger breeds. However, this requires that marker-quantitative trait loci associations persist across breeds. Here, we assessed the gain in accuracy of GEBV in Jersey cattle as a result of using a combined Holstein and Jersey reference population, with either 39,745 or 624,213 single nucleotide polymorphism (SNP) markers. The surrogate used for accuracy was the correlation of GEBV with daughter trait deviations in a validation population. Two methods were used to predict breeding values, either a genomic BLUP (GBLUP_mod), or a new method, BayesR, which used a mixture of normal distributions as the prior for SNP effects, including one distribution that set SNP effects to zero. The GBLUP_mod method scaled both the genomic relationship matrix and the additive relationship matrix to a base at the time the breeds diverged, and regressed the genomic relationship matrix to account for sampling errors in estimating relationship coefficients due to a finite number of markers, before combining the 2 matrices. Although these modifications did result in less biased breeding values for Jerseys compared with an unmodified genomic relationship matrix, BayesR gave the highest accuracies of GEBV for the 3 traits investigated (milk yield, fat yield, and protein yield), with an average increase in accuracy compared with GBLUP_mod across the 3 traits of 0.05 for both Jerseys and Holsteins. The advantage was limited for either Jerseys or Holsteins in using 624,213 SNP rather than 39,745 SNP (0.01 for Holsteins and 0.03 for Jerseys, averaged across traits). Even this limited and nonsignificant advantage was only observed when BayesR was used. An alternative panel, which extracted the SNP in the transcribed part of the bovine genome from the 624,213 SNP panel (to give 58,532 SNP), performed better, with an increase in accuracy of 0.03 for Jerseys across traits. This panel captures much of the increased genomic content of the 624,213 SNP panel, with the advantage of a greatly reduced number of SNP effects to estimate. Taken together, using this panel, a combined breed reference and using BayesR rather than GBLUP_mod increased the accuracy of GEBV in Jerseys from 0.43 to 0.52, averaged across the 3 traits. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
SNP discovery in the bovine milk transcriptome using RNA-Seq technology.
Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F
2010-12-01
High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.
Martínez-García, Pedro J; Fresnedo-Ramírez, Jonathan; Parfitt, Dan E; Gradziel, Thomas M; Crisosto, Carlos H
2013-01-01
Single nucleotide polymorphisms (SNPs) are a fundamental source of genomic variation. Large SNP panels have been developed for Prunus species. Fruit quality traits are essential peach breeding program objectives since they determine consumer acceptance, fruit consumption, industry trends and cultivar adoption. For many cultivars, these traits are negatively impacted by cold storage, used to extend fruit market life. The major symptoms of chilling injury are lack of flavor, off flavor, mealiness, flesh browning, and flesh bleeding. A set of 1,109 SNPs was mapped previously and 67 were linked with these complex traits. The prediction of the effects associated with these SNPs on downstream products from the 'peach v1.0' genome sequence was carried out. A total of 2,163 effects were detected, 282 effects (non-synonymous, synonymous or stop codon gained) were located in exonic regions (13.04 %) and 294 placed in intronic regions (13.59 %). An extended list of genes and proteins that could be related to these traits was developed. Two SNP markers that explain a high percentage of the observed phenotypic variance, UCD_SNP_1084 and UCD_SNP_46, are associated with zinc finger (C3HC4-type RING finger) family protein and AOX1A (alternative oxidase 1a) protein groups, respectively. In addition, phenotypic variation suggests that the observed polymorphism for SNP UCD_SNP_1084 [A/G] mutation could be a candidate quantitative trait nucleotide affecting quantitative trait loci for mealiness. The interaction and expression of affected proteins could explain the variation observed in each individual and facilitate understanding of gene regulatory networks for fruit quality traits in peach.
Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids
Bartkiewicz, Annette M.; Chilla, Friederike; Terefe-Ayana, Diro; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Linde, Marcus; Debener, Thomas
2018-01-01
Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP) array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL) for four phenotypic traits. PMID:29868076
Khrustaleva, A M; Volkov, A A; Stoklitskaia, D S; Miuge, N S; Zelenina, D A
2010-11-01
Sockeye salmon samples from five largest lacustrine-riverine systems of Kamchatka Peninsula were tested for polymorphism at six microsatellite (STR) and five single nucleotide polymorphism (SNP) loci. Statistically significant genetic differentiation among local populations from this part of the species range examined was demonstrated. The data presented point to pronounced genetic divergence of the populations from two geographical regions, Eastern and Western Kamchatka. For sockeye salmon, the individual identification test accuracy was higher for microsatellites compared to similar number of SNP markers. Pooling of the STR and SNP allele frequency data sets provided the highest accuracy of the individual fish population assignment.
USDA-ARS?s Scientific Manuscript database
A high-throughput genotyping platform is needed to enable marker-assisted breeding in the allo-octoploid cultivated strawberry Fragaria ×ananassa. Short-read sequences from one diploid and 19 octoploid accessions were aligned to the diploid Fragaria vesca ‘Hawaii 4’ reference genome to identify sing...
Evaluation of soybean germplasm conserved in NIAS genebank and development of mini core collections
Kaga, Akito; Shimizu, Takehiko; Watanabe, Satoshi; Tsubokura, Yasutaka; Katayose, Yuichi; Harada, Kyuya; Vaughan, Duncan A.; Tomooka, Norihiko
2012-01-01
Genetic variation and population structure among 1603 soybean accessions, consisted of 832 Japanese landraces, 109 old and 57 recent Japanese varieties, 341 landrace from 16 Asian countries and 264 wild soybean accessions, were characterized using 191 SNP markers. Although gene diversity of Japanese soybean germplasm was slight lower than that of exotic soybean germplasm, population differentiation and clustering analyses indicated clear genetic differentiation among Japanese cultivated soybeans, exotic cultivated soybeans and wild soybeans. Nine hundred ninety eight Japanese accessions were separated to a certain extent into groups corresponding to their agro-morphologic characteristics such as photosensitivity and seed characteristics rather than their geographical origin. Based on the assessment of the SNP markers and several agro-morphologic traits, accessions that retain gene diversity of the whole collection were selected to develop several soybean sets of different sizes using an heuristic approach; a minimum of 12 accessions can represent the observed gene diversity; a mini-core collection of 96 accession can represent a major proportion of both geographic origin and agro-morphologic trait variation. These selected sets of germplasm will provide an effective platform for enhancing soybean diversity studies and assist in finding novel traits for crop improvement. PMID:23136496
Li, Feng; Kitashiba, Hiroyasu; Inaba, Kiyofumi; Nishio, Takeshi
2009-01-01
For identification of genes responsible for varietal differences in flowering time and leaf morphological traits, we constructed a linkage map of Brassica rapa DNA markers including 170 EST-based markers, 12 SSR markers, and 59 BAC sequence-based markers, of which 151 are single nucleotide polymorphism (SNP) markers. By BLASTN, 223 markers were shown to have homologous regions in Arabidopsis thaliana, and these homologous loci covered nearly the whole genome of A. thaliana. Synteny analysis between B. rapa and A. thaliana revealed 33 large syntenic regions. Three quantitative trait loci (QTLs) for flowering time were detected. BrFLC1 and BrFLC2 were linked to the QTLs for bolting time, budding time, and flowering time. Three SNPs in the promoter, which may be the cause of low expression of BrFLC2 in the early-flowering parental line, were identified. For leaf lobe depth and leaf hairiness, one major QTL corresponding to a syntenic region containing GIBBERELLIN 20 OXIDASE 3 and one major QTL containing BrGL1, respectively, were detected. Analysis of nucleotide sequences and expression of these genes suggested possible involvement of these genes in leaf morphological traits. PMID:19884167
Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T
2017-02-01
To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.
Ravelombola, Waltram; Shi, Ainong; Weng, Yuejin; Mou, Beiquan; Motes, Dennis; Clark, John; Chen, Pengyin; Srivastava, Vibha; Qin, Jun; Dong, Lingdi; Yang, Wei; Bhattarai, Gehendra; Sugihara, Yuichi
2018-01-01
This is the first report on association analysis of salt tolerance and identification of SNP markers associated with salt tolerance in cowpea. Cowpea (Vigna unguiculata (L.) Walp) is one of the most important cultivated legumes in Africa. The worldwide annual production in cowpea dry seed is 5.4 million metric tons. However, cowpea is unfavorably affected by salinity stress at germination and seedling stages, which is exacerbated by the effects of climate change. The lack of knowledge on the genetic underlying salt tolerance in cowpea limits the establishment of a breeding strategy for developing salt-tolerant cowpea cultivars. The objectives of this study were to conduct association mapping for salt tolerance at germination and seedling stages and to identify SNP markers associated with salt tolerance in cowpea. We analyzed the salt tolerance index of 116 and 155 cowpea accessions at germination and seedling stages, respectively. A total of 1049 SNPs postulated from genotyping-by-sequencing were used for association analysis. Population structure was inferred using Structure 2.3.4; K optimal was determined using Structure Harvester. TASSEL 5, GAPIT, and FarmCPU involving three models such as single marker regression, general linear model, and mixed linear model were used for the association study. Substantial variation in salt tolerance index for germination rate, plant height reduction, fresh and dry shoot biomass reduction, foliar leaf injury, and inhibition of the first trifoliate leaf was observed. The cowpea accessions were structured into two subpopulations. Three SNPs, Scaffold87490_622, Scaffold87490_630, and C35017374_128 were highly associated with salt tolerance at germination stage. Seven SNPs, Scaffold93827_270, Scaffold68489_600, Scaffold87490_633, Scaffold87490_640, Scaffold82042_3387, C35069468_1916, and Scaffold93942_1089 were found to be associated with salt tolerance at seedling stage. The SNP markers were consistent across the three models and could be used as a tool to select salt-tolerant lines for breeding improved cowpea tolerance to salinity.
Development of a genetic tool for product regulation in the diverse British pig breed market.
Wilkinson, Samantha; Archibald, Alan L; Haley, Chris S; Megens, Hendrik-Jan; Crooijmans, Richard P M A; Groenen, Martien A M; Wiener, Pamela; Ogden, Rob
2012-11-15
The application of DNA markers for the identification of biological samples from both human and non-human species is widespread and includes use in food authentication. In the food industry the financial incentive to substituting the true name of a food product with a higher value alternative is driving food fraud. This applies to British pork products where products derived from traditional pig breeds are of premium value. The objective of this study was to develop a genetic assay for regulatory authentication of traditional pig breed-labelled products in the porcine food industry in the United Kingdom. The dataset comprised of a comprehensive coverage of breed types present in Britain: 460 individuals from 7 traditional breeds, 5 commercial purebreds, 1 imported European breed and 1 imported Asian breed were genotyped using the PorcineSNP60 beadchip. Following breed-informative SNP selection, assignment power was calculated for increasing SNP panel size. A 96-plex assay created using the most informative SNPs revealed remarkably high genetic differentiation between the British pig breeds, with an average FST of 0.54 and Bayesian clustering analysis also indicated that they were distinct homogenous populations. The posterior probability of assignment of any individual of a presumed origin actually originating from that breed given an alternative breed origin was > 99.5% in 174 out of 182 contrasts, at a test value of log(LR) > 0. Validation of the 96-plex assay using independent test samples of known origin was successful; a subsequent survey of market samples revealed a high level of breed label conformity. The newly created 96-plex assay using selected markers from the PorcineSNP60 beadchip enables powerful assignment of samples to traditional breed origin and can effectively identify mislabelling, providing a highly effective tool for DNA analysis in food forensics.
Development of a genetic tool for product regulation in the diverse British pig breed market
2012-01-01
Background The application of DNA markers for the identification of biological samples from both human and non-human species is widespread and includes use in food authentication. In the food industry the financial incentive to substituting the true name of a food product with a higher value alternative is driving food fraud. This applies to British pork products where products derived from traditional pig breeds are of premium value. The objective of this study was to develop a genetic assay for regulatory authentication of traditional pig breed-labelled products in the porcine food industry in the United Kingdom. Results The dataset comprised of a comprehensive coverage of breed types present in Britain: 460 individuals from 7 traditional breeds, 5 commercial purebreds, 1 imported European breed and 1 imported Asian breed were genotyped using the PorcineSNP60 beadchip. Following breed-informative SNP selection, assignment power was calculated for increasing SNP panel size. A 96-plex assay created using the most informative SNPs revealed remarkably high genetic differentiation between the British pig breeds, with an average FST of 0.54 and Bayesian clustering analysis also indicated that they were distinct homogenous populations. The posterior probability of assignment of any individual of a presumed origin actually originating from that breed given an alternative breed origin was > 99.5% in 174 out of 182 contrasts, at a test value of log(LR) > 0. Validation of the 96-plex assay using independent test samples of known origin was successful; a subsequent survey of market samples revealed a high level of breed label conformity. Conclusion The newly created 96-plex assay using selected markers from the PorcineSNP60 beadchip enables powerful assignment of samples to traditional breed origin and can effectively identify mislabelling, providing a highly effective tool for DNA analysis in food forensics. PMID:23150935
Zanke, Christine D; Rodemann, Bernd; Ling, Jie; Muqaddasi, Quddoos H; Plieske, Jörg; Polley, Andreas; Kollers, Sonja; Ebmeyer, Erhard; Korzun, Viktor; Argillier, Odile; Stiewe, Gunther; Zschäckel, Thomas; Ganal, Martin W; Röder, Marion S
2017-03-01
Genotypes with recombination events in the Triticum ventricosum introgression on chromosome 7D allowed to fine-map resistance gene Pch1, the main source of eyespot resistance in European winter wheat cultivars. Eyespot (also called Strawbreaker) is a common and serious fungal disease of winter wheat caused by the necrotrophic fungi Oculimacula yallundae and Oculimacula acuformis (former name Pseudocercosporella herpotrichoides). A genome-wide association study (GWAS) for eyespot was performed with 732 microsatellite markers (SSR) and 7761 mapped SNP markers derived from the 90 K iSELECT wheat array using a panel of 168 European winter wheat varieties as well as three spring wheat varieties and phenotypic evaluation of eyespot in field tests in three environments. Best linear unbiased estimations (BLUEs) were calculated across all trials and ranged from 1.20 (most resistant) to 5.73 (most susceptible) with an average value of 4.24 and a heritability of H 2 = 0.91. A total of 108 SSR and 235 SNP marker-trait associations (MTAs) were identified by considering associations with a -log 10 (P value) ≥3.0. Significant MTAs for eyespot-score BLUEs were found on chromosomes 1D, 2A, 2D, 3D, 5A, 5D, 6A, 7A and 7D for the SSR markers and chromosomes 1B, 2A, 2B, 2D, 3B and 7D for the SNP markers. For 18 varieties (10.5%), a highly resistant phenotype was detected that was linked to the presence of the resistance gene Pch1 on chromosome 7D. The identification of genotypes with recombination events in the introgressed genomic segment from Triticum ventricosum harboring the Pch1 resistance gene on chromosome 7DL allowed the fine-mapping of this gene using additional SNP markers and a potential candidate gene Traes_7DL_973A33763 coding for a CC-NBS-LRR class protein was identified.
2011-01-01
Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation sequencing technologies and will include SNPs from comparative orthologous sequences that were identified in the present study, providing a wider collection of anchor points for comparative genomics among the conifers. PMID:21767361
USDA-ARS?s Scientific Manuscript database
Dominant and co-dominant molecular markers are routinely used in plant genetic diversity research. In the present study we assessed the success-rate of three marker-systems for estimating genotypic diversity, clustering varieties into populations, and assigning a single variety into the expected pop...
USDA-ARS?s Scientific Manuscript database
Genomic selection (GS) simultaneously incorporates dense SNP marker genotypes with phenotypic data from related animals to predict animal-specific genomic breeding value (GEBV), which circumvents the need to measure the disease phenotype in potential breeders. Marker assisted selection (MAS) involv...
Liu, Jia; Wang, Jun; Wang, Hui; Wang, Wenxiang; Zhou, Rijin; Mei, Desheng; Cheng, Hongtao; Yang, Juan; Raman, Harsh; Hu, Qiong
2016-01-01
The majority of rapeseed cultivars shatter seeds upon maturity especially under hot-dry and windy conditions, reducing yield and gross margin return to growers. Here, we identified quantitative trait loci (QTL) for resistance to pod shatter in an unstructured diverse panel of 143 rapeseed accessions, and two structured populations derived from bi-parental doubled haploid (DH) and inter-mated (IF2) crosses derived from R1 (resistant to pod shattering) and R2 (prone to pod shattering) accessions. Genome-wide association analysis identified six significant QTL for resistance to pod shatter located on chromosomes A01, A06, A07, A09, C02, and C05. Two of the QTL, qSRI.A09 delimited with the SNP marker Bn-A09-p30171993 (A09) and qSRI.A06 delimited with the SNP marker Bn-A06-p115948 (A06) could be repeatedly detected across environments in a diversity panel, DH and IF2 populations, suggesting that at least two loci on chromosomes A06 and A09 were the main contributors to pod shatter resistance in Chinese germplasm. Significant SNP markers identified in this study especially those that appeared repeatedly across environments provide a cost-effective and an efficient method for introgression and pyramiding of favorable alleles for pod shatter resistance via marker-assisted selection in rapeseed improvement programs. PMID:27493651
USDA-ARS?s Scientific Manuscript database
The genome-wide association study (GWAS) is a useful tool for detecting and characterizing traits of interest including those associated with disease resistance in soybean. The availability of 50,000 single nucleotide polymorphism (SNP) markers (SoySNP50K iSelect BeadChip; www.soybase.org) on 19,652...
USDA-ARS?s Scientific Manuscript database
We will present an ultra-dense genetic linkage map for the octoploid, cultivated strawberry (Fragaria x ananassa) consisting of over 13K Axiom® based SNP markers and 150 previously mapped reference SSR loci. The high quality of the map is demonstrated by the short sizes of each of the 28 linkage gro...
Brøndum, R F; Su, G; Janss, L; Sahana, G; Guldbrandtsen, B; Boichard, D; Lund, M S
2015-06-01
This study investigated the effect on the reliability of genomic prediction when a small number of significant variants from single marker analysis based on whole genome sequence data were added to the regular 54k single nucleotide polymorphism (SNP) array data. The extra markers were selected with the aim of augmenting the custom low-density Illumina BovineLD SNP chip (San Diego, CA) used in the Nordic countries. The single-marker analysis was done breed-wise on all 16 index traits included in the breeding goals for Nordic Holstein, Danish Jersey, and Nordic Red cattle plus the total merit index itself. Depending on the trait's economic weight, 15, 10, or 5 quantitative trait loci (QTL) were selected per trait per breed and 3 to 5 markers were selected to tag each QTL. After removing duplicate markers (same marker selected for more than one trait or breed) and filtering for high pairwise linkage disequilibrium and assaying performance on the array, a total of 1,623 QTL markers were selected for inclusion on the custom chip. Genomic prediction analyses were performed for Nordic and French Holstein and Nordic Red animals using either a genomic BLUP or a Bayesian variable selection model. When using the genomic BLUP model including the QTL markers in the analysis, reliability was increased by up to 4 percentage points for production traits in Nordic Holstein animals, up to 3 percentage points for Nordic Reds, and up to 5 percentage points for French Holstein. Smaller gains of up to 1 percentage point was observed for mastitis, but only a 0.5 percentage point increase was seen for fertility. When using a Bayesian model accuracies were generally higher with only 54k data compared with the genomic BLUP approach, but increases in reliability were relatively smaller when QTL markers were included. Results from this study indicate that the reliability of genomic prediction can be increased by including markers significant in genome-wide association studies on whole genome sequence data alongside the 54k SNP set. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Luo, Zhijing; Chen, Mingjiao; Zhao, Xiangxiang; Zhang, Dabing; Qi, Yiping; Yuan, Zheng
2016-01-01
Rapid and accurate genome-wide marker detection is essential to the marker-assisted breeding and functional genomics studies. In this work, we developed an integrated software, AgroMarker Finder (AMF: http://erp.novelbio.com/AMF), for providing graphical user interface (GUI) to facilitate the recently developed restriction-site associated DNA (RAD) sequencing data analysis in rice. By application of AMF, a total of 90,743 high-quality markers (82,878 SNPs and 7,865 InDels) were detected between rice varieties JP69 and Jiaoyuan5A. The density of the identified markers is 0.2 per Kb for SNP markers, and 0.02 per Kb for InDel markers. Sequencing validation revealed that the accuracy of genome-wide marker detection by AMF is 93%. In addition, a validated subset of 82 SNPs and 31 InDels were found to be closely linked to 117 important agronomic trait genes, providing a basis for subsequent marker-assisted selection (MAS) and variety identification. Furthermore, we selected 12 markers from 31 validated InDel markers to identify seed authenticity of variety Jiaoyuanyou69, and we also identified 10 markers closely linked to the fragrant gene BADH2 to minimize linkage drag for Wuxiang075 (BADH2 donor)/Jiachang1 recombinants selection. Therefore, this software provides an efficient approach for marker identification from RAD-seq data, and it would be a valuable tool for plant MAS and variety protection. PMID:26799713
Fan, Wei; Zong, Jie; Luo, Zhijing; Chen, Mingjiao; Zhao, Xiangxiang; Zhang, Dabing; Qi, Yiping; Yuan, Zheng
2016-01-01
Rapid and accurate genome-wide marker detection is essential to the marker-assisted breeding and functional genomics studies. In this work, we developed an integrated software, AgroMarker Finder (AMF: http://erp.novelbio.com/AMF), for providing graphical user interface (GUI) to facilitate the recently developed restriction-site associated DNA (RAD) sequencing data analysis in rice. By application of AMF, a total of 90,743 high-quality markers (82,878 SNPs and 7,865 InDels) were detected between rice varieties JP69 and Jiaoyuan5A. The density of the identified markers is 0.2 per Kb for SNP markers, and 0.02 per Kb for InDel markers. Sequencing validation revealed that the accuracy of genome-wide marker detection by AMF is 93%. In addition, a validated subset of 82 SNPs and 31 InDels were found to be closely linked to 117 important agronomic trait genes, providing a basis for subsequent marker-assisted selection (MAS) and variety identification. Furthermore, we selected 12 markers from 31 validated InDel markers to identify seed authenticity of variety Jiaoyuanyou69, and we also identified 10 markers closely linked to the fragrant gene BADH2 to minimize linkage drag for Wuxiang075 (BADH2 donor)/Jiachang1 recombinants selection. Therefore, this software provides an efficient approach for marker identification from RAD-seq data, and it would be a valuable tool for plant MAS and variety protection.
Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.
2015-01-01
ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133
Feltus, F Alex; Wan, Jun; Schulze, Stefan R; Estill, James C; Jiang, Ning; Paterson, Andrew H
2004-09-01
Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% +/- 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% +/- 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp.
Feltus, F. Alex; Wan, Jun; Schulze, Stefan R.; Estill, James C.; Jiang, Ning; Paterson, Andrew H.
2004-01-01
Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% ± 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% ± 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp. PMID:15342564
Garzón-Martínez, Gina A.; Osorio-Guarín, Jaime A.; Delgadillo-Durán, Paola; Mayorga, Franklin; Enciso-Rodríguez, Felix E.; Landsman, David
2015-01-01
The genus Physalis is common in the Americas and includes several economically important species, among them Physalis peruviana that produces appetizing edible fruits. We studied the genetic diversity and population structure of P. peruviana and characterized 47 accessions of this species along with 13 accessions of related taxa consisting of 222 individuals from the Colombian Corporation of Agricultural Research (CORPOICA) germplasm collection, using Conserved Orthologous Sequences (COSII) and Immunity Related Genes (IRGs). In addition, 642 Single Nucleotide Polymorphism (SNPs) markers were identified and used for the genetic diversity analysis. A total of 121 alleles were detected in 24 InDels loci ranging from 2 to 9 alleles per locus, with an average of 5.04 alleles per locus. The average number of alleles in the SNP markers was two. The observed heterozygosity for P. peruviana with InDel and SNP markers was higher (0.48 and 0.59) than the expected heterozygosity (0.30 and 0.41). Interestingly, the observed heterozygosity in related taxa (0.4 and 0.12) was lower than the expected heterozygosity (0.59 and 0.25). The coefficient of population differentiation FST was 0.143 (InDels) and 0.038 (SNPs), showing a relatively low level of genetic differentiation among P. peruviana and related taxa. Higher levels of genetic variation were instead observed within populations based on the AMOVA analysis. Population structure analysis supported the presence of two main groups and PCA analysis based on SNP markers revealed two distinct clusters in the P. peruviana accessions corresponding to their state of cultivation. In this study, we identified molecular markers useful to detect genetic variation in Physalis germplasm for assisting conservation and crossbreeding strategies. PMID:26550601
Garzón-Martínez, Gina A; Osorio-Guarín, Jaime A; Delgadillo-Durán, Paola; Mayorga, Franklin; Enciso-Rodríguez, Felix E; Landsman, David; Mariño-Ramírez, Leonardo; Barrero, Luz Stella
2015-12-01
The genus Physalis is common in the Americas and includes several economically important species, among them Physalis peruviana that produces appetizing edible fruits. We studied the genetic diversity and population structure of P. peruviana and characterized 47 accessions of this species along with 13 accessions of related taxa consisting of 222 individuals from the Colombian Corporation of Agricultural Research (CORPOICA) germplasm collection, using Conserved Orthologous Sequences (COSII) and Immunity Related Genes (IRGs). In addition, 642 Single Nucleotide Polymorphism (SNPs) markers were identified and used for the genetic diversity analysis. A total of 121 alleles were detected in 24 InDels loci ranging from 2 to 9 alleles per locus, with an average of 5.04 alleles per locus. The average number of alleles in the SNP markers was two. The observed heterozygosity for P. peruviana with InDel and SNP markers was higher (0.48 and 0.59) than the expected heterozygosity (0.30 and 0.41). Interestingly, the observed heterozygosity in related taxa (0.4 and 0.12) was lower than the expected heterozygosity (0.59 and 0.25). The coefficient of population differentiation F ST was 0.143 (InDels) and 0.038 (SNPs), showing a relatively low level of genetic differentiation among P. peruviana and related taxa. Higher levels of genetic variation were instead observed within populations based on the AMOVA analysis. Population structure analysis supported the presence of two main groups and PCA analysis based on SNP markers revealed two distinct clusters in the P. peruviana accessions corresponding to their state of cultivation. In this study, we identified molecular markers useful to detect genetic variation in Physalis germplasm for assisting conservation and crossbreeding strategies.
The GCP molecular marker toolkit, an instrument for use in breeding food security crops.
Van Damme, Veerle; Gómez-Paniagua, Humberto; de Vicente, M Carmen
2011-12-01
Crop genetic resources carry variation useful for overcoming the challenges of modern agriculture. Molecular markers can facilitate the selection of agronomically important traits. The pervasiveness of genomics research has led to an overwhelming number of publications and databases, which are, nevertheless, scattered and hence often difficult for plant breeders to access, particularly those in developing countries. This situation separates them from developed countries, which have better endowed programs for developing varieties. To close this growing knowledge gap, we conducted an intensive literature review and consulted with more than 150 crop experts on the use of molecular markers in the breeding program of 19 food security crops. The result was a list of effectively used and highly reproducible sequence tagged site (STS), simple sequence repeat (SSR), single nucleotide polymorphism (SNP), and sequence characterized amplified region (SCAR) markers. However, only 12 food crops had molecular markers suitable for improvement. That is, marker-assisted selection is not yet used for Musa spp., coconut, lentils, millets, pigeonpea, sweet potato, and yam. For the other 12 crops, 214 molecular markers were found to be effectively used in association with 74 different traits. Results were compiled as the GCP Molecular Marker Toolkit, a free online tool that aims to promote the adoption of molecular approaches in breeding activities.
Rincon, Gonzalo; Islas-Trejo, Alma; Castillo, Alejandro R; Bauman, Dale E; German, Bruce J; Medrano, Juan F
2012-02-01
Genes in the sterol regulatory element-binding protein-1 (SREBP1) pathway play a central role in regulation of milk fat synthesis, especially the de-novo synthesis of saturated fatty acids. SCD, a SREBP-responsive gene, is the key enzyme in the synthesis of monounsaturated fatty acids in the mammary gland. In the present study, we discovered SNP in candidate genes associated with this signalling pathway and SCD to identify genetic markers that can be used for genetic and metabolically directed selection in cattle. We resequenced six candidate genes in the SREBP1 pathway (SREBP1, SCAP, INSIG1, INSIG2, MBTPS1, MBTPS2) and two genes for SCD (SCD1 and SCD5) and discovered 47 Tag SNP that were used in a marker-trait association study. Milk and blood samples were collected from Holstein cows in their 1st or 2nd parity at 100-150 days of lactation. Individual fatty acids from C4 to C20, saturated fatty acid (SFA) content, monounsaturated fatty acid content, polyunsaturated fatty acid content and desaturase indexes were measured and used to perform the asociation analysis. Polymorphisms in the SCD5 and INSIG2 genes were the most representative markers associated with SFA/unsaturated fatty acid (UFA) ratio in milk. The analysis of desaturation activity determined that markers in the SCD1 and SCD5 genes showed the most significant effects. DGAT1 K232A marker was included in the study to examine the effect of this marker on the variation of milk fatty acids in our Holstein population. The percentage of variance explained by DGAT1 in the analysis was only 6% of SFA/UFA ratio. Milk fat depression was observed in one of the dairy herds and in this particular dairy one SNP in the SREBP1 gene (rs41912290) accounted for 40% of the phenotypic variance. Our results provide detailed SNP information for key genes in the SREBP1 signalling pathway and SCD that can be used to change milk fat composition by marker-assisted breeding to meet consumer demands regarding human health, as well as furthering understanding of technological aspects of cows' milk.
2014-01-01
Background Although the X chromosome is the second largest bovine chromosome, markers on the X chromosome are not used for genomic prediction in some countries and populations. In this study, we presented a method for computing genomic relationships using X chromosome markers, investigated the accuracy of imputation from a low density (7K) to the 54K SNP (single nucleotide polymorphism) panel, and compared the accuracy of genomic prediction with and without using X chromosome markers. Methods The impact of considering X chromosome markers on prediction accuracy was assessed using data from Nordic Holstein bulls and different sets of SNPs: (a) the 54K SNPs for reference and test animals, (b) SNPs imputed from the 7K to the 54K SNP panel for test animals, (c) SNPs imputed from the 7K to the 54K panel for half of the reference animals, and (d) the 7K SNP panel for all animals. Beagle and Findhap were used for imputation. GBLUP (genomic best linear unbiased prediction) models with or without X chromosome markers and with or without a residual polygenic effect were used to predict genomic breeding values for 15 traits. Results Averaged over the two imputation datasets, correlation coefficients between imputed and true genotypes for autosomal markers, pseudo-autosomal markers, and X-specific markers were 0.971, 0.831 and 0.935 when using Findhap, and 0.983, 0.856 and 0.937 when using Beagle. Estimated reliabilities of genomic predictions based on the imputed datasets using Findhap or Beagle were very close to those using the real 54K data. Genomic prediction using all markers gave slightly higher reliabilities than predictions without X chromosome markers. Based on our data which included only bulls, using a G matrix that accounted for sex-linked relationships did not improve prediction, compared with a G matrix that did not account for sex-linked relationships. A model that included a polygenic effect did not recover the loss of prediction accuracy from exclusion of X chromosome markers. Conclusions The results from this study suggest that markers on the X chromosome contribute to accuracy of genomic predictions and should be used for routine genomic evaluation. PMID:25080199
Marker traits association of agronomical traits correlated with stagnant flooding tolerance in rice
NASA Astrophysics Data System (ADS)
Sitaresmi, T.; Utami, D. W.; Suwarno, W. B.; Ardie, S. W.; Susanto, U.; Aswidinnoor, H.
2017-05-01
In deep-water areas, the water depth increases gradually throughout the year and maintains up to more than 50 cm of deep of water for long period. In these situations, elongation ability is necessary to allow the plants to keep up with rising floodwater. The elongation of internode during submergence is regulated by environmental and hormonal factors. The objective of this study was aimed to identify the SNP markers on 384 SNPs linked with agronomical and morphological traits related to stagnant flooding tolerance. The research were conducted at Indonesian Center for Rice Research and Indonesian Centre for Agricultural Biotechnology and Genetic Resources Research and Development. The phenotypical data was collected from F2 from bi-parental crossing of IR 42 and IRRI 119. IR 42 was sensitive parent, and IRRI 119 was tolerant. DNA extraction for rice was using a modified version of Murray and Thompson method using cetyl tri-methyl-ammonium bromide (CTAB). The genotyping was carried out using 384 SNPs Golden Gate Illumina assay. Association analysis between SNP markers and phenotypical data was performed using General Linear Model in Tassel versus 5.0 software program. Based on GLM analysis, the significant marker for plant height with P value < 0.05 are TBGI275345, TBGI275367, and TBGI424383. The significant marker for number of tiller are TBGI000722, TBGI258600, TBGI270843, TBGI271066, TBGI271076, TBGI272122, TBGI272241, and TBGI327790. Two of them, TBGI424383 and TBGI271066 were expected associated with family of protein kinase which play role in plant stress signalling.
Mapping of the Gynoecy in Bitter Gourd (Momordica charantia) Using RAD-Seq Analysis
Matsumura, Hideo; Miyagi, Norimichi; Taniai, Naoki; Fukushima, Mai; Tarora, Kazuhiko; Shudo, Ayano; Urasaki, Naoya
2014-01-01
Momordica charantia is a monoecious plant of the Cucurbitaceae family that has both male and female unisexual flowers. Its unique gynoecious line, OHB61-5, is essential as a maternal parent in the production of F1 cultivars. To identify the DNA markers for this gynoecy, a RAD-seq (restriction-associated DNA tag sequencing) analysis was employed to reveal genome-wide DNA polymorphisms and to genotype the F2 progeny from a cross between OHB61-5 and a monoecious line. Based on a RAD-seq analysis of F2 individuals, a linkage map was constructed using 552 co-dominant markers. In addition, after analyzing the pooled genomic DNA from monoecious or gynoecious F2 plants, several SNP loci that are genetically linked to gynoecy were identified. GTFL-1, the closest SNP locus to the putative gynoecious locus, was converted to a conventional DNA marker using invader assay technology, which is applicable to the marker-assisted selection of gynoecy in M. charantia breeding. PMID:24498029
Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M
2015-01-01
In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.
USDA-ARS?s Scientific Manuscript database
Leaf rust (Puccinia triticina Eriks. & Henn.) is a major disease affecting durum wheat production. The Lr14a leaf rust resistant gene present in the durum wheat cv. Creso and its derivative Colosseo is one of the best characterized leaf rust resistance sources presently deployed in durum wheat breed...
Bangera, Rama; Correa, Katharina; Lhorente, Jean P; Figueroa, René; Yáñez, José M
2017-01-31
Salmon Rickettsial Syndrome (SRS) caused by Piscirickettsia salmonis is a major disease affecting the Chilean salmon industry. Genomic selection (GS) is a method wherein genome-wide markers and phenotype information of full-sibs are used to predict genomic EBV (GEBV) of selection candidates and is expected to have increased accuracy and response to selection over traditional pedigree based Best Linear Unbiased Prediction (PBLUP). Widely used GS methods such as genomic BLUP (GBLUP), SNPBLUP, Bayes C and Bayesian Lasso may perform differently with respect to accuracy of GEBV prediction. Our aim was to compare the accuracy, in terms of reliability of genome-enabled prediction, from different GS methods with PBLUP for resistance to SRS in an Atlantic salmon breeding program. Number of days to death (DAYS), binary survival status (STATUS) phenotypes, and 50 K SNP array genotypes were obtained from 2601 smolts challenged with P. salmonis. The reliability of different GS methods at different SNP densities with and without pedigree were compared to PBLUP using a five-fold cross validation scheme. Heritability estimated from GS methods was significantly higher than PBLUP. Pearson's correlation between predicted GEBV from PBLUP and GS models ranged from 0.79 to 0.91 and 0.79-0.95 for DAYS and STATUS, respectively. The relative increase in reliability from different GS methods for DAYS and STATUS with 50 K SNP ranged from 8 to 25% and 27-30%, respectively. All GS methods outperformed PBLUP at all marker densities. DAYS and STATUS showed superior reliability over PBLUP even at the lowest marker density of 3 K and 500 SNP, respectively. 20 K SNP showed close to maximal reliability for both traits with little improvement using higher densities. These results indicate that genomic predictions can accelerate genetic progress for SRS resistance in Atlantic salmon and implementation of this approach will contribute to the control of SRS in Chile. We recommend GBLUP for routine GS evaluation because this method is computationally faster and the results are very similar with other GS methods. The use of lower density SNP or the combination of low density SNP and an imputation strategy may help to reduce genotyping costs without compromising gain in reliability.
Dosage Transmission Disequilibrium Test (dTDT) for Linkage and Association Detection
Zhang, Zhehao; Wang, Jen-Chyong; Howells, William; Lin, Peng; Agrawal, Arpana; Edenberg, Howard J.; Tischfield, Jay A.; Schuckit, Marc A.; Bierut, Laura J.; Goate, Alison; Rice, John P.
2013-01-01
Both linkage and association studies have been successfully applied to identify disease susceptibility genes with genetic markers such as microsatellites and Single Nucleotide Polymorphisms (SNPs). As one of the traditional family-based studies, the Transmission/Disequilibrium Test (TDT) measures the over-transmission of an allele in a trio from its heterozygous parents to the affected offspring and can be potentially useful to identify genetic determinants for complex disorders. However, there is reduced information when complete trio information is unavailable. In this study, we developed a novel approach to “infer” the transmission of SNPs by combining both the linkage and association data, which uses microsatellite markers from families informative for linkage together with SNP markers from the offspring who are genotyped for both linkage and a Genome-Wide Association Study (GWAS). We generalized the traditional TDT to process these inferred dosage probabilities, which we name as the dosage-TDT (dTDT). For evaluation purpose, we developed a simulation procedure to assess its operating characteristics. We applied the dTDT to the simulated data and documented the power of the dTDT under a number of different realistic scenarios. Finally, we applied our methods to a family study of alcohol dependence (COGA) and performed individual genotyping on complete families for the top signals. One SNP (rs4903712 on chromosome 14) remained significant after correcting for multiple testing Methods developed in this study can be adapted to other platforms and will have widespread applicability in genomic research when case-control GWAS data are collected in families with existing linkage data. PMID:23691058
Nucleotide diversity maps reveal variation in diversity among wheat genomes and chromosomes
2010-01-01
Background A genome-wide assessment of nucleotide diversity in a polyploid species must minimize the inclusion of homoeologous sequences into diversity estimates and reliably allocate individual haplotypes into their respective genomes. The same requirements complicate the development and deployment of single nucleotide polymorphism (SNP) markers in polyploid species. We report here a strategy that satisfies these requirements and deploy it in the sequencing of genes in cultivated hexaploid wheat (Triticum aestivum, genomes AABBDD) and wild tetraploid wheat (Triticum turgidum ssp. dicoccoides, genomes AABB) from the putative site of wheat domestication in Turkey. Data are used to assess the distribution of diversity among and within wheat genomes and to develop a panel of SNP markers for polyploid wheat. Results Nucleotide diversity was estimated in 2114 wheat genes and was similar between the A and B genomes and reduced in the D genome. Within a genome, diversity was diminished on some chromosomes. Low diversity was always accompanied by an excess of rare alleles. A total of 5,471 SNPs was discovered in 1791 wheat genes. Totals of 1,271, 1,218, and 2,203 SNPs were discovered in 488, 463, and 641 genes of wheat putative diploid ancestors, T. urartu, Aegilops speltoides, and Ae. tauschii, respectively. A public database containing genome-specific primers, SNPs, and other information was constructed. A total of 987 genes with nucleotide diversity estimated in one or more of the wheat genomes was placed on an Ae. tauschii genetic map, and the map was superimposed on wheat deletion-bin maps. The agreement between the maps was assessed. Conclusions In a young polyploid, exemplified by T. aestivum, ancestral species are the primary source of genetic diversity. Low effective recombination due to self-pollination and a genetic mechanism precluding homoeologous chromosome pairing during polyploid meiosis can lead to the loss of diversity from large chromosomal regions. The net effect of these factors in T. aestivum is large variation in diversity among genomes and chromosomes, which impacts the development of SNP markers and their practical utility. Accumulation of new mutations in older polyploid species, such as wild emmer, results in increased diversity and its more uniform distribution across the genome. PMID:21156062
Genetic, metabolite and developmental determinism of fruit friction discolouration in pear.
Saeed, Munazza; Brewer, Lester; Johnston, Jason; McGhie, Tony K; Gardiner, Susan E; Heyes, Julian A; Chagné, David
2014-09-16
The unattractive appearance of the surface of pear fruit caused by the postharvest disorder friction discolouration (FD) is responsible for significant consumer dissatisfaction in markets, leading to lower returns to growers. Developing an understanding of the genetic control of FD is essential to enable the full application of genomics-informed breeding for the development of new pear cultivars. Biochemical constituents [phenolic compounds and ascorbic acid (AsA)], polyphenol oxidase (PPO) activity, as well as skin anatomy, have been proposed to play important roles in FD susceptibility in studies on a limited number of cultivars. However, to date there has been no investigation on the biochemical and genetic control of FD, employing segregating populations. In this study, we used 250 seedlings from two segregating populations (POP369 and POP356) derived from interspecific crosses between Asian (Pyrus pyrifolia Nakai and P. bretschneideri Rehd.) and European (P. communis) pears to identify genetic factors associated with susceptibility to FD. Single nucleotide polymorphism (SNP)-based linkage maps suitable for QTL analysis were developed for the parents of both populations. The maps for population POP369 comprised 174 and 265 SNP markers for the male and female parent, respectively, while POP356 maps comprised 353 and 398 SNP markers for the male and female parent, respectively. Phenotypic data for 22 variables were measured over two successive years (2011 and 2012) for POP369 and one year (2011) only for POP356. A total of 221 QTLs were identified that were linked to 22 phenotyped variables, including QTLs associated with FD for both populations that were stable over the successive years. In addition, clear evidence of the influence of developmental factors (fruit maturity) on FD and other variables was also recorded. The QTLs associated with fruit firmness, PPO activity, AsA concentration and concentration of polyphenol compounds as well as FD are the first reported for pear. We conclude that the postharvest disorder FD is controlled by multiple small effect QTLs and that it will be very challenging to apply marker-assisted selection based on these QTLs. However, genomic selection could be employed to select elite genotypes with lower or no susceptibility to FD early in the breeding cycle.
Peace, Cameron; Bassil, Nahla; Main, Dorrie; Ficklin, Stephen; Rosyara, Umesh R.; Stegmeir, Travis; Sebolt, Audrey; Gilmore, Barbara; Lawley, Cindy; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Iezzoni, Amy
2012-01-01
High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a community initiative to enable marker-assisted breeding for rosaceous crops. Next-generation sequencing in diverse breeding germplasm provided 25 billion basepairs (Gb) of cherry DNA sequence from which were identified genome-wide SNPs for sweet cherry and for the two sour cherry subgenomes derived from sweet cherry (avium subgenome) and P. fruticosa (fruticosa subgenome). Anchoring to the peach genome sequence, recently released by the International Peach Genome Initiative, predicted relative physical locations of the 1.9 million putative SNPs detected, preliminarily filtered to 368,943 SNPs. Further filtering was guided by results of a 144-SNP subset examined with the Illumina GoldenGate® assay on 160 accessions. A 6K Infinium® II array was designed with SNPs evenly spaced genetically across the sweet and sour cherry genomes. SNPs were developed for each sour cherry subgenome by using minor allele frequency in the sour cherry detection panel to enrich for subgenome-specific SNPs followed by targeting to either subgenome according to alleles observed in sweet cherry. The array was evaluated using panels of sweet (n = 269) and sour (n = 330) cherry breeding germplasm. Approximately one third of array SNPs were informative for each crop. A total of 1825 polymorphic SNPs were verified in sweet cherry, 13% of these originally developed for sour cherry. Allele dosage was resolved for 2058 polymorphic SNPs in sour cherry, one third of these being originally developed for sweet cherry. This publicly available genomics resource represents a significant advance in cherry genome-scanning capability that will accelerate marker-locus-trait association discovery, genome structure investigation, and genetic diversity assessment in this diploid-tetraploid crop group. PMID:23284615
Gao, Yangchun; Li, Shiguo; Zhan, Aibin
2018-04-01
Invasive species cause huge damages to ecology, environment and economy globally. The comprehensive understanding of invasion mechanisms, particularly genetic bases of micro-evolutionary processes responsible for invasion success, is essential for reducing potential damages caused by invasive species. The golden star tunicate, Botryllus schlosseri, has become a model species in invasion biology, mainly owing to its high invasiveness nature and small well-sequenced genome. However, the genome-wide genetic markers have not been well developed in this highly invasive species, thus limiting the comprehensive understanding of genetic mechanisms of invasion success. Using restriction site-associated DNA (RAD) tag sequencing, here we developed a high-quality resource of 14,119 out of 158,821 SNPs for B. schlosseri. These SNPs were relatively evenly distributed at each chromosome. SNP annotations showed that the majority of SNPs (63.20%) were located at intergenic regions, and 21.51% and 14.58% were located at introns and exons, respectively. In addition, the potential use of the developed SNPs for population genomics studies was primarily assessed, such as the estimate of observed heterozygosity (H O ), expected heterozygosity (H E ), nucleotide diversity (π), Wright's inbreeding coefficient (F IS ) and effective population size (Ne). Our developed SNP resource would provide future studies the genome-wide genetic markers for genetic and genomic investigations, such as genetic bases of micro-evolutionary processes responsible for invasion success.
Rice SNP-seek database update: new SNPs, indels, and queries.
Mansueto, Locedie; Fuentes, Roven Rommel; Borja, Frances Nikki; Detras, Jeffery; Abriol-Santos, Juan Miguel; Chebotarov, Dmytro; Sanciangco, Millicent; Palis, Kevin; Copetti, Dario; Poliakov, Alexandre; Dubchak, Inna; Solovyev, Victor; Wing, Rod A; Hamilton, Ruaraidh Sackville; Mauleon, Ramil; McNally, Kenneth L; Alexandrov, Nickolai
2017-01-04
We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Oh, Chang Seok; Lee, Soong Deok; Kim, Yi-Suk; Shin, Dong Hoon
2015-01-01
Previous study showed that East Asian mtDNA haplogroups, especially those of Koreans, could be successfully assigned by the coupled use of analyses on coding region SNP markers and control region mutation motifs. In this study, we tried to see if the same triple multiplex analysis for coding regions SNPs could be also applicable to ancient samples from East Asia as the complementation for sequence analysis of mtDNA control region. By the study on Joseon skeleton samples, we know that mtDNA haplogroup determined by coding region SNP markers successfully falls within the same haplogroup that sequence analysis on control region can assign. Considering that ancient samples in previous studies make no small number of errors in control region mtDNA sequencing, coding region SNP analysis can be used as good complimentary to the conventional haplogroup determination, especially of archaeological human bone samples buried underground over long periods. PMID:26345190
Roden, Suzanne E; Dutton, Peter H; Morin, Phillip A
2009-01-01
The green sea turtle, Chelonia mydas, was used as a case study for single nucleotide polymorphism (SNP) discovery in a species that has little genetic sequence information available. As green turtles have a complex population structure, additional nuclear markers other than microsatellites could add to our understanding of their complex life history. Amplified fragment length polymorphism technique was used to generate sets of random fragments of genomic DNA, which were then electrophoretically separated with precast gels, stained with SYBR green, excised, and directly sequenced. It was possible to perform this method without the use of polyacrylamide gels, radioactive or fluorescent labeled primers, or hybridization methods, reducing the time, expense, and safety hazards of SNP discovery. Within 13 loci, 2547 base pairs were screened, resulting in the discovery of 35 SNPs. Using this method, it was possible to yield a sufficient number of loci to screen for SNP markers without the availability of prior sequence information.
Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography
Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi
2013-01-01
New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420
Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie
2018-01-01
Powdery mildew resistance gene Pm4b , originating from Triticum persicum , is effective against the prevalent Blumeria graminis f. sp. tritici ( Bgt ) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F 2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7 ∗ Bainong 3217 F 4 (carrying Pm4b ) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F 2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b -linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13 , Xics43 , and Xics76 , were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease resistance were predicted in this collinear genomic region, which included C2 domain protein, peroxidase activity protein, protein kinases of PKc_like super family, Mlo family protein, and catalytic domain of the serine/threonine kinases (STKc_IRAK like super family). The markers developed in the present study facilitate identification of Pm4b during its MAS practice.
Wu, Peipei; Xie, Jingzhong; Hu, Jinghuang; Qiu, Dan; Liu, Zhiyong; Li, Jingting; Li, Miaomiao; Zhang, Hongjun; Yang, Li; Liu, Hongwei; Zhou, Yang; Zhang, Zhongjun; Li, Hongjie
2018-01-01
Powdery mildew resistance gene Pm4b, originating from Triticum persicum, is effective against the prevalent Blumeria graminis f. sp. tritici (Bgt) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification of Pm4b during the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F2:3 mapping population (237 families) derived from a pair of isogenic lines VPM1/7∗Bainong 3217 F4 (carrying Pm4b) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3 families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. Four Pm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking the Pm4b gene. Three SSR markers, Xics13, Xics43, and Xics76, were incorporated in the new genetic linkage map, which located Pm4b in a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship with Brachypodium distachyon chromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease resistance were predicted in this collinear genomic region, which included C2 domain protein, peroxidase activity protein, protein kinases of PKc_like super family, Mlo family protein, and catalytic domain of the serine/threonine kinases (STKc_IRAK like super family). The markers developed in the present study facilitate identification of Pm4b during its MAS practice. PMID:29491869
A second generation genetic linkage map of Japanese flounder (Paralichthys olivaceus)
2010-01-01
Background Japanese flounder (Paralichthys olivaceus) is one of the most economically important marine species in Northeast Asia. Information on genetic markers associated with quantitative trait loci (QTL) can be used in breeding programs to identify and select individuals carrying desired traits. Commercial production of Japanese flounder could be increased by developing disease-resistant fish and improving commercially important traits. Previous maps have been constructed with AFLP markers and a limited number of microsatellite markers. In this study, improved genetic linkage maps are presented. In contrast with previous studies, these maps were built mainly with a large number of codominant markers so they can potentially be used to analyze different families and populations. Results Sex-specific genetic linkage maps were constructed for the Japanese flounder including a total of 1,375 markers [1,268 microsatellites, 105 single nucleotide polymorphisms (SNPs) and two genes]; 1,167 markers are linked to the male map and 1,067 markers are linked to the female map. The lengths of the male and female maps are 1,147.7 cM and 833.8 cM, respectively. Based on estimations of map lengths, the female and male maps covered 79 and 82% of the genome, respectively. Recombination ratio in the new maps revealed F:M of 1:0.7. All linkage groups in the maps presented large differences in the location of sex-specific recombination hot-spots. Conclusions The improved genetic linkage maps are very useful for QTL analyses and marker-assisted selection (MAS) breeding programs for economically important traits in Japanese flounder. In addition, SNP flanking sequences were blasted against Tetraodon nigroviridis (puffer fish) and Danio rerio (zebrafish), and synteny analysis has been carried out. The ability to detect synteny among species or genera based on homology analysis of SNP flanking sequences may provide opportunities to complement initial QTL experiments with candidate gene approaches from homologous chromosomal locations identified in related model organisms. PMID:20937088
Guerini, Franca R; Bolognesi, Elisabetta; Manca, Salvatorica; Sotgiu, Stefano; Zanzottera, Milena; Agliardi, Cristina; Usai, Sonia; Clerici, Mario
2009-03-01
Analyses of a 6-Mb region spanning the human leukocyte antigen (HLA) region from the HLA-DR to the HFE gene were performed in 37 families of Sardinian ancestry, all of whom had at least one autistic child, to identify genetic markers associated with autism spectrum disorders (ASD) development. In particular, four microsatellites (MIB, D6S265, MOGc, and D6S2239) and three single-nucleotide polymorphisms (SNPs; two in positions -308 and -238 in the promoter of the TNF-alpha and SNP rs2857766 [V142L] in exon 3 of the MOG gene) were analyzed. An intrafamilial case-control method (affected family-based controls) and transmission disequilibrium test analysis were used to evaluate the association of microsatellite and SNP markers with ASD-affected children. Results indicated positive associations with ASD for D6S265*220 (p < 0.01) and MOGc*131 (p < 0.05) and negative associations for MOGc*117 and MIB*346 alleles (p < 0.01) in ASD children. Polymorphism haplotype analysis indicated that D6S265 allele *220 and MOGc allele *131 were significantly more likely to be transmitted together, as a whole haplotype, to ASD children (p < 0.05). Conversely, the D6S265*224-MOGc*117-rs2857766(G) haplotype was significantly less frequently transmitted to ASD children (p < 0.01). The results present novel gene markers, reinforcing the hypothesis that genetic factors play a pivotal role in the pathogenesis of ASD.
Sturgeon conservation genomics: SNP discovery and validation using RAD sequencing.
Ogden, R; Gharbi, K; Mugue, N; Martinsohn, J; Senn, H; Davey, J W; Pourkazemi, M; McEwing, R; Eland, C; Vidotto, M; Sergeev, A; Congiu, L
2013-06-01
Caviar-producing sturgeons belonging to the genus Acipenser are considered to be one of the most endangered species groups in the world. Continued overfishing in spite of increasing legislation, zero catch quotas and extensive aquaculture production have led to the collapse of wild stocks across Europe and Asia. The evolutionary relationships among Adriatic, Russian, Persian and Siberian sturgeons are complex because of past introgression events and remain poorly understood. Conservation management, traceability and enforcement suffer a lack of appropriate DNA markers for the genetic identification of sturgeon at the species, population and individual level. This study employed RAD sequencing to discover and characterize single nucleotide polymorphism (SNP) DNA markers for use in sturgeon conservation in these four tetraploid species over three biological levels, using a single sequencing lane. Four population meta-samples and eight individual samples from one family were barcoded separately before sequencing. Analysis of 14.4 Gb of paired-end RAD data focused on the identification of SNPs in the paired-end contig, with subsequent in silico and empirical validation of candidate markers. Thousands of putatively informative markers were identified including, for the first time, SNPs that show population-wide differentiation between Russian and Persian sturgeons, representing an important advance in our ability to manage these cryptic species. The results highlight the challenges of genotyping-by-sequencing in polyploid taxa, while establishing the potential genetic resources for developing a new range of caviar traceability and enforcement tools. © 2013 John Wiley & Sons Ltd.
Zhang, Yingxiao; Iaffaldano, Brian J; Zhuang, Xiaofeng; Cardina, John; Cornish, Katrina
2017-02-02
Rubber dandelion (Taraxacum kok-saghyz, TK) is being developed as a domestic source of natural rubber to meet increasing global demand. However, the domestication of TK is complicated by its colocation with two weedy dandelion species, Taraxacum brevicorniculatum (TB) and the common dandelion (Taraxacum officinale, TO). TB is often present as a seed contaminant within TK accessions, while TO is a pandemic weed, which may have the potential to hybridize with TK. To discriminate these species at the molecular level, and facilitate gene flow studies between the potential rubber crop, TK, and its weedy relatives, we generated genomic and marker resources for these three dandelion species. Complete chloroplast genome sequences of TK (151,338 bp), TO (151,299 bp), and TB (151,282 bp) were obtained using the Illumina GAII and MiSeq platforms. Chloroplast sequences were analyzed and annotated for all the three species. Phylogenetic analysis within Asteraceae showed that TK has a closer genetic distance to TB than to TO and Taraxacum species were most closely related to lettuce (Lactuca sativa). By sequencing multiple genotypes for each species and testing variants using gel-based methods, four chloroplast Single Nucleotide Polymorphism (SNP) variants were found to be fixed between TK and TO in large populations, and between TB and TO. Additionally, Expressed Sequence Tag (EST) resources developed for TO and TK permitted the identification of five nuclear species-specific SNP markers. The availability of chloroplast genomes of these three dandelion species, as well as chloroplast and nuclear molecular markers, will provide a powerful genetic resource for germplasm differentiation and purification, and the study of potential gene flow among Taraxacum species.
Genome-wide association analyses for carcass quality in crossbred beef cattle
2013-01-01
Background Genetic improvement of beef quality will benefit both producers and consumers, and can be achieved by selecting animals that carry desired quantitative trait nucleotides (QTN), which result from intensive searches using genetic markers. This paper presents a genome-wide association approach utilizing single nucleotide polymorphisms (SNP) in the Illumina BovineSNP50 BeadChip to seek genomic regions that potentially harbor genes or QTN underlying variation in carcass quality of beef cattle. This study used 747 genotyped animals, mainly crossbred, with phenotypes on twelve carcass quality traits, including hot carcass weight (HCW), back fat thickness (BF), Longissimus dorsi muscle area or ribeye area (REA), marbling scores (MRB), lean yield grade by Beef Improvement Federation formulae (BIFYLD), steak tenderness by Warner-Bratzler shear force 7-day post-mortem (LM7D) as well as body composition as determined by partial rib (IMPS 103) dissection presented as a percentage of total rib weight including body cavity fat (BDFR), lean (LNR), bone (BNR), intermuscular fat (INFR), subcutaneous fat (SQFR), and total fat (TLFR). Results At the genome wide level false discovery rate (FDR < 10%), eight SNP were found significantly associated with HCW. Seven of these SNP were located on Bos taurus autosome (BTA) 6. At a less stringent significance level (P < 0.001), 520 SNP were found significantly associated with mostly individual traits (473 SNP), and multiple traits (47 SNP). Of these significant SNP, 48 were located on BTA6, and 22 of them were in association with hot carcass weight. There were 53 SNP associated with percentage of rib bone, and 12 of them were on BTA20. The rest of the significant SNP were scattered over other chromosomes. They accounted for 1.90 - 5.89% of the phenotypic variance of the traits. A region of approximately 4 Mbp long on BTA6 was found to be a potential area to harbor candidate genes influencing growth. One marker on BTA25 accounting for 2.67% of the variation in LM7D may be worth further investigation for the improvement of beef tenderness. Conclusion This study provides useful information to further assist the identification of chromosome regions and subsequently genes affecting carcass quality traits in beef cattle. It also revealed many SNP that acted pleiotropically to affect carcass quality. This knowledge is important in selecting subsets of SNP to improve the performance of beef cattle. PMID:24024930
Mora, Freddy; Quitral, Yerko A; Matus, Ivan; Russell, Joanne; Waugh, Robbie; Del Pozo, Alejandro
2016-01-01
This study identified single nucleotide polymorphism (SNP) markers associated with 15 complex traits in a breeding population of barley (Hordeum vulgare L.) consisting of 137 recombinant chromosome substitution lines (RCSL), evaluated under contrasting water availability conditions in the Mediterranean climatic region of central Chile. Given that markers showed a very strong segregation distortion, a quantitative trait locus/loci (QTL) mapping mixed model was used to account for the heterogeneity in genetic relatedness between genotypes. Fifty-seven QTL were detected under rain-fed conditions, which accounted for 5-22% of the phenotypic variation. In full irrigation conditions, 84 SNPs were significantly associated with the traits studied, explaining 5-35% of phenotypic variation. Most of the QTL were co-localized on chromosomes 2H and 3H. Environment-specific genomic regions were detected for 12 of the 15 traits scored. Although most QTL-trait associations were environment and trait specific, some important and stable associations were also detected. In full irrigation conditions, a relatively major genomic region was found underlying hectoliter weight (HW), on chromosome 1H, which explained between 27% (SNP 2711-234) and 35% (SNP 1923-265) of the phenotypic variation. Interestingly, the locus 1923-265 was also detected for grain yield at both environmental conditions, accounting for 9 and 18%, in the rain-fed and irrigation conditions, respectively. Analysis of QTL in this breeding population identified significant genomic regions that can be used for marker-assisted selection (MAS) of barley in areas where drought is a significant constraint.
Mora, Freddy; Quitral, Yerko A.; Matus, Ivan; Russell, Joanne; Waugh, Robbie; del Pozo, Alejandro
2016-01-01
This study identified single nucleotide polymorphism (SNP) markers associated with 15 complex traits in a breeding population of barley (Hordeum vulgare L.) consisting of 137 recombinant chromosome substitution lines (RCSL), evaluated under contrasting water availability conditions in the Mediterranean climatic region of central Chile. Given that markers showed a very strong segregation distortion, a quantitative trait locus/loci (QTL) mapping mixed model was used to account for the heterogeneity in genetic relatedness between genotypes. Fifty-seven QTL were detected under rain-fed conditions, which accounted for 5–22% of the phenotypic variation. In full irrigation conditions, 84 SNPs were significantly associated with the traits studied, explaining 5–35% of phenotypic variation. Most of the QTL were co-localized on chromosomes 2H and 3H. Environment-specific genomic regions were detected for 12 of the 15 traits scored. Although most QTL-trait associations were environment and trait specific, some important and stable associations were also detected. In full irrigation conditions, a relatively major genomic region was found underlying hectoliter weight (HW), on chromosome 1H, which explained between 27% (SNP 2711-234) and 35% (SNP 1923-265) of the phenotypic variation. Interestingly, the locus 1923-265 was also detected for grain yield at both environmental conditions, accounting for 9 and 18%, in the rain-fed and irrigation conditions, respectively. Analysis of QTL in this breeding population identified significant genomic regions that can be used for marker-assisted selection (MAS) of barley in areas where drought is a significant constraint. PMID:27446139
Jo, Ick-Hyun; Sung, Jwakyung; Hong, Chi-Eun; Raveendar, Sebastin; Bang, Kyong-Hwan; Chung, Jong-Wook
2018-05-01
Licorice ( Glycyrrhiza glabra ) is an important medicinal crop often used as health foods or medicine worldwide. The molecular genetics of licorice is under scarce owing to lack of molecular markers. Here, we have developed cleaved amplified polymorphic sequence (CAPS) and high-resolution melting (HRM) markers based on single nucleotide polymorphisms (SNP) by comparing the chloroplast genomes of two Glycyrrhiza species ( G. glabra and G. lepidota ). The CAPS and HRM markers were tested for diversity analysis with 24 Glycyrrhiza accessions. The restriction profiles generated with CAPS markers classified the accessions (2-4 genotypes) and melting curves (2-3) were obtained from the HRM markers. The number of alleles and major allele frequency were 2-6 and 0.31-0.92, respectively. The genetic distance and polymorphism information content values were 0.16-0.76 and 0.15-0.72, respectively. The phylogenetic relationships among the 24 accessions were estimated using a dendrogram, which classified them into four clades. Except clade III, the remaining three clades included the same species, confirming interspecies genetic correlation. These 18 CAPS and HRM markers might be helpful for genetic diversity assessment and rapid identification of licorice species.
Adiponectin and resistin gene polymorphisms in association with their respective adipokine levels.
Lau, Cia-Hin; Muniandy, Sekaran
2011-05-01
Single nucleotide polymorphisms (SNPs) at the adiponectin and resistin loci are strongly associated with hypoadiponectinemia and hyperresistinemia, which may eventually increase risk of insulin resistance, type 2 diabetes (T2DM), metabolic syndrome (MS), and cardiovascular disease. Real-time PCR was used to genotype SNPs of the adiponectin (SNP+45T>G, SNP+276G>T, SNP+639T>C, and SNP+1212A>G) and resistin (SNP-420C>G and SNP+299G>A) genes in 809 Malaysian men (208 controls, 174 MS without T2DM, 171 T2DM without MS, 256 T2DM with MS) whose ages ranged between 40 and 70 years old. The genotyping results for each SNP marker was verified by sequencing. The anthropometric clinical and metabolic parameters of subjects were recorded. None of these SNPs at the adiponectin and resistin loci were associated with T2DM and MS susceptibility in Malaysian men. SNP+45T>G, SNP+276G>T, and SNP+639T>C of the adiponectin gene did not influence circulating levels of adiponectin. However, the G-allele of SNP+1212A>G at the adiponectin locus was marginally associated (P= 0.0227) with reduced circulating adiponectin levels. SNP-420C>G (df = 2; F= 16.026; P= 1.50×10(-7) ) and SNP+299G>A (df = 2; F= 22.944; P= 2.04×10(-10) ) of the resistin gene were strongly associated with serum resistin levels. Thus, SNP-420C>G and SNP+299G>A of the resistin gene are strongly associated with the risk of hyperresistinemia in Malaysian men. © 2011 The Authors Annals of Human Genetics © 2011 Blackwell Publishing Ltd/University College London.
Qi, L L; Ma, G J; Long, Y M; Hulke, B S; Gong, L; Markell, S G
2015-03-01
The rust resistance gene R 2 was reassigned to linkage group 14 of the sunflower genome. DNA markers linked to R 2 were identified and used for marker-assisted gene pyramiding in a confection type genetic background. Due to the frequent evolution of new pathogen races, sunflower rust is a recurring threat to sunflower production worldwide. The inbred line Morden Cross 29 (MC29) carries the rust resistance gene, R 2 , conferring resistance to numerous races of rust fungus in the US, Canada, and Australia, and can be used as a broad-spectrum resistance resource. Based on phenotypic assessments and SSR marker analyses on the 117 F2 individuals derived from a cross of HA 89 with MC29 (USDA), R 2 was mapped to linkage group (LG) 14 of the sunflower, and not to the previously reported location on LG9. The closest SSR marker HT567 was located at 4.3 cM distal to R 2 . Furthermore, 36 selected SNP markers from LG14 were used to saturate the R 2 region. Two SNP markers, NSA_002316 and SFW01272, flanked R 2 at a genetic distance of 2.8 and 1.8 cM, respectively. Of the three closely linked markers, SFW00211 amplified an allele specific for the presence of R 2 in a marker validation set of 46 breeding lines, and SFW01272 was also shown to be diagnostic for R 2 . These newly developed markers, together with the previously identified markers linked to the gene R 13a , were used to screen 524 F2 individuals from a cross of a confection R 2 line and HA-R6 carrying R 13a . Eleven homozygous double-resistant F2 plants with the gene combination of R 2 and R 13a were obtained. This double-resistant line will be extremely useful in confection sunflower, where few rust R genes are available, risking evolution of new virulence phenotypes and further disease epidemics.
van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul
2017-08-07
Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.
Valenzuela-Muñoz, Valentina; Araya-Garay, José Miguel; Gallardo-Escárate, Cristian
2013-06-01
The California red abalone, Haliotis rufescens that belongs to the Haliotidae family, is the largest species of abalone in the world that has sustained the major fishery and aquaculture production in the USA and Mexico. This native mollusk has not been evaluated or assigned a conservation category even though in the last few decades it was heavily exploited until it disappeared in some areas along the California coast. In Chile, the red abalone was introduced in the 1970s from California wild abalone stocks for the purposes of aquaculture. Considering the number of years that the red abalone has been cultivated in Chile crucial genetic information is scarce and critical issues remain unresolved. This study reports and validates novel single nucleotide polymorphisms (SNP) markers for the red abalone H. rufescens using cDNA pyrosequencing. A total of 622 high quality SNPs were identified in 146 sequences with an estimated frequency of 1 SNP each 1000bp. Forty-five SNPs markers with functional information for gene ontology were selected. Of these, 8 were polymorphic among the individuals screened: Heat shock protein 70 (HSP70), vitellogenin (VTG), lysin, alginate lyase enzyme (AL), Glucose-regulated protein 94 (GRP94), fructose-bisphosphate aldolase (FBA), sulfatase 1A precursor (S1AP) and ornithine decarboxylase antizyme (ODC). Two additional sequences were also identified with polymorphisms but no similarities with known proteins were achieved. To validate the putative SNP markers, High Resolution Melting Analysis (HRMA) was conducted in a wild and hatchery-bred population. Additionally, SNP cross-amplifications were tested in two further native abalone species, Haliotis fulgens and Haliotis corrugata. This study provides novel candidate genes that could be used to evaluate loss of genetic diversity due to hatchery selection or inbreeding effects. Copyright © 2013 Elsevier B.V. All rights reserved.
2012-01-01
Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in the coding regions of genes involved in different physiological processes. The platform will also be useful for future mapping and diversity studies, and will be essential in order to accelerate the process of breeding new and better-adapted squash varieties. PMID:22356647
Welderufael, B G; Løvendahl, Peter; de Koning, Dirk-Jan; Janss, Lucas L G; Fikse, W F
2018-01-01
Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS) to identify associated single nucleotide polymorphisms (SNPs) and investigate the genetic background not only for susceptibility to - but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t -test and a genome-wide significance level of P -value < 10 -4 was used to declare significant SNP-trait association. A number of significant SNP variants were identified for both traits. Many of the SNP variants associated either with susceptibility to - or recoverability from mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2 ) and genes involved in macrophage recruitment and regulation of inflammations ( PDGFD and PTX3 ) were suggested as possible causal genes for susceptibility to - and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to - and recoverability from mastitis.
A case of false mother included with 46 autosomal STR markers.
Li, Li; Lin, Yuan; Liu, Yan; Zhu, Ruxin; Zhao, Zhenmin; Que, Tingzhi
2015-01-01
For solving a maternity case, 19 autosomal short tandem repeats (STRs) were amplified using the AmpFℓSTR(®) Sinofiler(TM) kit and PowerPlex(®) 16 System. Additional 27 autosomal STR loci were analyzed using two domestic kits AGCU 21+1 and STRtyper-10G. The combined maternity index (CMI) was calculated to be 3.3 × 10(13), but the putative mother denied that she had given birth to the child. In order to reach an accurate conclusion, further testing of 20 X-chromosomal short tandem repeats (X-STRs), 40 single nucleotide polymorphism (SNP) loci, and mitochondrial DNA (mtDNA) was carried out. The putative mother and the boy shared at least one allele at all 46 tested autosomal STR loci. But, according to the profile data of 20 X-STR and 40 SNP markers, different genotypes at 13 X-STR loci and five SNP loci excluded maternity. Mitochondrial profiles also clearly excluded the mother as a parent of the son because they have multiple differences. It was finally found that the putative mother is the sister of the biological father. Different kinds of genetic markers needfully supplement the use of autosomal STR loci in case where the putative parent is suspected to be related to the true parent.
Applications of random forest feature selection for fine-scale genetic population assignment.
Sylvester, Emma V A; Bentzen, Paul; Bradbury, Ian R; Clément, Marie; Pearce, Jon; Horne, John; Beiko, Robert G
2018-02-01
Genetic population assignment used to inform wildlife management and conservation efforts requires panels of highly informative genetic markers and sensitive assignment tests. We explored the utility of machine-learning algorithms (random forest, regularized random forest and guided regularized random forest) compared with F ST ranking for selection of single nucleotide polymorphisms (SNP) for fine-scale population assignment. We applied these methods to an unpublished SNP data set for Atlantic salmon ( Salmo salar ) and a published SNP data set for Alaskan Chinook salmon ( Oncorhynchus tshawytscha ). In each species, we identified the minimum panel size required to obtain a self-assignment accuracy of at least 90% using each method to create panels of 50-700 markers Panels of SNPs identified using random forest-based methods performed up to 7.8 and 11.2 percentage points better than F ST -selected panels of similar size for the Atlantic salmon and Chinook salmon data, respectively. Self-assignment accuracy ≥90% was obtained with panels of 670 and 384 SNPs for each data set, respectively, a level of accuracy never reached for these species using F ST -selected panels. Our results demonstrate a role for machine-learning approaches in marker selection across large genomic data sets to improve assignment for management and conservation of exploited populations.
Tumor Touch Imprints as Source for Whole Genome Analysis of Neuroblastoma Tumors
Brunner, Clemens; Brunner-Herglotz, Bettina; Ziegler, Andrea; Frech, Christian; Amann, Gabriele; Ladenstein, Ruth; Ambros, Inge M.; Ambros, Peter F.
2016-01-01
Introduction Tumor touch imprints (TTIs) are routinely used for the molecular diagnosis of neuroblastomas by interphase fluorescence in-situ hybridization (I-FISH). However, in order to facilitate a comprehensive, up-to-date molecular diagnosis of neuroblastomas and to identify new markers to refine risk and therapy stratification methods, whole genome approaches are needed. We examined the applicability of an ultra-high density SNP array platform that identifies copy number changes of varying sizes down to a few exons for the detection of genomic changes in tumor DNA extracted from TTIs. Material and Methods DNAs were extracted from TTIs of 46 neuroblastoma and 4 other pediatric tumors. The DNAs were analyzed on the Cytoscan HD SNP array platform to evaluate numerical and structural genomic aberrations. The quality of the data obtained from TTIs was compared to that from randomly chosen fresh or fresh frozen solid tumors (n = 212) and I-FISH validation was performed. Results SNP array profiles were obtained from 48 (out of 50) TTI DNAs of which 47 showed genomic aberrations. The high marker density allowed for single gene analysis, e.g. loss of nine exons in the ATRX gene and the visualization of chromothripsis. Data quality was comparable to fresh or fresh frozen tumor SNP profiles. SNP array results were confirmed by I-FISH. Conclusion TTIs are an excellent source for SNP array processing with the advantage of simple handling, distribution and storage of tumor tissue on glass slides. The minimal amount of tumor tissue needed to analyze whole genomes makes TTIs an economic surrogate source in the molecular diagnostic work up of tumor samples. PMID:27560999
Technical note: Equivalent genomic models with a residual polygenic effect.
Liu, Z; Goddard, M E; Hayes, B J; Reinhardt, F; Reents, R
2016-03-01
Routine genomic evaluations in animal breeding are usually based on either a BLUP with genomic relationship matrix (GBLUP) or single nucleotide polymorphism (SNP) BLUP model. For a multi-step genomic evaluation, these 2 alternative genomic models were proven to give equivalent predictions for genomic reference animals. The model equivalence was verified also for young genotyped animals without phenotypes. Due to incomplete linkage disequilibrium of SNP markers to genes or causal mutations responsible for genetic inheritance of quantitative traits, SNP markers cannot explain all the genetic variance. A residual polygenic effect is normally fitted in the genomic model to account for the incomplete linkage disequilibrium. In this study, we start by showing the proof that the multi-step GBLUP and SNP BLUP models are equivalent for the reference animals, when they have a residual polygenic effect included. Second, the equivalence of both multi-step genomic models with a residual polygenic effect was also verified for young genotyped animals without phenotypes. Additionally, we derived formulas to convert genomic estimated breeding values of the GBLUP model to its components, direct genomic values and residual polygenic effect. Third, we made a proof that the equivalence of these 2 genomic models with a residual polygenic effect holds also for single-step genomic evaluation. Both the single-step GBLUP and SNP BLUP models lead to equal prediction for genotyped animals with phenotypes (e.g., reference animals), as well as for (young) genotyped animals without phenotypes. Finally, these 2 single-step genomic models with a residual polygenic effect were proven to be equivalent for estimation of SNP effects, too. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Identification of single nucleotide polymorphism in ginger using expressed sequence tags
Chandrasekar, Arumugam; Riju, Aikkal; Sithara, Kandiyl; Anoop, Sahadevan; Eapen, Santhosh J
2009-01-01
Ginger (Zingiber officinale Rosc) (Family: Zingiberaceae) is a herbaceous perennial, the rhizomes of which are used as a spice. Ginger is a plant which is well known for its medicinal applications. Recently EST-derived SNPs are a free by-product of the currently expanding EST (Expressed Sequence Tag) databases. The development of high-throughput methods for the detection of SNPs (Single Nucleotide Polymorphism) and small indels (insertion/deletion) has led to a revolution in their use as molecular markers. Available (38139) Ginger EST sequences were mined from dbEST of NCBI. CAP3 program was used to assemble EST sequences into contigs. Candidate SNPs and Indel polymorphisms were detected using the perl script AutoSNP version 1.0 which has used 31905 ESTs for detecting SNPs and Indel sites. We found 64026 SNP sites and 7034 indel polymorphisms with frequency of 0.84 SNPs / 100 bp. Among the three tissues from which the EST libraries had been generated, Rhizomes had high frequency of 1.08 SNPs/indels per 100 bp whereas the leaves had lowest frequency of 0.63 per 100 bp and root is showing relative frequency 0.82/100bp. Transitions and transversion ratio is 0.90. In overall detected SNP, transversion is high when compare to transition. These detected SNPs can be used as markers for genetic studies. Availability The results of the present study hosted in our webserver www.spices.res.in/spicesnip PMID:20198184
Stark, Klaus; Reinhard, Wibke; Grassl, Martina; Erdmann, Jeanette; Schunkert, Heribert; Illig, Thomas; Hengstenberg, Christian
2009-11-05
Recently, a large meta-analysis including over 28,000 participants identified nine different loci with association to serum uric acid (UA) levels. Since elevated serum UA levels potentially cause gout and are a possible risk factor for coronary artery disease (CAD) and myocardial infarction (MI), we performed two large case-control association analyses with participants from the German MI Family Study. In the first study, we assessed the association of the qualitative trait gout and ten single nucleotide polymorphisms (SNP) markers that showed association to UA serum levels. In the second study, the same genetic polymorphisms were analyzed for association with CAD. A total of 683 patients suffering from gout and 1,563 healthy controls from the German MI Family Study were genotyped. Nine SNPs were identified from a recently performed genome-wide meta-analysis on serum UA levels (rs12129861, rs780094, rs734553, rs2231142, rs742132, rs1183201, rs12356193, rs17300741 and rs505802). Additionally, the marker rs6855911 was included which has been associated with gout in our cohort in a previous study. SNPs rs734553 and rs6855911, located in SLC2A9, and SNP rs2231142, known to be a missense polymorphism in ABCG2, were associated with gout (p=5.6*10(-7), p=1.1*10(-7), and p=1.3*10(-3), respectively). Other SNPs in the genes PDZK1, GCKR, LRRC16A, SLC17A1-SLC17A3, SLC16A9, SLC22A11 and SLC22A12 failed the significance level. None of the ten markers were associated with risk to CAD in our study sample of 1,473 CAD cases and 1,241 CAD-free controls. SNP markers in SLC2A9 and ABCG2 genes were found to be strongly associated with the phenotype gout. However, not all SNP markers influencing serum UA levels were also directly associated with the clinical manifestation of gout in our study sample. In addition, none of these SNPs showed association with the risk to CAD in the German MI Family Study.
Ling, Kai-Shu; Harris, Karen R; Meyer, Jenelle D F; Levi, Amnon; Guner, Nihat; Wehner, Todd C; Bendahmane, Abdelhafid; Havey, Michael J
2009-12-01
Zucchini yellow mosaic virus (ZYMV) is one of the most economically important potyviruses infecting cucurbit crops worldwide. Using a candidate gene approach, we cloned and sequenced eIF4E and eIF(iso)4E gene segments in watermelon. Analysis of the nucleotide sequences between the ZYMV-resistant watermelon plant introduction PI 595203 (Citrullus lanatus var. lanatus) and the ZYMV-susceptible watermelon cultivar 'New Hampshire Midget' ('NHM') showed the presence of single nucleotide polymorphisms (SNPs). Initial analysis of the identified SNPs in association studies indicated that SNPs in the eIF4E, but not eIF(iso)4E, were closely associated to the phenotype of ZYMV-resistance in 70 F(2) and 114 BC(1R) progenies. Subsequently, we focused our efforts in obtaining the entire genomic sequence of watermelon eIF4E. Three SNPs were identified between PI 595203 and NHM. One of the SNPs (A241C) was in exon 1 and the other two SNPs (C309A and T554G) were in the first intron of the gene. SNP241 which resulted in an amino acid substitution (proline to threonine) was shown to be located in the critical cap recognition and binding area, similar to that of several plant species resistance to potyviruses. Analysis of a cleaved amplified polymorphism sequence (CAPS) marker derived from this SNP in F(2) and BC(1R) populations demonstrated a cosegregation between the CAPS-2 marker and their ZYMV resistance or susceptibility phenotype. When we investigated whether such SNP mutation in the eIF4E was also conserved in several other PIs of C. lanatus var. citroides, we identified a different SNP (A171G) resulting in another amino acid substitution (D71G) from four ZYMV-resistant C. lanatus var. citroides (PI 244018, PI 482261, PI 482299, and PI 482322). Additional CAPS markers were also identified. Availability of all these CAPS markers will enable marker-aided breeding of watermelon for ZYMV resistance.
Henshall, John M; Dierens, Leanne; Sellars, Melony J
2014-09-02
While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms.
Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.
Baniecki, Mary Lynn; Faust, Aubrey L; Schaffner, Stephen F; Park, Daniel J; Galinsky, Kevin; Daniels, Rachel F; Hamilton, Elizabeth; Ferreira, Marcelo U; Karunaweera, Nadira D; Serre, David; Zimmerman, Peter A; Sá, Juliana M; Wellems, Thomas E; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E; Volkman, Sarah K; Wirth, Dyann F; Sabeti, Pardis C
2015-03-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.
Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections
Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.
2015-01-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890
Transcriptome sequencing for high throughput SNP development and genetic mapping in Pea
2014-01-01
Background Pea has a complex genome of 4.3 Gb for which only limited genomic resources are available to date. Although SNP markers are now highly valuable for research and modern breeding, only a few are described and used in pea for genetic diversity and linkage analysis. Results We developed a large resource by cDNA sequencing of 8 genotypes representative of modern breeding material using the Roche 454 technology, combining both long reads (400 bp) and high coverage (3.8 million reads, reaching a total of 1,369 megabases). Sequencing data were assembled and generated a 68 K unigene set, from which 41 K were annotated from their best blast hit against the model species Medicago truncatula. Annotated contigs showed an even distribution along M. truncatula pseudochromosomes, suggesting a good representation of the pea genome. 10 K pea contigs were found to be polymorphic among the genetic material surveyed, corresponding to 35 K SNPs. We validated a subset of 1538 SNPs through the GoldenGate assay, proving their ability to structure a diversity panel of breeding germplasm. Among them, 1340 were genetically mapped and used to build a new consensus map comprising a total of 2070 markers. Based on blast analysis, we could establish 1252 bridges between our pea consensus map and the pseudochromosomes of M. truncatula, which provides new insight on synteny between the two species. Conclusions Our approach created significant new resources in pea, i.e. the most comprehensive genetic map to date tightly linked to the model species M. truncatula and a large SNP resource for both academic research and breeding. PMID:24521263
2013-01-01
Background Efficient screening of bacterial artificial chromosome (BAC) libraries with polymerase chain reaction (PCR)-based markers is feasible provided that a multidimensional pooling strategy is implemented. Single nucleotide polymorphisms (SNPs) can be screened in multiplexed format, therefore this marker type lends itself particularly well for medium- to high-throughput applications. Combining the power of multiplex-PCR assays with a multidimensional pooling system may prove to be especially challenging in a polyploid genome. In polyploid genomes two classes of SNPs need to be distinguished, polymorphisms between accessions (intragenomic SNPs) and those differentiating between homoeologous genomes (intergenomic SNPs). We have assessed whether the highly parallel Illumina GoldenGate® Genotyping Assay is suitable for the screening of a BAC library of the polyploid Brassica napus genome. Results A multidimensional screening platform was developed for a Brassica napus BAC library which is composed of almost 83,000 clones. Intragenomic and intergenomic SNPs were included in Illumina’s GoldenGate® Genotyping Assay and both SNP classes were used successfully for screening of the multidimensional BAC pools of the Brassica napus library. An optimized scoring method is proposed which is especially valuable for SNP calling of intergenomic SNPs. Validation of the genotyping results by independent methods revealed a success of approximately 80% for the multiplex PCR-based screening regardless of whether intra- or intergenomic SNPs were evaluated. Conclusions Illumina’s GoldenGate® Genotyping Assay can be efficiently used for screening of multidimensional Brassica napus BAC pools. SNP calling was specifically tailored for the evaluation of BAC pool screening data. The developed scoring method can be implemented independently of plant reference samples. It is demonstrated that intergenomic SNPs represent a powerful tool for BAC library screening of a polyploid genome. PMID:24010766
Liu, Dewu; Zhang, Yushan; Du, Yinjun; Yang, Guanfu; Zhang, Xiquan
2007-06-01
The growth-correlated genes that are part of the neuroendocrine growth axis play crucial roles in the regulation of growth and development of pig. The identification of genetic polymorphisms in these genes will enable the scientist to evaluate the biological relevance of such polymorphisms and to gain a better understanding of quantitative traits like growth. In the present study, seven pairs of primers were designed to obtain unknown sequences of growth-correlated genes, and other 25 pairs of primers were designed to identify single nucleotide polymorphisms (SNP) using the denaturing high-performance liquid chromatography (DHPLC) technology in four pig breeds (Duroc, Landrace, Lantang and Wuzhishan), significantly differing in growth and development characteristics. A total of 101 polymorphisms were discovered in 10,707 base pairs (bp) from six genes of the ghrelin (GHRL), leptin (LEP), insulin-like growth factor II (IGF-II), insulin-like growth factor binding protein 2 (IGFBP-2), insulin-like growth factor binding protein 3 (IGFBP-3), and somatostatin (SS). The observed average distances between the SNP in the 5'UTR, coding regions, introns and 3'UTR were 134, 521, 81 and 92 bp, respectively. Four SNPs were found in the coding regions of IGF-II, IGFBP-2 and LEP, respectively. Two synonymous mutations were obtained in IGF-II and LEP genes respectively, and two non-synonymous were found in IGFBP-2 and LEP genes, respectively. Seven other mutations were also observed. Thirty-two PCR-RFLP markers were found among 101 polymorphisms of the six genes. The SNP discovered in this study would provide suitable markers for association studies of candidate genes with growth related traits in pig.
Li, Faji; Wen, Weie; He, Zhonghu; Liu, Jindong; Jin, Hui; Cao, Shuanghe; Geng, Hongwei; Yan, Jun; Zhang, Pingzhi; Wan, Yingxiu; Xia, Xianchun
2018-06-01
We identified 21 new and stable QTL, and 11 QTL clusters for yield-related traits in three bread wheat populations using the wheat 90 K SNP assay. Identification of quantitative trait loci (QTL) for yield-related traits and closely linked molecular markers is important in order to identify gene/QTL for marker-assisted selection (MAS) in wheat breeding. The objectives of the present study were to identify QTL for yield-related traits and dissect the relationships among different traits in three wheat recombinant inbred line (RIL) populations derived from crosses Doumai × Shi 4185 (D × S), Gaocheng 8901 × Zhoumai 16 (G × Z) and Linmai 2 × Zhong 892 (L × Z). Using the available high-density linkage maps previously constructed with the wheat 90 K iSelect single nucleotide polymorphism (SNP) array, 65, 46 and 53 QTL for 12 traits were identified in the three RIL populations, respectively. Among them, 34, 23 and 27 were likely to be new QTL. Eighteen common QTL were detected across two or three populations. Eleven QTL clusters harboring multiple QTL were detected in different populations, and the interval 15.5-32.3 cM around the Rht-B1 locus on chromosome 4BS harboring 20 QTL is an important region determining grain yield (GY). Thousand-kernel weight (TKW) is significantly affected by kernel width and plant height (PH), whereas flag leaf width can be used to select lines with large kernel number per spike. Eleven candidate genes were identified, including eight cloned genes for kernel, heading date (HD) and PH-related traits as well as predicted genes for TKW, spike length and HD. The closest SNP markers of stable QTL or QTL clusters can be used for MAS in wheat breeding using kompetitive allele-specific PCR or semi-thermal asymmetric reverse PCR assays for improvement of GY.
Shiokai, Sachiko; Kitashiba, Hiroyasu; Nishio, Takeshi
2010-08-01
Although the dot-blot-SNP technique is a simple cost-saving technique suitable for genotyping of many plant individuals, optimization of hybridization and washing conditions for each SNP marker requires much time and labor. For prediction of the optimum hybridization conditions for each probe, we compared T (m) values estimated from nucleotide sequences using the DINAMelt web server, measured T (m) values, and hybridization conditions yielding allele-specific signals. The estimated T (m) values were comparable to the measured T (m) values with small differences of less than 3 degrees C for most of the probes. There were differences of approximately 14 degrees C between the specific signal detection conditions and estimated T (m) values. Change of one level of SSC concentrations of 0.1, 0.2, 0.5, and 1.0x SSC corresponded to a difference of approximately 5 degrees C in optimum signal detection temperature. Increasing the sensitivity of signal detection by shortening the exposure time to X-ray film changed the optimum hybridization condition for specific signal detection. Addition of competitive oligonucleotides to the hybridization mixture increased the suitable hybridization conditions by 1.8. Based on these results, optimum hybridization conditions for newly produced dot-blot-SNP markers will become predictable.
USDA-ARS?s Scientific Manuscript database
Downy mildew, which is caused by fungus Plasmopara halstedii (Farl.) Berlese & de Toni, is one of the most important diseases that affect sunflower production globally. Two downy mildew resistance genes, PlArg and Pl8, were discovered in the late 1980s. Over two decades, PlArg is still effective aga...
Edet, Offiong Ukpong; Kim, June-Sik; Okamoto, Masanori; Hanada, Kousuke; Takeda, Tomoyuki; Kishii, Masahiro; Gorafi, Yasir Serag Alnor; Tsujimoto, Hisashi
2018-03-27
The tertiary gene pool of bread wheat, to which Leymus racemosus belongs, has remained underutilized due to the current limited genomic resources of the species that constitute it. Continuous enrichment of public databases with useful information regarding these species is, therefore, needed to provide insights on their genome structures and aid successful utilization of their genes to develop improved wheat cultivars for effective management of environmental stresses. We generated de novo DNA and mRNA sequence information of L. racemosus and developed 110 polymorphic PCR-based markers from the data, and to complement the PCR markers, DArT-seq genotyping was applied to develop additional 9990 SNP markers. Approximately 52% of all the markers enabled us to clearly genotype 22 wheat-L. racemosus chromosome introgression lines, and L. racemosus chromosome-specific markers were highly efficient in detailed characterization of the translocation and recombination lines analyzed. A further analysis revealed remarkable transferability of the PCR markers to three other important Triticeae perennial species: L. mollis, Psathyrostachys huashanica and Elymus ciliaris, indicating their suitability for characterizing wheat-alien chromosome introgressions carrying chromosomes of these genomes. The efficiency of the markers in characterizing wheat-L. racemosus chromosome introgression lines proves their reliability, and their high transferability further broadens their scope of application. This is the first report on sequencing and development of markers from L. racemosus genome and the application of DArT-seq to develop markers from a perennial wild relative of wheat, marking a paradigm shift from the seeming concentration of the technology on cultivated species. Integration of these markers with appropriate cytogenetic methods would accelerate development and characterization of wheat-alien chromosome introgression lines.
Otto, Lars-Gernot; Mondal, Prodyut; Brassac, Jonathan; Preiss, Susanne; Degenhardt, Jörg; He, Sang; Reif, Jochen Christoph; Sharbel, Timothy Francis
2017-08-10
Chamomile (Matricaria recutita L.) has a long history of use in herbal medicine with various applications, and the flower heads contain numerous secondary metabolites which are medicinally active. In the major crop plants, next generation sequencing (NGS) approaches are intensely applied to exploit genetic resources, to develop genomic resources and to enhance breeding. Here, genotyping-by-sequencing (GBS) has been used in the non-model medicinal plant chamomile to evaluate the genetic structure of the cultivated varieties/populations, and to perform genome wide association study (GWAS) focusing on genes with large effect on flowering time and the medicinally important alpha-bisabolol content. GBS analysis allowed the identification of 6495 high-quality SNP-markers in our panel of 91 M. recutita plants from 33 origins (2-4 genotypes each) and 4 M. discoidea plants as outgroup, grown in the greenhouse in Gatersleben, Germany. M. recutita proved to be clearly distinct from the outgroup, as was demonstrated by different cluster and principal coordinate analyses using the SNP-markers. Chamomile genotypes from the same origin were mostly genetically similar. Model-based cluster analysis revealed one large group of tetraploid genotypes with low genetic differentiation including 39 plants from 14 origins. Tetraploids tended to display lower genetic diversity than diploids, probably reflecting their origin by artificial polyploidisation from only a limited set of genetic backgrounds. Analyses of flowering time demonstrated that diploids generally flowered earlier than tetraploids, and the analysis of alpha-bisabolol identified several tetraploid genotypes with a high content. GWAS identified highly significant (P < 0.01) SNPs for flowering time (9) and alpha-bisabolol (71). One sequence harbouring SNPs associated with flowering time was described to play a role in self-pollination in Arabidopsis thaliana, whereas four sequences harbouring SNPs associated with alpha-bisabolol were identified to be involved in plant biotic and abiotic stress response in various plants species. The first genomic resource for future applications to enhance breeding in chamomile was created, andanalyses of diversity will facilitate the exploitation of these genetic resources. The GWAS data pave the way for future research towards the genetics underlying important traits in chamomile, the identification of marker-trait associations, and development of reliable markers for practical breeding.
Lind, Mårten; Källman, Thomas; Chen, Jun; Ma, Xiao-Fei; Bousquet, Jean; Morgante, Michele; Zaina, Giusi; Karlsson, Bo; Elfstrand, Malin; Lascoux, Martin; Stenlid, Jan
2014-01-01
A consensus linkage map of Picea abies, an economically important conifer, was constructed based on the segregation of 686 SNP markers in a F1 progeny population consisting of 247 individuals. The total length of 1889.2 cM covered 96.5% of the estimated genome length and comprised 12 large linkage groups, corresponding to the number of haploid P. abies chromosomes. The sizes of the groups (from 5.9 to 9.9% of the total map length) correlated well with previous estimates of chromosome sizes (from 5.8 to 10.8% of total genome size). Any locus in the genome has a 97% probability to be within 10 cM from a mapped marker, which makes the map suited for QTL mapping. Infecting the progeny trees with the root rot pathogen Heterobasidion parviporum allowed for mapping of four different resistance traits: lesion length at the inoculation site, fungal spread within the sapwood, exclusion of the pathogen from the host after initial infection, and ability to prevent the infection from establishing at all. These four traits were associated with two, four, four and three QTL regions respectively of which none overlapped between the traits. Each QTL explained between 4.6 and 10.1% of the respective traits phenotypic variation. Although the QTL regions contain many more genes than the ones represented by the SNP markers, at least four markers within the confidence intervals originated from genes with known function in conifer defence; a leucoanthocyanidine reductase, which has previously been shown to upregulate during H. parviporum infection, and three intermediates of the lignification process; a hydroxycinnamoyl CoA shikimate/quinate hydroxycinnamoyltransferase, a 4-coumarate CoA ligase, and a R2R3-MYB transcription factor. PMID:25036209
Tao, Aifen; Huang, Long; Wu, Guifen; Afshar, Reza Keshavarz; Qi, Jianmin; Xu, Jiantang; Fang, Pingping; Lin, Lihui; Zhang, Liwu; Lin, Peiqing
2017-05-08
Genetic mapping and quantitative trait locus (QTL) detection are powerful methodologies in plant improvement and breeding. White jute (Corchorus capsularis L.) is an important industrial raw material fiber crop because of its elite characteristics. However, construction of a high-density genetic map and identification of QTLs has been limited in white jute due to a lack of sufficient molecular markers. The specific locus amplified fragment sequencing (SLAF-seq) strategy combines locus-specific amplification and high-throughput sequencing to carry out de novo single nuclear polymorphism (SNP) discovery and large-scale genotyping. In this study, SLAF-seq was employed to obtain sufficient markers to construct a high-density genetic map for white jute. Moreover, with the development of abundant markers, genetic dissection of fiber yield traits such as plant height was also possible. Here, we present QTLs associated with plant height that were identified using our newly constructed genetic linkage groups. An F 8 population consisting of 100 lines was developed. In total, 69,446 high-quality SLAFs were detected of which 5,074 SLAFs were polymorphic; 913 polymorphic markers were used for the construction of a genetic map. The average coverage for each SLAF marker was 43-fold in the parents, and 9.8-fold in each F 8 individual. A linkage map was constructed that contained 913 SLAFs on 11 linkage groups (LGs) covering 1621.4 cM with an average density of 1.61 cM per locus. Among the 11 LGs, LG1 was the largest with 210 markers, a length of 406.34 cM, and an average distance of 1.93 cM between adjacent markers. LG11 was the smallest with only 25 markers, a length of 29.66 cM, and an average distance of 1.19 cM between adjacent markers. 'SNP_only' markers accounted for 85.54% and were the predominant markers on the map. QTL mapping based on the F 8 phenotypes detected 11 plant height QTLs including one major effect QTL across two cultivation locations, with each QTL accounting for 4.14-15.63% of the phenotypic variance. To our knowledge, the linkage map constructed here is the densest one available to date for white jute. This analysis also identified the first QTL in white jute. The results will provide an important platform for gene/QTL mapping, sequence assembly, genome comparisons, and marker-assisted selection breeding for white jute.
Shimosako, Nana; Kerr, Jonathan R
2014-12-01
We have reported gene expression changes in patients with chronic fatigue syndrome/myalgic encephalomyelitis (CFS/ME) and the fact that such gene expression data can be used to identify subtypes of CFS/ME with distinct clinical phenotypes. Due to the difficulties in using a comparative gene expression method as an aid to CFS/ME disease and subtype-specific diagnosis, we have attempted to develop such a method based on single-nucleotide polymorphism (SNP) analysis. To identify SNP allele associations with CFS/ME and CFS/ME subtypes, we tested genomic DNA of patients with CFS/ME (n=108), patients with endogenous depression (n=17) and normal blood donors (n=68) for 504 human SNP alleles located within 88 CFS-associated human genes using the SNP Genotyping GoldenGate Assay (Illumina, San Diego, California, USA). 360 ancestry informative markers (AIM) were also examined. 21 SNPs were significantly associated with CFS/ME compared with depression and normal groups. 148 SNP alleles had a significant association with one or more CFS/ME subtypes. For each subtype, associated SNPs tended to be grouped together within particular genes. AIM SNPs indicated that 4 subjects were of Asian origin while the remainder were Caucasian. Hierarchical clustering of AIM data revealed the relatedness between 2 couples of patients with CFS only and confirmed the overall heterogeneity of all subjects. This study provides evidence that human SNPs located within CFS/ME associated genes are associated with particular genomic subtypes of CFS/ME. Further work is required to develop this into a clinically useful subtype-specific diagnostic test. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.
Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A
2012-12-01
The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.
Clevenger, Josh; Chu, Ye; Chavarro, Carolina; Botton, Stephanie; Culbreath, Albert; Isleib, Thomas G; Holbrook, C C; Ozias-Akins, Peggy
2018-01-01
Late leaf spot (LLS; Cercosporidium personatum ) is a major fungal disease of cultivated peanut ( Arachis hypogaea ). A recombinant inbred line population segregating for quantitative field resistance was used to identify quantitative trait loci (QTL) using QTL-seq. High rates of false positive SNP calls using established methods in this allotetraploid crop obscured significant QTLs. To resolve this problem, robust parental SNPs were first identified using polyploid-specific SNP identification pipelines, leading to discovery of significant QTLs for LLS resistance. These QTLs were confirmed over 4 years of field data. Selection with markers linked to these QTLs resulted in a significant increase in resistance, showing that these markers can be immediately applied in breeding programs. This study demonstrates that QTL-seq can be used to rapidly identify QTLs controlling highly quantitative traits in polyploid crops with complex genomes. Markers identified can then be deployed in breeding programs, increasing the efficiency of selection using molecular tools. Key Message: Field resistance to late leaf spot is a quantitative trait controlled by many QTLs. Using polyploid-specific methods, QTL-seq is faster and more cost effective than QTL mapping.
Clevenger, Josh; Chu, Ye; Chavarro, Carolina; Botton, Stephanie; Culbreath, Albert; Isleib, Thomas G.; Holbrook, C. C.; Ozias-Akins, Peggy
2018-01-01
Late leaf spot (LLS; Cercosporidium personatum) is a major fungal disease of cultivated peanut (Arachis hypogaea). A recombinant inbred line population segregating for quantitative field resistance was used to identify quantitative trait loci (QTL) using QTL-seq. High rates of false positive SNP calls using established methods in this allotetraploid crop obscured significant QTLs. To resolve this problem, robust parental SNPs were first identified using polyploid-specific SNP identification pipelines, leading to discovery of significant QTLs for LLS resistance. These QTLs were confirmed over 4 years of field data. Selection with markers linked to these QTLs resulted in a significant increase in resistance, showing that these markers can be immediately applied in breeding programs. This study demonstrates that QTL-seq can be used to rapidly identify QTLs controlling highly quantitative traits in polyploid crops with complex genomes. Markers identified can then be deployed in breeding programs, increasing the efficiency of selection using molecular tools. Key Message: Field resistance to late leaf spot is a quantitative trait controlled by many QTLs. Using polyploid-specific methods, QTL-seq is faster and more cost effective than QTL mapping. PMID:29459876
Wang, Hongbo; Ye, Shengtuo; Mou, Tongmin
2016-12-01
The development of hybrid rice is a practical approach for increasing rice production. However, the brown planthopper (BPH), Nilaparvata lugens Stål, causes severe yield loss of rice (Oryza sativa L.) and can threaten food security. Therefore, breeding hybrid rice resistant to BPH is the most effective and economical strategy to maintain high and stable production. Fortunately, numerous BPH resistance genes have been identified, and abundant linkage markers are available for molecular marker-assisted selection (MAS) in breeding programs. Hence, we pyramided two BPH resistance genes, Bph14 and Bph15, into a susceptive CMS restorer line Huahui938 and its derived hybrids using MAS to improve the BPH resistance of hybrid rice. Three near-isogenic lines (NILs) with pyramided Bph14 and Bph15 were obtained by molecular marker-assisted backcross (MAB) and phenotypic selection. The genomic components of these NILs were detected using the whole-genome SNP (Single nucleotide polymorphism) array, RICE6K, suggesting that the recurrent parent genome (RPG) recovery of the NILs was 87.88, 87.70 and 86.62 %, respectively. BPH bioassays showed that the improved NILs and their derived hybrids carrying homozygous Bph14 and Bph15 were resistant to BPH. However, the hybrids with heterozygous Bph14 and Bph15 remained susceptible to BPH. The developed NILs showed no significant differences in major agronomic traits and rice qualities compared with the recurrent parent. Moreover, the improved hybrids derived from the NILs exhibited better agronomic performance and rice quality compared with the controls under natural field conditions. This study demonstrates that it is essential to stack Bph14 and Bph15 into both the maternal and paternal parents for developing BPH-resistant hybrid rice varieties. The SNP array with abundant DNA markers is an efficient tool for analyzing the RPG recovery of progenies and can be used to monitor the donor segments in NILs, thus being extremely important for rice molecular breeding.
Associations of genetic markers in cattle receiving differing implant protocols.
King, D A; Shackelford, S D; McDaneld, T G; Kuehn, L A; Kemp, C M; Smith, T P L; Wheeler, T L; Koohmaraie, M
2012-07-01
The potential interaction of growth-promoting implants and genetic markers previously reported to be associated with growth, carcass traits, and tenderness was evaluated. Two implant protocols were applied to subsets of steers (n = 383) and heifers (n = 65) that were also genotyped for 47 SNP reported to be associated with variation in growth, fat thickness, LM area, marbling, or tenderness. The "mild" protocol consisted of a single terminal implant [16 mg estradiol benzoate (EB), 80 mg trenbalone acetate (TBA) or 8 mg EB, 80 mg TBA given to steers and heifers, respectively]. The "aggressive" protocol consisted of both a growing implant (8 mg EB, 40 mg TBA) for the lightest half of the animals on the aggressive protocol and 2 successive implants (28 mg EB, 200 mg TBA) given to all animals assigned to the aggressive treatment. Implant protocol had measurable impact on BW and ADG (P < 0.05), with the aggressive protocol increasing these traits before the terminal implant (relative to the mild protocol), whereas the mild protocol increased ADG after the terminal implant so that the final BW and ADG over the experimental period were similar between protocols. Animals on the aggressive protocol had significantly increased (P < 0.05) LM area (1.9 cm(2)), slice shear force (1.4 kg), and intact desmin (0.05 units), but decreased (P < 0.05) marbling score (49 units) and adjusted fat thickness (0.1 cm), and yield grade (0.15 units). Among both treatments, 8 of 9 growth-related SNP were associated with BW or ADG, and 6 of 17 tenderness-related SNP were associated with slice shear force or intact desmin. Favorable growth alleles generally were associated with increased carcass yield traits but decreased tenderness. Similarly, favorable tenderness genotypes for some markers were associated with decreased BW and ADG. Some interactions of implant protocol and genotype were noted, with some growth SNP alleles increasing the effect of the aggressive protocol. In contrast, putative beneficial effects of favorable tenderness SNP alleles were mitigated by the effects of aggressive implant. These type of antagonisms of management variables and genotypes must be accounted for in marker assisted selection (MAS) programs, and our results suggest that MAS could be used to manage, but likely will not eliminate negative impact of implants on quality.
Ponomarenko, Petr; Chadaeva, Irina; Rasskazov, Dmitry A.; Sharypova, Ekaterina; Kashina, Elena V.; Drachkova, Irina; Zhechev, Dmitry; Ponomarenko, Mikhail P.; Savinkova, Ludmila K.; Kolchanov, Nikolay
2017-01-01
While year after year, conditions, quality, and duration of human lives have been improving due to the progress in science, technology, education, and medicine, only eight diseases have been increasing in prevalence and shortening human lives because of premature deaths according to the retrospective official review on the state of US health, 1990-2010. These diseases are kidney cancer, chronic kidney diseases, liver cancer, diabetes, drug addiction, poisoning cases, consequences of falls, and Alzheimer's disease (AD) as one of the leading pathologies. There are familial AD of hereditary nature (~4% of cases) and sporadic AD of unclear etiology (remaining ~96% of cases; i.e., non-familial AD). Therefore, sporadic AD is no longer a purely medical problem, but rather a social challenge when someone asks oneself: “What can I do in my own adulthood to reduce the risk of sporadic AD at my old age to save the years of my lifespan from the destruction caused by it?” Here, we combine two computational approaches for regulatory SNPs: Web service SNP_TATA_Comparator for sequence analysis and a PubMed-based keyword search for articles on the biochemical markers of diseases. Our purpose was to try to find answers to the question: “What can be done in adulthood to reduce the risk of sporadic AD in old age to prevent the lifespan reduction caused by it?” As a result, we found 89 candidate SNP markers of familial and sporadic AD (e.g., rs562962093 is associated with sporadic AD in the elderly as a complication of stroke in adulthood, where natural marine diets can reduce risks of both diseases in case of the minor allele of this SNP). In addition, rs768454929, and rs761695685 correlate with sporadic AD as a comorbidity of short stature, where maximizing stature in childhood and adolescence as an integral indicator of health can minimize (or even eliminate) the risk of sporadic AD in the elderly. After validation by clinical protocols, these candidate SNP markers may become interesting to the general population [may help to choose a lifestyle (in childhood, adolescence, and adulthood) that can reduce the risks of sporadic AD, its comorbidities, and complications in the elderly]. PMID:28775688
Performance of the SNPforID 52 SNP-plex assay in paternity testing.
Børsting, Claus; Sanchez, Juan J; Hansen, Hanna E; Hansen, Anders J; Bruun, Hanne Q; Morling, Niels
2008-09-01
The performance of a multiplex assay with 52 autosomal single nucleotide polymorphisms (SNPs) developed for human identification was tested on 124 mother-child-father trios. The typical paternity indices (PIs) were 10(5)-10(6) for the trios and 10(3)-10(4) for the child-father duos. Using the SNP profiles from the randomly selected trios and 700 previously typed individuals, a total of 83,096 comparisons between mother, child and an unrelated man were performed. On average, 9-10 mismatches per comparison were detected. Four mismatches were genetic inconsistencies and 5-6 mismatches were opposite homozygosities. In only two of the 83,096 comparisons did an unrelated man match perfectly to a mother-child duo, and in both cases the PI of the true father was much higher than the PI of the unrelated man. The trios were also typed for 15 short tandem repeats (STRs) and seven variable number of tandem repeats (VNTRs). The typical PIs based on 15 STRs or seven VNTRs were 5-50 times higher than the typical PIs based on 52 SNPs. Six mutations in tandem repeats were detected among the randomly selected trios. In contrast, there was not found any mutations in the SNP loci. The results showed that the 52 SNP-plex assay is a very useful alternative to currently used methods in relationship testing. The usefulness of SNP markers with low mutation rates in paternity and immigration casework is discussed.
A whole genome analyses of genetic variants in two Kelantan Malay individuals.
Wan Juhari, Wan Khairunnisa; Md Tamrin, Nur Aida; Mat Daud, Mohd Hanif Ridzuan; Isa, Hatin Wan; Mohd Nasir, Nurfazreen; Maran, Sathiya; Abdul Rajab, Nur Shafawati; Ahmad Amin Noordin, Khairul Bariah; Nik Hassan, Nik Norliza; Tearle, Rick; Razali, Rozaimi; Merican, Amir Feisal; Zilfalil, Bin Alwi
2014-12-01
The sequencing of two members of the Royal Kelantan Malay family genomes will provide insights on the Kelantan Malay whole genome sequences. The two Kelantan Malay genomes were analyzed for the SNP markers associated with thalassemia and Helicobacter pylori infection. Helicobacter pylori infection was reported to be low prevalence in the north-east as compared to the west coast of the Peninsular Malaysia and beta-thalassemia was known to be one of the most common inherited and genetic disorder in Malaysia. By combining SNP information from literatures, GWAS study and NCBI ClinVar, 18 unique SNPs were selected for further analysis. From these 18 SNPs, 10 SNPs came from previous study of Helicobacter pylori infection among Malay patients, 6 SNPs were from NCBI ClinVar and 2 SNPs from GWAS studies. The analysis reveals that both Royal Kelantan Malay genomes shared all the 10 SNPs identified by Maran (Single Nucleotide Polymorphims (SNPs) genotypic profiling of Malay patients with and without Helicobacter pylori infection in Kelantan, 2011) and one SNP from GWAS study. In addition, the analysis also reveals that both Royal Kelantan Malay genomes shared 3 SNP markers; HBG1 (rs1061234), HBB (rs1609812) and BCL11A (rs766432) where all three markers were associated with beta-thalassemia. Our findings suggest that the Royal Kelantan Malays carry the SNPs which are associated with protection to Helicobacter pylori infection. In addition they also carry SNPs which are associated with beta-thalassemia. These findings are in line with the findings by other researchers who conducted studies on thalassemia and Helicobacter pylori infection in the non-royal Malay population.
Repnik, Katja; Koder, Silvo; Skok, Pavel; Ferkolj, Ivan; Potočnik, Uroš
2016-08-01
Tumor necrosis factor α inhibitors (anti-TNF) have improved treatment of several complex diseases, including Crohn's disease (CD). However, the effect varies and approximately one-third of the patients do not respond. Since blood parameters as well as genetic factors have shown a great potential to predict response during treatment, the aim of the study was to evaluate response to anti-TNF treatment with adalimumab (ADA) between genes HFE and TF and haematological parameters in Slovenian refractory CD patients. Single nucleotide polymorphisms (SNPs) rs1799852 in gene TF and rs2071303 in gene HFE were genotyped in 68 refractory CD patients for which response has been measured using inflammatory bowel disease questionnaire (IBDQ) index. Haematological parameters and IBDQ index were determined before therapy and after 4, 12, 20 and 30 weeks. We found novel strong association between SNP rs2071303 in gene HFE and response to ADA treatment, particularly patients with G allele comparing to A allele had better response after 20 weeks (p = 0.008). Further, we found strong association between transferrin level at baseline and treatment response after 12, 20 and 30 weeks, where average transferrin level before therapy was lower in responders (2.38 g/L) compared to non-responders (2.89 g/L, p = 0.005). Association was found between transferrin level in week 30 and SNP rs1799852 (p = 0.023), and between MCHC level before treatment and SNP rs2071303 (p = 0.007). Our results suggest that SNP in gene HFE as well as haematological markers serve as promising prognostic markers of response to anti-TNF treatment in CD patients.
Ertiro, Berhanu Tadesse; Semagn, Kassa; Das, Biswanath; Olsen, Michael; Labuschagne, Maryke; Worku, Mosisa; Wegary, Dagne; Azmach, Girum; Ogugo, Veronica; Keno, Tolera; Abebe, Beyene; Chibsa, Temesgen; Menkir, Abebe
2017-10-12
Molecular characterization is important for efficient utilization of germplasm and development of improved varieties. In the present study, we investigated the genetic purity, relatedness and population structure of 265 maize inbred lines from the Ethiopian Institute of Agricultural Research (EIAR), the International Maize and Wheat Improvement Centre (CIMMYT) and the International Institute of Tropical Agriculture (IITA) using 220,878 single nucleotide polymorphic (SNP) markers obtained using genotyping by sequencing (GBS). Only 22% of the inbred lines were considered pure with <5% heterogeneity, while the remaining 78% of the inbred lines had a heterogeneity ranging from 5.1 to 31.5%. Pairwise genetic distances among the 265 inbred lines varied from 0.011 to 0.345, with 89% of the pairs falling between 0.301 and 0.345. Only <1% of the pairs had a genetic distance lower than 0.200, which included 14 pairs of sister lines that were nearly identical. Relative kinship analysis showed that the kinship coefficients for 59% of the pairs of lines was close to zero, which agrees with the genetic distance estimates. Principal coordinate analysis, discriminant analysis of principal components (DAPC) and the model-based population structure analysis consistently suggested the presence of three groups, which generally agreed with pedigree information (genetic background). Although not distinct enough, the SNP markers showed some level of separation between the two CIMMYT heterotic groups A and B established based on pedigree and combining ability information. The high level of heterogeneity detected in most of the inbred lines suggested the requirement for purification or further inbreeding except those deliberately maintained at early inbreeding level. The genetic distance and relative kinship analysis clearly indicated the uniqueness of most of the inbred lines in the maize germplasm available for breeders in the mid-altitude maize breeding program of Ethiopia. Results from the present study facilitate the maize breeding work in Ethiopia and germplasm exchange among breeding programs in Africa. We suggest the incorporation of high density molecular marker information in future heterotic group assignments.
Cytosolic labile zinc: a marker for apoptosis in the developing rat brain.
Lee, Joo-Yong; Hwang, Jung Jin; Park, Mi-Ha; Koh, Jae-Young
2006-01-01
Cytosolic zinc accumulation was thought to occur specifically in neuronal death (necrosis) following acute injury. However, a recent study demonstrated that zinc accumulation also occurs in adult rat neurons undergoing apoptosis following target ablation, and in vitro experiments have shown that zinc accumulation may play a causal role in various forms of apoptosis. Here, we examined whether intraneuronal zinc accumulation occurs in central neurons undergoing apoptosis during development. Embryonic and newborn Sprague-Dawley rat brains were double-stained for terminal deoxynucleotidyl transferase-mediated dUTP-biotin nick end labelling (TUNEL) detection of apoptosis and immunohistochemical detection of stage-specific neuronal markers, such as nestin, proliferating cell nuclear antigen (PCNA), TuJ1 and neuronal nuclear specific protein (NeuN). The results revealed that apoptotic cell death occurred in neurons of diverse stages (neural stem cells, and dividing, young and adult neurons) throughout the brain during the embryonic and early postnatal periods. Further staining of brain sections with acid fuchsin or zinc-specific fluorescent dyes showed that all of the apoptotic neurons were acidophilic and contained labile zinc in their cell bodies. Cytosolic zinc accumulation was also observed in cultured cortical neurons undergoing staurosporine- or sodium nitroprusside (SNP)-induced apoptosis. In contrast, zinc chelation with CaEDTA or N,N,N',N'-tetrakis(2-pyridylmethyl)ethylenediamine (TPEN) reduced SNP-induced apoptosis but not staurosporine-induced apoptosis, indicating that cytosolic zinc accumulation does not play a causal role in all forms of apoptosis. Finally, the specific cytosolic zinc accumulation may have a practical application as a relatively simple marker for neurons undergoing developmental apoptosis.
Kawakami, Takeshi; Backström, Niclas; Burri, Reto; Husby, Arild; Olason, Pall; Rice, Amber M; Ålund, Murielle; Qvarnström, Anna; Ellegren, Hans
2014-01-01
With the access to draft genome sequence assemblies and whole-genome resequencing data from population samples, molecular ecology studies will be able to take truly genome-wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single-nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10-fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later-generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system. PMID:24784959
Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B
2015-01-01
Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816
Genomic analysis of cow mortality and milk production using a threshold-linear model.
Tsuruta, S; Lourenco, D A L; Misztal, I; Lawlor, T J
2017-09-01
The objective of this study was to investigate the feasibility of genomic evaluation for cow mortality and milk production using a single-step methodology. Genomic relationships between cow mortality and milk production were also analyzed. Data included 883,887 (866,700) first-parity, 733,904 (711,211) second-parity, and 516,256 (492,026) third-parity records on cow mortality (305-d milk yields) of Holsteins from Northeast states in the United States. The pedigree consisted of up to 1,690,481 animals including 34,481 bulls genotyped with 36,951 SNP markers. Analyses were conducted with a bivariate threshold-linear model for each parity separately. Genomic information was incorporated as a genomic relationship matrix in the single-step BLUP. Traditional and genomic estimated breeding values (GEBV) were obtained with Gibbs sampling using fixed variances, whereas reliabilities were calculated from variances of GEBV samples. Genomic EBV were then converted into single nucleotide polymorphism (SNP) marker effects. Those SNP effects were categorized according to values corresponding to 1 to 4 standard deviations. Moving averages and variances of SNP effects were calculated for windows of 30 adjacent SNP, and Manhattan plots were created for SNP variances with the same window size. Using Gibbs sampling, the reliability for genotyped bulls for cow mortality was 28 to 30% in EBV and 70 to 72% in GEBV. The reliability for genotyped bulls for 305-d milk yields was 53 to 65% to 81 to 85% in GEBV. Correlations of SNP effects between mortality and 305-d milk yields within categories were the highest with the largest SNP effects and reached >0.7 at 4 standard deviations. All SNP regions explained less than 0.6% of the genetic variance for both traits, except regions close to the DGAT1 gene, which explained up to 2.5% for cow mortality and 4% for 305-d milk yields. Reliability for GEBV with a moderate number of genotyped animals can be calculated by Gibbs samples. Genomic information can greatly increase the reliability of predictions not only for milk but also for mortality. The existence of a common region on Bos taurus autosome 14 affecting both traits may indicate a major gene with a pleiotropic effect on milk and mortality. Copyright © 2017 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Larmer, S G; Sargolzaei, M; Schenkel, F S
2014-05-01
Genomic selection requires a large reference population to accurately estimate single nucleotide polymorphism (SNP) effects. In some Canadian dairy breeds, the available reference populations are not large enough for accurate estimation of SNP effects for traits of interest. If marker phase is highly consistent across multiple breeds, it is theoretically possible to increase the accuracy of genomic prediction for one or all breeds by pooling several breeds into a common reference population. This study investigated the extent of linkage disequilibrium (LD) in 5 major dairy breeds using a 50,000 (50K) SNP panel and 3 of the same breeds using the 777,000 (777K) SNP panel. Correlation of pair-wise SNP phase was also investigated on both panels. The level of LD was measured using the squared correlation of alleles at 2 loci (r(2)), and the consistency of SNP gametic phases was correlated using the signed square root of these values. Because of the high cost of the 777K panel, the accuracy of imputation from lower density marker panels [6,000 (6K) or 50K] was examined both within breed and using a multi-breed reference population in Holstein, Ayrshire, and Guernsey. Imputation was carried out using FImpute V2.2 and Beagle 3.3.2 software. Imputation accuracies were then calculated as both the proportion of correct SNP filled in (concordance rate) and allelic R(2). Computation time was also explored to determine the efficiency of the different algorithms for imputation. Analysis showed that LD values >0.2 were found in all breeds at distances at or shorter than the average adjacent pair-wise distance between SNP on the 50K panel. Correlations of r-values, however, did not reach high levels (<0.9) at these distances. High correlation values of SNP phase between breeds were observed (>0.94) when the average pair-wise distances using the 777K SNP panel were examined. High concordance rate (0.968-0.995) and allelic R(2) (0.946-0.991) were found for all breeds when imputation was carried out with FImpute from 50K to 777K. Imputation accuracy for Guernsey and Ayrshire was slightly lower when using the imputation method in Beagle. Computing time was significantly greater when using Beagle software, with all comparable procedures being 9 to 13 times less efficient, in terms of time, compared with FImpute. These findings suggest that use of a multi-breed reference population might increase prediction accuracy using the 777K SNP panel and that 777K genotypes can be efficiently and effectively imputed using the lower density 50K SNP panel. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Ma, G J; Markell, S G; Song, Q J; Qi, L L
2017-07-01
Genotyping-by-sequencing revealed a new downy mildew resistance gene, Pl 20 , from wild Helianthus argophyllus located on linkage group 8 of the sunflower genome and closely linked to SNP markers that facilitate the marker-assisted selection of resistance genes. Downy mildew (DM), caused by Plasmopara halstedii, is one of the most devastating and yield-limiting diseases of sunflower. Downy mildew resistance identified in wild Helianthus argophyllus accession PI 494578 was determined to be effective against the predominant and virulent races of P. halstedii occurring in the United States. The evaluation of 114 BC 1 F 2:3 families derived from the cross between HA 89 and PI 494578 against P. halstedii race 734 revealed that single dominant gene controls downy mildew resistance in the population. Genotyping-by-sequencing analysis conducted in the BC 1 F 2 population indicated that the DM resistance gene derived from wild H. argophyllus PI 494578 is located on the upper end of the linkage group (LG) 8 of the sunflower genome, as was determined single nucleotide polymorphism (SNP) markers associated with DM resistance. Analysis of 11 additional SNP markers previously mapped to this region revealed that the resistance gene, named Pl 20 , co-segregated with four markers, SFW02745, SFW09076, S8_11272025, and S8_11272046, and is flanked by SFW04358 and S8_100385559 at an interval of 1.8 cM. The newly discovered P. halstedii resistance gene has been introgressed from wild species into cultivated sunflower to provide a novel gene with DM resistance. The homozygous resistant individuals were selected from BC 2 F 2 progenies with the use of markers linked to the Pl 20 gene, and these lines should benefit the sunflower community for Helianthus improvement.
A comprehensive SNP and indel imputability database.
Duan, Qing; Liu, Eric Yi; Croteau-Chonka, Damien C; Mohlke, Karen L; Li, Yun
2013-02-15
Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms. SNP and indel imputability information can be retrieved through a user-friendly interface by providing the ID(s) of the desired variant(s) or by specifying the desired genomic region. The query results can be refined by selecting relevant GWAS genotyping platform(s). This is the first database providing variant imputability information specific to each continental group and to each genotyping platform. In Filipino individuals from the Cebu Longitudinal Health and Nutrition Survey, our database can achieve an area under the receiver-operating characteristic curve of 0.97, 0.91, 0.88 and 0.79 for markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. Specifically, by filtering out 48.6% of markers (corresponding to a reduction of up to 48.6% in computational costs for actual imputation) based on the imputability information in our database, we can remove 77%, 58%, 51% and 42% of the poorly imputed markers at the cost of only 0.3%, 0.8%, 1.5% and 4.6% of the well-imputed markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. http://www.unc.edu/∼yunmli/imputability.html
An imputed genotype resource for the laboratory mouse
Szatkiewicz, Jin P.; Beane, Glen L.; Ding, Yueming; Hutchins, Lucie; de Villena, Fernando Pardo-Manuel; Churchill, Gary A.
2009-01-01
We have created a high-density SNP resource encompassing 7.87 million polymorphic loci across 49 inbred mouse strains of the laboratory mouse by combining data available from public databases and training a hidden Markov model to impute missing genotypes in the combined data. The strong linkage disequilibrium found in dense sets of SNP markers in the laboratory mouse provides the basis for accurate imputation. Using genotypes from eight independent SNP resources, we empirically validated the quality of the imputed genotypes and demonstrate that they are highly reliable for most inbred strains. The imputed SNP resource will be useful for studies of natural variation and complex traits. It will facilitate association study designs by providing high density SNP genotypes for large numbers of mouse strains. We anticipate that this resource will continue to evolve as new genotype data become available for laboratory mouse strains. The data are available for bulk download or query at http://cgd.jax.org/. PMID:18301946
Miyagawa, Taku; Honda, Makoto; Kawashima, Minae; Shimada, Mihoko; Tanaka, Susumu; Honda, Yutaka; Tokunaga, Katsushi
2009-01-01
Background SNP rs5770917 located between CPT1B and CHKB, and HLA-DRB1*1501-DQB1*0602 haplotype were previously identified as susceptibility loci for narcolepsy with cataplexy. This study was conducted in order to investigate whether these genetic markers are associated with Japanese CNS hypersomnias (essential hypersomnia: EHS) other than narcolepsy with cataplexy. Principal Findings EHS was significantly associated with SNP rs5770917 (Pallele = 3.6×10−3; OR = 1.56; 95% c.i.: 1.12–2.15) and HLA-DRB1*1501-DQB1*0602 haplotype (P positivity = 9.2×10−11; OR = 3.97; 95% c.i.: 2.55–6.19). No interaction between the two markers (SNP rs5770917 and HLA-DRB1*1501-DQB1*0602 haplotype) was observed in EHS. Conclusion CPT1B, CHKB and HLA are candidates for susceptibility to CNS hypersomnias (EHS), as well as narcolepsy with cataplexy. PMID:19404393
Huynh, Bao-Lam; Ehlers, Jeffrey D; Ndeve, Arsenio; Wanamaker, Steve; Lucas, Mitchell R; Close, Timothy J; Roberts, Philip A
The cowpea aphid Aphis craccivora Koch (CPA) is a destructive insect pest of cowpea, a staple legume crop in Sub-Saharan Africa and other semiarid warm tropics and subtropics. In California, CPA causes damage on all local cultivars from early vegetative to pod development growth stages. Sources of CPA resistance are available in African cowpea germplasm. However, their utilization in breeding is limited by the lack of information on inheritance, genomic location and marker linkage associations of the resistance determinants. In the research reported here, a recombinant inbred line (RIL) population derived from a cross between a susceptible California blackeye cultivar (CB27) and a resistant African breeding line (IT97K-556-6) was genotyped with 1,536 SNP markers. The RILs and parents were phenotyped for CPA resistance using field-based screenings during two main crop seasons in a 'hotspot' location for this pest within the primary growing region of the Central Valley of California. One minor and one major quantitative trait locus (QTL) were consistently mapped on linkage groups 1 and 7, respectively, both with favorable alleles contributed from IT97K-556-6. The major QTL appeared dominant based on a validation test in a related F2 population. SNP markers flanking each QTL were positioned in physical contigs carrying genes involved in plant defense based on synteny with related legumes. These markers could be used to introgress resistance alleles from IT97K-556-6 into susceptible local blackeye varieties by backcrossing.
Birth Characteristics and Childhood Leukemia Risk: Correlations With Genetic Markers.
Kennedy, Amy E; Kamdar, Kala Y; Lupo, Philip J; Okcu, Mehmet F; Scheurer, Michael E; Dorak, Mehmet T
2015-07-01
Birth characteristics such as birth order, birth weight, birth defects, and Down syndrome showed some of the first risk associations with childhood leukemia. Examinations of correlations between birth characteristics and leukemia risk markers have been limited to birth weight-related genetic polymorphisms. We integrated information on nongenetic and genetic markers by evaluating the relationship of birth characteristics, genetic markers for childhood acute lymphoblastic leukemia (ALL) susceptibility, and ALL risk together. The multiethnic study consisted of cases with childhood ALL (n=161) and healthy controls (n=261). Birth characteristic data were collected through questionnaires, and genotyping was achieved by TaqMan SNP Genotyping Assays. We observed risk associations for birth weight over 4000 g (odds ratios [OR]=1.93; 95% confidence interval [CI], 1.16-3.19), birth length (OR=1.18 per inch; 95% CI, 1.01-1.38), and with gestational age (OR=1.10 per week; 95% CI, 1.00-1.21). Only the HFE tag single-nucleotide polymorphism (SNP) rs9366637 showed an inverse correlation with a birth characteristic, gestational age, with a gene-dosage effect (P=0.005), and in interaction with a transferrin receptor rs3817672 genotype (Pinteraction=0.05). This correlation translated into a strong association for rs9366637 with preterm birth (OR=5.0; 95% CI, 1.19-20.9). Our study provides evidence for the involvement of prenatal events in the development of childhood ALL. The inverse correlation of rs9366637 with gestational age has implications on the design of HFE association studies in birth weight and childhood conditions using full-term newborns as controls.
USDA-ARS?s Scientific Manuscript database
Sunflower rust, which is incited by the fungus Puccinia helianthi Schwein., is the most common disease in Australia, Argentina, South Africa, and North America. Three independent genes, R5, R4, and R13 with two alleles R13a and R13b, were discovered in sunflower and are promising sources of resistan...
Performance of different SNP panels for parentage testing in two East Asian cattle breeds.
Strucken, E M; Gudex, B; Ferdosi, M H; Lee, H K; Song, K D; Gibson, J P; Kelly, M; Piper, E K; Porto-Neto, L R; Lee, S H; Gondro, C
2014-08-01
The International Society for Animal Genetics (ISAG) proposed a panel of single nucleotide polymorphisms (SNPs) for parentage testing in cattle (a core panel of 100 SNPs and an additional list of 100 SNPs). However, markers specific to East Asian taurine cattle breeds were not included, and no information is available as to whether the ISAG panel performs adequately for these breeds. We tested ISAG's core (100 SNP) and full (200 SNP) panels on two East Asian taurine breeds: the Korean Hanwoo and the Japanese Wagyu, the latter from the Australian herd. Even though the power of exclusion was high at 0.99 for both ISAG panels, the core panel performed poorly with 3.01% false-positive assignments in the Hanwoo population and 3.57% in the Wagyu. The full ISAG panel identified all sire-offspring relations correctly in both populations with 0.02% of relations wrongly excluded in the Hanwoo population. Based on these results, we created and tested two population-specific marker panels: one for the Wagyu population, which showed no false-positive assignments with either 100 or 200 SNPs, and a second panel for the Hanwoo, which still had some false-positive assignments with 100 SNPs but no false positives using 200 SNPs. In conclusion, for parentage assignment in East Asian cattle breeds, only the full ISAG panel is adequate for parentage testing. If fewer markers should be used, it is advisable to use population-specific markers rather than the ISAG panel. © 2014 Stichting International Foundation for Animal Genetics.
Welderufael, B. G.; Løvendahl, Peter; de Koning, Dirk-Jan; Janss, Lucas L. G.; Fikse, W. F.
2018-01-01
Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS) to identify associated single nucleotide polymorphisms (SNPs) and investigate the genetic background not only for susceptibility to – but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t-test and a genome-wide significance level of P-value < 10-4 was used to declare significant SNP-trait association. A number of significant SNP variants were identified for both traits. Many of the SNP variants associated either with susceptibility to – or recoverability from mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2) and genes involved in macrophage recruitment and regulation of inflammations (PDGFD and PTX3) were suggested as possible causal genes for susceptibility to – and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to – and recoverability from mastitis. PMID:29755506
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.
Thyssen, Gregory N; Fang, David D; Turley, Rickie B; Florane, Christopher; Li, Ping; Naoumkina, Marina
2015-09-01
Mapping-by-sequencing and SNP marker analysis were used to fine map the Ligon-lintless-1 ( Li 1 ) short fiber mutation in tetraploid cotton to a 255-kb region that contains 16 annotated proteins. The Ligon-lintless-1 (Li 1 ) mutant of cotton (Gossypium hirsutum L.) has been studied as a model for cotton fiber development since its identification in 1929; however, the causative mutation has not been identified yet. Here we report the fine genetic mapping of the mutation to a 255-kb region that contains only 16 annotated genes in the reference Gossypium raimondii genome. We took advantage of the incompletely dominant dwarf vegetative phenotype to identify 100 mutants (Li 1 /Li 1 ) and 100 wild-type (li 1 /li 1 ) homozygotes from a mapping population of 2567 F2 plants, which we bulked and deep sequenced. Since only homozygotes were sequenced, we were able to use a high stringency in SNP calling to rapidly narrow down the region harboring the Li 1 locus, and designed subgenome-specific SNP markers to test the population. We characterized the expression of all sixteen genes in the region by RNA sequencing of elongating fibers and by RT-qPCR at seven time points spanning fiber development. One of the most highly expressed genes found in this interval in wild-type fiber cells is 40-fold under-expressed at the day of anthesis (DOA) in the mutant fiber cells. This gene is a major facilitator superfamily protein, part of the large family of proteins that includes auxin and sugar transporters. Interestingly, nearly all genes in this region were most highly expressed at DOA and showed a high degree of co-expression. Further characterization is required to determine if transport of hormones or carbohydrates is involved in both the dwarf and lintless phenotypes of Li 1 plants.
Birdsell, Dawn N.; Pearson, Talima; Price, Erin P.; Hornstra, Heidie M.; Nera, Roxanne D.; Stone, Nathan; Gruendike, Jeffrey; Kaufman, Emily L.; Pettus, Amanda H.; Hurbon, Audriana N.; Buchhagen, Jordan L.; Harms, N. Jane; Chanturia, Gvantsa; Gyuranecz, Miklos; Wagner, David M.; Keim, Paul S.
2012-01-01
Single nucleotide polymorphisms (SNPs) are abundant in genomes of all species and biologically informative markers extensively used across broad scientific disciplines. Newly identified SNP markers are publicly available at an ever-increasing rate due to advancements in sequencing technologies. Efficient, cost-effective SNP genotyping methods to screen sample populations are in great demand in well-equipped laboratories, but also in developing world situations. Dual Probe TaqMan assays are robust but can be cost-prohibitive and require specialized equipment. The Mismatch Amplification Mutation Assay, coupled with melt analysis (Melt-MAMA), is flexible, efficient and cost-effective. However, Melt-MAMA traditionally suffers from high rates of assay design failures and knowledge gaps on assay robustness and sensitivity. In this study, we identified strategies that improved the success of Melt-MAMA. We examined the performance of 185 Melt-MAMAs across eight different pathogens using various optimization parameters. We evaluated the effects of genome size and %GC content on assay development. When used collectively, specific strategies markedly improved the rate of successful assays at the first design attempt from ∼50% to ∼80%. We observed that Melt-MAMA accurately genotypes across a broad DNA range (∼100 ng to ∼0.1 pg). Genomic size and %GC content influence the rate of successful assay design in an independent manner. Finally, we demonstrated the versatility of these assays by the creation of a duplex Melt-MAMA real-time PCR (two SNPs) and conversion to a size-based genotyping system, which uses agarose gel electrophoresis. Melt-MAMA is comparable to Dual Probe TaqMan assays in terms of design success rate and accuracy. Although sensitivity is less robust than Dual Probe TaqMan assays, Melt-MAMA is superior in terms of cost-effectiveness, speed of development and versatility. We detail the parameters most important for the successful application of Melt-MAMA, which should prove useful to the wider scientific community. PMID:22438886
Association mapping of seed and disease resistance traits in Theobroma cacao L.
Motilal, Lambert A; Zhang, Dapeng; Mischke, Sue; Meinhardt, Lyndel W; Boccara, Michel; Fouet, Olivier; Lanaud, Claire; Umaharan, Pathmanathan
2016-12-01
Microsatellite and single nucleotide polymorphism markers that could be used in marker assisted breeding of cacao were identified for number of filled seeds, black pod resistance and witches' broom disease resistance. An association mapping approach was employed to identify markers for seed number and resistance to black pod and witches' broom disease (WBD) in cacao (Theobroma cacao L.). Ninety-five microsatellites (SSRs) and 775 single nucleotide polymorphisms (SNPs) were assessed on 483 unique trees in the International Cocoa Genebank Trinidad (ICGT). Linkage disequilibrium (LD) and association mapping studies were conducted to identify markers to tag the phenotypic traits. Decay of LD occurred over an average 9.3 cM for chromosomes 1-9 and 2.5 cM for chromosome 10. Marker/trait associations were generally identified based on general linear models (GLMs) that incorporated principal components from molecular information on relatedness factor. Seven markers (mTcCIR 8, 66, 126, 212; TcSNP368, 697, 1370) on chromosomes 1 and 9 were identified for number of filled seeds (NSEED). A single marker was found for black pod resistance (mTcCIR280) on chromosome 3, whereas six markers on chromosomes 4, 5, 6, 8, and 10 were detected for WBD (mTcCIR91, 183; TcSNP375, 720, 1230 and 1374). It is expected that this association mapping study in cacao would contribute to the knowledge of the genetic determinism of cocoa traits and that the markers identified herein would prove useful in marker assisted breeding of cacao.
Maroso, Francesco; Franch, Rafaella; Dalla Rovere, Giulia; Arculeo, Marco; Bargelloni, Luca
2016-08-01
Dolphinfish is an important fish species for both commercial and sport fishing, but so far limited information is available on genetic variability and pattern of differentiation of dolphinfish populations in the Mediterranean basin. Recently developed techniques allow genome-wide identification of genetic markers for better understanding of population structure in species with limited genome information. Using restriction-site associated DNA analysis we successfully genotyped 140 individuals of dolphinfish from eight locations in the Mediterranean Sea at 3324 SNP loci. We identified 311 sex-related loci that were used to assess sex-ratio in dolphinfish populations. In addition, we identified a weak signature of genetic differentiation of the population closer to Gibraltar Strait in comparison to other Mediterranean populations, which might be related to introgression of individuals from Atlantic. No further genetic differentiation could be detected in the other populations sampled, as expected considering the known highly mobility of the species. The results obtained improve our knowledge of the species and can help managing dolphinfish stock in the future. Copyright © 2016 Elsevier B.V. All rights reserved.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892
Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann
2017-01-01
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238
2014-01-01
Background Recent advancements in next-generation sequencing technology have enabled cost-effective sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To enrich genomic resources for Japanese eel (Anguilla japonica), we constructed a ddRAD-based genetic map using an Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 19 linkage groups of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the history of genome evolution after the teleost-specific genome duplication. Results We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal rearrangement events that occurred soon after the teleost-specific genome duplication. Conclusions The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with economically important traits and for investigating comparative genomics of the Japanese eel. PMID:24669946
Kai, Wataru; Nomura, Kazuharu; Fujiwara, Atushi; Nakamura, Yoji; Yasuike, Motoshige; Ojima, Nobuhiko; Masaoka, Tetsuji; Ozaki, Akiyuki; Kazeto, Yukinori; Gen, Koichiro; Nagao, Jiro; Tanaka, Hideki; Kobayashi, Takanori; Ototake, Mitsuru
2014-03-26
Recent advancements in next-generation sequencing technology have enabled cost-effective sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To enrich genomic resources for Japanese eel (Anguilla japonica), we constructed a ddRAD-based genetic map using an Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 19 linkage groups of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the history of genome evolution after the teleost-specific genome duplication. We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal rearrangement events that occurred soon after the teleost-specific genome duplication. The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with economically important traits and for investigating comparative genomics of the Japanese eel.
Zhu, Yufeng; Yin, Yanfei; Yang, Keqiang; Li, Jihong; Sang, Yalin; Huang, Long; Fan, Shu
2015-08-18
Walnut (Juglans regia, 2n = 32, approximately 606 Mb per 1C genome) is an economically important tree crop. Resistance to anthracnose, caused by Colletotrichum gloeosporioides, is a major objective of walnut genetic improvement in China. The recently developed specific length amplified fragment sequencing (SLAF-seq) is an efficient strategy that can obtain large numbers of markers with sufficient sequence information to construct high-density genetic maps and permits detection of quantitative trait loci (QTLs) for molecular breeding. SLAF-seq generated 161.64 M paired-end reads. 153,820 SLAF markers were obtained, of which 49,174 were polymorphic. 13,635 polymorphic markers were sorted into five segregation types and 2,577 markers of them were used to construct genetic linkage maps: 2,395 of these fell into 16 linkage groups (LGs) for the female map, 448 markers for the male map, and 2,577 markers for the integrated map. Taking into account the size of all LGs, the marker coverage was 2,664.36 cM for the female map, 1,305.58 cM for the male map, and 2,457.82 cM for the integrated map. The average intervals between two adjacent mapped markers were 1.11 cM, 2.91 cM and 0.95 cM for three maps, respectively. 'SNP_only' markers accounted for 89.25% of the markers on the integrated map. Mapping markers contained 5,043 single nucleotide polymorphisms (SNPs) loci, which corresponded to two SNP loci per SLAF marker. According to the integrated map, we used interval mapping (Logarithm of odds, LOD > 3.0) to detect our quantitative trait. One QTL was detected for anthracnose resistance. The interval of this QTL ranged from 165.51 cM to 176.33 cM on LG14, and ten markers in this interval that were above the threshold value were considered to be linked markers to the anthracnose resistance trait. The phenotypic variance explained by each marker ranged from 16.2 to 19.9%, and their LOD scores varied from 3.22 to 4.04. High-density genetic maps for walnut containing 16 LGs were constructed using the SLAF-seq method with an F1 population. One QTL for walnut anthracnose resistance was identified based on the map. The results will aid molecular marker-assisted breeding and walnut resistance genes identification.
Single-feature polymorphism discovery in the barley transcriptome
Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie
2005-01-01
A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
Patterns of diversity and recombination along chromosome 1 of maize (Zea mays ssp. mays L.).
Tenaillon, Maud I; Sawkins, Mark C; Anderson, Lorinda K; Stack, Stephen M; Doebley, John; Gaut, Brandon S
2002-01-01
We investigate the interplay between genetic diversity and recombination in maize (Zea mays ssp. mays). Genetic diversity was measured in three types of markers: single-nucleotide polymorphisms, indels, and microsatellites. All three were examined in a sample of previously published DNA sequences from 21 loci on maize chromosome 1. Small indels (1-5 bp) were numerous and far more common than large indels. Furthermore, large indels (>100 bp) were infrequent in the population sample, suggesting they are slightly deleterious. The 21 loci also contained 47 microsatellites, of which 33 were polymorphic. Diversity in SNPs, indels, and microsatellites was compared to two measures of recombination: C (=4Nc) estimated from DNA sequence data and R based on a quantitative recombination nodule map of maize synaptonemal complex 1. SNP diversity was correlated with C (r = 0.65; P = 0.007) but not with R (r = -0.10; P = 0.69). Given the lack of correlation between R and SNP diversity, the correlation between SNP diversity and C may be driven by demography. In contrast to SNP diversity, microsatellite diversity was correlated with R (r = 0.45; P = 0.004) but not C (r = -0.025; P = 0.55). The correlation could arise if recombination is mutagenic for microsatellites, or it may be consistent with background selection that is apparent only in this class of rapidly evolving markers. PMID:12454083
Genomics of pear and other Rosaceae fruit trees
Yamamoto, Toshiya; Terakami, Shingo
2016-01-01
The family Rosaceae includes many economically important fruit trees, such as pear, apple, peach, cherry, quince, apricot, plum, raspberry, and loquat. Over the past few years, whole-genome sequences have been released for Chinese pear, European pear, apple, peach, Japanese apricot, and strawberry. These sequences help us to conduct functional and comparative genomics studies and to develop new cultivars with desirable traits by marker-assisted selection in breeding programs. These genomics resources also allow identification of evolutionary relationships in Rosaceae, development of genome-wide SNP and SSR markers, and construction of reference genetic linkage maps, which are available through the Genome Database for the Rosaceae website. Here, we review the recent advances in genomics studies and their practical applications for Rosaceae fruit trees, particularly pear, apple, peach, and cherry. PMID:27069399
Genomics of pear and other Rosaceae fruit trees.
Yamamoto, Toshiya; Terakami, Shingo
2016-01-01
The family Rosaceae includes many economically important fruit trees, such as pear, apple, peach, cherry, quince, apricot, plum, raspberry, and loquat. Over the past few years, whole-genome sequences have been released for Chinese pear, European pear, apple, peach, Japanese apricot, and strawberry. These sequences help us to conduct functional and comparative genomics studies and to develop new cultivars with desirable traits by marker-assisted selection in breeding programs. These genomics resources also allow identification of evolutionary relationships in Rosaceae, development of genome-wide SNP and SSR markers, and construction of reference genetic linkage maps, which are available through the Genome Database for the Rosaceae website. Here, we review the recent advances in genomics studies and their practical applications for Rosaceae fruit trees, particularly pear, apple, peach, and cherry.
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are the most common genetic markers in Theobroma cacao, occurring approximately once in every 200 nucleotides. SNPs, like microsatellites, are co-dominant and PCR-based, but they have several advantages over microsatellites. They are unambiguous, so that a SN...
USDA-ARS?s Scientific Manuscript database
Marker-assisted breeding is now routinely used in major crops to facilitate more efficient cultivar improvement. This has been significantly enabled by the use of next-generation sequencing technology to identify loci and markers associated with traits of interest. While rich in a variety of nutriti...
European Population Substructure: Clustering of Northern and Southern Populations
Seldin, Michael F; Shigeta, Russell; Villoslada, Pablo; Selmi, Carlo; Tuomilehto, Jaakko; Silva, Gabriel; Belmont, John W; Klareskog, Lars; Gregersen, Peter K
2006-01-01
Using a genome-wide single nucleotide polymorphism (SNP) panel, we observed population structure in a diverse group of Europeans and European Americans. Under a variety of conditions and tests, there is a consistent and reproducible distinction between “northern” and “southern” European population groups: most individual participants with southern European ancestry (Italian, Spanish, Portuguese, and Greek) have >85% membership in the “southern” population; and most northern, western, eastern, and central Europeans have >90% in the “northern” population group. Ashkenazi Jewish as well as Sephardic Jewish origin also showed >85% membership in the “southern” population, consistent with a later Mediterranean origin of these ethnic groups. Based on this work, we have developed a core set of informative SNP markers that can control for this partition in European population structure in a variety of clinical and genetic studies. PMID:17044734
Ben Ayed, Rayda; Ben Hassen, Hanen; Ennouri, Karim; Rebai, Ahmed
2016-12-01
The genetic diversity of 22 olive tree cultivars (Olea europaea L.) sampled from different Mediterranean countries was assessed using 5 SNP markers (FAD2.1; FAD2.3; CALC; SOD and ANTHO3) located in four different genes. The genotyping analysis of the 22 cultivars with 5 SNP loci revealed 11 alleles (average 2.2 per allele). The dendrogram based on cultivar genotypes revealed three clusters consistent with the cultivars classification. Besides, the results obtained with the five SNPs were compared to those obtained with the SSR markers using bioinformatic analyses and by computing a cophenetic correlation coefficient, indicating the usefulness of the UPGMA method for clustering plant genotypes. Based on principal coordinate analysis using a similarity matrix, the first two coordinates, revealed 54.94 % of the total variance. This work provides a more comprehensive explanation of the diversity available in Tunisia olive cultivars, and an important contribution for olive breeding and olive oil authenticity.
Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo
2015-03-01
In the wheat (Triticum aestivum L.) cultivar 'Zenkoujikomugi', a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the 'Zenkoujikomugi'-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, 'Iwainodaichi' (Kyushu), 'Junreikomugi' (Kinki-Chugoku-Shikoku), 'Kinuhime' (Kanto-Tokai), 'Nebarigoshi' (Tohoku-Hokuriku), and 'Kitamoe' (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for 'Kitamoe', were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region.
Wu, Mian; Wu, Wen-Ping; Liu, Cheng-Chen; Liu, Ying-Na; Wu, Xiao-Yi; Ma, Fang-Fang; Zhu, An-Qi; Yang, Jia-Yin; Wang, Bin; Chen, Jian-Qun
2018-06-16
In the soybean cultivar Suweon 97, BCMV-resistance gene was fine-mapped to a 58.1-kb region co-localizing with the Soybean mosaic virus (SMV)-resistance gene, Rsv1-h raising a possibility that the same gene is utilized against both viral pathogens. Certain soybean cultivars exhibit resistance against soybean mosaic virus (SMV) or bean common mosaic virus (BCMV). Although several SMV-resistance loci have been reported, the understanding of the mechanism underlying BCMV resistance in soybean is limited. Here, by crossing a resistant cultivar Suweon 97 with a susceptible cultivar Williams 82 and inoculating 220 F 2 individuals with a BCMV strain (HZZB011), we observed a 3:1 (resistant/susceptible) segregation ratio, suggesting that Suweon 97 possesses a single dominant resistance gene against BCMV. By performing bulked segregant analysis with 186 polymorphic simple sequence repeat (SSR) markers across the genome, the resistance gene was determined to be linked with marker BARSOYSSR_13_1109. Examining the genotypes of nearby SSR markers on all 220 F 2 individuals then narrowed down the gene between markers BARSOYSSR_13_1109 and BARSOYSSR_13_1122. Furthermore, 14 previously established F 2:3 lines showing crossovers between the two markers were assayed for their phenotypes upon BCMV inoculation. By developing six more SNP (single nucleotide polymorphism) markers, the resistance gene was finally delimited to a 58.1-kb interval flanked by BARSOYSSR_13_1114 and SNP-49. Five genes were annotated in this interval of the Williams 82 genome, including a characteristic coiled-coil nucleotide-binding site-leucine-rich repeat (CC-NBS-LRR, CNL)-type of resistance gene, Glyma13g184800. Coincidentally, the SMV-resistance allele Rsv1-h was previously mapped to almost the same region, thereby suggesting that soybean Suweon 97 likely relies on the same CNL-type R gene to resist both viral pathogens.
Interest in genomic SNP testing for prostate cancer risk: a pilot survey.
Hall, Michael J; Ruth, Karen J; Chen, David Yt; Gross, Laura M; Giri, Veda N
2015-01-01
Advancements in genomic testing have led to the identification of single nucleotide polymorphisms (SNPs) associated with prostate cancer. The clinical utility of SNP tests to evaluate prostate cancer risk is unclear. Studies have not examined predictors of interest in novel genomic SNP tests for prostate cancer risk in a diverse population. Consecutive participants in the Fox Chase Prostate Cancer Risk Assessment Program (PRAP) (n = 40) and unselected men from surgical urology clinics (n = 40) completed a one-time survey. Items examined interest in genomic SNP testing for prostate cancer risk, knowledge, impact of unsolicited findings, and psychosocial factors including health literacy. Knowledge of genomic SNP tests was low in both groups, but interest was higher among PRAP men (p < 0.001). The prospect of receiving unsolicited results about ancestral genomic markers increased interest in testing in both groups. Multivariable modeling identified several predictors of higher interest in a genomic SNP test including higher perceived risk (p = 0.025), indicating zero reasons for not wanting testing (vs ≥1 reason) (p = 0.013), and higher health literacy (p = 0.016). Knowledge of genomic SNP testing was low in this sample, but higher among high-risk men. High-risk status may increase interest in novel genomic tests, while low literacy may lessen interest.
KinSNP software for homozygosity mapping of disease genes using SNP microarrays
2010-01-01
Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from http://bioinfo.bgu.ac.il/bsu/software/kinSNP. PMID:20846928
Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography.
Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi
2013-03-01
New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined 'elimination' status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of Mycobacterium leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. Copyright © 2012 Elsevier B.V. All rights reserved.
Jiang, Y; Zhao, Y; Rodemann, B; Plieske, J; Kollers, S; Korzun, V; Ebmeyer, E; Argillier, O; Hinze, M; Ling, J; Röder, M S; Ganal, M W; Mette, M F; Reif, J C
2015-03-01
Genome-wide mapping approaches in diverse populations are powerful tools to unravel the genetic architecture of complex traits. The main goals of our study were to investigate the potential and limits to unravel the genetic architecture and to identify the factors determining the accuracy of prediction of the genotypic variation of Fusarium head blight (FHB) resistance in wheat (Triticum aestivum L.) based on data collected with a diverse panel of 372 European varieties. The wheat lines were phenotyped in multi-location field trials for FHB resistance and genotyped with 782 simple sequence repeat (SSR) markers, and 9k and 90k single-nucleotide polymorphism (SNP) arrays. We applied genome-wide association mapping in combination with fivefold cross-validations and observed surprisingly high accuracies of prediction for marker-assisted selection based on the detected quantitative trait loci (QTLs). Using a random sample of markers not selected for marker-trait associations revealed only a slight decrease in prediction accuracy compared with marker-based selection exploiting the QTL information. The same picture was confirmed in a simulation study, suggesting that relatedness is a main driver of the accuracy of prediction in marker-assisted selection of FHB resistance. When the accuracy of prediction of three genomic selection models was contrasted for the three marker data sets, no significant differences in accuracies among marker platforms and genomic selection models were observed. Marker density impacted the accuracy of prediction only marginally. Consequently, genomic selection of FHB resistance can be implemented most cost-efficiently based on low- to medium-density SNP arrays.
Erdoğan, Onur; Aydin Son, Yeşim
2014-01-01
Single Nucleotide Polymorphisms (SNPs) are the most common genomic variations where only a single nucleotide differs between individuals. Individual SNPs and SNP profiles associated with diseases can be utilized as biological markers. But there is a need to determine the SNP subsets and patients' clinical data which is informative for the diagnosis. Data mining approaches have the highest potential for extracting the knowledge from genomic datasets and selecting the representative SNPs as well as most effective and informative clinical features for the clinical diagnosis of the diseases. In this study, we have applied one of the widely used data mining classification methodology: "decision tree" for associating the SNP biomarkers and significant clinical data with the Alzheimer's disease (AD), which is the most common form of "dementia". Different tree construction parameters have been compared for the optimization, and the most accurate tree for predicting the AD is presented.
Mikheecheva, Natalya E; Zaychikova, Marina V; Melerzanov, Alexander V; Danilenko, Valery N
2017-04-01
Mycobacterium tuberculosis is divided into several distinct lineages, and various genetic markers such as IS-elements, VNTR, and SNPs are used for lineage identification. We propose an M. tuberculosis classification approach based on functional polymorphisms in virulence genes. An M. tuberculosis virulence genes catalog has been established, including 319 genes from various protein groups, such as proteases, cell wall proteins, fatty acid and lipid metabolism proteins, sigma factors, toxin-antitoxin systems. Another catalog of 1,573 M. tuberculosis isolates of different lineages has been developed. The developed SNP-calling program has identified 3,563 nonsynonymous SNPs. The constructed SNP-based phylogeny reflected the evolutionary relationship between lineages and detected new sublineages. SNP analysis of sublineage F15/LAM4/KZN revealed four lineage-specific mutations in cyp125, mce3B, vapC25, and vapB34. The Ural lineage has been divided into two geographical clusters based on different SNPs in virulence genes. A new sublineage, B0/N-90, was detected inside the Beijing-B0/W-148 by SNPs in irtB, mce3F and vapC46. We have found 27 members of B0/N-90 among the 227 available genomes of the Beijing-B0/W-148 sublineage. Whole-genome sequencing of strain B9741, isolated from an HIV-positive patient, was demonstrated to belong to the new B0/N-90 group. A primer set for PCR detection of B0/N-90 lineage-specific mutations has been developed. The prospective use of mce3 mutant genes as genetically engineered vaccine is discussed. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Chen, Qian; Song, Jun; Du, Wen-Ping; Xu, Li-Yuan; Jiang, Yun; Zhang, Jie; Xiang, Xiao-Li; Yu, Gui-Rong
2018-06-27
Semi-dwarfism is an agronomically important trait in breeding for stable high yields and for resistance to damage by wind and rain (lodging resistance). Many QTLs and genes causing dwarf phenotype have been found in maize. However, because of the yield loss associated with these QTLs and genes, they have been difficult to use in breeding for dwarf stature in maize. Therefore, it is important to find the new dwarfing genes or materials without undesirable characters. The objectives of this study were: (1) to figure out the inheritance of semi-dwarfism in mutants; (2) mapping dwarfing gene or QTL. Maize inbred lines '18599' and 'DM173', which is the dwarf mutant derived from the maize inbred line '173' through 60 Co-γ ray irradiation. F 2 and BC 1 F 1 population were used for genetic analysis. Whole genome resequencing-based technology (QTL-seq) were performed to map dwarfing gene and figured out the SNP markers in predicted region using dwarf bulk and tall bulk from F 2 population. Based on the polymorphic SNP markers from QTL-seq, we were fine-mapping the dwarfing gene using F 2 population. In F 2 population, 398 were dwarf plants and 135 were tall plants. Results of χ 2 tests indicated that the ratio of dwarf plants to tall plants was fitted to 3:1 ratio. Furthermore, the χ 2 tests of BC 1 F 1 population showed that the ratio was fitted to 1:1 ratio. Based on QTL-seq, the dwarfing gene was located at the region from 111.07 to 124.56 Mb of chromosome 9, and we named it rht-DM. Using traditional QTL mapping with SNP markers, the rht-DM was narrowed down to 400 kb region between SNP-21 and SNP-24. The two SNPs were located at 0.43 and 0.11 cM. Segregation analysis of F 2 and BC 1 F 1 indicated that the dwarfing gene was likely a dominant gene. This dwarfing gene was located in the region between 115.02 and 115.42 Mb on chromosome 9.
Cañas-Álvarez, J J; Mouresan, E F; Varona, L; Díaz, C; Molina, A; Baro, J A; Altarriba, J; Carabaño, M J; Casellas, J; Piedrafita, J
2016-07-01
Linkage disequilibrium (LD) and persistence of phase are fundamental approaches for exploring the genetic basis of economically important traits in cattle, including the identification of QTL for genomic selection and the estimation of effective population size () to determine the size of the training populations. In this study, we have used the Illumina BovineHD chip in 168 trios of 7 Spanish beef cattle breeds to obtain an overview of the magnitude of LD and the persistence of LD phase through the physical distance between markers. Also, we estimated the time of divergence based on the persistence of the LD phase and calculated past from LD estimates using different alternatives to define the recombination rate. Estimates of average (as a measure of LD) for adjacent markers were close to 0.52 in the 7 breeds and decreased with the distance between markers, although in long distances, some LD still remained (0.07 and 0.05 for markers 200 kb and 1 Mb apart, respectively). A panel with a lower boundary of 38,000 SNP would be necessary to launch a successful within-breed genomic selection program. Persistence of phase, measured as the pairwise correlations between estimates of in 2 breeds at short distances (10 kb), was in the 0.89 to 0.94 range and decreased from 0.33 to 0.52 to a range of 0.01 to 0.08 when marker distance increased from 200 kb to 1 Mb, respectively. The magnitude of the persistence of phase between the Spanish beef breeds was similar to those found in dairy breeds. For across-breed genomic selection, the size of the SNP panels must be in the range of 50,000 to 83,000 SNP. Estimates of past showed values ranging from 26 to 31 for 1 generation ago in all breeds. The divergence among breeds occurred between 129 and 207 generations ago. The results of this study are relevant for the future implementation of within- and across-breed genomic selection programs in the Spanish beef cattle populations. Our results suggest that a reduced subset of the SNP panel would be enough to achieve an adequate precision of the genomic predictions.
Jighly, Abdulqader; Oyiga, Benedict C; Makdis, Farid; Nazari, Kumarse; Youssef, Omran; Tadesse, Wuletaw; Abdalla, Osman; Ogbonnaya, Francis C
2015-07-01
Identified DArT and SNP markers including a first reported QTL on 3AS, validated large effect APR on 3BS. The different genes can be used to incorporate stripe resistance in cultivated varieties. Stripe rust [yellow rust, caused by Puccinia striiformis f. sp. tritici (Pst)] is a serious disease in wheat (Triticum aestivum). This study employed genome-wide association mapping (GWAM) to identify markers linked to stripe rust resistance genes using Diversity Arrays Technology (DArT(®)) and single-nucleotide polymorphism (SNP) Infinium 9K assays in 200 ICARDA wheat genotypes, phenotyped for seedling and adult plant resistance in two sites over two growing seasons in Syria. Only 25.8 % of the genotypes showed resistance at seedling stage while about 33 and 44 % showed moderate resistance and resistance response, respectively. Mixed-linear model adjusted for false discovery rate at p < 0.05 identified 12 DArT and 29 SNP markers on chromosome arms 3AS, 3AL, 1AL, 2AL, 2BS, 2BL, 3BS, 3BL, 5BL, 6AL, and 7DS significantly linked to Pst resistance genes. Of these, the locus on 3AS has not been previously reported to confer resistance to stripe rust in wheat. The QTL on 3AS, 3AL, 1AL, 2AL, and 2BS were effective at seedling and adult plant growth stages while those on 3BS, 3BL, 5BL, 6AL and 7DS were effective at adult plant stage. The 3BS QTL was validated in Cham-6 × Cham-8 recombinant inbred line population; composite interval analysis identified a stripe resistance QTL flanked by the DArT marker, wPt-798970, contributed by Cham-6 parent which accounted for 31.2 % of the phenotypic variation. The DArT marker "wPt-798970" lies 1.6 cM away from the 3BS QTL detected within GWAM. Epistatic interactions were also investigated; only the QTL on 1AL, 3AS and 6AL exhibited interactions with other loci. These results suggest that GWAM can be an effective approach for identifying and improving resistance to stripe rust in wheat.
Shen, Qi; Zhang, Dong; Sun, Wei; Zhang, Yu-Jun; Shang, Zhi-Wei; Chen, Shi-Lin
2017-05-01
Perilla frutescens is one of 60 kinds of food and medicine plants in the initial directory announced by health ministry of China. With the development of Perilla domain in recent , the breeding and application of good varieties has become the main bottleneck of its development. This study reported that applied to the system selection, add to marker-assisted method to breed perilla varieties. Through the whole genome sequencing and consistency matching, annotated the mutation locus according to genome data, and comparison analysis with Perilla common variants database, finally selected 30 non-synonymous mutation SNPs used as characteristic markers of Zhongyan Feishu No.1. those SNP marker were used as chosen standard of Perilla varieties. Finally breeding new perilla variety Zhongyan Feishu No.1, which possess to characters of the leaf and seed dual-used, high yield, high resistance, and could used to green fertilizer. The Zhongyan Feishu No.1 acquired the plant new varieties identification of Beijing city , the identification numbers is 2016054. Marker assisted identification guide new varieties breeding in plants, which can provide a new reference for breeding of medicinal plants. Copyright© by the Chinese Pharmaceutical Association.
Dettogni, Raquel Spinassé; Sá, Ricardo Tristão; Tovar, Thaís Tristão; Louro, Iúri Drumond
2013-08-01
Mapping single nucleotide polymorphisms (SNPs) in genes potentially involved in immune responses may help understand the pathophysiology of infectious diseases in specific geographical regions. In this context, we have aimed to analyze the frequency of immunogenetic markers, focusing on genes CD209 (SNP -336A/G), FCγRIIa (SNP -131H/R), TNF-α (SNP -308A/G) and VDR (SNP Taq I) in two populations of the Espirito Santo State (ES), Brazil: general and Pomeranian populations. Peripheral blood genomic DNA was extracted from one hundred healthy individuals of the general population and from 59 Pomeranians. Polymorphic variant identification was performed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). SNP genotype frequencies were in Hardy-Weinberg Equilibrium. There was no statistically significant difference in allelic and genotypic distributions between the two populations studied. Statistically significant differences were observed for SNP genotype distribution in genes CD209, TNF-α and VDR when comparing the ES populations with other Brazilian populations. This is the first report of CD209, FcγRIIa, TNF-α and VDR allelic frequencies for the general and Pomeranian populations of ES.
Lu, Timothy Tehua; Lao, Oscar; Nothnagel, Michael; Junge, Olaf; Freitag-Wolf, Sandra; Caliebe, Amke; Balascakova, Miroslava; Bertranpetit, Jaume; Bindoff, Laurence Albert; Comas, David; Holmlund, Gunilla; Kouvatsi, Anastasia; Macek, Milan; Mollet, Isabelle; Nielsen, Finn; Parson, Walther; Palo, Jukka; Ploski, Rafal; Sajantila, Antti; Tagliabracci, Adriano; Gether, Ulrik; Werge, Thomas; Rivadeneira, Fernando; Hofman, Albert; Uitterlinden, André Gerardus; Gieger, Christian; Wichmann, Heinz-Erich; Ruether, Andreas; Schreiber, Stefan; Becker, Christian; Nürnberg, Peter; Nelson, Matthew Roberts; Kayser, Manfred; Krawczak, Michael
2009-07-01
Genetic matching potentially provides a means to alleviate the effects of incomplete Mendelian randomization in population-based gene-disease association studies. We therefore evaluated the genetic-matched pair study design on the basis of genome-wide SNP data (309,790 markers; Affymetrix GeneChip Human Mapping 500K Array) from 2457 individuals, sampled at 23 different recruitment sites across Europe. Using pair-wise identity-by-state (IBS) as a matching criterion, we tried to derive a subset of markers that would allow identification of the best overall matching (BOM) partner for a given individual, based on the IBS status for the subset alone. However, our results suggest that, by following this approach, the prediction accuracy is only notably improved by the first 20 markers selected, and increases proportionally to the marker number thereafter. Furthermore, in a considerable proportion of cases (76.0%), the BOM of a given individual, based on the complete marker set, came from a different recruitment site than the individual itself. A second marker set, specifically selected for ancestry sensitivity using singular value decomposition, performed even more poorly and was no more capable of predicting the BOM than randomly chosen subsets. This leads us to conclude that, at least in Europe, the utility of the genetic-matched pair study design depends critically on the availability of comprehensive genotype information for both cases and controls.
Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu
2016-01-01
Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
2013-07-01
as a statistical graphic, and Pearson product moment correlation coefficients as measures of the strength of linear association; 4) performing SNP ...determine if there are differences in single nucleotide polymorphisms ( SNPs ) in selected candidate genes implicated in metabolic syndrome, obesity, chronic...samples for the serum and SNP analyses. We have reached a target of 500 patients at the end of year 2; however, some of the patients turned out to be
High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).
Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C
2016-03-01
Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.
Jaiswal, Preeti; Guhathakurta, Subhrangshu; Singh, Asem Surindro; Verma, Deepak; Pandey, Mritunjay; Varghese, Merina; Sinha, Swagata; Ghosh, Saurabh; Mohanakumar, Kochupurackal P; Rajamma, Usha
2015-01-02
Presence of platelet hyperserotonemia and effective amelioration of behavioral dysfunctions by selective serotonin reuptake inhibitors (SSRI) in autism spectrum disorders (ASD) indicate that irregularities in serotonin (5-HT) reuptake and its homeostasis could be the basis of behavioral impairments in ASD patients. SLC6A4, the gene encoding serotonin transporter (SERT) is considered as a potential susceptibility gene for ASD, since it is a quantitative trait locus for blood 5-HT levels. Three functional polymorphisms, 5-HTTLPR, STin2 and 3'UTR-SNP of SLC6A4 are extensively studied for possible association with the disorder, with inconclusive outcome. In the present study, we investigated association of these polymorphisms with platelet 5-HT content and symptoms severity as revealed by childhood autism rating scale in ASD children from an Indian population. Higher 5-HT level observed in ASD was highly significant in children with heterozygous and homozygous genotypes comprising of minor alleles of the markers. Quantitative transmission disequilibrium test demonstrated significant genetic effect of STin2 allele as well as STin2/3'UTR-SNP and 5-HTTLPR/3'UTR-SNP haplotypes on 5-HT levels, but no direct association with overall CARS score and ASD phenotype. Significant genetic effect of the markers on specific behavioral phenotypes was observed for various sub-phenotypes of CARS in quantitative trait analysis. Even though the 5-HT level was not associated with severity of behavioral CARS score, a significant negative relationship was observed for 5-HT levels and level and consistency of intellectual response and general impression in ASD children. Population-based study revealed higher distribution of the haplotype 10/G of STin2/3'UTR-SNP in male controls, suggesting protective effect of this haplotype in male cases. Overall results of the study suggest that SLC6A4 markers have specific genetic effect on individual ASD behavioral attributes, might be through the modulation of 5-HT content. Copyright © 2014 Elsevier Inc. All rights reserved.
Shahid, Muhammad Qasim; Çiftçi, Vahdettin; E. Sáenz de Miera, Luis; Aasim, Muhammad; Nadeem, Muhammad Azhar; Aktaş, Husnu; Özkan, Hakan; Hatipoğlu, Rüştü
2017-01-01
Until now, little attention has been paid to the geographic distribution and evaluation of genetic diversity of durum wheat from the Central Fertile Crescent (modern-day Turkey and Syria). Turkey and Syria are considered as primary centers of wheat diversity, and thousands of locally adapted wheat landraces are still present in the farmers’ small fields. We planned this study to evaluate the genetic diversity of durum wheat landraces from the Central Fertile Crescent by genotyping based on DArTseq and SNP analysis. A total of 39,568 DArTseq and 20,661 SNP markers were used to characterize the genetic characteristic of 91 durum wheat land races. Clustering based on Neighbor joining analysis, principal coordinate as well as Bayesian model implemented in structure, clearly showed that the grouping pattern is not associated with the geographical distribution of the durum wheat due to the mixing of the Turkish and Syrian landraces. Significant correlation between DArTseq and SNP markers was observed in the Mantel test. However, we detected a non-significant relationship between geographical coordinates and DArTseq (r = -0.085) and SNP (r = -0.039) loci. These results showed that unconscious farmer selection and lack of the commercial varieties might have resulted in the exchange of genetic material and this was apparent in the genetic structure of durum wheat in Turkey and Syria. The genomic characterization presented here is an essential step towards a future exploitation of the available durum wheat genetic resources in genomic and breeding programs. The results of this study have also depicted a clear insight about the genetic diversity of wheat accessions from the Central Fertile Crescent. PMID:28099442
Serin, Elise A. R.; Snoek, L. B.; Nijveen, Harm; Willems, Leo A. J.; Jiménez-Gómez, Jose M.; Hilhorst, Henk W. M.; Ligterink, Wilco
2017-01-01
High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches. PMID:29259624
Saxena, Rachit K.; Varma Penmetsa, R.; Upadhyaya, Hari D.; Kumar, Ashish; Carrasquilla-Garcia, Noelia; Schlueter, Jessica A.; Farmer, Andrew; Whaley, Adam M.; Sarma, Birinchi K.; May, Gregory D.; Cook, Douglas R.; Varshney, Rajeev K.
2012-01-01
Single-nucleotide polymorphisms (SNPs, >2000) were discovered by using RNA-seq and allele-specific sequencing approaches in pigeonpea (Cajanus cajan). For making the SNP genotyping cost-effective, successful competitive allele-specific polymerase chain reaction (KASPar) assays were developed for 1616 SNPs and referred to as PKAMs (pigeonpea KASPar assay markers). Screening of PKAMs on 24 genotypes [23 from cultivated species and 1 wild species (Cajanus scarabaeoides)] defined a set of 1154 polymorphic markers (77.4%) with a polymorphism information content (PIC) value from 0.04 to 0.38. One thousand and ninety-four PKAMs showed polymorphisms between parental lines of the reference mapping population (C. cajan ICP 28 × C. scarabaeoides ICPW 94). By using high-quality marker genotyping data on 167 F2 lines from the population, a comprehensive genetic map comprising 875 PKAMs with an average inter-marker distance of 1.11 cM was developed. Previously mapped 35 simple sequence repeat markers were integrated into the PKAM map and an integrated genetic map of 996.21 cM was constructed. Mapped PKAMs showed a higher degree of synteny with the genome of Glycine max followed by Medicago truncatula and Lotus japonicus and least with Vigna unguiculata. These PKAMs will be useful for genetics research and breeding applications in pigeonpea and for utilizing genome information from other legume species. PMID:23103470
Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv
2018-01-01
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.
USDA-ARS?s Scientific Manuscript database
Verticillium wilt (VW) of alfalfa is a soilborne disease that causes severe yield loss in alfalfa. To identify molecular markers associated with VW resistance, an integrated framework of genome-wide association study (GWAS) with high-throughput genotyping by sequencing (GBS) was used for mapping lo...
X-linked infantile spinal muscular atrophy: clinical definition and molecular mapping.
Dressman, Devin; Ahearn, Mary Ellen; Yariz, Kemal O; Basterrecha, Hugo; Martínez, Francisco; Palau, Francesc; Barmada, M Michael; Clark, Robin Dawn; Meindl, Alfons; Wirth, Brunhilde; Hoffman, Eric P; Baumbach-Reardon, Lisa
2007-01-01
X-linked infantile spinal-muscular atrophy (XL-SMA) is a rare disorder, which presents with the clinical characteristics of hypotonia, areflexia, and multiple congenital contractures (arthrogryposis) associated with loss of anterior horn cells and death in infancy. We have previously reported a single family with XL-SMA that mapped to Xp11.3-q11.2. Here we report further clinical description of XL-SMA plus an additional seven unrelated (XL-SMA) families from North America and Europe that show linkage data consistent with the same region. We first investigated linkage to the candidate disease gene region using microsatellite repeat markers. We further saturated the candidate disease gene region using polymorphic microsatellite repeat markers and single nucleotide polymorphisms in an effort to narrow the critical region. Two-point and multipoint linkage analysis was performed using the Allegro software package. Linkage analysis of all XL-SMA families displayed linkage consistent with the original XL-SMA region. The addition of new families and new markers has narrowed the disease gene interval for a XL-SMA locus between SNP FLJ22843 near marker DXS 8080 and SNP ARHGEF9 which is near DXS7132 (Xp11.3-Xq11.1).
He, Yanxia; Yuan, Wangjun; Dong, Meifang; Han, Yuanji; Shang, Fude
2017-01-01
Osmanthus fragrans is an ornamental plant of substantial commercial value, and no genetic linkage maps of this species have previously been reported. Specific-locus amplified fragment sequencing (SLAF-seq) is a recently developed technology that allows massive single nucleotide polymorphisms (SNPs) to be identified and high-resolution genotyping. In our current research, we generated the first genetic map of O. fragrans using SLAF-seq, which is composed with 206.92 M paired-end reads and 173,537 SLAF markers. Among total 90,715 polymorphic SLAF markers, 15,317 polymorphic SLAFs could be used for genetic map construction. The integrated map contained 14,189 high quality SLAFs that were grouped in 23 genetic linkage groups, with a total length of 2962.46 cM and an average distance of 0.21 cM between two adjacent markers. In addition, 23,664 SNPs were identified from the mapped markers. As far as we know, this is the first of the genetic map of O. fragrans. Our results are further demonstrate that SLAF-seq is a very effective method for developing markers and constructing high-density linkage maps. The SNP markers and the genetic map reported in this study should be valuable resource in future research. PMID:29018460
Mapping of a major QTL for salt tolerance of mature field-grown maize plants based on SNP markers.
Luo, Meijie; Zhao, Yanxin; Zhang, Ruyang; Xing, Jinfeng; Duan, Minxiao; Li, Jingna; Wang, Naishun; Wang, Wenguang; Zhang, Shasha; Chen, Zhihui; Zhang, Huasheng; Shi, Zi; Song, Wei; Zhao, Jiuran
2017-08-15
Salt stress significantly restricts plant growth and production. Maize is an important food and economic crop but is also a salt sensitive crop. Identification of the genetic architecture controlling salt tolerance facilitates breeders to select salt tolerant lines. However, the critical quantitative trait loci (QTLs) responsible for the salt tolerance of field-grown maize plants are still unknown. To map the main genetic factors contributing to salt tolerance in mature maize, a double haploid population (240 individuals) and 1317 single nucleotide polymorphism (SNP) markers were employed to produce a genetic linkage map covering 1462.05 cM. Plant height of mature maize cultivated in the saline field (SPH) and plant height-based salt tolerance index (ratio of plant height between saline and control fields, PHI) were used to evaluate salt tolerance of mature maize plants. A major QTL for SPH was detected on Chromosome 1 with the LOD score of 22.4, which explained 31.2% of the phenotypic variation. In addition, the major QTL conditioning PHI was also mapped at the same position on Chromosome 1, and two candidate genes involving in ion homeostasis were identified within the confidence interval of this QTL. The detection of the major QTL in adult maize plant establishes the basis for the map-based cloning of genes associated with salt tolerance and provides a potential target for marker assisted selection in developing maize varieties with salt tolerance.
Non-Cholesterol Sterol Levels Predict Hyperglycemia and Conversion to Type 2 Diabetes in Finnish Men
Cederberg, Henna; Gylling, Helena; Miettinen, Tatu A.; Paananen, Jussi; Vangipurapu, Jagadish; Pihlajamäki, Jussi; Kuulasmaa, Teemu; Stančáková, Alena; Smith, Ulf; Kuusisto, Johanna; Laakso, Markku
2013-01-01
We investigated the levels of non-cholesterol sterols as predictors for the development of hyperglycemia (an increase in the glucose area under the curve in an oral glucose tolerance test) and incident type 2 diabetes in a 5-year follow-up study of a population-based cohort of Finnish men (METSIM Study, N = 1,050) having non-cholesterol sterols measured at baseline. Additionally we determined the association of 538,265 single nucleotide polymorphisms (SNP) with non-cholesterol sterol levels in a cross-sectional cohort of non-diabetic offspring of type 2 diabetes (the Kuopio cohort of the EUGENE2 Study, N = 273). We found that in a cross-sectional METSIM Study the levels of sterols indicating cholesterol absorption were reduced as a function of increasing fasting glucose levels, whereas the levels of sterols indicating cholesterol synthesis were increased as a function of increasing 2-hour glucose levels. A cholesterol synthesis marker desmosterol significantly predicted an increase, and two absorption markers (campesterol and avenasterol) a decrease in the risk of hyperglycemia and incident type 2 diabetes in a 5-year follow-up of the METSIM cohort, mainly attributable to insulin sensitivity. A SNP of ABCG8 was associated with fasting plasma glucose levels in a cross-sectional study but did not predict hyperglycemia or incident type 2 diabetes. In conclusion, the levels of some, but not all non-cholesterol sterols are markers of the worsening of hyperglycemia and type 2 diabetes. PMID:23840693
Ju, Yiqian; Jiao, Yao; Feng, Lu; Pan, Huitang; Cheng, Tangren; Zhang, Qixiang
2016-01-01
The genetic control of plant architecture is a promising approach to breed desirable cultivars, particularly in ornamental flowers. In this study, the F1 population (142 seedlings) derived from Lagerstroemia fauriei (non-dwarf) × L. indica ‘Pocomoke’ (dwarf) was phenotyped for six traits (plant height (PH), internode length (IL), internode number, primary lateral branch height (PLBH), secondary lateral branch height and primary branch number), and the IL and PLBH traits were positively correlated with the PH trait and considered representative indexes of PH. Fifty non-dwarf and dwarf seedlings were pooled and subjected to a specific-locus amplified fragment sequencing (SLAF-seq) method, which screened 1221 polymorphic markers. A total of 3 markers segregating between bulks were validated in the F1 population, with the M16337 and M38412 markers highly correlated with the IL trait and the M25207 marker highly correlated with the PLBH trait. These markers provide a predictability of approximately 80% using a single marker (M25207) and a predictability of 90% using marker combinations (M16337 + M25207) in the F1 population, which revealed that the IL and the PLBH traits, especially the PLBH, were the decisive elements for PH in terms of molecular regulation. Further validation was performed in the BC1 population and a set of 28 Lagerstroemia stocks using allele-specific PCR (AS-PCR) technology, and the results showed the stability and reliability of the SNP markers and the co-determination of PH by multiple genes. Our findings provide an important theoretical and practical basis for the early prediction and indirect selection of PH using the IL and the PLBH, and the detected SNPs may be useful for marker-assisted selection (MAS) in crape myrtle. PMID:27404662
Lusk, Christine M.; Dyson, Greg; Clark, Andrew G.; Ballantyne, Christie M.; Frikke-Schmidt, Ruth; Tybjærg-Hansen, Anne; Boerwinkle, Eric
2014-01-01
Markers of the chromosome 9p21 region are regarded as the strongest and most reliably significant genome-wide association study (GWAS) signals for Coronary heart disease (CHD) risk; this was recently confirmed by the CARDIoGRAMplusC4D Consortium meta-analysis. However, while these associations are significant at the population level, they may not be clinically relevant predictors of risk for all individuals. We describe here the results of a study designed to address the question: What is the contribution of context defined by traditional risk factors in determining the utility of DNA sequence variations marking the 9p21 region for explaining variation in CHD risk? We analyzed a sample of 7,589 (3,869 females and 3,720 males) European American participants of the Atherosclerosis Risk in Communities study. We confirmed CHD-SNP genotype associations for two 9p21 region marker SNPs previously identified by the CARDIoGRAMplusC4D Consortium study, of which ARIC was a part. We then tested each marker SNP genotype effect on prediction of CHD within sub-groups of the ARIC sample defined by traditional CHD risk factors by applying a novel multi-model strategy, PRIM. We observed that the effects of SNP genotypes in the 9p21 region were strongest in a subgroup of hypertensives. We subsequently validated the effect of the region in an independent sample from the Copenhagen City Heart Study. Our study suggests that marker SNPs identified as predictors of CHD risk in large population based GWAS may have their greatest utility in explaining risk of disease in particular sub-groups characterized by biological and environmental effects measured by the traditional CHD risk factors. PMID:24889828
Ye, Changrong; Tenorio, Fatima A; Redoña, Edilberto D; Morales-Cortezano, Portia S; Cabrega, Gleizl A; Jagadish, Krishna S V; Gregorio, Glenn B
2015-08-01
This study fine mapped and validated a QTL on rice chromosome 4 that increases spikelet fertility under high temperature (over 37 °C) at the flowering stage. Climate change has a negative effect on crop production and food security. Understanding the genetic mechanism of heat tolerance and developing heat-tolerant varieties is essential to cope with future global warming. Previously, we reported on a QTL (qHTSF4.1) from an IR64/N22 population responsible for rice spikelet fertility under high-temperature stress at the flowering stage. To further fine map and validate the effect of qHTSF4.1, PCR-based SNP markers were developed and used to genotype BC2F2, BC3F2, BC3F3, and BC5F2 populations from the same cross. The interval of the QTL was narrowed down to about 1.2 Mb; however, further recombination was not identified even with a large BC5F2 population that was subsequently developed and screened. The sequence in the QTL region is highly conserved and a large number of genes in the same gene family were observed to be clustered in the region. The QTL qHTSF4.1 consistently increased spikelet fertility in all of the backcross populations. This was confirmed using 24 rice varieties. Most of the rice varieties with the QTL showed a certain degree of heat tolerance under high-temperature conditions. In a BC5F2 population with clean background of IR64, QTL qHTSF4.1 increased spikelet fertility by about 15%. It could be an important source for enhancing heat tolerance in rice at the flowering stage. PCR-based SNP markers developed in this study can be used for QTL introgression and for pyramiding with other agronomically important QTLs/genes through marker-assisted selection.
Holdsworth, William L; LaPlant, Kyle E; Bell, Duane C; Jahn, Molly M; Mazourek, Michael
2016-01-01
Powdery mildew is a major fungal disease on squash and pumpkin (Cucurbita spp.) in the US and throughout the world. Genetic resistance to the disease is not known to occur naturally within Cucurbita pepo and only infrequently in Cucurbita moschata, but has been achieved in both species through the introgression of a major resistance gene from the wild species Cucurbita okeechobeensis subsp. martinezii. At present, this gene, Pm-0, is used extensively in breeding, and is found in nearly all powdery mildew-resistant C. pepo and C. moschata commercial cultivars. In this study, we mapped C. okeechobeensis subsp. martinezii-derived single nucleotide polymorphism (SNP) alleles in a set of taxonomically and morphologically diverse and resistant C. pepo and C. moschata cultivars bred at Cornell University that, by common possession of Pm-0, form a shared-trait introgression panel. High marker density was achieved using genotyping-by-sequencing, which yielded over 50,000 de novo SNP markers in each of the three Cucurbita species genotyped. A single 516.4 kb wild-derived introgression was present in all of the resistant cultivars and absent in a diverse set of heirlooms that predated the Pm-0 introgression. The contribution of this interval to powdery mildew resistance was confirmed by association mapping in a C. pepo cultivar panel that included the Cornell lines, heirlooms, and 68 additional C. pepo cultivars and with an independent F2 population derived from C. okeechobeensis subsp. martinezii x C. moschata. The interval was refined to a final candidate interval of 76.4 kb and CAPS markers were developed inside this interval to facilitate marker-assisted selection.
Muleta, Kebede T; Bulli, Peter; Zhang, Zhiwu; Chen, Xianming; Pumphrey, Michael
2017-11-01
Harnessing diversity from germplasm collections is more feasible today because of the development of lower-cost and higher-throughput genotyping methods. However, the cost of phenotyping is still generally high, so efficient methods of sampling and exploiting useful diversity are needed. Genomic selection (GS) has the potential to enhance the use of desirable genetic variation in germplasm collections through predicting the genomic estimated breeding values (GEBVs) for all traits that have been measured. Here, we evaluated the effects of various scenarios of population genetic properties and marker density on the accuracy of GEBVs in the context of applying GS for wheat ( L.) germplasm use. Empirical data for adult plant resistance to stripe rust ( f. sp. ) collected on 1163 spring wheat accessions and genotypic data based on the wheat 9K single nucleotide polymorphism (SNP) iSelect assay were used for various genomic prediction tests. Unsurprisingly, the results of the cross-validation tests demonstrated that prediction accuracy increased with an increase in training population size and marker density. It was evident that using all the available markers (5619) was unnecessary for capturing the trait variation in the germplasm collection, with no further gain in prediction accuracy beyond 1 SNP per 3.2 cM (∼1850 markers), which is close to the linkage disequilibrium decay rate in this population. Collectively, our results suggest that larger germplasm collections may be efficiently sampled via lower-density genotyping methods, whereas genetic relationships between the training and validation populations remain critical when exploiting GS to select from germplasm collections. Copyright © 2017 Crop Science Society of America.
Holdsworth, William L.; LaPlant, Kyle E.; Bell, Duane C.; Jahn, Molly M.; Mazourek, Michael
2016-01-01
Powdery mildew is a major fungal disease on squash and pumpkin (Cucurbita spp.) in the US and throughout the world. Genetic resistance to the disease is not known to occur naturally within Cucurbita pepo and only infrequently in Cucurbita moschata, but has been achieved in both species through the introgression of a major resistance gene from the wild species Cucurbita okeechobeensis subsp. martinezii. At present, this gene, Pm-0, is used extensively in breeding, and is found in nearly all powdery mildew-resistant C. pepo and C. moschata commercial cultivars. In this study, we mapped C. okeechobeensis subsp. martinezii-derived single nucleotide polymorphism (SNP) alleles in a set of taxonomically and morphologically diverse and resistant C. pepo and C. moschata cultivars bred at Cornell University that, by common possession of Pm-0, form a shared-trait introgression panel. High marker density was achieved using genotyping-by-sequencing, which yielded over 50,000 de novo SNP markers in each of the three Cucurbita species genotyped. A single 516.4 kb wild-derived introgression was present in all of the resistant cultivars and absent in a diverse set of heirlooms that predated the Pm-0 introgression. The contribution of this interval to powdery mildew resistance was confirmed by association mapping in a C. pepo cultivar panel that included the Cornell lines, heirlooms, and 68 additional C. pepo cultivars and with an independent F2 population derived from C. okeechobeensis subsp. martinezii x C. moschata. The interval was refined to a final candidate interval of 76.4 kb and CAPS markers were developed inside this interval to facilitate marker-assisted selection. PMID:27936008
Liabeuf, Debora; Sim, Sung-Chur; Francis, David M
2018-03-01
Bacterial spot affects tomato crops (Solanum lycopersicum) grown under humid conditions. Major genes and quantitative trait loci (QTL) for resistance have been described, and multiple loci from diverse sources need to be combined to improve disease control. We investigated genomic selection (GS) prediction models for resistance to Xanthomonas euvesicatoria and experimentally evaluated the accuracy of these models. The training population consisted of 109 families combining resistance from four sources and directionally selected from a population of 1,100 individuals. The families were evaluated on a plot basis in replicated inoculated trials and genotyped with single nucleotide polymorphisms (SNP). We compared the prediction ability of models developed with 14 to 387 SNP. Genomic estimated breeding values (GEBV) were derived using Bayesian least absolute shrinkage and selection operator regression (BL) and ridge regression (RR). Evaluations were based on leave-one-out cross validation and on empirical observations in replicated field trials using the next generation of inbred progeny and a hybrid population resulting from selections in the training population. Prediction ability was evaluated based on correlations between GEBV and phenotypes (r g ), percentage of coselection between genomic and phenotypic selection, and relative efficiency of selection (r g /r p ). Results were similar with BL and RR models. Models using only markers previously identified as significantly associated with resistance but weighted based on GEBV and mixed models with markers associated with resistance treated as fixed effects and markers distributed in the genome treated as random effects offered greater accuracy and a high percentage of coselection. The accuracy of these models to predict the performance of progeny and hybrids exceeded the accuracy of phenotypic selection.
Qi, L L; Long, Y M; Jan, C C; Ma, G J; Gulya, T J
2015-04-01
Pl 17, a novel downy mildew resistance gene independent of known downy mildew resistance genes in sunflowers, was genetically mapped to linkage group 4 of the sunflower genome. Downy mildew (DM), caused by Plasmopara halstedii (Farl.). Berl. et de Toni, is one of the serious sunflower diseases in the world due to its high virulence and the variability of the pathogen. DM resistance in the USDA inbred line, HA 458, has been shown to be effective against all virulent races of P. halstedii currently identified in the USA. To determine the chromosomal location of this resistance, 186 F 2:3 families derived from a cross of HA 458 with HA 234 were phenotyped for their resistance to race 734 of P. halstedii. The segregation ratio of the population supported that the resistance was controlled by a single dominant gene, Pl 17. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) primers were used to identify molecular markers linked to Pl 17. Bulked segregant analysis using 849 SSR markers located Pl 17 to linkage group (LG) 4, which is the first DM gene discovered in this linkage group. An F2 population of 186 individuals was screened with polymorphic SSR and SNP primers from LG4. Two flanking markers, SNP SFW04052 and SSR ORS963, delineated Pl 17 in an interval of 3.0 cM. The markers linked to Pl 17 were validated in a BC3 population. A search for the physical location of flanking markers in sunflower genome sequences revealed that the Pl 17 region had a recombination frequency of 0.59 Mb/cM, which was a fourfold higher recombination rate relative to the genomic average. This region can be considered amenable to molecular manipulation for further map-based cloning of Pl 17.
Børsting, Claus; Morling, Niels
2012-02-01
In some relationship cases, the initial investigations of autosomal short tandem repeats (STRs) lead to an ambiguous conclusion and supplementary investigations become necessary. Six unusual paternity cases were previously investigated by other researchers and published as case work examples in forensic journals. Here, the cases were reinvestigated by typing the samples for 49 autosomal single-nucleotide polymorphisms (SNPs) using the SNPforID multiplex assay. Three cases were solved by the SNP investigation without the need for any additional testing. In two cases, the SNP results supported the conclusions based on STRs. In the last case, the SNP results spoke in favor of paternity, and the combined paternity index based on autosomal STRs and SNPs was 12.3 billion. Nevertheless, the alleged father was excluded by X-chromosome typing. The case work examples underline the importance of performing supplementary investigations, and they advocate for the implementation of several panels that may be used in the highly unusual cases. Panels with SNPs or other markers with low mutation probabilities are preferable as supplementary markers, because the risk of detecting (additional) mutations is very low. © 2012 American Association of Blood Banks.
Hernández, N; Martínez-González, J C; Parra-Bracamonte, G M; Sifuentes-Rincón, A M; López-Villalobos, N; Morris, S T; Briones-Encinia, F; Ortega-Rivas, E; Pacheco-Contreras, V I; L A Meza-García, And
2016-09-02
Polymorphisms in candidate genes can produce significant and favorable changes in the phenotype, and therefore are useful for the identification of the best combination of favorable variants for marker-assisted selection. In the present study, an assessment to evaluate the effect of 11 single nucleotide polymorphisms (SNPs) in candidate genes on live weight traits of registered Brahman cattle was performed. Data from purebred bulls were used in this assessment. The dataset included birth (BW), weaning (WW), and yearling (YW) weights. A panel of 11 SNP markers, selected by their formerly reported or apparent direct and indirect association with live weight traits, was included in an assessment previously confirming their minimum allele frequency (<0.05). Live weights were adjusted BW (aBW), WW (aWW), and YW (aYW) using a generalized linear model, which included the fixed effects of herd and season of birth and the random effect of the sire and year of birth. An SNP in a growth hormone gene (GH4.1) was significantly related to aWW (P = 0.035) with an estimate substitution effect of 3.97 kg (P = 0.0210). In addition, a leptin SNP (LEPg.978) was significantly associated with aYW (P = 0.003) with an estimate substitution effect of 9.57 kg (P = 0.0007). The results suggest that markers GH4.1 and LEPg.978 can be considered as candidate loci for assisted genetic improvement programs in Mexican Brahman cattle.
Vieira, Alexandre R.; McHenry, Toby G.; Daack-Hirsch, Sandra; Murray, Jeffrey C.; Marazita, Mary L.
2009-01-01
We revisited 42 families with two or more cleft affected siblings that participated in previous studies and collected complete dental information. Genotypes from 1489 single nucleotide polymorphism (SNP) markers located in 150 candidate genes/loci were reanalyzed. Two sets of association analyses were carried out. First we ran the analysis solely on the cleft status. Second we assigned affection to any cleft or dental anomaly (tooth agenesis, supernumerary teeth, and microdontia), and repeated the analysis. Significant over-transmission was seen for a SNP in ANKS6 (rs4742741, 9q22.33; p=0.0004) when a dental anomaly phenotype was included in the analysis. Significant over-transmission was also seen for a SNP in ERBB2 (rs1810132, 17q21.1; p=0.0006). In the clefts only data, the most significant result was also for ERBB2 (p=0.0006). Other markers with suggestive p-values included IRF6 and 6q21-q23 loci. In contrast to the above results, suggestive over-transmission of markers in GART, DPF3, and NRXN3 were seen only when the dental anomaly phenotype was included in the analysis. These findings support the hypothesis that some loci may contribute to both clefts and congenital dental anomalies. Thus, including dental anomalies information in the genetics analysis of cleft lip and palate will provide new opportunities to map susceptibility loci for clefts. PMID:18978678
Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D
2015-06-01
Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
Littlejohn, Mathew D; Turner, Sally-Anne; Walker, Caroline G; Berry, Sarah D; Tiplady, Kathryn; Sherlock, Ric G; Sutherland, Greg; Swift, Simon; Garrick, Dorian; Lacy-Hulbert, S Jane; McDougall, Scott; Spelman, Richard J; Snell, Russell G; Hillerton, J Eric
2018-05-01
Inflammation of the mammary gland following bacterial infection, commonly known as mastitis, affects all mammalian species. Although the aetiology and epidemiology of mastitis in the dairy cow are well described, the genetic factors mediating resistance to mammary gland infection are not well known, due in part to the difficulty in obtaining robust phenotypic information from sufficiently large numbers of individuals. To address this problem, an experimental mammary gland infection experiment was undertaken, using a Friesian-Jersey cross breed F2 herd. A total of 604 animals received an intramammary infusion of Streptococcus uberis in one gland, and the clinical response over 13 milkings was used for linkage mapping and genome-wide association analysis. A quantitative trait locus (QTL) was detected on bovine chromosome 11 for clinical mastitis status using micro-satellite and Affymetrix 10 K SNP markers, and then exome and genome sequence data used from the six F1 sires of the experimental animals to examine this region in more detail. A total of 485 sequence variants were typed in the QTL interval, and association mapping using these and an additional 37 986 genome-wide markers from the Illumina SNP50 bovine SNP panel revealed association with markers encompassing the interleukin-1 gene cluster locus. This study highlights a region on bovine chromosome 11, consistent with earlier studies, as conferring resistance to experimentally induced mammary gland infection, and newly prioritises the IL1 gene cluster for further analysis in genetic resistance to mastitis.
Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo
2015-01-01
In the wheat (Triticum aestivum L.) cultivar ‘Zenkoujikomugi’, a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the ‘Zenkoujikomugi’-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, ‘Iwainodaichi’ (Kyushu), ‘Junreikomugi’ (Kinki-Chugoku-Shikoku), ‘Kinuhime’ (Kanto-Tokai), ‘Nebarigoshi’ (Tohoku-Hokuriku), and ‘Kitamoe’ (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for ‘Kitamoe’, were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region. PMID:25931984
2012-01-01
Background Tocopherols, which are vitamin E compounds, play an important role in maintaining human health. Compared with other staple foods, maize grains contain high level of tocopherols. Results Two F2 populations (K22/CI7 and K22/Dan340, referred to as POP-1 and POP-2, respectively), which share a common parent (K22), were developed and genotyped using a GoldenGate assay containing 1,536 single nucleotide polymorphism (SNP) markers. An integrated genetic linkage map was constructed using 619 SNP markers, spanning a total of 1649.03 cM of the maize genome with an average interval of 2.67 cM. Seventeen quantitative trait loci (QTLs) for all the traits were detected in the first map and 13 in the second. In these two maps, QTLs for different traits were localized to the same genomic regions and some were co-located with candidate genes in the tocopherol biosynthesis pathway. Single QTL was responsible for 3.03% to 52.75% of the phenotypic variation and the QTLs in sum explained23.4% to 66.52% of the total phenotypic variation. A major QTL (qc5-1/qd5-1) affecting α-tocopherol (αT) was identified on chromosome 5 between the PZA03161.1 and PZA02068.1 in the POP-2. The QTL region was narrowed down from 18.7 Mb to 5.4 Mb by estimating the recombination using high-density markers of the QTL region. This allowed the identification of the candidate gene VTE4 which encodes γ-tocopherol methyltransferase, an enzyme that transforms γ-tocopherol (γT)to αT. Conclusions These results demonstrate that a few QTLs with major effects and several QTLs with medium to minor effects might contribute to the natural variation of tocopherols in maize grain. The high-density markers will help to fine map and identify the QTLs with major effects even in the preliminary segregating populations. Furthermore, this study provides a simple guide line for the breeders to improve traits that minimize the risk of malnutrition, especially in developing countries. PMID:23122295
Regularized quantile regression for SNP marker estimation of pig growth curves.
Barroso, L M A; Nascimento, M; Nascimento, A C C; Silva, F F; Serão, N V L; Cruz, C D; Resende, M D V; Silva, F L; Azevedo, C F; Lopes, P S; Guimarães, S E F
2017-01-01
Genomic growth curves are generally defined only in terms of population mean; an alternative approach that has not yet been exploited in genomic analyses of growth curves is the Quantile Regression (QR). This methodology allows for the estimation of marker effects at different levels of the variable of interest. We aimed to propose and evaluate a regularized quantile regression for SNP marker effect estimation of pig growth curves, as well as to identify the chromosome regions of the most relevant markers and to estimate the genetic individual weight trajectory over time (genomic growth curve) under different quantiles (levels). The regularized quantile regression (RQR) enabled the discovery, at different levels of interest (quantiles), of the most relevant markers allowing for the identification of QTL regions. We found the same relevant markers simultaneously affecting different growth curve parameters (mature weight and maturity rate): two (ALGA0096701 and ALGA0029483) for RQR(0.2), one (ALGA0096701) for RQR(0.5), and one (ALGA0003761) for RQR(0.8). Three average genomic growth curves were obtained and the behavior was explained by the curve in quantile 0.2, which differed from the others. RQR allowed for the construction of genomic growth curves, which is the key to identifying and selecting the most desirable animals for breeding purposes. Furthermore, the proposed model enabled us to find, at different levels of interest (quantiles), the most relevant markers for each trait (growth curve parameter estimates) and their respective chromosomal positions (identification of new QTL regions for growth curves in pigs). These markers can be exploited under the context of marker assisted selection while aiming to change the shape of pig growth curves.
USDA-ARS?s Scientific Manuscript database
Selection of the composite MARC III population for markers allowed better estimates of effects and inheritance of markers for targeted carcass quality traits (n=254) and nontargeted traits and an evaluation of SNP specific residual variance models for tenderness. Genotypic effects of CAPN1 haplotyp...
Genomic Prediction of Testcross Performance in Canola (Brassica napus)
Jan, Habib U.; Abbadi, Amine; Lücke, Sophie; Nichols, Richard A.; Snowdon, Rod J.
2016-01-01
Genomic selection (GS) is a modern breeding approach where genome-wide single-nucleotide polymorphism (SNP) marker profiles are simultaneously used to estimate performance of untested genotypes. In this study, the potential of genomic selection methods to predict testcross performance for hybrid canola breeding was applied for various agronomic traits based on genome-wide marker profiles. A total of 475 genetically diverse spring-type canola pollinator lines were genotyped at 24,403 single-copy, genome-wide SNP loci. In parallel, the 950 F1 testcross combinations between the pollinators and two representative testers were evaluated for a number of important agronomic traits including seedling emergence, days to flowering, lodging, oil yield and seed yield along with essential seed quality characters including seed oil content and seed glucosinolate content. A ridge-regression best linear unbiased prediction (RR-BLUP) model was applied in combination with 500 cross-validations for each trait to predict testcross performance, both across the whole population as well as within individual subpopulations or clusters, based solely on SNP profiles. Subpopulations were determined using multidimensional scaling and K-means clustering. Genomic prediction accuracy across the whole population was highest for seed oil content (0.81) followed by oil yield (0.75) and lowest for seedling emergence (0.29). For seed yieId, seed glucosinolate, lodging resistance and days to onset of flowering (DTF), prediction accuracies were 0.45, 0.61, 0.39 and 0.56, respectively. Prediction accuracies could be increased for some traits by treating subpopulations separately; a strategy which only led to moderate improvements for some traits with low heritability, like seedling emergence. No useful or consistent increase in accuracy was obtained by inclusion of a population substructure covariate in the model. Testcross performance prediction using genome-wide SNP markers shows considerable potential for pre-selection of promising hybrid combinations prior to resource-intensive field testing over multiple locations and years. PMID:26824924
A genome-wide association study of social genetic effects in Landrace pigs.
Hong, Joon Ki; Jeong, Yong Dae; Cho, Eun Seok; Choi, Tae Jeong; Kim, Yong Min; Cho, Kyu Ho; Lee, Jae Bong; Lim, Hyun Tae; Lee, Deuk Hwan
2018-06-01
The genetic effects of an individual on the phenotypes of its social partners, such as its pen mates, are known as social genetic effects. This study aims to identify the candidate genes for social (pen-mates') average daily gain (ADG) in pigs by using the genome-wide association approach. Social ADG (sADG) was the average ADG of unrelated pen-mates (strangers). We used the phenotype data (16,802 records) after correcting for batch (week), sex, pen, number of strangers (1 to 7 pigs) in the pen, full-sib rate (0% to 80%) within pen, and age at the end of the test. A total of 1,041 pigs from Landrace breeds were genotyped using the Illumina PorcineSNP60 v2 BeadChip panel, which comprised 61,565 single nucleotide polymorphism (SNP) markers. After quality control, 909 individuals and 39,837 markers remained for sADG in genome-wide association study. We detected five new SNPs, all on chromosome 6, which have not been associated with social ADG or other growth traits to date. One SNP was inside the prostaglandin F2α receptor ( PTGFR ) gene, another SNP was located 22 kb upstream of gene interferon-induced protein 44 ( IFI44 ), and the last three SNPs were between 161 kb and 191 kb upstream of the EGF latrophilin and seven transmembrane domain-containing protein 1 ( ELTD1 ) gene. PTGFR, IFI44, and ELTD1 were never associated with social interaction and social genetic effects in any of the previous studies. The identification of several genomic regions, and candidate genes associated with social genetic effects reported here, could contribute to a better understanding of the genetic basis of interaction traits for ADG. In conclusion, we suggest that the PTGFR, IFI44, and ELTD1 may be used as a molecular marker for sADG, although their functional effect was not defined yet. Thus, it will be of interest to execute association studies in those genes.
Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers
2010-01-01
Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls. Conclusions Accurate genomic evaluation of the broader bull and cow population can be achieved with a single genotyping assays containing ~ 3,000 to 5,000 evenly spaced SNP. PMID:20950478
Multiplex-Ready Technology for mid-throughput genotyping of molecular markers.
Bonneau, Julien; Hayden, Matthew
2014-01-01
Screening molecular markers across large populations in breeding programs is generally time consuming and expensive. The Multiplex-Ready Technology (MRT) (Hayden et al., BMC genomics 9:80, 2008) was created to optimize polymorphism screening and genotyping using standardized PCR reaction conditions. The flexibility of this method maximizes the number of markers (up to 24 markers SSR or SNP, ideally small PCR product <500 bp and highly polymorphic) by using fluorescent dye (VIC, FAM, NED, and PET) and a semiautomated DNA fragment analyzer (ABI3730) capillary electrophoresis for large numbers of DNA samples (96 or 384 samples).
Isolation of New Gravitropic Mutants under Hypergravity Conditions.
Mori, Akiko; Toyota, Masatsugu; Shimada, Masayoshi; Mekata, Mika; Kurata, Tetsuya; Tasaka, Masao; Morita, Miyo T
2016-01-01
Forward genetics is a powerful approach used to link genotypes and phenotypes, and mutant screening/analysis has provided deep insights into many aspects of plant physiology. Gravitropism is a tropistic response in plants, in which hypocotyls and stems sense the direction of gravity and grow upward. Previous studies of gravitropic mutants have suggested that shoot endodermal cells in Arabidopsis stems and hypocotyls are capable of sensing gravity (i.e., statocytes). In the present study, we report a new screening system using hypergravity conditions to isolate enhancers of gravitropism mutants, and we also describe a rapid and efficient genome mapping method, using next-generation sequencing (NGS) and single nucleotide polymorphism (SNP)-based markers. Using the endodermal-amyloplast less 1 ( eal1 ) mutant, which exhibits defective development of endodermal cells and gravitropism, we found that hypergravity (10 g) restored the reduced gravity responsiveness in eal1 hypocotyls and could, therefore, be used to obtain mutants with further reduction in gravitropism in the eal1 background. Using the new screening system, we successfully isolated six ene ( enhancer of eal1 ) mutants that exhibited little or no gravitropism under hypergravity conditions, and using NGS and map-based cloning with SNP markers, we narrowed down the potential causative genes, which revealed a new genetic network for shoot gravitropism in Arabidopsis .
Isolation of New Gravitropic Mutants under Hypergravity Conditions
Mori, Akiko; Toyota, Masatsugu; Shimada, Masayoshi; Mekata, Mika; Kurata, Tetsuya; Tasaka, Masao; Morita, Miyo T.
2016-01-01
Forward genetics is a powerful approach used to link genotypes and phenotypes, and mutant screening/analysis has provided deep insights into many aspects of plant physiology. Gravitropism is a tropistic response in plants, in which hypocotyls and stems sense the direction of gravity and grow upward. Previous studies of gravitropic mutants have suggested that shoot endodermal cells in Arabidopsis stems and hypocotyls are capable of sensing gravity (i.e., statocytes). In the present study, we report a new screening system using hypergravity conditions to isolate enhancers of gravitropism mutants, and we also describe a rapid and efficient genome mapping method, using next-generation sequencing (NGS) and single nucleotide polymorphism (SNP)-based markers. Using the endodermal-amyloplast less 1 (eal1) mutant, which exhibits defective development of endodermal cells and gravitropism, we found that hypergravity (10 g) restored the reduced gravity responsiveness in eal1 hypocotyls and could, therefore, be used to obtain mutants with further reduction in gravitropism in the eal1 background. Using the new screening system, we successfully isolated six ene (enhancer of eal1) mutants that exhibited little or no gravitropism under hypergravity conditions, and using NGS and map-based cloning with SNP markers, we narrowed down the potential causative genes, which revealed a new genetic network for shoot gravitropism in Arabidopsis. PMID:27746791
KinSNP software for homozygosity mapping of disease genes using SNP microarrays.
Amir, El-Ad David; Bartal, Ofer; Morad, Efrat; Nagar, Tal; Sheynin, Jony; Parvari, Ruti; Chalifa-Caspi, Vered
2010-08-01
Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from.
Characterization and mapping of leaf rust resistance in four durum wheat cultivars.
Kthiri, Dhouha; Loladze, Alexander; MacLachlan, P R; N'Diaye, Amidou; Walkowiak, Sean; Nilsen, Kirby; Dreisigacker, Susanne; Ammar, Karim; Pozniak, Curtis J
2018-01-01
Widening the genetic basis of leaf rust resistance is a primary objective of the global durum wheat breeding effort at the International Wheat and Maize Improvement Center (CIMMYT). Breeding programs in North America are following suit, especially after the emergence of new races of Puccinia triticina such as BBG/BP and BBBQD in Mexico and the United States, respectively. This study was conducted to characterize and map previously undescribed genes for leaf rust resistance in durum wheat and to develop reliable molecular markers for marker-assisted breeding. Four recombinant inbred line (RIL) mapping populations derived from the resistance sources Amria, Byblos, Geromtel_3 and Tunsyr_2, which were crossed to the susceptible line ATRED #2, were evaluated for their reaction to the Mexican race BBG/BP of P. triticina. Genetic analyses of host reactions indicated that leaf rust resistance in these genotypes was based on major seedling resistance genes. Allelism tests among resistant parents supported that Amria and Byblos carried allelic or closely linked genes. The resistance in Geromtel_3 and Tunsyr_2 also appeared to be allelic. Bulked segregant analysis using the Infinium iSelect 90K single nucleotide polymorphism (SNP) array identified two genomic regions for leaf rust resistance; one on chromosome 6BS for Geromtel_3 and Tunsyr_2 and the other on chromosome 7BL for Amria and Byblos. Polymorphic SNPs identified within these regions were converted to kompetitive allele-specific PCR (KASP) assays and used to genotype the RIL populations. KASP markers usw215 and usw218 were the closest to the resistance genes in Geromtel_3 and Tunsyr_2, while usw260 was closely linked to the resistance genes in Amria and Byblos. DNA sequences associated with these SNP markers were anchored to the wild emmer wheat (WEW) reference sequence, which identified several candidate resistance genes. The molecular markers reported herein will be useful to effectively pyramid these resistance genes with other previously marked genes into adapted, elite durum wheat genotypes.
Characterization and mapping of leaf rust resistance in four durum wheat cultivars
Kthiri, Dhouha; Loladze, Alexander; MacLachlan, P. R.; N’Diaye, Amidou; Walkowiak, Sean; Nilsen, Kirby; Dreisigacker, Susanne; Ammar, Karim
2018-01-01
Widening the genetic basis of leaf rust resistance is a primary objective of the global durum wheat breeding effort at the International Wheat and Maize Improvement Center (CIMMYT). Breeding programs in North America are following suit, especially after the emergence of new races of Puccinia triticina such as BBG/BP and BBBQD in Mexico and the United States, respectively. This study was conducted to characterize and map previously undescribed genes for leaf rust resistance in durum wheat and to develop reliable molecular markers for marker-assisted breeding. Four recombinant inbred line (RIL) mapping populations derived from the resistance sources Amria, Byblos, Geromtel_3 and Tunsyr_2, which were crossed to the susceptible line ATRED #2, were evaluated for their reaction to the Mexican race BBG/BP of P. triticina. Genetic analyses of host reactions indicated that leaf rust resistance in these genotypes was based on major seedling resistance genes. Allelism tests among resistant parents supported that Amria and Byblos carried allelic or closely linked genes. The resistance in Geromtel_3 and Tunsyr_2 also appeared to be allelic. Bulked segregant analysis using the Infinium iSelect 90K single nucleotide polymorphism (SNP) array identified two genomic regions for leaf rust resistance; one on chromosome 6BS for Geromtel_3 and Tunsyr_2 and the other on chromosome 7BL for Amria and Byblos. Polymorphic SNPs identified within these regions were converted to kompetitive allele-specific PCR (KASP) assays and used to genotype the RIL populations. KASP markers usw215 and usw218 were the closest to the resistance genes in Geromtel_3 and Tunsyr_2, while usw260 was closely linked to the resistance genes in Amria and Byblos. DNA sequences associated with these SNP markers were anchored to the wild emmer wheat (WEW) reference sequence, which identified several candidate resistance genes. The molecular markers reported herein will be useful to effectively pyramid these resistance genes with other previously marked genes into adapted, elite durum wheat genotypes. PMID:29746580
Hess, Jon E; Campbell, Nathan R; Docker, Margaret F; Baker, Cyndi; Jackson, Aaron; Lampman, Ralph; McIlraith, Brian; Moser, Mary L; Statler, David P; Young, William P; Wildbill, Andrew J; Narum, Shawn R
2015-01-01
Next-generation sequencing data can be mined for highly informative single nucleotide polymorphisms (SNPs) to develop high-throughput genomic assays for nonmodel organisms. However, choosing a set of SNPs to address a variety of objectives can be difficult because SNPs are often not equally informative. We developed an optimal combination of 96 high-throughput SNP assays from a total of 4439 SNPs identified in a previous study of Pacific lamprey (Entosphenus tridentatus) and used them to address four disparate objectives: parentage analysis, species identification and characterization of neutral and adaptive variation. Nine of these SNPs are FST outliers, and five of these outliers are localized within genes and significantly associated with geography, run-timing and dwarf life history. Two of the 96 SNPs were diagnostic for two other lamprey species that were morphologically indistinguishable at early larval stages and were sympatric in the Pacific Northwest. The majority (85) of SNPs in the panel were highly informative for parentage analysis, that is, putatively neutral with high minor allele frequency across the species' range. Results from three case studies are presented to demonstrate the broad utility of this panel of SNP markers in this species. As Pacific lamprey populations are undergoing rapid decline, these SNPs provide an important resource to address critical uncertainties associated with the conservation and recovery of this imperiled species. © 2014 John Wiley & Sons Ltd.
Genomic selection for slaughter age in pigs using the Cox frailty model.
Santos, V S; Martins Filho, S; Resende, M D V; Azevedo, C F; Lopes, P S; Guimarães, S E F; Glória, L S; Silva, F F
2015-10-19
The aim of this study was to compare genomic selection methodologies using a linear mixed model and the Cox survival model. We used data from an F2 population of pigs, in which the response variable was the time in days from birth to the culling of the animal and the covariates were 238 markers [237 single nucleotide polymorphism (SNP) plus the halothane gene]. The data were corrected for fixed effects, and the accuracy of the method was determined based on the correlation of the ranks of predicted genomic breeding values (GBVs) in both models with the corrected phenotypic values. The analysis was repeated with a subset of SNP markers with largest absolute effects. The results were in agreement with the GBV prediction and the estimation of marker effects for both models for uncensored data and for normality. However, when considering censored data, the Cox model with a normal random effect (S1) was more appropriate. Since there was no agreement between the linear mixed model and the imputed data (L2) for the prediction of genomic values and the estimation of marker effects, the model S1 was considered superior as it took into account the latent variable and the censored data. Marker selection increased correlations between the ranks of predicted GBVs by the linear and Cox frailty models and the corrected phenotypic values, and 120 markers were required to increase the predictive ability for the characteristic analyzed.
Yáñez, J M; Naswa, S; López, M E; Bassini, L; Correa, K; Gilbey, J; Bernatchez, L; Norris, A; Neira, R; Lhorente, J P; Schnable, P S; Newman, S; Mileham, A; Deeb, N; Di Genova, A; Maass, A
2016-07-01
A considerable number of single nucleotide polymorphisms (SNPs) are required to elucidate genotype-phenotype associations and determine the molecular basis of important traits. In this work, we carried out de novo SNP discovery accounting for both genome duplication and genetic variation from American and European salmon populations. A total of 9 736 473 nonredundant SNPs were identified across a set of 20 fish by whole-genome sequencing. After applying six bioinformatic filtering steps, 200 K SNPs were selected to develop an Affymetrix Axiom(®) myDesign Custom Array. This array was used to genotype 480 fish representing wild and farmed salmon from Europe, North America and Chile. A total of 159 099 (79.6%) SNPs were validated as high quality based on clustering properties. A total of 151 509 validated SNPs showed a unique position in the genome. When comparing these SNPs against 238 572 markers currently available in two other Atlantic salmon arrays, only 4.6% of the SNP overlapped with the panel developed in this study. This novel high-density SNP panel will be very useful for the dissection of economically and ecologically relevant traits, enhancing breeding programmes through genomic selection as well as supporting genetic studies in both wild and farmed populations of Atlantic salmon using high-resolution genomewide information. © 2016 John Wiley & Sons Ltd.
Natural Allelic Diversity, Genetic Structure and Linkage Disequilibrium Pattern in Wild Chickpea
Kujur, Alice; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2014-01-01
Characterization of natural allelic diversity and understanding the genetic structure and linkage disequilibrium (LD) pattern in wild germplasm accessions by large-scale genotyping of informative microsatellite and single nucleotide polymorphism (SNP) markers is requisite to facilitate chickpea genetic improvement. Large-scale validation and high-throughput genotyping of genome-wide physically mapped 478 genic and genomic microsatellite markers and 380 transcription factor gene-derived SNP markers using gel-based assay, fluorescent dye-labelled automated fragment analyser and matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass array have been performed. Outcome revealed their high genotyping success rate (97.5%) and existence of a high level of natural allelic diversity among 94 wild and cultivated Cicer accessions. High intra- and inter-specific polymorphic potential and wider molecular diversity (11–94%) along with a broader genetic base (13–78%) specifically in the functional genic regions of wild accessions was assayed by mapped markers. It suggested their utility in monitoring introgression and transferring target trait-specific genomic (gene) regions from wild to cultivated gene pool for the genetic enhancement. Distinct species/gene pool-wise differentiation, admixed domestication pattern, and differential genome-wide recombination and LD estimates/decay observed in a six structured population of wild and cultivated accessions using mapped markers further signifies their usefulness in chickpea genetics, genomics and breeding. PMID:25222488
Mdladla, K; Dzomba, E F; Huson, H J; Muchadeyi, F C
2016-08-01
The sustainability of goat farming in marginal areas of southern Africa depends on local breeds that are adapted to specific agro-ecological conditions. Unimproved non-descript goats are the main genetic resources used for the development of commercial meat-type breeds of South Africa. Little is known about genetic diversity and the genetics of adaptation of these indigenous goat populations. This study investigated the genetic diversity, population structure and breed relations, linkage disequilibrium, effective population size and persistence of gametic phase in goat populations of South Africa. Three locally developed meat-type breeds of the Boer (n = 33), Savanna (n = 31), Kalahari Red (n = 40), a feral breed of Tankwa (n = 25) and unimproved non-descript village ecotypes (n = 110) from four goat-producing provinces of the Eastern Cape, KwaZulu-Natal, Limpopo and North West were assessed using the Illumina Goat 50K SNP Bead Chip assay. The proportion of SNPs with minor allele frequencies >0.05 ranged from 84.22% in the Tankwa to 97.58% in the Xhosa ecotype, with a mean of 0.32 ± 0.13 across populations. Principal components analysis, admixture and pairwise FST identified Tankwa as a genetically distinct population and supported clustering of the populations according to their historical origins. Genome-wide FST identified 101 markers potentially under positive selection in the Tankwa. Average linkage disequilibrium was highest in the Tankwa (r(2) = 0.25 ± 0.26) and lowest in the village ecotypes (r(2) range = 0.09 ± 0.12 to 0.11 ± 0.14). We observed an effective population size of <150 for all populations 13 generations ago. The estimated correlations for all breed pairs were lower than 0.80 at marker distances >100 kb with the exception of those in Savanna and Tswana populations. This study highlights the high level of genetic diversity in South African indigenous goats as well as the utility of the genome-wide SNP marker panels in genetic studies of these populations. © 2016 Stichting International Foundation for Animal Genetics.
Design and characterization of a 52K SNP chip for goats.
Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C M; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T; McEwan, John; Martin, Patrice; Moreno, Carole R; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang
2014-01-01
The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.
Design and Characterization of a 52K SNP Chip for Goats
Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C. M.; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T.; McEwan, John; Martin, Patrice; Moreno, Carole R.; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L.; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang
2014-01-01
The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50–60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years. PMID:24465974
Howard, Nicholas P; van de Weg, Eric; Bedford, David S; Peace, Cameron P; Vanderzande, Stijn; Clark, Matthew D; Teh, Soon Li; Cai, Lichun; Luby, James J
2017-01-01
The apple (Malus×domestica) cultivar Honeycrisp has become important economically and as a breeding parent. An earlier study with SSR markers indicated the original recorded pedigree of ‘Honeycrisp’ was incorrect and ‘Keepsake’ was identified as one putative parent, the other being unknown. The objective of this study was to verify ‘Keepsake’ as a parent and identify and genetically describe the unknown parent and its grandparents. A multi-family based dense and high-quality integrated SNP map was created using the apple 8 K Illumina Infinium SNP array. This map was used alongside a large pedigree-connected data set from the RosBREED project to build extended SNP haplotypes and to identify pedigree relationships. ‘Keepsake’ was verified as one parent of ‘Honeycrisp’ and ‘Duchess of Oldenburg’ and ‘Golden Delicious’ were identified as grandparents through the unknown parent. Following this finding, siblings of ‘Honeycrisp’ were identified using the SNP data. Breeding records from several of these siblings suggested that the previously unreported parent is a University of Minnesota selection, MN1627. This selection is no longer available, but now is genetically described through imputed SNP haplotypes. We also present the mosaic grandparental composition of ‘Honeycrisp’ for each of its 17 chromosome pairs. This new pedigree and genetic information will be useful in future pedigree-based genetic studies to connect ‘Honeycrisp’ with other cultivars used widely in apple breeding programs. The created SNP linkage map will benefit future research using the data from the Illumina apple 8 and 20 K and Affymetrix 480 K SNP arrays. PMID:28243452
[Genetic diversity analysis of Andrographis paniculata in China based on SRAP and SNP].
Chen, Rong; Wang, Xiao-Yun; Song, Yu-Ning; Zhu, Yun-feng; Wang, Peng-liang; Li, Min; Zhong, Guo-Yue
2014-12-01
In order to reveal genetic diversity of domestic Andrographis paniculata and its impact on quality, genetic backgrounds of 103 samples from 7 provinces in China were analyzed using SRAP marker and SNP marker. Genetic structures of the A. paniculata populations were estimated with Powermarker V 3.25 and Mega 6.0 software, and polymorphic SNPs were identified with CodonCode Aligner software. The results showed that the genetic distances of domestic A. paniculata germplasm ranged from 0. 01 to 0.09, and no polymorphic SNPs were discovered in coding sequence fragments of ent-copalyl diphosphate synthase. A. paniculata germplasm from various regions in China had poor genetic diversity. This phenomenon was closely related to strict self-fertilization and earlier introduction from the same origin. Therefore, genetic background had little impact on variable qualities of A. paniculata in domestic market. Mutation breeding, polyploid breeding and molecular breeding were proposed as promising strategies in germplasm innovation.
Single-nucleotide polymorphism genotyping on optical thin-film biosensor chips.
Zhong, Xiao-Bo; Reynolds, Robert; Kidd, Judith R; Kidd, Kenneth K; Jenison, Robert; Marlar, Richard A; Ward, David C
2003-09-30
Single-nucleotide polymorphisms (SNPs) constitute the bulk of human genetic variation and provide excellent markers to identify genetic factors contributing to complex disease susceptibility. A rapid, sensitive, and inexpensive assay is important for large-scale SNP scoring. Here we report the development of a multiplex SNP detection system using silicon chips coated to create a thin-film optical biosensor. Allele-discriminating, aldehyde-labeled oligonucleotides are arrayed and covalently attached to a hydrazinederivatized chip surface. Target sequences (e.g., PCR amplicons) then are hybridized in the presence of a mixture of biotinylated detector probes, one for each SNP, and a thermostable DNA ligase. After a stringent wash (0.01 M NaOH), ligation of biotinylated detector probes to perfectly matched capture oligomers is visualized as a color change on the chip surface (gold to blue/purple) after brief incubations with an anti-biotin IgG-horseradish peroxidase conjugate and a precipitable horseradish peroxidase substrate. Testing of PCR fragments is completed in 30-40 min. Up to several hundred SNPs can be assayed on a 36-mm2 chip, and SNP scoring can be done by eye or with a simple digital-camera system. This assay is extremely robust, exhibits high sensitivity and specificity, and is format-flexible and economical. In studies of mutations associated with risk for venous thrombosis and genotyping/haplotyping of African-American samples, we document high-fidelity analysis with 0 misassignments in 500 assays performed in duplicate.
Mei, C G; Gui, L S; Fu, C Z; Wang, H C; Wang, J L; Cheng, G; Zan, L S
2015-08-07
Previous studies have shown that the cell death-inducing DFF45-like effector-C (CIDEC) gene is involved in lipid storage and energy metabolism, suggesting that it is a potential candidate gene that affects body measurement traits (BMTs) and meat quality traits (MQTs). The aim of this study was to identify polymorphisms of the bovine CIDEC gene and analyze their possible associations with BMTs and MQTs in 531 randomly selected Qinchuan cattle aged between 18 and 24 months. DNA sequencing and polymerase chain reaction-restriction fragment length polymorphism were employed to detect CIDEC single nucleotide polymorphisms (SNPs). We found five SNPs: two in exon 5 (SNP1, g.9815G>A and SNP2, g.9924C>T) and three in the 3'-untranslated region (SNP3, g.13281C>T; SNP4, g.13297A>G; and SNP5, g.13307G>A). SNP1 was a missense mutation that resulted in an arginine to glutamine amino acid change, and exhibited two genotypes (GG and AG). SNP2 was a synonymous mutation that exhibited three genotypes (CC, CT, and TT). SNP3, 4, and 5 were completely linked, and only exhibited two genotypes (CC-AA-GG and CT-AG-GA). We found significant associations between these polymorphisms and BMTs and MQTs (P < 0.05); GG, CT, and CT-AG-GA appeared to be the most beneficial genotypes. Therefore, CIDEC may affect BMTs and MQTs in Qinchuan cattle, and could be used in marker-assisted selection.
Zhang, Hong; Zhang, Lu; Wang, Changyou; Wang, Yajuan; Zhou, Xinli; Lv, Shikai; Liu, Xinlun; Kang, Zhensheng; Ji, Wanquan
2016-02-01
YrSM139-1B maybe a new gene for effective resistance to stripe rust and useful flanking markers for marker-assisted selection were developed. Stripe rust, caused by Puccinia striiformis f. sp. tritici, is an important foliar disease of wheat. Two dominant stripe rust resistant genes YrSM139-1B and YrSM139-2D were pyramided in bread wheat cultivar Shaanmai 139; one from wild emmer and the other from Thinopyrum intermedium. Three near-isogenic F7:8 line pairs (contrasting RILs), N122-1013R/S, N122-185R/S, and N122-1812R/S, independently derived from different F2 plants and differing at the YrSM139-1B locus were generated from the cross Shaanmai 139 × Hu 901-19 through marker-assisted selection. A large F2:3 population from cross N122-1013R × N122-1013S tested for stripe rust response and subjected to analysis with markers in the 1BS10-0.5 bin region using SSR expressed sequence tags (EST) and site-specific sequence markers developed from the 90 K Illumina iSelect SNP array. Five EST-STS markers and four allele-specific PCR markers were mapped to the YrSM139-1B region. The 30.5 cM genetic map for YrSM139-1B consisted of nine markers, two of which were closer to YrSM139-1B than Xgwm273, which was used in producing the contrasting RIL pairs. Race response data and allelism tests showed that YrSM139-1B is different from Yr10, Yr15, and Yr24/26/CH42.
USDA-ARS?s Scientific Manuscript database
Theobroma cacao is a tree cultivated in the tropics around the world for its seeds that are the source of both chocolate and cocoa butter. The cacao genome sequencing project initiated as a collaboration between USDA, Mars, Inc. and IBM has generated a great deal of transcriptome and genome sequenc...
Use of direct and iterative solvers for estimation of SNP effects in genome-wide selection
2010-01-01
The aim of this study was to compare iterative and direct solvers for estimation of marker effects in genomic selection. One iterative and two direct methods were used: Gauss-Seidel with Residual Update, Cholesky Decomposition and Gentleman-Givens rotations. For resembling different scenarios with respect to number of markers and of genotyped animals, a simulated data set divided into 25 subsets was used. Number of markers ranged from 1,200 to 5,925 and number of animals ranged from 1,200 to 5,865. Methods were also applied to real data comprising 3081 individuals genotyped for 45181 SNPs. Results from simulated data showed that the iterative solver was substantially faster than direct methods for larger numbers of markers. Use of a direct solver may allow for computing (co)variances of SNP effects. When applied to real data, performance of the iterative method varied substantially, depending on the level of ill-conditioning of the coefficient matrix. From results with real data, Gentleman-Givens rotations would be the method of choice in this particular application as it provided an exact solution within a fairly reasonable time frame (less than two hours). It would indeed be the preferred method whenever computer resources allow its use. PMID:21637627
2009-01-01
Background Genomic selection (GS) uses molecular breeding values (MBV) derived from dense markers across the entire genome for selection of young animals. The accuracy of MBV prediction is important for a successful application of GS. Recently, several methods have been proposed to estimate MBV. Initial simulation studies have shown that these methods can accurately predict MBV. In this study we compared the accuracies and possible bias of five different regression methods in an empirical application in dairy cattle. Methods Genotypes of 7,372 SNP and highly accurate EBV of 1,945 dairy bulls were used to predict MBV for protein percentage (PPT) and a profit index (Australian Selection Index, ASI). Marker effects were estimated by least squares regression (FR-LS), Bayesian regression (Bayes-R), random regression best linear unbiased prediction (RR-BLUP), partial least squares regression (PLSR) and nonparametric support vector regression (SVR) in a training set of 1,239 bulls. Accuracy and bias of MBV prediction were calculated from cross-validation of the training set and tested against a test team of 706 young bulls. Results For both traits, FR-LS using a subset of SNP was significantly less accurate than all other methods which used all SNP. Accuracies obtained by Bayes-R, RR-BLUP, PLSR and SVR were very similar for ASI (0.39-0.45) and for PPT (0.55-0.61). Overall, SVR gave the highest accuracy. All methods resulted in biased MBV predictions for ASI, for PPT only RR-BLUP and SVR predictions were unbiased. A significant decrease in accuracy of prediction of ASI was seen in young test cohorts of bulls compared to the accuracy derived from cross-validation of the training set. This reduction was not apparent for PPT. Combining MBV predictions with pedigree based predictions gave 1.05 - 1.34 times higher accuracies compared to predictions based on pedigree alone. Some methods have largely different computational requirements, with PLSR and RR-BLUP requiring the least computing time. Conclusions The four methods which use information from all SNP namely RR-BLUP, Bayes-R, PLSR and SVR generate similar accuracies of MBV prediction for genomic selection, and their use in the selection of immediate future generations in dairy cattle will be comparable. The use of FR-LS in genomic selection is not recommended. PMID:20043835
Khatkar, Mehar S; Nicholas, Frank W; Collins, Andrew R; Zenger, Kyall R; Cavanagh, Julie A L; Barris, Wes; Schnabel, Robert D; Taylor, Jeremy F; Raadsma, Herman W
2008-04-24
The extent of linkage disequilibrium (LD) within a population determines the number of markers that will be required for successful association mapping and marker-assisted selection. Most studies on LD in cattle reported to date are based on microsatellite markers or small numbers of single nucleotide polymorphisms (SNPs) covering one or only a few chromosomes. This is the first comprehensive study on the extent of LD in cattle by analyzing data on 1,546 Holstein-Friesian bulls genotyped for 15,036 SNP markers covering all regions of all autosomes. Furthermore, most studies in cattle have used relatively small sample sizes and, consequently, may have had biased estimates of measures commonly used to describe LD. We examine minimum sample sizes required to estimate LD without bias and loss in accuracy. Finally, relatively little information is available on comparative LD structures including other mammalian species such as human and mouse, and we compare LD structure in cattle with public-domain data from both human and mouse. We computed three LD estimates, D', Dvol and r2, for 1,566,890 syntenic SNP pairs and a sample of 365,400 non-syntenic pairs. Mean D' is 0.189 among syntenic SNPs, and 0.105 among non-syntenic SNPs; mean r2 is 0.024 among syntenic SNPs and 0.0032 among non-syntenic SNPs. All three measures of LD for syntenic pairs decline with distance; the decline is much steeper for r2 than for D' and Dvol. The value of D' and Dvol are quite similar. Significant LD in cattle extends to 40 kb (when estimated as r2) and 8.2 Mb (when estimated as D'). The mean values for LD at large physical distances are close to those for non-syntenic SNPs. Minor allelic frequency threshold affects the distribution and extent of LD. For unbiased and accurate estimates of LD across marker intervals spanning < 1 kb to > 50 Mb, minimum sample sizes of 400 (for D') and 75 (for r2) are required. The bias due to small samples sizes increases with inter-marker interval. LD in cattle is much less extensive than in a mouse population created from crossing inbred lines, and more extensive than in humans. For association mapping in Holstein-Friesian cattle, for a given design, at least one SNP is required for each 40 kb, giving a total requirement of at least 75,000 SNPs for a low power whole-genome scan (median r2 > 0.19) and up to 300,000 markers at 10 kb intervals for a high power genome scan (median r2 > 0.62). For estimation of LD by D' and Dvol with sufficient precision, a sample size of at least 400 is required, whereas for r2 a minimum sample of 75 is adequate.
Multilocus nuclear DNA markers reveal population structure and demography of Anopheles minimus.
Dixit, Jyotsana; Arunyawat, Uraiwan; Huong, Ngo Thi; Das, Aparup
2014-11-01
Utilization of multiple putatively neutral DNA markers for inferring evolutionary history of species population is considered to be the most robust approach. Molecular population genetic studies have been conducted in many species of Anopheles genus, but studies based on single nucleotide polymorphism (SNP) data are still very scarce. Anopheles minimus is one of the principal malaria vectors of Southeast (SE) Asia including the Northeastern (NE) India. Although population genetic studies with mitochondrial genetic variation data have been utilized to infer phylogeography of the SE Asian populations of this species, limited information on the population structure and demography of Indian An. minimus is available. We herewith have developed multilocus nuclear genetic approach with SNP markers located in X chromosome of An. minimus in eight Indian and two SE Asian population samples (121 individual mosquitoes in total) to infer population history and test several hypotheses on the phylogeography of this species. While the Thai population sample of An. minimus presented the highest nucleotide diversity, majority of the Indian samples were also fairly diverse. In general, An. minimus populations were moderately substructured in the distribution range covering SE Asia and NE India, largely falling under three distinct genetic clusters. Moreover, demographic expansion events could be detected in the majority of the presently studied populations of An. minimus. Additional DNA sequencing of the mitochondrial COII region in a subset of the samples (40 individual mosquitoes) corroborated the existing hypothesis of Indian An. minimus falling under the earlier reported mitochondrial lineage B. © 2014 John Wiley & Sons Ltd.
Li, X; Buitenhuis, A J; Lund, M S; Li, C; Sun, D; Zhang, Q; Poulsen, N A; Su, G
2015-11-01
The identification of causal genes or genomic regions associated with fatty acids (FA) will enhance our understanding of the pathways underlying FA synthesis and provide opportunities for changing milk fat composition through a genetic approach. The linkage disequilibrium between adjacent markers is highly consistent between the Chinese and Danish Holstein populations, such that a joint genome-wide association study (GWAS) can be performed. In this study, a joint GWAS was performed for 16 milk FA traits based on data of 784 Chinese and 371 Danish Holstein cows genotyped by a high-density bovine single nucleotide polymorphism (SNP) array. A total of 486,464 SNP markers on 29 bovine autosomes were used. Bonferroni corrections were applied to adjust the significance thresholds for multiple testing at the genome- and chromosome-wide levels. According to the analysis of either the Chinese or Danish data individually, the total numbers of overlapping SNP that were significant at the chromosome level were 94 for C14:1, 208 for the C14 index, and 1 for C18:0. Joint analysis using the combined data of the 2 populations detected greater numbers of significant SNP compared with either of the individual populations alone for 7 and 10 traits at the genome- and chromosome-wide significance levels, respectively. Greater numbers of significant SNP were detected for C18:0 and the C18 index in the Chinese population compared with the joint analysis. Sixty-five significant SNP across all traits had significantly different effects in the 2 populations. Ten FA were influenced by a quantitative trait loci (QTL) region including DGAT1. Both C14:1 and the C14 index were influenced by a QTL region including SCD1 in the combined population. Other QTL regions also showed significant associations with the studied FA. A large region (14.9-24.9 Mbp) in BTA26 significantly influenced C14:1 and the C14 index in both populations, mostly likely due to the SNP in SCD1. A QTL region (69.97-73.69 Mbp) on BTA9 showed a significantly different effect on C18:0 between the 2 populations. Detection of these important SNP and the corresponding QTL regions will be helpful for follow-up studies to identify causal mutations and their interaction with environments for milk FA in dairy cattle. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
McClure, Matthew C.; McCarthy, John; Flynn, Paul; McClure, Jennifer C.; Dair, Emma; O'Connell, D. K.; Kearney, John F.
2018-01-01
A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP) verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS), they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF) analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800) selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR), and minor allele frequency (MAF) in the Irish cattle population. Large datasets require sample and SNP quality control (QC). Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present), and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non-matching genotypes per animal, SNP duplicates, sex and breed prediction mismatches, parentage and progeny validation results, and other situations. The Animal QC pipeline make use of ICBF800 SNP set where appropriate to identify errors in a computationally efficient yet still highly accurate method. PMID:29599798
Phetsuksiri, Benjawan; Srisungngam, Sopa; Rudeeaneksin, Janisara; Bunchoo, Supranee; Lukebua, Atchariya; Wongtrungkapun, Ruch; Paitoon, Soontara; Sakamuri, Rama Murthy; Brennan, Patrick J; Vissa, Varalakshmi
2012-01-01
Based on the discovery of three single nucleotide polymorphisms (SNPs) in Mycobacterium leprae, it has been previously reported that there are four major SNP types associated with different geographic regions around the world. Another typing system for global differentiation of M. leprae is the analysis of the variable number of short tandem repeats within the rpoT gene. To expand the analysis of geographic distribution of M. leprae, classified by SNP and rpoT gene polymorphisms, we studied 85 clinical isolates from Thai patients and compared the findings with those reported from Asian isolates. SNP genotyping by PCR amplification and sequencing revealed that all strains like those in Myanmar were SNP type 1 and 3, with the former being predominant, while in Japan, Korea, and Indonesia, the SNP type 3 was found to be more frequent. The pattern of M. leprae distribution in Thailand and Myanmar is quite similar, except that SNP type 2 was not found in Thailand. In addition, the 3-copy hexamer genotype in the rpoT gene is shared among the isolates from these two neighboring countries. On the basis of these two markers, we postulate that M. leprae in leprosy patients from Myanmar and Thailand has a common historical origin. Further differentiation among Thai isolates was possible by assessing copy numbers of the TTC sequence, a more polymorphic microsatellite locus.
2013-01-01
Background The availability of a large expressed sequence tags (EST) resource and recent advances in high-throughput genotyping technology have made it possible to develop highly multiplexed SNP arrays for multi-objective genetic applications, including the construction of meiotic maps. Such approaches are particularly useful in species with a large genome size, precluding the use of whole-genome shotgun assembly with current technologies. Results In this study, a 12 k-SNP genotyping array was developed for maritime pine from an extensive EST resource assembled into a unigene set. The offspring of three-generation outbred and inbred mapping pedigrees were then genotyped. The inbred pedigree consisted of a classical F2 population resulting from the selfing of a single inter-provenance (Landes x Corsica) hybrid tree, whereas the outbred pedigree (G2) resulted from a controlled cross of two intra-provenance (Landes x Landes) hybrid trees. This resulted in the generation of three linkage maps based on SNP markers: one from the parental genotype of the F2 population (1,131 markers in 1,708 centimorgan (cM)), and one for each parent of the G2 population (1,015 and 1,110 markers in 1,447 and 1,425 cM for the female and male parents, respectively). A comparison of segregation patterns in the progeny obtained from the two types of mating (inbreeding and outbreeding) led to the identification of a chromosomal region carrying an embryo viability locus with a semi-lethal allele. Following selfing and segregation, zygote mortality resulted in a deficit of Corsican homozygous genotypes in the F2 population. This dataset was also used to study the extent and distribution of meiotic recombination along the length of the chromosomes and the effect of sex and/or genetic background on recombination. The genetic background of trees in which meiotic recombination occurred was found to have a significant effect on the frequency of recombination. Furthermore, only a small proportion of the recombination hot- and cold-spots were common to all three genotypes, suggesting that the spatial pattern of recombination was genetically variable. Conclusion This study led to the development of classical genomic tools for this ecologically and economically important species. It also identified a chromosomal region bearing a semi-lethal recessive allele and demonstrated the genetic variability of recombination rate over the genome. PMID:23597128
Valentini, Giseli; Gonçalves-Vidigal, Maria Celeste; Hurtado-Gonzales, Oscar P; de Lima Castro, Sandra Aparecida; Cregan, Perry B; Song, Qijian; Pastor-Corrales, Marcial A
2017-08-01
Co-segregation analysis and high-throughput genotyping using SNP, SSR, and KASP markers demonstrated genetic linkage between Ur-14 and Co-3 4 /Phg-3 loci conferring resistance to the rust, anthracnose and angular leaf spot diseases of common bean. Rust, anthracnose, and angular leaf spot are major diseases of common bean in the Americas and Africa. The cultivar Ouro Negro has the Ur-14 gene that confers broad spectrum resistance to rust and the gene cluster Co-3 4 /Phg-3 containing two tightly linked genes conferring resistance to anthracnose and angular leaf spot, respectively. We used co-segregation analysis and high-throughput genotyping of 179 F 2:3 families from the Rudá (susceptible) × Ouro Negro (resistant) cross-phenotyped separately with races of the rust and anthracnose pathogens. The results confirmed that Ur-14 and Co-3 4 /Phg-3 cluster in Ouro Negro conferred resistance to rust and anthracnose, respectively, and that Ur-14 and the Co-3 4 /Phg-3 cluster were closely linked. Genotyping the F 2:3 families, first with 5398 SNPs on the Illumina BeadChip BARCBEAN6K_3 and with 15 SSR, and eight KASP markers, specifically designed for the candidate region containing Ur-14 and Co-3 4 /Phg-3, permitted the creation of a high-resolution genetic linkage map which revealed that Ur-14 was positioned at 2.2 cM from Co-3 4 /Phg-3 on the short arm of chromosome Pv04 of the common bean genome. Five flanking SSR markers were tightly linked at 0.1 and 0.2 cM from Ur-14, and two flanking KASP markers were tightly linked at 0.1 and 0.3 cM from Co-3 4 /Phg-3. Many other SSR, SNP, and KASP markers were also linked to these genes. These markers will be useful for the development of common bean cultivars combining the important Ur-14 and Co-3 4 /Phg-3 genes conferring resistance to three of the most destructive diseases of common bean.
Tian, Chao; Kosoy, Roman; Nassir, Rami; Lee, Annette; Villoslada, Pablo; Klareskog, Lars; Hammarström, Lennart; Garchon, Henri-Jean; Pulver, Ann E.; Ransom, Michael; Gregersen, Peter K.; Seldin, Michael F.
2009-01-01
The definition of European population genetic substructure and its application to understanding complex phenotypes is becoming increasingly important. In the current study using over 4000 subjects genotyped for 300 thousand SNPs we provide further insight into relationships among European population groups and identify sets of SNP ancestry informative markers (AIMs) for application in genetic studies. In general, the graphical description of these principal components analyses (PCA) of diverse European subjects showed a strong correspondence to the geographical relationships of specific countries or regions of origin. Clearer separation of different ethnic and regional populations was observed when northern and southern European groups were considered separately and the PCA results were influenced by the inclusion or exclusion of different self-identified population groups including Ashkenazi Jewish, Sardinian and Orcadian ethnic groups. SNP AIM sets were identified that could distinguish the regional and ethnic population groups. Moreover, the studies demonstrated that most allele frequency differences between different European groups could be effectively controlled in analyses using these AIM sets. The European substructure AIMs should be widely applicable to ongoing studies to confirm and delineate specific disease susceptibility candidate regions without the necessity to perform additional genome-wide SNP studies in additional subject sets. PMID:19707526
Tian, Chao; Kosoy, Roman; Nassir, Rami; Lee, Annette; Villoslada, Pablo; Klareskog, Lars; Hammarström, Lennart; Garchon, Henri-Jean; Pulver, Ann E; Ransom, Michael; Gregersen, Peter K; Seldin, Michael F
2009-01-01
The definition of European population genetic substructure and its application to understanding complex phenotypes is becoming increasingly important. In the current study using over 4,000 subjects genotyped for 300,000 single-nucleotide polymorphisms (SNPs), we provide further insight into relationships among European population groups and identify sets of SNP ancestry informative markers (AIMs) for application in genetic studies. In general, the graphical description of these principal components analyses (PCA) of diverse European subjects showed a strong correspondence to the geographical relationships of specific countries or regions of origin. Clearer separation of different ethnic and regional populations was observed when northern and southern European groups were considered separately and the PCA results were influenced by the inclusion or exclusion of different self-identified population groups including Ashkenazi Jewish, Sardinian, and Orcadian ethnic groups. SNP AIM sets were identified that could distinguish the regional and ethnic population groups. Moreover, the studies demonstrated that most allele frequency differences between different European groups could be controlled effectively in analyses using these AIM sets. The European substructure AIMs should be widely applicable to ongoing studies to confirm and delineate specific disease susceptibility candidate regions without the necessity of performing additional genome-wide SNP studies in additional subject sets.
A potential molecular marker for selection against abdominal fatness in chickens.
Wu, G Q; Deng, X M; Li, J Y; Li, N; Yang, N
2006-11-01
The peroxisome proliferators-activated receptor-gamma coactivator-1alpha (PGC-1alpha) was investigated as a candidate gene for growth and fatness traits in chicken because of its prominent role in muscle fiber specialization and adipogenesis. A single nucleotide polymorphism (SNP) from G to A at position 646 of the open reading frame of chicken PGC-1alpha gene causing an Asp216Asn amino acid substitution was identified. The frequencies of alleles and genotypes were significantly different among 6 chicken breeds (P < 0.01). The White Plymouth Rock had the highest frequency (0.67) of allele G, whereas the White Leghorn had the lowest (0.18). The associations of the SNP with the growth and fatness traits were evaluated in 332 F(2) birds from an experimental cross of White Plymouth Rock x Silkies. No association was found between the SNP and growth-related traits. However, abdominal fat weight at 12 wk of age for birds with genotype GG was 34.26 and 28.71% higher than those with genotypes AA and AG, respectively (P < 0.01), indicating that the Asp216Asn polymorphism of the PGC-1alpha gene could be used as a novel potential molecular marker for selection against abdominal fatness without interfering in regular breeding for growth rate of chickens.
Abo-Al-Ela, Haitham G; El-Magd, Mohammed Abu; El-Nahas, Abeer F; Mansour, Ali A
2014-08-01
Insulin-like growth factor 2 (IGF2) plays an important role in muscle growth and it might be used as a marker for the growth traits selection strategies in farm animals. The objectives of this study were to detect polymorphisms in exon 10 of IGF2 and to determine associations between these polymorphisms and growth traits in Egyptian water buffalo. PCR-single-strand conformation polymorphism (SSCP) and DNA sequencing methods were used to detect any prospective polymorphism. A novel single nucleotide polymorphism (SNP), C287A, was detected. It was a non-synonymous mutation and led to replacement of glutamine (Q) amino acid (aa) by histidine (H) aa. Three different SSCP patterns were observed: AA, AC, and CC, with frequencies of 0.540, 0.325, and 0.135, respectively. Association analyses revealed that the AA individuals had a higher average daily gain (ADG) than other individuals (CC and AC) from birth to 9 months of age. We conclude that the AA genotype in C287A SNP in the exon 10 of the IGF2 gene is associated with the ADG during the age from birth to 9 months and could be used as a potential genetic marker for selection of growth traits in Egyptian buffalo.
Hao, Chenyang; Wang, Yuquan; Chao, Shiaoman; Li, Tian; Liu, Hongxia; Wang, Lanfen; Zhang, Xueyong
2017-01-30
A Chinese wheat mini core collection was genotyped using the wheat 9 K iSelect SNP array. Total 2420 and 2396 polymorphic SNPs were detected on the A and the B genome chromosomes, which formed 878 haplotype blocks. There were more blocks in the B genome, but the average block size was significantly (P < 0.05) smaller than those in the A genome. Intense selection (domestication and breeding) had a stronger effect on the A than on the B genome chromosomes. Based on the genetic pedigrees, many blocks can be traced back to a well-known Strampelli cross, which was made one century ago. Furthermore, polyploidization of wheat (both tetraploidization and hexaploidization) induced revolutionary changes in both the A and the B genomes, with a greater increase of gene diversity compared to their diploid ancestors. Modern breeding has dramatically increased diversity in the gene coding regions, though obvious blocks were formed on most of the chromosomes in both tetraploid and hexaploid wheats. Tag-SNP markers identified in this study can be used for marker assisted selection using haplotype blocks as a wheat breeding strategy. This strategy can also be employed to facilitate genome selection in other self-pollinating crop species.
Tasker, Esiri; LaRue, Bobby; Beherec, Charity; Gangitano, David; Hughes-Stamm, Sheree
2017-05-01
Improvised explosive devices (IEDs) such as pipe bombs are weapons used to detrimentally affect people and communities. A readily accessible brand of exploding targets called Tannerite® has been identified as a potential material for abuse as an explosive in pipe bombs. The ability to recover and genotype DNA from such weapons may be vital in the effort to identify suspects associated with these devices. While it is possible to recover DNA from post-blast fragments using short tandem repeat markers (STRs), genotyping success can be negatively affected by low quantities of DNA, degradation, and/or PCR inhibitors. Alternative markers such as insertion/null (INNULs) and single nucleotide polymorphisms (SNPs) are bi-allelic genetic markers that are shorter genomic targets than STRs for amplification, which are more likely to resist degradation. In this study, we constructed pipe bombs that were spiked with known amounts of biological material to: 1) recover "touch" DNA from the surface of the device, and 2) recover traces of blood from the ends of wires (simulated finger prick). The bombs were detonated with the binary explosive Tannerite® using double-base smokeless powder to initiate the reaction. DNA extracted from the post-blast fragments was quantified with the Quantifiler® Trio DNA Quantification Kit. STR analysis was conducted using the GlobalFiler® Amplification Kit, INNULs were amplified using an early-access version of the InnoTyper™ 21 Kit, and SNP analysis via massively parallel sequencing (MPS) was performed using the HID-Ion Ampliseq™ Identity and Ancestry panels using the Ion Chef and Ion PGM sequencing system. The results of this study showed that INNUL markers resulted in the most complete genetic profiles when compared to STR and SNP profiles. The random match probabilities calculated for samples using INNULs were lower than with STRs when less than 14 STR alleles were reported. These results suggest that INNUL analysis may be well suited for low-template and/or degraded DNA samples, and may be used to supplement incomplete or failed STR analysis. Human identification using SNP analysis via MPS showed variable success with low-level post-blast samples in this study (<150pg). While neat DNA samples (6μL input as recommended) resulted in <50% of SNP calls, samples that were concentrated from 15μL to 6μL (15μL was added for STR and INNUL typing) resulted in more complete SNP profiles. Five out of six blood samples recovered from the wires attached to the pipe-bombs resulted in the correct ancestry predictions. Copyright © 2017 Elsevier B.V. All rights reserved.
Functional Analysis and Marker Development of TaCRT-D Gene in Common Wheat (Triticum aestivum L.).
Wang, Jiping; Li, Runzhi; Mao, Xinguo; Jing, Ruilian
2017-01-01
Calreticulin (CRT), an endoplasmic reticulum (ER)-localized Ca 2+ -binding/buffering protein, is highly conserved and extensively expressed in animal and plant cells. To understand the function of CRTs in wheat ( Triticum aestivum L.), particularly their roles in stress tolerance, we cloned the full-length genomic sequence of the TaCRT-D isoform from D genome of common hexaploid wheat, and characterized its function by transgenic Arabidopsis system. TaCRT-D exhibited different expression patterns in wheat seedling under different abiotic stresses. Transgenic Arabidopsis plants overexpressing ORF of TaCRT-D displayed more tolerance to drought, cold, salt, mannitol, and other abiotic stresses at both seed germination and seedling stages, compared with the wild-type controls. Furthermore, DNA polymorphism analysis and gene mapping were employed to develop the functional markers of this gene for marker-assistant selection in wheat breeding program. One SNP, S440 (T→C) was detected at the TaCRT-D locus by genotyping a wheat recombinant inbred line (RIL) population (114 lines) developed from Opata 85 × W7984. The TaCRT-D was then fine mapped between markers Xgwm645 and Xgwm664 on chromosome 3DL, corresponding to genetic distances of 3.5 and 4.4 cM, respectively, using the RIL population and Chinese Spring nulli-tetrasomic lines. Finally, the genome-specific and allele-specific markers were developed for the TaCRT-D gene. These findings indicate that TaCRT-D function importantly in plant stress responses, providing a gene target for genetic engineering to increase plant stress tolerance and the functional markers of TaCRT-D for marker-assistant selection in wheat breeding.
Functional Analysis and Marker Development of TaCRT-D Gene in Common Wheat (Triticum aestivum L.)
Wang, Jiping; Li, Runzhi; Mao, Xinguo; Jing, Ruilian
2017-01-01
Calreticulin (CRT), an endoplasmic reticulum (ER)-localized Ca2+-binding/buffering protein, is highly conserved and extensively expressed in animal and plant cells. To understand the function of CRTs in wheat (Triticum aestivum L.), particularly their roles in stress tolerance, we cloned the full-length genomic sequence of the TaCRT-D isoform from D genome of common hexaploid wheat, and characterized its function by transgenic Arabidopsis system. TaCRT-D exhibited different expression patterns in wheat seedling under different abiotic stresses. Transgenic Arabidopsis plants overexpressing ORF of TaCRT-D displayed more tolerance to drought, cold, salt, mannitol, and other abiotic stresses at both seed germination and seedling stages, compared with the wild-type controls. Furthermore, DNA polymorphism analysis and gene mapping were employed to develop the functional markers of this gene for marker-assistant selection in wheat breeding program. One SNP, S440 (T→C) was detected at the TaCRT-D locus by genotyping a wheat recombinant inbred line (RIL) population (114 lines) developed from Opata 85 × W7984. The TaCRT-D was then fine mapped between markers Xgwm645 and Xgwm664 on chromosome 3DL, corresponding to genetic distances of 3.5 and 4.4 cM, respectively, using the RIL population and Chinese Spring nulli-tetrasomic lines. Finally, the genome-specific and allele-specific markers were developed for the TaCRT-D gene. These findings indicate that TaCRT-D function importantly in plant stress responses, providing a gene target for genetic engineering to increase plant stress tolerance and the functional markers of TaCRT-D for marker-assistant selection in wheat breeding. PMID:28955354