density snp-based linkage: Topics by Science.gov

Sample records for density snp-based linkage

An improved consensus linkage map of barley based on flow-sorted chromosomes and SNP markers

USDA-ARS?s Scientific Manuscript database

Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a SNP-based genotyping platform was developed a...
Construction of a high density SNP linkage map of kelp (Saccharina japonica) by sequencing Taq I site associated DNA and mapping of a sex determining locus.

PubMed

Zhang, Ning; Zhang, Linan; Tao, Ye; Guo, Li; Sun, Juan; Li, Xia; Zhao, Nan; Peng, Jie; Li, Xiaojie; Zeng, Liang; Chen, Jinsa; Yang, Guanpin

2015-03-15

Kelp (Saccharina japonica) has been intensively cultured in China for almost a century. Its genetic improvement is comparable with that of rice. However, the development of its molecular tools is extremely limited, thus its genes, genetics and genomics. Kelp performs an alternative life cycle during which sporophyte generation alternates with gametophyte generation. The gametophytes of kelp can be cloned and crossed. Due to these characteristics, kelp may serve as a reference for the biological and genetic studies of Volvox, mosses and ferns. We constructed a high density single nucleotide polymorphism (SNP) linkage map for kelp by restriction site associated DNA (RAD) sequencing. In total, 4,994 SNP-containing physical (tag-defined) RAD loci were mapped on 31 linkage groups. The map expanded a total genetic distance of 1,782.75 cM, covering 98.66% of the expected (1,806.94 cM). The length of RAD tags (85 bp) was extended to 400-500 bp with Miseq method, offering us an easiness of developing SNP chips and shifting SNP genotyping to a high throughput track. The number of linkage groups was in accordance with the documented with cytological methods. In addition, we identified a set of microsatellites (99 in total) from the extended RAD tags. A gametophyte sex determining locus was mapped on linkage group 2 in a window about 9.0 cM in width, which was 2.66 cM up to marker_40567 and 6.42 cM down to marker_23595. A high density SNP linkage map was constructed for kelp, an intensively cultured brown alga in China. The RAD tags were also extended so that a SNP chip could be developed. In addition, a set of microsatellites were identified among mapped loci, and a gametophyte sex determining locus was mapped. This map will facilitate the genetic studies of kelp including for example the evaluation of germplasm and the decipherment of the genetic bases of economic traits.
A high density linkage map of the ancestral diploid strawberry F. iinumae using SNP markers from the ISTRAW90 array and GBS

USDA-ARS?s Scientific Manuscript database

Fragaria iinumae is recognized as an ancestor of the octoploid strawberry species, including the cultivated strawberry, Fragaria ×ananassa. Here we report the construction of the first high density linkage map for F. iinumae. The map is based on two high-throughput techniques of single nucleotide p...
A high-density intraspecific SNP linkage map of pigeonpea (Cajanas cajan L. Millsp.)

PubMed Central

Mandal, Paritra; Bhutani, Shefali; Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram Pratap; Chaudhary, A. K.; Yadav, Rekha; Gaikwad, K.; Sevanthi, Amitha Mithra; Datta, Subhojit; Raje, Ranjeet S.; Sharma, Tilak R.; Singh, Nagendra Kumar

2017-01-01

Pigeonpea (Cajanus cajan (L.) Millsp.) is a major food legume cultivated in semi-arid tropical regions including the Indian subcontinent, Africa, and Southeast Asia. It is an important source of protein, minerals, and vitamins for nearly 20% of the world population. Due to high carbon sequestration and drought tolerance, pigeonpea is an important crop for the development of climate resilient agriculture and nutritional security. However, pigeonpea productivity has remained low for decades because of limited genetic and genomic resources, and sparse utilization of landraces and wild pigeonpea germplasm. Here, we present a dense intraspecific linkage map of pigeonpea comprising 932 markers that span a total adjusted map length of 1,411.83 cM. The consensus map is based on three different linkage maps that incorporate a large number of single nucleotide polymorphism (SNP) markers derived from next generation sequencing data, using Illumina GoldenGate bead arrays, and genotyping with restriction site associated DNA (RAD) sequencing. The genotyping-by-sequencing enhanced the marker density but was met with limited success due to lack of common markers across the genotypes of mapping population. The integrated map has 547 bead-array SNP, 319 RAD-SNP, and 65 simple sequence repeat (SSR) marker loci. We also show here correspondence between our linkage map and published genome pseudomolecules of pigeonpea. The availability of a high-density linkage map will help improve the anchoring of the pigeonpea genome to its chromosomes and the mapping of genes and quantitative trait loci associated with useful agronomic traits. PMID:28654689
Construction and Annotation of a High Density SNP Linkage Map of the Atlantic Salmon (Salmo salar) Genome.

PubMed

Tsai, Hsin Y; Robledo, Diego; Lowe, Natalie R; Bekaert, Michael; Taggart, John B; Bron, James E; Houston, Ross D

2016-07-07

High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species' genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the 'ssalar01' high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. Copyright © 2016 Tsai et al.
Construction of an SNP-based high-density linkage map for flax (Linum usitatissimum L.) using specific length amplified fragment sequencing (SLAF-seq) technology.

PubMed

Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui

2017-01-01

Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.
Evaluation of Bovine High-Density SNP Genotyping Array in Indigenous Dairy Cattle Breeds.

PubMed

Dash, S; Singh, A; Bhatia, A K; Jayakumar, S; Sharma, A; Singh, S; Ganguly, I; Dixit, S P

2018-04-03

In total 52 samples of Sahiwal ( 19 ), Tharparkar ( 17 ), and Gir ( 16 ) were genotyped by using BovineHD SNP chip to analyze minor allele frequency (MAF), genetic diversity, and linkage disequilibrium among these cattle. The common SNPs of BovineHD and 54K SNP Chips were also extracted and evaluated for their performance. Only 40%-50% SNPs of these arrays was found informative for genetic analysis in these cattle breeds. The overall mean of MAF for SNPs of BovineHD SNPChip was 0.248 ± 0.006, 0.241 ± 0.007, and 0.242 ± 0.009 in Sahiwal, Tharparkar and Gir, respectively, while that for 54K SNPs was on lower side. The average Reynold's genetic distance between breeds ranged from 0.042 to 0.055 based on BovineHD Beadchip, and from 0.052 to 0.084 based on 54K SNP Chip. The estimates of genetic diversity based on HD and 54K chips were almost same and, hence, low density chip seems to be good enough to decipher genetic diversity of these cattle breeds. The linkage disequilibrium started decaying (r 2 < 0.2) at 140 kb inter-marker distance and, hence, a 20K low density customized SNP array from HD chip could be designed for genomic selection in these cattle else the 54K Bead Chip as such will be useful.
Insights Into Upland Cotton (Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents.

PubMed

Ulloa, Mauricio; Hulse-Kemp, Amanda M; De Santiago, Luis M; Stelly, David M; Burke, John J

2017-01-01

High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton ( Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F 2 , a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD) 1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the A t and D t subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the D t subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral A t -subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum . However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid A t and D t subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array.

PubMed

Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J

2012-05-25

A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the 'Golden Delicious' reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
A SNP based high-density linkage map of Apis cerana reveals a high recombination rate similar to Apis mellifera.

PubMed

Shi, Yuan Yuan; Sun, Liang Xian; Huang, Zachary Y; Wu, Xiao Bo; Zhu, Yong Qiang; Zheng, Hua Jun; Zeng, Zhi Jiang

2013-01-01

The Eastern honey bee, Apis cerana Fabricius, is distributed in southern and eastern Asia, from India and China to Korea and Japan and southeast to the Moluccas. This species is also widely kept for honey production besides Apis mellifera. Apis cerana is also a model organism for studying social behavior, caste determination, mating biology, sexual selection, and host-parasite interactions. Few resources are available for molecular research in this species, and a linkage map was never constructed. A linkage map is a prerequisite for quantitative trait loci mapping and for analyzing genome structure. We used the Chinese honey bee, Apis cerana cerana to construct the first linkage map in the Eastern honey bee. F2 workers (N = 103) were genotyped for 126,990 single nucleotide polymorphisms (SNPs). After filtering low quality and those not passing the Mendel test, we obtained 3,000 SNPs, 1,535 of these were informative and used to construct a linkage map. The preliminary map contains 19 linkage groups, we then mapped the 19 linkage groups to 16 chromosomes by comparing the markers to the genome of A. mellfiera. The final map contains 16 linkage groups with a total of 1,535 markers. The total genetic distance is 3,942.7 centimorgans (cM) with the largest linkage group (180 loci) measuring 574.5 cM. Average marker interval for all markers across the 16 linkage groups is 2.6 cM. We constructed a high density linkage map for A. c. cerana with 1,535 markers. Because the map is based on SNP markers, it will enable easier and faster genotyping assays than randomly amplified polymorphic DNA or microsatellite based maps used in A. mellifera.
An ultra-high density linkage map and QTL mapping for sex and growth-related traits of common carp (Cyprinus carpio)

PubMed Central

Peng, Wenzhu; Xu, Jian; Zhang, Yan; Feng, Jianxin; Dong, Chuanju; Jiang, Likun; Feng, Jingyan; Chen, Baohua; Gong, Yiwen; Chen, Lin; Xu, Peng

2016-01-01

High density genetic linkage maps are essential for QTL fine mapping, comparative genomics and high quality genome sequence assembly. In this study, we constructed a high-density and high-resolution genetic linkage map with 28,194 SNP markers on 14,146 distinct loci for common carp based on high-throughput genotyping with the carp 250 K single nucleotide polymorphism (SNP) array in a mapping family. The genetic length of the consensus map was 10,595.94 cM with an average locus interval of 0.75 cM and an average marker interval of 0.38 cM. Comparative genomic analysis revealed high level of conserved syntenies between common carp and the closely related model species zebrafish and medaka. The genome scaffolds were anchored to the high-density linkage map, spanning 1,357 Mb of common carp reference genome. QTL mapping and association analysis identified 22 QTLs for growth-related traits and 7 QTLs for sex dimorphism. Candidate genes underlying growth-related traits were identified, including important regulators such as KISS2, IGF1, SMTLB, NPFFR1 and CPE. Candidate genes associated with sex dimorphism were also identified including 3KSR and DMRT2b. The high-density and high-resolution genetic linkage map provides an important tool for QTL fine mapping and positional cloning of economically important traits, and improving common carp genome assembly. PMID:27225429
Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array

PubMed Central

2012-01-01

Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety. PMID:22631220
A high-density, SNP-based consensus map of tetraploid wheat as a bridge to integrate durum and bread wheat genomics and breeding

USDA-ARS?s Scientific Manuscript database

Consensus linkage maps are important tools in crop genomics. We have assembled a high-density tetraploid wheat consensus map by integrating 13 datasets from independent biparental populations involving durum wheat cultivars (Triticum turgidum ssp. durum), cultivated emmer (T. turgidum ssp. dicoccum...
The construction of a high-density linkage map for identifying SNP markers that are tightly linked to a nuclear-recessive major gene for male sterility in Cryptomeria japonica D. Don

PubMed Central

2012-01-01

Background High-density linkage maps facilitate the mapping of target genes and the construction of partial linkage maps around target loci to develop markers for marker-assisted selection (MAS). MAS is quite challenging in conifers because of their large, complex, and poorly-characterized genomes. Our goal was to construct a high-density linkage map to facilitate the identification of markers that are tightly linked to a major recessive male-sterile gene (ms1) for MAS in C. japonica, a species that is important in Japanese afforestation but which causes serious social pollinosis problems. Results We constructed a high-density saturated genetic linkage map for C. japonica using expressed sequence-derived co-dominant single nucleotide polymorphism (SNP) markers, most of which were genotyped using the GoldenGate genotyping assay. A total of 1261 markers were assigned to 11 linkage groups with an observed map length of 1405.2 cM and a mean distance between two adjacent markers of 1.1 cM; the number of linkage groups matched the basic chromosome number in C. japonica. Using this map, we located ms1 on the 9th linkage group and constructed a partial linkage map around the ms1 locus. This enabled us to identify a marker (hrmSNP970_sf) that is closely linked to the ms1 gene, being separated from it by only 0.5 cM. Conclusions Using the high-density map, we located the ms1 gene on the 9th linkage group and constructed a partial linkage map around the ms1 locus. The map distance between the ms1 gene and the tightly linked marker was only 0.5 cM. The identification of markers that are tightly linked to the ms1 gene will facilitate the early selection of male-sterile trees, which should expedite C. japonica breeding programs aimed at alleviating pollinosis problems without harming productivity. PMID:22424262
Large-scale SNP discovery and construction of a high-density genetic map of Colossoma macropomum through genotyping-by-sequencing

PubMed Central

Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann

2017-01-01

Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238
Genome survey and high-density genetic map construction provide genomic and genetic resources for the Pacific White Shrimp Litopenaeus vannamei

PubMed Central

Yu, Yang; Zhang, Xiaojun; Yuan, Jianbo; Li, Fuhua; Chen, Xiaohan; Zhao, Yongzhen; Huang, Long; Zheng, Hongkun; Xiang, Jianhai

2015-01-01

The Pacific white shrimp Litopenaeus vannamei is the dominant crustacean species in global seafood mariculture. Understanding the genome and genetic architecture is useful for deciphering complex traits and accelerating the breeding program in shrimp. In this study, a genome survey was conducted and a high-density linkage map was constructed using a next-generation sequencing approach. The genome survey was used to identify preliminary genome characteristics and to generate a rough reference for linkage map construction. De novo SNP discovery resulted in 25,140 polymorphic markers. A total of 6,359 high-quality markers were selected for linkage map construction based on marker coverage among individuals and read depths. For the linkage map, a total of 6,146 markers spanning 4,271.43 cM were mapped to 44 sex-averaged linkage groups, with an average marker distance of 0.7 cM. An integration analysis linked 5,885 genome scaffolds and 1,504 BAC clones to the linkage map. Based on the high-density linkage map, several QTLs for body weight and body length were detected. This high-density genetic linkage map reveals basic genomic architecture and will be useful for comparative genomics research, genome assembly and genetic improvement of L. vannamei and other penaeid shrimp species. PMID:26503227
A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species.

PubMed

Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma Jj; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco Cam; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric

2016-01-01

Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple ( Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.
A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species

PubMed Central

Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric

2016-01-01

Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289
The utility of low-density genotyping for imputation in the Thoroughbred horse

PubMed Central

2014-01-01

Background Despite the dramatic reduction in the cost of high-density genotyping that has occurred over the last decade, it remains one of the limiting factors for obtaining the large datasets required for genomic studies of disease in the horse. In this study, we investigated the potential for low-density genotyping and subsequent imputation to address this problem. Results Using the haplotype phasing and imputation program, BEAGLE, it is possible to impute genotypes from low- to high-density (50K) in the Thoroughbred horse with reasonable to high accuracy. Analysis of the sources of variation in imputation accuracy revealed dependence both on the minor allele frequency of the single nucleotide polymorphisms (SNPs) being imputed and on the underlying linkage disequilibrium structure. Whereas equidistant spacing of the SNPs on the low-density panel worked well, optimising SNP selection to increase their minor allele frequency was advantageous, even when the panel was subsequently used in a population of different geographical origin. Replacing base pair position with linkage disequilibrium map distance reduced the variation in imputation accuracy across SNPs. Whereas a 1K SNP panel was generally sufficient to ensure that more than 80% of genotypes were correctly imputed, other studies suggest that a 2K to 3K panel is more efficient to minimize the subsequent loss of accuracy in genomic prediction analyses. The relationship between accuracy and genotyping costs for the different low-density panels, suggests that a 2K SNP panel would represent good value for money. Conclusions Low-density genotyping with a 2K SNP panel followed by imputation provides a compromise between cost and accuracy that could promote more widespread genotyping, and hence the use of genomic information in horses. In addition to offering a low cost alternative to high-density genotyping, imputation provides a means to combine datasets from different genotyping platforms, which is becoming necessary since researchers are starting to use the recently developed equine 70K SNP chip. However, more work is needed to evaluate the impact of between-breed differences on imputation accuracy. PMID:24495673
Accurate genomic predictions for BCWD resistance in rainbow trout are achieved using low-density SNP panels: Evidence that long-range LD is a major contributing factor.

PubMed

Vallejo, Roger L; Silva, Rafael M O; Evenhuis, Jason P; Gao, Guangtu; Liu, Sixin; Parsons, James E; Martin, Kyle E; Wiens, Gregory D; Lourenco, Daniela A L; Leeds, Timothy D; Palti, Yniv

2018-06-05

Previously accurate genomic predictions for Bacterial cold water disease (BCWD) resistance in rainbow trout were obtained using a medium-density single nucleotide polymorphism (SNP) array. Here, the impact of lower-density SNP panels on the accuracy of genomic predictions was investigated in a commercial rainbow trout breeding population. Using progeny performance data, the accuracy of genomic breeding values (GEBV) using 35K, 10K, 3K, 1K, 500, 300 and 200 SNP panels as well as a panel with 70 quantitative trait loci (QTL)-flanking SNP was compared. The GEBVs were estimated using the Bayesian method BayesB, single-step GBLUP (ssGBLUP) and weighted ssGBLUP (wssGBLUP). The accuracy of GEBVs remained high despite the sharp reductions in SNP density, and even with 500 SNP accuracy was higher than the pedigree-based prediction (0.50-0.56 versus 0.36). Furthermore, the prediction accuracy with the 70 QTL-flanking SNP (0.65-0.72) was similar to the panel with 35K SNP (0.65-0.71). Genomewide linkage disequilibrium (LD) analysis revealed strong LD (r 2 ≥ 0.25) spanning on average over 1 Mb across the rainbow trout genome. This long-range LD likely contributed to the accurate genomic predictions with the low-density SNP panels. Population structure analysis supported the hypothesis that long-range LD in this population may be caused by admixture. Results suggest that lower-cost, low-density SNP panels can be used for implementing genomic selection for BCWD resistance in rainbow trout breeding programs. © 2018 The Authors. This article is a U.S. Government work and is in the public domain in the USA. Journal of Animal Breeding and Genetics published by Blackwell Verlag GmbH.

Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing

PubMed Central

2014-01-01

Background Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families. Results Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays. Conclusions This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication. PMID:24571138
Genetic dissection of seed oil and protein content and identification of networks associated with oil content in Brassica napus.

PubMed

Chao, Hongbo; Wang, Hao; Wang, Xiaodong; Guo, Liangxing; Gu, Jianwei; Zhao, Weiguo; Li, Baojun; Chen, Dengyan; Raboanatahiry, Nadia; Li, Maoteng

2017-04-10

High-density linkage maps can improve the precision of QTL localization. A high-density SNP-based linkage map containing 3207 markers covering 3072.7 cM of the Brassica napus genome was constructed in the KenC-8 × N53-2 (KNDH) population. A total of 67 and 38 QTLs for seed oil and protein content were identified with an average confidence interval of 5.26 and 4.38 cM, which could explain up to 22.24% and 27.48% of the phenotypic variation, respectively. Thirty-eight associated genomic regions from BSA overlapped with and/or narrowed the SOC-QTLs, further confirming the QTL mapping results based on the high-density linkage map. Potential candidates related to acyl-lipid and seed storage underlying SOC and SPC, respectively, were identified and analyzed, among which six were checked and showed expression differences between the two parents during different embryonic developmental periods. A large primary carbohydrate pathway based on potential candidates underlying SOC- and SPC-QTLs, and interaction networks based on potential candidates underlying SOC-QTLs, was constructed to dissect the complex mechanism based on metabolic and gene regulatory features, respectively. Accurate QTL mapping and potential candidates identified based on high-density linkage map and BSA analyses provide new insights into the complex genetic mechanism of oil and protein accumulation in the seeds of rapeseed.
An ultra-dense SNP linkage map for the octoploid, cultivated strawberry and its application in genetic research

USDA-ARS?s Scientific Manuscript database

We will present an ultra-dense genetic linkage map for the octoploid, cultivated strawberry (Fragaria x ananassa) consisting of over 13K Axiom® based SNP markers and 150 previously mapped reference SSR loci. The high quality of the map is demonstrated by the short sizes of each of the 28 linkage gro...
A high-density SNP genetic linkage map for the silver-lipped pearl oyster, Pinctada maxima: a valuable resource for gene localisation and marker-assisted selection.

PubMed

Jones, David B; Jerry, Dean R; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R

2013-11-20

The silver-lipped pearl oyster, Pinctada maxima, is an important tropical aquaculture species extensively farmed for the highly sought "South Sea" pearls. Traditional breeding programs have been initiated for this species in order to select for improved pearl quality, but many economic traits under selection are complex, polygenic and confounded with environmental factors, limiting the accuracy of selection. The incorporation of a marker-assisted selection (MAS) breeding approach would greatly benefit pearl breeding programs by allowing the direct selection of genes responsible for pearl quality. However, before MAS can be incorporated, substantial genomic resources such as genetic linkage maps need to be generated. The construction of a high-density genetic linkage map for P. maxima is not only essential for unravelling the genomic architecture of complex pearl quality traits, but also provides indispensable information on the genome structure of pearl oysters. A total of 1,189 informative genome-wide single nucleotide polymorphisms (SNPs) were incorporated into linkage map construction. The final linkage map consisted of 887 SNPs in 14 linkage groups, spans a total genetic distance of 831.7 centimorgans (cM), and covers an estimated 96% of the P. maxima genome. Assessment of sex-specific recombination across all linkage groups revealed limited overall heterochiasmy between the sexes (i.e. 1.15:1 F/M map length ratio). However, there were pronounced localised differences throughout the linkage groups, whereby male recombination was suppressed near the centromeres compared to female recombination, but inflated towards telomeric regions. Mean values of LD for adjacent SNP pairs suggest that a higher density of markers will be required for powerful genome-wide association studies. Finally, numerous nacre biomineralization genes were localised providing novel positional information for these genes. This high-density SNP genetic map is the first comprehensive linkage map for any pearl oyster species. It provides an essential genomic tool facilitating studies investigating the genomic architecture of complex trait variation and identifying quantitative trait loci for economically important traits useful in genetic selection programs within the P. maxima pearling industry. Furthermore, this map provides a foundation for further research aiming to improve our understanding of the dynamic process of biomineralization, and pearl oyster evolution and synteny.
High-density single nucleotide polymorphism (SNP) array mapping in Brassica oleracea: identification of QTL associated with carotenoid variation in broccoli florets.

PubMed

Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W

2014-09-01

A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.
Genetic dissection of seed oil and protein content and identification of networks associated with oil content in Brassica napus

PubMed Central

Chao, Hongbo; Wang, Hao; Wang, Xiaodong; Guo, Liangxing; Gu, Jianwei; Zhao, Weiguo; Li, Baojun; Chen, Dengyan; Raboanatahiry, Nadia; Li, Maoteng

2017-01-01

High-density linkage maps can improve the precision of QTL localization. A high-density SNP-based linkage map containing 3207 markers covering 3072.7 cM of the Brassica napus genome was constructed in the KenC-8 × N53-2 (KNDH) population. A total of 67 and 38 QTLs for seed oil and protein content were identified with an average confidence interval of 5.26 and 4.38 cM, which could explain up to 22.24% and 27.48% of the phenotypic variation, respectively. Thirty-eight associated genomic regions from BSA overlapped with and/or narrowed the SOC-QTLs, further confirming the QTL mapping results based on the high-density linkage map. Potential candidates related to acyl-lipid and seed storage underlying SOC and SPC, respectively, were identified and analyzed, among which six were checked and showed expression differences between the two parents during different embryonic developmental periods. A large primary carbohydrate pathway based on potential candidates underlying SOC- and SPC-QTLs, and interaction networks based on potential candidates underlying SOC-QTLs, was constructed to dissect the complex mechanism based on metabolic and gene regulatory features, respectively. Accurate QTL mapping and potential candidates identified based on high-density linkage map and BSA analyses provide new insights into the complex genetic mechanism of oil and protein accumulation in the seeds of rapeseed. PMID:28393910
Construction and analysis of a high-density genetic linkage map in cabbage (Brassica oleracea L. var. capitata)

PubMed Central

2012-01-01

Background Brassica oleracea encompass a family of vegetables and cabbage that are among the most widely cultivated crops. In 2009, the B. oleracea Genome Sequencing Project was launched using next generation sequencing technology. None of the available maps were detailed enough to anchor the sequence scaffolds for the Genome Sequencing Project. This report describes the development of a large number of SSR and SNP markers from the whole genome shotgun sequence data of B. oleracea, and the construction of a high-density genetic linkage map using a double haploid mapping population. Results The B. oleracea high-density genetic linkage map that was constructed includes 1,227 markers in nine linkage groups spanning a total of 1197.9 cM with an average of 0.98 cM between adjacent loci. There were 602 SSR markers and 625 SNP markers on the map. The chromosome with the highest number of markers (186) was C03, and the chromosome with smallest number of markers (99) was C09. Conclusions This first high-density map allowed the assembled scaffolds to be anchored to pseudochromosomes. The map also provides useful information for positional cloning, molecular breeding, and integration of information of genes and traits in B. oleracea. All the markers on the map will be transferable and could be used for the construction of other genetic maps. PMID:23033896
SNP Discovery and Linkage Map Construction in Cultivated Tomato

PubMed Central

Shirasawa, Kenta; Isobe, Sachiko; Hirakawa, Hideki; Asamizu, Erika; Fukuoka, Hiroyuki; Just, Daniel; Rothan, Christophe; Sasamoto, Shigemi; Fujishiro, Tsunakazu; Kishida, Yoshie; Kohara, Mitsuyo; Tsuruoka, Hisano; Wada, Tsuyuko; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

2010-01-01

Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/. PMID:21044984
Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

PubMed Central

Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.

2015-01-01

High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Genome-Wide QTL Mapping for Wheat Processing Quality Parameters in a Gaocheng 8901/Zhoumai 16 Recombinant Inbred Line Population.

PubMed

Jin, Hui; Wen, Weie; Liu, Jindong; Zhai, Shengnan; Zhang, Yan; Yan, Jun; Liu, Zhiyong; Xia, Xianchun; He, Zhonghu

2016-01-01

Dough rheological and starch pasting properties play an important role in determining processing quality in bread wheat (Triticum aestivum L.). In the present study, a recombinant inbred line (RIL) population derived from a Gaocheng 8901/Zhoumai 16 cross grown in three environments was used to identify quantitative trait loci (QTLs) for dough rheological and starch pasting properties evaluated by Mixograph, Rapid Visco-Analyzer (RVA), and Mixolab parameters using the wheat 90 and 660 K single nucleotide polymorphism (SNP) chip assays. A high-density linkage map constructed with 46,961 polymorphic SNP markers from the wheat 90 and 660 K SNP assays spanned a total length of 4121 cM, with an average chromosome length of 196.2 cM and marker density of 0.09 cM/marker; 6596 new SNP markers were anchored to the bread wheat linkage map, with 1046 and 5550 markers from the 90 and 660 K SNP assays, respectively. Composite interval mapping identified 119 additive QTLs on 20 chromosomes except 4D; among them, 15 accounted for more than 10% of the phenotypic variation across two or three environments. Twelve QTLs for Mixograph parameters, 17 for RVA parameters and 55 for Mixolab parameters were new. Eleven QTL clusters were identified. The closely linked SNP markers can be used in marker-assisted wheat breeding in combination with the Kompetitive Allele Specific PCR (KASP) technique for improvement of processing quality in bread wheat.
Genome-Wide QTL Mapping for Wheat Processing Quality Parameters in a Gaocheng 8901/Zhoumai 16 Recombinant Inbred Line Population

PubMed Central

Jin, Hui; Wen, Weie; Liu, Jindong; Zhai, Shengnan; Zhang, Yan; Yan, Jun; Liu, Zhiyong; Xia, Xianchun; He, Zhonghu

2016-01-01

Dough rheological and starch pasting properties play an important role in determining processing quality in bread wheat (Triticum aestivum L.). In the present study, a recombinant inbred line (RIL) population derived from a Gaocheng 8901/Zhoumai 16 cross grown in three environments was used to identify quantitative trait loci (QTLs) for dough rheological and starch pasting properties evaluated by Mixograph, Rapid Visco-Analyzer (RVA), and Mixolab parameters using the wheat 90 and 660 K single nucleotide polymorphism (SNP) chip assays. A high-density linkage map constructed with 46,961 polymorphic SNP markers from the wheat 90 and 660 K SNP assays spanned a total length of 4121 cM, with an average chromosome length of 196.2 cM and marker density of 0.09 cM/marker; 6596 new SNP markers were anchored to the bread wheat linkage map, with 1046 and 5550 markers from the 90 and 660 K SNP assays, respectively. Composite interval mapping identified 119 additive QTLs on 20 chromosomes except 4D; among them, 15 accounted for more than 10% of the phenotypic variation across two or three environments. Twelve QTLs for Mixograph parameters, 17 for RVA parameters and 55 for Mixolab parameters were new. Eleven QTL clusters were identified. The closely linked SNP markers can be used in marker-assisted wheat breeding in combination with the Kompetitive Allele Specific PCR (KASP) technique for improvement of processing quality in bread wheat. PMID:27486464
Linkage disequilibrium levels in Bos indicus and Bos taurus cattle using medium and high density SNP chip data and different minor allele frequency distributions

USDA-ARS?s Scientific Manuscript database

Linkage disequilibrium (LD), the observed correlation between alleles at different loci in the genome, is a determinant parameter in many applications of molecular genetics. With the wider use of genomic technologies in animal breeding and animal genetics, it is worthwhile revising and improving the...
A high density integrated genetic linkage map of soybean and the development of a 1,536 Universal Soy Linkage Panel for QTL mapping

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphisms (SNPs) are the marker of choice for many researchers due to their abundance and the high-throughput methods available for their multiplex analysis. Only recently have SNP markers been available to researchers in soybean [Glycine max (L.) Merr.] with the release of th...
A High-Density Consensus Map of Common Wheat Integrating Four Mapping Populations Scanned by the 90K SNP Array

PubMed Central

Wen, Weie; He, Zhonghu; Gao, Fengmei; Liu, Jindong; Jin, Hui; Zhai, Shengnan; Qu, Yanying; Xia, Xianchun

2017-01-01

A high-density consensus map is a powerful tool for gene mapping, cloning and molecular marker-assisted selection in wheat breeding. The objective of this study was to construct a high-density, single nucleotide polymorphism (SNP)-based consensus map of common wheat (Triticum aestivum L.) by integrating genetic maps from four recombinant inbred line populations. The populations were each genotyped using the wheat 90K Infinium iSelect SNP assay. A total of 29,692 SNP markers were mapped on 21 linkage groups corresponding to 21 hexaploid wheat chromosomes, covering 2,906.86 cM, with an overall marker density of 10.21 markers/cM. Compared with the previous maps based on the wheat 90K SNP chip detected 22,736 (76.6%) of the SNPs with consistent chromosomal locations, whereas 1,974 (6.7%) showed different chromosomal locations, and 4,982 (16.8%) were newly mapped. Alignment of the present consensus map and the wheat expressed sequence tags (ESTs) Chromosome Bin Map enabled assignment of 1,221 SNP markers to specific chromosome bins and 819 ESTs were integrated into the consensus map. The marker orders of the consensus map were validated based on physical positions on the wheat genome with Spearman rank correlation coefficients ranging from 0.69 (4D) to 0.97 (1A, 4B, 5B, and 6A), and were also confirmed by comparison with genetic position on the previously 40K SNP consensus map with Spearman rank correlation coefficients ranging from 0.84 (6D) to 0.99 (6A). Chromosomal rearrangements reported previously were confirmed in the present consensus map and new putative rearrangements were identified. In addition, an integrated consensus map was developed through the combination of five published maps with ours, containing 52,607 molecular markers. The consensus map described here provided a high-density SNP marker map and a reliable order of SNPs, representing a step forward in mapping and validation of chromosomal locations of SNPs on the wheat 90K array. Moreover, it can be used as a reference for quantitative trait loci (QTL) mapping to facilitate exploitation of genes and QTL in wheat breeding. PMID:28848588
Polymorphisms in the Estrogen Receptor β (ESR2) Gene Are Associated with Bone Mineral Density in Caucasian Men and Women

PubMed Central

Ichikawa, Shoji; Koller, Daniel L.; Peacock, Munro; Johnson, Michelle L.; Lai, Dongbing; Hui, Siu L.; Johnston, C. Conrad; Foroud, Tatiana M.; Econs, Michael J.

2007-01-01

Context A major determinant of osteoporotic fractures is peak bone mineral density (BMD), which is a highly heritable trait. Recently, we identified significant linkage for hip BMD in premenopausal sister pairs at chromosome 14q (LOD score = 3.5), where the estrogen receptor β gene (ESR2) is located. Objective The objective of the study was to determine whether ESR2 polymorphisms are associated with normal BMD variation. Design This was a population‐based genetic association study, using 11 single nucleotide polymorphisms (SNPs) distributed across the ESR2 gene. Setting The study was conducted at an academic research laboratory and medical center. Patients and Other Participants A total of 411 healthy men (aged 18–61 yr) and 1291 healthy premenopausal women (aged 20–50 yr) living in Indiana participated in the study. Intervention(s) There were no interventions. Main Outcome Measure(s) The main outcome measures were SNP genotype distributions and their association with BMD at the femoral neck and lumbar spine. Results Significant association of spine BMD was found with three SNPs in men and one SNP in women (P ≤ 0.05). The conditional linkage analysis using the ESR2 haplotypes showed that the ESR2 gene accounts for, at most, 18% of the original linkage. Conclusions ESR2 polymorphisms are significantly associated with bone mass in both men and women. However, the ESR2 gene is not entirely responsible for our original linkage, and an additional gene(s) in chromosome 14q contributes to the determination of BMD. PMID:16118344
Genome-Wide Linkage and Association Analysis Identifies Major Gene Loci for Guttural Pouch Tympany in Arabian and German Warmblood Horses

PubMed Central

Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

2012-01-01

Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553
A High-Resolution SNP Array-Based Linkage Map Anchors a New Domestic Cat Draft Genome Assembly and Provides Detailed Patterns of Recombination.

PubMed

Li, Gang; Hillier, LaDeana W; Grahn, Robert A; Zimin, Aleksey V; David, Victor A; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O'Brien, Stephen J; Minx, Pat; Wilson, Richard K; Lyons, Leslie A; Warren, Wesley C; Murphy, William J

2016-06-01

High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. Copyright © 2016 Li et al.
An imputed genotype resource for the laboratory mouse

PubMed Central

Szatkiewicz, Jin P.; Beane, Glen L.; Ding, Yueming; Hutchins, Lucie; de Villena, Fernando Pardo-Manuel; Churchill, Gary A.

2009-01-01

We have created a high-density SNP resource encompassing 7.87 million polymorphic loci across 49 inbred mouse strains of the laboratory mouse by combining data available from public databases and training a hidden Markov model to impute missing genotypes in the combined data. The strong linkage disequilibrium found in dense sets of SNP markers in the laboratory mouse provides the basis for accurate imputation. Using genotypes from eight independent SNP resources, we empirically validated the quality of the imputed genotypes and demonstrate that they are highly reliable for most inbred strains. The imputed SNP resource will be useful for studies of natural variation and complex traits. It will facilitate association study designs by providing high density SNP genotypes for large numbers of mouse strains. We anticipate that this resource will continue to evolve as new genotype data become available for laboratory mouse strains. The data are available for bulk download or query at http://cgd.jax.org/. PMID:18301946
Development of new SNP derived cleaved amplified polymorphic sequence marker set and its successful utilization in the genetic analysis of seed color variation in barley.

PubMed

Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad

2016-03-01

The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide. Copyright © 2016 Elsevier Inc. All rights reserved.
Construction of a high-density, high-resolution genetic map and its integration with BAC-based physical map in channel catfish

PubMed Central

Li, Yun; Liu, Shikai; Qin, Zhenkui; Waldbieser, Geoff; Wang, Ruijia; Sun, Luyang; Bao, Lisui; Danzmann, Roy G.; Dunham, Rex; Liu, Zhanjiang

2015-01-01

Construction of genetic linkage map is essential for genetic and genomic studies. Recent advances in sequencing and genotyping technologies made it possible to generate high-density and high-resolution genetic linkage maps, especially for the organisms lacking extensive genomic resources. In the present work, we constructed a high-density and high-resolution genetic map for channel catfish with three large resource families genotyped using the catfish 250K single-nucleotide polymorphism (SNP) array. A total of 54,342 SNPs were placed on the linkage map, which to our knowledge had the highest marker density among aquaculture species. The estimated genetic size was 3,505.4 cM with a resolution of 0.22 cM for sex-averaged genetic map. The sex-specific linkage maps spanned a total of 4,495.1 cM in females and 2,593.7 cM in males, presenting a ratio of 1.7 : 1 between female and male in recombination fraction. After integration with the previously established physical map, over 87% of physical map contigs were anchored to the linkage groups that covered a physical length of 867 Mb, accounting for ∼90% of the catfish genome. The integrated map provides a valuable tool for validating and improving the catfish whole-genome assembly and facilitates fine-scale QTL mapping and positional cloning of genes responsible for economically important traits. PMID:25428894

Construction of an ultrahigh-density genetic linkage map for Jatropha curcas L. and identification of QTL for fruit yield.

PubMed

Xia, Zhiqiang; Zhang, Shengkui; Wen, Mingfu; Lu, Cheng; Sun, Yufang; Zou, Meiling; Wang, Wenquan

2018-01-01

As an important biofuel plant, the demand for higher yield Jatropha curcas L. is rapidly increasing. However, genetic analysis of Jatropha and molecular breeding for higher yield have been hampered by the limited number of molecular markers available. An ultrahigh-density linkage map for a Jatropha mapping population of 153 individuals was constructed and covered 1380.58 cM of the Jatropha genome, with average marker density of 0.403 cM. The genetic linkage map consisted of 3422 SNP and indel markers, which clustered into 11 linkage groups. With this map, 13 repeatable QTLs (reQTLs) for fruit yield traits were identified. Ten reQTLs, qNF - 1 , qNF - 2a , qNF - 2b , qNF - 2c , qNF - 3 , qNF - 4 , qNF - 6 , qNF - 7a , qNF - 7b and qNF - 8, that control the number of fruits (NF) mapped to LGs 1, 2, 3, 4, 6, 7 and 8, whereas three reQTLs, qTWF - 1 , qTWF - 2 and qTWF - 3, that control the total weight of fruits (TWF) mapped to LGs 1, 2 and 3, respectively. It is interesting that there are two candidate critical genes, which may regulate Jatropha fruit yield. We also identified three pleiotropic reQTL pairs associated with both the NF and TWF traits. This study is the first to report an ultrahigh-density Jatropha genetic linkage map construction, and the markers used in this study showed great potential for QTL mapping. Thirteen fruit-yield reQTLs and two important candidate genes were identified based on this linkage map. This genetic linkage map will be a useful tool for the localization of other economically important QTLs and candidate genes for Jatropha .
Development and Validation of a High-Density SNP Genotyping Array for African Oil Palm.

PubMed

Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Heng, Huey Ying; Lee, Heng Leng; Mohamed, Mohaimi; Low, Joel Zi-Bin; Apparow, Sukganah; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Appleton, David Ross

2016-08-01

High-density single nucleotide polymorphism (SNP) genotyping arrays are powerful tools that can measure the level of genetic polymorphism within a population. To develop a whole-genome SNP array for oil palms, SNP discovery was performed using deep resequencing of eight libraries derived from 132 Elaeis guineensis and Elaeis oleifera palms belonging to 59 origins, resulting in the discovery of >3 million putative SNPs. After SNP filtering, the Illumina OP200K custom array was built with 170 860 successful probes. Phenetic clustering analysis revealed that the array could distinguish between palms of different origins in a way consistent with pedigree records. Genome-wide linkage disequilibrium declined more slowly for the commercial populations (ranging from 120 kb at r(2) = 0.43 to 146 kb at r(2) = 0.50) when compared with the semi-wild populations (19.5 kb at r(2) = 0.22). Genetic fixation mapping comparing the semi-wild and commercial population identified 321 selective sweeps. A genome-wide association study (GWAS) detected a significant peak on chromosome 2 associated with the polygenic component of the shell thickness trait (based on the trait shell-to-fruit; S/F %) in tenera palms. Testing of a genomic selection model on the same trait resulted in good prediction accuracy (r = 0.65) with 42% of the S/F % variation explained. The first high-density SNP genotyping array for oil palm has been developed and shown to be robust for use in genetic studies and with potential for developing early trait prediction to shorten the oil palm breeding cycle. Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.
Linkage disequilibrium, SNP frequency change due to selection, and association mapping in popcorn chromosome regions containing QTLs for quality traits

PubMed Central

Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca e; Mundim, Gabriel Borges

2016-01-01

Abstract The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis. PMID:27007903
Linkage disequilibrium, SNP frequency change due to selection, and association mapping in popcorn chromosome regions containing QTLs for quality traits.

PubMed

Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca E; Mundim, Gabriel Borges

2016-03-01

The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis.
Pharmacogenetics.

PubMed

Roses, A D

2001-10-01

Pharmacogenetics is the variability of drug response due to inherited characteristics in individuals. Drug metabolizing enzymes have been studied for decades, first as chemical reactions and, more recently, as specific polymorphisms of known molecules. With the availability of whole-genome single-nucleotide polymorphism (SNP) maps, it will soon be possible to create an SNP profile for patients who experience adverse events (AEs) or who respond clinically to the medicine (efficacy). Proof-of-principle experiments have demonstrated that high density SNP maps in chromosomal regions of genetic linkage facilitate the identification of susceptibility disease genes. Whole-genome SNP mapping analyses aimed at determining linkage disequilibrium (LD) profiles along an ordered human genome backbone are in progress. SNP 'fingerprints' or SNP PRINTs(sm) will be used to identify patients at greater risk of an AE, or those patients with a greater chance of responding to a medicine. As LD maps for various ethnic populations are constructed, the number of SNPs necessary to measure for an individual will decrease. Standardized pharmacogenetic maps for drug registration and post-marketing surveillance will result in safer, more effective and more cost-efficient medicines. The timing of these pharmacogenetic applications will occur over the next 5 years. In contrast, the benefits of pharmacogenomic applications such as the identification of new tractable targets will not be visible as new medicines for 7-12 years, due to the lengthy drug development and registration processes.
Large-Scale SNP Discovery and Genotyping for Constructing a High-Density Genetic Map of Tea Plant Using Specific-Locus Amplified Fragment Sequencing (SLAF-seq)

PubMed Central

Ma, Chun-Lei; Jin, Ji-Qiang; Li, Chun-Fang; Wang, Rong-Kai; Zheng, Hong-Kun; Yao, Ming-Zhe; Chen, Liang

2015-01-01

Genetic maps are important tools in plant genomics and breeding. The present study reports the large-scale discovery of single nucleotide polymorphisms (SNPs) for genetic map construction in tea plant. We developed a total of 6,042 valid SNP markers using specific-locus amplified fragment sequencing (SLAF-seq), and subsequently mapped them into the previous framework map. The final map contained 6,448 molecular markers, distributing on fifteen linkage groups corresponding to the number of tea plant chromosomes. The total map length was 3,965 cM, with an average inter-locus distance of 1.0 cM. This map is the first SNP-based reference map of tea plant, as well as the most saturated one developed to date. The SNP markers and map resources generated in this study provide a wealth of genetic information that can serve as a foundation for downstream genetic analyses, such as the fine mapping of quantitative trait loci (QTL), map-based cloning, marker-assisted selection, and anchoring of scaffolds to facilitate the process of whole genome sequencing projects for tea plant. PMID:26035838
A ddRAD Based Linkage Map of the Cultivated Strawberry, Fragaria xananassa

PubMed Central

Davik, Jahn; Sargent, Daniel James; Brurberg, May Bente; Lien, Sigbjørn; Kent, Matthew; Alsheikh, Muath

2015-01-01

The cultivated strawberry (Fragaria ×ananassa Duch.) is an allo-octoploid considered difficult to disentangle genetically due to its four relatively similar sub-genomic chromosome sets. This has been alleviated by the recent release of the strawberry IStraw90 whole genome genotyping array. However, array resolution relies on the genotypes used in the array construction and may be of limited general use. SNP detection based on reduced genomic sequencing approaches has the potential of providing better coverage in cases where the studied genotypes are only distantly related from the SNP array’s construction foundation. Here we have used double digest restriction-associated DNA sequencing (ddRAD) to identify SNPs in a 145 seedling F1 hybrid population raised from the cross between the cultivars Sonata (♀) and Babette (♂). A linkage map containing 907 markers which spanned 1,581.5 cM across 31 linkage groups representing the 28 chromosomes of the species. Comparing the physical span of the SNP markers with the F. vesca genome sequence, the linkage groups resolved covered 79% of the estimated 830 Mb of the F. ×ananassa genome. Here, we have developed the first linkage map for F. ×ananassa using ddRAD and show that this technique and other related techniques are useful tools for linkage map development and downstream genetic studies in the octoploid strawberry. PMID:26398886
No association between SNP rs498055 on chromosome 10 and late-onset Alzheimer disease in multiple datasets.

PubMed

Liang, Xueying; Schnetz-Boutaud, Nathalie; Bartlett, Jackie; Allen, Melissa J; Gwirtsman, Harry; Schmechel, Don E; Carney, Regina M; Gilbert, John R; Pericak-Vance, Margaret A; Haines, Jonathan L

2008-01-01

SNP rs498055 in the predicted gene LOC439999 on chromosome 10 was recently identified as being strongly associated with late-onset Alzheimer disease (LOAD). This SNP falls within a chromosomal region that has engendered continued interest generated from both preliminary genetic linkage and candidate gene studies. To independently evaluate this interesting candidate SNP we examined four independent datasets, three family-based and one case-control. All the cases were late-onset AD Caucasian patients with minimum age at onset >or= 60 years. None of the three family samples or the combined family-based dataset showed association in either allelic or genotypic family-based association tests at p < 0.05. Both original and OSA two-point LOD scores were calculated. However, there was no evidence indicating linkage no matter what covariates were applied (the highest LOD score was 0.82). The case-control dataset did not demonstrate any association between this SNP and AD (all p-values > 0.52). Our results do not confirm the previous association, but are consistent with a more recent negative association result that used family-based association tests to examine the effect of this SNP in two family datasets. Thus we conclude that rs498055 is not associated with an increased risk of LOAD.
Genome-wide patterns of recombination, linkage disequilibrium and nucleotide diversity from pooled resequencing and single nucleotide polymorphism genotyping unlock the evolutionary history of Eucalyptus grandis.

PubMed

Silva-Junior, Orzenil B; Grattapaglia, Dario

2015-11-01

We used high-density single nucleotide polymorphism (SNP) data and whole-genome pooled resequencing to examine the landscape of population recombination (ρ) and nucleotide diversity (ϴw ), assess the extent of linkage disequilibrium (r(2) ) and build the highest density linkage maps for Eucalyptus. At the genome-wide level, linkage disequilibrium (LD) decayed within c. 4-6 kb, slower than previously reported from candidate gene studies, but showing considerable variation from absence to complete LD up to 50 kb. A sharp decrease in the estimate of ρ was seen when going from short to genome-wide inter-SNP distances, highlighting the dependence of this parameter on the scale of observation adopted. Recombination was correlated with nucleotide diversity, gene density and distance from the centromere, with hotspots of recombination enriched for genes involved in chemical reactions and pathways of the normal metabolic processes. The high nucleotide diversity (ϴw = 0.022) of E. grandis revealed that mutation is more important than recombination in shaping its genomic diversity (ρ/ϴw = 0.645). Chromosome-wide ancestral recombination graphs allowed us to date the split of E. grandis (1.7-4.8 million yr ago) and identify a scenario for the recent demographic history of the species. Our results have considerable practical importance to Genome Wide Association Studies (GWAS), while indicating bright prospects for genomic prediction of complex phenotypes in eucalypt breeding. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
High-density genetic linkage map construction by F2 populations and QTL analysis of early-maturity traits in upland cotton (Gossypium hirsutum L.).

PubMed

Li, Libei; Zhao, Shuqi; Su, Junji; Fan, Shuli; Pang, Chaoyou; Wei, Hengling; Wang, Hantao; Gu, Lijiao; Zhang, Chi; Liu, Guoyuan; Yu, Dingwei; Liu, Qibao; Zhang, Xianlong; Yu, Shuxun

2017-01-01

Due to China's rapidly increasing population, the total arable land area has dramatically decreased; as a consequence, the competition for farming land allocated for grain and cotton production has become fierce. Therefore, to overcome the existing contradiction between cotton grain and fiber production and the limited farming land, development of early-maturing cultivars is necessary. In this research, a high-density linkage map of upland cotton was constructed using genotyping by sequencing (GBS) to discover single nucleotide polymorphism (SNP) markers associated with early maturity in 170 F2 individuals derived from a cross between LU28 and ZHONG213. The high-density genetic map, which was composed of 3978 SNP markers across the 26 cotton chromosomes, spanned 2480 cM with an average genetic distance of 0.62 cM. Collinearity analysis showed that the genetic map was of high quality and accurate and agreed well with the Gossypium hirsutum reference genome. Based on this high-density linkage map, QTL analysis was performed on cotton early-maturity traits, including FT, FBP, WGP, NFFB, HNFFB and PH. A total 47 QTLs for the six traits were detected; each of these QTLs explained between 2.61% and 32.57% of the observed phenotypic variation. A major region controlling early-maturity traits in Gossypium hirsutum was identified for FT, FBP, WGP, NFFB and HNFFB on chromosome D03. QTL analyses revealed that phenotypic variation explained (PVE) ranged from 10.42% to 32.57%. Two potential candidate genes, Gh_D03G0885 and Gh_D03G0922, were predicted in a stable QTL region and had higher expression levels in the early-maturity variety ZHONG213 than in the late-maturity variety LU28. However, further evidence is required for functional validation. This study could provide useful information for the dissection of early-maturity traits and guide valuable genetic loci for molecular-assisted selection (MAS) in cotton breeding.
High-density genetic linkage map construction by F2 populations and QTL analysis of early-maturity traits in upland cotton (Gossypium hirsutum L.)

PubMed Central

Li, Libei; Zhao, Shuqi; Su, Junji; Fan, Shuli; Pang, Chaoyou; Wei, Hengling; Wang, Hantao; Gu, Lijiao; Zhang, Chi; Liu, Guoyuan; Yu, Dingwei; Liu, Qibao; Zhang, Xianlong

2017-01-01

Due to China’s rapidly increasing population, the total arable land area has dramatically decreased; as a consequence, the competition for farming land allocated for grain and cotton production has become fierce. Therefore, to overcome the existing contradiction between cotton grain and fiber production and the limited farming land, development of early-maturing cultivars is necessary. In this research, a high-density linkage map of upland cotton was constructed using genotyping by sequencing (GBS) to discover single nucleotide polymorphism (SNP) markers associated with early maturity in 170 F2 individuals derived from a cross between LU28 and ZHONG213. The high-density genetic map, which was composed of 3978 SNP markers across the 26 cotton chromosomes, spanned 2480 cM with an average genetic distance of 0.62 cM. Collinearity analysis showed that the genetic map was of high quality and accurate and agreed well with the Gossypium hirsutum reference genome. Based on this high-density linkage map, QTL analysis was performed on cotton early-maturity traits, including FT, FBP, WGP, NFFB, HNFFB and PH. A total 47 QTLs for the six traits were detected; each of these QTLs explained between 2.61% and 32.57% of the observed phenotypic variation. A major region controlling early-maturity traits in Gossypium hirsutum was identified for FT, FBP, WGP, NFFB and HNFFB on chromosome D03. QTL analyses revealed that phenotypic variation explained (PVE) ranged from 10.42% to 32.57%. Two potential candidate genes, Gh_D03G0885 and Gh_D03G0922, were predicted in a stable QTL region and had higher expression levels in the early-maturity variety ZHONG213 than in the late-maturity variety LU28. However, further evidence is required for functional validation. This study could provide useful information for the dissection of early-maturity traits and guide valuable genetic loci for molecular-assisted selection (MAS) in cotton breeding. PMID:28809947
A ddRAD-based genetic map and its integration with the genome assembly of Japanese eel (Anguilla japonica) provides insights into genome evolution after the teleost-specific genome duplication

PubMed Central

2014-01-01

Background Recent advancements in next-generation sequencing technology have enabled cost-effective sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To enrich genomic resources for Japanese eel (Anguilla japonica), we constructed a ddRAD-based genetic map using an Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 19 linkage groups of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the history of genome evolution after the teleost-specific genome duplication. Results We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal rearrangement events that occurred soon after the teleost-specific genome duplication. Conclusions The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with economically important traits and for investigating comparative genomics of the Japanese eel. PMID:24669946
A ddRAD-based genetic map and its integration with the genome assembly of Japanese eel (Anguilla japonica) provides insights into genome evolution after the teleost-specific genome duplication.

PubMed

Kai, Wataru; Nomura, Kazuharu; Fujiwara, Atushi; Nakamura, Yoji; Yasuike, Motoshige; Ojima, Nobuhiko; Masaoka, Tetsuji; Ozaki, Akiyuki; Kazeto, Yukinori; Gen, Koichiro; Nagao, Jiro; Tanaka, Hideki; Kobayashi, Takanori; Ototake, Mitsuru

2014-03-26

Recent advancements in next-generation sequencing technology have enabled cost-effective sequencing of whole or partial genomes, permitting the discovery and characterization of molecular polymorphisms. Double-digest restriction-site associated DNA sequencing (ddRAD-seq) is a powerful and inexpensive approach to developing numerous single nucleotide polymorphism (SNP) markers and constructing a high-density genetic map. To enrich genomic resources for Japanese eel (Anguilla japonica), we constructed a ddRAD-based genetic map using an Ion Torrent Personal Genome Machine and anchored scaffolds of the current genome assembly to 19 linkage groups of the Japanese eel. Furthermore, we compared the Japanese eel genome with genomes of model fishes to infer the history of genome evolution after the teleost-specific genome duplication. We generated the ddRAD-based linkage map of the Japanese eel, where the maps for female and male spanned 1748.8 cM and 1294.5 cM, respectively, and were arranged into 19 linkage groups. A total of 2,672 SNP markers and 115 Simple Sequence Repeat markers provide anchor points to 1,252 scaffolds covering 151 Mb (13%) of the current genome assembly of the Japanese eel. Comparisons among the Japanese eel, medaka, zebrafish and spotted gar genomes showed highly conserved synteny among teleosts and revealed part of the eight major chromosomal rearrangement events that occurred soon after the teleost-specific genome duplication. The ddRAD-seq approach combined with the Ion Torrent Personal Genome Machine sequencing allowed us to conduct efficient and flexible SNP genotyping. The integration of the genetic map and the assembled sequence provides a valuable resource for fine mapping and positional cloning of quantitative trait loci associated with economically important traits and for investigating comparative genomics of the Japanese eel.
Genome-wide distribution of genetic diversity and linkage disequilibrium in a mass-selected population of maritime pine

PubMed Central

2014-01-01

Background The accessibility of high-throughput genotyping technologies has contributed greatly to the development of genomic resources in non-model organisms. High-density genotyping arrays have only recently been developed for some economically important species such as conifers. The potential for using genomic technologies in association mapping and breeding depends largely on the genome wide patterns of diversity and linkage disequilibrium in current breeding populations. This study aims to deepen our knowledge regarding these issues in maritime pine, the first species used for reforestation in south western Europe. Results Using a new map merging algorithm, we first established a 1,712 cM composite linkage map (comprising 1,838 SNP markers in 12 linkage groups) by bringing together three already available genetic maps. Using rigorous statistical testing based on kernel density estimation and resampling we identified cold and hot spots of recombination. In parallel, 186 unrelated trees of a mass-selected population were genotyped using a 12k-SNP array. A total of 2,600 informative SNPs allowed to describe historical recombination, genetic diversity and genetic structure of this recently domesticated breeding pool that forms the basis of much of the current and future breeding of this species. We observe very low levels of population genetic structure and find no evidence that artificial selection has caused a reduction in genetic diversity. By combining these two pieces of information, we provided the map position of 1,671 SNPs corresponding to 1,192 different loci. This made it possible to analyze the spatial pattern of genetic diversity (H e ) and long distance linkage disequilibrium (LD) along the chromosomes. We found no particular pattern in the empirical variogram of H e across the 12 linkage groups and, as expected for an outcrossing species with large effective population size, we observed an almost complete lack of long distance LD. Conclusions These results are a stepping stone for the development of strategies for studies in population genomics, association mapping and genomic prediction in this economical and ecologically important forest tree species. PMID:24581176
Molecular mapping of QTLs for plant type and earliness traits in pigeonpea (Cajanus cajan L. Millsp.).

PubMed

Kumawat, Giriraj; Raje, Ranjeet S; Bhutani, Shefali; Pal, Jitendra K; Mithra, Amitha S V C R; Gaikwad, Kishor; Sharma, Tilak R; Singh, Nagendra K

2012-10-08

Pigeonpea is an important grain legume of the semi-arid tropics and sub-tropical regions where it plays a crucial role in the food and nutritional security of the people. The average productivity of pigeonpea has remained very low and stagnant for over five decades due to lack of genomic information and intensive breeding efforts. Previous SSR-based linkage maps of pigeonpea used inter-specific crosses due to low inter-varietal polymorphism. Here our aim was to construct a high density intra-specific linkage map using genic-SNP markers for mapping of major quantitative trait loci (QTLs) for key agronomic traits, including plant height, number of primary and secondary branches, number of pods, days to flowering and days to maturity in pigeonpea. A population of 186 F2:3 lines derived from an intra-specific cross between inbred lines 'Pusa Dwarf' and 'HDM04-1' was used to construct a dense molecular linkage map of 296 genic SNP and SSR markers covering a total adjusted map length of 1520.22 cM for the 11 chromosomes of the pigeonpea genome. This is the first dense intra-specific linkage map of pigeonpea with the highest genome length coverage. Phenotypic data from the F2:3 families were used to identify thirteen QTLs for the six agronomic traits. The proportion of phenotypic variance explained by the individual QTLs ranged from 3.18% to 51.4%. Ten of these QTLs were clustered in just two genomic regions, indicating pleiotropic effects or close genetic linkage. In addition to the main effects, significant epistatic interaction effects were detected between the QTLs for number of pods per plant. A large amount of information on transcript sequences, SSR markers and draft genome sequence is now available for pigeonpea. However, there is need to develop high density linkage maps and identify genes/QTLs for important agronomic traits for practical breeding applications. This is the first report on identification of QTLs for plant type and maturity traits in pigeonpea. The QTLs identified in this study provide a strong foundation for further validation and fine mapping for utilization in the pigeonpea improvement.
Linear reduction methods for tag SNP selection.

PubMed

He, Jingwu; Zelikovsky, Alex

2004-01-01

It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.
The recombination landscape around forensic STRs: Accurate measurement of genetic distances between syntenic STR pairs using HapMap high density SNP data.

PubMed

Phillips, C; Ballard, D; Gill, P; Court, D Syndercombe; Carracedo, A; Lareu, M V

2012-05-01

Family studies can be used to measure the genetic distance between same-chromosome (syntenic) STRs in order to detect physical linkage or linkage disequilibrium. However, family studies are expensive and time consuming, in many cases uninformative, and lack a reliable means to infer the phase of the diplotypes obtained. HapMap provides a more comprehensive and fine-scale estimation of recombination rates using high density multi-point SNP data (average inter-SNP distance: 900 nucleotides). Data at this fine scale detects sub-kilobase genetic distances across the whole recombining human genome. We have used the most recent HapMap SNP data release 22 to measure and compare genetic distances, and by inference fine-scale recombination rates, between 29 syntenic STR pairs identified from 39 validated STRs currently available for forensic use. The 39 STRs comprise 23 core loci: SE33, Penta D & E, 13 CODIS and 7 non-CODIS European Standard Set STRs, plus supplementary STRs in the recently released Promega CS-7™ and Qiagen Investigator HDplex™ kits. Also included were D9S1120, a marker we developed for forensic use unique to chromosome 9, and the novel D6S1043 component STR of SinoFiler™ (Applied Biosystems). The data collated provides reliable estimates of recombination rates between each STR pair, that can then be placed into haplotype frequency calculators for short pedigrees with multiple meiotic inputs and which just requires the addition of allele frequencies. This allows all current STR sets or their combinations to be used in supplemented paternity analyses without the need for further adjustment for physical linkage. The detailed analysis of recombination rates made for autosomal forensic STRs was extended to the more than 50 X chromosome STRs established or in development for complex kinship analyses. Copyright Â© 2011 Elsevier Ireland Ltd. All rights reserved.
High-Resolution Detection of Identity by Descent in Unrelated Individuals

PubMed Central

Browning, Sharon R.; Browning, Brian L.

2010-01-01

Detection of recent identity by descent (IBD) in population samples is important for population-based linkage mapping and for highly accurate genotype imputation and haplotype-phase inference. We present a method for detection of recent IBD in population samples. Our method accounts for linkage disequilibrium between SNPs to enable full use of high-density SNP data. We find that our method can detect segments of a length of 2 cM with moderate power and negligible false discovery rate in Illumina 550K data in Northwestern Europeans. We compare our method with GERMLINE and PLINK, and we show that our method has a level of resolution that is significantly better than these existing methods, thus extending the usefulness of recent IBD in analysis of high-density SNP data. We survey four genomic regions in a sample of UK individuals of European descent and find that on average, at a given location, our method detects IBD in 2.7 per 10,000 pairs of individuals in Illumina 550K data. We also present methodology and results for detection of homozygosity by descent (HBD) and survey the whole genome in a sample of 1373 UK individuals of European descent. We detect HBD in 4.7 individuals per 10,000 on average at a given location. Our methodology is implemented in the freely available BEAGLE software package. PMID:20303063
A Larger Chocolate Chip-Development of a 15K Theobroma cacao L. SNP Array to Create High-Density Linkage Maps.

PubMed

Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M; Rodezno, Dayana C; Suarez, Carmen; Amores, Freddy; Feltus, Frank A; Mockaitis, Keithanne; Cornejo, Omar E; Motamayor, Juan C

2017-01-01

Cacao ( Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance.
High-density genetic map construction and comparative genome analysis in asparagus bean.

PubMed

Huang, Haitao; Tan, Huaqiang; Xu, Dongmei; Tang, Yi; Niu, Yisong; Lai, Yunsong; Tie, Manman; Li, Huanxiu

2018-03-19

Genetic maps are a prerequisite for quantitative trait locus (QTL) analysis, marker-assisted selection (MAS), fine gene mapping, and assembly of genome sequences. So far, several asparagus bean linkage maps have been established using various kinds of molecular markers. However, these maps were all constructed by gel- or array-based markers. No maps based on sequencing method have been reported. In this study, an NGS-based strategy, SLAF-seq, was applied to create a high-density genetic map for asparagus bean. Through SLAF library construction and Illumina sequencing of two parents and 100 F2 individuals, a total of 55,437 polymorphic SLAF markers were developed and mined for SNP markers. The map consisted of 5,225 SNP markers in 11 LGs, spanning a total distance of 1,850.81 cM, with an average distance between markers of 0.35 cM. Comparative genome analysis with four other legume species, soybean, common bean, mung bean and adzuki bean showed that asparagus bean is genetically more related to adzuki bean. The results will provide a foundation for future genomic research, such as QTL fine mapping, comparative mapping in pulses, and offer support for assembling asparagus bean genome sequence.

Analysis of population structure and genetic history of cattle breeds based on high-density SNP data

USDA-ARS?s Scientific Manuscript database

Advances in single nucleotide polymorphism (SNP) genotyping microarrays have facilitated a new understanding of population structure and evolutionary history for several species. Most existing studies in livestock were based on low density SNP arrays. The first wave of low density SNP studies on cat...
Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh).

PubMed

Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

2014-01-01

High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Development and Validation of a 20K Single Nucleotide Polymorphism (SNP) Whole Genome Genotyping Array for Apple (Malus × domestica Borkh)

PubMed Central

Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

2014-01-01

High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088
High-Density SNP Map Construction and QTL Identification for the Apetalous Character in Brassica napus L.

PubMed Central

Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu

2015-01-01

The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line ‘APL01’ and a normally petalled variety ‘Holly’. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus. PMID:26779193
High-Density SNP Map Construction and QTL Identification for the Apetalous Character in Brassica napus L.

PubMed

Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu

2015-01-01

The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line 'APL01' and a normally petalled variety 'Holly'. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus.
Is a gene important for bone resorption a candidate for obesity? An association and linkage study on the RANK (receptor activator of nuclear factor-kappaB) gene in a large Caucasian sample.

PubMed

Zhao, Lan-Juan; Guo, Yan-Fang; Xiong, Dong-Hai; Xiao, Peng; Recker, Robert R; Deng, Hong-Wen

2006-11-01

In light of findings that osteoporosis and obesity may share some common genetic determination and previous reports that RANK (receptor activator of nuclear factor-kappaB) is expressed in skeletal muscles which are important for energy metabolism, we hypothesize that RANK, a gene essential for osteoclastogenesis, is also important for obesity. In order to test the hypothesis with solid data we first performed a linkage analysis around the RANK gene in 4,102 Caucasian subjects from 434 pedigrees, then we genotyped 19 SNPs in or around the RANK gene. A family-based association test (FBAT) was performed with both a quantitative measure of obesity [fat mass, lean mass, body mass index (BMI), and percentage fat mass (PFM)] and a dichotomously defined obesity phenotype-OB (OB if BMI > or = 30 kg/m(2)). In the linkage analysis, an empirical P = 0.004 was achieved at the location of the RANK gene for BMI. Family-based association analysis revealed significant associations of eight SNPs with at least one obesity-related phenotype (P < 0.05). Evidence of association was obtained at SNP10 (P = 0.002) and SNP16 (P = 0.001) with OB; SNP1 with fat mass (P = 0.003); SNP1 (P = 0.003) and SNP7 (P = 0.003) with lean mass; SNP1 (P = 0.002) and SNP7 (P = 0.002) with BMI; SNP1 (P = 0.003), SNP4 (P = 0.007), and SNP7 (P = 0.002) with PFM. In order to deal with the complex multiple testing issues, we performed FBAT multi-marker test (FBAT-MM) to evaluate the association between all the 18 SNPs and each obesity phenotype. The P value is 0.126 for OB, 0.033 for fat mass, 0.021 for lean mass, 0.016 for BMI, and 0.006 for PFM. The haplotype data analyses provide further association evidence. In conclusion, for the first time, our results suggest that RANK is a novel candidate for determination of obesity.
A SNP genetic linkage map based on the ‘Hamilton’ by ‘Spencer’ recombinant inbred line (RIL) population identified QTL for seed Isoflavone contents in soybean

USDA-ARS?s Scientific Manuscript database

Soybean is one of the most important crops worldwide for its protein, oil as well as the health beneficial phytoestrogens or isoflavone. This study reports a relatively dense SNP-Based genetic map based on ‘Hamilton’ by ‘Spencer’ recombinant inbred line (RIL) population and quantitative t...
Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease.

PubMed

Martin, E R; Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Ribble, R C; Booze, M W; Rogala, A; Hauser, M A; Zhang, F; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Pericak-Vance, M A; Vance, J M

2001-11-14

The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. To investigate whether the tau gene is involved in idiopathic PD. Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Family-based tests of association, calculated using asymptotic distributions. Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P =.03; SNP 9i, P =.04; and SNP 11, P =.04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P =.11, and SNP 9iii, P =.87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P =.009) and a negative association with another haplotype (P =.007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3, 9i, 9ii, and 11). This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD.
A Larger Chocolate Chip—Development of a 15K Theobroma cacao L. SNP Array to Create High-Density Linkage Maps

PubMed Central

Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M.; Rodezno, Dayana C.; Suarez, Carmen; Amores, Freddy; Feltus, Frank A.; Mockaitis, Keithanne; Cornejo, Omar E.; Motamayor, Juan C.

2017-01-01

Cacao (Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance. PMID:29259608
Genetic Linkage Mapping of Economically Important Traits in Cultivated Tetraploid Potato (Solanum tuberosum L.).

PubMed

Massa, Alicia N; Manrique-Carpintero, Norma C; Coombs, Joseph J; Zarka, Daniel G; Boone, Anne E; Kirk, William W; Hackett, Christine A; Bryan, Glenn J; Douches, David S

2015-09-14

The objective of this study was to construct a single nucleotide polymorphism (SNP)-based genetic map at the cultivated tetraploid level to locate quantitative trait loci (QTL) contributing to economically important traits in potato (Solanum tuberosum L.). The 156 F1 progeny and parents of a cross (MSL603) between "Jacqueline Lee" and "MSG227-2" were genotyped using the Infinium 8303 Potato Array. Furthermore, the progeny and parents were evaluated for foliar late blight reaction to isolates of the US-8 genotype of Phytophthora infestans (Mont.) de Bary and vine maturity. Linkage analyses and QTL mapping were performed using a novel approach that incorporates allele dosage information. The resulting genetic maps contained 1972 SNP markers with an average density of 1.36 marker per cM. QTL mapping identified the major source of late blight resistance in "Jacqueline Lee." The best SNP marker mapped ~0.54 Mb from a resistance hotspot on the long arm of chromosome 9. For vine maturity, the major-effect QTL was located on chromosome 5 with allelic effects from both parents. A candidate SNP marker for this trait mapped ~0.25 Mb from the StCDF1 gene, which is a candidate gene for the maturity trait. The identification of markers for P. infestans resistance will enable the introgression of multiple sources of resistance through marker-assisted selection. Moreover, the discovery of a QTL for late blight resistance not linked to the QTL for vine maturity provides the opportunity to use marker-assisted selection for resistance independent of the selection for vine maturity classifications. Copyright © 2015 Massa et al.
Genetic Linkage Mapping of Economically Important Traits in Cultivated Tetraploid Potato (Solanum tuberosum L.)

PubMed Central

Massa, Alicia N.; Manrique-Carpintero, Norma C.; Coombs, Joseph J.; Zarka, Daniel G.; Boone, Anne E.; Kirk, William W.; Hackett, Christine A.; Bryan, Glenn J.; Douches, David S.

2015-01-01

The objective of this study was to construct a single nucleotide polymorphism (SNP)-based genetic map at the cultivated tetraploid level to locate quantitative trait loci (QTL) contributing to economically important traits in potato (Solanum tuberosum L.). The 156 F1 progeny and parents of a cross (MSL603) between “Jacqueline Lee” and “MSG227-2” were genotyped using the Infinium 8303 Potato Array. Furthermore, the progeny and parents were evaluated for foliar late blight reaction to isolates of the US-8 genotype of Phytophthora infestans (Mont.) de Bary and vine maturity. Linkage analyses and QTL mapping were performed using a novel approach that incorporates allele dosage information. The resulting genetic maps contained 1972 SNP markers with an average density of 1.36 marker per cM. QTL mapping identified the major source of late blight resistance in “Jacqueline Lee.” The best SNP marker mapped ∼0.54 Mb from a resistance hotspot on the long arm of chromosome 9. For vine maturity, the major-effect QTL was located on chromosome 5 with allelic effects from both parents. A candidate SNP marker for this trait mapped ∼0.25 Mb from the StCDF1 gene, which is a candidate gene for the maturity trait. The identification of markers for P. infestans resistance will enable the introgression of multiple sources of resistance through marker-assisted selection. Moreover, the discovery of a QTL for late blight resistance not linked to the QTL for vine maturity provides the opportunity to use marker-assisted selection for resistance independent of the selection for vine maturity classifications. PMID:26374597
Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.

PubMed

Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A

2006-08-01

Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Molecular mapping of QTLs for plant type and earliness traits in pigeonpea (Cajanus cajan L. Millsp.)

PubMed Central

2012-01-01

Background Pigeonpea is an important grain legume of the semi-arid tropics and sub-tropical regions where it plays a crucial role in the food and nutritional security of the people. The average productivity of pigeonpea has remained very low and stagnant for over five decades due to lack of genomic information and intensive breeding efforts. Previous SSR-based linkage maps of pigeonpea used inter-specific crosses due to low inter-varietal polymorphism. Here our aim was to construct a high density intra-specific linkage map using genic-SNP markers for mapping of major quantitative trait loci (QTLs) for key agronomic traits, including plant height, number of primary and secondary branches, number of pods, days to flowering and days to maturity in pigeonpea. Results A population of 186 F2:3 lines derived from an intra-specific cross between inbred lines ‘Pusa Dwarf’ and ‘HDM04-1’ was used to construct a dense molecular linkage map of 296 genic SNP and SSR markers covering a total adjusted map length of 1520.22 cM for the 11 chromosomes of the pigeonpea genome. This is the first dense intra-specific linkage map of pigeonpea with the highest genome length coverage. Phenotypic data from the F2:3 families were used to identify thirteen QTLs for the six agronomic traits. The proportion of phenotypic variance explained by the individual QTLs ranged from 3.18% to 51.4%. Ten of these QTLs were clustered in just two genomic regions, indicating pleiotropic effects or close genetic linkage. In addition to the main effects, significant epistatic interaction effects were detected between the QTLs for number of pods per plant. Conclusions A large amount of information on transcript sequences, SSR markers and draft genome sequence is now available for pigeonpea. However, there is need to develop high density linkage maps and identify genes/QTLs for important agronomic traits for practical breeding applications. This is the first report on identification of QTLs for plant type and maturity traits in pigeonpea. The QTLs identified in this study provide a strong foundation for further validation and fine mapping for utilization in the pigeonpea improvement. PMID:23043321
Dissection of genetic factors underlying wheat kernel shape and size in an elite x nonadapted cross using a high density SNP linkage map

USDA-ARS?s Scientific Manuscript database

Wheat kernel shape and size has been under selection since early domestication. Kernel morphology is a major consideration in wheat breeding, as it impacts grain yield and quality. A population of 160 recombinant inbred lines (RIL), developed using an elite (ND 705) and a nonadapted genotype (PI 414...
Effect of Co-segregating Markers on High-Density Genetic Maps and Prediction of Map Expansion Using Machine Learning Algorithms.

PubMed

N'Diaye, Amidou; Haile, Jemanesh K; Fowler, D Brian; Ammar, Karim; Pozniak, Curtis J

2017-01-01

Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called 'large p, small n' problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly.
Effect of Co-segregating Markers on High-Density Genetic Maps and Prediction of Map Expansion Using Machine Learning Algorithms

PubMed Central

N’Diaye, Amidou; Haile, Jemanesh K.; Fowler, D. Brian; Ammar, Karim; Pozniak, Curtis J.

2017-01-01

Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called ‘large p, small n’ problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly. PMID:28878789
An integrated genetic linkage map of watermelon and genetic diversity based on single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers

USDA-ARS?s Scientific Manuscript database

Watermelon (Citrullus lanatus var. lanatus) is an important vegetable fruit throughout the world. A high number of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers should provide large coverage of the watermelon genome and high phylogenetic resolution of germplasm acces...
Extent of linkage disequilibrium, consistency of gametic phase, and imputation accuracy within and across Canadian dairy breeds.

PubMed

Larmer, S G; Sargolzaei, M; Schenkel, F S

2014-05-01

Genomic selection requires a large reference population to accurately estimate single nucleotide polymorphism (SNP) effects. In some Canadian dairy breeds, the available reference populations are not large enough for accurate estimation of SNP effects for traits of interest. If marker phase is highly consistent across multiple breeds, it is theoretically possible to increase the accuracy of genomic prediction for one or all breeds by pooling several breeds into a common reference population. This study investigated the extent of linkage disequilibrium (LD) in 5 major dairy breeds using a 50,000 (50K) SNP panel and 3 of the same breeds using the 777,000 (777K) SNP panel. Correlation of pair-wise SNP phase was also investigated on both panels. The level of LD was measured using the squared correlation of alleles at 2 loci (r(2)), and the consistency of SNP gametic phases was correlated using the signed square root of these values. Because of the high cost of the 777K panel, the accuracy of imputation from lower density marker panels [6,000 (6K) or 50K] was examined both within breed and using a multi-breed reference population in Holstein, Ayrshire, and Guernsey. Imputation was carried out using FImpute V2.2 and Beagle 3.3.2 software. Imputation accuracies were then calculated as both the proportion of correct SNP filled in (concordance rate) and allelic R(2). Computation time was also explored to determine the efficiency of the different algorithms for imputation. Analysis showed that LD values >0.2 were found in all breeds at distances at or shorter than the average adjacent pair-wise distance between SNP on the 50K panel. Correlations of r-values, however, did not reach high levels (<0.9) at these distances. High correlation values of SNP phase between breeds were observed (>0.94) when the average pair-wise distances using the 777K SNP panel were examined. High concordance rate (0.968-0.995) and allelic R(2) (0.946-0.991) were found for all breeds when imputation was carried out with FImpute from 50K to 777K. Imputation accuracy for Guernsey and Ayrshire was slightly lower when using the imputation method in Beagle. Computing time was significantly greater when using Beagle software, with all comparable procedures being 9 to 13 times less efficient, in terms of time, compared with FImpute. These findings suggest that use of a multi-breed reference population might increase prediction accuracy using the 777K SNP panel and that 777K genotypes can be efficiently and effectively imputed using the lower density 50K SNP panel. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Elucidation of the ‘Honeycrisp’ pedigree through haplotype analysis with a multi-family integrated SNP linkage map and a large apple (Malus×domestica) pedigree-connected SNP data set

PubMed Central

Howard, Nicholas P; van de Weg, Eric; Bedford, David S; Peace, Cameron P; Vanderzande, Stijn; Clark, Matthew D; Teh, Soon Li; Cai, Lichun; Luby, James J

2017-01-01

The apple (Malus×domestica) cultivar Honeycrisp has become important economically and as a breeding parent. An earlier study with SSR markers indicated the original recorded pedigree of ‘Honeycrisp’ was incorrect and ‘Keepsake’ was identified as one putative parent, the other being unknown. The objective of this study was to verify ‘Keepsake’ as a parent and identify and genetically describe the unknown parent and its grandparents. A multi-family based dense and high-quality integrated SNP map was created using the apple 8 K Illumina Infinium SNP array. This map was used alongside a large pedigree-connected data set from the RosBREED project to build extended SNP haplotypes and to identify pedigree relationships. ‘Keepsake’ was verified as one parent of ‘Honeycrisp’ and ‘Duchess of Oldenburg’ and ‘Golden Delicious’ were identified as grandparents through the unknown parent. Following this finding, siblings of ‘Honeycrisp’ were identified using the SNP data. Breeding records from several of these siblings suggested that the previously unreported parent is a University of Minnesota selection, MN1627. This selection is no longer available, but now is genetically described through imputed SNP haplotypes. We also present the mosaic grandparental composition of ‘Honeycrisp’ for each of its 17 chromosome pairs. This new pedigree and genetic information will be useful in future pedigree-based genetic studies to connect ‘Honeycrisp’ with other cultivars used widely in apple breeding programs. The created SNP linkage map will benefit future research using the data from the Illumina apple 8 and 20 K and Affymetrix 480 K SNP arrays. PMID:28243452
Development of a Medium Density Combined-Species SNP Array for Pacific and European Oysters (Crassostrea gigas and Ostrea edulis).

PubMed

Gutierrez, Alejandro P; Turner, Frances; Gharbi, Karim; Talbot, Richard; Lowe, Natalie R; Peñaloza, Carolina; McCullough, Mark; Prodöhl, Paulo A; Bean, Tim P; Houston, Ross D

2017-07-05

SNP arrays are enabling tools for high-resolution studies of the genetic basis of complex traits in farmed and wild animals. Oysters are of critical importance in many regions from both an ecological and economic perspective, and oyster aquaculture forms a key component of global food security. The aim of our study was to design a combined-species, medium density SNP array for Pacific oyster ( Crassostrea gigas ) and European flat oyster ( Ostrea edulis ), and to test the performance of this array on farmed and wild populations from multiple locations, with a focus on European populations. SNP discovery was carried out by whole-genome sequencing (WGS) of pooled genomic DNA samples from eight C. gigas populations, and restriction site-associated DNA sequencing (RAD-Seq) of 11 geographically diverse O. edulis populations. Nearly 12 million candidate SNPs were discovered and filtered based on several criteria, including preference for SNPs segregating in multiple populations and SNPs with monomorphic flanking regions. An Affymetrix Axiom Custom Array was created and tested on a diverse set of samples ( n = 219) showing ∼27 K high quality SNPs for C. gigas and ∼11 K high quality SNPs for O. edulis segregating in these populations. A high proportion of SNPs were segregating in each of the populations, and the array was used to detect population structure and levels of linkage disequilibrium (LD). Further testing of the array on three C. gigas nuclear families ( n = 165) revealed that the array can be used to clearly distinguish between both families based on identity-by-state (IBS) clustering parental assignment software. This medium density, combined-species array will be publicly available through Affymetrix, and will be applied for genome-wide association and evolutionary genetic studies, and for genomic selection in oyster breeding programs. Copyright © 2017 Gutierrez et al.

A gene-derived SNP-based high resolution linkage map of carrot including the location of QTL conditioning root and leaf anthocyanin pigmentation

USDA-ARS?s Scientific Manuscript database

Background: Purple carrots accumulate large quantities of anthocyanins in their roots and leaves. These flavonoid pigments possess antioxidant activity and are implicated in providing health benefits. The lack of informative and saturated linkage maps associated with well characterized populations s...
Quantitative trait loci underlying resistance to sudden death syndrome (SDS) in MD96-5722 by 'Spencer' recombinant inbred line population of soybean.

PubMed

Anderson, J; Akond, M; Kassem, M A; Meksem, K; Kantartzi, S K

2015-04-01

The best way to protect yield loss of soybean [Glycine max (L.) Merr.] due to sudden death syndrome (SDS), caused by Fusarium virguliforme (Aoki, O'Donnel, Homma & Lattanzi), is the development and use of resistant lines. Mapping quantitative trait loci (QTL) linked to SDS help developing resistant soybean germplasm through molecular marker-assisted selection strategy. QTL for SDS presented herein are from a high-density SNP-based genetic linkage map of MD 96-5722 (a.k.a 'Monocacy') by 'Spencer' recombinant inbred line using SoySNP6K Illumina Infinium BeadChip genotyping array. Ninety-four F 5:7 lines were evaluated for 2 years (2010 and 2011) at two locations (Carbondale and Valmeyer) in southern Illinois, USA to identify QTL controlling SDS resistance using disease index (DX). Composite interval mapping identified 19 SDS controlling QTL which were mapped on 11 separate linkage group (LG) or chromosomes (Chr) out of 20 LG or Chr of soybean genome. Many of these significant QTL identified in one environment/year were confirmed in another year or environment, which suggests a common genetic effects and modes of the pathogen. These new QTL are useful sources for SDS resistance studies in soybean breeding, complementing previously reported loci.
QTL meta-analysis of root traits in Brassica napus under contrasting phosphorus supply in two growth systems

PubMed Central

Zhang, Ying; Thomas, Catherine L.; Xiang, Jinxia; Long, Yan; Wang, Xiaohua; Zou, Jun; Luo, Ziliang; Ding, Guangda; Cai, Hongmei; Graham, Neil S.; Hammond, John P.; King, Graham J.; White, Philip J.; Xu, Fangsen; Broadley, Martin R.; Shi, Lei; Meng, Jinling

2016-01-01

A high-density SNP-based genetic linkage map was constructed and integrated with a previous map in the Tapidor x Ningyou7 (TNDH) Brassica napus population, giving a new map with a total of 2041 molecular markers and an average marker density which increased from 0.39 to 0.97 (0.82 SNP bin) per cM. Root and shoot traits were screened under low and ‘normal’ phosphate (Pi) supply using a ‘pouch and wick’ system, and had been screened previously in an agar based system. The P-efficient parent Ningyou7 had a shorter primary root length (PRL), greater lateral root density (LRD) and a greater shoot biomass than the P-inefficient parent Tapidor under both treatments and growth systems. Quantitative trait loci (QTL) analysis identified a total of 131 QTL, and QTL meta-analysis found four integrated QTL across the growth systems. Integration reduced the confidence interval by ~41%. QTL for root and shoot biomass were co-located on chromosome A3 and for lateral root emergence were co-located on chromosomes A4/C4 and C8/C9. There was a major QTL for LRD on chromosome C9 explaining ~18% of the phenotypic variation. QTL underlying an increased LRD may be a useful breeding target for P uptake efficiency in Brassica. PMID:27624881
Mapping a New Spontaneous Preterm Birth Susceptibility Gene, IGF1R, Using Linkage, Haplotype Sharing, and Association Analysis

PubMed Central

Luukkonen, Aino; Teramo, Kari; Puttonen, Hilkka; Ojaniemi, Marja; Varilo, Teppo; Chaudhari, Bimal P.; Plunkett, Jevon; Murray, Jeffrey C.; McCarroll, Steven A.; Muglia, Louis J.; Palotie, Aarno; Hallman, Mikko

2011-01-01

Preterm birth is the major cause of neonatal death and serious morbidity. Most preterm births are due to spontaneous onset of labor without a known cause or effective prevention. Both maternal and fetal genomes influence the predisposition to spontaneous preterm birth (SPTB), but the susceptibility loci remain to be defined. We utilized a combination of unique population structures, family-based linkage analysis, and subsequent case-control association to identify a susceptibility haplotype for SPTB. Clinically well-characterized SPTB families from northern Finland, a subisolate founded by a relatively small founder population that has subsequently experienced a number of bottlenecks, were selected for the initial discovery sample. Genome-wide linkage analysis using a high-density single-nucleotide polymorphism (SNP) array in seven large northern Finnish non-consanginous families identified a locus on 15q26.3 (HLOD 4.68). This region contains the IGF1R gene, which encodes the type 1 insulin-like growth factor receptor IGF-1R. Haplotype segregation analysis revealed that a 55 kb 12-SNP core segment within the IGF1R gene was shared identical-by-state (IBS) in five families. A follow-up case-control study in an independent sample representing the more general Finnish population showed an association of a 6-SNP IGF1R haplotype with SPTB in the fetuses, providing further evidence for IGF1R as a SPTB predisposition gene (frequency in cases versus controls 0.11 versus 0.05, P = 0.001, odds ratio 2.3). This study demonstrates the identification of a predisposing, low-frequency haplotype in a multifactorial trait using a well-characterized population and a combination of family and case-control designs. Our findings support the identification of the novel susceptibility gene IGF1R for predisposition by the fetal genome to being born preterm. PMID:21304894
Association of Single-Nucleotide Polymorphisms of the Tau Gene With Late-Onset Parkinson Disease

PubMed Central

Martin, Eden R.; Scott, William K.; Nance, Martha A.; Watts, Ray L.; Hubble, Jean P.; Koller, William C.; Lyons, Kelly; Pahwa, Rajesh; Stern, Matthew B.; Colcher, Amy; Hiner, Bradley C.; Jankovic, Joseph; Ondo, William G.; Allen, Fred H.; Goetz, Christopher G.; Small, Gary W.; Masterman, Donna; Mastaglia, Frank; Laing, Nigel G.; Stajich, Jeffrey M.; Ribble, Robert C.; Booze, Michael W.; Rogala, Allison; Hauser, Michael A.; Zhang, Fengyu; Gibson, Rachel A.; Middleton, Lefkos T.; Roses, Allen D.; Haines, Jonathan L.; Scott, Burton L.; Pericak-Vance, Margaret A.; Vance, Jeffery M.

2013-01-01

Context The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. Objective To investigate whether the tau gene is involved in idiopathic PD. Design, Setting, and Participants Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Main Outcome Measure Family-based tests of association, calculated using asymptotic distributions. Results Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P = .03; SNP 9i, P = .04; and SNP 11, P = .04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P = .11, and SNP 9iii, P = .87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P = .009) and a negative association with another haplotype (P = .007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3,9i, 9ii, and 11). Conclusions This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD. PMID:11710889
A general model for likelihood computations of genetic marker data accounting for linkage, linkage disequilibrium, and mutations.

PubMed

Kling, Daniel; Tillmar, Andreas; Egeland, Thore; Mostad, Petter

2015-09-01

Several applications necessitate an unbiased determination of relatedness, be it in linkage or association studies or in a forensic setting. An appropriate model to compute the joint probability of some genetic data for a set of persons given some hypothesis about the pedigree structure is then required. The increasing number of markers available through high-density SNP microarray typing and NGS technologies intensifies the demand, where using a large number of markers may lead to biased results due to strong dependencies between closely located loci, both within pedigrees (linkage) and in the population (allelic association or linkage disequilibrium (LD)). We present a new general model, based on a Markov chain for inheritance patterns and another Markov chain for founder allele patterns, the latter allowing us to account for LD. We also demonstrate a specific implementation for X chromosomal markers that allows for computation of likelihoods based on hypotheses of alleged relationships and genetic marker data. The algorithm can simultaneously account for linkage, LD, and mutations. We demonstrate its feasibility using simulated examples. The algorithm is implemented in the software FamLinkX, providing a user-friendly GUI for Windows systems (FamLinkX, as well as further usage instructions, is freely available at www.famlink.se ). Our software provides the necessary means to solve cases where no previous implementation exists. In addition, the software has the possibility to perform simulations in order to further study the impact of linkage and LD on computed likelihoods for an arbitrary set of markers.
Genetic diversity, linkage disequilibrium, population structure and construction of a core collection of Prunus avium L. landraces and bred cultivars.

PubMed

Campoy, José Antonio; Lerigoleur-Balsemin, Emilie; Christmann, Hélène; Beauvieux, Rémi; Girollet, Nabil; Quero-García, José; Dirlewanger, Elisabeth; Barreneche, Teresa

2016-02-24

Depiction of the genetic diversity, linkage disequilibrium (LD) and population structure is essential for the efficient organization and exploitation of genetic resources. The objectives of this study were to (i) to evaluate the genetic diversity and to detect the patterns of LD, (ii) to estimate the levels of population structure and (iii) to identify a 'core collection' suitable for association genetic studies in sweet cherry. A total of 210 genotypes including modern cultivars and landraces from 16 countries were genotyped using the RosBREED cherry 6 K SNP array v1. Two groups, mainly bred cultivars and landraces, respectively, were first detected using STRUCTURE software and confirmed by Principal Coordinate Analysis (PCoA). Further analyses identified nine subgroups using STRUCTURE and Discriminant Analysis of Principal Components (DAPC). Several sub-groups correspond to different eco-geographic regions of landraces distribution. Linkage disequilibrium was evaluated showing lower values than in peach, the reference Prunus species. A 'core collection' containing 156 accessions was selected using the maximum length sub tree method. The present study constitutes the first population genetics analysis in cultivated sweet cherry using a medium-density SNP (single nucleotide polymorphism) marker array. We provided estimations of linkage disequilibrium, genetic structure and the definition of a first INRA's Sweet Cherry core collection useful for breeding programs, germplasm management and association genetics studies.
Genome-wide linkage mapping of QTL for black point reaction in bread wheat (Triticum aestivum L.).

PubMed

Liu, Jindong; He, Zhonghu; Wu, Ling; Bai, Bin; Wen, Weie; Xie, Chaojie; Xia, Xianchun

2016-11-01

Nine QTL for black point resistance in wheat were identified using a RIL population derived from a Linmai 2/Zhong 892 cross and 90K SNP assay. Black point, discoloration of the embryo end of the grain, downgrades wheat grain quality leading to significant economic losses to the wheat industry. The availability of molecular markers will accelerate improvement of black point resistance in wheat breeding. The aims of this study were to identify quantitative trait loci (QTL) for black point resistance and tightly linked molecular markers, and to search for candidate genes using a high-density genetic linkage map of wheat. A recombinant inbred line (RIL) population derived from the cross Linmai 2/Zhong 892 was evaluated for black point reaction during the 2011-2012, 2012-2013 and 2013-2014 cropping seasons, providing data for seven environments. A high-density linkage map was constructed by genotyping the RILs with the wheat 90K single nucleotide polymorphism (SNP) chip. Composite interval mapping detected nine QTL on chromosomes 2AL, 2BL, 3AL, 3BL, 5AS, 6A, 7AL (2) and 7BS, designated as QBp.caas-2AL, QBp.caas-2BL, QBp.caas-3AL, QBp.caas-3BL, QBp.caas-5AS, QBp.caas-6A, QBp.caas-7AL.1, QBp.caas-7AL.2 and QBp.caas-7BS, respectively. All resistance alleles, except for QBp.caas-7AL.1 from Linmai 2, were contributed by Zhong 892. QBp.caas-3BL, QBp.caas-5AS, QBp.caas-7AL.1, QBp.caas-7AL.2 and QBp.caas-7BS probably represent new loci for black point resistance. Sequences of tightly linked SNPs were used to survey wheat and related cereal genomes identifying three candidate genes for black point resistance. The tightly linked SNP markers can be used in marker-assisted breeding in combination with the kompetitive allele specific PCR technique to improve black point resistance.
New generation pharmacogenomic tools: a SNP linkage disequilibrium Map, validated SNP assay resource, and high-throughput instrumentation system for large-scale genetic studies.

PubMed

De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A

2002-06-01

Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.
High-density genetic map construction and QTLs identification for plant height in white jute (Corchorus capsularis L.) using specific locus amplified fragment (SLAF) sequencing.

PubMed

Tao, Aifen; Huang, Long; Wu, Guifen; Afshar, Reza Keshavarz; Qi, Jianmin; Xu, Jiantang; Fang, Pingping; Lin, Lihui; Zhang, Liwu; Lin, Peiqing

2017-05-08

Genetic mapping and quantitative trait locus (QTL) detection are powerful methodologies in plant improvement and breeding. White jute (Corchorus capsularis L.) is an important industrial raw material fiber crop because of its elite characteristics. However, construction of a high-density genetic map and identification of QTLs has been limited in white jute due to a lack of sufficient molecular markers. The specific locus amplified fragment sequencing (SLAF-seq) strategy combines locus-specific amplification and high-throughput sequencing to carry out de novo single nuclear polymorphism (SNP) discovery and large-scale genotyping. In this study, SLAF-seq was employed to obtain sufficient markers to construct a high-density genetic map for white jute. Moreover, with the development of abundant markers, genetic dissection of fiber yield traits such as plant height was also possible. Here, we present QTLs associated with plant height that were identified using our newly constructed genetic linkage groups. An F 8 population consisting of 100 lines was developed. In total, 69,446 high-quality SLAFs were detected of which 5,074 SLAFs were polymorphic; 913 polymorphic markers were used for the construction of a genetic map. The average coverage for each SLAF marker was 43-fold in the parents, and 9.8-fold in each F 8 individual. A linkage map was constructed that contained 913 SLAFs on 11 linkage groups (LGs) covering 1621.4 cM with an average density of 1.61 cM per locus. Among the 11 LGs, LG1 was the largest with 210 markers, a length of 406.34 cM, and an average distance of 1.93 cM between adjacent markers. LG11 was the smallest with only 25 markers, a length of 29.66 cM, and an average distance of 1.19 cM between adjacent markers. 'SNP_only' markers accounted for 85.54% and were the predominant markers on the map. QTL mapping based on the F 8 phenotypes detected 11 plant height QTLs including one major effect QTL across two cultivation locations, with each QTL accounting for 4.14-15.63% of the phenotypic variance. To our knowledge, the linkage map constructed here is the densest one available to date for white jute. This analysis also identified the first QTL in white jute. The results will provide an important platform for gene/QTL mapping, sequence assembly, genome comparisons, and marker-assisted selection breeding for white jute.
Genome-Wide Linkage and Positional Association Analyses Identify Associations of Novel AFF3 and NTM Genes with Triglycerides: The GenSalt Study

PubMed Central

Li, Changwei; Bazzano, Lydia A.L.; Rao, Dabeeru C.; Hixson, James E.; He, Jiang; Gu, Dongfeng; Gu, Charles C.; Shimmin, Lawrence C.; Jaquish, Cashell E.; Schwander, Karen; Liu, De-Pei; Huang, Jianfeng; Lu, Fanghong; Cao, Jie; Chong, Shen; Lu, Xiangfeng; Kelly, Tanika N.

2016-01-01

We conducted a genome-wide linkage scan and positional association study to identify genes and variants influencing blood lipid levels among participants of the Genetic Epidemiology Network of Salt-Sensitivity (GenSalt) study. The GenSalt study was conducted among 1906 participants from 633 Han Chinese families. Lipids were measured from overnight fasting blood samples using standard methods. Multipoint quantitative trait genome-wide linkage scans were performed on the high-density lipoprotein, low-density lipoprotein, and log-transformed triglyceride phenotypes. Using dense panels of single nucleotide polymorphisms (SNPs), single-marker and gene-based association analyses were conducted to follow-up on promising linkage signals. Additive associations between each SNP and lipid phenotypes were tested using mixed linear regression models. Gene-based analyses were performed by combining P-values from single-marker analyses within each gene using the truncated product method (TPM). Significant associations were assessed for replication among 777 Asian participants of the Multi-ethnic Study of Atherosclerosis (MESA). Bonferroni correction was used to adjust for multiple testing. In the GenSalt study, suggestive linkage signals were identified at 2p11.2–2q12.1 [maximum multipoint LOD score (MML) = 2.18 at 2q11.2] and 11q24.3–11q25 (MML = 2.29 at 11q25) for the log-transformed triglyceride phenotype. Follow-up analyses of these two regions revealed gene-based associations of charged multivesicular body protein 3 (CHMP3), ring finger protein 103 (RNF103), AF4/FMR2 family, member 3 (AFF3), and neurotrimin (NTM ) with triglycerides (P = 4 × 10−4, 1.00 × 10−5, 2.00 × 10−5, and 1.00 × 10−7, respectively). Both the AFF3 and NTM triglyceride associations were replicated among MESA study participants (P = 1.00 × 10−7 and 8.00 × 10−5, respectively). Furthermore, NTM explained the linkage signal on chromosome 11. In conclusion, we identified novel genes associated with lipid phenotypes in linkage regions on chromosomes 2 and 11. PMID:25819087
SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

PubMed Central

Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

2014-01-01

The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047
High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).

PubMed

Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C

2016-03-01

Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.
Genome-wide linkage mapping of yield-related traits in three Chinese bread wheat populations using high-density SNP markers.

PubMed

Li, Faji; Wen, Weie; He, Zhonghu; Liu, Jindong; Jin, Hui; Cao, Shuanghe; Geng, Hongwei; Yan, Jun; Zhang, Pingzhi; Wan, Yingxiu; Xia, Xianchun

2018-06-01

We identified 21 new and stable QTL, and 11 QTL clusters for yield-related traits in three bread wheat populations using the wheat 90 K SNP assay. Identification of quantitative trait loci (QTL) for yield-related traits and closely linked molecular markers is important in order to identify gene/QTL for marker-assisted selection (MAS) in wheat breeding. The objectives of the present study were to identify QTL for yield-related traits and dissect the relationships among different traits in three wheat recombinant inbred line (RIL) populations derived from crosses Doumai × Shi 4185 (D × S), Gaocheng 8901 × Zhoumai 16 (G × Z) and Linmai 2 × Zhong 892 (L × Z). Using the available high-density linkage maps previously constructed with the wheat 90 K iSelect single nucleotide polymorphism (SNP) array, 65, 46 and 53 QTL for 12 traits were identified in the three RIL populations, respectively. Among them, 34, 23 and 27 were likely to be new QTL. Eighteen common QTL were detected across two or three populations. Eleven QTL clusters harboring multiple QTL were detected in different populations, and the interval 15.5-32.3 cM around the Rht-B1 locus on chromosome 4BS harboring 20 QTL is an important region determining grain yield (GY). Thousand-kernel weight (TKW) is significantly affected by kernel width and plant height (PH), whereas flag leaf width can be used to select lines with large kernel number per spike. Eleven candidate genes were identified, including eight cloned genes for kernel, heading date (HD) and PH-related traits as well as predicted genes for TKW, spike length and HD. The closest SNP markers of stable QTL or QTL clusters can be used for MAS in wheat breeding using kompetitive allele-specific PCR or semi-thermal asymmetric reverse PCR assays for improvement of GY.
A consensus linkage map of lentil based on DArT markers from three RIL mapping populations.

PubMed

Ates, Duygu; Aldemir, Secil; Alsaleh, Ahmad; Erdogmus, Semih; Nemli, Seda; Kahriman, Abdullah; Ozkan, Hakan; Vandenberg, Albert; Tanyolac, Bahattin

2018-01-01

Lentil (Lens culinaris ssp. culinaris Medikus) is a diploid (2n = 2x = 14), self-pollinating grain legume with a haploid genome size of about 4 Gbp and is grown throughout the world with current annual production of 4.9 million tonnes. A consensus map of lentil (Lens culinaris ssp. culinaris Medikus) was constructed using three different lentils recombinant inbred line (RIL) populations, including "CDC Redberry" x "ILL7502" (LR8), "ILL8006" x "CDC Milestone" (LR11) and "PI320937" x "Eston" (LR39). The lentil consensus map was composed of 9,793 DArT markers, covered a total of 977.47 cM with an average distance of 0.10 cM between adjacent markers and constructed 7 linkage groups representing 7 chromosomes of the lentil genome. The consensus map had no gap larger than 12.67 cM and only 5 gaps were found to be between 12.67 cM and 6.0 cM (on LG3 and LG4). The localization of the SNP markers on the lentil consensus map were in general consistent with their localization on the three individual genetic linkage maps and the lentil consensus map has longer map length, higher marker density and shorter average distance between the adjacent markers compared to the component linkage maps. This high-density consensus map could provide insight into the lentil genome. The consensus map could also help to construct a physical map using a Bacterial Artificial Chromosome library and map based cloning studies. Sequence information of DArT may help localization of orientation scaffolds from Next Generation Sequencing data.
A consensus linkage map of lentil based on DArT markers from three RIL mapping populations

PubMed Central

Ates, Duygu; Aldemir, Secil; Alsaleh, Ahmad; Erdogmus, Semih; Nemli, Seda; Kahriman, Abdullah; Ozkan, Hakan; Vandenberg, Albert

2018-01-01

Background Lentil (Lens culinaris ssp. culinaris Medikus) is a diploid (2n = 2x = 14), self-pollinating grain legume with a haploid genome size of about 4 Gbp and is grown throughout the world with current annual production of 4.9 million tonnes. Materials and methods A consensus map of lentil (Lens culinaris ssp. culinaris Medikus) was constructed using three different lentils recombinant inbred line (RIL) populations, including “CDC Redberry” x “ILL7502” (LR8), “ILL8006” x “CDC Milestone” (LR11) and “PI320937” x “Eston” (LR39). Results The lentil consensus map was composed of 9,793 DArT markers, covered a total of 977.47 cM with an average distance of 0.10 cM between adjacent markers and constructed 7 linkage groups representing 7 chromosomes of the lentil genome. The consensus map had no gap larger than 12.67 cM and only 5 gaps were found to be between 12.67 cM and 6.0 cM (on LG3 and LG4). The localization of the SNP markers on the lentil consensus map were in general consistent with their localization on the three individual genetic linkage maps and the lentil consensus map has longer map length, higher marker density and shorter average distance between the adjacent markers compared to the component linkage maps. Conclusion This high-density consensus map could provide insight into the lentil genome. The consensus map could also help to construct a physical map using a Bacterial Artificial Chromosome library and map based cloning studies. Sequence information of DArT may help localization of orientation scaffolds from Next Generation Sequencing data. PMID:29351563
Identification, Characterization, and Mapping of a Novel SNP Associated with Body Color Transparency in Juvenile Red Sea Bream (Pagrus major).

PubMed

Sawayama, Eitaro; Noguchi, Daiki; Nakayama, Kei; Takagi, Motohiro

2018-03-23

We previously reported a body color deformity in juvenile red sea bream, which shows transparency in the juvenile stage because of delayed chromatophore development compared with normal individuals, and this finding suggested a genetic cause based on parentage assessments. To conduct marker-assisted selection to eliminate broodstock inheriting the causative gene, developing DNA markers associated with the phenotype was needed. We first conducted SNP mining based on AFLP analysis using bulked-DNA from normal and transparent individuals. One SNP was identified from a transparent-specific AFLP fragment, which significantly associated with transparent individuals. Two alleles (A/G) were observed in this locus, and the genotype G/G was dominantly observed in the transparent groups (97.1%) collected from several production lots produced from different broodstock populations. A few normal individuals inherited the G/G genotype (5.0%), but the A/A and A/G genotypes were dominantly observed in the normal groups. The homologs region of the SNP was searched using a medaka genome database, and intron 12 of the Nell2a gene (located on chromosome 6 of the medaka genome) was highly matched. We also mapped the red sea bream Nell2a gene on the previously developed linkage maps, and this gene was mapped on a male linkage group, LG4-M. The newly found SNP was useful in eliminating broodstock possessing the causative gene of the body color transparency observed in juvenile stage of red sea bream.
Identification of stable QTLs for seed oil content by combined linkage and association mapping in Brassica napus.

PubMed

Sun, Fengming; Liu, Jing; Hua, Wei; Sun, Xingchao; Wang, Xinfa; Wang, Hanzhong

2016-11-01

Seed oil content is an important agricultural trait in rapeseed breeding. Although numerous quantitative trait locus (QTL) have been identified, most of them cannot be applied in practical breeding mainly due to environmental instability or large confidence intervals. The purpose of this study was to identify and validate high quality and more stable QTLs by combining linkage mapping and genome-wide association study (GWAS). For linkage mapping, we constructed two F 2 populations from crosses of high-oil content (∼50%) lines 6F313 and 61616 with a low-oil content (∼40%) line 51070. Two high density linkage maps spanned 1987cM (1659 bins) and 1856cM (1746 bins), respectively. For GWAS, we developed more than 34,000 high-quality SNP markers based on 227 accessions. Finally, 40 QTLs and 29 associations were established by linkage and association mapping in different environments. After merging the results, 32 consensus QTLs were obtained and 7 of them were identified by both mapping methods. Seven overlapping QTLs covered an average confidence interval of 183kb and explained the phenotypic variation of 10.23 to 24.45%. We further developed allele-specific PCR primers to identify each of the seven QTLs. These stable QTLs should be useful in gene cloning and practical breeding application. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).

PubMed

Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair

2014-02-06

Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection.
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)

PubMed Central

2014-01-01

Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection. PMID:24524230

Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies

PubMed Central

Zhang, Yu; Liu, Jun S.

2011-01-01

Genome-wide association studies commonly involve simultaneous tests of millions of single nucleotide polymorphisms (SNP) for disease association. The SNPs in nearby genomic regions, however, are often highly correlated due to linkage disequilibrium (LD, a genetic term for correlation). Simple Bonferonni correction for multiple comparisons is therefore too conservative. Permutation tests, which are often employed in practice, are both computationally expensive for genome-wide studies and limited in their scopes. We present an accurate and computationally efficient method, based on Poisson de-clumping heuristics, for approximating genome-wide significance of SNP associations. Compared with permutation tests and other multiple comparison adjustment approaches, our method computes the most accurate and robust p-value adjustments for millions of correlated comparisons within seconds. We demonstrate analytically that the accuracy and the efficiency of our method are nearly independent of the sample size, the number of SNPs, and the scale of p-values to be adjusted. In addition, our method can be easily adopted to estimate false discovery rate. When applied to genome-wide SNP datasets, we observed highly variable p-value adjustment results evaluated from different genomic regions. The variation in adjustments along the genome, however, are well conserved between the European and the African populations. The p-value adjustments are significantly correlated with LD among SNPs, recombination rates, and SNP densities. Given the large variability of sequence features in the genome, we further discuss a novel approach of using SNP-specific (local) thresholds to detect genome-wide significant associations. This article has supplementary material online. PMID:22140288
LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs

PubMed Central

Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong

2009-01-01

Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380
A high-density genetic map of Arachis duranensis, a diploid ancestor of cultivated peanut

PubMed Central

2012-01-01

Background Cultivated peanut (Arachis hypogaea) is an allotetraploid species whose ancestral genomes are most likely derived from the A-genome species, A. duranensis, and the B-genome species, A. ipaensis. The very recent (several millennia) evolutionary origin of A. hypogaea has imposed a bottleneck for allelic and phenotypic diversity within the cultigen. However, wild diploid relatives are a rich source of alleles that could be used for crop improvement and their simpler genomes can be more easily analyzed while providing insight into the structure of the allotetraploid peanut genome. The objective of this research was to establish a high-density genetic map of the diploid species A. duranensis based on de novo generated EST databases. Arachis duranensis was chosen for mapping because it is the A-genome progenitor of cultivated peanut and also in order to circumvent the confounding effects of gene duplication associated with allopolyploidy in A. hypogaea. Results More than one million expressed sequence tag (EST) sequences generated from normalized cDNA libraries of A. duranensis were assembled into 81,116 unique transcripts. Mining this dataset, 1236 EST-SNP markers were developed between two A. duranensis accessions, PI 475887 and Grif 15036. An additional 300 SNP markers also were developed from genomic sequences representing conserved legume orthologs. Of the 1536 SNP markers, 1054 were placed on a genetic map. In addition, 598 EST-SSR markers identified in A. hypogaea assemblies were included in the map along with 37 disease resistance gene candidate (RGC) and 35 other previously published markers. In total, 1724 markers spanning 1081.3 cM over 10 linkage groups were mapped. Gene sequences that provided mapped markers were annotated using similarity searches in three different databases, and gene ontology descriptions were determined using the Medicago Gene Atlas and TAIR databases. Synteny analysis between A. duranensis, Medicago and Glycine revealed significant stretches of conserved gene clusters spread across the peanut genome. A higher level of colinearity was detected between A. duranensis and Glycine than with Medicago. Conclusions The first high-density, gene-based linkage map for A. duranensis was generated that can serve as a reference map for both wild and cultivated Arachis species. The markers developed here are valuable resources for the peanut, and more broadly, to the legume research community. The A-genome map will have utility for fine mapping in other peanut species and has already had application for mapping a nematode resistance gene that was introgressed into A. hypogaea from A. cardenasii. PMID:22967170
A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome

PubMed Central

Ganal, Martin W.; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S.; Charcosset, Alain; Clarke, Joseph D.; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D.; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C.; Falque, Matthieu

2011-01-01

SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding. PMID:22174790
Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers

PubMed Central

2010-01-01

Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls. Conclusions Accurate genomic evaluation of the broader bull and cow population can be achieved with a single genotyping assays containing ~ 3,000 to 5,000 evenly spaced SNP. PMID:20950478
A combinatorial approach of comprehensive QTL-based comparative genome mapping and transcript profiling identified a seed weight-regulating candidate gene in chickpea

PubMed Central

Bajaj, Deepak; Upadhyaya, Hari D.; Khan, Yusuf; Das, Shouvik; Badoni, Saurabh; Shree, Tanima; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Singh, Sube; Sharma, Shivali; Tyagi, Akhilesh K.; Chattopdhyay, Debasis; Parida, Swarup K.

2015-01-01

High experimental validation/genotyping success rate (94–96%) and intra-specific polymorphic potential (82–96%) of 1536 SNP and 472 SSR markers showing in silico polymorphism between desi ICC 4958 and kabuli ICC 12968 chickpea was obtained in a 190 mapping population (ICC 4958 × ICC 12968) and 92 diverse desi and kabuli genotypes. A high-density 2001 marker-based intra-specific genetic linkage map comprising of eight LGs constructed is comparatively much saturated (mean map-density: 0.94 cM) in contrast to existing intra-specific genetic maps in chickpea. Fifteen robust QTLs (PVE: 8.8–25.8% with LOD: 7.0–13.8) associated with pod and seed number/plant (PN and SN) and 100 seed weight (SW) were identified and mapped on 10 major genomic regions of eight LGs. One of 126.8 kb major genomic region harbouring a strong SW-associated robust QTL (Caq'SW1.1: 169.1–171.3 cM) has been delineated by integrating high-resolution QTL mapping with comprehensive marker-based comparative genome mapping and differential expression profiling. This identified one potential regulatory SNP (G/A) in the cis-acting element of candidate ERF (ethylene responsive factor) TF (transcription factor) gene governing seed weight in chickpea. The functionally relevant molecular tags identified have potential to be utilized for marker-assisted genetic improvement of chickpea. PMID:25786576
Construction of a High-Density American Cranberry (Vaccinium macrocarpon Ait.) Composite Map Using Genotyping-by-Sequencing for Multi-pedigree Linkage Mapping

PubMed Central

Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan

2017-01-01

The American cranberry (Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. PMID:28250016
Whole genome sequences are required to fully resolve the linkage disequilibrium structure of human populations.

PubMed

Pengelly, Reuben J; Tapper, William; Gibson, Jane; Knut, Marcin; Tearle, Rick; Collins, Andrew; Ennis, Sarah

2015-09-03

An understanding of linkage disequilibrium (LD) structures in the human genome underpins much of medical genetics and provides a basis for disease gene mapping and investigating biological mechanisms such as recombination and selection. Whole genome sequencing (WGS) provides the opportunity to determine LD structures at maximal resolution. We compare LD maps constructed from WGS data with LD maps produced from the array-based HapMap dataset, for representative European and African populations. WGS provides up to 5.7-fold greater SNP density than array-based data and achieves much greater resolution of LD structure, allowing for identification of up to 2.8-fold more regions of intense recombination. The absence of ascertainment bias in variant genotyping improves the population representativeness of the WGS maps, and highlights the extent of uncaptured variation using array genotyping methodologies. The complete capture of LD patterns using WGS allows for higher genome-wide association study (GWAS) power compared to array-based GWAS, with WGS also allowing for the analysis of rare variation. The impact of marker ascertainment issues in arrays has been greatest for Sub-Saharan African populations where larger sample sizes and substantially higher marker densities are required to fully resolve the LD structure. WGS provides the best possible resource for LD mapping due to the maximal marker density and lack of ascertainment bias. WGS LD maps provide a rich resource for medical and population genetics studies. The increasing availability of WGS data for large populations will allow for improved research utilising LD, such as GWAS and recombination biology studies.
Identification of QTLs for 14 Agronomically Important Traits in Setaria italica Based on SNPs Generated from High-Throughput Sequencing

PubMed Central

Zhang, Kai; Fan, Guangyu; Zhang, Xinxin; Zhao, Fang; Wei, Wei; Du, Guohua; Feng, Xiaolei; Wang, Xiaoming; Wang, Feng; Song, Guoliang; Zou, Hongfeng; Zhang, Xiaolei; Li, Shuangdong; Ni, Xuemei; Zhang, Gengyun; Zhao, Zhihai

2017-01-01

Foxtail millet (Setaria italica) is an important crop possessing C4 photosynthesis capability. The S. italica genome was de novo sequenced in 2012, but the sequence lacked high-density genetic maps with agronomic and yield trait linkages. In the present study, we resequenced a foxtail millet population of 439 recombinant inbred lines (RILs) and developed high-resolution bin map and high-density SNP markers, which could provide an effective approach for gene identification. A total of 59 QTL for 14 agronomic traits in plants grown under long- and short-day photoperiods were identified. The phenotypic variation explained ranged from 4.9 to 43.94%. In addition, we suggested that there may be segregation distortion on chromosome 6 that is significantly distorted toward Zhang gu. The newly identified QTL will provide a platform for sequence-based research on the S. italica genome, and for molecular marker-assisted breeding. PMID:28364039
Identification of QTLs for 14 Agronomically Important Traits in Setaria italica Based on SNPs Generated from High-Throughput Sequencing.

PubMed

Zhang, Kai; Fan, Guangyu; Zhang, Xinxin; Zhao, Fang; Wei, Wei; Du, Guohua; Feng, Xiaolei; Wang, Xiaoming; Wang, Feng; Song, Guoliang; Zou, Hongfeng; Zhang, Xiaolei; Li, Shuangdong; Ni, Xuemei; Zhang, Gengyun; Zhao, Zhihai

2017-05-05

Foxtail millet ( Setaria italica ) is an important crop possessing C4 photosynthesis capability. The S. italica genome was de novo sequenced in 2012, but the sequence lacked high-density genetic maps with agronomic and yield trait linkages. In the present study, we resequenced a foxtail millet population of 439 recombinant inbred lines (RILs) and developed high-resolution bin map and high-density SNP markers, which could provide an effective approach for gene identification. A total of 59 QTL for 14 agronomic traits in plants grown under long- and short-day photoperiods were identified. The phenotypic variation explained ranged from 4.9 to 43.94%. In addition, we suggested that there may be segregation distortion on chromosome 6 that is significantly distorted toward Zhang gu. The newly identified QTL will provide a platform for sequence-based research on the S. italica genome, and for molecular marker-assisted breeding. Copyright © 2017 Zhang et al.
Empirical characteristics of family-based linkage to a complex trait: the ADIPOQ region and adiponectin levels.

PubMed

Hellwege, Jacklyn N; Palmer, Nicholette D; Mark Brown, W; Brown, Mark W; Ziegler, Julie T; Sandy An, S; An, Sandy S; Guo, Xiuqing; Ida Chen, Y-D; Chen, Ida Y-D; Taylor, Kent; Hawkins, Gregory A; Ng, Maggie C Y; Speliotes, Elizabeth K; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Wagenknecht, Lynne E; Langefeld, Carl D; Bowden, Donald W

2015-02-01

We previously identified a low-frequency (1.1 %) coding variant (G45R; rs200573126) in the adiponectin gene (ADIPOQ) which was the basis for a multipoint microsatellite linkage signal (LOD = 8.2) for plasma adiponectin levels in Hispanic families. We have empirically evaluated the ability of data from targeted common variants, exome chip genotyping, and genome-wide association study data to detect linkage and association to adiponectin protein levels at this locus. Simple two-point linkage and association analyses were performed in 88 Hispanic families (1,150 individuals) using 10,958 SNPs on chromosome 3. Approaches were compared for their ability to map the functional variant, G45R, which was strongly linked (two-point LOD = 20.98) and powerfully associated (p value = 8.1 × 10(-50)). Over 450 SNPs within a broad 61 Mb interval around rs200573126 showed nominal evidence of linkage (LOD > 3) but only four other SNPs in this region were associated with p values < 1.0 × 10(-4). When G45R was accounted for, the maximum LOD score across the interval dropped to 4.39 and the best p value was 1.1 × 10(-5). Linked and/or associated variants ranged in frequency (0.0018-0.50) and type (coding, non-coding) and had little detectable linkage disequilibrium with rs200573126 (r (2) < 0.20). In addition, the two-point linkage approach empirically outperformed multipoint microsatellite and multipoint SNP analysis. In the absence of data for rs200573126, family-based linkage analysis using a moderately dense SNP dataset, including both common and low-frequency variants, resulted in stronger evidence for an adiponectin locus than association data alone. Thus, linkage analysis can be a useful tool to facilitate identification of high-impact genetic variants.
Linkage disequilibrium between STRPs and SNPs across the human genome.

PubMed

Payseur, Bret A; Place, Michael; Weber, James L

2008-05-01

Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this idea, we measured associations between short-tandem-repeat polymorphisms (STRPs), which can mutate rapidly and recurrently, and SNPs in 721 regions across the human genome. We directly compared STRP-SNP LD with SNP-SNP LD from the same genomic regions in the human HapMap populations. The intensity of STRP-SNP LD, measured by the average of D', was reduced, consistent with the action of recurrent mutation. Nevertheless, a higher fraction of STRP-SNP pairs than SNP-SNP pairs showed significant LD, on both short (up to 50 kb) and long (cM) scales. These results reveal the substantial effects of mutational processes on LD at STRPs and provide important measures of the potential of STRPs for association mapping of disease genes.
A high-density genetic map and growth related QTL mapping in bighead carp (Hypophthalmichthys nobilis)

PubMed Central

Fu, Beide; Liu, Haiyang; Yu, Xiaomu; Tong, Jingou

2016-01-01

Growth related traits in fish are controlled by quantitative trait loci (QTL), but no QTL for growth have been detected in bighead carp (Hypophthalmichthys nobilis) due to the lack of high-density genetic map. In this study, an ultra-high density genetic map was constructed with 3,121 SNP markers by sequencing 117 individuals in a F1 family using 2b-RAD technology. The total length of the map was 2341.27 cM, with an average marker interval of 0.75 cM. A high level of genomic synteny between our map and zebrafish was detected. Based on this genetic map, one genome-wide significant and 37 suggestive QTL for five growth-related traits were identified in 6 linkage groups (i.e. LG3, LG11, LG15, LG18, LG19, LG22). The phenotypic variance explained (PVE) by these QTL varied from 15.4% to 38.2%. Marker within the significant QTL region was surrounded by CRP1 and CRP2, which played an important role in muscle cell division. These high-density map and QTL information provided a solid base for QTL fine mapping and comparative genomics in bighead carp. PMID:27345016
POLYMORPHISMS NEAR SOCS3 ARE ASSOCIATED WITH OBESITY AND GLUCOSE HOMEOSTASIS TRAITS IN HISPANIC AMERICANS FROM THE INSULIN RESISTANCE ATHEROSCLEROSIS FAMILY STUDY

PubMed Central

Talbert, Matthew E; Langefeld, Carl D; Ziegler, Julie; Mychaleckyj, Josyf C; Haffner, Steven M; Norris, Jill M; Bowden, Donald W

2009-01-01

The SOCS3 gene product participates in the feedback inhibition of a range of cytokine signals. Most notably, SOCS3 inhibits the functioning of leptin and downstream steps in insulin signaling after being expressed by terminal transcription factors, such as STAT3 and c-fos. The SOCS3 gene is located in the chromosome region 17q24–17q25, previously linked to body mass index (BMI), visceral adipose tissue (VAT), and waist circumference (WAIST) in Hispanic families in the Insulin Resistance Atherosclerosis Family Study (IRASFS). A high density map of 1536 single nucleotide polymorphisms (SNPs) was constructed to cover a portion of the 17q linkage interval in DNA samples from 1425 Hispanic subjects from 90 extended families in IRASFS. Analysis of this dense SNP map data revealed evidence of association of rs9914220 (located 10 kb 5’ of the SOCS3 gene) with BMI, VAT, and WAIST (P-value ranging from 0 003 to 0.017). Using a tagging SNP approach, rs9914220 and 22 additional SOCS3 SNPs were genotyped for genetic association analysis with measures of adiposity and glucose homeostasis. The adiposity phenotypes utilized in association analyses included BMI, WAIST, waist to hip ratio (WHR), subcutaneous adipose tissue (SAT), VAT, and visceral to subcutaneous ratio (VSR). Linkage disequilibrium (LD) calculations revealed three haplotype blocks near SOCS3. Haplotype Block 1 (5’ of SOCS3) contained SNPs consistently associated with BMI, WAIST, WHR, and VAT (P-values ranging from 2.00x10−4 to .036). Haplotype Block 3 contained single-SNPs that were associated with most adiposity traits except for VSR (P-values ranging from 0.002 to 0.047). When trait associated SNPs were included in linkage analyses as covariates, a reduction of VAT LOD score from 1.26 to .76 above the SOCS3 locus (110 cM) was observed. Multi-SNP haplotype testing using the quantitative pedigree disequilibrium test (QPDT) was broadly consistent with the single-SNP associations. In conclusion, these results support a role for SOCS3 genetic variants in human obesity. PMID:19083014
Characterization of Insect Resistance Loci in the USDA Soybean Germplasm Collection Using Genome-Wide Association Studies

PubMed Central

Chang, Hao-Xun; Hartman, Glen L.

2017-01-01

Management of insects that cause economic damage to yields of soybean mainly rely on insecticide applications. Sources of resistance in soybean plant introductions (PIs) to different insect pests have been reported, and some of these sources, like for the soybean aphid (SBA), have been used to develop resistant soybean cultivars. With the availability of SoySNP50K and the statistical power of genome-wide association studies, we integrated phenotypic data for beet armyworm, Mexican bean beetle (MBB), potato leafhopper (PLH), SBA, soybean looper (SBL), velvetbean caterpillar (VBC), and chewing damage caused by unspecified insects for a comprehensive understanding of insect resistance in the United States Department of Agriculture Soybean Germplasm Collection. We identified significant single nucleotide (SNP) polymorphic markers for MBB, PLH, SBL, and VBC, and we highlighted several leucine-rich repeat-containing genes and myeloblastosis transcription factors within the high linkage disequilibrium region surrounding significant SNP markers. Specifically for soybean resistance to PLH, we found the PLH locus is close but distinct to a locus for soybean pubescence density on chromosome 12. The results provide genetic support that pubescence density may not directly link to PLH resistance. This study offers a novel insight of soybean resistance to four insect pests and reviews resistance mapping studies for major soybean insects. PMID:28555141
Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map.

PubMed

N'Diaye, Amidou; Haile, Jemanesh K; Cory, Aron T; Clarke, Fran R; Clarke, John M; Knox, Ron E; Pozniak, Curtis J

2017-01-01

Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat.
Toward allotetraploid cotton genome assembly: integration of a high-density molecular genetic linkage map with DNA sequence information

PubMed Central

2012-01-01

Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource for tetraploid cotton genome assembly, for cloning genes related to superior agronomic traits, and for further comparative genomic analyses in Gossypium. PMID:23046547
Genetic diversity analysis of two commercial breeds of pigs using genomic and pedigree data.

PubMed

Zanella, Ricardo; Peixoto, Jane O; Cardoso, Fernando F; Cardoso, Leandro L; Biegelmeyer, Patrícia; Cantão, Maurício E; Otaviano, Antonio; Freitas, Marcelo S; Caetano, Alexandre R; Ledur, Mônica C

2016-03-30

Genetic improvement in livestock populations can be achieved without significantly affecting genetic diversity if mating systems and selection decisions take genetic relationships among individuals into consideration. The objective of this study was to examine the genetic diversity of two commercial breeds of pigs. Genotypes from 1168 Landrace (LA) and 1094 Large White (LW) animals from a commercial breeding program in Brazil were obtained using the Illumina PorcineSNP60 Beadchip. Inbreeding estimates based on pedigree (F x) and genomic information using runs of homozygosity (F ROH) and the single nucleotide polymorphisms (SNP) by SNP inbreeding coefficient (F SNP) were obtained. Linkage disequilibrium (LD), correlation of linkage phase (r) and effective population size (N e ) were also estimated. Estimates of inbreeding obtained with pedigree information were lower than those obtained with genomic data in both breeds. We observed that the extent of LD was slightly larger at shorter distances between SNPs in the LW population than in the LA population, which indicates that the LW population was derived from a smaller N e . Estimates of N e based on genomic data were equal to 53 and 40 for the current populations of LA and LW, respectively. The correlation of linkage phase between the two breeds was equal to 0.77 at distances up to 50 kb, which suggests that genome-wide association and selection should be performed within breed. Although selection intensities have been stronger in the LA breed than in the LW breed, levels of genomic and pedigree inbreeding were lower for the LA than for the LW breed. The use of genomic data to evaluate population diversity in livestock animals can provide new and more precise insights about the effects of intense selection for production traits. Resulting information and knowledge can be used to effectively increase response to selection by appropriately managing the rate of inbreeding, minimizing negative effects of inbreeding depression and therefore maintaining desirable levels of genetic diversity.
Haplotypes in the gene encoding protein kinase c-beta (PRKCB1) on chromosome 16 are associated with autism.

PubMed

Philippi, A; Roschmann, E; Tores, F; Lindenbaum, P; Benajou, A; Germain-Leclerc, L; Marcaillou, C; Fontaine, K; Vanpeene, M; Roy, S; Maillard, S; Decaulne, V; Saraiva, J P; Brooks, P; Rousseau, F; Hager, J

2005-10-01

Autism is a developmental disorder characterized by impairments in social interaction and communication associated with repetitive patterns of interest or behavior. Autism is highly influenced by genetic factors. Genome-wide linkage and candidate gene association approaches have been used to try and identify autism genes. A few loci have repeatedly been reported linked to autism. Several groups reported evidence for linkage to a region on chromosome 16p. We have applied a direct physical identity-by-descent (IBD) mapping approach to perform a high-density (0.85 megabases) genome-wide linkage scan in 116 families from the AGRE collection. Our results confirm linkage to a region on chromosome 16p with autism. High-resolution single-nucleotide polymorphism (SNP) genotyping and analysis of this region show that haplotypes in the protein kinase c-beta gene are strongly associated with autism. An independent replication of the association in a second set of 167 trio families with autism confirmed our initial findings. Overall, our data provide evidence that the PRKCB1 gene on chromosome 16p may be involved in the etiology of autism.
SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.

PubMed

Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H

2014-02-26

Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.

A graphene-based platform for single nucleotide polymorphism (SNP) genotyping.

PubMed

Liu, Meng; Zhao, Huimin; Chen, Shuo; Yu, Hongtao; Zhang, Yaobin; Quan, Xie

2011-06-15

A facile, rapid, stable and sensitive approach for fluorescent detection of single nucleotide polymorphism (SNP) is designed based on DNA ligase reaction and π-stacking between the graphene and the nucleotide bases. In the presence of perfectly matched DNA, DNA ligase can catalyze the linkage of fluorescein amidite-labeled single-stranded DNA (ssDNA) and a phosphorylated ssDNA, and thus the formation of a stable duplex in high yield. However, the catalytic reaction cannot effectively carry out with one-base mismatched DNA target. In this case, we add graphene to the system in order to produce different quenching signals due to its different adsorption affinity for ssDNA and double-stranded DNA. Taking advantage of the unique surface property of graphene and the high discriminability of DNA ligase, the proposed protocol exhibits good performance in SNP genotyping. The results indicate that it is possible to accurately determine SNP with frequency as low as 2.6% within 40 min. Furthermore, the presented flexible strategy facilitates the development of other biosensing applications in the future. Copyright © 2011 Elsevier B.V. All rights reserved.
The extent of linkage disequilibrium in beef cattle breeds using high-density SNP genotypes.

PubMed

Porto-Neto, Laercio R; Kijas, James W; Reverter, Antonio

2014-03-24

The extent of linkage disequilibrium (LD) between molecular markers impacts genome-wide association studies and implementation of genomic selection. The availability of high-density single nucleotide polymorphism (SNP) genotyping platforms makes it possible to investigate LD at an unprecedented resolution. In this work, we characterised LD decay in breeds of beef cattle of taurine, indicine and composite origins and explored its variation across autosomes and the X chromosome. In each breed, LD decayed rapidly and r2 was less than 0.2 for marker pairs separated by 50 kb. The LD decay curves clustered into three groups of similar LD decay that distinguished the three main cattle types. At short distances between markers (<10 kb), taurine breeds showed higher LD (r2=0.45) than their indicine (r2=0.25) and composite (r2=0.32) counterparts. This higher LD in taurine breeds was attributed to a smaller effective population size and a stronger bottleneck during breed formation. Using all SNPs on only the X chromosome, the three cattle types could still be distinguished. However for taurine breeds, the LD decay on the X chromosome was much faster and the background level much lower than for indicine breeds and composite populations. When using only SNPs that were polymorphic in all breeds, the analysis of the X chromosome mimicked that of the autosomes. The pattern of LD mirrored some aspects of the history of breed populations and showed a sharp decay with increasing physical distance between markers. We conclude that the availability of the HD chip can be used to detect association signals that remained hidden when using lower density genotyping platforms, since LD dropped below 0.2 at distances of 50 kb.
Construction of a High-Density American Cranberry (Vaccinium macrocarpon Ait.) Composite Map Using Genotyping-by-Sequencing for Multi-pedigree Linkage Mapping.

PubMed

Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan

2017-04-03

The American cranberry ( Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. Copyright © 2017 Schlautman et al.
miRNA-Mediated Relationships between Cis-SNP Genotypes and Transcript Intensities in Lymphocyte Cell Lines

PubMed Central

Zhang, Wensheng; Edwards, Andrea; Zhu, Dongxiao; Flemington, Erik K.; Deininger, Prescott; Zhang, Kun

2012-01-01

In metazoans, miRNAs regulate gene expression primarily through binding to target sites in the 3′ UTRs (untranslated regions) of messenger RNAs (mRNAs). Cis-acting variants within, or close to, a gene are crucial in explaining the variability of gene expression measures. Single nucleotide polymorphisms (SNPs) in the 3′ UTRs of genes can affect the base-pairing between miRNAs and mRNAs, and hence disrupt existing target sites (in the reference sequence) or create novel target sites, suggesting a possible mechanism for cis regulation of gene expression. Moreover, because the alleles of different SNPs within a DNA sequence of limited length tend to be in strong linkage disequilibrium (LD), we hypothesize the variants of miRNA target sites caused by SNPs potentially function as bridges linking the documented cis-SNP markers to the expression of the associated genes. A large-scale analysis was herein performed to test this hypothesis. By systematically integrating multiple latest information sources, we found 21 significant gene-level SNP-involved miRNA-mediated post-transcriptional regulation modules (SNP-MPRMs) in the form of SNP-miRNA-mRNA triplets in lymphocyte cell lines for the CEU and YRI populations. Among the cognate genes, six including ALG8, DGKE, GNA12, KLF11, LRPAP1, and MMAB are related to multiple genetic diseases such as depressive disorder and Type-II diabetes. Furthermore, we found that ∼35% of the documented transcript intensity-related cis-SNPs (∼950) in a recent publication are identical to, or in significant linkage disequilibrium (LD) (p<0.01) with, one or multiple SNPs located in miRNA target sites. Based on these associations (or identities), 69 significant exon-level SNP-MPRMs and 12 disease genes were further determined for two populations. These results provide concrete in silico evidence for the proposed hypothesis. The discovered modules warrant additional follow-up in independent laboratory studies. PMID:22348086
Population-genetic nature of copy number variations in the human genome.

PubMed

Kato, Mamoru; Kawaguchi, Takahisa; Ishikawa, Shumpei; Umeda, Takayoshi; Nakamichi, Reiichiro; Shapero, Michael H; Jones, Keith W; Nakamura, Yusuke; Aburatani, Hiroyuki; Tsunoda, Tatsuhiko

2010-03-01

Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000-4000 CNVs (4-6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV-SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV-SNP linkage disequilibrium (LD) for 500-900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP-SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.
Technical note: Equivalent genomic models with a residual polygenic effect.

PubMed

Liu, Z; Goddard, M E; Hayes, B J; Reinhardt, F; Reents, R

2016-03-01

Routine genomic evaluations in animal breeding are usually based on either a BLUP with genomic relationship matrix (GBLUP) or single nucleotide polymorphism (SNP) BLUP model. For a multi-step genomic evaluation, these 2 alternative genomic models were proven to give equivalent predictions for genomic reference animals. The model equivalence was verified also for young genotyped animals without phenotypes. Due to incomplete linkage disequilibrium of SNP markers to genes or causal mutations responsible for genetic inheritance of quantitative traits, SNP markers cannot explain all the genetic variance. A residual polygenic effect is normally fitted in the genomic model to account for the incomplete linkage disequilibrium. In this study, we start by showing the proof that the multi-step GBLUP and SNP BLUP models are equivalent for the reference animals, when they have a residual polygenic effect included. Second, the equivalence of both multi-step genomic models with a residual polygenic effect was also verified for young genotyped animals without phenotypes. Additionally, we derived formulas to convert genomic estimated breeding values of the GBLUP model to its components, direct genomic values and residual polygenic effect. Third, we made a proof that the equivalence of these 2 genomic models with a residual polygenic effect holds also for single-step genomic evaluation. Both the single-step GBLUP and SNP BLUP models lead to equal prediction for genotyped animals with phenotypes (e.g., reference animals), as well as for (young) genotyped animals without phenotypes. Finally, these 2 single-step genomic models with a residual polygenic effect were proven to be equivalent for estimation of SNP effects, too. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Unlocking Diversity in Germplasm Collections via Genomic Selection: A Case Study Based on Quantitative Adult Plant Resistance to Stripe Rust in Spring Wheat.

PubMed

Muleta, Kebede T; Bulli, Peter; Zhang, Zhiwu; Chen, Xianming; Pumphrey, Michael

2017-11-01

Harnessing diversity from germplasm collections is more feasible today because of the development of lower-cost and higher-throughput genotyping methods. However, the cost of phenotyping is still generally high, so efficient methods of sampling and exploiting useful diversity are needed. Genomic selection (GS) has the potential to enhance the use of desirable genetic variation in germplasm collections through predicting the genomic estimated breeding values (GEBVs) for all traits that have been measured. Here, we evaluated the effects of various scenarios of population genetic properties and marker density on the accuracy of GEBVs in the context of applying GS for wheat ( L.) germplasm use. Empirical data for adult plant resistance to stripe rust ( f. sp. ) collected on 1163 spring wheat accessions and genotypic data based on the wheat 9K single nucleotide polymorphism (SNP) iSelect assay were used for various genomic prediction tests. Unsurprisingly, the results of the cross-validation tests demonstrated that prediction accuracy increased with an increase in training population size and marker density. It was evident that using all the available markers (5619) was unnecessary for capturing the trait variation in the germplasm collection, with no further gain in prediction accuracy beyond 1 SNP per 3.2 cM (∼1850 markers), which is close to the linkage disequilibrium decay rate in this population. Collectively, our results suggest that larger germplasm collections may be efficiently sampled via lower-density genotyping methods, whereas genetic relationships between the training and validation populations remain critical when exploiting GS to select from germplasm collections. Copyright © 2017 Crop Science Society of America.
Lack of Association of Bone Morphogenetic Protein 2 Gene Haplotypes with Bone Mineral Density, Bone Loss, or Risk of Fractures in Men

PubMed Central

Varanasi, Satya S.; Tuck, Stephen P.; Mastana, Sarabjit S.; Dennison, Elaine; Cooper, Cyrus; Vila, Josephine; Francis, Roger M.; Datta, Harish K.

2011-01-01

Introduction. The association of bone morphogenetic protein 2 (BMP2) with BMD and risk of fracture was suggested by a recent linkage study, but subsequent studies have been contradictory. We report the results of a study of the relationship between BMP2 genotypes and BMD, annual change in BMD, and risk of fracture in male subjects. Materials and Methods. We tested three single-nucleotide polymorphisms (SNPs) across the BMP2 gene, including Ser37Ala SNP, in 342 Caucasian Englishmen, comprising 224 control and 118 osteoporotic subjects. Results. BMP2 SNP1 (Ser37Ala) genotypes were found to have similar low frequency in control subjects and men with osteoporosis. The major informative polymorphism, BMP2 SNP3 (Arg190Ser), showed no statistically significant association with weight, height, BMD, change in BMD at hip or lumbar spine, and risk of fracture. Conclusion. There were no genotypic or haplotypic effects of the BMP2 candidate gene on BMD, change in BMD, or fracture risk identified in this cohort. PMID:22013543
A high-density SNP linkage scan with 142 combined subtype ADHD sib pairs identifies linkage regions on chromosomes 9 and 16.

PubMed

Asherson, P; Zhou, K; Anney, R J L; Franke, B; Buitelaar, J; Ebstein, R; Gill, M; Altink, M; Arnold, R; Boer, F; Brookes, K; Buschgens, C; Butler, L; Cambell, D; Chen, W; Christiansen, H; Feldman, L; Fleischman, K; Fliers, E; Howe-Forbes, R; Goldfarb, A; Heise, A; Gabriëls, I; Johansson, L; Lubetzki, I; Marco, R; Medad, S; Minderaa, R; Mulas, F; Müller, U; Mulligan, A; Neale, B; Rijsdijk, F; Rabin, K; Rommelse, N; Sethna, V; Sorohan, J; Uebel, H; Psychogiou, L; Weeks, A; Barrett, R; Xu, X; Banaschewski, T; Sonuga-Barke, E; Eisenberg, J; Manor, I; Miranda, A; Oades, R D; Roeyers, H; Rothenberger, A; Sergeant, J; Steinhausen, H-C; Taylor, E; Thompson, M; Faraone, S V

2008-05-01

As part of the International Multi-centre ADHD Genetics project we completed an affected sibling pair study of 142 narrowly defined Diagnostic and Statistical Manual of Mental Disorders, fourth edition combined type attention deficit hyperactivity disorder (ADHD) proband-sibling pairs. No linkage was observed on the most established ADHD-linked genomic regions of 5p and 17p. We found suggestive linkage signals on chromosomes 9 and 16, respectively, with the highest multipoint nonparametric linkage signal on chromosome 16q23 at 99 cM (log of the odds, LOD=3.1) overlapping data published from the previous UCLA (University of California, Los Angeles) (LOD>1, approximately 95 cM) and Dutch (LOD>1, approximately 100 cM) studies. The second highest peak in this study was on chromosome 9q22 at 90 cM (LOD=2.13); both the previous UCLA and German studies also found some evidence of linkage at almost the same location (UCLA LOD=1.45 at 93 cM; German LOD=0.68 at 100 cM). The overlap of these two main peaks with previous findings suggests that loci linked to ADHD may lie within these regions. Meta-analysis or reanalysis of the raw data of all the available ADHD linkage scan data may help to clarify whether these represent true linked loci.
Saturation of an Intra-Gene Pool Linkage Map: Towards a Unified Consensus Linkage Map for Fine Mapping and Synteny Analysis in Common Bean

PubMed Central

Galeano, Carlos H.; Fernandez, Andrea C.; Franco-Herrera, Natalia; Cichy, Karen A.; McClean, Phillip E.; Vanderleyden, Jos; Blair, Matthew W.

2011-01-01

Map-based cloning and fine mapping to find genes of interest and marker assisted selection (MAS) requires good genetic maps with reproducible markers. In this study, we saturated the linkage map of the intra-gene pool population of common bean DOR364×BAT477 (DB) by evaluating 2,706 molecular markers including SSR, SNP, and gene-based markers. On average the polymorphism rate was 7.7% due to the narrow genetic base between the parents. The DB linkage map consisted of 291 markers with a total map length of 1,788 cM. A consensus map was built using the core mapping populations derived from inter-gene pool crosses: DOR364×G19833 (DG) and BAT93×JALO EEP558 (BJ). The consensus map consisted of a total of 1,010 markers mapped, with a total map length of 2,041 cM across 11 linkage groups. On average, each linkage group on the consensus map contained 91 markers of which 83% were single copy markers. Finally, a synteny analysis was carried out using our highly saturated consensus maps compared with the soybean pseudo-chromosome assembly. A total of 772 marker sequences were compared with the soybean genome. A total of 44 syntenic blocks were identified. The linkage group Pv6 presented the most diverse pattern of synteny with seven syntenic blocks, and Pv9 showed the most consistent relations with soybean with just two syntenic blocks. Additionally, a co-linear analysis using common bean transcript map information against soybean coding sequences (CDS) revealed the relationship with 787 soybean genes. The common bean consensus map has allowed us to map a larger number of markers, to obtain a more complete coverage of the common bean genome. Our results, combined with synteny relationships provide tools to increase marker density in selected genomic regions to identify closely linked polymorphic markers for indirect selection, fine mapping or for positional cloning. PMID:22174773
Dosage Transmission Disequilibrium Test (dTDT) for Linkage and Association Detection

PubMed Central

Zhang, Zhehao; Wang, Jen-Chyong; Howells, William; Lin, Peng; Agrawal, Arpana; Edenberg, Howard J.; Tischfield, Jay A.; Schuckit, Marc A.; Bierut, Laura J.; Goate, Alison; Rice, John P.

2013-01-01

Both linkage and association studies have been successfully applied to identify disease susceptibility genes with genetic markers such as microsatellites and Single Nucleotide Polymorphisms (SNPs). As one of the traditional family-based studies, the Transmission/Disequilibrium Test (TDT) measures the over-transmission of an allele in a trio from its heterozygous parents to the affected offspring and can be potentially useful to identify genetic determinants for complex disorders. However, there is reduced information when complete trio information is unavailable. In this study, we developed a novel approach to “infer” the transmission of SNPs by combining both the linkage and association data, which uses microsatellite markers from families informative for linkage together with SNP markers from the offspring who are genotyped for both linkage and a Genome-Wide Association Study (GWAS). We generalized the traditional TDT to process these inferred dosage probabilities, which we name as the dosage-TDT (dTDT). For evaluation purpose, we developed a simulation procedure to assess its operating characteristics. We applied the dTDT to the simulated data and documented the power of the dTDT under a number of different realistic scenarios. Finally, we applied our methods to a family study of alcohol dependence (COGA) and performed individual genotyping on complete families for the top signals. One SNP (rs4903712 on chromosome 14) remained significant after correcting for multiple testing Methods developed in this study can be adapted to other platforms and will have widespread applicability in genomic research when case-control GWAS data are collected in families with existing linkage data. PMID:23691058
Uneven recombination rate and linkage disequilibrium across a reference SNP map for common bean (Phaseolus vulgaris L.)

PubMed Central

Farmer, Andrew D.; Huang, Wei; Ambachew, Daniel; Penmetsa, R. Varma; Carrasquilla-Garcia, Noelia; Assefa, Teshale; Cannon, Steven B.

2018-01-01

Recombination (R) rate and linkage disequilibrium (LD) analyses are the basis for plant breeding. These vary by breeding system, by generation of inbreeding or outcrossing and by region in the chromosome. Common bean (Phaseolus vulgaris L.) is a favored food legume with a small sequenced genome (514 Mb) and n = 11 chromosomes. The goal of this study was to describe R and LD in the common bean genome using a 768-marker array of single nucleotide polymorphisms (SNP) based on Trans-legume Orthologous Group (TOG) genes along with an advanced-generation Recombinant Inbred Line reference mapping population (BAT93 x Jalo EEP558) and an internationally available diversity panel. A whole genome genetic map was created that covered all eleven linkage groups (LG). The LGs were linked to the physical map by sequence data of the TOGs compared to each chromosome sequence of common bean. The genetic map length in total was smaller than for previous maps reflecting the precision of allele calling and mapping with SNP technology as well as the use of gene-based markers. A total of 91.4% of TOG markers had singleton hits with annotated Pv genes and all mapped outside of regions of resistance gene clusters. LD levels were found to be stronger within the Mesoamerican genepool and decay more rapidly within the Andean genepool. The recombination rate across the genome was 2.13 cM / Mb but R was found to be highly repressed around centromeres and frequent outside peri-centromeric regions. These results have important implications for association and genetic mapping or crop improvement in common bean. PMID:29522524
A Picea abies Linkage Map Based on SNP Markers Identifies QTLs for Four Aspects of Resistance to Heterobasidion parviporum Infection

PubMed Central

Lind, Mårten; Källman, Thomas; Chen, Jun; Ma, Xiao-Fei; Bousquet, Jean; Morgante, Michele; Zaina, Giusi; Karlsson, Bo; Elfstrand, Malin; Lascoux, Martin; Stenlid, Jan

2014-01-01

A consensus linkage map of Picea abies, an economically important conifer, was constructed based on the segregation of 686 SNP markers in a F1 progeny population consisting of 247 individuals. The total length of 1889.2 cM covered 96.5% of the estimated genome length and comprised 12 large linkage groups, corresponding to the number of haploid P. abies chromosomes. The sizes of the groups (from 5.9 to 9.9% of the total map length) correlated well with previous estimates of chromosome sizes (from 5.8 to 10.8% of total genome size). Any locus in the genome has a 97% probability to be within 10 cM from a mapped marker, which makes the map suited for QTL mapping. Infecting the progeny trees with the root rot pathogen Heterobasidion parviporum allowed for mapping of four different resistance traits: lesion length at the inoculation site, fungal spread within the sapwood, exclusion of the pathogen from the host after initial infection, and ability to prevent the infection from establishing at all. These four traits were associated with two, four, four and three QTL regions respectively of which none overlapped between the traits. Each QTL explained between 4.6 and 10.1% of the respective traits phenotypic variation. Although the QTL regions contain many more genes than the ones represented by the SNP markers, at least four markers within the confidence intervals originated from genes with known function in conifer defence; a leucoanthocyanidine reductase, which has previously been shown to upregulate during H. parviporum infection, and three intermediates of the lignification process; a hydroxycinnamoyl CoA shikimate/quinate hydroxycinnamoyltransferase, a 4-coumarate CoA ligase, and a R2R3-MYB transcription factor. PMID:25036209
SNP marker discovery, linkage map construction and identification of QTLs for enhanced salinity tolerance in field pea (Pisum sativum L.)

PubMed Central

2013-01-01

Background Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs. Results In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for selection of resistant cultivars. Comparison of sequences underpinning these SNP markers to the M. truncatula genome defined genomic regions containing candidate genes associated with saline stress tolerance. Conclusion The SNP assays and associated genetic linkage maps developed in this study permitted identification of salinity tolerance QTLs and candidate genes. This constitutes an important set of tools for marker-assisted selection (MAS) programs aimed at performance enhancement of field pea cultivars. PMID:24134188
SNP marker discovery, linkage map construction and identification of QTLs for enhanced salinity tolerance in field pea (Pisum sativum L.).

PubMed

Leonforte, Antonio; Sudheesh, Shimna; Cogan, Noel O I; Salisbury, Philip A; Nicolas, Marc E; Materne, Michael; Forster, John W; Kaur, Sukhjiwan

2013-10-17

Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs. In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for selection of resistant cultivars. Comparison of sequences underpinning these SNP markers to the M. truncatula genome defined genomic regions containing candidate genes associated with saline stress tolerance. The SNP assays and associated genetic linkage maps developed in this study permitted identification of salinity tolerance QTLs and candidate genes. This constitutes an important set of tools for marker-assisted selection (MAS) programs aimed at performance enhancement of field pea cultivars.
Linkage disequilibrium among commonly genotyped SNP and variants detected from bull sequence

USDA-ARS?s Scientific Manuscript database

Genomic prediction utilizing causal variants could increase selection accuracy above that achieved with SNP genotyped by commercial assays. A number of variants detected from sequencing influential sires are likely to be causal, but noticable improvements in prediction accuracy using imputed sequen...
Development and dissection of diagnostic SNP markers for the downy mildew resistance genes Pl Arg and Pl 8 and maker-assisted gene pyramiding in sunflower (Helianthus annuus L.).

PubMed

Qi, L L; Talukder, Z I; Hulke, B S; Foley, M E

2017-06-01

Diagnostic DNA markers are an invaluable resource in breeding programs for successful introgression and pyramiding of disease resistance genes. Resistance to downy mildew (DM) disease in sunflower is mediated by Pl genes which are known to be effective against the causal fungus, Plasmopara halstedii. Two DM resistance genes, Pl Arg and Pl 8 , are highly effective against P. halstedii races in the USA, and have been previously mapped to the sunflower linkage groups (LGs) 1 and 13, respectively, using simple sequence repeat (SSR) markers. In this study, we developed high-density single nucleotide polymorphism (SNP) maps encompassing the Pl arg and Pl 8 genes and identified diagnostic SNP markers closely linked to these genes. The specificity of the diagnostic markers was validated in a highly diverse panel of 548 sunflower lines. Dissection of a large marker cluster co-segregated with Pl Arg revealed that the closest SNP markers NSA_007595 and NSA_001835 delimited Pl Arg to an interval of 2.83 Mb on the LG1 physical map. The SNP markers SFW01497 and SFW06597 delimited Pl 8 to an interval of 2.85 Mb on the LG13 physical map. We also developed sunflower lines with homozygous, three gene pyramids carrying Pl Arg , Pl 8 , and the sunflower rust resistance gene R 12 using the linked SNP markers from a segregating F 2 population of RHA 340 (carrying Pl 8 )/RHA 464 (carrying Pl Arg and R 12 ). The high-throughput diagnostic SNP markers developed in this study will facilitate marker-assisted selection breeding, and the pyramided sunflower lines will provide durable resistance to downy mildew and rust diseases.
SNP-markers in Allium species to facilitate introgression breeding in onion.

PubMed

Scholten, Olga E; van Kaauwen, Martijn P W; Shahin, Arwa; Hendrickx, Patrick M; Keizer, L C Paul; Burger, Karin; van Heusden, Adriaan W; van der Linden, C Gerard; Vosman, Ben

2016-08-31

Within onion, Allium cepa L., the availability of disease resistance is limited. The identification of sources of resistance in related species, such as Allium roylei and Allium fistulosum, was a first step towards the improvement of onion cultivars by breeding. SNP markers linked to resistance and polymorphic between these related species and onion cultivars are a valuable tool to efficiently introgress disease resistance genes. In this paper we describe the identification and validation of SNP markers valuable for onion breeding. Transcriptome sequencing resulted in 192 million RNA seq reads from the interspecific F1 hybrid between A. roylei and A. fistulosum (RF) and nine onion cultivars. After assembly, reliable SNPs were discovered in about 36 % of the contigs. For genotyping of the interspecific three-way cross population, derived from a cross between an onion cultivar and the RF (CCxRF), 1100 SNPs that are polymorphic in RF and monomorphic in the onion cultivars (RF SNPs) were selected for the development of KASP assays. A molecular linkage map based on 667 RF-SNP markers was constructed for CCxRF. In addition, KASP assays were developed for 1600 onion-SNPs (SNPs polymorphic among onion cultivars). A second linkage map was constructed for an F2 of onion x A. roylei (F2(CxR)) that consisted of 182 onion-SNPs and 119 RF-SNPs, and 76 previously mapped markers. Markers co-segregating in both the F2(CxR) and the CCxRF population were used to assign the linkage groups of RF to onion chromosomes. To validate usefulness of these SNP markers, QTL mapping was applied in the CCxRF population that segregates for resistance to Botrytis squamosa and resulted in a QTL for resistance on chromosome 6 of A. roylei. Our research has more than doubled the publicly available marker sequences of expressed onion genes and two onion-related species. It resulted in a detailed genetic map for the interspecific CCxRF population. This is the first paper that reports the detection of a QTL for resistance to B. squamosa in A. roylei.
A high-density genetic map and QTL analysis of agronomic traits in foxtail millet [Setaria italica (L.) P. Beauv.] using RAD-seq.

PubMed

Wang, Jun; Wang, Zhilan; Du, Xiaofen; Yang, Huiqing; Han, Fang; Han, Yuanhuai; Yuan, Feng; Zhang, Linyi; Peng, Shuzhong; Guo, Erhu

2017-01-01

Foxtail millet (Setaria italica), a very important grain crop in China, has become a new model plant for cereal crops and biofuel grasses. Although its reference genome sequence was released recently, quantitative trait loci (QTLs) controlling complex agronomic traits remains limited. The development of massively parallel genotyping methods and next-generation sequencing technologies provides an excellent opportunity for developing single-nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of complex quantitative traits. In this study, a high-throughput and cost-effective RAD-seq approach was employed to generate a high-density genetic map for foxtail millet. A total of 2,668,587 SNP loci were detected according to the reference genome sequence; meanwhile, 9,968 SNP markers were used to genotype 124 F2 progenies derived from the cross between Hongmiaozhangu and Changnong35; a high-density genetic map spanning 1648.8 cM, with an average distance of 0.17 cM between adjacent markers was constructed; 11 major QTLs for eight agronomic traits were identified; five co-dominant DNA markers were developed. These findings will be of value for the identification of candidate genes and marker-assisted selection in foxtail millet.
A high-density genetic map and QTL analysis of agronomic traits in foxtail millet [Setaria italica (L.) P. Beauv.] using RAD-seq

PubMed Central

Wang, Zhilan; Du, Xiaofen; Yang, Huiqing; Han, Fang; Han, Yuanhuai; Yuan, Feng; Zhang, Linyi; Peng, Shuzhong; Guo, Erhu

2017-01-01

Foxtail millet (Setaria italica), a very important grain crop in China, has become a new model plant for cereal crops and biofuel grasses. Although its reference genome sequence was released recently, quantitative trait loci (QTLs) controlling complex agronomic traits remains limited. The development of massively parallel genotyping methods and next-generation sequencing technologies provides an excellent opportunity for developing single-nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of complex quantitative traits. In this study, a high-throughput and cost-effective RAD-seq approach was employed to generate a high-density genetic map for foxtail millet. A total of 2,668,587 SNP loci were detected according to the reference genome sequence; meanwhile, 9,968 SNP markers were used to genotype 124 F2 progenies derived from the cross between Hongmiaozhangu and Changnong35; a high-density genetic map spanning 1648.8 cM, with an average distance of 0.17 cM between adjacent markers was constructed; 11 major QTLs for eight agronomic traits were identified; five co-dominant DNA markers were developed. These findings will be of value for the identification of candidate genes and marker-assisted selection in foxtail millet. PMID:28644843

Linkage disequilibrium, persistence of phase, and effective population size in Spanish local beef cattle breeds assessed through a high-density single nucleotide polymorphism chip.

PubMed

Cañas-Álvarez, J J; Mouresan, E F; Varona, L; Díaz, C; Molina, A; Baro, J A; Altarriba, J; Carabaño, M J; Casellas, J; Piedrafita, J

2016-07-01

Linkage disequilibrium (LD) and persistence of phase are fundamental approaches for exploring the genetic basis of economically important traits in cattle, including the identification of QTL for genomic selection and the estimation of effective population size () to determine the size of the training populations. In this study, we have used the Illumina BovineHD chip in 168 trios of 7 Spanish beef cattle breeds to obtain an overview of the magnitude of LD and the persistence of LD phase through the physical distance between markers. Also, we estimated the time of divergence based on the persistence of the LD phase and calculated past from LD estimates using different alternatives to define the recombination rate. Estimates of average (as a measure of LD) for adjacent markers were close to 0.52 in the 7 breeds and decreased with the distance between markers, although in long distances, some LD still remained (0.07 and 0.05 for markers 200 kb and 1 Mb apart, respectively). A panel with a lower boundary of 38,000 SNP would be necessary to launch a successful within-breed genomic selection program. Persistence of phase, measured as the pairwise correlations between estimates of in 2 breeds at short distances (10 kb), was in the 0.89 to 0.94 range and decreased from 0.33 to 0.52 to a range of 0.01 to 0.08 when marker distance increased from 200 kb to 1 Mb, respectively. The magnitude of the persistence of phase between the Spanish beef breeds was similar to those found in dairy breeds. For across-breed genomic selection, the size of the SNP panels must be in the range of 50,000 to 83,000 SNP. Estimates of past showed values ranging from 26 to 31 for 1 generation ago in all breeds. The divergence among breeds occurred between 129 and 207 generations ago. The results of this study are relevant for the future implementation of within- and across-breed genomic selection programs in the Spanish beef cattle populations. Our results suggest that a reduced subset of the SNP panel would be enough to achieve an adequate precision of the genomic predictions.
A consensus genetic map of cowpea [Vigna unguiculata (L) Walp.] and synteny based on EST-derived SNPs.

PubMed

Muchero, Wellington; Diop, Ndeye N; Bhat, Prasanna R; Fenton, Raymond D; Wanamaker, Steve; Pottorff, Marti; Hearne, Sarah; Cisse, Ndiaga; Fatokun, Christian; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J

2009-10-27

Consensus genetic linkage maps provide a genomic framework for quantitative trait loci identification, map-based cloning, assessment of genetic diversity, association mapping, and applied breeding in marker-assisted selection schemes. Among "orphan crops" with limited genomic resources such as cowpea [Vigna unguiculata (L.) Walp.] (2n = 2x = 22), the use of transcript-derived SNPs in genetic maps provides opportunities for automated genotyping and estimation of genome structure based on synteny analysis. Here, we report the development and validation of a high-throughput EST-derived SNP assay for cowpea, its application in consensus map building, and determination of synteny to reference genomes. SNP mining from 183,118 ESTs sequenced from 17 cDNA libraries yielded approximately 10,000 high-confidence SNPs from which an Illumina 1,536-SNP GoldenGate genotyping array was developed and applied to 741 recombinant inbred lines from six mapping populations. Approximately 90% of the SNPs were technically successful, providing 1,375 dependable markers. Of these, 928 were incorporated into a consensus genetic map spanning 680 cM with 11 linkage groups and an average marker distance of 0.73 cM. Comparison of this cowpea genetic map to reference legumes, soybean (Glycine max) and Medicago truncatula, revealed extensive macrosynteny encompassing 85 and 82%, respectively, of the cowpea map. Regions of soybean genome duplication were evident relative to the simpler diploid cowpea. Comparison with Arabidopsis revealed extensive genomic rearrangement with some conserved microsynteny. These results support evolutionary closeness between cowpea and soybean and identify regions for synteny-based functional genomics studies in legumes.
A Brassica rapa Linkage Map of EST-based SNP Markers for Identification of Candidate Genes Controlling Flowering Time and Leaf Morphological Traits

PubMed Central

Li, Feng; Kitashiba, Hiroyasu; Inaba, Kiyofumi; Nishio, Takeshi

2009-01-01

For identification of genes responsible for varietal differences in flowering time and leaf morphological traits, we constructed a linkage map of Brassica rapa DNA markers including 170 EST-based markers, 12 SSR markers, and 59 BAC sequence-based markers, of which 151 are single nucleotide polymorphism (SNP) markers. By BLASTN, 223 markers were shown to have homologous regions in Arabidopsis thaliana, and these homologous loci covered nearly the whole genome of A. thaliana. Synteny analysis between B. rapa and A. thaliana revealed 33 large syntenic regions. Three quantitative trait loci (QTLs) for flowering time were detected. BrFLC1 and BrFLC2 were linked to the QTLs for bolting time, budding time, and flowering time. Three SNPs in the promoter, which may be the cause of low expression of BrFLC2 in the early-flowering parental line, were identified. For leaf lobe depth and leaf hairiness, one major QTL corresponding to a syntenic region containing GIBBERELLIN 20 OXIDASE 3 and one major QTL containing BrGL1, respectively, were detected. Analysis of nucleotide sequences and expression of these genes suggested possible involvement of these genes in leaf morphological traits. PMID:19884167
Exploiting genotyping by sequencing to characterize the genomic structure of the American cranberry through high-density linkage mapping.

PubMed

Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Schlautman, Brandon; Deutsch, Joseph; Salazar, Walter; Hernandez-Ochoa, Miguel; Grygleski, Edward; Steffan, Shawn; Iorizzo, Massimo; Polashock, James; Vorsa, Nicholi; Zalapa, Juan

2016-06-13

The application of genotyping by sequencing (GBS) approaches, combined with data imputation methodologies, is narrowing the genetic knowledge gap between major and understudied, minor crops. GBS is an excellent tool to characterize the genomic structure of recently domesticated (~200 years) and understudied species, such as cranberry (Vaccinium macrocarpon Ait.), by generating large numbers of markers for genomic studies such as genetic mapping. We identified 10842 potentially mappable single nucleotide polymorphisms (SNPs) in a cranberry pseudo-testcross population wherein 5477 SNPs and 211 short sequence repeats (SSRs) were used to construct a high density linkage map in cranberry of which a total of 4849 markers were mapped. Recombination frequency, linkage disequilibrium (LD), and segregation distortion at the genomic level in the parental and integrated linkage maps were characterized for first time in cranberry. SSR markers, used as the backbone in the map, revealed high collinearity with previously published linkage maps. The 4849 point map consisted of twelve linkage groups spanning 1112 cM, which anchored 2381 nuclear scaffolds accounting for ~13 Mb of the estimated 470 Mb cranberry genome. Bin mapping identified 592 and 672 unique bins in the parentals and a total of 1676 unique marker positions in the integrated map. Synteny analyses comparing the order of anchored cranberry scaffolds to their homologous positions in kiwifruit, grape, and coffee genomes provided initial evidence of homology between cranberry and closely related species. GBS data was used to rapidly saturate the cranberry genome with markers in a pseudo-testcross population. Collinearity between the present saturated genetic map and previous cranberry SSR maps suggests that the SNP locations represent accurate marker order and chromosome structure of the cranberry genome. SNPs greatly improved current marker genome coverage, which allowed for genome-wide structure investigations such as segregation distortion, recombination, linkage disequilibrium, and synteny analyses. In the future, GBS can be used to accelerate cranberry molecular breeding through QTL mapping and genome-wide association studies (GWAS).
Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map

PubMed Central

Haile, Jemanesh K.; Cory, Aron T.; Clarke, Fran R.; Clarke, John M.; Knox, Ron E.; Pozniak, Curtis J.

2017-01-01

Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat. PMID:28135299
Distinct contributions of replication and transcription to mutation rate variation of human genomes.

PubMed

Cui, Peng; Ding, Feng; Lin, Qiang; Zhang, Lingfang; Li, Ang; Zhang, Zhang; Hu, Songnian; Yu, Jun

2012-02-01

Here, we evaluate the contribution of two major biological processes--DNA replication and transcription--to mutation rate variation in human genomes. Based on analysis of the public human tissue transcriptomics data, high-resolution replicating map of Hela cells and dbSNP data, we present significant correlations between expression breadth, replication time in local regions and SNP density. SNP density of tissue-specific (TS) genes is significantly higher than that of housekeeping (HK) genes. TS genes tend to locate in late-replicating genomic regions and genes in such regions have a higher SNP density compared to those in early-replication regions. In addition, SNP density is found to be positively correlated with expression level among HK genes. We conclude that the process of DNA replication generates stronger mutational pressure than transcription-associated biological processes do, resulting in an increase of mutation rate in TS genes while having weaker effects on HK genes. In contrast, transcription-associated processes are mainly responsible for the accumulation of mutations in highly-expressed HK genes. Copyright © 2012 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.
Genotyping by sequencing for SNP-based linkage analysis and identification of QTLs linked to fruit quality traits in Japanese plum (Prunus salicina Lindl.)

USDA-ARS?s Scientific Manuscript database

Marker-assisted selection (MAS) in stone fruit (Prunus species) breeding is currently difficult to achieve due to the polygenic nature of themost relevant agronomic traits linked to fruit quality. Genotyping by sequencing (GBS), however, provides a large quantity of useful data suitable for finemapp...
Evolution of the Oat Genetic Road Map: From Tetraploid to Hexaploid

USDA-ARS?s Scientific Manuscript database

The development of a genetic linkage map for hexaploid oat (Avena sativa L. 2n = 6 x = 42) that defines all 21 chromosomes has been hindered due to the lack of oat-based markers and the size and complexity of the oat genome. Recent efforts in oat DArT, SSR, and SNP marker development should improve...
Detection of genetic association and functional polymorphisms of UGDH affecting milk production trait in Chinese Holstein cattle.

PubMed

Xu, Qing; Mei, Gui; Sun, Dongxiao; Zhang, Qin; Zhang, Yuan; Yin, Cengceng; Chen, Huiyong; Ding, Xiangdong; Liu, Jianfeng

2012-11-02

We previously localized a quantitative trait locus (QTL) on bovine chromosome 6 affecting milk production traits to a 1.5-Mb region between BMS483 and MNB-209 via genome scanning followed by fine mapping. Totally 15 genes were mapped within such linkage region through bioinformatic analysis of the cattle-human comparative map and bovine genome assembly. Of them, the UDP-glucose dehydrogenase (UGDH) was suggested as a potential positional candidate gene for milk production traits based on its corresponding physiological and biochemical functions and genetic effects. By sequencing all the coding exons and the untranslated regions in UGDH with pooled DNA of 8 sires represented the separated families detected in our previous studies, a total of ten SNPs were identified and genotyped in 1417 Holstein cows of 8 separation families. Individual SNP-based association analysis revealed 4 significant associations of SNP Ex1-1, SNP Int3-1, SNP Int5-1, and SNP Ex12-3 with milk yield (P < 0.05), and 2 significant associations of SNP Ex1-1 and SNP Ex12-3 with protein yield (P < 0.05). Furthermore, our haplotype-based association analyses indicated that haplotypes G-C-C, formed by SNP Ex12-2-SNP Int11-1-SNP Ex11-1, T-G, formed by SNP Int9-3-SNP Int9-2, and C-C, formed by SNP Int5-1-SNP Int3-1, are significantly associated with protein percentage (F=4.15; P=0.0418) and fat percentage (F=5.18~7.25; P=0.0072~0.0231). Finally, by using an in vitro expression assay, we demonstrated that the A allele of SNP Ex1-1 and T allele of SNP Ex11-1of UGDH significantly decreases the expression of UGDH by 68.0% at the RNA, and 50.1% at the protein level, suggesting that SNP Ex1-1 and Ex11-1 represent two functional polymorphisms affecting expression of UGDH and may partly contributed to the observed association of the gene with milk production traits in our samples. Taken together, our findings strongly indicate that UGDH gene could be involved in genetic variation underlying the QTL for milk production traits.
Genome-wide association study of acute post-surgical pain in humans

PubMed Central

Kim, Hyungsuk; Ramsay, Edward; Lee, Hyewon; Wahl, Sharon; Dionne, Raymond A

2009-01-01

Aims Testing a relatively small genomic region with a few hundred SNPs provides limited information. Genome-wide association studies (GWAS) provide an opportunity to overcome the limitation of candidate gene association studies. Here, we report the results of a GWAS for the responses to an NSAID analgesic. Materials & methods European Americans (60 females and 52 males) undergoing oral surgery were genotyped with Affymetrix 500K SNP assay. Additional SNP genotyping was performed from the gene in linkage disequilibrium with the candidate SNP revealed by the GWAS. Results GWAS revealed a candidate SNP (rs2562456) associated with analgesic onset, which is in linkage disequilibrium with a gene encoding a zinc finger protein. Additional SNP genotyping of ZNF429 confirmed the association with analgesic onset in humans (p = 1.8 × 10−10, degrees of freedom = 103, F = 28.3). We also found candidate loci for the maximum post-operative pain rating (rs17122021, p = 6.9 × 10−7) and post-operative pain onset time (rs6693882, p = 2.1 × 10−6), however, correcting for multiple comparisons did not sustain these genetic associations. Conclusion GWAS for acute clinical pain followed by additional SNP genotyping of a neighboring gene suggests that genetic variations in or near the loci encoding DNA binding proteins play a role in the individual variations in responses to analgesic drugs. PMID:19207018
A genetic map and germplasm diversity estimation of Mangifera indica (mango) with SNPs

USDA-ARS?s Scientific Manuscript database

Mango (Mangifera indica) is often referred to as the “King of Fruits”. As the first steps in developing a mango genomics project, we genotyped 582 individuals comprising six mapping populations with 1054 SNP markers. The resulting consensus map had 20 linkage groups defined by 726 SNP markers with...
Significant Locus and Metabolic Genetic Correlations Revealed in Genome-Wide Association Study of Anorexia Nervosa.

PubMed

Duncan, Laramie; Yilmaz, Zeynep; Gaspar, Helena; Walters, Raymond; Goldstein, Jackie; Anttila, Verneri; Bulik-Sullivan, Brendan; Ripke, Stephan; Thornton, Laura; Hinney, Anke; Daly, Mark; Sullivan, Patrick F; Zeggini, Eleftheria; Breen, Gerome; Bulik, Cynthia M

2017-09-01

The authors conducted a genome-wide association study of anorexia nervosa and calculated genetic correlations with a series of psychiatric, educational, and metabolic phenotypes. Following uniform quality control and imputation procedures using the 1000 Genomes Project (phase 3) in 12 case-control cohorts comprising 3,495 anorexia nervosa cases and 10,982 controls, the authors performed standard association analysis followed by a meta-analysis across cohorts. Linkage disequilibrium score regression was used to calculate genome-wide common variant heritability (single-nucleotide polymorphism [SNP]-based heritability [h 2 SNP ]), partitioned heritability, and genetic correlations (r g ) between anorexia nervosa and 159 other phenotypes. Results were obtained for 10,641,224 SNPs and insertion-deletion variants with minor allele frequencies >1% and imputation quality scores >0.6. The h 2 SNP of anorexia nervosa was 0.20 (SE=0.02), suggesting that a substantial fraction of the twin-based heritability arises from common genetic variation. The authors identified one genome-wide significant locus on chromosome 12 (rs4622308) in a region harboring a previously reported type 1 diabetes and autoimmune disorder locus. Significant positive genetic correlations were observed between anorexia nervosa and schizophrenia, neuroticism, educational attainment, and high-density lipoprotein cholesterol, and significant negative genetic correlations were observed between anorexia nervosa and body mass index, insulin, glucose, and lipid phenotypes. Anorexia nervosa is a complex heritable phenotype for which this study has uncovered the first genome-wide significant locus. Anorexia nervosa also has large and significant genetic correlations with both psychiatric phenotypes and metabolic traits. The study results encourage a reconceptualization of this frequently lethal disorder as one with both psychiatric and metabolic etiology.
A high-density SNP genetic map consisting of a complete set of homologous groups in autohexaploid sweetpotato (Ipomoea batatas)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro

Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
A high-density SNP genetic map consisting of a complete set of homologous groups in autohexaploid sweetpotato (Ipomoea batatas)

DOE PAGES

Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro; ...

2017-03-10

Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Genome-wide association study identified three major QTL for carcass weight including the PLAG1-CHCHD7 QTN for stature in Japanese Black cattle

PubMed Central

2012-01-01

Background Significant quantitative trait loci (QTL) for carcass weight were previously mapped on several chromosomes in Japanese Black half-sib families. Two QTL, CW-1 and CW-2, were narrowed down to 1.1-Mb and 591-kb regions, respectively. Recent advances in genomic tools allowed us to perform a genome-wide association study (GWAS) in cattle to detect associations in a general population and estimate their effect size. Here, we performed a GWAS for carcass weight using 1156 Japanese Black steers. Results Bonferroni-corrected genome-wide significant associations were detected in three chromosomal regions on bovine chromosomes (BTA) 6, 8, and 14. The associated single nucleotide polymorphisms (SNP) on BTA 6 were in linkage disequilibrium with the SNP encoding NCAPG Ile442Met, which was previously identified as a candidate quantitative trait nucleotide for CW-2. In contrast, the most highly associated SNP on BTA 14 was located 2.3-Mb centromeric from the previously identified CW-1 region. Linkage disequilibrium mapping led to a revision of the CW-1 region within a 0.9-Mb interval around the associated SNP, and targeted resequencing followed by association analysis highlighted the quantitative trait nucleotides for bovine stature in the PLAG1-CHCHD7 intergenic region. The association on BTA 8 was accounted for by two SNP on the BovineSNP50 BeadChip and corresponded to CW-3, which was simultaneously detected by linkage analyses using half-sib families. The allele substitution effects of CW-1, CW-2, and CW-3 were 28.4, 35.3, and 35.0 kg per allele, respectively. Conclusion The GWAS revealed the genetic architecture underlying carcass weight variation in Japanese Black cattle in which three major QTL accounted for approximately one-third of the genetic variance. PMID:22607022
High-Density SNP Genotyping to Define β-Globin Locus Haplotypes

PubMed Central

Liu, Li; Muralidhar, Shalini; Singh, Manisha; Sylvan, Caprice; Kalra, Inderdeep S.; Quinn, Charles T.; Onyekwere, Onyinye C.; Pace, Betty S.

2014-01-01

Five major β-globin locus haplotypes have been established in individuals with sickle cell disease (SCD) from the Benin, Bantu, Senegal, Cameroon, and Arab-Indian populations. Historically, β-haplotypes were established using restriction fragment length polymorphism (RFLP) analysis across the β-locus, which consists of five functional β-like globin genes located on chromosome 11. Previous attempts to correlate these haplotypes as robust predictors of clinical phenotypes observed in SCD have not been successful. We speculate that the coverage and distribution of the RFLP sites located proximal to or within the globin genes are not sufficiently dense to accurately reflect the complexity of this region. To test our hypothesis, we performed RFLP analysis and high-density single nucleotide polymorphism (SNP) genotyping across the β-locus using DNA samples from either healthy African Americans with normal hemoglobin A (HbAA) or individuals with homozygous SS (HbSS) disease. Using the genotyping data from 88 SNPs and Haploview analysis, we generated a greater number of haplotypes than that observed with RFLP analysis alone. Furthermore, a unique pattern of long-range linkage disequilibrium between the locus control region and the β-like globin genes was observed in the HbSS group. Interestingly, we observed multiple SNPs within the HindIII restriction site located in the Gγ-globin intervening sequence II which produced the same RFLP pattern. These findings illustrated the inability of RFLP analysis to decipher the complexity of sequence variations that impacts genomic structure in this region. Our data suggest that high density SNP mapping may be required to accurately define β-haplotypes that correlate with the different clinical phenotypes observed in SCD. PMID:18829352
Population-Specific Patterns of Linkage Disequilibrium and SNP Variation in Spring and Winter Polyploid Wheat

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphisms (SNPs) are ideally suited for the construction of high-resolution genetic maps, studying population evolutionary history and performing genome-wide association mapping experiments. Here we used a genome-wide set of 1536 SNPs to study linkage disequilibrium (LD) and po...
A second generation SNP and SSR integrated linkage map and QTL mapping for the Chinese mitten crab Eriocheir sinensis

PubMed Central

Qiu, Gao-Feng; Xiong, Liang-Wei; Han, Zhi-Ke; Liu, Zhi-Qiang; Feng, Jian-Bin; Wu, Xu-Gan; Yan, Yin-Long; Shen, Hong; Huang, Long; Chen, Li

2017-01-01

The Chinese mitten crab Eriocheir sinensis is the most economically important cultivated crab species in China, and its genome has a high number of chromosomes (2n = 146). To obtain sufficient markers for construction of a dense genetic map for this species, we employed the recently developed specific-locus amplified fragment sequencing (SLAF-seq) method for large-scale SNPs screening and genotyping in a F1 full-sib family of 149 individuals. SLAF-seq generated 127,677 polymorphic SNP markers, of which 20,803 valid markers were assigned into five segregation types and were used together with previous SSR markers for linkage map construction. The final integrated genetic map included 17,680 SNP and 629 SSR markers on the 73 linkage groups (LG), and spanned 14,894.9 cM with an average marker interval of 0.81 cM. QTL mapping localized three significant growth-related QTL to a 1.2 cM region in LG53 as well as 146 sex-linked markers in LG48. Genome-wide QTL-association analysis further identified four growth-related QTL genes named LNX2, PAK2, FMRFamide and octopamine receptors. These genes are involved in a variety of different signaling pathways including cell proliferation and growth. The map and SNP markers described here will be a valuable resource for the E. sinensis genome project and selective breeding programs. PMID:28045132
X-linked infantile spinal muscular atrophy: clinical definition and molecular mapping.

PubMed

Dressman, Devin; Ahearn, Mary Ellen; Yariz, Kemal O; Basterrecha, Hugo; Martínez, Francisco; Palau, Francesc; Barmada, M Michael; Clark, Robin Dawn; Meindl, Alfons; Wirth, Brunhilde; Hoffman, Eric P; Baumbach-Reardon, Lisa

2007-01-01

X-linked infantile spinal-muscular atrophy (XL-SMA) is a rare disorder, which presents with the clinical characteristics of hypotonia, areflexia, and multiple congenital contractures (arthrogryposis) associated with loss of anterior horn cells and death in infancy. We have previously reported a single family with XL-SMA that mapped to Xp11.3-q11.2. Here we report further clinical description of XL-SMA plus an additional seven unrelated (XL-SMA) families from North America and Europe that show linkage data consistent with the same region. We first investigated linkage to the candidate disease gene region using microsatellite repeat markers. We further saturated the candidate disease gene region using polymorphic microsatellite repeat markers and single nucleotide polymorphisms in an effort to narrow the critical region. Two-point and multipoint linkage analysis was performed using the Allegro software package. Linkage analysis of all XL-SMA families displayed linkage consistent with the original XL-SMA region. The addition of new families and new markers has narrowed the disease gene interval for a XL-SMA locus between SNP FLJ22843 near marker DXS 8080 and SNP ARHGEF9 which is near DXS7132 (Xp11.3-Xq11.1).
Clarifying sub-genomic positions of QTLs for flowering habit and fruit quality in U.S. strawberry (Fragaria×ananassa) breeding populations using pedigree-based QTL analysis

PubMed Central

Verma, Sujeet; Zurn, Jason D; Salinas, Natalia; Mathey, Megan M; Denoyes, Beatrice; Hancock, James F; Finn, Chad E; Bassil, Nahla V; Whitaker, Vance M

2017-01-01

The cultivated strawberry (Fragaria×ananassa) is consumed worldwide for its flavor and nutritional benefits. Genetic analysis of commercially important traits in strawberry are important for the development of breeding methods and tools for this species. Although several quantitative trait loci (QTL) have been previously detected for fruit quality and flowering traits using low-density genetic maps, clarity on the sub-genomic locations of these QTLs was missing. Recent discoveries in allo-octoploid strawberry genomics led to the development of the IStraw90 single-nucleotide polymorphism (SNP) array, enabling high-density genetic maps and finer resolution QTL analysis. In this study, breeder-specified traits were evaluated in the Eastern (Michigan) and Western (Oregon) United States for a common set of breeding populations during 2 years. Several QTLs were validated for soluble solids content (SSC), fruit weight (FWT), pH and titratable acidity (TA) using a pedigree-based QTL analysis approach. For fruit quality, a QTL for SSC on linkage group (LG) 6A, a QTL for FWT on LG 2BII, a QTL for pH on LG 4CII and two QTLs for TA on LGs 2A and 5B were detected. In addition, a large-effect QTL for flowering was detected at the distal end of LG 4A, coinciding with the FaPFRU locus. Marker haplotype analysis in the FaPFRU region indicated that the homozygous recessive genotype was highly predictive of seasonal flowering. SNP probes in the FaPFRU region may help facilitate marker-assisted selection for this trait. PMID:29138689

Clarifying sub-genomic positions of QTLs for flowering habit and fruit quality in U.S. strawberry (Fragaria×ananassa) breeding populations using pedigree-based QTL analysis.

PubMed

Verma, Sujeet; Zurn, Jason D; Salinas, Natalia; Mathey, Megan M; Denoyes, Beatrice; Hancock, James F; Finn, Chad E; Bassil, Nahla V; Whitaker, Vance M

2017-01-01

The cultivated strawberry ( Fragaria × ananassa ) is consumed worldwide for its flavor and nutritional benefits. Genetic analysis of commercially important traits in strawberry are important for the development of breeding methods and tools for this species. Although several quantitative trait loci (QTL) have been previously detected for fruit quality and flowering traits using low-density genetic maps, clarity on the sub-genomic locations of these QTLs was missing. Recent discoveries in allo-octoploid strawberry genomics led to the development of the IStraw90 single-nucleotide polymorphism (SNP) array, enabling high-density genetic maps and finer resolution QTL analysis. In this study, breeder-specified traits were evaluated in the Eastern (Michigan) and Western (Oregon) United States for a common set of breeding populations during 2 years. Several QTLs were validated for soluble solids content (SSC), fruit weight (FWT), pH and titratable acidity (TA) using a pedigree-based QTL analysis approach. For fruit quality, a QTL for SSC on linkage group (LG) 6A, a QTL for FWT on LG 2BII, a QTL for pH on LG 4CII and two QTLs for TA on LGs 2A and 5B were detected. In addition, a large-effect QTL for flowering was detected at the distal end of LG 4A, coinciding with the FaPFRU locus. Marker haplotype analysis in the FaPFRU region indicated that the homozygous recessive genotype was highly predictive of seasonal flowering. SNP probes in the FaPFRU region may help facilitate marker-assisted selection for this trait.
Fast and Accurate Construction of Ultra-Dense Consensus Genetic Maps Using Evolution Strategy Optimization

PubMed Central

Mester, David; Ronin, Yefim; Schnable, Patrick; Aluru, Srinivas; Korol, Abraham

2015-01-01

Our aim was to develop a fast and accurate algorithm for constructing consensus genetic maps for chip-based SNP genotyping data with a high proportion of shared markers between mapping populations. Chip-based genotyping of SNP markers allows producing high-density genetic maps with a relatively standardized set of marker loci for different mapping populations. The availability of a standard high-throughput mapping platform simplifies consensus analysis by ignoring unique markers at the stage of consensus mapping thereby reducing mathematical complicity of the problem and in turn analyzing bigger size mapping data using global optimization criteria instead of local ones. Our three-phase analytical scheme includes automatic selection of ~100-300 of the most informative (resolvable by recombination) markers per linkage group, building a stable skeletal marker order for each data set and its verification using jackknife re-sampling, and consensus mapping analysis based on global optimization criterion. A novel Evolution Strategy optimization algorithm with a global optimization criterion presented in this paper is able to generate high quality, ultra-dense consensus maps, with many thousands of markers per genome. This algorithm utilizes "potentially good orders" in the initial solution and in the new mutation procedures that generate trial solutions, enabling to obtain a consensus order in reasonable time. The developed algorithm, tested on a wide range of simulated data and real world data (Arabidopsis), outperformed two tested state-of-the-art algorithms by mapping accuracy and computation time. PMID:25867943
Genome-Wide Single-Nucleotide Polymorphisms Discovery and High-Density Genetic Map Construction in Cauliflower Using Specific-Locus Amplified Fragment Sequencing

PubMed Central

Zhao, Zhenqing; Gu, Honghui; Sheng, Xiaoguang; Yu, Huifang; Wang, Jiansheng; Huang, Long; Wang, Dan

2016-01-01

Molecular markers and genetic maps play an important role in plant genomics and breeding studies. Cauliflower is an important and distinctive vegetable; however, very few molecular resources have been reported for this species. In this study, a novel, specific-locus amplified fragment (SLAF) sequencing strategy was employed for large-scale single nucleotide polymorphism (SNP) discovery and high-density genetic map construction in a double-haploid, segregating population of cauliflower. A total of 12.47 Gb raw data containing 77.92 M pair-end reads were obtained after processing and 6815 polymorphic SLAFs between the two parents were detected. The average sequencing depths reached 52.66-fold for the female parent and 49.35-fold for the male parent. Subsequently, these polymorphic SLAFs were used to genotype the population and further filtered based on several criteria to construct a genetic linkage map of cauliflower. Finally, 1776 high-quality SLAF markers, including 2741 SNPs, constituted the linkage map with average data integrity of 95.68%. The final map spanned a total genetic length of 890.01 cM with an average marker interval of 0.50 cM, and covered 364.9 Mb of the reference genome. The markers and genetic map developed in this study could provide an important foundation not only for comparative genomics studies within Brassica oleracea species but also for quantitative trait loci identification and molecular breeding of cauliflower. PMID:27047515
A second generation genetic linkage map of Japanese flounder (Paralichthys olivaceus)

PubMed Central

2010-01-01

Background Japanese flounder (Paralichthys olivaceus) is one of the most economically important marine species in Northeast Asia. Information on genetic markers associated with quantitative trait loci (QTL) can be used in breeding programs to identify and select individuals carrying desired traits. Commercial production of Japanese flounder could be increased by developing disease-resistant fish and improving commercially important traits. Previous maps have been constructed with AFLP markers and a limited number of microsatellite markers. In this study, improved genetic linkage maps are presented. In contrast with previous studies, these maps were built mainly with a large number of codominant markers so they can potentially be used to analyze different families and populations. Results Sex-specific genetic linkage maps were constructed for the Japanese flounder including a total of 1,375 markers [1,268 microsatellites, 105 single nucleotide polymorphisms (SNPs) and two genes]; 1,167 markers are linked to the male map and 1,067 markers are linked to the female map. The lengths of the male and female maps are 1,147.7 cM and 833.8 cM, respectively. Based on estimations of map lengths, the female and male maps covered 79 and 82% of the genome, respectively. Recombination ratio in the new maps revealed F:M of 1:0.7. All linkage groups in the maps presented large differences in the location of sex-specific recombination hot-spots. Conclusions The improved genetic linkage maps are very useful for QTL analyses and marker-assisted selection (MAS) breeding programs for economically important traits in Japanese flounder. In addition, SNP flanking sequences were blasted against Tetraodon nigroviridis (puffer fish) and Danio rerio (zebrafish), and synteny analysis has been carried out. The ability to detect synteny among species or genera based on homology analysis of SNP flanking sequences may provide opportunities to complement initial QTL experiments with candidate gene approaches from homologous chromosomal locations identified in related model organisms. PMID:20937088
Quantitative analysis of low-density SNP data for parentage assignment and estimation of family contributions to pooled samples.

PubMed

Henshall, John M; Dierens, Leanne; Sellars, Melony J

2014-09-02

While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms.
Proteasome modulator 9 gene SNPs, responsible for anti-depressant response, are in linkage with generalized anxiety disorder.

PubMed

Gragnoli, Claudia

2014-09-01

Proteasome modulator 9 (PSMD9) gene single nucleotide polymorphism (SNP) rs1043307/rs2514259 (E197G) is associated with significant clinical response to the anti-depressant desipramine. PSMD9 SNP rs74421874 [intervening sequence (IVS) 3 + nt460 G>A], rs3825172 (IVS3 + nt437 C>T) and rs1043307/rs2514259 (E197G A>G) are all linked to type 2 diabetes (T2D), maturity-onset-diabetes-of the young 3 (MODY3), obesity and waist circumference, hypertension, hypercholesterolemia, T2D-macrovascular and T2D-microvascular disease, T2D-neuropathy, T2D-carpal tunnel syndrome, T2D-nephropathy, T2D-retinopathy, non-diabetic retinopathy and depression. PSMD9 rs149556654 rare SNP (N166S A>G) and the variant S143G A>G also contribute to T2D. PSMD9 is located in the chromosome 12q24 locus, which per se is in linkage with depression, bipolar disorder and anxiety. In the present study, we wanted to determine whether PSMD9 is linked to general anxiety disorder in Italian T2D families. Two-hundred Italian T2D families were phenotyped for generalized anxiety disorder, using the diagnostic criteria of DSM-IV. When the diagnosis was unavailable or unclear, the trait was reported as unknown. The 200 Italians families were tested for the PSMD9 T2D risk SNPs rs74421874 (IVS3 + nt460 G>A), rs3825172 (IVS3 +nt437 T>C) and for the T2D risk and anti-depressant response SNP rs1043307/rs2514259 (E197G A>G) for evidence of linkage with generalized anxiety disorder. Non-parametric linkage analysis was executed via Merlin software. One-thousand simulation tests were performed to exclude results due to random chance. In our study, the PSMD9 gene SNPs rs74421874, rs3825172, and rs1043307/rs2514259 result in linkage to generalized anxiety disorder. This is the first report describing PSMD9 gene SNPs in linkage to generalized anxiety disorder in T2D families. © 2014 Wiley Periodicals, Inc.
Construction of a SNP and SSR linkage map in autotetraploid blueberry using genotyping by sequencing

USDA-ARS?s Scientific Manuscript database

A mapping population developed from a cross between two key highbush blueberry cultivars, Draper × Jewel (Vaccinium corymbosum), segregating for a number of important phenotypic traits, has been utilized to produce a genetic linkage map. Data on 233 single sequence repeat (SSR) markers and 1794 sing...
Nested association mapping for dissecting complex traits using Peanut 58K SNP array

USDA-ARS?s Scientific Manuscript database

Genome-wide association studies (GWAS) and linkage mapping have been the two most predominant strategies to dissect complex traits, but are limited by the occurrence of false positives reported for GWAS, and low resolution in the case of linkage analysis. This has led to the development of a joint a...
Ultra-low-density genotype panels for breed assignment of Angus and Hereford cattle.

PubMed

Judge, M M; Kelleher, M M; Kearney, J F; Sleator, R D; Berry, D P

2017-06-01

Angus and Hereford beef is marketed internationally for apparent superior meat quality attributes; DNA-based breed authenticity could be a useful instrument to ensure consumer confidence on premium meat products. The objective of this study was to develop an ultra-low-density genotype panel to accurately quantify the Angus and Hereford breed proportion in biological samples. Medium-density genotypes (13 306 single nucleotide polymorphisms (SNPs)) were available on 54 703 commercial and 4042 purebred animals. The breed proportion of the commercial animals was generated from the medium-density genotypes and this estimate was regarded as the gold-standard breed composition. Ten genotype panels (100 to 1000 SNPs) were developed from the medium-density genotypes; five methods were used to identify the most informative SNPs and these included the Delta statistic, the fixation (F st) statistic and an index of both. Breed assignment analyses were undertaken for each breed, panel density and SNP selection method separately with a programme to infer population structure using the entire 13 306 SNP panel (representing the gold-standard measure). Breed assignment was undertaken for all commercial animals (n=54 703), animals deemed to contain some proportion of Angus based on pedigree (n=5740) and animals deemed to contain some proportion of Hereford based on pedigree (n=5187). The predicted breed proportion of all animals from the lower density panels was then compared with the gold-standard breed prediction. Panel density, SNP selection method and breed all had a significant effect on the correlation of predicted and actual breed proportion. Regardless of breed, the Index method of SNP selection numerically (but not significantly) outperformed all other selection methods in accuracy (i.e. correlation and root mean square of prediction) when panel density was ⩾300 SNPs. The correlation between actual and predicted breed proportion increased as panel density increased. Using 300 SNPs (selected using the global index method), the correlation between predicted and actual breed proportion was 0.993 and 0.995 in the Angus and Hereford validation populations, respectively. When SNP panels optimised for breed prediction in one population were used to predict the breed proportion of a separate population, the correlation between predicted and actual breed proportion was 0.034 and 0.044 weaker in the Hereford and Angus populations, respectively (using the 300 SNP panel). It is necessary to include at least 300 to 400 SNPs (per breed) on genotype panels to accurately predict breed proportion from biological samples.
A combined genome-wide linkage and association approach to find susceptibility loci for platelet function phenotypes in European American and African American families with coronary artery disease

PubMed Central

2010-01-01

Background The inability of aspirin (ASA) to adequately suppress platelet aggregation is associated with future risk of coronary artery disease (CAD). Heritability studies of agonist-induced platelet function phenotypes suggest that genetic variation may be responsible for ASA responsiveness. In this study, we leverage independent information from genome-wide linkage and association data to determine loci controlling platelet phenotypes before and after treatment with ASA. Methods Clinical data on 37 agonist-induced platelet function phenotypes were evaluated before and after a 2-week trial of ASA (81 mg/day) in 1231 European American and 846 African American healthy subjects with a family history of premature CAD. Principal component analysis was performed to minimize the number of independent factors underlying the covariance of these various phenotypes. Multi-point sib-pair based linkage analysis was performed using a microsatellite marker set, and single-SNP association tests were performed using markers from the Illumina 1 M genotyping chip from deCODE Genetics, Inc. All analyses were performed separately within each ethnic group. Results Several genomic regions appear to be linked to ASA response factors: a 10 cM region in African Americans on chromosome 5q11.2 had several STRs with suggestive (p-value < 7 × 10-4) and significant (p-value < 2 × 10-5) linkage to post aspirin platelet response to ADP, and ten additional factors had suggestive evidence for linkage (p-value < 7 × 10-4) to thirteen genomic regions. All but one of these factors were aspirin response variables. While the strength of genome-wide SNP association signals for factors showing evidence for linkage is limited, especially at the strict thresholds of genome-wide criteria (N = 9 SNPs for 11 factors), more signals were considered significant when the association signal was weighted by evidence for linkage (N = 30 SNPs). Conclusions Our study supports the hypothesis that platelet phenotypes in response to ASA likely have genetic control and the combined approach of linkage and association offers an alternative approach to prioritizing regions of interest for subsequent follow-up. PMID:20529293
Linkage disequilibrium and signatures of positive selection around LINE-1 retrotransposons in the human genome.

PubMed

Kuhn, Alexandre; Ong, Yao Min; Cheng, Ching-Yu; Wong, Tien Yin; Quake, Stephen R; Burkholder, William F

2014-06-03

Insertions of the human-specific subfamily of LINE-1 (L1) retrotransposon are highly polymorphic across individuals and can critically influence the human transcriptome. We hypothesized that L1 insertions could represent genetic variants determining important human phenotypic traits, and performed an integrated analysis of L1 elements and single nucleotide polymorphisms (SNPs) in several human populations. We found that a large fraction of L1s were in high linkage disequilibrium with their surrounding genomic regions and that they were well tagged by SNPs. However, L1 variants were only partially captured by SNPs on standard SNP arrays, so that their potential phenotypic impact would be frequently missed by SNP array-based genome-wide association studies. We next identified potential phenotypic effects of L1s by looking for signatures of natural selection linked to L1 insertions; significant extended haplotype homozygosity was detected around several L1 insertions. This finding suggests that some of these L1 insertions may have been the target of recent positive selection.
Genetic mapping of Pinus flexilis major gene (Cr4) for resistance to white pine blister rust using transcriptome-based SNP genotyping

Treesearch

Jun-Jun Liu; Anna W. Schoettle; Richard A. Sniezko; Rona N. Sturrock; Arezoo Zamany; Holly Williams; Amanda Ha; Danelle Chan; Bob Danchok; Douglas P. Savin; Angelia Kegley

2016-01-01

Linkage of DNA markers with phenotypic traits provides essential information to dissect clustered genes with potential phenotypic contributions in a target genome region. Pinus flexilis E. James (limber pine) is a keystone five-needle pine species in mountain-top ecosystems of North America. White pine blister rust (WPBR), caused by a non-native fungal...
Uneven recombination and linkage disequilibrium across a reference SNP map for common bean (Phaseolus vulgaris L.)

USDA-ARS?s Scientific Manuscript database

Linkage disequilibrium (LD) and recombination (R) analyses are the basis for plant breeding. LD and R vary by breeding system, by generation of inbreeding or outcrossing and by region of the chromosome. Common bean (Phaseolus vulgaris L.) is a favored food legume with a small sequenced genome and n=...
Genetic Architecture of Capitate Glandular Trichome Density in Florets of Domesticated Sunflower (Helianthus annuus L.)

PubMed Central

Gao, Qing-Ming; Kane, Nolan C.; Hulke, Brent S.; Reinert, Stephan; Pogoda, Cloe S.; Tittes, Silas; Prasifka, Jarrad R.

2018-01-01

Capitate glandular trichomes (CGT), one type of glandular trichomes, are most common in Asteraceae species. CGT can produce various secondary metabolites such as sesquiterpene lactones (STLs) and provide durable resistance to insect pests. In sunflower, CGT-based host resistance is effective to combat the specialist pest, sunflower moth. However, the genetic basis of CGT density is not well understood in sunflower. In this study, we identified two major QTL controlling CGT density in sunflower florets by using a F4 mapping population derived from the cross HA 300 × RHA 464 with a genetic linkage map constructed from genotyping-by-sequencing data and composed of 2121 SNP markers. One major QTL is located on chromosome 5, which explained 11.61% of the observed phenotypic variation, and the second QTL is located on chromosome 6, which explained 14.06% of the observed phenotypic variation. The QTL effects and the association between CGT density and QTL support interval were confirmed in a validation population which included 39 sunflower inbred lines with diverse genetic backgrounds. We also identified two strong candidate genes in the QTL support intervals, and the functions of their orthologs in other plant species suggested their potential roles in regulating capitate glandular trichome density in sunflower. Our results provide valuable information to sunflower breeding community for developing host resistance to sunflower insect pests. PMID:29375602
Joint genome-wide association study for milk fatty acid traits in Chinese and Danish Holstein populations.

PubMed

Li, X; Buitenhuis, A J; Lund, M S; Li, C; Sun, D; Zhang, Q; Poulsen, N A; Su, G

2015-11-01

The identification of causal genes or genomic regions associated with fatty acids (FA) will enhance our understanding of the pathways underlying FA synthesis and provide opportunities for changing milk fat composition through a genetic approach. The linkage disequilibrium between adjacent markers is highly consistent between the Chinese and Danish Holstein populations, such that a joint genome-wide association study (GWAS) can be performed. In this study, a joint GWAS was performed for 16 milk FA traits based on data of 784 Chinese and 371 Danish Holstein cows genotyped by a high-density bovine single nucleotide polymorphism (SNP) array. A total of 486,464 SNP markers on 29 bovine autosomes were used. Bonferroni corrections were applied to adjust the significance thresholds for multiple testing at the genome- and chromosome-wide levels. According to the analysis of either the Chinese or Danish data individually, the total numbers of overlapping SNP that were significant at the chromosome level were 94 for C14:1, 208 for the C14 index, and 1 for C18:0. Joint analysis using the combined data of the 2 populations detected greater numbers of significant SNP compared with either of the individual populations alone for 7 and 10 traits at the genome- and chromosome-wide significance levels, respectively. Greater numbers of significant SNP were detected for C18:0 and the C18 index in the Chinese population compared with the joint analysis. Sixty-five significant SNP across all traits had significantly different effects in the 2 populations. Ten FA were influenced by a quantitative trait loci (QTL) region including DGAT1. Both C14:1 and the C14 index were influenced by a QTL region including SCD1 in the combined population. Other QTL regions also showed significant associations with the studied FA. A large region (14.9-24.9 Mbp) in BTA26 significantly influenced C14:1 and the C14 index in both populations, mostly likely due to the SNP in SCD1. A QTL region (69.97-73.69 Mbp) on BTA9 showed a significantly different effect on C18:0 between the 2 populations. Detection of these important SNP and the corresponding QTL regions will be helpful for follow-up studies to identify causal mutations and their interaction with environments for milk FA in dairy cattle. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Genes with stable DNA methylation levels show higher evolutionary conservation than genes with fluctuant DNA methylation levels.

PubMed

Zhang, Ruijie; Lv, Wenhua; Luan, Meiwei; Zheng, Jiajia; Shi, Miao; Zhu, Hongjie; Li, Jin; Lv, Hongchao; Zhang, Mingming; Shang, Zhenwei; Duan, Lian; Jiang, Yongshuai

2015-11-24

Different human genes often exhibit different degrees of stability in their DNA methylation levels between tissues, samples or cell types. This may be related to the evolution of human genome. Thus, we compared the evolutionary conservation between two types of genes: genes with stable DNA methylation levels (SM genes) and genes with fluctuant DNA methylation levels (FM genes). For long-term evolutionary characteristics between species, we compared the percentage of the orthologous genes, evolutionary rate dn/ds and protein sequence identity. We found that the SM genes had greater percentages of the orthologous genes, lower dn/ds, and higher protein sequence identities in all the 21 species. These results indicated that the SM genes were more evolutionarily conserved than the FM genes. For short-term evolutionary characteristics among human populations, we compared the single nucleotide polymorphism (SNP) density, and the linkage disequilibrium (LD) degree in HapMap populations and 1000 genomes project populations. We observed that the SM genes had lower SNP densities, and higher degrees of LD in all the 11 HapMap populations and 13 1000 genomes project populations. These results mean that the SM genes had more stable chromosome genetic structures, and were more conserved than the FM genes.
Genetics of recurrent early-onset major depression (GenRED): significant linkage on chromosome 15q25-q26 after fine mapping with single nucleotide polymorphism markers.

PubMed

Levinson, Douglas F; Evgrafov, Oleg V; Knowles, James A; Potash, James B; Weissman, Myrna M; Scheftner, William A; Depaulo, J Raymond; Crowe, Raymond R; Murphy-Eberenz, Kathleen; Marta, Diana H; McInnis, Melvin G; Adams, Philip; Gladis, Madeline; Miller, Erin B; Thomas, Jo; Holmans, Peter

2007-02-01

The authors studied a dense map of single nucleotide polymorphism (SNP) DNA markers on chromosome 15q25-q26 to maximize the informativeness of genetic linkage analyses in a region where they previously reported suggestive evidence for linkage of recurrent early-onset major depressive disorder. In 631 European-ancestry families with multiple cases of recurrent early-onset major depressive disorder, 88 SNPs were genotyped, and multipoint allele-sharing linkage analyses were carried out. Marker-marker linkage disequilibrium was minimized, and a simulation study with founder haplotypes from these families suggested that linkage scores were not inflated by linkage disequilibrium. The dense SNP map increased the information content of the analysis from around 0.7 to over 0.9. The maximum evidence for linkage was the Z likelihood ratio score statistic of Kong and Cox (Z(LR))=4.69 at 109.8 cM. The exact p value was below the genomewide significance threshold. By contrast, in the genome scan with microsatellite markers at 9 cM spacing, the maximum Z(LR) for European-ancestry families was 3.43 (106.53 cM). It was estimated that the linked locus or loci in this region might account for a 20% or less populationwide increase in risk to siblings of cases. This region has produced modestly positive evidence for linkage to depression and related traits in other studies. These results suggest that DNA sequence variations in one or more genes in the 15q25-q26 region can increase susceptibility to major depression and that efforts are warranted to identify these genes.
Combined genome-wide linkage and targeted association analysis of head circumference in autism spectrum disorder families.

PubMed

Woodbury-Smith, M; Bilder, D A; Morgan, J; Jerominski, L; Darlington, T; Dyer, T; Paterson, A D; Coon, H

2017-01-01

It has long been recognized that there is an association between enlarged head circumference (HC) and autism spectrum disorder (ASD), but the genetics of HC in ASD is not well understood. In order to investigate the genetic underpinning of HC in ASD, we undertook a genome-wide linkage study of HC followed by linkage signal targeted association among a sample of 67 extended pedigrees with ASD. HC measurements on members of 67 multiplex ASD extended pedigrees were used as a quantitative trait in a genome-wide linkage analysis. The Illumina 6K SNP linkage panel was used, and analyses were carried out using the SOLAR implemented variance components model. Loci identified in this way formed the target for subsequent association analysis using the Illumina OmniExpress chip and imputed genotypes. A modification of the qTDT was used as implemented in SOLAR. We identified a linkage signal spanning 6p21.31 to 6p22.2 (maximum LOD = 3.4). Although targeted association did not find evidence of association with any SNP overall, in one family with the strongest evidence of linkage, there was evidence for association (rs17586672, p = 1.72E-07). Although this region does not overlap with ASD linkage signals in these same samples, it has been associated with other psychiatric risk, including ADHD, developmental dyslexia, schizophrenia, specific language impairment, and juvenile bipolar disorder. The genome-wide significant linkage signal represents the first reported observation of a potential quantitative trait locus for HC in ASD and may be relevant in the context of complex multivariate risk likely leading to ASD.
High-throughput genotyping-by-sequencing facilitates molecular tagging of a novel rust resistance gene, R 15 , in sunflower (Helianthus annuus L.).

PubMed

Ma, G J; Song, Q J; Markell, S G; Qi, L L

2018-07-01

A novel rust resistance gene, R 15 , derived from the cultivated sunflower HA-R8 was assigned to linkage group 8 of the sunflower genome using a genotyping-by-sequencing approach. SNP markers closely linked to R 15 were identified, facilitating marker-assisted selection of resistance genes. The rust virulence gene is co-evolving with the resistance gene in sunflower, leading to the emergence of new physiologic pathotypes. This presents a continuous threat to the sunflower crop necessitating the development of resistant sunflower hybrids providing a more efficient, durable, and environmentally friendly host plant resistance. The inbred line HA-R8 carries a gene conferring resistance to all known races of the rust pathogen in North America and can be used as a broad-spectrum resistance resource. Based on phenotypic assessments of 140 F 2 individuals derived from a cross of HA 89 with HA-R8, rust resistance in the population was found to be conferred by a single dominant gene (R 15 ) originating from HA-R8. Genotypic analysis with the currently available SSR markers failed to find any association between rust resistance and any markers. Therefore, we used genotyping-by-sequencing (GBS) analysis to achieve better genomic coverage. The GBS data showed that R 15 was located at the top end of linkage group (LG) 8. Saturation with 71 previously mapped SNP markers selected within this region further showed that it was located in a resistance gene cluster on LG8, and mapped to a 1.0-cM region between three co-segregating SNP makers SFW01920, SFW00128, and SFW05824 as well as the NSA_008457 SNP marker. These closely linked markers will facilitate marker-assisted selection and breeding in sunflower.
A High Density SNP Array for the Domestic Horse and Extant Perissodactyla: Utility for Association Mapping, Genetic Diversity, and Phylogeny Studies

PubMed Central

McCue, Molly E.; Bannasch, Danika L.; Petersen, Jessica L.; Gurr, Jessica; Bailey, Ernie; Binns, Matthew M.; Distl, Ottmar; Guérin, Gérard; Hasegawa, Telhisa; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Penedo, M. Cecilia T.; Røed, Knut H.; Ryder, Oliver A.; Swinburne, June E.; Tozaki, Teruaki; Valberg, Stephanie J.; Vaudin, Mark; Lindblad-Toh, Kerstin

2012-01-01

An equine SNP genotyping array was developed and evaluated on a panel of samples representing 14 domestic horse breeds and 18 evolutionarily related species. More than 54,000 polymorphic SNPs provided an average inter-SNP spacing of ∼43 kb. The mean minor allele frequency across domestic horse breeds was 0.23, and the number of polymorphic SNPs within breeds ranged from 43,287 to 52,085. Genome-wide linkage disequilibrium (LD) in most breeds declined rapidly over the first 50–100 kb and reached background levels within 1–2 Mb. The extent of LD and the level of inbreeding were highest in the Thoroughbred and lowest in the Mongolian and Quarter Horse. Multidimensional scaling (MDS) analyses demonstrated the tight grouping of individuals within most breeds, close proximity of related breeds, and less tight grouping in admixed breeds. The close relationship between the Przewalski's Horse and the domestic horse was demonstrated by pair-wise genetic distance and MDS. Genotyping of other Perissodactyla (zebras, asses, tapirs, and rhinoceros) was variably successful, with call rates and the number of polymorphic loci varying across taxa. Parsimony analysis placed the modern horse as sister taxa to Equus przewalski. The utility of the SNP array in genome-wide association was confirmed by mapping the known recessive chestnut coat color locus (MC1R) and defining a conserved haplotype of ∼750 kb across all breeds. These results demonstrate the high quality of this SNP genotyping resource, its usefulness in diverse genome analyses of the horse, and potential use in related species. PMID:22253606

A high-resolution genetic linkage map and QTL fine mapping for growth-related traits and sex in the Yangtze River common carp (Cyprinus carpio haematopterus).

PubMed

Feng, Xiu; Yu, Xiaomu; Fu, Beide; Wang, Xinhua; Liu, Haiyang; Pang, Meixia; Tong, Jingou

2018-04-02

A high-density genetic linkage map is essential for QTL fine mapping, comparative genome analysis, identification of candidate genes and marker-assisted selection for economic traits in aquaculture species. The Yangtze River common carp (Cyprinus carpio haematopterus) is one of the most important aquacultured strains in China. However, quite limited genetics and genomics resources have been developed for genetic improvement of economic traits in such strain. A high-resolution genetic linkage map was constructed by using 7820 2b-RAD (2b-restriction site-associated DNA) and 295 microsatellite markers in a F2 family of the Yangtze River common carp (C. c. haematopterus). The length of the map was 4586.56 cM with an average marker interval of 0.57 cM. Comparative genome mapping revealed that a high proportion (70%) of markers with disagreed chromosome location was observed between C. c. haematopterus and another common carp strain (subspecies) C. c. carpio. A clear 2:1 relationship was observed between C. c. haematopterus linkage groups (LGs) and zebrafish (Danio rerio) chromosomes. Based on the genetic map, 21 QTLs for growth-related traits were detected on 12 LGs, and contributed values of phenotypic variance explained (PVE) ranging from 16.3 to 38.6%, with LOD scores ranging from 4.02 to 11.13. A genome-wide significant QTL (LOD = 10.83) and three chromosome-wide significant QTLs (mean LOD = 4.84) for sex were mapped on LG50 and LG24, respectively. A 1.4 cM confidence interval of QTL for all growth-related traits showed conserved synteny with a 2.06 M segment on chromosome 14 of D. rerio. Five potential candidate genes were identified by blast search in this genomic region, including a well-studied multi-functional growth related gene, Apelin. We mapped a set of suggestive and significant QTLs for growth-related traits and sex based on a high-density genetic linkage map using SNP and microsatellite markers for Yangtze River common carp. Several candidate growth genes were also identified from the QTL regions by comparative mapping. This genetic map would provide a basis for genome assembly and comparative genomics studies, and those QTL-derived candidate genes and genetic markers are useful genomic resources for marker-assisted selection (MAS) of growth-related traits in the Yangtze River common carp.
Porcine NAMPT gene: search for polymorphism, mapping and association studies.

PubMed

Cepica, S; Bartenschlager, H; Ovilo, C; Zrůstová, J; Masopust, M; Fernández, A; López, A; Knoll, A; Rohrer, G A; Snelling, W M; Geldermann, H

2010-12-01

NAMPT encodes an enzyme catalysing the rate-limiting step in NAD biosynthesis. The extracellular form of the enzyme is known as adipokine visfatin. We detected SNP AM999341:g.669T>C (referred to as 669T>C) in intron 9 and SNP FN392209:g.358A>G (referred to as 358A>G) in the promoter of the gene. RH mapping linked the gene to microsatellite SW944. Linkage analysis placed the gene on the current USDA – USMARC linkage map at position 92 cM on SSC9. Association analyses were performed in a wild boar × Meishan F2 family (W × M), with 45 traits recorded (growth and fattening, fat deposition, muscling, meat quality, stress resistance and other traits), and in a commercial Landrace × Chinese-European (LCE) synthetic population with records for 15 traits (growth, fat deposition, muscling, intramuscular fat, meat colour and backfat fatty acid content). In the W × M, SNP 669T>C was associated with muscling, fat deposition, growth and fattening, meat quality and other traits and in the LCE with muscling, meat quality and backfat fatty acid composition. In the W × M, SNP 358A>G was associated with muscling, fat deposition, growth and other traits. After correction for multiple testing, the NAMPT haplotypes were associated in the W × M with, in descending order, muscling (q = 0.0056), growth (q = 0.0056), fat deposition (q = 0.0109), fat-to-meat ratio (q = 0.0135), cooling losses (q = 0.0568) and longissimus pHU (q = 0.0695). The SNPs are hypothesized to be in linkage disequilibrium with a causative mutation affecting energy metabolism as a whole rather than fat metabolism alone.
Evaluation of a SNP map of 6q24-27 confirms diabetic nephropathy loci and identifies novel associations type 2 diabetes patients enriched with nephropathy from an African American population

PubMed Central

Leak, Tennille S.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Keene, Keith L.; Gordon, Candace J.; Hicks, Pamela J.; Freedman, Barry I.; Bowden, Donald W.; Sale, Michèle M.

2009-01-01

Previously we performed a genome scan for type 2 diabetes (T2DM) using 638 African-American (AA) affected sibling pairs from 247 families; non-parametric linkage analysis suggested evidence of linkage at 6q24-27 (LOD 2.26). To comprehensively evaluate this region we performed a 2-stage association study by first constructing a SNP map of 754 SNPs selected from HapMap on the basis of linkage disequilibrium (LD) in 300 AAT2DM-ESRD subjects, 311 AA controls, 43 European American controls and 45 Yoruba Nigerian samples (Set 1). Replication analyses were conducted in an independent population of 283 AA T2DM-ESRD subjects and 282 AA controls (Set 2). In addition, we adjusted for the impact of admixture on association results by using ancestry informative markers (AIMs). In Stage 1, 137 (18.2%) SNPs showed nominal evidence of association (P<0.05) in one or more of tests of association: allelic (n=33), dominant (n=36), additive (n=29), or recessive (n=34) genotypic models, and 2- (n=47) and 3-SNP (n=43) haplotypic analyses. These SNPs were selected for follow-up genotyping. Stage 2 analyses confirmed association with a predicted 2-SNP “risk” haplotype in the PARK2 gene. Also, two intergenic SNPs showed consistent genotypic association with T2DM-ESRD: rs12197043 and rs4897081. Combined analysis of all subjects from both stages revealed nominal associations with 17 SNPs within genes; including suggestive associations in ESR1 and PARK2. This study confirms known diabetic nephropathy loci and identifies potentially novel susceptibility variants located within 6q24-27 in AA. PMID:18560894
Compression and fast retrieval of SNP data.

PubMed

Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio

2014-11-01

The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Compression and fast retrieval of SNP data

PubMed Central

Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio

2014-01-01

Motivation: The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. Results: We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Availability and implementation: Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. Contact: sambofra@dei.unipd.it or cobelli@dei.unipd.it. PMID:25064564
BAC-End Sequence-Based SNP Mining in Allotetraploid Cotton (Gossypium) Utilizing Resequencing Data, Phylogenetic Inferences, and Perspectives for Genetic Mapping

PubMed Central

Hulse-Kemp, Amanda M.; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A.; Scheffler, Brian E.; Fang, David D.; Chen, Z. Jeffrey; Van Deynze, Allen; Stelly, David M.

2015-01-01

A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. PMID:25858960
A High-Density Integrated DArTseq SNP-Based Genetic Map of Pisum fulvum and Identification of QTLs Controlling Rust Resistance

PubMed Central

Barilli, Eleonora; Cobos, María J.; Carrillo, Estefanía; Kilian, Andrzej; Carling, Jason; Rubiales, Diego

2018-01-01

Pisum fulvum, a wild relative of pea is an important source of allelic diversity to improve the genetic resistance of cultivated species against fungal diseases of economic importance like the pea rust caused by Uromyces pisi. To unravel the genetic control underlying resistance to this fungal disease, a recombinant inbred line (RIL) population was generated from a cross between two P. fulvum accessions, IFPI3260 and IFPI3251, and genotyped using Diversity Arrays Technology. A total of 9,569 high-quality DArT-Seq and 8,514 SNPs markers were generated. Finally, a total of 12,058 markers were assembled into seven linkage groups, equivalent to the number of haploid chromosomes of P. fulvum and P. sativum. The newly constructed integrated genetic linkage map of P. fulvum covered an accumulated distance of 1,877.45 cM, an average density of 1.19 markers cM−1 and an average distance between adjacent markers of 1.85 cM. The composite interval mapping revealed three QTLs distributed over two linkage groups that were associated with the percentage of rust disease severity (DS%). QTLs UpDSII and UpDSIV were located in the LGs II and IV respectively and were consistently identified both in adult plants over 3 years at the field (Córdoba, Spain) and in seedling plants under controlled conditions. Whenever they were detected, their contribution to the total phenotypic variance varied between 19.8 and 29.2. A third QTL (UpDSIV.2) was also located in the LGIVand was environmentally specific as was only detected for DS % in seedlings under controlled conditions. It accounted more than 14% of the phenotypic variation studied. Taking together the data obtained in the study, it could be concluded that the expression of resistance to fungal diseases in P. fulvum originates from the resistant parent IFPI3260. PMID:29497430
Genome-wide linkage and association analysis of cardiometabolic phenotypes in Hispanic Americans.

PubMed

Hellwege, Jacklyn N; Palmer, Nicholette D; Dimitrov, Latchezar; Keaton, Jacob M; Tabb, Keri L; Sajuthi, Satria; Taylor, Kent D; Ng, Maggie C Y; Speliotes, Elizabeth K; Hawkins, Gregory A; Long, Jirong; Ida Chen, Yii-Der; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Langefeld, Carl D; Wagenknecht, Lynne E; Bowden, Donald W

2017-02-01

Linkage studies of complex genetic diseases have been largely replaced by genome-wide association studies, due in part to limited success in complex trait discovery. However, recent interest in rare and low-frequency variants motivates re-examination of family-based methods. In this study, we investigated the performance of two-point linkage analysis for over 1.6 million single-nucleotide polymorphisms (SNPs) combined with single variant association analysis to identify high impact variants, which are both strongly linked and associated with cardiometabolic traits in up to 1414 Hispanics from the Insulin Resistance Atherosclerosis Family Study (IRASFS). Evaluation of all 50 phenotypes yielded 83 557 000 LOD (logarithm of the odds) scores, with 9214 LOD scores ⩾3.0, 845 ⩾4.0 and 89 ⩾5.0, with a maximal LOD score of 6.49 (rs12956744 in the LAMA1 gene for tumor necrosis factor-α (TNFα) receptor 2). Twenty-seven variants were associated with P<0.005 as well as having an LOD score >4, including variants in the NFIB gene under a linkage peak with TNFα receptor 2 levels on chromosome 9. Linkage regions of interest included a broad peak (31 Mb) on chromosome 1q with acute insulin response (max LOD=5.37). This region was previously documented with type 2 diabetes in family-based studies, providing support for the validity of these results. Overall, we have demonstrated the utility of two-point linkage and association in comprehensive genome-wide array-based SNP genotypes.
Genetic Mapping and Exome Sequencing Identify Variants Associated with Five Novel Diseases

PubMed Central

Puffenberger, Erik G.; Jinks, Robert N.; Sougnez, Carrie; Cibulskis, Kristian; Willert, Rebecca A.; Achilly, Nathan P.; Cassidy, Ryan P.; Fiorentini, Christopher J.; Heiken, Kory F.; Lawrence, Johnny J.; Mahoney, Molly H.; Miller, Christopher J.; Nair, Devika T.; Politi, Kristin A.; Worcester, Kimberly N.; Setton, Roni A.; DiPiazza, Rosa; Sherman, Eric A.; Eastman, James T.; Francklyn, Christopher; Robey-Bond, Susan; Rider, Nicholas L.; Gabriel, Stacey; Morton, D. Holmes; Strauss, Kevin A.

2012-01-01

The Clinic for Special Children (CSC) has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain) children. Among the Plain people, we have used single nucleotide polymorphism (SNP) microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb) that contain many genes (mean = 79). For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data. PMID:22279524
African-specific variability in the acetylcholine muscarinic receptor M4: association with cocaine and heroin addiction.

PubMed

Levran, Orna; Randesi, Matthew; Peles, Einat; Correa da Rosa, Joel; Ott, Jurg; Rotrosen, John; Adelson, Miriam; Kreek, Mary Jeanne

2016-06-01

This study was designed to determine whether polymorphisms in acetylcholine receptors contribute to opioid dependence and/or cocaine dependence. The sample (n = 1860) was divided by drug and ancestry, and 55 polymorphisms (nine genes) were analyzed. Of the 20 SNPs that showed nominally significant associations, the association of the African-specific CHRM4 SNP rs2229163 (Asn417=) with cocaine dependence survived correction for multiple testing (Pcorrected = 0.047). CHRM4 is located in a region of strong linkage disequilibrium on chromosome 11 that includes genes associated with schizophrenia. CHRM4 SNP rs2229163 is in strong linkage disequilibrium with several African-specific SNPs in DGKZ and AMBRA1. Cholinergic receptors' variants may contribute to drug addiction and have a potential role as pharmacogenetic markers.
A comprehensive screen for SNP associations on chromosome region 5q31-33 in Swedish/Norwegian celiac disease families.

PubMed

Amundsen, Silja Svanstrøm; Adamovic, Svetlana; Hellqvist, Asa; Nilsson, Staffan; Gudjónsdóttir, Audur H; Ascher, Henry; Ek, Johan; Larsson, Kristina; Wahlström, Jan; Lie, Benedicte A; Sollid, Ludvig M; Naluai, Asa Torinsson

2007-09-01

Celiac disease (CD) is a gluten-induced enteropathy, which results from the interplay between environmental and genetic factors. There is a strong human leukocyte antigen (HLA) association with the disease, and HLA-DQ alleles represent a major genetic risk factor. In addition to HLA-DQ, non-HLA genes appear to be crucial for CD development. Chromosomal region 5q31-33 has demonstrated linkage with CD in several genome-wide studies, including in our Swedish/Norwegian cohort. In a European meta-analysis 5q31-33 was the only region that reached a genome-wide level of significance except for the HLA region. To identify the genetic variant(s) responsible for this linkage signal, we performed a comprehensive single nucleotide polymorphism (SNP) association screen in 97 Swedish/Norwegian multiplex families who demonstrate linkage to the region. We selected tag SNPs from a 16 Mb region representing the 95% confidence interval of the linkage peak. A total of 1,404 SNPs were used for the association analysis. We identified several regions with SNPs demonstrating moderate single- or multipoint associations. However, the isolated association signals appeared insufficient to account for the linkage signal seen in our cohort. Collective effects of multiple risk genes within the region, incomplete genetic coverage or effects related to copy number variation are possible explanations for our findings.
Gene-Based Single Nucleotide Polymorphism Markers for Genetic and Association Mapping in Common Bean

PubMed Central

2012-01-01

Background In common bean, expressed sequence tags (ESTs) are an underestimated source of gene-based markers such as insertion-deletions (Indels) or single-nucleotide polymorphisms (SNPs). However, due to the nature of these conserved sequences, detection of markers is difficult and portrays low levels of polymorphism. Therefore, development of intron-spanning EST-SNP markers can be a valuable resource for genetic experiments such as genetic mapping and association studies. Results In this study, a total of 313 new gene-based markers were developed at target genes. Intronic variation was deeply explored in order to capture more polymorphism. Introns were putatively identified after comparing the common bean ESTs with the soybean genome, and the primers were designed over intron-flanking regions. The intronic regions were evaluated for parental polymorphisms using the single strand conformational polymorphism (SSCP) technique and Sequenom MassARRAY system. A total of 53 new marker loci were placed on an integrated molecular map in the DOR364 × G19833 recombinant inbred line (RIL) population. The new linkage map was used to build a consensus map, merging the linkage maps of the BAT93 × JALO EEP558 and DOR364 × BAT477 populations. A total of 1,060 markers were mapped, with a total map length of 2,041 cM across 11 linkage groups. As a second application of the generated resource, a diversity panel with 93 genotypes was evaluated with 173 SNP markers using the MassARRAY-platform and KASPar technology. These results were coupled with previous SSR evaluations and drought tolerance assays carried out on the same individuals. This agglomerative dataset was examined, in order to discover marker-trait associations, using general linear model (GLM) and mixed linear model (MLM). Some significant associations with yield components were identified, and were consistent with previous findings. Conclusions In short, this study illustrates the power of intron-based markers for linkage and association mapping in common bean. The utility of these markers is discussed in relation with the usefulness of microsatellites, the molecular markers by excellence in this crop. PMID:22734675
High-resolution genetic linkage mapping, high-temperature tolerance and growth-related quantitative trait locus (QTL) identification in Marsupenaeus japonicus.

PubMed

Lu, Xia; Luan, Sheng; Hu, Long Yang; Mao, Yong; Tao, Ye; Zhong, Sheng Ping; Kong, Jie

2016-06-01

The Kuruma prawn, Marsupenaeus japonicus, is one of the most promising marine invertebrates in the industry in Asia, Europe and Australia. However, the increasing global temperatures result in considerable economic losses in M. japonicus farming. In the present study, to select genetically improved animals for the sustainable development of the Kuruma prawn industry, a high-resolution genetic linkage map and quantitative trait locus (QTL) identification were performed using the RAD technology. The maternal map contained 5849 SNP markers and spanned 3127.23 cM, with an average marker interval of 0.535 cM. Instead, the paternal map contained 3927 SNP markers and spanned 3326.19 cM, with an average marker interval of 0.847 cM. The consensus map contained 9289 SNP markers and spanned 3610.90 cM, with an average marker interval of 0.388 cM and coverage of 99.06 % of the genome. The markers were grouped into 41 linkage groups in the maps. Significantly, negative correlation was detected between high-temperature tolerance (UTT) and body weight (BW). The QTL mapping revealed 129 significant QTL loci for UTT and four significant QTL loci for BW at the genome-wide significance threshold. Among these QTLs, 129 overlapped with linked SNPs, and the remaining four were located in regions between contiguous SNPs. They explained the total phenotypic variance ranging from 8.9 to 12.4 %. Because of a significantly negative correlation between growth and high-temperature tolerance, we demonstrate that this high-resolution linkage map and QTLs would be useful for further marker-assisted selection in the genetic improvement of M. japonicus.
SNP discovery and development of genetic markers for mapping innate immune response genes in common carp (Cyprinus carpio).

PubMed

Kongchum, Pawapol; Palti, Yniv; Hallerman, Eric M; Hulata, Gideon; David, Lior

2010-08-01

Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers for susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpesvirus 3 (CyHV-3) is highly contagious and virulent in common carp (Cyprinus carpio). With the aim to develop molecular tools for breeding CyHV-3-resistant carp, we have amplified and sequenced 11 candidate genes for viral disease resistance including TLR2, TLR3, TLR4ba, TLR7, TLR9, TLR21, TLR22, MyD88, TRAF6, type I IFN and IL-1beta. For each gene, we initially cloned and sequenced PCR amplicons from 8 to 12 fish (2-3 fish per strain) from the SNP discovery panel. We then identified and evaluated putative SNPs for their polymorphisms in the SNP discovery panel and validated their usefulness for linkage analysis in a full-sib family using the SNaPshot method. Our sequencing results and phylogenetic analyses suggested that TLR3, TLR7 and MyD88 genes are duplicated in the common carp genome. We, therefore, developed locus-specific PCR primers and SNP genotyping assays for the duplicated loci. A total of 48 SNP markers were developed from PCR fragments of the 13 loci (7 single-locus and 3 duplicated genes). Thirty-nine markers were polymorphic with estimated minor allele frequencies of more than 0.1. The utility of the SNP markers was evaluated in one full-sib family and revealed that 20 markers from 9 loci segregated in a disomic and Mendelian pattern and would be useful for linkage analysis. Published by Elsevier Ltd.
Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations

PubMed Central

Truong, Hoa T.; Ramos, A. Marcos; Yalcin, Feyruz; de Ruiter, Marjo; van der Poel, Hein J. A.; Huvenaars, Koen H. J.; Hogers, René C. J.; van Enckevort, Leonora. J. G.; Janssen, Antoine; van Orsouw, Nathalie J.; van Eijk, Michiel J. T.

2012-01-01

Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n = 222 samples) and lettuce (n = 87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike. PMID:22662172
Pl(17) is a novel gene independent of known downy mildew resistance genes in the cultivated sunflower (Helianthus annuus L.).

PubMed

Qi, L L; Long, Y M; Jan, C C; Ma, G J; Gulya, T J

2015-04-01

Pl 17, a novel downy mildew resistance gene independent of known downy mildew resistance genes in sunflowers, was genetically mapped to linkage group 4 of the sunflower genome. Downy mildew (DM), caused by Plasmopara halstedii (Farl.). Berl. et de Toni, is one of the serious sunflower diseases in the world due to its high virulence and the variability of the pathogen. DM resistance in the USDA inbred line, HA 458, has been shown to be effective against all virulent races of P. halstedii currently identified in the USA. To determine the chromosomal location of this resistance, 186 F 2:3 families derived from a cross of HA 458 with HA 234 were phenotyped for their resistance to race 734 of P. halstedii. The segregation ratio of the population supported that the resistance was controlled by a single dominant gene, Pl 17. Simple sequence repeat (SSR) and single nucleotide polymorphism (SNP) primers were used to identify molecular markers linked to Pl 17. Bulked segregant analysis using 849 SSR markers located Pl 17 to linkage group (LG) 4, which is the first DM gene discovered in this linkage group. An F2 population of 186 individuals was screened with polymorphic SSR and SNP primers from LG4. Two flanking markers, SNP SFW04052 and SSR ORS963, delineated Pl 17 in an interval of 3.0 cM. The markers linked to Pl 17 were validated in a BC3 population. A search for the physical location of flanking markers in sunflower genome sequences revealed that the Pl 17 region had a recombination frequency of 0.59 Mb/cM, which was a fourfold higher recombination rate relative to the genomic average. This region can be considered amenable to molecular manipulation for further map-based cloning of Pl 17.
Fine mapping and association studies of a high-density lipoprotein cholesterol linkage region on chromosome 16 in French-Canadian subjects

PubMed Central

Dastani, Zari; Pajukanta, Päivi; Marcil, Michel; Rudzicz, Nicholas; Ruel, Isabelle; Bailey, Swneke D; Lee, Jenny C; Lemire, Mathieu; Faith, Janet; Platko, Jill; Rioux, John; Hudson, Thomas J; Gaudet, Daniel; Engert, James C; Genest, Jacques

2010-01-01

Low levels of high-density lipoprotein cholesterol (HDL-C) are an independent risk factor for cardiovascular disease. To identify novel genetic variants that contribute to HDL-C, we performed genome-wide scans and quantitative association studies in two study samples: a Quebec-wide study consisting of 11 multigenerational families and a study of 61 families from the Saguenay–Lac St-Jean (SLSJ) region of Quebec. The heritability of HDL-C in these study samples was 0.73 and 0.49, respectively. Variance components linkage methods identified a LOD score of 2.61 at 98 cM near the marker D16S515 in Quebec-wide families and an LOD score of 2.96 at 86 cM near the marker D16S2624 in SLSJ families. In the Quebec-wide sample, four families showed segregation over a 25.5-cM (18 Mb) region, which was further reduced to 6.6 Mb with additional markers. The coding regions of all genes within this region were sequenced. A missense variant in CHST6 segregated in four families and, with additional families, we observed a P value of 0.015 for this variant. However, an association study of this single-nucleotide polymorphism (SNP) in unrelated Quebec-wide samples was not significant. We also identified an SNP (rs11646677) in the same region, which was significantly associated with a low HDL-C (P=0.016) in the SLSJ study sample. In addition, RT-PCR results from cultured cells showed a significant difference in the expression of CHST6 and KIAA1576, another gene in the region. Our data constitute additional evidence for a locus on chromosome 16q23-24 that affects HDL-C levels in two independent French-Canadian studies. PMID:19844255
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

PubMed

Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

2016-01-01

Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal.
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications

PubMed Central

Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

2016-01-01

Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal. PMID:27583971
Mapping the sex determination locus in the hāpuku (Polyprion oxygeneios) using ddRAD sequencing.

PubMed

Brown, Jeremy K; Taggart, John B; Bekaert, Michaël; Wehner, Stefanie; Palaiokostas, Christos; Setiawan, Alvin N; Symonds, Jane E; Penman, David J

2016-06-10

Hāpuku (Polyprion oxygeneios) is a member of the wreckfish family (Polyprionidae) and is highly regarded as a food fish. Although adults grow relatively slowly, juveniles exhibit low feed conversion ratios and can reach market size in 1-2 years, making P. oxygeneios a strong candidate for aquaculture. However, they can take over 5 years to reach sexual maturity in captivity and are not externally sexually dimorphic, complicating many aspects of broodstock management. Understanding the sex determination system of P. oxygeneios and developing accurate assays to assign genetic sex will contribute significantly towards its full-scale commercialisation. DNA from parents and sexed offspring (n = 57) from a single family of captive bred P. oxygeneios was used as a template for double digestion Restriction-site Associated DNA (ddRAD) sequencing. Two libraries were constructed using SbfI - SphI and SbfI - NcoI restriction enzyme combinations, respectively. Two runs on an Illumina MiSeq platform generated 70,266,464 raw reads, identifying 19,669 RAD loci. A combined sex linkage map (1367 cM) was constructed based on 1575 Single Nucleotide Polymorphism (SNP) markers that resolved into 35 linkage groups. Sex-specific linkage maps were of similar size (1132 and 1168 cM for male and female maps respectively). A single major sex-determining locus, found to be heterogametic in males, was mapped to linkage group 14. Several markers were found to be in strong linkage disequilibrium with the sex-determining locus. Allele-specific PCR assays were developed for two of these markers, SphI6331 and SphI8298, and demonstrated to accurately differentiate sex in progeny within the same pedigree. Comparative genomic analyses indicated that many of the linkage groups within the P. oxygeneios map share a relatively high degree of homology with those published for the European seabass (Dicentrarchus labrax). P. oxygeneios has an XX/XY sex determination system. Evaluation of allele-specific PCR assays, based on the two SNP markers most closely associated with phenotypic sex, indicates that a simple molecular assay for sexing P. oxygeneios should be readily attainable. The high degree of synteny observed with D. labrax should aid further molecular genetic study and exploitation of hāpuku as a food fish.

Construction of a high-density genetic map using specific length amplified fragment markers and identification of a quantitative trait locus for anthracnose resistance in walnut (Juglans regia L.).

PubMed

Zhu, Yufeng; Yin, Yanfei; Yang, Keqiang; Li, Jihong; Sang, Yalin; Huang, Long; Fan, Shu

2015-08-18

Walnut (Juglans regia, 2n = 32, approximately 606 Mb per 1C genome) is an economically important tree crop. Resistance to anthracnose, caused by Colletotrichum gloeosporioides, is a major objective of walnut genetic improvement in China. The recently developed specific length amplified fragment sequencing (SLAF-seq) is an efficient strategy that can obtain large numbers of markers with sufficient sequence information to construct high-density genetic maps and permits detection of quantitative trait loci (QTLs) for molecular breeding. SLAF-seq generated 161.64 M paired-end reads. 153,820 SLAF markers were obtained, of which 49,174 were polymorphic. 13,635 polymorphic markers were sorted into five segregation types and 2,577 markers of them were used to construct genetic linkage maps: 2,395 of these fell into 16 linkage groups (LGs) for the female map, 448 markers for the male map, and 2,577 markers for the integrated map. Taking into account the size of all LGs, the marker coverage was 2,664.36 cM for the female map, 1,305.58 cM for the male map, and 2,457.82 cM for the integrated map. The average intervals between two adjacent mapped markers were 1.11 cM, 2.91 cM and 0.95 cM for three maps, respectively. 'SNP_only' markers accounted for 89.25% of the markers on the integrated map. Mapping markers contained 5,043 single nucleotide polymorphisms (SNPs) loci, which corresponded to two SNP loci per SLAF marker. According to the integrated map, we used interval mapping (Logarithm of odds, LOD > 3.0) to detect our quantitative trait. One QTL was detected for anthracnose resistance. The interval of this QTL ranged from 165.51 cM to 176.33 cM on LG14, and ten markers in this interval that were above the threshold value were considered to be linked markers to the anthracnose resistance trait. The phenotypic variance explained by each marker ranged from 16.2 to 19.9%, and their LOD scores varied from 3.22 to 4.04. High-density genetic maps for walnut containing 16 LGs were constructed using the SLAF-seq method with an F1 population. One QTL for walnut anthracnose resistance was identified based on the map. The results will aid molecular marker-assisted breeding and walnut resistance genes identification.
Genome-wide linkage disequilibrium and genetic diversity in five populations of Australian domestic sheep.

PubMed

Al-Mamun, Hawlader Abdullah; Clark, Samuel A; Kwan, Paul; Gondro, Cedric

2015-11-24

Knowledge of the genetic structure and overall diversity of livestock species is important to maximise the potential of genome-wide association studies and genomic prediction. Commonly used measures such as linkage disequilibrium (LD), effective population size (N e ), heterozygosity, fixation index (F ST) and runs of homozygosity (ROH) are widely used and help to improve our knowledge about genetic diversity in animal populations. The development of high-density single nucleotide polymorphism (SNP) arrays and the subsequent genotyping of large numbers of animals have greatly increased the accuracy of these population-based estimates. In this study, we used the Illumina OvineSNP50 BeadChip array to estimate and compare LD (measured by r (2) and D'), N e , heterozygosity, F ST and ROH in five Australian sheep populations: three pure breeds, i.e., Merino (MER), Border Leicester (BL), Poll Dorset (PD) and two crossbred populations i.e. F1 crosses of Merino and Border Leicester (MxB) and MxB crossed to Poll Dorset (MxBxP). Compared to other livestock species, the sheep populations that were analysed in this study had low levels of LD and high levels of genetic diversity. The rate of LD decay was greater in Merino than in the other pure breeds. Over short distances (<10 kb), the levels of LD were higher in BL and PD than in MER. Similarly, BL and PD had comparatively smaller N e than MER. Observed heterozygosity in the pure breeds ranged from 0.3 in BL to 0.38 in MER. Genetic distances between breeds were modest compared to other livestock species (highest F ST = 0.063) but the genetic diversity within breeds was high. Based on ROH, two chromosomal regions showed evidence of strong recent selection. This study shows that there is a large range of genome diversity in Australian sheep breeds, especially in Merino sheep. The observed range of diversity will influence the design of genome-wide association studies and the results that can be obtained from them. This knowledge will also be useful to design reference populations for genomic prediction of breeding values in sheep.
Genome-wide population structure and evolutionary history of the Frizarta dairy sheep.

PubMed

Kominakis, A; Hager-Theodorides, A L; Saridaki, A; Antonakos, G; Tsiamis, G

2017-10-01

In the present study, we used genomic data, generated with a medium density single nucleotide polymorphisms (SNP) array, to acquire more information on the population structure and evolutionary history of the synthetic Frizarta dairy sheep. First, two typical measures of linkage disequilibrium (LD) were estimated at various physical distances that were then used to make inferences on the effective population size at key past time points. Population structure was also assessed by both multidimensional scaling analysis and k-means clustering on the distance matrix obtained from the animals' genomic relationships. The Wright's fixation F ST index was also employed to assess herds' genetic homogeneity and to indirectly estimate past migration rates. The Wright's fixation F IS index and genomic inbreeding coefficients based on the genomic relationship matrix as well as on runs of homozygosity were also estimated. The Frizarta breed displays relatively low LD levels with r 2 and |D'| equal to 0.18 and 0.50, respectively, at an average inter-marker distance of 31 kb. Linkage disequilibrium decayed rapidly by distance and persisted over just a few thousand base pairs. Rate of LD decay (β) varied widely among the 26 autosomes with larger values estimated for shorter chromosomes (e.g. β=0.057, for OAR6) and smaller values for longer ones (e.g. β=0.022, for OAR2). The inferred effective population size at the beginning of the breed's formation was as high as 549, was then reduced to 463 in 1981 (end of the breed's formation) and further declined to 187, one generation ago. Multidimensional scaling analysis and k-means clustering suggested a genetically homogenous population, F ST estimates indicated relatively low genetic differentiation between herds, whereas a heat map of the animals' genomic kinship relationships revealed a stratified population, at a herd level. Estimates of genomic inbreeding coefficients suggested that most recent parental relatedness may have been a major determinant of the current effective population size. A denser than the 50k SNP panel may be more beneficial when performing genome wide association studies in the breed.
Genetic variation in C-reactive protein (CRP) gene may be associated with risk of systemic lupus erythematosus and CRP concentrations.

PubMed

Shih, P Betty; Manzi, Susan; Shaw, Penny; Kenney, Margaret; Kao, Amy H; Bontempo, Franklin; Barmada, M Michael; Kammerer, Candace; Kamboh, M Ilyas

2008-11-01

The gene coding for C-reactive protein (CRP) is located on chromosome 1q23.2, which falls within a linkage region thought to harbor a systemic lupus erythematosus (SLE) susceptibility gene. Recently, 2 single-nucleotide polymorphisms (SNP) in the CRP gene (+838, +2043) have been shown to be associated with CRP concentrations and/or SLE risk in a British family-based cohort. Our study was done to confirm the reported association in an independent population-based case-control cohort, and also to investigate the influence of 3 additional CRP tagSNP (-861, -390, +90) on SLE risk and serum CRP concentrations. DNA from 337 Caucasian women who met the American College of Rheumatology criteria for definite (n = 324) or probable (n = 13) SLE and 448 Caucasian healthy female controls was genotyped for 5 CRP tagSNP (-861, -390, +90, +838, +2043). Genotyping was performed using restriction fragment length polymorphism-polymerase chain reaction, pyrosequencing, or TaqMan assays. Serum CRP levels were measured using ELISA. Association studies were performed using the chi-squared distribution, Z-test, Fisher's exact test, and analysis of variance. Haplotype analysis was performed using EH software and the haplo.stats package in R 2.1.2. While none of the SNP were found to be associated with SLE risk individually, there was an association with the 5 SNP haplotypes (p < 0.001). Three SNP (-861, -390, +90) were found to significantly influence serum CRP level in SLE cases, both independently and as haplotypes. Our data suggest that unique haplotype combinations in the CRP gene may modify the risk of developing SLE and influence circulating CRP levels.
The First Genetic Map in Sweet Osmanthus (Osmanthus fragrans Lour.) Using Specific Locus Amplified Fragment Sequencing

PubMed Central

He, Yanxia; Yuan, Wangjun; Dong, Meifang; Han, Yuanji; Shang, Fude

2017-01-01

Osmanthus fragrans is an ornamental plant of substantial commercial value, and no genetic linkage maps of this species have previously been reported. Specific-locus amplified fragment sequencing (SLAF-seq) is a recently developed technology that allows massive single nucleotide polymorphisms (SNPs) to be identified and high-resolution genotyping. In our current research, we generated the first genetic map of O. fragrans using SLAF-seq, which is composed with 206.92 M paired-end reads and 173,537 SLAF markers. Among total 90,715 polymorphic SLAF markers, 15,317 polymorphic SLAFs could be used for genetic map construction. The integrated map contained 14,189 high quality SLAFs that were grouped in 23 genetic linkage groups, with a total length of 2962.46 cM and an average distance of 0.21 cM between two adjacent markers. In addition, 23,664 SNPs were identified from the mapped markers. As far as we know, this is the first of the genetic map of O. fragrans. Our results are further demonstrate that SLAF-seq is a very effective method for developing markers and constructing high-density linkage maps. The SNP markers and the genetic map reported in this study should be valuable resource in future research. PMID:29018460
Partial preferential chromosome pairing is genotype dependent in tetraploid rose.

PubMed

Bourke, Peter M; Arens, Paul; Voorrips, Roeland E; Esselink, G Danny; Koning-Boucoiran, Carole F S; Van't Westende, Wendy P C; Santos Leonardo, Tiago; Wissink, Patrick; Zheng, Chaozhi; van Geest, Geert; Visser, Richard G F; Krens, Frans A; Smulders, Marinus J M; Maliepaard, Chris

2017-04-01

It has long been recognised that polyploid species do not always neatly fall into the categories of auto- or allopolyploid, leading to the term 'segmental allopolyploid' to describe everything in between. The meiotic behaviour of such intermediate species is not fully understood, nor is there consensus as to how to model their inheritance patterns. In this study we used a tetraploid cut rose (Rosa hybrida) population, genotyped using the 68K WagRhSNP array, to construct an ultra-high-density linkage map of all homologous chromosomes using methods previously developed for autotetraploids. Using the predicted bivalent configurations in this population we quantified differences in pairing behaviour among and along homologous chromosomes, leading us to correct our estimates of recombination frequency to account for this behaviour. This resulted in the re-mapping of 25 695 SNP markers across all homologues of the seven rose chromosomes, tailored to the pairing behaviour of each chromosome in each parent. We confirmed the inferred differences in pairing behaviour among chromosomes by examining repulsion-phase linkage estimates, which also carry information about preferential pairing and recombination. Currently, the closest sequenced relative to rose is Fragaria vesca. Aligning the integrated ultra-dense rose map with the strawberry genome sequence provided a detailed picture of the synteny, confirming overall co-linearity but also revealing new genomic rearrangements. Our results suggest that pairing affinities may vary along chromosome arms, which broadens our current understanding of segmental allopolyploidy. © 2017 The Authors The Plant Journal published by John Wiley & Sons Ltd and Society for Experimental Biology.
Combined linkage and association analyses identify a novel locus for obesity near PROX1 in Asians.

PubMed

Kim, Hyun-Jin; Yoo, Yun Joo; Ju, Young Seok; Lee, Seungbok; Cho, Sung-Il; Sung, Joohon; Kim, Jong-Il; Seo, Jeong-Sun

2013-11-01

Although genome-wide association studies (GWAS) have substantially contributed to understanding the genetic architecture, unidentified variants for complex traits remain an issue. One of the efficient approaches is the improvement of the power of GWAS scan by weighting P values with prior linkage signals. Our objective was to identify the novel candidates for obesity in Asian populations by using genemapping strategies that combine linkage and association analyses. To obtain linkage information for body mass index (BMI) and waist circumference (WC), we performed a multipoint genome-wide linkage study in an isolated Mongolian sample of 1,049 individuals from 74 families. Next, a family-based GWAS, which integrates within- and between-family components, was performed using the genotype data of 756 individuals of the Mongolian sample, and P values for association were weighted using linkage information obtained previously. For both BMI (LOD = 3.3) and WC (LOD = 2.6), the highest linkage peak was discovered at chromosome 10q11.22. In family-based GWAS combined with linkage information, six single-nucleotide polymorphisms (SNPs) for BMI and five SNPs for WC reached a significant level of association (linkage weighted P < 1 × 10(-5) ). Of these, only one of the SNPs associated with WC (rs1704198) was replicated in 327 Korean families comprising 1,301 individuals. This SNP was located in the proximity of the prosperorelated homeobox 1 (PROX1) gene, the function of which was validated previously in a mouse model. Our powerful strategic analysis enabled the discovery of a novel candidate gene, PROX1, associated with WC in an Asian population. Copyright © 2012 The Obesity Society.
Discrimination of candidate subgenome-specific loci by linkage map construction with an S1 population of octoploid strawberry (Fragaria × ananassa).

PubMed

Nagano, Soichiro; Shirasawa, Kenta; Hirakawa, Hideki; Maeda, Fumi; Ishikawa, Masami; Isobe, Sachiko N

2017-05-12

The strawberry, Fragaria × ananassa, is an allo-octoploid (2n = 8x = 56) and outcrossing species. Although it is the most widely consumed berry crop in the world, its complex genome structure has hindered its genetic and genomic analysis, and thus discrimination of subgenome-specific loci among the homoeologous chromosomes is needed. In the present study, we identified candidate subgenome-specific single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) loci, and constructed a linkage map using an S 1 mapping population of the cultivar 'Reikou' with an IStraw90 Axiom® SNP array and previously published SSR markers. The 'Reikou' linkage map consisted of 11,574 loci (11,002 SNPs and 572 SSR loci) spanning 2816.5 cM of 31 linkage groups. The 11,574 loci were located on 4738 unique positions (bin) on the linkage map. Of the mapped loci, 8999 (8588 SNPs and 411 SSR loci) showed a 1:2:1 segregation ratio of AA:AB:BB allele, which suggested the possibility of deriving loci from candidate subgenome-specific sequences. In addition, 2575 loci (2414 SNPs and 161 SSR loci) showed a 3:1 segregation of AB:BB allele, indicating they were derived from homoeologous genomic sequences. Comparative analysis of the homoeologous linkage groups revealed differences in genome structure among the subgenomes. Our results suggest that candidate subgenome-specific loci are randomly located across the genomes, and that there are small- to large-scale structural variations among the subgenomes. The mapped SNPs and SSR loci on the linkage map are expected to be seed points for the construction of pseudomolecules in the octoploid strawberry.
A Genome-Wide Association Meta-Analysis of Attention-Deficit/Hyperactivity Disorder Symptoms in Population-Based Paediatric Cohorts

PubMed Central

Groen-Blokhuis, Maria M.; Pourcain, Beate St.; Greven, Corina U.; Pappa, Irene; Tiesler, Carla M.T.; Ang, Wei; Nolte, Ilja M.; Vilor-Tejedor, Natalia; Bacelis, Jonas; Ebejer, Jane L.; Zhao, Huiying; Davies, Gareth E.; Ehli, Erik A.; Evans, David M.; Fedko, Iryna O.; Guxens, Mònica; Hottenga, Jouke-Jan; Hudziak, James J.; Jugessur, Astanand; Kemp, John P.; Krapohl, Eva; Martin, Nicholas G.; Murcia, Mario; Myhre, Ronny; Ormel, Johan; Ring, Susan M.; Standl, Marie; Stergiakouli, Evie; Stoltenberg, Camilla; Thiering, Elisabeth; Timpson, Nicholas J.; Trzaskowski, Maciej; van der Most, Peter J.; Wang, Carol; Nyholt, Dale R.; Medland, Sarah E.; Neale, Benjamin; Jacobsson, Bo; Sunyer, Jordi; Hartman, Catharina A.; Whitehouse, Andrew J.O.; Pennell, Craig E.; Heinrich, Joachim; Plomin, Robert; Smith, George Davey; Tiemeier, Henning; Posthuma, Danielle; Boomsma, Dorret I.

2016-01-01

Objective To elucidate the influence of common genetic variants on childhood attention-deficit/hyperactivity disorder (ADHD) symptoms, to identify genetic variants that explain its high heritability, and to investigate the genetic overlap of ADHD symptom scores with ADHD diagnosis. Method Within the EArly Genetics and Lifecourse Epidemiology (EAGLE) consortium, genome-wide single nucleotide polymorphisms (SNPs) and ADHD symptom scores were available for 17,666 children (< 13 years) from nine population-based cohorts. SNP-based heritability was estimated in data from the three largest cohorts. Meta-analysis based on genome-wide association (GWA) analyses with SNPs was followed by gene-based association tests, and the overlap in results with a meta-analysis in the Psychiatric Genomics Consortium (PGC) case-control ADHD study was investigated. Results SNP-based heritability ranged from 5% to 34%, indicating that variation in common genetic variants influences ADHD symptom scores. The meta-analysis did not detect genome-wide significant SNPs, but three genes, lying close to each other with SNPs in high linkage disequilibrium (LD), showed a gene-wide significant association (p values between 1.46×10-6 and 2.66×10-6). One gene, WASL, is involved in neuronal development. Both SNP- and gene-based analyses indicated overlap with the PGC meta-analysis results with the genetic correlation estimated at 0.96. Conclusion The SNP-based heritability for ADHD symptom scores indicates a polygenic architecture and genes involved in neurite outgrowth are possibly involved. Continuous and dichotomous measures of ADHD appear to assess a genetically common phenotype. A next step is to combine data from population-based and case-control cohorts in genetic association studies to increase sample size and improve statistical power for identifying genetic variants. PMID:27663945
Genome-wide Linkage and Positional Association Study of Blood Pressure Response to Dietary Sodium Intervention

PubMed Central

Mei, Hao; Gu, Dongfeng; Hixson, James E.; Rice, Treva K.; Chen, Jing; Shimmin, Lawrence C.; Schwander, Karen; Kelly, Tanika N.; Liu, De-Pei; Chen, Shufeng; Huang, Jian-feng; Jaquish, Cashell E.; Rao, Dabeeru C.; He, Jiang

2012-01-01

The authors conducted a genome-wide linkage scan and positional association analysis to identify the genetic determinants of salt sensitivity of blood pressure (BP) in a large family-based, dietary-feeding study. The dietary intervention was conducted among 1,906 participants in rural China (2003–2005). A 7-day low-sodium intervention was followed by a 7-day high-sodium intervention. Salt sensitivity was defined as BP responses to low- and high-sodium interventions. Signals of the logarithm of the odds to the base 10 (LOD ≥ 3) were detected at 33–42 centimorgans of chromosome 2 (2p24.3-2p24.1), with a maximum LOD score of 3.33 for diastolic blood pressure responses to high-sodium intervention. LOD scores were 2.35–2.91 for mean arterial pressure (MAP) and 0.80–1.49 for systolic blood pressure responses in this region, respectively. Correcting for multiple tests, single nucleotide polymorphism (SNP) rs11674786 (2.7 kilobases upstream of the family with sequence similarity 84, member A, gene (FAM84A)) in the linkage region was significantly associated with diastolic blood pressure (P = 0.0007) and MAP responses (P = 0.0007), and SNP rs16983422 (2.8 kilobases upstream of the visinin-like 1 gene (VSNL1)) was marginally associated with diastolic blood pressure (P = 0.005) and MAP responses (P = 0.005). An additive interaction between SNPs rs11674786 and rs16983422 was observed, with P = 7.00 × 10−5 and P = 7.23 × 10−5 for diastolic blood pressure and MAP responses, respectively. The authors concluded that genetic region 2p24.3-2p24.1 might harbor functional variants for the salt sensitivity of BP. PMID:22865701
Investigation of genetic variation in scavenger receptor class B, member 1 (SCARB1) and association with serum carotenoids

PubMed Central

McKay, Gareth J; Loane, Edward; Nolan, John M; Patterson, Christopher C; Meyers, Kristin J; Mares, Julie A; Yonova-Doing, Ekaterina; Hammond, Christopher J; Beatty, Stephen; Silvestri, Giuliana

2013-01-01

Objective To investigate association of scavenger receptor class B, member 1 (SCARB1) genetic variants with serum carotenoid levels of lutein (L) and zeaxanthin (Z) and macular pigment optical density (MPOD). Design A cross-sectional study of healthy adults aged 20-70. Participants 302 participants recruited following local advertisement. Methods MPOD was measured by customized heterochromatic flicker photometry. Fasting blood samples were taken for serum L and Z measurement by HPLC and lipoprotein analysis by spectrophotometric assay. Forty-seven single nucleotide polymorphisms (SNPs) across SCARB1 were genotyped using Sequenom technology. Association analyses were performed using PLINK to compare allele and haplotype means, with adjustment for potential confounding and correction for multiple comparisons by permutation testing. Replication analysis was performed in the TwinsUK and CAREDS cohorts. Main outcome measures Odds ratios (ORs) for macular pigment optical density area, serum lutein and zeaxanthin concentrations associated with genetic variations in SCARB1 and interactions between SCARB1 and sex. Results Following multiple regression analysis with adjustment for age, body mass index, sex, high-density lipoprotein cholesterol (HDLc), low-density lipoprotein cholesterol (LDLc), triglycerides, smoking, dietary L and Z levels, 5 SNPs were significantly associated with serum L concentration and 1 SNP with MPOD (P<0.01). Only the association between rs11057841 and serum L withstood correction for multiple comparisons by permutation testing (P<0.01) and replicated in the TwinsUK cohort (P=0.014). Independent replication was also observed in the CAREDS cohort with rs10846744 (P=2×10−4), a SNP in high linkage disequilibrium with rs11057841 (r2=0.93). No significant interactions by sex were found. Haplotype analysis revealed no stronger association than obtained with single SNP analyses. Conclusions Our study has identified association between rs11057841 and serum L concentration (24% increase per T allele) in healthy subjects, independent of potential confounding factors. Our data supports further evaluation of the role for SCARB1 in the transport of macular pigment and the possible modulation of AMD risk through combating the effects of oxidative stress within the retina. PMID:23562302
High-resolution genetic mapping of allelic variants associated with cell wall chemistry in Populus.

PubMed

Muchero, Wellington; Guo, Jianjun; DiFazio, Stephen P; Chen, Jin-Gui; Ranjan, Priya; Slavov, Gancho T; Gunter, Lee E; Jawdy, Sara; Bryan, Anthony C; Sykes, Robert; Ziebell, Angela; Klápště, Jaroslav; Porth, Ilga; Skyba, Oleksandr; Unda, Faride; El-Kassaby, Yousry A; Douglas, Carl J; Mansfield, Shawn D; Martin, Joel; Schackwitz, Wendy; Evans, Luke M; Czarnecki, Olaf; Tuskan, Gerald A

2015-01-23

QTL cloning for the discovery of genes underlying polygenic traits has historically been cumbersome in long-lived perennial plants like Populus. Linkage disequilibrium-based association mapping has been proposed as a cloning tool, and recent advances in high-throughput genotyping and whole-genome resequencing enable marker saturation to levels sufficient for association mapping with no a priori candidate gene selection. Here, multiyear and multienvironment evaluation of cell wall phenotypes was conducted in an interspecific P. trichocarpa x P. deltoides pseudo-backcross mapping pedigree and two partially overlapping populations of unrelated P. trichocarpa genotypes using pyrolysis molecular beam mass spectrometry, saccharification, and/ or traditional wet chemistry. QTL mapping was conducted using a high-density genetic map with 3,568 SNP markers. As a fine-mapping approach, chromosome-wide association mapping targeting a QTL hot-spot on linkage group XIV was performed in the two P. trichocarpa populations. Both populations were genotyped using the 34 K Populus Infinium SNP array and whole-genome resequencing of one of the populations facilitated marker-saturation of candidate intervals for gene identification. Five QTLs ranging in size from 0.6 to 1.8 Mb were mapped on linkage group XIV for lignin content, syringyl to guaiacyl (S/G) ratio, 5- and 6-carbon sugars using the mapping pedigree. Six candidate loci exhibiting significant associations with phenotypes were identified within QTL intervals. These associations were reproducible across multiple environments, two independent genotyping platforms, and different plant growth stages. cDNA sequencing for allelic variants of three of the six loci identified polymorphisms leading to variable length poly glutamine (PolyQ) stretch in a transcription factor annotated as an ANGUSTIFOLIA C-terminus Binding Protein (CtBP) and premature stop codons in a KANADI transcription factor as well as a protein kinase. Results from protoplast transient expression assays suggested that each of the polymorphisms conferred allelic differences in the activation of cellulose, hemicelluloses, and lignin pathway marker genes. This study illustrates the utility of complementary QTL and association mapping as tools for gene discovery with no a priori candidate gene selection. This proof of concept in a perennial organism opens up opportunities for discovery of novel genetic determinants of economically important but complex traits in plants.
Extent of genome-wide linkage disequilibrium in Australian Holstein-Friesian cattle based on a high-density SNP panel.

PubMed

Khatkar, Mehar S; Nicholas, Frank W; Collins, Andrew R; Zenger, Kyall R; Cavanagh, Julie A L; Barris, Wes; Schnabel, Robert D; Taylor, Jeremy F; Raadsma, Herman W

2008-04-24

The extent of linkage disequilibrium (LD) within a population determines the number of markers that will be required for successful association mapping and marker-assisted selection. Most studies on LD in cattle reported to date are based on microsatellite markers or small numbers of single nucleotide polymorphisms (SNPs) covering one or only a few chromosomes. This is the first comprehensive study on the extent of LD in cattle by analyzing data on 1,546 Holstein-Friesian bulls genotyped for 15,036 SNP markers covering all regions of all autosomes. Furthermore, most studies in cattle have used relatively small sample sizes and, consequently, may have had biased estimates of measures commonly used to describe LD. We examine minimum sample sizes required to estimate LD without bias and loss in accuracy. Finally, relatively little information is available on comparative LD structures including other mammalian species such as human and mouse, and we compare LD structure in cattle with public-domain data from both human and mouse. We computed three LD estimates, D', Dvol and r2, for 1,566,890 syntenic SNP pairs and a sample of 365,400 non-syntenic pairs. Mean D' is 0.189 among syntenic SNPs, and 0.105 among non-syntenic SNPs; mean r2 is 0.024 among syntenic SNPs and 0.0032 among non-syntenic SNPs. All three measures of LD for syntenic pairs decline with distance; the decline is much steeper for r2 than for D' and Dvol. The value of D' and Dvol are quite similar. Significant LD in cattle extends to 40 kb (when estimated as r2) and 8.2 Mb (when estimated as D'). The mean values for LD at large physical distances are close to those for non-syntenic SNPs. Minor allelic frequency threshold affects the distribution and extent of LD. For unbiased and accurate estimates of LD across marker intervals spanning < 1 kb to > 50 Mb, minimum sample sizes of 400 (for D') and 75 (for r2) are required. The bias due to small samples sizes increases with inter-marker interval. LD in cattle is much less extensive than in a mouse population created from crossing inbred lines, and more extensive than in humans. For association mapping in Holstein-Friesian cattle, for a given design, at least one SNP is required for each 40 kb, giving a total requirement of at least 75,000 SNPs for a low power whole-genome scan (median r2 > 0.19) and up to 300,000 markers at 10 kb intervals for a high power genome scan (median r2 > 0.62). For estimation of LD by D' and Dvol with sufficient precision, a sample size of at least 400 is required, whereas for r2 a minimum sample of 75 is adequate.
Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

PubMed Central

2011-01-01

Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation sequencing technologies and will include SNPs from comparative orthologous sequences that were identified in the present study, providing a wider collection of anchor points for comparative genomics among the conifers. PMID:21767361
Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan

PubMed Central

Ting, Jason C; Ye, Ying; Thomas, George H; Ruczinski, Ingo; Pevsner, Jonathan

2006-01-01

Background A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes), microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP) microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity). SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. Results We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1) to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2) to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions), (3) to display uniparental isodisomy based on loss of heterozygosity (LOH) across genomic regions, (4) to compare paired samples (e.g. tumor and normal), and (5) to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC) Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Étude du Polymorphisme Humain (CEPH) collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently confirmed by fluorescence in situ hybridization, and a region of homozygosity in a UPD case was confirmed by sequencing of genomic DNA. Conclusion SNPscan is useful to identify chromosomal abnormalities based on SNP intensity (such as chromosomal copy number changes) and heterozygosity data (including regions of LOH and some cases of UPD). The program and source code are available at the SNPscan website . PMID:16420694
Development of a Genetic Map for Onion (Allium cepa L.) Using Reference-Free Genotyping-by-Sequencing and SNP Assays

PubMed Central

Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl

2017-01-01

Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273
Genetic Determinants of Lipid Traits in Diverse Populations from the Population Architecture using Genomics and Epidemiology (PAGE) Study

PubMed Central

Dumitrescu, Logan; Carty, Cara L.; Taylor, Kira; Schumacher, Fredrick R.; Hindorff, Lucia A.; Ambite, José L.; Anderson, Garnet; Best, Lyle G.; Brown-Gentry, Kristin; Bůžková, Petra; Carlson, Christopher S.; Cochran, Barbara; Cole, Shelley A.; Devereux, Richard B.; Duggan, Dave; Eaton, Charles B.; Fornage, Myriam; Franceschini, Nora; Haessler, Jeff; Howard, Barbara V.; Johnson, Karen C.; Laston, Sandra; Kolonel, Laurence N.; Lee, Elisa T.; MacCluer, Jean W.; Manolio, Teri A.; Pendergrass, Sarah A.; Quibrera, Miguel; Shohet, Ralph V.; Wilkens, Lynne R.; Haiman, Christopher A.; Le Marchand, Loïc; Buyske, Steven; Kooperberg, Charles; North, Kari E.; Crawford, Dana C.

2011-01-01

For the past five years, genome-wide association studies (GWAS) have identified hundreds of common variants associated with human diseases and traits, including high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), and triglyceride (TG) levels. Approximately 95 loci associated with lipid levels have been identified primarily among populations of European ancestry. The Population Architecture using Genomics and Epidemiology (PAGE) study was established in 2008 to characterize GWAS–identified variants in diverse population-based studies. We genotyped 49 GWAS–identified SNPs associated with one or more lipid traits in at least two PAGE studies and across six racial/ethnic groups. We performed a meta-analysis testing for SNP associations with fasting HDL-C, LDL-C, and ln(TG) levels in self-identified European American (∼20,000), African American (∼9,000), American Indian (∼6,000), Mexican American/Hispanic (∼2,500), Japanese/East Asian (∼690), and Pacific Islander/Native Hawaiian (∼175) adults, regardless of lipid-lowering medication use. We replicated 55 of 60 (92%) SNP associations tested in European Americans at p<0.05. Despite sufficient power, we were unable to replicate ABCA1 rs4149268 and rs1883025, CETP rs1864163, and TTC39B rs471364 previously associated with HDL-C and MAFB rs6102059 previously associated with LDL-C. Based on significance (p<0.05) and consistent direction of effect, a majority of replicated genotype-phentoype associations for HDL-C, LDL-C, and ln(TG) in European Americans generalized to African Americans (48%, 61%, and 57%), American Indians (45%, 64%, and 77%), and Mexican Americans/Hispanics (57%, 56%, and 86%). Overall, 16 associations generalized across all three populations. For the associations that did not generalize, differences in effect sizes, allele frequencies, and linkage disequilibrium offer clues to the next generation of association studies for these traits. PMID:21738485
Association of Phosphodiesterase 4D with ischemic stroke: a population-based case-control study.

PubMed

Woo, Daniel; Kaushal, Ritesh; Kissela, Brett; Sekar, Padmini; Wolujewicz, Michael; Pal, Prodipto; Alwell, Kathleen; Haverbusch, Mary; Ewing, Irene; Miller, Rosie; Kleindorfer, Dawn; Flaherty, Matthew; Chakraborty, Ranajit; Deka, Ranjan; Broderick, Joseph

2006-02-01

The Phosphodiesterase 4D (PDE4D) gene was reported recently to be associated with ischemic stroke in an Icelandic population. The association was found predominately with large vessel and cardioembolic stroke. However, 2 recent reports were unable to confirm this association, although a trend toward association with cardioembolic stroke was reported. None of the reports included significant proportions of blacks. We tested for genotype and haplotype association of polymorphisms of the PDE4D gene with ischemic stroke in a population-based, biracial, case-control study. A total of 357 cases of ischemic stroke and 482 stroke-free controls from the same community were examined. Single nucleotide polymorphisms (SNPs) were chosen based on significant associations reported previously. Linkage disequilibrium (LD), SNP, and haplotype association analysis was performed using PHASE 2.0 and Haploview 3.2. Although several univariate associations were identified, only 1 SNP (rs2910829) was found to be significantly associated with cardioembolic stroke among both whites and blacks. The rs152312 SNP was associated with cardioembolic stroke among whites after multiple comparison corrections. The same SNP was not associated with cardioembolic stroke among blacks. However, significant haplotype association was identified for both whites and blacks for all ischemic stroke, cardioembolic stroke, and stroke of unknown origin. Haplotype association was identified for small vessel stroke among whites. PDE4D is a risk factor for ischemic stroke and, in particular, for cardioembolic stroke, among whites and blacks. Further study of this gene is warranted.
SNPHunter: a bioinformatic software for single nucleotide polymorphism data acquisition and management.

PubMed

Wang, Lin; Liu, Simin; Niu, Tianhua; Xu, Xin

2005-03-18

Single nucleotide polymorphisms (SNPs) provide an important tool in pinpointing susceptibility genes for complex diseases and in unveiling human molecular evolution. Selection and retrieval of an optimal SNP set from publicly available databases have emerged as the foremost bottlenecks in designing large-scale linkage disequilibrium studies, particularly in case-control settings. We describe the architectural structure and implementations of a novel software program, SNPHunter, which allows for both ad hoc-mode and batch-mode SNP search, automatic SNP filtering, and retrieval of SNP data, including physical position, function class, flanking sequences at user-defined lengths, and heterozygosity from NCBI dbSNP. The SNP data extracted from dbSNP via SNPHunter can be exported and saved in plain text format for further down-stream analyses. As an illustration, we applied SNPHunter for selecting SNPs for 10 major candidate genes for type 2 diabetes, including CAPN10, FABP4, IL6, NOS3, PPARG, TNF, UCP2, CRP, ESR1, and AR. SNPHunter constitutes an efficient and user-friendly tool for SNP screening, selection, and acquisition. The executable and user's manual are available at http://www.hsph.harvard.edu/ppg/software.htm
Natural Allelic Diversity, Genetic Structure and Linkage Disequilibrium Pattern in Wild Chickpea

PubMed Central

Kujur, Alice; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.

2014-01-01

Characterization of natural allelic diversity and understanding the genetic structure and linkage disequilibrium (LD) pattern in wild germplasm accessions by large-scale genotyping of informative microsatellite and single nucleotide polymorphism (SNP) markers is requisite to facilitate chickpea genetic improvement. Large-scale validation and high-throughput genotyping of genome-wide physically mapped 478 genic and genomic microsatellite markers and 380 transcription factor gene-derived SNP markers using gel-based assay, fluorescent dye-labelled automated fragment analyser and matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass array have been performed. Outcome revealed their high genotyping success rate (97.5%) and existence of a high level of natural allelic diversity among 94 wild and cultivated Cicer accessions. High intra- and inter-specific polymorphic potential and wider molecular diversity (11–94%) along with a broader genetic base (13–78%) specifically in the functional genic regions of wild accessions was assayed by mapped markers. It suggested their utility in monitoring introgression and transferring target trait-specific genomic (gene) regions from wild to cultivated gene pool for the genetic enhancement. Distinct species/gene pool-wise differentiation, admixed domestication pattern, and differential genome-wide recombination and LD estimates/decay observed in a six structured population of wild and cultivated accessions using mapped markers further signifies their usefulness in chickpea genetics, genomics and breeding. PMID:25222488

Molecular Mapping of Restriction-Site Associated DNA Markers In Allotetraploid Upland Cotton.

PubMed

Wang, Yangkun; Ning, Zhiyuan; Hu, Yan; Chen, Jiedan; Zhao, Rui; Chen, Hong; Ai, Nijiang; Guo, Wangzhen; Zhang, Tianzhen

2015-01-01

Upland cotton (Gossypium hirsutum L., 2n = 52, AADD) is an allotetraploid, therefore the discovery of single nucleotide polymorphism (SNP) markers is difficult. The recent emergence of genome complexity reduction technologies based on the next-generation sequencing (NGS) platform has greatly expedited SNP discovery in crops with highly repetitive and complex genomes. Here we applied restriction-site associated DNA (RAD) sequencing technology for de novo SNP discovery in allotetraploid cotton. We identified 21,109 SNPs between the two parents and used these for genotyping of 161 recombinant inbred lines (RILs). Finally, a high dense linkage map comprising 4,153 loci over 3500-cM was developed based on the previous result. Using this map quantitative trait locus (QTLs) conferring fiber strength and Verticillium Wilt (VW) resistance were mapped to a more accurate region in comparison to the 1576-cM interval determined using the simple sequence repeat (SSR) genetic map. This suggests that the newly constructed map has more power and resolution than the previous SSR map. It will pave the way for the rapid identification of the marker-assisted selection in cotton breeding and cloning of QTL of interest traits.
Covariance Between Genotypic Effects and its Use for Genomic Inference in Half-Sib Families

PubMed Central

Wittenburg, Dörte; Teuscher, Friedrich; Klosa, Jan; Reinsch, Norbert

2016-01-01

In livestock, current statistical approaches utilize extensive molecular data, e.g., single nucleotide polymorphisms (SNPs), to improve the genetic evaluation of individuals. The number of model parameters increases with the number of SNPs, so the multicollinearity between covariates can affect the results obtained using whole genome regression methods. In this study, dependencies between SNPs due to linkage and linkage disequilibrium among the chromosome segments were explicitly considered in methods used to estimate the effects of SNPs. The population structure affects the extent of such dependencies, so the covariance among SNP genotypes was derived for half-sib families, which are typical in livestock populations. Conditional on the SNP haplotypes of the common parent (sire), the theoretical covariance was determined using the haplotype frequencies of the population from which the individual parent (dam) was derived. The resulting covariance matrix was included in a statistical model for a trait of interest, and this covariance matrix was then used to specify prior assumptions for SNP effects in a Bayesian framework. The approach was applied to one family in simulated scenarios (few and many quantitative trait loci) and using semireal data obtained from dairy cattle to identify genome segments that affect performance traits, as well as to investigate the impact on predictive ability. Compared with a method that does not explicitly consider any of the relationship among predictor variables, the accuracy of genetic value prediction was improved by 10–22%. The results show that the inclusion of dependence is particularly important for genomic inference based on small sample sizes. PMID:27402363
BAC-end sequence-based SNPs and Bin mapping for rapid integration of physical and genetic maps in apple.

PubMed

Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S

2009-03-01

A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.
Insights Into Upland Cotton (Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents. Genomics Insights

USDA-ARS?s Scientific Manuscript database

High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement; high-density linkage maps are especially important in paleopolyploids with exce...
Systematic search for single nucleotide polymorphisms in a lymphoid tyrosine phosphatase gene (PTPN22): association between a promoter polymorphism and type 1 diabetes in Asian populations.

PubMed

Kawasaki, Eiji; Awata, Takuya; Ikegami, Hiroshi; Kobayashi, Tetsuro; Maruyama, Taro; Nakanishi, Koji; Shimada, Akira; Uga, Miho; Uga, Mho; Kurihara, Susumu; Kawabata, Yumiko; Tanaka, Shoichiro; Kanazawa, Yasuhiko; Lee, Inkyu; Eguchi, Katsumi

2006-03-15

The protein tyrosine phosphatase, nonreceptor 22 gene (PTPN22) maps to human chromosome 1p13.3-p13.1 and encodes an important negative regulator of T-cell activation, lymphoid-specific phosphatase (Lyp). Recently, the minor allele of a single-nucleotide polymorphism (SNP) at nucleotide position 1858 (rs2476601, +1858C > T) was found to be associated with type 1 diabetes. However, the degree of the association is variable among ethnic populations, suggesting the presence of other disease-associated variants in PTPN22. To examine this possibility, we carried out a systemic search for PTPN22 using direct sequencing of PCR-amplified products in the Japanese population. Association and linkage studies were also conducted in 1,690 Japanese samples, 180 Korean samples, and 472 Caucasian samples from 95 nuclear families. We identified five novel SNPs, but not the +1858C > T SNP. Of these two frequent SNPs, -1123G > C, and +2740C > T were in strong linkage disequilibrium (LD), and the -1123G > C promoter SNP was associated with acute-onset but not slow-onset type 1 diabetes in the Japanese population (odds ratio [OR] = 1.42, 95% CI = 1.07-1.89, P = 0.015). This association was observed also in Korean patients with type 1 diabetes (Mantel-Haenszel chi2= 6.543, P = 0.0105, combined OR = 1.41 95% CI = 1.09-1.82). Furthermore, the affected family-based control (AFBAC) association test and the transmission disequilibrium analysis of multiplex families of European descent from the British Diabetes Association (BDA) Warren Repository indicated that the association was stronger in -1123G > C compared to +1858C > T. In conclusion, the type 1 diabetes association with PTPN22 is confirmed, but it cannot be attributed solely to the +1858C > T variant. The promoter -1123G > C SNP is a more likely causative variant in PTPN22. 2006 Wiley-Liss, Inc.
A genome-wide linkage study of mammographic density, a risk factor for breast cancer

PubMed Central

2011-01-01

Introduction Mammographic breast density is a highly heritable (h2 > 0.6) and strong risk factor for breast cancer. We conducted a genome-wide linkage study to identify loci influencing mammographic breast density (MD). Methods Epidemiological data were assembled on 1,415 families from the Australia, Northern California and Ontario sites of the Breast Cancer Family Registry, and additional families recruited in Australia and Ontario. Families consisted of sister pairs with age-matched mammograms and data on factors known to influence MD. Single nucleotide polymorphism (SNP) genotyping was performed on 3,952 individuals using the Illumina Infinium 6K linkage panel. Results Using a variance components method, genome-wide linkage analysis was performed using quantitative traits obtained by adjusting MD measurements for known covariates. Our primary trait was formed by fitting a linear model to the square root of the percentage of the breast area that was dense (PMD), adjusting for age at mammogram, number of live births, menopausal status, weight, height, weight squared, and menopausal hormone therapy. The maximum logarithm of odds (LOD) score from the genome-wide scan was on chromosome 7p14.1-p13 (LOD = 2.69; 63.5 cM) for covariate-adjusted PMD, with a 1-LOD interval spanning 8.6 cM. A similar signal was seen for the covariate adjusted area of the breast that was dense (DA) phenotype. Simulations showed that the complete sample had adequate power to detect LOD scores of 3 or 3.5 for a locus accounting for 20% of phenotypic variance. A modest peak initially seen on chromosome 7q32.3-q34 increased in strength when only the 513 families with at least two sisters below 50 years of age were included in the analysis (LOD 3.2; 140.7 cM, 1-LOD interval spanning 9.6 cM). In a subgroup analysis, we also found a LOD score of 3.3 for DA phenotype on chromosome 12.11.22-q13.11 (60.8 cM, 1-LOD interval spanning 9.3 cM), overlapping a region identified in a previous study. Conclusions The suggestive peaks and the larger linkage signal seen in the subset of pedigrees with younger participants highlight regions of interest for further study to identify genes that determine MD, with the goal of understanding mammographic density and its involvement in susceptibility to breast cancer. PMID:22188651
BAC-End Sequence-Based SNP Mining in Allotetraploid Cotton (Gossypium) Utilizing Resequencing Data, Phylogenetic Inferences, and Perspectives for Genetic Mapping.

PubMed

Hulse-Kemp, Amanda M; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A; Scheffler, Brian E; Fang, David D; Chen, Z Jeffrey; Van Deynze, Allen; Stelly, David M

2015-04-09

A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. Copyright © 2015 Hulse-Kemp et al.
Methylphenidate side effect profile is influenced by genetic variation in the attention-deficit/hyperactivity disorder-associated CES1 gene.

PubMed

Johnson, Katherine A; Barry, Edwina; Lambert, David; Fitzgerald, Michael; McNicholas, Fiona; Kirley, Aiveen; Gill, Michael; Bellgrove, Mark A; Hawi, Ziarih

2013-12-01

A naturalistic, prospective study of the influence of genetic variation on dose prescribed, clinical response, and side effects related to stimulant medication in 77 children with attention-deficit/hyperactivity disorder (ADHD) was undertaken. The influence of genetic variation of the CES1 gene coding for carboxylesterase 1A1 (CES1A1), the major enzyme responsible for the first-pass, stereoselective metabolism of methylphenidate, was investigated. Parent- and teacher-rated behavioral questionnaires were collected at baseline when the children were medication naïve, and again at 6 weeks while they were on medication. Medication dose, prescribed at the discretion of the treating clinician, and side effects, were recorded at week 6. Blood and saliva samples were collected for genotyping. Single nucleotide polymorphisms (SNPs) were selected in the coding, non-coding and the 3' flanking region of the CES1 gene. Genetic association between CES1 variants and ADHD was investigated in an expanded sample of 265 Irish ADHD families. Analyses were conducted using analysis of covariance (ANCOVA) and logistic regression models. None of the CES1 gene variants were associated with the dose of methylphenidate provided or the clinical response recorded at the 6 week time point. An association between two CES1 SNP markers and the occurrence of sadness as a side effect of short-acting methylphenidate was found. The two associated CES1 markers were in linkage disequilibrium and were significantly associated with ADHD in a larger sample of ADHD trios. The associated CES1 markers were also in linkage disequilibrium with two SNP markers of the noradrenaline transporter gene (SLC6A2). This study found an association between two CES1 SNP markers and the occurrence of sadness as a side effect of short-acting methylphenidate. These markers were in linkage disequilibrium together and with two SNP markers of the noradrenaline transporter gene.
CIDR

Science.gov Websites

Initiation Application Schedule Service Information and Pricing Services Sample Requirements Pricing SNP Genotyping General Information Genome Wide Association Custom FFPE Sample Options Methylation Linkage Consortium Developed Mouse Whole Genome Sequencing General Information Whole Genome Whole Exome Custom
Genomic predictions can accelerate selection for resistance against Piscirickettsia salmonis in Atlantic salmon (Salmo salar).

PubMed

Bangera, Rama; Correa, Katharina; Lhorente, Jean P; Figueroa, René; Yáñez, José M

2017-01-31

Salmon Rickettsial Syndrome (SRS) caused by Piscirickettsia salmonis is a major disease affecting the Chilean salmon industry. Genomic selection (GS) is a method wherein genome-wide markers and phenotype information of full-sibs are used to predict genomic EBV (GEBV) of selection candidates and is expected to have increased accuracy and response to selection over traditional pedigree based Best Linear Unbiased Prediction (PBLUP). Widely used GS methods such as genomic BLUP (GBLUP), SNPBLUP, Bayes C and Bayesian Lasso may perform differently with respect to accuracy of GEBV prediction. Our aim was to compare the accuracy, in terms of reliability of genome-enabled prediction, from different GS methods with PBLUP for resistance to SRS in an Atlantic salmon breeding program. Number of days to death (DAYS), binary survival status (STATUS) phenotypes, and 50 K SNP array genotypes were obtained from 2601 smolts challenged with P. salmonis. The reliability of different GS methods at different SNP densities with and without pedigree were compared to PBLUP using a five-fold cross validation scheme. Heritability estimated from GS methods was significantly higher than PBLUP. Pearson's correlation between predicted GEBV from PBLUP and GS models ranged from 0.79 to 0.91 and 0.79-0.95 for DAYS and STATUS, respectively. The relative increase in reliability from different GS methods for DAYS and STATUS with 50 K SNP ranged from 8 to 25% and 27-30%, respectively. All GS methods outperformed PBLUP at all marker densities. DAYS and STATUS showed superior reliability over PBLUP even at the lowest marker density of 3 K and 500 SNP, respectively. 20 K SNP showed close to maximal reliability for both traits with little improvement using higher densities. These results indicate that genomic predictions can accelerate genetic progress for SRS resistance in Atlantic salmon and implementation of this approach will contribute to the control of SRS in Chile. We recommend GBLUP for routine GS evaluation because this method is computationally faster and the results are very similar with other GS methods. The use of lower density SNP or the combination of low density SNP and an imputation strategy may help to reduce genotyping costs without compromising gain in reliability.
Maximum likelihood estimation of linkage disequilibrium in half-sib families.

PubMed

Gomez-Raya, L

2012-05-01

Maximum likelihood methods for the estimation of linkage disequilibrium between biallelic DNA-markers in half-sib families (half-sib method) are developed for single and multifamily situations. Monte Carlo computer simulations were carried out for a variety of scenarios regarding sire genotypes, linkage disequilibrium, recombination fraction, family size, and number of families. A double heterozygote sire was simulated with recombination fraction of 0.00, linkage disequilibrium among dams of δ=0.10, and alleles at both markers segregating at intermediate frequencies for a family size of 500. The average estimates of δ were 0.17, 0.25, and 0.10 for Excoffier and Slatkin (1995), maternal informative haplotypes, and the half-sib method, respectively. A multifamily EM algorithm was tested at intermediate frequencies by computer simulation. The range of the absolute difference between estimated and simulated δ was between 0.000 and 0.008. A cattle half-sib family was genotyped with the Illumina 50K BeadChip. There were 314,730 SNP pairs for which the sire was a homo-heterozygote with average estimates of r2 of 0.115, 0.067, and 0.111 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. There were 208,872 SNP pairs for which the sire was double heterozygote with average estimates of r2 across the genome of 0.100, 0.267, and 0.925 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. Genome analyses for all possible sire genotypes with 829,042 tests showed that ignoring half-sib family structure leads to upward biased estimates of linkage disequilibrium. Published inferences on population structure and evolution of cattle should be revisited after accommodating existing half-sib family structure in the estimation of linkage disequilibrium.
Maximum Likelihood Estimation of Linkage Disequilibrium in Half-Sib Families

PubMed Central

Gomez-Raya, L.

2012-01-01

Maximum likelihood methods for the estimation of linkage disequilibrium between biallelic DNA-markers in half-sib families (half-sib method) are developed for single and multifamily situations. Monte Carlo computer simulations were carried out for a variety of scenarios regarding sire genotypes, linkage disequilibrium, recombination fraction, family size, and number of families. A double heterozygote sire was simulated with recombination fraction of 0.00, linkage disequilibrium among dams of δ = 0.10, and alleles at both markers segregating at intermediate frequencies for a family size of 500. The average estimates of δ were 0.17, 0.25, and 0.10 for Excoffier and Slatkin (1995), maternal informative haplotypes, and the half-sib method, respectively. A multifamily EM algorithm was tested at intermediate frequencies by computer simulation. The range of the absolute difference between estimated and simulated δ was between 0.000 and 0.008. A cattle half-sib family was genotyped with the Illumina 50K BeadChip. There were 314,730 SNP pairs for which the sire was a homo-heterozygote with average estimates of r2 of 0.115, 0.067, and 0.111 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. There were 208,872 SNP pairs for which the sire was double heterozygote with average estimates of r2 across the genome of 0.100, 0.267, and 0.925 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. Genome analyses for all possible sire genotypes with 829,042 tests showed that ignoring half-sib family structure leads to upward biased estimates of linkage disequilibrium. Published inferences on population structure and evolution of cattle should be revisited after accommodating existing half-sib family structure in the estimation of linkage disequilibrium. PMID:22377635
Detection of quantitative trait loci in Bos indicus and Bos taurus cattle using genome-wide association studies

PubMed Central

2013-01-01

Background The apparent effect of a single nucleotide polymorphism (SNP) on phenotype depends on the linkage disequilibrium (LD) between the SNP and a quantitative trait locus (QTL). However, the phase of LD between a SNP and a QTL may differ between Bos indicus and Bos taurus because they diverged at least one hundred thousand years ago. Here, we test the hypothesis that the apparent effect of a SNP on a quantitative trait depends on whether the SNP allele is inherited from a Bos taurus or Bos indicus ancestor. Methods Phenotype data on one or more traits and SNP genotype data for 10 181 cattle from Bos taurus, Bos indicus and composite breeds were used. All animals had genotypes for 729 068 SNPs (real or imputed). Chromosome segments were classified as originating from B. indicus or B. taurus on the basis of the haplotype of SNP alleles they contained. Consequently, SNP alleles were classified according to their sub-species origin. Three models were used for the association study: (1) conventional GWAS (genome-wide association study), fitting a single SNP effect regardless of subspecies origin, (2) interaction GWAS, fitting an interaction between SNP and subspecies-origin, and (3) best variable GWAS, fitting the most significant combination of SNP and sub-species origin. Results Fitting an interaction between SNP and subspecies origin resulted in more significant SNPs (i.e. more power) than a conventional GWAS. Thus, the effect of a SNP depends on the subspecies that the allele originates from. Also, most QTL segregated in only one subspecies, suggesting that many mutations that affect the traits studied occurred after divergence of the subspecies or the mutation became fixed or was lost in one of the subspecies. Conclusions The results imply that GWAS and genomic selection could gain power by distinguishing SNP alleles based on their subspecies origin, and that only few QTL segregate in both B. indicus and B. taurus cattle. Thus, the QTL that segregate in current populations likely resulted from mutations that occurred in one of the subspecies and can have both positive and negative effects on the traits. There was no evidence that selection has increased the frequency of alleles that increase body weight. PMID:24168700
Does probabilistic modelling of linkage disequilibrium evolution improve the accuracy of QTL location in animal pedigree?

PubMed

Cierco-Ayrolles, Christine; Dejean, Sébastien; Legarra, Andrés; Gilbert, Hélène; Druet, Tom; Ytournel, Florence; Estivals, Delphine; Oumouhou, Naïma; Mangin, Brigitte

2010-10-22

Since 2001, the use of more and more dense maps has made researchers aware that combining linkage and linkage disequilibrium enhances the feasibility of fine-mapping genes of interest. So, various method types have been derived to include concepts of population genetics in the analyses. One major drawback of many of these methods is their computational cost, which is very significant when many markers are considered. Recent advances in technology, such as SNP genotyping, have made it possible to deal with huge amount of data. Thus the challenge that remains is to find accurate and efficient methods that are not too time consuming. The study reported here specifically focuses on the half-sib family animal design. Our objective was to determine whether modelling of linkage disequilibrium evolution improved the mapping accuracy of a quantitative trait locus of agricultural interest in these populations. We compared two methods of fine-mapping. The first one was an association analysis. In this method, we did not model linkage disequilibrium evolution. Therefore, the modelling of the evolution of linkage disequilibrium was a deterministic process; it was complete at time 0 and remained complete during the following generations. In the second method, the modelling of the evolution of population allele frequencies was derived from a Wright-Fisher model. We simulated a wide range of scenarios adapted to animal populations and compared these two methods for each scenario. Our results indicated that the improvement produced by probabilistic modelling of linkage disequilibrium evolution was not significant. Both methods led to similar results concerning the location accuracy of quantitative trait loci which appeared to be mainly improved by using four flanking markers instead of two. Therefore, in animal half-sib designs, modelling linkage disequilibrium evolution using a Wright-Fisher model does not significantly improve the accuracy of the QTL location when compared to a simpler method assuming complete and constant linkage between the QTL and the marker alleles. Finally, given the high marker density available nowadays, the simpler method should be preferred as it gives accurate results in a reasonable computing time.
Development of cleaved amplified polymorphic sequence markers and a CAPS-based genetic linkage map in watermelon (Citrullus lanatus [Thunb.] Matsum. and Nakai) constructed using whole-genome re-sequencing data

PubMed Central

Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu

2016-01-01

Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
A first linkage map and downy mildew resistance QTL discovery for sweet basil (Ocimum basilicum) facilitated by double digestion restriction site associated DNA sequencing (ddRADseq).

PubMed

Pyne, Robert; Honig, Josh; Vaiciunas, Jennifer; Koroch, Adolfina; Wyenandt, Christian; Bonos, Stacy; Simon, James

2017-01-01

Limited understanding of sweet basil (Ocimum basilicum L.) genetics and genome structure has reduced efficiency of breeding strategies. This is evidenced by the rapid, worldwide dissemination of basil downy mildew (Peronospora belbahrii) in the absence of resistant cultivars. In an effort to improve available genetic resources, expressed sequence tag simple sequence repeat (EST-SSR) and single nucleotide polymorphism (SNP) markers were developed and used to genotype the MRI x SB22 F2 mapping population, which segregates for response to downy mildew. SNP markers were generated from genomic sequences derived from double digestion restriction site associated DNA sequencing (ddRADseq). Disomic segregation was observed in both SNP and EST-SSR markers providing evidence of an O. basilicum allotetraploid genome structure and allowing for subsequent analysis of the mapping population as a diploid intercross. A dense linkage map was constructed using 42 EST-SSR and 1,847 SNP markers spanning 3,030.9 cM. Multiple quantitative trait loci (QTL) model (MQM) analysis identified three QTL that explained 37-55% of phenotypic variance associated with downy mildew response across three environments. A single major QTL, dm11.1 explained 21-28% of phenotypic variance and demonstrated dominant gene action. Two minor QTL dm9.1 and dm14.1 explained 5-16% and 4-18% of phenotypic variance, respectively. Evidence is provided for an additive effect between the two minor QTL and the major QTL dm11.1 increasing downy mildew susceptibility. Results indicate that ddRADseq-facilitated SNP and SSR marker genotyping is an effective approach for mapping the sweet basil genome.
A first linkage map and downy mildew resistance QTL discovery for sweet basil (Ocimum basilicum) facilitated by double digestion restriction site associated DNA sequencing (ddRADseq)

PubMed Central

Honig, Josh; Vaiciunas, Jennifer; Koroch, Adolfina; Wyenandt, Christian; Bonos, Stacy; Simon, James

2017-01-01

Limited understanding of sweet basil (Ocimum basilicum L.) genetics and genome structure has reduced efficiency of breeding strategies. This is evidenced by the rapid, worldwide dissemination of basil downy mildew (Peronospora belbahrii) in the absence of resistant cultivars. In an effort to improve available genetic resources, expressed sequence tag simple sequence repeat (EST-SSR) and single nucleotide polymorphism (SNP) markers were developed and used to genotype the MRI x SB22 F2 mapping population, which segregates for response to downy mildew. SNP markers were generated from genomic sequences derived from double digestion restriction site associated DNA sequencing (ddRADseq). Disomic segregation was observed in both SNP and EST-SSR markers providing evidence of an O. basilicum allotetraploid genome structure and allowing for subsequent analysis of the mapping population as a diploid intercross. A dense linkage map was constructed using 42 EST-SSR and 1,847 SNP markers spanning 3,030.9 cM. Multiple quantitative trait loci (QTL) model (MQM) analysis identified three QTL that explained 37–55% of phenotypic variance associated with downy mildew response across three environments. A single major QTL, dm11.1 explained 21–28% of phenotypic variance and demonstrated dominant gene action. Two minor QTL dm9.1 and dm14.1 explained 5–16% and 4–18% of phenotypic variance, respectively. Evidence is provided for an additive effect between the two minor QTL and the major QTL dm11.1 increasing downy mildew susceptibility. Results indicate that ddRADseq-facilitated SNP and SSR marker genotyping is an effective approach for mapping the sweet basil genome. PMID:28922359
A single nucleotide polymorphism in MGEA5 encoding O-GlcNAc-selective N-acetyl-beta-D glucosaminidase is associated with type 2 diabetes in Mexican Americans.

PubMed

Lehman, Donna M; Fu, Dong-Jing; Freeman, Angela B; Hunt, Kelly J; Leach, Robin J; Johnson-Pais, Teresa; Hamlington, Jeanette; Dyer, Thomas D; Arya, Rector; Abboud, Hanna; Göring, Harald H H; Duggirala, Ravindranath; Blangero, John; Konrad, Robert J; Stern, Michael P

2005-04-01

Excess O-glycosylation of proteins by O-linked beta-N-acetylglucosamine (O-GlcNAc) may be involved in the pathogenesis of type 2 diabetes. The enzyme O-GlcNAc-selective N-acetyl-beta-d glucosaminidase (O-GlcNAcase) encoded by MGEA5 on 10q24.1-q24.3 reverses this modification by catalyzing the removal of O-GlcNAc. We have previously reported the linkage of type 2 diabetes and age at diabetes onset to an overlapping region on chromosome 10q in the San Antonio Family Diabetes Study (SAFADS). In this study, we investigated menangioma-expressed antigen-5 (MGEA5) as a positional candidate gene. Twenty-four single nucleotide polymorphisms (SNPs), identified by sequencing 44 SAFADS subjects, were genotyped in 436 individuals from 27 families whose data were used in the original linkage report. Association tests indicated significant association of a novel SNP with the traits diabetes (P = 0.0128, relative risk = 2.77) and age at diabetes onset (P = 0.0017). The associated SNP is located in intron 10, which contains an alternate stop codon and may lead to decreased expression of the 130-kDa isoform, the isoform predicted to contain the O-GlcNAcase activity. We investigated whether this variant was responsible for the original linkage signal. The variance attributed to this SNP accounted for approximately 25% of the logarithm of odds. These results suggest that this variant within the MGEA5 gene may increase diabetes risk in Mexican Americans.
Construction of a high-density high-resolution genetic map and its integration with BAC-based physical map in channel catfish

USDA-ARS?s Scientific Manuscript database

Construction of genetic linkage map is essential for genetic and genomic studies. Recent advances in sequencing and genotyping technologies made it possible to generate high-density and high-resolution genetic linkage maps, especially for the organisms lacking extensive genomic resources. In the pre...
Breeding and Genetics Symposium: networks and pathways to guide genomic selection.

PubMed

Snelling, W M; Cushman, R A; Keele, J W; Maltecca, C; Thomas, M G; Fortes, M R S; Reverter, A

2013-02-01

Many traits affecting profitability and sustainability of meat, milk, and fiber production are polygenic, with no single gene having an overwhelming influence on observed variation. No knowledge of the specific genes controlling these traits has been needed to make substantial improvement through selection. Significant gains have been made through phenotypic selection enhanced by pedigree relationships and continually improving statistical methodology. Genomic selection, recently enabled by assays for dense SNP located throughout the genome, promises to increase selection accuracy and accelerate genetic improvement by emphasizing the SNP most strongly correlated to phenotype although the genes and sequence variants affecting phenotype remain largely unknown. These genomic predictions theoretically rely on linkage disequilibrium (LD) between genotyped SNP and unknown functional variants, but familial linkage may increase effectiveness when predicting individuals related to those in the training data. Genomic selection with functional SNP genotypes should be less reliant on LD patterns shared by training and target populations, possibly allowing robust prediction across unrelated populations. Although the specific variants causing polygenic variation may never be known with certainty, a number of tools and resources can be used to identify those most likely to affect phenotype. Associations of dense SNP genotypes with phenotype provide a 1-dimensional approach for identifying genes affecting specific traits; in contrast, associations with multiple traits allow defining networks of genes interacting to affect correlated traits. Such networks are especially compelling when corroborated by existing functional annotation and established molecular pathways. The SNP occurring within network genes, obtained from public databases or derived from genome and transcriptome sequences, may be classified according to expected effects on gene products. As illustrated by functionally informed genomic predictions being more accurate than naive whole-genome predictions of beef tenderness, coupling evidence from livestock genotypes, phenotypes, gene expression, and genomic variants with existing knowledge of gene functions and interactions may provide greater insight into the genes and genomic mechanisms affecting polygenic traits and facilitate functional genomic selection for economically important traits.

Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel

PubMed Central

Houle, David; Márquez, Eladio J.

2015-01-01

We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r2 ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. PMID:26068573
Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel.

PubMed

Houle, David; Márquez, Eladio J

2015-06-10

We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r(2) ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. Copyright © 2015 Houle and Márquez.
A Genome Scan Conducted in a Multigenerational Pedigree with Convergent Strabismus Supports a Complex Genetic Determinism

PubMed Central

Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel

2013-01-01

A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree. PMID:24376720
A genome scan conducted in a multigenerational pedigree with convergent strabismus supports a complex genetic determinism.

PubMed

Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel

2013-01-01

A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree.
Genetic studies on the APOA1-C3-A5 gene cluster in Asian Indians with premature coronary artery disease

PubMed Central

Shanker, Jayashree; Perumal, Ganapathy; Rao, Veena S; Khadrinarasimhiah, Natesha B; John, Shibu; Hebbagodi, Sridhara; Mukherjee, Manjari; Kakkar, Vijay V

2008-01-01

Background The APOA1-C3-A5 gene cluster plays an important role in the regulation of lipids. Asian Indians have an increased tendency for abnormal lipid levels and high risk of Coronary Artery Disease (CAD). Therefore, the present study aimed to elucidate the relationship of four single nucleotide polymorphisms (SNPs) in the Apo11q cluster, namely the -75G>A, +83C>T SNPs in the APOA1 gene, the Sac1 SNP in the APOC3 gene and the S19W variant in the APOA5 gene to plasma lipids and CAD in 190 affected sibling pairs (ASPs) belonging to Asian Indian families with a strong CAD history. Methods & results Genotyping and lipid assays were carried out using standard protocols. Plasma lipids showed a strong heritability (h2 48% – 70%; P < 0.0001). A subset of 77 ASPs with positive sign of Logarithm of Odds (LOD) score showed significant linkage to CAD trait by multi-point analysis (LOD score 7.42, P < 0.001) and to Sac1 (LOD score 4.49) and -75G>A (LOD score 2.77) SNPs by single-point analysis (P < 0.001). There was significant proportion of mean allele sharing (pi) for the Sac1 (pi 0.59), -75G>A (pi 0.56) and +83C>T (pi 0.52) (P < 0.001) SNPs, respectively. QTL analysis showed suggestive evidence of linkage of the Sac1 SNP to Total Cholesterol (TC), High Density Lipoprotein-cholesterol (HDL-C) and Apolipoprotein B (ApoB) with LOD scores of 1.42, 1.72 and 1.19, respectively (P < 0.01). The Sac1 and -75G>A SNPs along with hypertension showed maximized correlations with TC, TG and Apo B by association analysis. Conclusion The APOC3-Sac1 SNP is an important genetic variant that is associated with CAD through its interaction with plasma lipids and other standard risk factors among Asian Indians. PMID:18801202
Mapping autism risk loci using genetic linkage and chromosomal rearrangements

PubMed Central

Szatmari, Peter; Paterson, Andrew; Zwaigenbaum, Lonnie; Roberts, Wendy; Brian, Jessica; Liu, Xiao-Qing; Vincent, John; Skaug, Jennifer; Thompson, Ann; Senman, Lili; Feuk, Lars; Qian, Cheng; Bryson, Susan; Jones, Marshall; Marshall, Christian; Scherer, Stephen; Vieland, Veronica; Bartlett, Christopher; Mangin, La Vonne; Goedken, Rhinda; Segre, Alberto; Pericak-Vance, Margaret; Cuccaro, Michael; Gilbert, John; Wright, Harry; Abramson, Ruth; Betancur, Catalina; Bourgeron, Thomas; Gillberg, Christopher; Leboyer, Marion; Buxbaum, Joseph; Davis, Kenneth; Hollander, Eric; Silverman, Jeremy; Hallmayer, Joachim; Lotspeich, Linda; Sutcliffe, James; Haines, Jonathan; Folstein, Susan; Piven, Joseph; Wassink, Thomas; Sheffield, Val; Geschwind, Daniel; Bucan, Maja; Brown, Ted; Cantor, Rita; Constantino, John; Gilliam, Conrad; Herbert, Martha; Lajonchere, Clara; Ledbetter, David; Lese-Martin, Christa; Miller, Janet; Nelson, Stan; Samango-Sprouse, Carol; Spence, Sarah; State, Matthew; Tanzi, Rudolph; Coon, Hilary; Dawson, Geraldine; Devlin, Bernie; Estes, Annette; Flodman, Pamela; Klei, Lambertus; Mcmahon, William; Minshew, Nancy; Munson, Jeff; Korvatska, Elena; Rodier, Patricia; Schellenberg, Gerard; Smith, Moyra; Spence, Anne; Stodgell, Chris; Tepper, Ping Guo; Wijsman, Ellen; Yu, Chang-En; Rogé, Bernadette; Mantoulan, Carine; Wittemeyer, Kerstin; Poustka, Annemarie; Felder, Bärbel; Klauck, Sabine; Schuster, Claudia; Poustka, Fritz; Bölte, Sven; Feineis-Matthews, Sabine; Herbrecht, Evelyn; Schmötzer, Gabi; Tsiantis, John; Papanikolaou, Katerina; Maestrini, Elena; Bacchelli, Elena; Blasi, Francesca; Carone, Simona; Toma, Claudio; Van Engeland, Herman; De Jonge, Maretha; Kemner, Chantal; Koop, Frederieke; Langemeijer, Marjolein; Hijmans, Channa; Staal, Wouter; Baird, Gillian; Bolton, Patrick; Rutter, Michael; Weisblatt, Emma; Green, Jonathan; Aldred, Catherine; Wilkinson, Julie-Anne; Pickles, Andrew; Le Couteur, Ann; Berney, Tom; Mcconachie, Helen; Bailey, Anthony; Francis, Kostas; Honeyman, Gemma; Hutchinson, Aislinn; Parr, Jeremy; Wallace, Simon; Monaco, Anthony; Barnby, Gabrielle; Kobayashi, Kazuhiro; Lamb, Janine; Sousa, Ines; Sykes, Nuala; Cook, Edwin; Guter, Stephen; Leventhal, Bennett; Salt, Jeff; Lord, Catherine; Corsello, Christina; Hus, Vanessa; Weeks, Daniel; Volkmar, Fred; Tauber, Maïté; Fombonne, Eric; Shih, Andy; Meyer, Kacie

2007-01-01

Autism spectrum disorders (ASD) are common, heritable neurodevelopmental conditions. The genetic architecture of ASD is complex, requiring large samples to overcome heterogeneity. Here we broaden coverage and sample size relative to other studies of ASD by using Affymetrix 10K single nucleotide polymorphism (SNP) arrays and 1168 families with ≥ 2 affected individuals to perform the largest linkage scan to date, while also analyzing copy number variation (CNV) in these families. Linkage and CNV analyses implicate chromosome 11p12-p13 and neurexins, respectively, amongst other candidate loci. Neurexins team with previously-implicated neuroligins for glutamatergic synaptogenesis, highlighting glutamate-related genes as promising candidates for ASD. PMID:17322880
Comprehensive replication of the relationship between myopia-related genes and refractive errors in a large Japanese cohort.

PubMed

Yoshikawa, Munemitsu; Yamashiro, Kenji; Miyake, Masahiro; Oishi, Maho; Akagi-Kurashige, Yumiko; Kumagai, Kyoko; Nakata, Isao; Nakanishi, Hideo; Oishi, Akio; Gotoh, Norimoto; Yamada, Ryo; Matsuda, Fumihiko; Yoshimura, Nagahisa

2014-10-21

We investigated the association between refractive error in a Japanese population and myopia-related genes identified in two recent large-scale genome-wide association studies. Single-nucleotide polymorphisms (SNPs) in 51 genes that were reported by the Consortium for Refractive Error and Myopia and/or the 23andMe database were genotyped in 3712 healthy Japanese volunteers from the Nagahama Study using HumanHap610K Quad, HumanOmni2.5M, and/or HumanExome Arrays. To evaluate the association between refractive error and recently identified myopia-related genes, we used three approaches to perform quantitative trait locus analyses of mean refractive error in both eyes of the participants: per-SNP, gene-based top-SNP, and gene-based all-SNP analyses. Association plots of successfully replicated genes also were investigated. In our per-SNP analysis, eight myopia gene associations were replicated successfully: GJD2, RASGRF1, BICC1, KCNQ5, CD55, CYP26A1, LRRC4C, and B4GALNT2.Seven additional gene associations were replicated in our gene-based analyses: GRIA4, BMP2, QKI, BMP4, SFRP1, SH3GL2, and EHBP1L1. The signal strength of the reported SNPs and their tagging SNPs increased after considering different linkage disequilibrium patterns across ethnicities. Although two previous studies suggested strong associations between PRSS56, LAMA2, TOX, and RDH5 and myopia, we could not replicate these results. Our results confirmed the significance of the myopia-related genes reported previously and suggested that gene-based replication analyses are more effective than per-SNP analyses. Our comparison with two previous studies suggested that BMP3 SNPs cause myopia primarily in Caucasian populations, while they may exhibit protective effects in Asian populations. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
SiNoPsis: Single Nucleotide Polymorphisms selection and promoter profiling.

PubMed

Boloc, Daniel; Rodríguez, Natalia; Gassó, Patricia; Abril, Josep F; Bernardo, Miquel; Lafuente, Amalia; Mas, Sergi

2017-09-14

The selection of a Single Nucleotide Polymorphism (SNP) using bibliographic methods can be a very time-consuming task. Moreover, a SNP selected in this way may not be easily visualized in its genomic context by a standard user hoping to correlate it with other valuable information. Here we propose a web form built on top of Circos that can assist SNP-centred screening, based on their location in the genome and the regulatory modules they can disrupt. Its use may allow researchers to prioritize SNPs in genotyping and disease studies. SiNoPsis is bundled as a web portal. It focuses on the different structures involved in the genomic expression of a gene, especially those found in the core promoter upstream region. These structures include transcription factor binding sites (for promoter and enhancer signals), histones, and promoter flanking regions. Additionally, the tool provides eQTL and linkage disequilibrium (LD) properties for a given SNP query, yielding further clues about other indirectly associated SNPs. Possible disruptions of the aforementioned structures affecting gene transcription are reported using multiple resource databases. SiNoPsis has a simple user-friendly interface, which allows single queries by gene symbol, genomic coordinates, Ensembl gene identifiers, RefSeq transcript identifiers and SNPs. It is the only portal providing useful SNP selection based on regulatory modules and LD with functional variants in both textual and graphic modes (by properly defining the arguments and parameters needed to run Circos). SiNoPsis is freely available at https://compgen.bio.ub.edu/SiNoPsis /. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Mapping the non-darkening trait from 'Wit-rood boontje' in bean (Phaseolus vulgaris).

PubMed

Erfatpour, M; Navabi, A; Pauls, K P

2018-06-01

A QTL for non-darkening seed coat from 'Wit-rood boontje' was mapped in pinto bean population on chromosome Pv10, comprising 40 candidate genes. The seed coat colour darkens with age in some market classes of dry beans (Phaseolus vulgaris), including pinto bean. Beans with darkened seed coats are discounted in the market place, since they are believed to be associated with lower nutritional quality, increased cooking time, and decreased palatability. The objective of this research was to map a non-darkening gene from a cranberry-like bean 'Wit-rood boontje' using a recombinant inbred line population, derived from a cross between 'Wit-rood boontje' and a slow-darkening pinto bean (1533-15). The population was characterized for seed phenotype and genotyped with an Illumina BeadChip. A genetic linkage map was constructed with 1327 informative SNP markers plus an STS marker (OL4S 500 ) and an SSR marker (Pvsd-0028), previously associated with the J gene and Sd gene, respectively, as well as non-darkening and slow-darkening phenotypes. The linkage map spanned 1253.2 cM over 11 chromosomes. A major QTL for the non-darkening trait was flanked by SNP 715646341 and SNP 715646348 on chromosome Pv10. The region, which spanned 13.2 cM, explained 48% of the phenotypic variation for seed coat darkening. Forty candidate genes were identified in the QTL interval. This information can be used to develop a gene-based marker to facilitate breeding non-darkening pinto beans and may lead to a better understanding of the molecular mechanism for the postharvest darkening phenomenon in pinto bean.
Genome-wide association mapping reveals a rich genetic architecture of stripe rust resistance loci in emmer wheat (Triticum turgidum ssp. dicoccum).

PubMed

Liu, Weizhen; Maccaferri, Marco; Chen, Xianming; Laghetti, Gaetano; Pignone, Domenico; Pumphrey, Michael; Tuberosa, Roberto

2017-11-01

SNP-based genome scanning in worldwide domesticated emmer germplasm showed high genetic diversity, rapid linkage disequilibrium decay and 51 loci for stripe rust resistance, a large proportion of which were novel. Cultivated emmer wheat (Triticum turgidum ssp. dicoccum), one of the oldest domesticated crops in the world, is a potentially rich reservoir of variation for improvement of resistance/tolerance to biotic and abiotic stresses in wheat. Resistance to stripe rust (Puccinia striiformis f. sp. tritici) in emmer wheat has been under-investigated. Here, we employed genome-wide association (GWAS) mapping with a mixed linear model to dissect effective stripe rust resistance loci in a worldwide collection of 176 cultivated emmer wheat accessions. Adult plants were tested in six environments and seedlings were evaluated with five races from the United States and one from Italy under greenhouse conditions. Five accessions were resistant across all experiments. The panel was genotyped with the wheat 90,000 Illumina iSelect single nucleotide polymorphism (SNP) array and 5106 polymorphic SNP markers with mapped positions were obtained. A high level of genetic diversity and fast linkage disequilibrium decay were observed. In total, we identified 14 loci associated with field resistance in multiple environments. Thirty-seven loci were significantly associated with all-stage (seedling) resistance and six of them were effective against multiple races. Of the 51 total loci, 29 were mapped distantly from previously reported stripe rust resistance genes or quantitative trait loci and represent newly discovered resistance loci. Our results suggest that GWAS is an effective method for characterizing genes in cultivated emmer wheat and confirm that emmer wheat is a rich source of stripe rust resistance loci that can be used for wheat improvement.
Linkage analysis of autopsy-confirmed familial Alzheimer disease supports an Alzheimer disease locus in 8q24.

PubMed

Sillén, Anna; Brohede, Jesper; Forsell, Charlotte; Lilius, Lena; Andrade, Jorge; Odeberg, Jacob; Kimura, Toru; Winblad, Bengt; Graff, Caroline

2011-01-01

We have previously reported the results of an extended genome-wide scan of Swedish Alzheimer disease (AD)-affected families; in this paper, we analyzed a subset of these families with autopsy-confirmed AD. We report the fine-mapping, using both microsatellite markers and single-nucleotide polymorphisms (SNPs), in the observed maximum logarithm of the odds (LOD)-2 unit (LOD(max)-2) region under the identified linkage peak, linkage analysis of the fine-mapping data with additionally analyzed pedigrees, and association analysis of SNPs selected from candidate genes in the linked interval. The subset was made on the criterion of at least one autopsy-confirmed AD case per family, resulting in 24 families. Linkage analysis of a family subset having at least one autopsy-confirmed AD case showed a significant nonparametric single-point LOD score of 4.4 in 8q24. Fine-mapping under the linkage peak with 10 microsatellite markers yielded an increase in the multipoint (mpt) LOD score from 2.1 to 3.0. SNP genotyping was performed on 21 selected candidate transcripts of the LOD(max)-2 region. Both family-based association and linkage analysis were performed on extended material from 30 families, resulting in a suggestive linkage at peak marker rs6577853 (mpt LOD score = 2.4). The 8q24 region has been implicated to be involved in AD etiology. Copyright © 2011 S. Karger AG, Basel.
High-density linkage mapping in a pine tree reveals a genomic region associated with inbreeding depression and provides clues to the extent and distribution of meiotic recombination

PubMed Central

2013-01-01

Background The availability of a large expressed sequence tags (EST) resource and recent advances in high-throughput genotyping technology have made it possible to develop highly multiplexed SNP arrays for multi-objective genetic applications, including the construction of meiotic maps. Such approaches are particularly useful in species with a large genome size, precluding the use of whole-genome shotgun assembly with current technologies. Results In this study, a 12 k-SNP genotyping array was developed for maritime pine from an extensive EST resource assembled into a unigene set. The offspring of three-generation outbred and inbred mapping pedigrees were then genotyped. The inbred pedigree consisted of a classical F2 population resulting from the selfing of a single inter-provenance (Landes x Corsica) hybrid tree, whereas the outbred pedigree (G2) resulted from a controlled cross of two intra-provenance (Landes x Landes) hybrid trees. This resulted in the generation of three linkage maps based on SNP markers: one from the parental genotype of the F2 population (1,131 markers in 1,708 centimorgan (cM)), and one for each parent of the G2 population (1,015 and 1,110 markers in 1,447 and 1,425 cM for the female and male parents, respectively). A comparison of segregation patterns in the progeny obtained from the two types of mating (inbreeding and outbreeding) led to the identification of a chromosomal region carrying an embryo viability locus with a semi-lethal allele. Following selfing and segregation, zygote mortality resulted in a deficit of Corsican homozygous genotypes in the F2 population. This dataset was also used to study the extent and distribution of meiotic recombination along the length of the chromosomes and the effect of sex and/or genetic background on recombination. The genetic background of trees in which meiotic recombination occurred was found to have a significant effect on the frequency of recombination. Furthermore, only a small proportion of the recombination hot- and cold-spots were common to all three genotypes, suggesting that the spatial pattern of recombination was genetically variable. Conclusion This study led to the development of classical genomic tools for this ecologically and economically important species. It also identified a chromosomal region bearing a semi-lethal recessive allele and demonstrated the genetic variability of recombination rate over the genome. PMID:23597128
SNPchiMp: a database to disentangle the SNPchip jungle in bovine livestock.

PubMed

Nicolazzi, Ezequiel Luis; Picciolini, Matteo; Strozzi, Francesco; Schnabel, Robert David; Lawley, Cindy; Pirani, Ali; Brew, Fiona; Stella, Alessandra

2014-02-11

Currently, six commercial whole-genome SNP chips are available for cattle genotyping, produced by two different genotyping platforms. Technical issues need to be addressed to combine data that originates from the different platforms, or different versions of the same array generated by the manufacturer. For example: i) genome coordinates for SNPs may refer to different genome assemblies; ii) reference genome sequences are updated over time changing the positions, or even removing sequences which contain SNPs; iii) not all commercial SNP ID's are searchable within public databases; iv) SNPs can be coded using different formats and referencing different strands (e.g. A/B or A/C/T/G alleles, referencing forward/reverse, top/bottom or plus/minus strand); v) Due to new information being discovered, higher density chips do not necessarily include all the SNPs present in the lower density chips; and, vi) SNP IDs may not be consistent across chips and platforms. Most researchers and breed associations manage SNP data in real-time and thus require tools to standardise data in a user-friendly manner. Here we present SNPchiMp, a MySQL database linked to an open access web-based interface. Features of this interface include, but are not limited to, the following functions: 1) referencing the SNP mapping information to the latest genome assembly, 2) extraction of information contained in dbSNP for SNPs present in all commercially available bovine chips, and 3) identification of SNPs in common between two or more bovine chips (e.g. for SNP imputation from lower to higher density). In addition, SNPchiMp can retrieve this information on subsets of SNPs, accessing such data either via physical position on a supported assembly, or by a list of SNP IDs, rs or ss identifiers. This tool combines many different sources of information, that otherwise are time consuming to obtain and difficult to integrate. The SNPchiMp not only provides the information in a user-friendly format, but also enables researchers to perform a large number of operations with a few clicks of the mouse. This significantly reduces the time needed to execute the large number of operations required to manage SNP data.
Construction of the model for the Genetic Analysis Workshop 14 simulated data: genotype-phenotype relationships, gene interaction, linkage, association, disequilibrium, and ascertainment effects for a complex phenotype.

PubMed

Greenberg, David A; Zhang, Junying; Shmulewitz, Dvora; Strug, Lisa J; Zimmerman, Regina; Singh, Veena; Marathe, Sudhir

2005-12-30

The Genetic Analysis Workshop 14 simulated dataset was designed 1) To test the ability to find genes related to a complex disease (such as alcoholism). Such a disease may be given a variety of definitions by different investigators, have associated endophenotypes that are common in the general population, and is likely to be not one disease but a heterogeneous collection of clinically similar, but genetically distinct, entities. 2) To observe the effect on genetic analysis and gene discovery of a complex set of gene x gene interactions. 3) To allow comparison of microsatellite vs. large-scale single-nucleotide polymorphism (SNP) data. 4) To allow testing of association to identify the disease gene and the effect of moderate marker x marker linkage disequilibrium. 5) To observe the effect of different ascertainment/disease definition schemes on the analysis. Data was distributed in two forms. Data distributed to participants contained about 1,000 SNPs and 400 microsatellite markers. Internet-obtainable data consisted of a finer 10,000 SNP map, which also contained data on controls. While disease characteristics and parameters were constant, four "studies" used varying ascertainment schemes based on differing beliefs about disease characteristics. One of the studies contained multiplex two- and three-generation pedigrees with at least four affected members. The simulated disease was a psychiatric condition with many associated behaviors (endophenotypes), almost all of which were genetic in origin. The underlying disease model contained four major genes and two modifier genes. The four major genes interacted with each other to produce three different phenotypes, which were themselves heterogeneous. The population parameters were calibrated so that the major genes could be discovered by linkage analysis in most datasets. The association evidence was more difficult to calibrate but was designed to find statistically significant association in 50% of datasets. We also simulated some marker x marker linkage disequilibrium around some of the genes and also in areas without disease genes. We tried two different methods to simulate the linkage disequilibrium.
A medium density genetic map and QTL for behavioral and production traits in Japanese quail.

PubMed

Recoquillay, Julien; Pitel, Frédérique; Arnould, Cécile; Leroux, Sophie; Dehais, Patrice; Moréno, Carole; Calandreau, Ludovic; Bertin, Aline; Gourichon, David; Bouchez, Olivier; Vignal, Alain; Fariello, Maria Ines; Minvielle, Francis; Beaumont, Catherine; Leterrier, Christine; Le Bihan-Duval, Elisabeth

2015-01-22

Behavioral traits such as sociability, emotional reactivity and aggressiveness are major factors in animal adaptation to breeding conditions. In order to investigate the genetic control of these traits as well as their relationships with production traits, a study was undertaken on a large second generation cross (F2) between two lines of Japanese Quail divergently selected on their social reinstatement behavior. All the birds were measured for several social behaviors (social reinstatement, response to social isolation, sexual motivation, aggression), behaviors measuring the emotional reactivity of the birds (reaction to an unknown object, tonic immobility reaction), and production traits (body weight and egg production). We report the results of the first genome-wide QTL detection based on a medium density SNP panel obtained from whole genome sequencing of a pool of individuals from each divergent line. A genetic map was constructed using 2145 markers among which 1479 could be positioned on 28 different linkage groups. The sex-averaged linkage map spanned a total of 3057 cM with an average marker spacing of 2.1 cM. With the exception of a few regions, the marker order was the same in Japanese Quail and the chicken, which confirmed a well conserved synteny between the two species. The linkage analyses performed using QTLMAP software revealed a total of 45 QTLs related either to behavioral (23) or production (22) traits. The most numerous QTLs (15) concerned social motivation traits. Interestingly, our results pinpointed putative pleiotropic regions which controlled emotional reactivity and body-weight of birds (on CJA5 and CJA8) or their social motivation and the onset of egg laying (on CJA19). This study identified several QTL regions for social and emotional behaviors in the Quail. Further research will be needed to refine the QTL and confirm or refute the role of candidate genes, which were suggested by bioinformatics analysis. It can be hoped that the identification of genes and polymorphisms related to behavioral traits in the quail will have further applications for other poultry species (especially the chicken) and will contribute to solving animal welfare issues in poultry production.
Analysis of whole exome sequencing with cardiometabolic traits using family-based linkage and association in the IRAS Family Study

PubMed Central

Tabb, Keri L.; Hellwege, Jacklyn N.; Palmer, Nicholette D.; Dimitrov, Latchezar; Sajuthi, Satria; Taylor, Kent D.; NG, Maggie C.Y.; Hawkins, Gregory A.; Chen, Yii-Der Ida; Brown, W. Mark; McWilliams, David; Williams, Adrienne; Lorenzo, Carlos; Norris, Jill M.; Long, Jirong; Rotter, Jerome I.; Curran, Joanne E.; Blangero, John; Wagenknecht, Lynne E.; Langefeld, Carl D.; Bowden, Donald W.

2017-01-01

Summary Family-based methods are a potentially powerful tool to identify trait-defining genetic variants in extended families, particularly when used to complement conventional association analysis. We utilized two-point linkage analysis and single variant association analysis to evaluate whole exome sequencing (WES) data from 1,205 Hispanic Americans (78 families) from the Insulin Resistance Atherosclerosis Family Study. WES identified 211,612 variants above the minor allele frequency threshold of ≥0.005. These variants were tested for linkage and/or association with 50 cardiometabolic traits after quality control checks. Two-point linkage analysis yielded 10,580,600 LOD scores with 1,148 LOD scores ≥3, 183 LOD scores ≥4, and 29 LOD scores ≥5. The maximal novel LOD score was 5.50 for rs2289043:T>C, in UNC5C with subcutaneous adipose tissue volume. Association analysis identified 13 variants attaining genome-wide significance (p<5×10-08), with the strongest association between rs651821:C>T in APOA5, and triglyceride levels (p=3.67×10-10). Overall, there was a 5.2-fold increase in the number of informative variants detected by WES compared to exome chip analysis in this population, nearly 30% of which were novel variants relative to dbSNP build 138. Thus, integration of results from two-point linkage and single-variant association analysis from WES data enabled identification of novel signals potentially contributing to cardiometabolic traits. PMID:28067407
Genetic mapping and legume synteny of aphid resistance in African cowpea (Vigna unguiculata L. Walp.) grown in California.

PubMed

Huynh, Bao-Lam; Ehlers, Jeffrey D; Ndeve, Arsenio; Wanamaker, Steve; Lucas, Mitchell R; Close, Timothy J; Roberts, Philip A

The cowpea aphid Aphis craccivora Koch (CPA) is a destructive insect pest of cowpea, a staple legume crop in Sub-Saharan Africa and other semiarid warm tropics and subtropics. In California, CPA causes damage on all local cultivars from early vegetative to pod development growth stages. Sources of CPA resistance are available in African cowpea germplasm. However, their utilization in breeding is limited by the lack of information on inheritance, genomic location and marker linkage associations of the resistance determinants. In the research reported here, a recombinant inbred line (RIL) population derived from a cross between a susceptible California blackeye cultivar (CB27) and a resistant African breeding line (IT97K-556-6) was genotyped with 1,536 SNP markers. The RILs and parents were phenotyped for CPA resistance using field-based screenings during two main crop seasons in a 'hotspot' location for this pest within the primary growing region of the Central Valley of California. One minor and one major quantitative trait locus (QTL) were consistently mapped on linkage groups 1 and 7, respectively, both with favorable alleles contributed from IT97K-556-6. The major QTL appeared dominant based on a validation test in a related F2 population. SNP markers flanking each QTL were positioned in physical contigs carrying genes involved in plant defense based on synteny with related legumes. These markers could be used to introgress resistance alleles from IT97K-556-6 into susceptible local blackeye varieties by backcrossing.
A tool for selecting SNPs for association studies based on observed linkage disequilibrium patterns.

PubMed

De La Vega, Francisco M; Isaac, Hadar I; Scafe, Charles R

2006-01-01

The design of genetic association studies using single-nucleotide polymorphisms (SNPs) requires the selection of subsets of the variants providing high statistical power at a reasonable cost. SNPs must be selected to maximize the probability that a causative mutation is in linkage disequilibrium (LD) with at least one marker genotyped in the study. The HapMap project performed a genome-wide survey of genetic variation with about a million SNPs typed in four populations, providing a rich resource to inform the design of association studies. A number of strategies have been proposed for the selection of SNPs based on observed LD, including construction of metric LD maps and the selection of haplotype tagging SNPs. Power calculations are important at the study design stage to ensure successful results. Integrating these methods and annotations can be challenging: the algorithms required to implement these methods are complex to deploy, and all the necessary data and annotations are deposited in disparate databases. Here, we present the SNPbrowser Software, a freely available tool to assist in the LD-based selection of markers for association studies. This stand-alone application provides fast query capabilities and swift visualization of SNPs, gene annotations, power, haplotype blocks, and LD map coordinates. Wizards implement several common SNP selection workflows including the selection of optimal subsets of SNPs (e.g. tagging SNPs). Selected SNPs are screened for their conversion potential to either TaqMan SNP Genotyping Assays or the SNPlex Genotyping System, two commercially available genotyping platforms, expediting the set-up of genetic studies with an increased probability of success.
Anchoring Linkage Groups of the Rosa Genetic Map to Physical Chromosomes with Tyramide-FISH and EST-SNP Markers

PubMed Central

Kirov, Ilya; Van Laere, Katrijn; De Riek, Jan; De Keyser, Ellen; Van Roy, Nadine; Khrustaleva, Ludmila

2014-01-01

In order to anchor Rosa linkage groups to physical chromosomes, a combination of the Tyramide-FISH technology and the modern molecular marker system based on High Resolution Melting (HRM) is an efficient approach. Although, Tyramide-FISH is a very promising technique for the visualization of short DNA probes, it is very challenging for plant species with small chromosomes such as Rosa. In this study, we successfully applied the Tyramide-FISH technique for Rosa and compared different detection systems. An indirect detection system exploiting biotinylated tyramides was shown to be the most suitable technique for reliable signal detection. Three gene fragments with a size of 1100 pb–1700 bp (Phenylalanine Ammonia Lyase, Pyrroline-5-Carboxylate Synthase and Orcinol O-Methyl Transferase) have been physically mapped on chromosomes 7, 4 and 1, respectively, of Rosa wichurana. The signal frequency was between 25% and 40%. HRM markers of these 3 gene fragments were used to include the gene fragments on the existing genetic linkage map of Rosa wichurana. As a result, three linkage groups could be anchored to their physical chromosomes. The information was used to check for synteny between the Rosa chromosomes and Fragaria. PMID:24755945
Dissection of Genetic Factors underlying Wheat Kernel Shape and Size in an Elite × Nonadapted Cross using a High Density SNP Linkage Map.

PubMed

Kumar, Ajay; Mantovani, E E; Seetan, R; Soltani, A; Echeverry-Solarte, M; Jain, S; Simsek, S; Doehlert, D; Alamri, M S; Elias, E M; Kianian, S F; Mergoum, M

2016-03-01

Wheat kernel shape and size has been under selection since early domestication. Kernel morphology is a major consideration in wheat breeding, as it impacts grain yield and quality. A population of 160 recombinant inbred lines (RIL), developed using an elite (ND 705) and a nonadapted genotype (PI 414566), was extensively phenotyped in replicated field trials and genotyped using Infinium iSelect 90K assay to gain insight into the genetic architecture of kernel shape and size. A high density genetic map consisting of 10,172 single nucleotide polymorphism (SNP) markers, with an average marker density of 0.39 cM/marker, identified a total of 29 genomic regions associated with six grain shape and size traits; ∼80% of these regions were associated with multiple traits. The analyses showed that kernel length (KL) and width (KW) are genetically independent, while a large number (∼59%) of the quantitative trait loci (QTL) for kernel shape traits were in common with genomic regions associated with kernel size traits. The most significant QTL was identified on chromosome 4B, and could be an ortholog of major rice grain size and shape gene or . Major and stable loci also were identified on the homeologous regions of Group 5 chromosomes, and in the regions of (6A) and (7A) genes. Both parental genotypes contributed equivalent positive QTL alleles, suggesting that the nonadapted germplasm has a great potential for enhancing the gene pool for grain shape and size. This study provides new knowledge on the genetic dissection of kernel morphology, with a much higher resolution, which may aid further improvement in wheat yield and quality using genomic tools. Copyright © 2016 Crop Science Society of America.

Association, effects and validation of polymorphisms within the NCAPG - LCORL locus located on BTA6 with feed intake, gain, meat and carcass traits in beef cattle

USDA-ARS?s Scientific Manuscript database

Background: In a previously reported genome-wide association study based on a high-density bovine SNP genotyping array, 8 SNP were nominally associated (P
Global Phylogeny of Mycobacterium tuberculosis Based on Single Nucleotide Polymorphism (SNP) Analysis: Insights into Tuberculosis Evolution, Phylogenetic Accuracy of Other DNA Fingerprinting Systems, and Recommendations for a Minimal Standard SNP Set†

PubMed Central

Filliol, Ingrid; Motiwala, Alifiya S.; Cavatore, Magali; Qi, Weihong; Hazbón, Manzour Hernando; Bobadilla del Valle, Miriam; Fyfe, Janet; García-García, Lourdes; Rastogi, Nalin; Sola, Christophe; Zozio, Thierry; Guerrero, Marta Inírida; León, Clara Inés; Crabtree, Jonathan; Angiuoli, Sam; Eisenach, Kathleen D.; Durmaz, Riza; Joloba, Moses L.; Rendón, Adrian; Sifuentes-Osornio, José; Ponce de León, Alfredo; Cave, M. Donald; Fleischmann, Robert; Whittam, Thomas S.; Alland, David

2006-01-01

We analyzed a global collection of Mycobacterium tuberculosis strains using 212 single nucleotide polymorphism (SNP) markers. SNP nucleotide diversity was high (average across all SNPs, 0.19), and 96% of the SNP locus pairs were in complete linkage disequilibrium. Cluster analyses identified six deeply branching, phylogenetically distinct SNP cluster groups (SCGs) and five subgroups. The SCGs were strongly associated with the geographical origin of the M. tuberculosis samples and the birthplace of the human hosts. The most ancestral cluster (SCG-1) predominated in patients from the Indian subcontinent, while SCG-1 and another ancestral cluster (SCG-2) predominated in patients from East Asia, suggesting that M. tuberculosis first arose in the Indian subcontinent and spread worldwide through East Asia. Restricted SCG diversity and the prevalence of less ancestral SCGs in indigenous populations in Uganda and Mexico suggested a more recent introduction of M. tuberculosis into these regions. The East African Indian and Beijing spoligotypes were concordant with SCG-1 and SCG-2, respectively; X and Central Asian spoligotypes were also associated with one SCG or subgroup combination. Other clades had less consistent associations with SCGs. Mycobacterial interspersed repetitive unit (MIRU) analysis provided less robust phylogenetic information, and only 6 of the 12 MIRU microsatellite loci were highly differentiated between SCGs as measured by GST. Finally, an algorithm was devised to identify two minimal sets of either 45 or 6 SNPs that could be used in future investigations to enable global collaborations for studies on evolution, strain differentiation, and biological differences of M. tuberculosis. PMID:16385065
DOE Office of Scientific and Technical Information (OSTI.GOV)

SacconePhD, Scott F; Chesler, Elissa J; Bierut, Laura J

Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well representedmore » by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.« less
Use of modern tomato breeding germplasm for deciphering the genetic control of agronomical traits by Genome Wide Association study.

PubMed

Bauchet, Guillaume; Grenier, Stéphane; Samson, Nicolas; Bonnet, Julien; Grivet, Laurent; Causse, Mathilde

2017-05-01

A panel of 300 tomato accessions including breeding materials was built and characterized with >11,000 SNP. A population structure in six subgroups was identified. Strong heterogeneity in linkage disequilibrium and recombination landscape among groups and chromosomes was shown. GWAS identified several associations for fruit weight, earliness and plant growth. Genome-wide association studies (GWAS) have become a method of choice in quantitative trait dissection. First limited to highly polymorphic and outcrossing species, it is now applied in horticultural crops, notably in tomato. Until now GWAS in tomato has been performed on panels of heirloom and wild accessions. Using modern breeding materials would be of direct interest for breeding purpose. To implement GWAS on a large panel of 300 tomato accessions including 168 breeding lines, this study assessed the genetic diversity and linkage disequilibrium decay and revealed the population structure and performed GWA experiment. Genetic diversity and population structure analyses were based on molecular markers (>11,000 SNP) covering the whole genome. Six genetic subgroups were revealed and associated to traits of agronomical interest, such as fruit weight and disease resistance. Estimates of linkage disequilibrium highlighted the heterogeneity of its decay among genetic subgroups. Haplotype definition allowed a fine characterization of the groups and their recombination landscape revealing the patterns of admixture along the genome. Selection footprints showed results in congruence with introgressions. Taken together, all these elements refined our knowledge of the genetic material included in this panel and allowed the identification of several associations for fruit weight, plant growth and earliness, deciphering the genetic architecture of these complex traits and identifying several new loci useful for tomato breeding.
A Saturated Genetic Linkage Map of Autotetraploid Alfalfa (Medicago sativa L.) Developed Using Genotyping-by-Sequencing Is Highly Syntenous with the Medicago truncatula Genome

PubMed Central

Li, Xuehui; Wei, Yanling; Acharya, Ananta; Jiang, Qingzhen; Kang, Junmei; Brummer, E. Charles

2014-01-01

A genetic linkage map is a valuable tool for quantitative trait locus mapping, map-based gene cloning, comparative mapping, and whole-genome assembly. Alfalfa, one of the most important forage crops in the world, is autotetraploid, allogamous, and highly heterozygous, characteristics that have impeded the construction of a high-density linkage map using traditional genetic marker systems. Using genotyping-by-sequencing (GBS), we constructed low-cost, reasonably high-density linkage maps for both maternal and paternal parental genomes of an autotetraploid alfalfa F1 population. The resulting maps contain 3591 single-nucleotide polymorphism markers on 64 linkage groups across both parents, with an average density of one marker per 1.5 and 1.0 cM for the maternal and paternal haplotype maps, respectively. Chromosome assignments were made based on homology of markers to the M. truncatula genome. Four linkage groups representing the four haplotypes of each alfalfa chromosome were assigned to each of the eight Medicago chromosomes in both the maternal and paternal parents. The alfalfa linkage groups were highly syntenous with M. truncatula, and clearly identified the known translocation between Chromosomes 4 and 8. In addition, a small inversion on Chromosome 1 was identified between M. truncatula and M. sativa. GBS enabled us to develop a saturated linkage map for alfalfa that greatly improved genome coverage relative to previous maps and that will facilitate investigation of genome structure. GBS could be used in breeding populations to accelerate molecular breeding in alfalfa. PMID:25147192
A saturated genetic linkage map of autotetraploid alfalfa (Medicago sativa L.) developed using genotyping-by-sequencing is highly syntenous with the Medicago truncatula genome.

PubMed

Li, Xuehui; Wei, Yanling; Acharya, Ananta; Jiang, Qingzhen; Kang, Junmei; Brummer, E Charles

2014-08-21

A genetic linkage map is a valuable tool for quantitative trait locus mapping, map-based gene cloning, comparative mapping, and whole-genome assembly. Alfalfa, one of the most important forage crops in the world, is autotetraploid, allogamous, and highly heterozygous, characteristics that have impeded the construction of a high-density linkage map using traditional genetic marker systems. Using genotyping-by-sequencing (GBS), we constructed low-cost, reasonably high-density linkage maps for both maternal and paternal parental genomes of an autotetraploid alfalfa F1 population. The resulting maps contain 3591 single-nucleotide polymorphism markers on 64 linkage groups across both parents, with an average density of one marker per 1.5 and 1.0 cM for the maternal and paternal haplotype maps, respectively. Chromosome assignments were made based on homology of markers to the M. truncatula genome. Four linkage groups representing the four haplotypes of each alfalfa chromosome were assigned to each of the eight Medicago chromosomes in both the maternal and paternal parents. The alfalfa linkage groups were highly syntenous with M. truncatula, and clearly identified the known translocation between Chromosomes 4 and 8. In addition, a small inversion on Chromosome 1 was identified between M. truncatula and M. sativa. GBS enabled us to develop a saturated linkage map for alfalfa that greatly improved genome coverage relative to previous maps and that will facilitate investigation of genome structure. GBS could be used in breeding populations to accelerate molecular breeding in alfalfa. Copyright © 2014 Li et al.
A Bayesian antedependence model for whole genome prediction.

PubMed

Yang, Wenzhao; Tempelman, Robert J

2012-04-01

Hierarchical mixed effects models have been demonstrated to be powerful for predicting genomic merit of livestock and plants, on the basis of high-density single-nucleotide polymorphism (SNP) marker panels, and their use is being increasingly advocated for genomic predictions in human health. Two particularly popular approaches, labeled BayesA and BayesB, are based on specifying all SNP-associated effects to be independent of each other. BayesB extends BayesA by allowing a large proportion of SNP markers to be associated with null effects. We further extend these two models to specify SNP effects as being spatially correlated due to the chromosomally proximal effects of causal variants. These two models, that we respectively dub as ante-BayesA and ante-BayesB, are based on a first-order nonstationary antedependence specification between SNP effects. In a simulation study involving 20 replicate data sets, each analyzed at six different SNP marker densities with average LD levels ranging from r(2) = 0.15 to 0.31, the antedependence methods had significantly (P < 0.01) higher accuracies than their corresponding classical counterparts at higher LD levels (r(2) > 0. 24) with differences exceeding 3%. A cross-validation study was also conducted on the heterogeneous stock mice data resource (http://mus.well.ox.ac.uk/mouse/HS/) using 6-week body weights as the phenotype. The antedependence methods increased cross-validation prediction accuracies by up to 3.6% compared to their classical counterparts (P < 0.001). Finally, we applied our method to other benchmark data sets and demonstrated that the antedependence methods were more accurate than their classical counterparts for genomic predictions, even for individuals several generations beyond the training data.
High Density Linkage Map Construction and Mapping of Yield Trait QTLs in Maize (Zea mays) Using the Genotyping-by-Sequencing (GBS) Technology

PubMed Central

Su, Chengfu; Wang, Wei; Gong, Shunliang; Zuo, Jinghui; Li, Shujiang; Xu, Shizhong

2017-01-01

Increasing grain yield is the ultimate goal for maize breeding. High resolution quantitative trait loci (QTL) mapping can help us understand the molecular basis of phenotypic variation of yield and thus facilitate marker assisted breeding. The aim of this study is to use genotyping-by-sequencing (GBS) for large-scale SNP discovery and simultaneous genotyping of all F2 individuals from a cross between two varieties of maize that are in clear contrast in yield and related traits. A set of 199 F2 progeny derived from the cross of varieties SG-5 and SG-7 were generated and genotyped by GBS. A total of 1,046,524,604 reads with an average of 5,258,918 reads per F2 individual were generated. This number of reads represents an approximately 0.36-fold coverage of the maize reference genome Zea_mays.AGPv3.29 for each F2 individual. A total of 68,882 raw SNPs were discovered in the F2 population, which, after stringent filtering, led to a total of 29,927 high quality SNPs. Comparative analysis using these physically mapped marker loci revealed a higher degree of synteny with the reference genome. The SNP genotype data were utilized to construct an intra-specific genetic linkage map of maize consisting of 3,305 bins on 10 linkage groups spanning 2,236.66 cM at an average distance of 0.68 cM between consecutive markers. From this map, we identified 28 QTLs associated with yield traits (100-kernel weight, ear length, ear diameter, cob diameter, kernel row number, corn grains per row, ear weight, and grain weight per plant) using the composite interval mapping (CIM) method and 29 QTLs using the least absolute shrinkage selection operator (LASSO) method. QTLs identified by the CIM method account for 6.4% to 19.7% of the phenotypic variation. Small intervals of three QTLs (qCGR-1, qKW-2, and qGWP-4) contain several genes, including one gene (GRMZM2G139872) encoding the F-box protein, three genes (GRMZM2G180811, GRMZM5G828139, and GRMZM5G873194) encoding the WD40-repeat protein, and one gene (GRMZM2G019183) encoding the UDP-Glycosyltransferase. The work will not only help to understand the mechanisms that control yield traits of maize, but also provide a basis for marker-assisted selection and map-based cloning in further studies. PMID:28533786
Breeding value prediction for production traits in layer chickens using pedigree or genomic relationships in a reduced animal model.

PubMed

Wolc, Anna; Stricker, Chris; Arango, Jesus; Settar, Petek; Fulton, Janet E; O'Sullivan, Neil P; Preisinger, Rudolf; Habier, David; Fernando, Rohan; Garrick, Dorian J; Lamont, Susan J; Dekkers, Jack C M

2011-01-21

Genomic selection involves breeding value estimation of selection candidates based on high-density SNP genotypes. To quantify the potential benefit of genomic selection, accuracies of estimated breeding values (EBV) obtained with different methods using pedigree or high-density SNP genotypes were evaluated and compared in a commercial layer chicken breeding line. The following traits were analyzed: egg production, egg weight, egg color, shell strength, age at sexual maturity, body weight, albumen height, and yolk weight. Predictions appropriate for early or late selection were compared. A total of 2,708 birds were genotyped for 23,356 segregating SNP, including 1,563 females with records. Phenotypes on relatives without genotypes were incorporated in the analysis (in total 13,049 production records).The data were analyzed with a Reduced Animal Model using a relationship matrix based on pedigree data or on marker genotypes and with a Bayesian method using model averaging. Using a validation set that consisted of individuals from the generation following training, these methods were compared by correlating EBV with phenotypes corrected for fixed effects, selecting the top 30 individuals based on EBV and evaluating their mean phenotype, and by regressing phenotypes on EBV. Using high-density SNP genotypes increased accuracies of EBV up to two-fold for selection at an early age and by up to 88% for selection at a later age. Accuracy increases at an early age can be mostly attributed to improved estimates of parental EBV for shell quality and egg production, while for other egg quality traits it is mostly due to improved estimates of Mendelian sampling effects. A relatively small number of markers was sufficient to explain most of the genetic variation for egg weight and body weight.
QTL mapping of potato chip color and tuber traits within an autotetraploid family

USDA-ARS?s Scientific Manuscript database

Cultivated potato (Solanum tuberosum L.) is a highly heterozygous autotetraploid crop species, and this presents challenges for traditional line development and molecular breeding. Recent availability of a single nucleotide polymorphism (SNP) array with 8303 features and software packages for linkag...
Linkage and association analysis of obesity traits reveals novel loci and interactions with dietary n-3 fatty acids in an Alaska Native (Yup’ik) population

PubMed Central

Vaughan, Laura Kelly; Wiener, Howard W.; Aslibekyan, Stella; Allison, David B.; Havel, Peter J.; Stanhope, Kimber L.; O’Brien, Diane M.; Hopkins, Scarlett E.; Lemas, Dominick J.; Boyer, Bert B.; Tiwari, Hemant K.

2015-01-01

Objective To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup’ik people. Material and Methods We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. Results We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). Conclusions This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup’ik people. PMID:25772781
Linkage and association analysis of obesity traits reveals novel loci and interactions with dietary n-3 fatty acids in an Alaska Native (Yup'ik) population.

PubMed

Vaughan, Laura Kelly; Wiener, Howard W; Aslibekyan, Stella; Allison, David B; Havel, Peter J; Stanhope, Kimber L; O'Brien, Diane M; Hopkins, Scarlett E; Lemas, Dominick J; Boyer, Bert B; Tiwari, Hemant K

2015-06-01

To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup'ik people. We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup'ik people. Copyright © 2015 Elsevier Inc. All rights reserved.
Efficiency of multi-breed genomic selection for dairy cattle breeds with different sizes of reference population.

PubMed

Hozé, C; Fritz, S; Phocas, F; Boichard, D; Ducrocq, V; Croiseau, P

2014-01-01

Single-breed genomic selection (GS) based on medium single nucleotide polymorphism (SNP) density (~50,000; 50K) is now routinely implemented in several large cattle breeds. However, building large enough reference populations remains a challenge for many medium or small breeds. The high-density BovineHD BeadChip (HD chip; Illumina Inc., San Diego, CA) containing 777,609 SNP developed in 2010 is characterized by short-distance linkage disequilibrium expected to be maintained across breeds. Therefore, combining reference populations can be envisioned. A population of 1,869 influential ancestors from 3 dairy breeds (Holstein, Montbéliarde, and Normande) was genotyped with the HD chip. Using this sample, 50K genotypes were imputed within breed to high-density genotypes, leading to a large HD reference population. This population was used to develop a multi-breed genomic evaluation. The goal of this paper was to investigate the gain of multi-breed genomic evaluation for a small breed. The advantage of using a large breed (Normande in the present study) to mimic a small breed is the large potential validation population to compare alternative genomic selection approaches more reliably. In the Normande breed, 3 training sets were defined with 1,597, 404, and 198 bulls, and a unique validation set included the 394 youngest bulls. For each training set, estimated breeding values (EBV) were computed using pedigree-based BLUP, single-breed BayesC, or multi-breed BayesC for which the reference population was formed by any of the Normande training data sets and 4,989 Holstein and 1,788 Montbéliarde bulls. Phenotypes were standardized by within-breed genetic standard deviation, the proportion of polygenic variance was set to 30%, and the estimated number of SNP with a nonzero effect was about 7,000. The 2 genomic selection (GS) approaches were performed using either the 50K or HD genotypes. The correlations between EBV and observed daughter yield deviations (DYD) were computed for 6 traits and using the different prediction approaches. Compared with pedigree-based BLUP, the average gain in accuracy with GS in small populations was 0.057 for the single-breed and 0.086 for multi-breed approach. This gain was up to 0.193 and 0.209, respectively, with the large reference population. Improvement of EBV prediction due to the multi-breed evaluation was higher for animals not closely related to the reference population. In the case of a breed with a small reference population size, the increase in correlation due to multi-breed GS was 0.141 for bulls without their sire in reference population compared with 0.016 for bulls with their sire in reference population. These results demonstrate that multi-breed GS can contribute to increase genomic evaluation accuracy in small breeds. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
High-throughput SNP genotyping for breeding applications in rice using the BeadXpress platform

USDA-ARS?s Scientific Manuscript database

Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design

PubMed Central

Goonetilleke, Shashi N.; March, Timothy J.; Wirthensohn, Michelle G.; Arús, Pere; Walker, Amanda R.; Mather, Diane E.

2017-01-01

In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond (Prunus dulcis Mill. D. A. Webb), application of a double pseudotestcross mapping approach to the F1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars “Nonpareil” and “Lauranne.” Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond. PMID:29141988
Genome-wide linkage scan for contraction velocity characteristics of knee musculature in the Leuven Genes for Muscular Strength Study.

PubMed

De Mars, Gunther; Windelinckx, An; Huygens, Wim; Peeters, Maarten W; Beunen, Gaston P; Aerssens, Jeroen; Vlietinck, Robert; Thomis, Martine A I

2008-09-17

The torque-velocity relationship is known to be affected by ageing, decreasing its protective role in the prevention of falls. Interindividual variability in this torque-velocity relationship is partly determined by genetic factors (h(2): 44-67%). As a first attempt, this genome-wide linkage study aimed to identify chromosomal regions linked to the torque-velocity relationship of the knee flexors and extensors. A selection of 283 informative male siblings (17-36 yr), belonging to 105 families, was used to conduct a genome-wide SNP-based (Illumina Linkage IVb panel) multipoint linkage analysis for the torque-velocity relationship of the knee flexors and extensors. The strongest evidence for linkage was found at 15q23 for the torque-velocity slope of the knee extensors (TVSE). Other interesting linkage regions with LOD scores >2 were found at 7p12.3 [logarithm of the odds ratio (LOD) = 2.03, P = 0.0011] for the torque-velocity ratio of the knee flexors (TVRF), at 2q14.3 (LOD = 2.25, P = 0.0006) for TVSE, and at 4p14 and 18q23 for the torque-velocity ratio of the knee extensors TVRE (LOD = 2.23 and 2.08; P = 0.0007 and 0.001, respectively). We conclude that many small contributing genes are involved in causing variation in the torque-velocity relationship of the knee flexor and extensor muscles. Several earlier reported candidate genes for muscle strength and muscle mass and new candidates are harbored within or in close vicinity of the linkage regions reported in the present study.
Random forest estimation of genomic breeding values for disease susceptibility over different disease incidences and genomic architectures in simulated cow calibration groups.

PubMed

Naderi, S; Yin, T; König, S

2016-09-01

A simulation study was conducted to investigate the performance of random forest (RF) and genomic BLUP (GBLUP) for genomic predictions of binary disease traits based on cow calibration groups. Training and testing sets were modified in different scenarios according to disease incidence, the quantitative-genetic background of the trait (h(2)=0.30 and h(2)=0.10), and the genomic architecture [725 quantitative trait loci (QTL) and 290 QTL, populations with high and low levels of linkage disequilibrium (LD)]. For all scenarios, 10,005 SNP (depicting a low-density 10K SNP chip) and 50,025 SNP (depicting a 50K SNP chip) were evenly spaced along 29 chromosomes. Training and testing sets included 20,000 cows (4,000 sick, 16,000 healthy, disease incidence 20%) from the last 2 generations. Initially, 4,000 sick cows were assigned to the testing set, and the remaining 16,000 healthy cows represented the training set. In the ongoing allocation schemes, the number of sick cows in the training set increased stepwise by moving 10% of the sick animals from the testing set to the training set, and vice versa. The size of the training and testing sets was kept constant. Evaluation criteria for both GBLUP and RF were the correlations between genomic breeding values and true breeding values (prediction accuracy), and the area under the receiving operating characteristic curve (AUROC). Prediction accuracy and AUROC increased for both methods and all scenarios as increasing percentages of sick cows were allocated to the training set. Highest prediction accuracies were observed for disease incidences in training sets that reflected the population disease incidence of 0.20. For this allocation scheme, the largest prediction accuracies of 0.53 for RF and of 0.51 for GBLUP, and the largest AUROC of 0.66 for RF and of 0.64 for GBLUP, were achieved using 50,025 SNP, a heritability of 0.30, and 725 QTL. Heritability decreases from 0.30 to 0.10 and QTL reduction from 725 to 290 were associated with decreasing prediction accuracy and decreasing AUROC for all scenarios. This decrease was more pronounced for RF. Also, the increase of LD had stronger effect on RF results than on GBLUP results. The highest prediction accuracy from the low LD scenario was 0.30 from RF and 0.36 from GBLUP, and increased to 0.39 for both methods in the high LD population. Random forest successfully identified important SNP in close map distance to QTL explaining a high proportion of the phenotypic trait variations. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses

PubMed Central

2010-01-01

Background Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. Results A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [Punadj. = 6.96 × 10-6]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (Punadj. = 1.61 × 10-9; PBonf. = 6.58 × 10-5). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r2 = 0.86). Conclusions Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 × 10-10; BIEC2-417495, Punadj. = 1.61 × 10-9). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds. PMID:20932346
A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses.

PubMed

Hill, Emmeline W; McGivney, Beatrice A; Gu, Jingjing; Whiston, Ronan; Machugh, David E

2010-10-11

Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [P(unadj.) = 6.96 x 10⁻⁶]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (P(unadj.) = 1.61 x 10⁻⁹; P(Bonf.) = 6.58 x 10⁻⁵). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r² = 0.86). Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 x 10⁻¹⁰; BIEC2-417495, P(unadj.) = 1.61 x 10⁻⁹). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds.
Preselection statistics and Random Forest classification identify population informative single nucleotide polymorphisms in cosmopolitan and autochthonous cattle breeds.

PubMed

Bertolini, F; Galimberti, G; Schiavo, G; Mastrangelo, S; Di Gerlando, R; Strillacci, M G; Bagnato, A; Portolano, B; Fontanesi, L

2018-01-01

Commercial single nucleotide polymorphism (SNP) arrays have been recently developed for several species and can be used to identify informative markers to differentiate breeds or populations for several downstream applications. To identify the most discriminating genetic markers among thousands of genotyped SNPs, a few statistical approaches have been proposed. In this work, we compared several methods of SNPs preselection (Delta, F st and principal component analyses (PCA)) in addition to Random Forest classifications to analyse SNP data from six dairy cattle breeds, including cosmopolitan (Holstein, Brown and Simmental) and autochthonous Italian breeds raised in two different regions and subjected to limited or no breeding programmes (Cinisara, Modicana, raised only in Sicily and Reggiana, raised only in Emilia Romagna). From these classifications, two panels of 96 and 48 SNPs that contain the most discriminant SNPs were created for each preselection method. These panels were evaluated in terms of the ability to discriminate as a whole and breed-by-breed, as well as linkage disequilibrium within each panel. The obtained results showed that for the 48-SNP panel, the error rate increased mainly for autochthonous breeds, probably as a consequence of their admixed origin lower selection pressure and by ascertaining bias in the construction of the SNP chip. The 96-SNP panels were generally more able to discriminate all breeds. The panel derived by PCA-chrom (obtained by a preselection chromosome by chromosome) could identify informative SNPs that were particularly useful for the assignment of minor breeds that reached the lowest value of Out Of Bag error even in the Cinisara, whose value was quite high in all other panels. Moreover, this panel contained also the lowest number of SNPs in linkage disequilibrium. Several selected SNPs are located nearby genes affecting breed-specific phenotypic traits (coat colour and stature) or associated with production traits. In general, our results demonstrated the usefulness of Random Forest in combination to other reduction techniques to identify population informative SNPs.

High-density marker imputation accuracy in sixteen French cattle breeds.

PubMed

Hozé, Chris; Fouilloux, Marie-Noëlle; Venot, Eric; Guillaume, François; Dassonneville, Romain; Fritz, Sébastien; Ducrocq, Vincent; Phocas, Florence; Boichard, Didier; Croiseau, Pascal

2013-09-03

Genotyping with the medium-density Bovine SNP50 BeadChip® (50K) is now standard in cattle. The high-density BovineHD BeadChip®, which contains 777,609 single nucleotide polymorphisms (SNPs), was developed in 2010. Increasing marker density increases the level of linkage disequilibrium between quantitative trait loci (QTL) and SNPs and the accuracy of QTL localization and genomic selection. However, re-genotyping all animals with the high-density chip is not economically feasible. An alternative strategy is to genotype part of the animals with the high-density chip and to impute high-density genotypes for animals already genotyped with the 50K chip. Thus, it is necessary to investigate the error rate when imputing from the 50K to the high-density chip. Five thousand one hundred and fifty three animals from 16 breeds (89 to 788 per breed) were genotyped with the high-density chip. Imputation error rates from the 50K to the high-density chip were computed for each breed with a validation set that included the 20% youngest animals. Marker genotypes were masked for animals in the validation population in order to mimic 50K genotypes. Imputation was carried out using the Beagle 3.3.0 software. Mean allele imputation error rates ranged from 0.31% to 2.41% depending on the breed. In total, 1980 SNPs had high imputation error rates in several breeds, which is probably due to genome assembly errors, and we recommend to discard these in future studies. Differences in imputation accuracy between breeds were related to the high-density-genotyped sample size and to the genetic relationship between reference and validation populations, whereas differences in effective population size and level of linkage disequilibrium showed limited effects. Accordingly, imputation accuracy was higher in breeds with large populations and in dairy breeds than in beef breeds. More than 99% of the alleles were correctly imputed if more than 300 animals were genotyped at high-density. No improvement was observed when multi-breed imputation was performed. In all breeds, imputation accuracy was higher than 97%, which indicates that imputation to the high-density chip was accurate. Imputation accuracy depends mainly on the size of the reference population and the relationship between reference and target populations.
High-density marker imputation accuracy in sixteen French cattle breeds

PubMed Central

2013-01-01

Background Genotyping with the medium-density Bovine SNP50 BeadChip® (50K) is now standard in cattle. The high-density BovineHD BeadChip®, which contains 777 609 single nucleotide polymorphisms (SNPs), was developed in 2010. Increasing marker density increases the level of linkage disequilibrium between quantitative trait loci (QTL) and SNPs and the accuracy of QTL localization and genomic selection. However, re-genotyping all animals with the high-density chip is not economically feasible. An alternative strategy is to genotype part of the animals with the high-density chip and to impute high-density genotypes for animals already genotyped with the 50K chip. Thus, it is necessary to investigate the error rate when imputing from the 50K to the high-density chip. Methods Five thousand one hundred and fifty three animals from 16 breeds (89 to 788 per breed) were genotyped with the high-density chip. Imputation error rates from the 50K to the high-density chip were computed for each breed with a validation set that included the 20% youngest animals. Marker genotypes were masked for animals in the validation population in order to mimic 50K genotypes. Imputation was carried out using the Beagle 3.3.0 software. Results Mean allele imputation error rates ranged from 0.31% to 2.41% depending on the breed. In total, 1980 SNPs had high imputation error rates in several breeds, which is probably due to genome assembly errors, and we recommend to discard these in future studies. Differences in imputation accuracy between breeds were related to the high-density-genotyped sample size and to the genetic relationship between reference and validation populations, whereas differences in effective population size and level of linkage disequilibrium showed limited effects. Accordingly, imputation accuracy was higher in breeds with large populations and in dairy breeds than in beef breeds. More than 99% of the alleles were correctly imputed if more than 300 animals were genotyped at high-density. No improvement was observed when multi-breed imputation was performed. Conclusion In all breeds, imputation accuracy was higher than 97%, which indicates that imputation to the high-density chip was accurate. Imputation accuracy depends mainly on the size of the reference population and the relationship between reference and target populations. PMID:24004563
Genome-wide high-density SNP linkage search for glioma susceptibility loci: results from the Gliogene Consortium

PubMed Central

Shete, Sanjay; Lau, Ching C; Houlston, Richard S; Claus, Elizabeth B; Barnholtz-Sloan, Jill; Lai, Rose; Il’yasova, Dora; Schildkraut, Joellen; Sadetzki, Siegal; Johansen, Christoffer; Bernstein, Jonine L; Olson, Sara H; Jenkins, Robert B; Yang, Ping; Vick, Nicholas A; Wrensch, Margaret; Davis, Faith G; McCarthy, Bridget J; Leung, Eastwood Hon-chiu; Davis, Caleb; Cheng, Rita; Hosking, Fay J; Armstrong, Georgina N; Liu, Yanhong; Yu, Robert K; Henriksson, Roger; Consortium, The Gliogene; Melin, Beatrice S; Bondy, Melissa L

2011-01-01

Gliomas, which generally have a poor prognosis, are the most common primary malignant brain tumors in adults. Recent genome-wide association studies have demonstrated that inherited susceptibility plays a role in the development of glioma. Although first-degree relatives of patients exhibit a two-fold increased risk of glioma, the search for susceptibility loci in familial forms of the disease has been challenging because the disease is relatively rare, fatal, and heterogeneous, making it difficult to collect sufficient biosamples from families for statistical power. To address this challenge, the Genetic Epidemiology of Glioma International Consortium (Gliogene) was formed to collect DNA samples from families with two or more cases of histologically confirmed glioma. In this study, we present results obtained from 46 U.S. families in which multipoint linkage analyses were undertaken using nonparametric (model-free) methods. After removal of high linkage disequilibrium SNPs, we obtained a maximum nonparametric linkage score (NPL) of 3.39 (P=0.0005) at 17q12–21.32 and the Z-score of 4.20 (P=0.000007). To replicate our findings, we genotyped 29 independent U.S. families and obtained a maximum NPL score of 1.26 (P=0.008) and the Z-score of 1.47 (P=0.035). Accounting for the genetic heterogeneity using the ordered subset analysis approach, the combined analyses of 75 families resulted in a maximum NPL score of 3.81 (P=0.00001). The genomic regions we have implicated in this study may offer novel insights into glioma susceptibility, focusing future work to identify genes that cause familial glioma. PMID:22037877
Genetic assessment of additional endophenotypes from the Consortium on the Genetics of Schizophrenia Family Study.

PubMed

Greenwood, Tiffany A; Lazzeroni, Laura C; Calkins, Monica E; Freedman, Robert; Green, Michael F; Gur, Raquel E; Gur, Ruben C; Light, Gregory A; Nuechterlein, Keith H; Olincy, Ann; Radant, Allen D; Seidman, Larry J; Siever, Larry J; Silverman, Jeremy M; Stone, William S; Sugar, Catherine A; Swerdlow, Neal R; Tsuang, Debby W; Tsuang, Ming T; Turetsky, Bruce I; Braff, David L

2016-01-01

The Consortium on the Genetics of Schizophrenia Family Study (COGS-1) has previously reported our efforts to characterize the genetic architecture of 12 primary endophenotypes for schizophrenia. We now report the characterization of 13 additional measures derived from the same endophenotype test paradigms in the COGS-1 families. Nine of the measures were found to discriminate between schizophrenia patients and controls, were significantly heritable (31 to 62%), and were sufficiently independent of previously assessed endophenotypes, demonstrating utility as additional endophenotypes. Genotyping via a custom array of 1536 SNPs from 94 candidate genes identified associations for CTNNA2, ERBB4, GRID1, GRID2, GRIK3, GRIK4, GRIN2B, NOS1AP, NRG1, and RELN across multiple endophenotypes. An experiment-wide p value of 0.003 suggested that the associations across all SNPs and endophenotypes collectively exceeded chance. Linkage analyses performed using a genome-wide SNP array further identified significant or suggestive linkage for six of the candidate endophenotypes, with several genes of interest located beneath the linkage peaks (e.g., CSMD1, DISC1, DLGAP2, GRIK2, GRIN3A, and SLC6A3). While the partial convergence of the association and linkage likely reflects differences in density of gene coverage provided by the distinct genotyping platforms, it is also likely an indication of the differential contribution of rare and common variants for some genes and methodological differences in detection ability. Still, many of the genes implicated by COGS through endophenotypes have been identified by independent studies of common, rare, and de novo variation in schizophrenia, all converging on a functional genetic network related to glutamatergic neurotransmission that warrants further investigation. Copyright © 2015 Elsevier B.V. All rights reserved.
The search for loci under selection: trends, biases and progress.

PubMed

Ahrens, Collin W; Rymer, Paul D; Stow, Adam; Bragg, Jason; Dillon, Shannon; Umbers, Kate D L; Dudaniec, Rachael Y

2018-03-01

Detecting genetic variants under selection using F ST outlier analysis (OA) and environmental association analyses (EAAs) are popular approaches that provide insight into the genetic basis of local adaptation. Despite the frequent use of OA and EAA approaches and their increasing attractiveness for detecting signatures of selection, their application to field-based empirical data have not been synthesized. Here, we review 66 empirical studies that use Single Nucleotide Polymorphisms (SNPs) in OA and EAA. We report trends and biases across biological systems, sequencing methods, approaches, parameters, environmental variables and their influence on detecting signatures of selection. We found striking variability in both the use and reporting of environmental data and statistical parameters. For example, linkage disequilibrium among SNPs and numbers of unique SNP associations identified with EAA were rarely reported. The proportion of putatively adaptive SNPs detected varied widely among studies, and decreased with the number of SNPs analysed. We found that genomic sampling effort had a greater impact than biological sampling effort on the proportion of identified SNPs under selection. OA identified a higher proportion of outliers when more individuals were sampled, but this was not the case for EAA. To facilitate repeatability, interpretation and synthesis of studies detecting selection, we recommend that future studies consistently report geographical coordinates, environmental data, model parameters, linkage disequilibrium, and measures of genetic structure. Identifying standards for how OA and EAA studies are designed and reported will aid future transparency and comparability of SNP-based selection studies and help to progress landscape and evolutionary genomics. © 2018 John Wiley & Sons Ltd.
Linkage disequilibrium matches forensic genetic records to disjoint genomic marker sets.

PubMed

Edge, Michael D; Algee-Hewitt, Bridget F B; Pemberton, Trevor J; Li, Jun Z; Rosenberg, Noah A

2017-05-30

Combining genotypes across datasets is central in facilitating advances in genetics. Data aggregation efforts often face the challenge of record matching-the identification of dataset entries that represent the same individual. We show that records can be matched across genotype datasets that have no shared markers based on linkage disequilibrium between loci appearing in different datasets. Using two datasets for the same 872 people-one with 642,563 genome-wide SNPs and the other with 13 short tandem repeats (STRs) used in forensic applications-we find that 90-98% of forensic STR records can be connected to corresponding SNP records and vice versa. Accuracy increases to 99-100% when ∼30 STRs are used. Our method expands the potential of data aggregation, but it also suggests privacy risks intrinsic in maintenance of databases containing even small numbers of markers-including databases of forensic significance.
A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing.

PubMed

Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv

2018-01-01

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.
Using SNP markers to dissect linkage disequilibrium at a major quantitative trait locus for resistance to the potato cyst nematode Globodera pallida on potato chromosome V.

PubMed

Achenbach, Ute; Paulo, Joao; Ilarionova, Evgenyia; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Gebhardt, Christiane

2009-02-01

The damage caused by the parasitic root cyst nematode Globodera pallida is a major yield-limiting factor in potato cultivation . Breeding for resistance is facilitated by the PCR-based marker 'HC', which is diagnostic for an allele conferring high resistance against G. pallida pathotype Pa2/3 that has been introgressed from the wild potato species Solanum vernei into the Solanum tuberosum tetraploid breeding pool. The major quantitative trait locus (QTL) controlling this nematode resistance maps on potato chromosome V in a hot spot for resistance to various pathogens including nematodes and the oomycete Phytophthora infestans. An unstructured sample of 79 tetraploid, highly heterozygous varieties and breeding clones was selected based on presence (41 genotypes) or absence (38 genotypes) of the HC marker. Testing the clones for resistance to G. pallida confirmed the diagnostic power of the HC marker. The 79 individuals were genotyped for 100 single nucleotide polymorphisms (SNPs) at 10 loci distributed over 38 cM on chromosome V. Forty-five SNPs at six loci spanning 2 cM in the interval between markers GP21-GP179 were associated with resistance to G. pallida. Based on linkage disequilibrium (LD) between SNP markers, six LD groups comprising between 2 and 18 SNPs were identified. The LD groups indicated the existence of multiple alleles at a single resistance locus or at several, physically linked resistance loci. LD group C comprising 18 SNPs corresponded to the 'HC' marker. LD group E included 16 SNPs and showed an association peak, which positioned one nematode resistance locus physically close to the R1 gene family.
Genome-wide association study of body weight in Australian Merino sheep reveals an orthologous region on OAR6 to human and bovine genomic regions affecting height and weight.

PubMed

Al-Mamun, Hawlader A; Kwan, Paul; Clark, Samuel A; Ferdosi, Mohammad H; Tellam, Ross; Gondro, Cedric

2015-08-14

Body weight (BW) is an important trait for meat production in sheep. Although over the past few years, numerous quantitative trait loci (QTL) have been detected for production traits in cattle, few QTL studies have been reported for sheep, with even fewer on meat production traits. Our objective was to perform a genome-wide association study (GWAS) with the medium-density Illumina Ovine SNP50 BeadChip to identify genomic regions and corresponding haplotypes associated with BW in Australian Merino sheep. A total of 1781 Australian Merino sheep were genotyped using the medium-density Illumina Ovine SNP50 BeadChip. Among the 53 862 single nucleotide polymorphisms (SNPs) on this array, 48 640 were used to perform a GWAS using a linear mixed model approach. Genotypes were phased with hsphase; to estimate SNP haplotype effects, linkage disequilibrium blocks were identified in the detected QTL region. Thirty-nine SNPs were associated with BW at a Bonferroni-corrected genome-wide significance threshold of 1 %. One region on sheep (Ovis aries) chromosome 6 (OAR6) between 36.15 and 38.56 Mb, included 13 significant SNPs that were associated with BW; the most significant SNP was OAR6_41936490.1 (P = 2.37 × 10(-16)) at 37.69 Mb with an allele substitution effect of 2.12 kg, which corresponds to 0.248 phenotypic standard deviations for BW. The region that surrounds this association signal on OAR6 contains three genes: leucine aminopeptidase 3 (LAP3), which is involved in the processing of the oxytocin precursor; NCAPG non-SMC condensin I complex, subunit G (NCAPG), which is associated with foetal growth and carcass size in cattle; and ligand dependent nuclear receptor corepressor-like (LCORL), which is associated with height in humans and cattle. The GWAS analysis detected 39 SNPs associated with BW in sheep and a major QTL region was identified on OAR6. In several other mammalian species, regions that are syntenic with this region have been found to be associated with body size traits, which may reflect that the underlying biological mechanisms share a common ancestry. These findings should facilitate the discovery of causative variants for BW and contribute to marker-assisted selection.
MiSNPDb: a web-based genomic resources of tropical ecology fruit mango (Mangifera indica L.) for phylogeography and varietal differentiation.

PubMed

Iquebal, M A; Jaiswal, Sarika; Mahato, Ajay Kumar; Jayaswal, Pawan K; Angadi, U B; Kumar, Neeraj; Sharma, Nimisha; Singh, Anand K; Srivastav, Manish; Prakash, Jai; Singh, S K; Khan, Kasim; Mishra, Rupesh K; Rajan, Shailendra; Bajpai, Anju; Sandhya, B S; Nischita, Puttaraju; Ravishankar, K V; Dinesh, M R; Rai, Anil; Kumar, Dinesh; Sharma, Tilak R; Singh, Nagendra K

2017-11-02

Mango is one of the most important fruits of tropical ecological region of the world, well known for its nutritive value, aroma and taste. Its world production is >45MT worth >200 billion US dollars. Genomic resources are required for improvement in productivity and management of mango germplasm. There is no web-based genomic resources available for mango. Hence rapid and cost-effective high throughput putative marker discovery is required to develop such resources. RAD-based marker discovery can cater this urgent need till whole genome sequence of mango becomes available. Using a panel of 84 mango varieties, a total of 28.6 Gb data was generated by ddRAD-Seq approach on Illumina HiSeq 2000 platform. A total of 1.25 million SNPs were discovered. Phylogenetic tree using 749 common SNPs across these varieties revealed three major lineages which was compared with geographical locations. A web genomic resources MiSNPDb, available at http://webtom.cabgrid.res.in/mangosnps/ is based on 3-tier architecture, developed using PHP, MySQL and Javascript. This web genomic resources can be of immense use in the development of high density linkage map, QTL discovery, varietal differentiation, traceability, genome finishing and SNP chip development for future GWAS in genomic selection program. We report here world's first web-based genomic resources for genetic improvement and germplasm management of mango.
Results of a SNP genome screen in a large Costa Rican pedigree segregating for severe bipolar disorder.

PubMed

Service, Susan; Molina, Julio; Deyoung, Joseph; Jawaheer, Damini; Aldana, Ileana; Vu, Thuy; Araya, Carmen; Araya, Xinia; Bejarano, Julio; Fournier, Eduardo; Ramirez, Magui; Mathews, Carol A; Davanzo, Pablo; Macaya, Gabriel; Sandkuijl, Lodewijk; Sabatti, Chiara; Reus, Victor; Freimer, Nelson

2006-06-05

We have ascertained in the Central Valley of Costa Rica a new kindred (CR201) segregating for severe bipolar disorder (BP-I). The family was identified by tracing genealogical connections among eight persons initially independently ascertained for a genome wide association study of BP-I. For the genome screen in CR201, we trimmed the family down to 168 persons (82 of whom are genotyped), containing 25 individuals with a best-estimate diagnosis of BP-I. A total of 4,690 SNP markers were genotyped. Analysis of the data was hampered by the size and complexity of the pedigree, which prohibited using exact multipoint methods on the entire kindred. Two-point parametric linkage analysis, using a conservative model of transmission, produced a maximum LOD score of 2.78 on chromosome 6, and a total of 39 loci with LOD scores >1.0. Multipoint parametric and non-parametric linkage analysis was performed separately on four sections of CR201, and interesting (nominal P-value from either analysis <0.01), although not statistically significant, regions were highlighted on chromosomes 1, 2, 3, 12, 16, 19, and 22, in at least one section of the pedigree, or when considering all sections together. The difficulties of analyzing genome wide SNP data for complex disorders in large, potentially informative, kindreds are discussed.
Quantitative trait loci mapping for flowering time in a switchgrass pseudo-F2 population

USDA-ARS?s Scientific Manuscript database

Flowering is an important developmental event in switchgrass (Panicum virgatum) because the onset of flowering causes the cessation of vegetative growth and biomass accumulation. The objective of this study was to generate a linkage map using single nucleotide polymorphism (SNP) markers to identify ...
Linkage Disequilibrium And Genome-Wide Association Studies In O. sativa

USDA-ARS?s Scientific Manuscript database

There is increasing evidence that genome-wide association studies provide a powerful approach to find the genetic basis of complex phenotypic variation in all kinds of species. For this purpose, we developed the first generation 44K Affymetrix SNP array in rice (see Tung et al. poster). We genotyped...
Population genetics related to adaptation in elite oat germplasm

USDA-ARS?s Scientific Manuscript database

Six hundred thirty five oat lines and 2,635 SNP loci were used to evaluate population structure, linkage disequilibrium (LD) and genotype-phenotype association with heading date. The first five principal components (PC) accounted for 25.3% of genetic variation. Neither the eigenvalues of the first 2...
Developing a new nonbinary SNP fluorescent multiplex detection system for forensic application in China.

PubMed

Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila

2017-04-01

Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A journey from a SSR-based low density map to a SNP-based high density map for identification of disease resistance quantitative trait loci in peanut

USDA-ARS?s Scientific Manuscript database

Mapping and identification of quantitative trait loci (QTLs) are important for efficient marker-assisted breeding. Diseases such as leaf spots and Tomato spotted wilt virus (TSWV) cause significant loses to peanut growers. The U.S. Peanut Genome Initiative (PGI) was launched in 2004, and expanded to...
Measuring firm size distribution with semi-nonparametric densities

NASA Astrophysics Data System (ADS)

Cortés, Lina M.; Mora-Valencia, Andrés; Perote, Javier

2017-11-01

In this article, we propose a new methodology based on a (log) semi-nonparametric (log-SNP) distribution that nests the lognormal and enables better fits in the upper tail of the distribution through the introduction of new parameters. We test the performance of the lognormal and log-SNP distributions capturing firm size, measured through a sample of US firms in 2004-2015. Taking different levels of aggregation by type of economic activity, our study shows that the log-SNP provides a better fit of the firm size distribution. We also formally introduce the multivariate log-SNP distribution, which encompasses the multivariate lognormal, to analyze the estimation of the joint distribution of the value of the firm's assets and sales. The results suggest that sales are a better firm size measure, as indicated by other studies in the literature.
A High Density Genetic Map Derived from RAD Sequencing and Its Application in QTL Analysis of Yield-Related Traits in Vigna unguiculata

PubMed Central

Pan, Lei; Wang, Nian; Wu, Zhihua; Guo, Rui; Yu, Xiaolu; Zheng, Yu; Xia, Qiuju; Gui, Songtao; Chen, Chanyou

2017-01-01

Cowpea [Vigna unguiculata (L.) Walp.] is an annual legume of economic importance and widely grown in the semi-arid tropics. However, high-density genetic maps of cowpea are still lacking. Here, we identified 34,868 SNPs (single nucleotide polymorphisms) that were distributed in the cowpea genome based on the RAD sequencing (restriction-site associated DNA sequencing) technique using a population of 170 individuals (two cowpea parents and 168 F2:3 progenies). Of these, 17,996 reliable SNPs were allotted to 11 consensus linkage groups (LGs). The length of the genetic map was 1,194.25 cM in total with a mean distance of 0.066 cM/SNP marker locus. Using this map and the F2:3 population, combined with the CIM (composite interval mapping) method, eleven quantitative trait loci (QTL) of yield-related trait were detected on seven LGs (LG4, 5, 6, 7, 9, 10, and 11) in cowpea. These QTL explained 0.05–17.32% of the total phenotypic variation. Among these, four QTL were for pod length, four QTL for thousand-grain weight (TGW), two QTL for grain number per pod, and one QTL for carpopodium length. Our results will provide a foundation for understanding genes related to grain yield in the cowpea and genus Vigna. PMID:28936219
High-Density Genetic Map Construction and Stem Total Polysaccharide Content-Related QTL Exploration for Chinese Endemic Dendrobium (Orchidaceae)

PubMed Central

Lu, Jiangjie; Liu, Yuyang; Xu, Jing; Mei, Ziwei; Shi, Yujun; Liu, Pengli; He, Jianbo; Wang, Xiaotong; Meng, Yijun; Feng, Shangguo; Shen, Chenjia; Wang, Huizhong

2018-01-01

Plants of the Dendrobium genus are orchids with not only ornamental value but also high medicinal value. To understand the genetic basis of variations in active ingredients of the stem total polysaccharide contents (STPCs) among different Dendrobium species, it is of paramount importance to understand the mechanism of STPC formation and identify genes affecting its process at the whole genome level. Here, we report the first high-density single-nucleotide polymorphism (SNP) integrated genetic map with a good genome coverage of Dendrobium. The specific-locus amplified fragment sequencing (SLAF-seq) technology led to identification of 7,013,400 SNPs from 1,503,626 high-quality SLAF markers from two parents (Dendrobium moniliforme ♀ × Dendrobium officinale ♂) and their interspecific F1 hybrid population. The final genetic map contained 8, 573 SLAF markers, covering 19 linkage groups (LGs). This genetic map spanned a length of 2,737.49 cM, where the average distance between markers is 0.32 cM. In total, 5 quantitative trait loci (QTL) related to STPC were identified, 3 of which have candidate genes within the confidence intervals of these stable QTLs based on the D. officinale genome sequence. This study will build a foundation up for the mapping of other medicinal-related traits and provide an important reference for the molecular breeding of these Chinese herb. PMID:29636767
Association Analysis of the Ephrin-B2 Gene in African-Americans with End-Stage Renal Disease

PubMed Central

Hicks, Pamela J.; Staten, Jennifer L.; Palmer, Nicholette D.; Langefeld, Carl D.; Ziegler, Julie T.; Keene, Keith L.; Sale, Michele M.; Bowden, Donald W.; Freedman, Barry I.

2008-01-01

Background Genome scans in African-Americans with end-stage renal disease (ESRD) identified linkage on chromosome 13q33 in the region containing the ephrin-B2 ligand (EFNB2) genes. Interactions between the ephrin-B2 receptor and ephrin-B2 ligand play essential roles in renal angiogenesis, blood vessel maturation, and kidney disease. Methods The EFNB2 gene was evaluated as a positional candidate for non-diabetic and diabetic ESRD susceptibility in 1,071 unrelated African-American subjects; 316 with non-diabetic etiologies of ESRD, 394 with type 2 diabetes-associated ESRD and 361 healthy controls. Single nucleotide polymorphism (SNP) genotyping was performed on the Sequenom Mass Array System. Statistical analyses were computed using Dandelion version 1.26, Snpaddmix version 1.4 and Haploview version 3.32. Results Twenty-eight HapMap tag SNPs were genotyped spanning the 39 kilobases (kb) of the EFNB2 coding region, with average spacing of 1.43 kb. Analysis of 710 ESRD patient samples and 361 controls provided no evidence of single SNP associations in either diabetic or non-diabetic ESRD; although nominal evidence of association with all-cause ESRD was observed with a two SNP (p = 0.022) and three SNP (p = 0.023) haplotype, both containing SNPs rs7490924 and rs2391335 in intron 1. Conclusions Although an attractive positional candidate gene, polymorphisms in the EFNB2 gene do not appear to contribute in a substantial way to non-diabetic, diabetic or all-cause ESRD susceptibility in African-Americans. Additional genes within the chromosome 13q33 linkage interval are likely contributors to African-American non-diabetic ESRD. PMID:18580054

Association of methionine synthase gene polymorphisms with wool production and quality traits in Chinese Merino population.

PubMed

Rong, E G; Yang, H; Zhang, Z W; Wang, Z P; Yan, X H; Li, H; Wang, N

2015-10-01

Methionine synthase (MTR) plays a crucial role in maintaining homeostasis of intracellular methionine, folate, and homocysteine, and its activity correlates with DNA methylation in many mammalian tissues. Our previous genomewide association study identified that 1 SNP located in the gene was associated with several wool production and quality traits in Chinese Merino. To confirm the potential involvement of the gene in sheep wool production and quality traits, we performed sheep tissue expression profiling, SNP detection, and association analysis with sheep wool production and quality traits. The semiquantitative reverse transcription PCR analysis showed that the gene was differentially expressed in skin from Merino and Kazak sheep. The sequencing analysis identified a total of 13 SNP in the gene from Chinese Merino sheep. Comparison of the allele frequencies revealed that these 13 identified SNP were significantly different among the 6 tested Chinese Merino strains ( < 0.001). Linkage disequilibrium analysis showed that SNP 3 to 11 were strongly linked in a single haplotype block in the tested population. Association analysis showed that SNP 2 to 11 were significantly associated with the average wool fiber diameter and the fineness SD and that SNP 4 to 11 were significantly associated with the CV of fiber diameter trait ( < 0.05). Single nucleotide polymorphism 2 and SNP 5 to 12 were weakly associated with wool crimp. Similarly, the haplotypes derived from these 13 identified SNP were also significantly associated with the average wool fiber diameter, fineness SD, and the CV of fiber diameter ( < 0.05). Our results suggest that is a candidate gene for sheep wool production and quality traits, and the identified SNP might be used in sheep breeding.
High-density genetic map using whole-genome resequencing for fine mapping and candidate gene discovery for disease resistance in peanut.

PubMed

Agarwal, Gaurav; Clevenger, Josh; Pandey, Manish K; Wang, Hui; Shasidhar, Yaduru; Chu, Ye; Fountain, Jake C; Choudhary, Divya; Culbreath, Albert K; Liu, Xin; Huang, Guodong; Wang, Xingjun; Deshmukh, Rupesh; Holbrook, C Corley; Bertioli, David J; Ozias-Akins, Peggy; Jackson, Scott A; Varshney, Rajeev K; Guo, Baozhu

2018-04-10

Whole-genome resequencing (WGRS) of mapping populations has facilitated development of high-density genetic maps essential for fine mapping and candidate gene discovery for traits of interest in crop species. Leaf spots, including early leaf spot (ELS) and late leaf spot (LLS), and Tomato spotted wilt virus (TSWV) are devastating diseases in peanut causing significant yield loss. We generated WGRS data on a recombinant inbred line population, developed a SNP-based high-density genetic map, and conducted fine mapping, candidate gene discovery and marker validation for ELS, LLS and TSWV. The first sequence-based high-density map was constructed with 8869 SNPs assigned to 20 linkage groups, representing 20 chromosomes, for the 'T' population (Tifrunner × GT-C20) with a map length of 3120 cM and an average distance of 1.45 cM. The quantitative trait locus (QTL) analysis using high-density genetic map and multiple season phenotyping data identified 35 main-effect QTLs with phenotypic variation explained (PVE) from 6.32% to 47.63%. Among major-effect QTLs mapped, there were two QTLs for ELS on B05 with 47.42% PVE and B03 with 47.38% PVE, two QTLs for LLS on A05 with 47.63% and B03 with 34.03% PVE and one QTL for TSWV on B09 with 40.71% PVE. The epistasis and environment interaction analyses identified significant environmental effects on these traits. The identified QTL regions had disease resistance genes including R-genes and transcription factors. KASP markers were developed for major QTLs and validated in the population and are ready for further deployment in genomics-assisted breeding in peanut. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Genomic Prediction of Resistance to Pasteurellosis in Gilthead Sea Bream (Sparus aurata) Using 2b-RAD Sequencing

PubMed Central

Palaiokostas, Christos; Ferraresso, Serena; Franch, Rafaella; Houston, Ross D.; Bargelloni, Luca

2016-01-01

Gilthead sea bream (Sparus aurata) is a species of paramount importance to the Mediterranean aquaculture industry, with an annual production exceeding 140,000 metric tons. Pasteurellosis due to the Gram-negative bacterium Photobacterium damselae subsp. piscicida (Phdp) causes significant mortality, especially during larval and juvenile stages, and poses a serious threat to bream production. Selective breeding for improved resistance to pasteurellosis is a promising avenue for disease control, and the use of genetic markers to predict breeding values can improve the accuracy of selection, and allow accurate calculation of estimated breeding values of nonchallenged animals. In the current study, a population of 825 sea bream juveniles, originating from a factorial cross between 67 broodfish (32 sires, 35 dams), were challenged by 30 min immersion with 1 × 105 CFU virulent Phdp. Mortalities and survivors were recorded and sampled for genotyping by sequencing. The restriction-site associated DNA sequencing approach, 2b-RAD, was used to generate genome-wide single nucleotide polymorphism (SNP) genotypes for all samples. A high-density linkage map containing 12,085 SNPs grouped into 24 linkage groups (consistent with the karyotype) was constructed. The heritability of surviving days (censored data) was 0.22 (95% highest density interval: 0.11–0.36) and 0.28 (95% highest density interval: 0.17–0.4) using the pedigree and the genomic relationship matrix respectively. A genome-wide association study did not reveal individual SNPs significantly associated with resistance at a genome-wide significance level. Genomic prediction approaches were tested to investigate the potential of the SNPs obtained by 2b-RAD for estimating breeding values for resistance. The accuracy of the genomic prediction models (r = 0.38–0.46) outperformed the traditional BLUP approach based on pedigree records (r = 0.30). Overall results suggest that major quantitative trait loci affecting resistance to pasteurellosis were not present in this population, but highlight the effectiveness of 2b-RAD genotyping by sequencing for genomic selection in a mass spawning fish species. PMID:27652890
High Density Linkage Map Construction and QTL Detection for Three Silique-Related Traits in Orychophragmus violaceus Derived Brassica napus Population.

PubMed

Yang, Yi; Shen, Yusen; Li, Shunda; Ge, Xianhong; Li, Zaiyun

2017-01-01

Seeds per silique (SS), seed weight (SW), and silique length (SL) are important determinant traits of seed yield potential in rapeseed ( Brassica napus L.), and are controlled by naturally occurring quantitative trait loci (QTLs). Mapping QTLs to narrow chromosomal regions provides an effective means of characterizing the genetic basis of these complex traits. Orychophragmus violaceus is a crucifer with long siliques, many SS, and heavy seeds. A novel B. napus introgression line with many SS was previously selected from multiple crosses ( B. rapa ssp. chinesis × O. violaceus ) × B. napus . In present study, a doubled haploid (DH) population with 167 lines was established from a cross between the introgression line and a line with far fewer SS, in order to detect QTLs for silique-related traits. By screening with a Brassica 60K single nucleotide polymorphism (SNP) array, a high-density linkage map consisting of 1,153 bins and spanning a cumulative length of 2,209.1 cM was constructed, using 12,602 high-quality polymorphic SNPs in the DH population. The average recombination bin densities of the A and C subgenomes were 1.7 and 2.4 cM, respectively. 45 QTLs were identified for the three traits in all, which explained 4.0-34.4% of the total phenotypic variation; 20 of them were integrated into three unique QTLs by meta-analysis. These unique QTLs revealed a significant positive correlation between SS and SL and a significant negative correlation between SW and SS, and were mapped onto the linkage groups A05, C08, and C09. A trait-by-trait meta-analysis revealed eight, four, and seven consensus QTLs for SS, SW, and SL, respectively, and five major QTLs ( cqSS.A09b, cqSS.C09, cqSW.A05, cqSW.C09 , and cqSL.C09 ) were identified. Five, three, and four QTLs for SS, SW, and SL, respectively, might be novel QTLs because of the existence of alien genetic loci for these traits in the alien introgression. Thirty-eight candidate genes underlying nine QTLs for silique-related traits were identified.
Optimal design of low-density SNP arrays for genomic prediction: algorithm and applications

USDA-ARS?s Scientific Manuscript database

Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for their optimal design. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optim...
Conclusive evidence for hexasomic inheritance in chrysanthemum based on analysis of a 183 k SNP array.

PubMed

van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul

2017-08-07

Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.
A novel non-coding RNA within an intron of CDH2 and association of its SNP with non-syndromic cleft lip and palate.

PubMed

Kumari, Priyanka; Singh, Subodh Kumar; Raman, Rajiva

2018-06-05

Genome-wide linkage analysis and whole genome sequencing in a Van der Woude syndrome (VWS) family revealed that the SNP, rs539075, within intron 2 of the cadherin 2 gene (CDH2) co-segregated with the disease phenotype. A study with nonsyndromic cleft lip with or without cleft palate (NSCL ± P) cases (N = 292) and controls (N = 287) established association of this SNP with NSCL ± P as a risk factor. RT-PCR based expression analysis of the SNP-harbouring region of intron 2 of CDH2 in the clefted lip and/or palate tissues of 16 patients revealed that the mutant allele expressed in all those individuals having it (hetero-/homozygous), whereas the wild type allele expressed in <50% of the samples in which it was present. The intronic transcript was also present in the prospective lip and palate region of 13.5 dpc mouse embryo, detected by RNA in situ hybridization and RT-PCR. These results including the in silico, characterization of the ~200 nt-intronic transcript showed that conformationally it fits best with noncoding small RNA, possibly a precursor of miRNA. Its function in the orofacial organogenesis remains to be elucidated which will enable us to define the role of this mutant ncRNA in the clefting of lip and palate. Copyright © 2018 Elsevier B.V. All rights reserved.
A framework linkage map of perennial ryegrass based on SSR markers

Treesearch

G.P. Gill; P.L. Wilcox; D.J. Whittaker; R.A. Winz; P. Bickerstaff; Craig E. Echt; J. Kent; M.O. Humphreys; K.M. Elborough; R.C. Gardner

2006-01-01

A moderate-density linkage map for Lolium perenne L. has been constructed based on 376 simple sequence repeat (SSR) markers. Approximately one third ( 124) of the SSR markers were developed from GeneThresher libraries that preferentially select genomic DNA clones from the gene-rich unmethylated portion of the genome. The remaining SSR marker loci...
In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?

PubMed Central

Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe

2010-01-01

Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950
Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design.

PubMed

Goonetilleke, Shashi N; March, Timothy J; Wirthensohn, Michelle G; Arús, Pere; Walker, Amanda R; Mather, Diane E

2018-01-04

In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond ( Prunus dulcis Mill. D. A. Webb), application of a double pseudotestcross mapping approach to the F 1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars "Nonpareil" and "Lauranne." Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F 1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond. Copyright © 2018 Goonetilleke et al.
Genotyping-by-sequencing enables linkage mapping in three octoploid cultivated strawberry families

PubMed Central

Salinas, Natalia; Tennessen, Jacob A.; Zurn, Jason D.; Sargent, Daniel James; Hancock, James; Bassil, Nahla V.

2017-01-01

Genotyping-by-sequencing (GBS) was used to survey genome-wide single-nucleotide polymorphisms (SNPs) in three biparental strawberry (Fragaria × ananassa) populations with the goal of evaluating this technique in a species with a complex octoploid genome. GBS sequence data were aligned to the F. vesca ‘Fvb’ reference genome in order to call SNPs. Numbers of polymorphic SNPs per population ranged from 1,163 to 3,190. Linkage maps consisting of 30–65 linkage groups were produced from the SNP sets derived from each parent. The linkage groups covered 99% of the Fvb reference genome, with three to seven linkage groups from a given parent aligned to any particular chromosome. A phylogenetic analysis performed using the POLiMAPS pipeline revealed linkage groups that were most similar to ancestral species F. vesca for each chromosome. Linkage groups that were most similar to a second ancestral species, F. iinumae, were only resolved for Fvb 4. The quantity of missing data and heterogeneity in genome coverage inherent in GBS complicated the analysis, but POLiMAPS resolved F. × ananassa chromosomal regions derived from diploid ancestor F. vesca. PMID:28875078
Increasing the number of single nucleotide polymorphisms used in genomic evaluations of dairy cattle

USDA-ARS?s Scientific Manuscript database

A small increase in the accuracy of genomic evaluations of dairy cattle was achieved by increasing the number of SNP used to 61,013. All the 45,195 SNP used previously were retained, and 15,818 SNP were selected from higher density genotyping chips if the magnitude of the SNP effect was among the to...
[Hereditary motor and sensory neuropathy with proximal dominant involvement (HMSN-P) is caused by a mutation in TFG].

PubMed

Ishiura, Hiroyuki; Tsuji, Shoji

2013-01-01

Hereditary motor and sensory neuropathy with proximal dominant involvement (HMSN-P) is an autosomal dominant neurodegenerative disease characterized by proximal predominant weakness and muscle atrophy accompanied by distal sensory disturbance. Linkage analysis using 4 families identified a region on chromosome 3 showing a LOD score exceeding 4. Further refinement of candidate region was performed by haplotype analysis using high-density SNP data, resulting in a minimum candidate region spanning 3.3 Mb. Exome analysis of an HMSN-P patient revealed a mutation (c.854C>T, p.Pro285Leu) in TRK-fused gene (TFG). The identical mutation was found in the four families, which cosegregated with the disease. The mutation was neither found in Japanese control subjects nor public databases. Detailed haplotype analysis suggested two independent origins of the mutation. These findings indicate that the mutation in TFG causes HMSN-P.
Mapping of the Gynoecy in Bitter Gourd (Momordica charantia) Using RAD-Seq Analysis

PubMed Central

Matsumura, Hideo; Miyagi, Norimichi; Taniai, Naoki; Fukushima, Mai; Tarora, Kazuhiko; Shudo, Ayano; Urasaki, Naoya

2014-01-01

Momordica charantia is a monoecious plant of the Cucurbitaceae family that has both male and female unisexual flowers. Its unique gynoecious line, OHB61-5, is essential as a maternal parent in the production of F1 cultivars. To identify the DNA markers for this gynoecy, a RAD-seq (restriction-associated DNA tag sequencing) analysis was employed to reveal genome-wide DNA polymorphisms and to genotype the F2 progeny from a cross between OHB61-5 and a monoecious line. Based on a RAD-seq analysis of F2 individuals, a linkage map was constructed using 552 co-dominant markers. In addition, after analyzing the pooled genomic DNA from monoecious or gynoecious F2 plants, several SNP loci that are genetically linked to gynoecy were identified. GTFL-1, the closest SNP locus to the putative gynoecious locus, was converted to a conventional DNA marker using invader assay technology, which is applicable to the marker-assisted selection of gynoecy in M. charantia breeding. PMID:24498029
A powerful tool for genome analysis in maize: development and evaluation of the high density 600 k SNP genotyping array.

PubMed

Unterseer, Sandra; Bauer, Eva; Haberer, Georg; Seidel, Michael; Knaak, Carsten; Ouzunova, Milena; Meitinger, Thomas; Strom, Tim M; Fries, Ruedi; Pausch, Hubert; Bertani, Christofer; Davassi, Alessandro; Mayer, Klaus Fx; Schön, Chris-Carolin

2014-09-29

High density genotyping data are indispensable for genomic analyses of complex traits in animal and crop species. Maize is one of the most important crop plants worldwide, however a high density SNP genotyping array for analysis of its large and highly dynamic genome was not available so far. We developed a high density maize SNP array composed of 616,201 variants (SNPs and small indels). Initially, 57 M variants were discovered by sequencing 30 representative temperate maize lines and then stringently filtered for sequence quality scores and predicted conversion performance on the array resulting in the selection of 1.2 M polymorphic variants assayed on two screening arrays. To identify high-confidence variants, 285 DNA samples from a broad genetic diversity panel of worldwide maize lines including the samples used for sequencing, important founder lines for European maize breeding, hybrids, and proprietary samples with European, US, semi-tropical, and tropical origin were used for experimental validation. We selected 616 k variants according to their performance during validation, support of genotype calls through sequencing data, and physical distribution for further analysis and for the design of the commercially available Affymetrix® Axiom® Maize Genotyping Array. This array is composed of 609,442 SNPs and 6,759 indels. Among these are 116,224 variants in coding regions and 45,655 SNPs of the Illumina® MaizeSNP50 BeadChip for study comparison. In a subset of 45,974 variants, apart from the target SNP additional off-target variants are detected, which show only a minor bias towards intermediate allele frequencies. We performed principal coordinate and admixture analyses to determine the ability of the array to detect and resolve population structure and investigated the extent of LD within a worldwide validation panel. The high density Affymetrix® Axiom® Maize Genotyping Array is optimized for European and American temperate maize and was developed based on a diverse sample panel by applying stringent quality filter criteria to ensure its suitability for a broad range of applications. With 600 k variants it is the largest currently publically available genotyping array in crop species.
Identification of Pyrus single nucleotide polymorphisms (SNPs) and evaluation for genetic mapping in European pear and interspecific Pyrus hybrids.

PubMed

Montanari, Sara; Saeed, Munazza; Knäbel, Mareike; Kim, YoonKyeong; Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E; Crowhurst, Ross N; Chagné, David

2013-01-01

We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear ('Old Home'×'Louise Bon Jersey') and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality.
Systems genetics of obesity in an F2 pig model by genome-wide association, genetic network, and pathway analyses

PubMed Central

Kogelman, Lisette J. A.; Pant, Sameer D.; Fredholm, Merete; Kadarmideen, Haja N.

2014-01-01

Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome investigations focusing on single genetic variants have achieved limited success, and the importance of including genetic interactions is becoming evident. Here, the aim was to perform an integrative genomic analysis in an F2 pig resource population that was constructed with an aim to maximize genetic variation of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA) analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation of haplotype blocks. We built Weighted Interaction SNP Hub (WISH) and differentially wired (DW) networks using genotypic correlations amongst obesity-associated SNPs resulting from GWA analysis. GWA results and SNP modules detected by WISH and DW analyses were further investigated by functional enrichment analyses. The functional annotation of SNPs revealed several genes associated with obesity, e.g., NPC2 and OR4D10. Moreover, gene enrichment analyses identified several significantly associated pathways, over and above the GWA study results, that may influence obesity and obesity related diseases, e.g., metabolic processes. WISH networks based on genotypic correlations allowed further identification of various gene ontology terms and pathways related to obesity and related traits, which were not identified by the GWA study. In conclusion, this is the first study to develop a (genetic) obesity index and employ systems genetics in a porcine model to provide important insights into the complex genetic architecture associated with obesity and many biological pathways that underlie it. PMID:25071839
Multiple loci on 8q24 associated with prostate cancer susceptibility.

PubMed

Al Olama, Ali Amin; Kote-Jarai, Zsofia; Giles, Graham G; Guy, Michelle; Morrison, Jonathan; Severi, Gianluca; Leongamornlert, Daniel A; Tymrakiewicz, Malgorzata; Jhavar, Sameer; Saunders, Ed; Hopper, John L; Southey, Melissa C; Muir, Kenneth R; English, Dallas R; Dearnaley, David P; Ardern-Jones, Audrey T; Hall, Amanda L; O'Brien, Lynne T; Wilkinson, Rosemary A; Sawyer, Emma; Lophatananon, Artitaya; Horwich, Alan; Huddart, Robert A; Khoo, Vincent S; Parker, Christopher C; Woodhouse, Christopher J; Thompson, Alan; Christmas, Tim; Ogden, Chris; Cooper, Colin; Donovan, Jenny L; Hamdy, Freddie C; Neal, David E; Eeles, Rosalind A; Easton, Douglas F

2009-10-01

Previous studies have identified multiple loci on 8q24 associated with prostate cancer risk. We performed a comprehensive analysis of SNP associations across 8q24 by genotyping tag SNPs in 5,504 prostate cancer cases and 5,834 controls. We confirmed associations at three previously reported loci and identified additional loci in two other linkage disequilibrium blocks (rs1006908: per-allele OR = 0.87, P = 7.9 x 10(-8); rs620861: OR = 0.90, P = 4.8 x 10(-8)). Eight SNPs in five linkage disequilibrium blocks were independently associated with prostate cancer susceptibility.
Genetic variation in the human vitamin D receptor is associated with muscle strength, fat mass and body weight in Swedish women.

PubMed

Grundberg, Elin; Brändström, Helena; Ribom, Eva L; Ljunggren, Osten; Mallmin, Hans; Kindmark, Andreas

2004-03-01

Bone mineral density (BMD) is under strong genetic control and a number of candidate genes have been associated with BMD. Both muscle strength and body weight are considered to be important predictors of BMD but far less is known about the genes affecting muscle strength and fat mass. The purpose of this study was to investigate the poly adenosine (A) repeat and the BsmI SNP in the vitamin D receptor (VDR) in relation to muscle strength and body composition in healthy women. A population-based study of 175 healthy women aged 20-39 years was used. The polymorphic regions in the VDR gene (the poly A repeat and the BsmI SNP) were amplified by PCR. Body mass measurements (fat mass, lean mass, body weight and body mass index) and muscle strength (quadriceps, hamstring and grip strength) were evaluated. Individuals with shorter poly A repeat, ss and/or absence of the linked BsmI restriction site (BB) have higher hamstring strength (ss vs LL, P=0.02), body weight (ss vs LL, P=0.049) and fat mass (ss vs LL, P=0.04) compared with women with a longer poly A repeat (LL) and/or the presence of the linked BsmI restriction site (bb). Genetic variation in the VDR is correlated with muscle strength, fat mass and body weight in premenopausal women. Further functional studies on the poly A microsatellite are needed to elucidate whether this is the functionally relevant locus or if the polymorphism is in linkage disequilibrium with a functional variant in a closely situated gene further downstream of the VDR 3'UTR.
GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies

PubMed Central

Alonso, Arnald; Marsal, Sara; Tortosa, Raül; Canela-Xandri, Oriol; Julià, Antonio

2013-01-01

We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method. PMID:23844243

Development of 101 novel EST-derived single nucleotide polymorphism markers for Zhikong scallop ( Chlamys farreri)

NASA Astrophysics Data System (ADS)

Li, Jiqin; Bao, Zhenmin; Li, Ling; Wang, Xiaojian; Wang, Shi; Hu, Xiaoli

2013-09-01

Zhikong scallop ( Chlamys farreri) is an important maricultured species in China. Many researches on this species, such as population genetics and QTL fine-mapping, need a large number of molecular markers. In this study, based on the expressed sequence tags (EST), a total of 300 putative single nucleotide polymorphisms (SNPs) were selected and validated using high resolution melting (HRM) technology with unlabeled probe. Of them, 101 (33.7%) were found to be polymorphic in 48 individuals from 4 populations. Further evaluation with 48 individuals from Qingdao population showed that all the polymorphic loci had two alleles with the minor allele frequency ranged from 0.046 to 0.500. The observed and expected heterozygosities ranged from 0.000 to 0.925 and from 0.089 to 0.505, respectively. Fifteen loci deviated significantly from Hardy-Weinberg equilibrium and significant linkage disequilibrate was detected in one pair of markers. BLASTx gave significant hits for 72 of the 101 polymorphic SNP-containing ESTs. Thirty four polymorphic SNP loci were predicted to be non-synonymous substitutions as they caused either the change of codons (33 SNPs) or pretermination of translation (1 SNP). The markers developed can be used for the population studies and genetic improvement on Zhikong scallop.
Association analysis of 528 intra-genic SNPs in a region of chromosome 10 linked to late onset Alzheimer's disease.

PubMed

Morgan, A R; Hamilton, G; Turic, D; Jehu, L; Harold, D; Abraham, R; Hollingworth, P; Moskvina, V; Brayne, C; Rubinsztein, D C; Lynch, A; Lawlor, B; Gill, M; O'Donovan, M; Powell, J; Lovestone, S; Williams, J; Owen, M J

2008-09-05

Late-onset Alzheimer's disease (LOAD) is a genetically complex neurodegenerative disorder. Currently, only the epsilon4 allele of the Apolipoprotein E gene has been identified unequivocally as a genetic susceptibility factor for LOAD. Others remain to be found. In 2002 we observed genome-wide significant evidence of linkage to a region on chromosome 10q11.23-q21.3 [Myers et al. (2002) Am J Med Genet 114:235-244]. Our objective in this study was to test every gene within the maximum LOD-1 linkage region, for association with LOAD. We obtained results for 528 SNPs from 67 genes, with an average density of 1 SNP every 10 kb within the genes. We demonstrated nominally significant association with LOAD for 4 SNPs: rs1881747 near DKK1 (P = 0.011, OR = 1.24), rs2279420 in ANK3 (P = 0.022, OR = 0.79), rs2306402 in CTNNA3 (P = 0.024, OR = 1.18), and rs5030882 in CXXC6 (P = 0.046, OR = 1.29) in 1,160 cases and 1,389 controls. These results would not survive correction for multiple testing but warrant attempts at confirmation in independent samples. 2007 Wiley-Liss, Inc.
Genomic selection and complex trait prediction using a fast EM algorithm applied to genome-wide markers

PubMed Central

2010-01-01

Background The information provided by dense genome-wide markers using high throughput technology is of considerable potential in human disease studies and livestock breeding programs. Genome-wide association studies relate individual single nucleotide polymorphisms (SNP) from dense SNP panels to individual measurements of complex traits, with the underlying assumption being that any association is caused by linkage disequilibrium (LD) between SNP and quantitative trait loci (QTL) affecting the trait. Often SNP are in genomic regions of no trait variation. Whole genome Bayesian models are an effective way of incorporating this and other important prior information into modelling. However a full Bayesian analysis is often not feasible due to the large computational time involved. Results This article proposes an expectation-maximization (EM) algorithm called emBayesB which allows only a proportion of SNP to be in LD with QTL and incorporates prior information about the distribution of SNP effects. The posterior probability of being in LD with at least one QTL is calculated for each SNP along with estimates of the hyperparameters for the mixture prior. A simulated example of genomic selection from an international workshop is used to demonstrate the features of the EM algorithm. The accuracy of prediction is comparable to a full Bayesian analysis but the EM algorithm is considerably faster. The EM algorithm was accurate in locating QTL which explained more than 1% of the total genetic variation. A computational algorithm for very large SNP panels is described. Conclusions emBayesB is a fast and accurate EM algorithm for implementing genomic selection and predicting complex traits by mapping QTL in genome-wide dense SNP marker data. Its accuracy is similar to Bayesian methods but it takes only a fraction of the time. PMID:20969788
The SNP g.1311T>C associated with the absence of β-casein in goat milk influences CSN2 promoter activity.

PubMed

Cosenza, G; Iannaccone, M; Pico, B A; Ramunno, L; Capparelli, R

2016-10-01

Quantitative individual differences in the amount of β-casein in goat milk are determined by at least nine alleles. In particular, two alleles (CSN2(0) and CSN2(01) ) are associated with an undetectable amount of this protein in milk. The CSN2(01) allele is characterized by a single nucleotide substitution at position 373 of the seventh exon (AJ011018:g.8915C>T), responsible for the formation of a premature stop codon at the 182 position. Herein, we report the contribution of the SNP g.1311T>C, which demonstrates a linkage with the SNP AJ011018:g.8915C>T, to the promoter transcriptional activity. Particularly, we indicate that the nucleotide C at position 1311 negatively affects the promoter activity of the CSN2 gene. © 2016 Stichting International Foundation for Animal Genetics.
Predictive ability of direct genomic values for lifetime net merit of Holstein sires using selected subsets of single nucleotide polymorphism markers.

PubMed

Weigel, K A; de los Campos, G; González-Recio, O; Naya, H; Wu, X L; Long, N; Rosa, G J M; Gianola, D

2009-10-01

The objective of the present study was to assess the predictive ability of subsets of single nucleotide polymorphism (SNP) markers for development of low-cost, low-density genotyping assays in dairy cattle. Dense SNP genotypes of 4,703 Holstein bulls were provided by the USDA Agricultural Research Service. A subset of 3,305 bulls born from 1952 to 1998 was used to fit various models (training set), and a subset of 1,398 bulls born from 1999 to 2002 was used to evaluate their predictive ability (testing set). After editing, data included genotypes for 32,518 SNP and August 2003 and April 2008 predicted transmitting abilities (PTA) for lifetime net merit (LNM$), the latter resulting from progeny testing. The Bayesian least absolute shrinkage and selection operator method was used to regress August 2003 PTA on marker covariates in the training set to arrive at estimates of marker effects and direct genomic PTA. The coefficient of determination (R(2)) from regressing the April 2008 progeny test PTA of bulls in the testing set on their August 2003 direct genomic PTA was 0.375. Subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP were created by choosing equally spaced and highly ranked SNP, with the latter based on the absolute value of their estimated effects obtained from the training set. The SNP effects were re-estimated from the training set for each subset of SNP, and the 2008 progeny test PTA of bulls in the testing set were regressed on corresponding direct genomic PTA. The R(2) values for subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP with largest effects (evenly spaced SNP) were 0.184 (0.064), 0.236 (0.111), 0.269 (0.190), 0.289 (0.179), 0.307 (0.228), 0.313 (0.268), and 0.322 (0.291), respectively. These results indicate that a low-density assay comprising selected SNP could be a cost-effective alternative for selection decisions and that significant gains in predictive ability may be achieved by increasing the number of SNP allocated to such an assay from 300 or fewer to 1,000 or more.
Development and Applications of a Bovine 50,000 SNP Chip

USDA-ARS?s Scientific Manuscript database

To develop an Illumina iSelect high density single nucleotide polymorphism (SNP) assay for cattle, the collaborative iBMC (Illumina, USDA ARS Beltsville, University of Missouri, USDA ARS Clay Center) Consortium first performed a de novo SNP discovery project in which genomic reduced representation l...
Joint effect of unlinked genotypes: application to type 2 diabetes in the EPIC-Potsdam case-cohort study.

PubMed

Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner

2015-07-01

Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.
Development and validation of a low-density SNP panel related to prolificacy in sheep

USDA-ARS?s Scientific Manuscript database

High-density SNP panels (e.g., 50,000 and 600,000 markers) have been used in exploratory population genetic studies with commercial and minor breeds of sheep. However, routine genetic diversity evaluations of large numbers of samples with large panels are in general cost-prohibitive for gene banks. ...
Chromosome 17q12 variants contribute to risk of early-onset prostate cancer

PubMed Central

Levin, Albert M.; Machiela, Mitchell J.; Zuhlke, Kimberly A.; Ray, Anna M.; Cooney, Kathleen A.; Douglas, Julie A.

2008-01-01

In a recent genome-wide association study by Gudmundsson et al. (2007), two prostate cancer susceptibility loci were identified on chromosome 17q. The first locus, at 17q12, was distinguished by two intronic single nucleotide polymorphisms (SNPs) in the TCF2 gene (rs4430796 and rs7501939). The second locus was in a gene-poor region of 17q24, where the strongest evidence of association was for SNP rs1859962. To determine if these loci were also associated with hereditary prostate cancer, we genotyped them in a family-based association sample of 403 non-Hispanic white families, including 1,015 men with and without prostate cancer. SNPs rs4430796 and rs7501939, which were in strong linkage disequilibrium (r2=0.68), showed the strongest evidence of prostate cancer association. Using a family-based association test, the “A” allele of SNP rs4430796 was over-transmitted to affected men (p=0.006), with an odds ratio of 1.40 (95%CI=1.09–1.81) under an additive genetic model. Notably, rs4430796 was significantly associated with prostate cancer among men diagnosed at an early (<50 years) but not later age (p=0.006 versus p=0.118). Our results confirm the prostate cancer association with SNPs on chromosome 17q12 initially reported by Gudmundsson et al. In addition, our results suggest that the increased risk associated with these SNPs is approximately doubled in individuals predisposed to develop early onset disease. Importantly, these SNPs do not account for a significant portion of our prior prostate cancer linkage evidence on chromosome 17. Thus, there likely exist one or more additional independent prostate cancer susceptibility loci in this region. PMID:18701471
Exploring and Harnessing Haplotype Diversity to Improve Yield Stability in Crops.

PubMed

Qian, Lunwen; Hickey, Lee T; Stahl, Andreas; Werner, Christian R; Hayes, Ben; Snowdon, Rod J; Voss-Fels, Kai P

2017-01-01

In order to meet future food, feed, fiber, and bioenergy demands, global yields of all major crops need to be increased significantly. At the same time, the increasing frequency of extreme weather events such as heat and drought necessitates improvements in the environmental resilience of modern crop cultivars. Achieving sustainably increase yields implies rapid improvement of quantitative traits with a very complex genetic architecture and strong environmental interaction. Latest advances in genome analysis technologies today provide molecular information at an ultrahigh resolution, revolutionizing crop genomic research, and paving the way for advanced quantitative genetic approaches. These include highly detailed assessment of population structure and genotypic diversity, facilitating the identification of selective sweeps and signatures of directional selection, dissection of genetic variants that underlie important agronomic traits, and genomic selection (GS) strategies that not only consider major-effect genes. Single-nucleotide polymorphism (SNP) markers today represent the genotyping system of choice for crop genetic studies because they occur abundantly in plant genomes and are easy to detect. SNPs are typically biallelic, however, hence their information content compared to multiallelic markers is low, limiting the resolution at which SNP-trait relationships can be delineated. An efficient way to overcome this limitation is to construct haplotypes based on linkage disequilibrium, one of the most important features influencing genetic analyses of crop genomes. Here, we give an overview of the latest advances in genomics-based haplotype analyses in crops, highlighting their importance in the context of polyploidy and genome evolution, linkage drag, and co-selection. We provide examples of how haplotype analyses can complement well-established quantitative genetics frameworks, such as quantitative trait analysis and GS, ultimately providing an effective tool to equip modern crops with environment-tailored characteristics.
High-throughput single nucleotide polymorphism genotyping for breeding applications in rice using the BeadXpress platform

USDA-ARS?s Scientific Manuscript database

Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction.

PubMed

Brøndum, R F; Su, G; Janss, L; Sahana, G; Guldbrandtsen, B; Boichard, D; Lund, M S

2015-06-01

This study investigated the effect on the reliability of genomic prediction when a small number of significant variants from single marker analysis based on whole genome sequence data were added to the regular 54k single nucleotide polymorphism (SNP) array data. The extra markers were selected with the aim of augmenting the custom low-density Illumina BovineLD SNP chip (San Diego, CA) used in the Nordic countries. The single-marker analysis was done breed-wise on all 16 index traits included in the breeding goals for Nordic Holstein, Danish Jersey, and Nordic Red cattle plus the total merit index itself. Depending on the trait's economic weight, 15, 10, or 5 quantitative trait loci (QTL) were selected per trait per breed and 3 to 5 markers were selected to tag each QTL. After removing duplicate markers (same marker selected for more than one trait or breed) and filtering for high pairwise linkage disequilibrium and assaying performance on the array, a total of 1,623 QTL markers were selected for inclusion on the custom chip. Genomic prediction analyses were performed for Nordic and French Holstein and Nordic Red animals using either a genomic BLUP or a Bayesian variable selection model. When using the genomic BLUP model including the QTL markers in the analysis, reliability was increased by up to 4 percentage points for production traits in Nordic Holstein animals, up to 3 percentage points for Nordic Reds, and up to 5 percentage points for French Holstein. Smaller gains of up to 1 percentage point was observed for mastitis, but only a 0.5 percentage point increase was seen for fertility. When using a Bayesian model accuracies were generally higher with only 54k data compared with the genomic BLUP approach, but increases in reliability were relatively smaller when QTL markers were included. Results from this study indicate that the reliability of genomic prediction can be increased by including markers significant in genome-wide association studies on whole genome sequence data alongside the 54k SNP set. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
UGbS-Flex, a novel bioinformatics pipeline for imputation-free SNP discovery in polyploids without a reference genome: finger millet as a case study.

PubMed

Qi, Peng; Gimode, Davis; Saha, Dipnarayan; Schröder, Stephan; Chakraborty, Debkanta; Wang, Xuewen; Dida, Mathews M; Malmberg, Russell L; Devos, Katrien M

2018-06-15

Research on orphan crops is often hindered by a lack of genomic resources. With the advent of affordable sequencing technologies, genotyping an entire genome or, for large-genome species, a representative fraction of the genome has become feasible for any crop. Nevertheless, most genotyping-by-sequencing (GBS) methods are geared towards obtaining large numbers of markers at low sequence depth, which excludes their application in heterozygous individuals. Furthermore, bioinformatics pipelines often lack the flexibility to deal with paired-end reads or to be applied in polyploid species. UGbS-Flex combines publicly available software with in-house python and perl scripts to efficiently call SNPs from genotyping-by-sequencing reads irrespective of the species' ploidy level, breeding system and availability of a reference genome. Noteworthy features of the UGbS-Flex pipeline are an ability to use paired-end reads as input, an effective approach to cluster reads across samples with enhanced outputs, and maximization of SNP calling. We demonstrate use of the pipeline for the identification of several thousand high-confidence SNPs with high representation across samples in an F 3 -derived F 2 population in the allotetraploid finger millet. Robust high-density genetic maps were constructed using the time-tested mapping program MAPMAKER which we upgraded to run efficiently and in a semi-automated manner in a Windows Command Prompt Environment. We exploited comparative GBS with one of the diploid ancestors of finger millet to assign linkage groups to subgenomes and demonstrate the presence of chromosomal rearrangements. The paper combines GBS protocol modifications, a novel flexible GBS analysis pipeline, UGbS-Flex, recommendations to maximize SNP identification, updated genetic mapping software, and the first high-density maps of finger millet. The modules used in the UGbS-Flex pipeline and for genetic mapping were applied to finger millet, an allotetraploid selfing species without a reference genome, as a case study. The UGbS-Flex modules, which can be run independently, are easily transferable to species with other breeding systems or ploidy levels.
Impact of QTL minor allele frequency on genomic evaluation using real genotype data and simulated phenotypes in Japanese Black cattle.

PubMed

Uemoto, Yoshinobu; Sasaki, Shinji; Kojima, Takatoshi; Sugimoto, Yoshikazu; Watanabe, Toshio

2015-11-19

Genetic variance that is not captured by single nucleotide polymorphisms (SNPs) is due to imperfect linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTLs), and the extent of LD between SNPs and QTLs depends on different minor allele frequencies (MAF) between them. To evaluate the impact of MAF of QTLs on genomic evaluation, we performed a simulation study using real cattle genotype data. In total, 1368 Japanese Black cattle and 592,034 SNPs (Illumina BovineHD BeadChip) were used. We simulated phenotypes using real genotypes under different scenarios, varying the MAF categories, QTL heritability, number of QTLs, and distribution of QTL effect. After generating true breeding values and phenotypes, QTL heritability was estimated and the prediction accuracy of genomic estimated breeding value (GEBV) was assessed under different SNP densities, prediction models, and population size by a reference-test validation design. The extent of LD between SNPs and QTLs in this population was higher in the QTLs with high MAF than in those with low MAF. The effect of MAF of QTLs depended on the genetic architecture, evaluation strategy, and population size in genomic evaluation. In genetic architecture, genomic evaluation was affected by the MAF of QTLs combined with the QTL heritability and the distribution of QTL effect. The number of QTL was not affected on genomic evaluation if the number of QTL was more than 50. In the evaluation strategy, we showed that different SNP densities and prediction models affect the heritability estimation and genomic prediction and that this depends on the MAF of QTLs. In addition, accurate QTL heritability and GEBV were obtained using denser SNP information and the prediction model accounted for the SNPs with low and high MAFs. In population size, a large sample size is needed to increase the accuracy of GEBV. The MAF of QTL had an impact on heritability estimation and prediction accuracy. Most genetic variance can be captured using denser SNPs and the prediction model accounted for MAF, but a large sample size is needed to increase the accuracy of GEBV under all QTL MAF categories.
Mapping Genes that Contribute to Daunorubicin-Induced Cytotoxicity

PubMed Central

Duan, Shiwei; Bleibel, Wasim K.; Huang, Rong Stephanie; Shukla, Sunita J.; Wu, Xiaolin; Badner, Judith A.; Dolan, M. Eileen

2009-01-01

Daunorubicin is an anthracycline antibiotic agent used in the treatment of hematopoietic malignancies. Toxicities associated with this agent include myelosuppression and cardiotoxicity; however, the genes or genetic determinants that contribute to these toxicities are unknown. We present an unbiased genome-wide approach that incorporates heritability, whole-genome linkage analysis, and linkage-directed association to uncover genetic variants contributing to the sensitivity to daunorubicin-induced cytotoxicity. Cell growth inhibition in 324 Centre d’ Etude du Polymorphisme Humain lymphoblastoid cell lines (24 pedigrees) was evaluated following treatment with daunorubicin for 72 h. Heritability analysis showed a significant genetic component contributing to the cytotoxic phenotypes (h2 = 0.18–0.63at 0.0125, 0.025, 0.05, 0.1, 0.2, and 1.0 µmol/L daunorubicin and at the IC50, the dose required to inhibit 50% cell growth). Whole-genome linkage scans at all drug concentrations and IC50 uncovered 11 regions with moderate peak LOD scores (>1.5), including 4q28.2 to 4q32.3 with a maximum LOD score of 3.18. The quantitative transmission disequilibrium tests were done using 31,312 high-frequency single-nucleotide polymorphisms (SNP) located in the 1 LOD confidence interval of these 11 regions. Thirty genes were identified as significantly associated with daunorubicin-induced cytotoxicity (P ≤ 2.0 × 10−4, false discovery rate ≤ 0.1). Pathway and functional gene ontology analysis showed that these genes were overrepresented in the phosphatidylinositol signaling system, axon guidance pathway, and GPI-anchored proteins family. Our findings suggest that a proportion of susceptibility to daunorubicin-induced cytotoxicity may be controlled by genetic determinants and that analysis using linkage-directed association studies with dense SNP markers can be used to identify the genetic variants contributing to cytotoxicity. PMID:17545624
SNP Data Quality Control in a National Beef and Dairy Cattle System and Highly Accurate SNP Based Parentage Verification and Identification

PubMed Central

McClure, Matthew C.; McCarthy, John; Flynn, Paul; McClure, Jennifer C.; Dair, Emma; O'Connell, D. K.; Kearney, John F.

2018-01-01

A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP) verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS), they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF) analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800) selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR), and minor allele frequency (MAF) in the Irish cattle population. Large datasets require sample and SNP quality control (QC). Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present), and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non-matching genotypes per animal, SNP duplicates, sex and breed prediction mismatches, parentage and progeny validation results, and other situations. The Animal QC pipeline make use of ICBF800 SNP set where appropriate to identify errors in a computationally efficient yet still highly accurate method. PMID:29599798
Genome-Wide Association Mapping of Correlated Traits in Cassava: Dry Matter and Total Carotenoid Content.

PubMed

Rabbi, Ismail Y; Udoh, Lovina I; Wolfe, Marnin; Parkes, Elizabeth Y; Gedil, Melaku A; Dixon, Alfred; Ramu, Punna; Jannink, Jean-Luc; Kulakow, Peter

2017-11-01

Cassava is a starchy root crop cultivated in the tropics for fresh consumption and commercial processing. Primary selection objectives in cassava breeding include dry matter content and micronutrient density, particularly provitamin A carotenoids. These traits are negatively correlated in the African germplasm. This study aimed at identifying genetic markers associated with these traits and uncovering whether linkage and/or pleiotropy were responsible for observed negative correlation. A genome-wide association mapping using 672 clones genotyped at 72,279 single nucleotide polymorphism (SNP) loci was performed. Root yellowness was used indirectly to assess variation in carotenoid content. Two major loci for root yellowness were identified on chromosome 1 at positions 24.1 and 30.5 Mbp. A single locus for dry matter content that colocated with the 24.1 Mbp peak for carotenoids was identified. Haplotypes at these loci explained 70 and 37% of the phenotypic variability for root yellowness and dry matter content, respectively. Evidence of megabase-scale linkage disequilibrium (LD) around the major loci of the two traits and detection of the major dry matter locus in independent analysis for the white- and yellow-root subpopulations suggests that physical linkage rather that pleiotropy is more likely to be the cause of the negative correlation between the target traits. Moreover, candidate genes for carotenoid () and starch biosynthesis ( and ) occurred in the vicinity of the identified locus at 24.1 Mbp. These findings elucidate the genetic architecture of carotenoids and dry matter in cassava and provide an opportunity to accelerate breeding of these traits. Copyright © 2017 Crop Science Society of America.
Assessing the expected response to genomic selection of individuals and families in Eucalyptus breeding with an additive-dominant model.

PubMed

Resende, R T; Resende, M D V; Silva, F F; Azevedo, C F; Takahashi, E K; Silva-Junior, O B; Grattapaglia, D

2017-10-01

We report a genomic selection (GS) study of growth and wood quality traits in an outbred F 2 hybrid Eucalyptus population (n=768) using high-density single-nucleotide polymorphism (SNP) genotyping. Going beyond previous reports in forest trees, models were developed for different selection targets, namely, families, individuals within families and individuals across the entire population using a genomic model including dominance. To provide a more breeder-intelligible assessment of the performance of GS we calculated the expected response as the percentage gain over the population average expected genetic value (EGV) for different proportions of genomically selected individuals, using a rigorous cross-validation (CV) scheme that removed relatedness between training and validation sets. Predictive abilities (PAs) were 0.40-0.57 for individual selection and 0.56-0.75 for family selection. PAs under an additive+dominance model improved predictions by 5 to 14% for growth depending on the selection target, but no improvement was seen for wood traits. The good performance of GS with no relatedness in CV suggested that our average SNP density (~25 kb) captured some short-range linkage disequilibrium. Truncation GS successfully selected individuals with an average EGV significantly higher than the population average. Response to GS on a per year basis was ~100% more efficient than by phenotypic selection and more so with higher selection intensities. These results contribute further experimental data supporting the positive prospects of GS in forest trees. Because generation times are long, traits are complex and costs of DNA genotyping are plummeting, genomic prediction has good perspectives of adoption in tree breeding practice.
A cross-ethnic survey of CFB and SLC44A4, Indian ulcerative colitis GWAS hits, underscores their potential role in disease susceptibility

PubMed Central

Gupta, Aditi; Juyal, Garima; Sood, Ajit; Midha, Vandana; Yamazaki, Keiko; Vich Vila, Arnau; Esaki, Motohiro; Matsui, Toshiyuki; Takahashi, Atsushi; Kubo, Michiaki; Weersma, Rinse K; Thelma, B K

2017-01-01

The first ever genome-wide association study (GWAS) of ulcerative colitis in genetically distinct north Indian population identified two novel genes namely CFB and SLC44A4. Considering their biological relevance, we investigated allelic/genetic heterogeneity in these genes among ulcerative colitis cohorts of north Indian, Japanese and Dutch origin using high-density ImmunoChip case–control genotype data. Comparative linkage disequilibrium profiling and test of association were performed. Of the 28 CFB SNPs, similar strength of association was observed for rs4151657 (novel ulcerative colitis GWAS SNP) in north Indians (P=1.73 × 10−10) and Japanese (P=2.02 × 10−12) but not in the Dutch. Further, a three-marker haplotype was shared between north Indians and Japanese (P<10−8), but a different five-marker haplotype was associated (P=2.07 × 10−6) in the Dutch. Of the 22 SLC44A4 SNPs, rs2736428 (novel ulcerative colitis GWAS SNP) was found significantly associated in north Indians (P=4.94 × 10−10) and Japanese (P=3.37 × 10−9), but not among the Dutch. These results suggest (i) apparent allelic heterogeneity in CFB and genetic heterogeneity in SLC44A4 across different ethnic groups; (ii) shared ulcerative colitis genetic etiological factors among Asians; and finally (iii) re-exploration of GWAS findings together with high-density genotyping/sequencing and trans-ethnic fine mapping approaches may help identify shared and population-specific risk variants and enable to explain missing disease heritability. PMID:27759029
Salmonid Chromosome Evolution as Revealed by a Novel Method for Comparing RADseq Linkage Maps

PubMed Central

Gosselin, Thierry; Normandeau, Eric; Lamothe, Manuel; Isabel, Nathalie; Audet, Céline; Bernatchez, Louis

2016-01-01

Whole genome duplication (WGD) can provide material for evolutionary innovation. Family Salmonidae is ideal for studying the effects of WGD as the ancestral salmonid underwent WGD relatively recently, ∼65 Ma, then rediploidized and diversified. Extensive synteny between homologous chromosome arms occurs in extant salmonids, but each species has both conserved and unique chromosome arm fusions and fissions. Assembly of large, outbred eukaryotic genomes can be difficult, but structural rearrangements within such taxa can be investigated using linkage maps. RAD sequencing provides unprecedented ability to generate high-density linkage maps for nonmodel species, but can result in low numbers of homologous markers between species due to phylogenetic distance or differences in library preparation. Here, we generate a high-density linkage map (3,826 markers) for the Salvelinus genera (Brook Charr S. fontinalis), and then identify corresponding chromosome arms among the other available salmonid high-density linkage maps, including six species of Oncorhynchus, and one species for each of Salmo, Coregonus, and the nonduplicated sister group for the salmonids, Northern Pike Esox lucius for identifying post-duplicated homeologs. To facilitate this process, we developed MapComp to identify identical and proximate (i.e. nearby) markers between linkage maps using a reference genome of a related species as an intermediate, increasing the number of comparable markers between linkage maps by 5-fold. This enabled a characterization of the most likely history of retained chromosomal rearrangements post-WGD, and several conserved chromosomal inversions. Analyses of RADseq-based linkage maps from other taxa will also benefit from MapComp, available at: https://github.com/enormandeau/mapcomp/ PMID:28173098

Extensive population structure in San, Khoe, and mixed ancestry populations from southern Africa revealed by 44 short 5-SNP haplotypes.

PubMed

Schlebusch, Carina M; Soodyall, Himlya

2012-12-01

The San and Khoe people currently represent remnant groups of a much larger and widely distributed population of hunter-gatherers and pastoralists who had exclusive occupation of southern Africa before the arrival of Bantu-speaking groups in the past 1,200 years and sea-borne immigrants within the last 350 years. Genetic studies [mitochondrial deoxyribonucleic acid (DNA) and Y-chromosome] conducted on San and Khoe groups revealed that they harbor some of the most divergent lineages found in living peoples throughout the world. Recently, high-density, autosomal, single-nucleotide polymorphism (SNP)-array studies confirmed the early divergence of Khoe-San population groups from all other human populations. The present study made use of 220 autosomal SNP markers (in the format of both haplotypes and genotypes) to examine the population structure of various San and Khoe groups and their relationship to other neighboring groups. Whereas analyses based on the genotypic SNP data only supported the division of the included populations into three main groups-Khoe-San, Bantu-speakers, and non-African populations-haplotype analyses revealed finer structure within Khoe-San populations. By the use of only 44 short SNP haplotypes (compiled from a total of 220 SNPs), most of the Khoe-San groups could be resolved as separate groups by applying STRUCTURE analyses. Therefore, by carefully selecting a few SNPs and combining them into haplotypes, we were able to achieve the same level of population distinction that was achieved previously in high-density SNP studies on the same population groups. Using haplotypes proved to be a very efficient and cost-effective way to study population structure. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.
Acute chest syndrome is associated with single nucleotide polymorphism-defined beta globin cluster haplotype in children with sickle cell anaemia

PubMed Central

Bean, Christopher J.; Boulet, Sheree L.; Yang, Genyan; Payne, Amanda B.; Ghaji, Nafisa; Pyle, Meredith E.; Hooper, W. Craig; Bhatnagar, Pallav; Keefer, Jeffrey; Barron-Casella, Emily A.; Casella, James F.; DeBaun, Michael R.

2013-01-01

Summary Genetic diversity at the human β-globin locus has been implicated as a modifier of sickle cell anaemia (SCA) severity. However, haplotypes defined by restriction fragment length polymorphism sites across the β-globin locus have not been consistently associated with clinical phenotypes. To define the genetic structure at the β-globin locus more thoroughly, we performed high-density single nucleotide polymorphism (SNP) mapping in 820 children who were homozygous for the sickle cell mutation (HbSS). Genotyping results revealed very high linkage disequilibrium across a large region spanning the locus control region and the HBB (β-globin gene) cluster. We identified three predominant haplotypes accounting for 96% of the βS-carrying chromosomes in this population that could be distinguished using a minimal set of common SNPs. Consistent with previous studies, fetal haemoglobin level was significantly associated with βS-haplotypes. After controlling for covariates, an association was detected between haplotype and rate of hospitalization for acute chest syndrome (ACS) (incidence rate ratio 0.51, 95% confidence interval 0.29–0.89) but not incidence rate of vaso-occlusive pain or presence of silent cerebral infarct (SCI). Our results suggest that these SNP-defined βS-haplotypes may be associated with ACS, but not pain or SCI in a study population of children with SCA. PMID:23952145
QTL Mapping for Resistance to Iridovirus in Asian Seabass Using Genotyping-by-Sequencing.

PubMed

Wang, Le; Bai, Bin; Huang, Shuqing; Liu, Peng; Wan, Zi Yi; Ye, Baoqing; Wu, Jinlu; Yue, Gen Hua

2017-10-01

Identifying quantitative trait loci (QTL) for viral disease resistance is of particular importance in selective breeding programs of fish species. Genetic markers linked to QTL can be useful in marker-assisted selection (MAS) for elites resistant to specific pathogens. Here, we conducted a genome scan for QTL associated with Singapore grouper iridovirus (SGIV) resistance in an Asian seabass (Lates calcarifer) family, using a high-density linkage map generated with genotyping-by-sequencing. One genome-wide significant and three suggestive QTL were detected at LG21, LG6, LG13, and LG15, respectively. The phenotypic variation explained (PVE) by the four QTL ranged from 7.5 to 15.6%. The position of the most significant QTL at LG21 was located between 31.88 and 36.81 cM. The SNP marker (SNP130416) nearest to the peak of this QTL was significantly associated with SGIV resistance in an unrelated multifamily population. One candidate gene, MECOM, close to the peak of this QTL region, was predicted. Evidence of alternative splicing was observed for MECOM and one specific category of splicing variants was differentially expressed at 5 days post-SGIV infection. The QTL detected in this study are valuable resources and can be used in the selective breeding programs of Asian seabass with regard to resistance to SGIV.
Role of six single nucleotide polymorphisms, risk factors in coronary disease, in OLR1 alternative splicing

PubMed Central

Tejedor, J. Ramón; Tilgner, Hagen; Iannone, Camilla; Guigó, Roderic; Valcárcel, Juan

2015-01-01

The OLR1 gene encodes the oxidized low-density lipoprotein receptor (LOX-1), which is responsible for the cellular uptake of oxidized LDL (Ox-LDL), foam cell formation in atheroma plaques and atherosclerotic plaque rupture. Alternative splicing (AS) of OLR1 exon 5 generates two protein isoforms with antagonistic functions in Ox-LDL uptake. Previous work identified six single nucleotide polymorphisms (SNPs) in linkage disequilibrium that influence the inclusion levels of OLR1 exon 5 and correlate with the risk of cardiovascular disease. Here we use minigenes to recapitulate the effects of two allelic series (Low- and High-Risk) on OLR1 AS and identify one SNP in intron 4 (rs3736234) as the main contributor to the differences in exon 5 inclusion, while the other SNPs in the allelic series attenuate the drastic effects of this key SNP. Bioinformatic, proteomic, mutational and functional high-throughput analyses allowed us to define regulatory sequence motifs and identify SR protein family members (SRSF1, SRSF2) and HMGA1 as factors involved in the regulation of OLR1 AS. Our results suggest that antagonism between SRSF1 and SRSF2/HMGA1, and differential recognition of their regulatory motifs depending on the identity of the rs3736234 polymorphism, influence OLR1 exon 5 inclusion and the efficiency of Ox-LDL uptake, with potential implications for atherosclerosis and coronary disease. PMID:25904137
Selection for genetic markers in beef cattle reveals complex associations of thyroglobulin and casein1-s1 with carcass and meat traits.

PubMed

Bennett, G L; Shackelford, S D; Wheeler, T L; King, D A; Casas, E; Smith, T P L

2013-02-01

Genetic markers in casein (CSN1S1) and thyroglobulin (TG) genes have previously been associated with fat distribution in cattle. Determining the nature of these genetic associations (additive, recessive, or dominant) has been difficult, because both markers have small minor allele frequencies in most beef cattle populations. This results in few animals homozygous for the minor alleles. selection to increase the frequencies of the minor alleles for 2 SNP markers in these genes was undertaken in a composite population. The objective was to obtain better estimates of genetic effects associated with these markers and determine if there were epistatic interactions. Selection increased the frequencies of minor alleles for both SNP from <0.30 to 0.45. Bulls (n = 24) heterozygous for both SNP were used in 3 yr to produce 204 steer progeny harvested at an average age of 474 d. The combined effect of the 9 CSN1S1 × TG genotypes was associated with carcass-adjusted fat thickness (P < 0.06) and meat tenderness predicted at the abattoir by visible and near-infrared reflectance spectroscopy (P < 0.04). Genotype did not affect BW from birth through harvest, ribeye area, marbling score, slice shear force, or image-based yield grade (P > 0.10). Additive, dominance, and epistatic SNP association effects were estimated from genotypic effects for adjusted fat thickness and predicted meat tenderness. Adjusted fat thickness showed a dominance association with TG SNP (P < 0.06) and an epistatic additive CSN1S1 × additive TG association (P < 0.03). For predicted meat tenderness, heterozygous TG meat was more tender than meat from either homozygote (P < 0.002). Dominance and epistatic associations can result in different SNP allele substitution effects in populations where SNP have the same linkage disequilibrium with causal mutations but have different frequencies. Although the complex associations estimated in this study would contribute little to within-population selection response, they could be important for marker-assisted management or reciprocal selection schemes.
Variants in the ATP-binding cassette transporter (ABCA7), apolipoprotein E ϵ4,and the risk of late-onset Alzheimer disease in African Americans.

PubMed

Reitz, Christiane; Jun, Gyungah; Naj, Adam; Rajbhandary, Ruchita; Vardarajan, Badri Narayan; Wang, Li-San; Valladares, Otto; Lin, Chiao-Feng; Larson, Eric B; Graff-Radford, Neill R; Evans, Denis; De Jager, Philip L; Crane, Paul K; Buxbaum, Joseph D; Murrell, Jill R; Raj, Towfique; Ertekin-Taner, Nilufer; Logue, Mark; Baldwin, Clinton T; Green, Robert C; Barnes, Lisa L; Cantwell, Laura B; Fallin, M Daniele; Go, Rodney C P; Griffith, Patrick; Obisesan, Thomas O; Manly, Jennifer J; Lunetta, Kathryn L; Kamboh, M Ilyas; Lopez, Oscar L; Bennett, David A; Hendrie, Hugh; Hall, Kathleen S; Goate, Alison M; Byrd, Goldie S; Kukull, Walter A; Foroud, Tatiana M; Haines, Jonathan L; Farrer, Lindsay A; Pericak-Vance, Margaret A; Schellenberg, Gerard D; Mayeux, Richard

2013-04-10

Genetic variants associated with susceptibility to late-onset Alzheimer disease are known for individuals of European ancestry, but whether the same or different variants account for the genetic risk of Alzheimer disease in African American individuals is unknown. Identification of disease-associated variants helps identify targets for genetic testing, prevention, and treatment. To identify genetic loci associated with late-onset Alzheimer disease in African Americans. The Alzheimer Disease Genetics Consortium (ADGC) assembled multiple data sets representing a total of 5896 African Americans (1968 case participants, 3928 control participants) 60 years or older that were collected between 1989 and 2011 at multiple sites. The association of Alzheimer disease with genotyped and imputed single-nucleotide polymorphisms (SNPs) was assessed in case-control and in family-based data sets. Results from individual data sets were combined to perform an inverse variance-weighted meta-analysis, first with genome-wide analyses and subsequently with gene-based tests for previously reported loci. Presence of Alzheimer disease according to standardized criteria. Genome-wide significance in fully adjusted models (sex, age, APOE genotype, population stratification) was observed for a SNP in ABCA7 (rs115550680, allele = G; frequency, 0.09 cases and 0.06 controls; odds ratio [OR], 1.79 [95% CI, 1.47-2.12]; P = 2.2 × 10(-9)), which is in linkage disequilibrium with SNPs previously associated with Alzheimer disease in Europeans (0.8 < D' < 0.9). The effect size for the SNP in ABCA7 was comparable with that of the APOE ϵ4-determining SNP rs429358 (allele = C; frequency, 0.30 cases and 0.18 controls; OR, 2.31 [95% CI, 2.19-2.42]; P = 5.5 × 10(-47)). Several loci previously associated with Alzheimer disease but not reaching significance in genome-wide analyses were replicated in gene-based analyses accounting for linkage disequilibrium between markers and correcting for number of tests performed per gene (CR1, BIN1, EPHA1, CD33; 0.0005 < empirical P < .001). In this meta-analysis of data from African American participants, Alzheimer disease was significantly associated with variants in ABCA7 and with other genes that have been associated with Alzheimer disease in individuals of European ancestry. Replication and functional validation of this finding is needed before this information is used in clinical settings.
High-resolution melt analysis to identify and map sequence-tagged site anchor points onto linkage maps: a white lupin (Lupinus albus) map as an exemplar.

PubMed

Croxford, Adam E; Rogers, Tom; Caligari, Peter D S; Wilkinson, Michael J

2008-01-01

* The provision of sequence-tagged site (STS) anchor points allows meaningful comparisons between mapping studies but can be a time-consuming process for nonmodel species or orphan crops. * Here, the first use of high-resolution melt analysis (HRM) to generate STS markers for use in linkage mapping is described. This strategy is rapid and low-cost, and circumvents the need for labelled primers or amplicon fractionation. * Using white lupin (Lupinus albus, x = 25) as a case study, HRM analysis was applied to identify 91 polymorphic markers from expressed sequence tag (EST)-derived and genomic libraries. Of these, 77 generated STS anchor points in the first fully resolved linkage map of the species. The map also included 230 amplified fragment length polymorphisms (AFLP) loci, spanned 1916 cM (84.2% coverage) and divided into the expected 25 linkage groups. * Quantitative trait loci (QTL) analyses performed on the population revealed genomic regions associated with several traits, including the agronomically important time to flowering (tf), alkaloid synthesis and stem height (Ph). Use of HRM-STS markers also allowed us to make direct comparisons between our map and that of the related crop, Lupinus angustifolius, based on the conversion of RFLP, microsatellite and single nucleotide polymorphism (SNP) markers into HRM markers.
Use of single nucleotide polymorphisms (SNP) to fine-map quantitative trait loci (QTL) in swine

USDA-ARS?s Scientific Manuscript database

Mapping quantitative trait loci (QTL) in swine at the US Meat Animal Research Center has relied heavily on linkage mapping in either F2 or Backcross families. QTL identified in the initial scans typically have very broad confidence intervals and further refinement of the QTL’s position is needed bef...
Genome-wide SNP identification, linkage map construction and QTL mapping for mineral nutrient concentrations and contents in pea (Pisum sativum L.)

USDA-ARS?s Scientific Manuscript database

Marker-assisted breeding is now routinely used in major crops to facilitate more efficient cultivar improvement. This has been significantly enabled by the use of next-generation sequencing technology to identify loci and markers associated with traits of interest. While rich in a variety of nutriti...
Detection of genetic variation affecting milk coagulation properties in Danish Holstein dairy cattle by analyses of pooled whole-genome sequences from phenotypically extreme samples (pool-seq).

PubMed

Bertelsen, H P; Gregersen, V R; Poulsen, N; Nielsen, R O; Das, A; Madsen, L B; Buitenhuis, A J; Holm, L-E; Panitz, F; Larsen, L B; Bendixen, C

2016-04-01

Rennet-induced milk coagulation is an important trait for cheese production. Recent studies have reported an alarming frequency of cows producing poorly coagulating milk unsuitable for cheese production. Several genetic factors are known to affect milk coagulation, including variation in the major milk proteins; however, recent association studies indicate genetic effects from other genomic regions as well. The aim of this study was to detect genetic variation affecting milk coagulation properties, measured as curd-firming rate (CFR) and milk pH. This was achieved by examining allele frequency differences between pooled whole-genome sequences of phenotypically extreme samples (pool-seq).. Curd-firming rate and raw milk pH were measured for 415 Danish Holstein cows, and each animal was sequenced at low coverage. Pools were created containing whole genome sequence reads from samples with "extreme" values (high or low) for both phenotypic traits. A total of 6,992,186 and 5,295,501 SNP were assessed in relation to CFR and milk pH, respectively. Allele frequency differences were calculated between pools and 32 significantly different SNP were detected, 1 for milk pH and 31 for CFR, of which 19 are located on chromosome 6. A total of 9 significant SNP, which were selected based on the possible function of proximal candidate genes, were genotyped in the entire sample set ( = 415) to test for an association. The most significant SNP was located proximal to , explaining 33% of the phenotypic variance. , coding for κ-casein, is the most studied in relation to milk coagulation due to its position on the surface of the casein micelles and the direct involvement in milk coagulation. Three additional SNP located on chromosome 6 showed significant associations explaining 7, 3.6, and 1.3% of the phenotypic variance of CFR. The significant SNP on chromosome 6 were shown to be in linkage disequilibrium with the SNP peaking proximal to ; however, after accounting for the genotype of the peak SNP within this QTL, significant effects (-value < 0.1) could still be detected for 2 of the SNP accounting for 2 and 1% of the phenotypic variance. These 2 interesting SNP were located within introns or proximal to the candidate genes-solute carrier family 4 (sodium bicarbonate cotransporter), member 4 () and LIM and calponin homology domains 1 (), respectively-making them interesting targets for further analysis.
BAT2 and BAT3 polymorphisms as novel genetic risk factors for rejection after HLA-related SCT.

PubMed

Piras, Ignazio Stefano; Angius, Andrea; Andreani, Marco; Testi, Manuela; Lucarelli, Guido; Floris, Matteo; Marktel, Sarah; Ciceri, Fabio; La Nasa, Giorgio; Fleischhauer, Katharina; Roncarolo, Maria Grazia; Bulfone, Alessandro; Gregori, Silvia; Bacchetta, Rosa

2014-11-01

The genetic background of donor and recipient is an important factor determining the outcome of allogeneic hematopoietic SCT (allo-HSCT). We applied whole-genome analysis to investigate genetic variants-other than HLA class I and II-associated with negative outcome after HLA-identical sibling allo-HSCT in a cohort of 110 β-Thalassemic patients. We identified two single-nucleotide polymorphisms (SNPs) in BAT2 (A/G) and BAT3 (T/C) genes, SNP rs11538264 and SNP rs10484558, both located in the HLA class III region, in strong linkage disequilibrium between each other (R(2)=0.92). When considered as single SNP, none of them reached a significant association with graft rejection (nominal P<0.00001 for BAT2 SNP rs11538264, and P<0.0001 for BAT3 SNP rs10484558), whereas the BAT2/BAT3 A/C haplotype was present at significantly higher frequency in patients who rejected as compared to those with functional graft (30.0% vs 2.6%, nominal P=1.15 × 10(-8); and adjusted P=0.0071). The BAT2/BAT3 polymorphisms and specifically the A/C haplotype may represent a novel immunogenetic factor associated with graft rejection in patients undergoing allo-HSCT.
BAT2 and BAT3 polymorphisms as novel genetic risk factors for rejection after HLA-related stem cell transplantation

PubMed Central

Piras, Ignazio Stefano; Angius, Andrea; Andreani, Marco; Testi, Manuela; Lucarelli, Guido; Floris, Matteo; Marktel, Sarah; Ciceri, Fabio; La Nasa, Giorgio; Fleischhauer, Katharina; Roncarolo, Maria Grazia; Bulfone, Alessandro

2014-01-01

The genetic background of donor and recipient is an important factor determining the outcome of allogeneic hematopoietic stem cell transplantation (allo-HSCT). We applied a whole genome analysis to investigate genetic variants - other than HLA class I and II - associated with negative outcome after HLA-identical sibling allo-HSCT in a cohort of 110 β-Thalassemic patients. We identified two single nucleotide polymorphisms in BAT2 (A/G) and BAT3 (T/C) genes, SNP rs11538264 and SNP rs10484558, both located in the HLA class III region, in strong Linkage Disequilibrium between each other (R2=0.92). When considered as single SNP, none of them reached a significant association with graft rejection (nominal P < 0.00001 for BAT2 SNP rs11538264, and P < 0.0001 for BAT3 SNP rs10484558). Whereas, the BAT2/BAT3 A/C haplotype was present at significantly higher frequency in patients who rejected as compared to those with functional graft (30.0% vs. 2.6%, nominal P = 1.15×10−8; and adjusted P = 0.0071). The BAT2/BAT3 polymorphisms and specifically the A/C haplotype may represent novel immunogenetic factor associated with graft rejection in patients undergoing allo-HSCT. PMID:25111513
Revealing the role of glutathione S-transferase omega in age-at-onset of Alzheimer and Parkinson diseases.

PubMed

Li, Yi-Ju; Scott, William K; Zhang, Ling; Lin, Ping-I; Oliveira, Sofia A; Skelly, Tara; Doraiswamy, Maurali P; Welsh-Bohmer, Kathleen A; Martin, Eden R; Haines, Jonathan L; Pericak-Vance, Margaret A; Vance, Jeffery M

2006-08-01

We previously reported a linkage region on chromosome 10q for age-at-onset (AAO) of Alzheimer (AD) and Parkinson (PD) diseases. Glutathione S-transferase, omega-1 (GSTO1) and the adjacent gene GSTO2, located in this linkage region, were then reported to associate with AAO of AD and PD. To examine whether GSTO1 and GSTO2 (hereafter referred to as GSTO1h) are responsible for the linkage evidence, we identified 39 families in AD that lead to our previous linkage and association findings. The evidence of linkage and association was markedly diminished after removing these 39 families from the analyses, thus providing support that GSTO1h drives the original linkage results. The maximum average AAO delayed by GSTO1h SNP 7-1 (rs4825, A nucleotide) was 6.8 (+/-4.41) years for AD and 8.6(+/-5.71) for PD, respectively. This is comparable to the magnitude of AAO difference by APOE-4 in these same AD and PD families. These findings suggest the presence of genetic heterogeneity for GSTO1h's effect on AAO, and support GSTO1h's role in modifying AAO in these two disorders.
Identification of Pyrus Single Nucleotide Polymorphisms (SNPs) and Evaluation for Genetic Mapping in European Pear and Interspecific Pyrus Hybrids

PubMed Central

Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E.; Crowhurst, Ross N.; Chagné, David

2013-01-01

We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear (‘Old Home’×‘Louise Bon Jersey’) and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality. PMID:24155917
Discrimination of relationships with the same degree of kinship using chromosomal sharing patterns estimated from high-density SNPs.

PubMed

Morimoto, Chie; Manabe, Sho; Fujimoto, Shuntaro; Hamano, Yuya; Tamaki, Keiji

2018-03-01

Distinguishing relationships with the same degree of kinship (e.g., uncle-nephew and grandfather-grandson) is generally difficult in forensic genetics by using the commonly employed short tandem repeat loci. In this study, we developed a new method for discerning such relationships between two individuals by examining the number of chromosomal shared segments estimated from high-density single nucleotide polymorphisms (SNPs). We computationally generated second-degree kinships (i.e., uncle-nephew and grandfather-grandson) and third-degree kinships (i.e., first cousins and great-grandfather-great-grandson) for 174,254 autosomal SNPs considering the effect of linkage disequilibrium and recombination for each SNP. We investigated shared chromosomal segments between two individuals that were estimated based on identity by state regions. We then counted the number of segments in each pair. Based on our results, the number of shared chromosomal segments in collateral relationships was larger than that in lineal relationships with both the second-degree and third-degree kinships. This was probably caused by differences involving chromosomal transitions and recombination between relationships. As we probabilistically evaluated the relationships between simulated pairs based on the number of shared segments using logistic regression, we could determine accurate relationships in >90% of second-degree relatives and >70% of third-degree relatives, using a probability criterion for the relationship ≥0.9. Furthermore, we could judge the true relationships of actual sample pairs from volunteers, as well as simulated data. Therefore, this method can be useful for discerning relationships between two individuals with the same degree of kinship. Copyright © 2017 Elsevier B.V. All rights reserved.
Rapid genotyping with DNA micro-arrays for high-density linkage mapping and QTL mapping in common buckwheat (Fagopyrum esculentum Moench)

PubMed Central

Yabe, Shiori; Hara, Takashi; Ueno, Mariko; Enoki, Hiroyuki; Kimura, Tatsuro; Nishimura, Satoru; Yasui, Yasuo; Ohsawa, Ryo; Iwata, Hiroyoshi

2014-01-01

For genetic studies and genomics-assisted breeding, particularly of minor crops, a genotyping system that does not require a priori genomic information is preferable. Here, we demonstrated the potential of a novel array-based genotyping system for the rapid construction of high-density linkage map and quantitative trait loci (QTL) mapping. By using the system, we successfully constructed an accurate, high-density linkage map for common buckwheat (Fagopyrum esculentum Moench); the map was composed of 756 loci and included 8,884 markers. The number of linkage groups converged to eight, which is the basic number of chromosomes in common buckwheat. The sizes of the linkage groups of the P1 and P2 maps were 773.8 and 800.4 cM, respectively. The average interval between adjacent loci was 2.13 cM. The linkage map constructed here will be useful for the analysis of other common buckwheat populations. We also performed QTL mapping for main stem length and detected four QTL. It took 37 days to process 178 samples from DNA extraction to genotyping, indicating the system enables genotyping of genome-wide markers for a few hundred buckwheat plants before the plants mature. The novel system will be useful for genomics-assisted breeding in minor crops without a priori genomic information. PMID:25914583
Rapid genotyping with DNA micro-arrays for high-density linkage mapping and QTL mapping in common buckwheat (Fagopyrum esculentum Moench).

PubMed

Yabe, Shiori; Hara, Takashi; Ueno, Mariko; Enoki, Hiroyuki; Kimura, Tatsuro; Nishimura, Satoru; Yasui, Yasuo; Ohsawa, Ryo; Iwata, Hiroyoshi

2014-12-01

For genetic studies and genomics-assisted breeding, particularly of minor crops, a genotyping system that does not require a priori genomic information is preferable. Here, we demonstrated the potential of a novel array-based genotyping system for the rapid construction of high-density linkage map and quantitative trait loci (QTL) mapping. By using the system, we successfully constructed an accurate, high-density linkage map for common buckwheat (Fagopyrum esculentum Moench); the map was composed of 756 loci and included 8,884 markers. The number of linkage groups converged to eight, which is the basic number of chromosomes in common buckwheat. The sizes of the linkage groups of the P1 and P2 maps were 773.8 and 800.4 cM, respectively. The average interval between adjacent loci was 2.13 cM. The linkage map constructed here will be useful for the analysis of other common buckwheat populations. We also performed QTL mapping for main stem length and detected four QTL. It took 37 days to process 178 samples from DNA extraction to genotyping, indicating the system enables genotyping of genome-wide markers for a few hundred buckwheat plants before the plants mature. The novel system will be useful for genomics-assisted breeding in minor crops without a priori genomic information.
Caucasian Families Exhibit Significant Linkage of Myopia to Chromosome 11p.

PubMed

Musolf, Anthony M; Simpson, Claire L; Moiz, Bilal A; Long, Kyle A; Portas, Laura; Murgia, Federico; Ciner, Elise B; Stambolian, Dwight; Bailey-Wilson, Joan E

2017-07-01

Myopia is a common visual disorder caused by eye overgrowth, resulting in blurry vision. It affects one in four Americans, and its prevalence is increasing. The genetic mechanisms that underpin myopia are not completely understood. Here, we use genotype data and linkage analyses to identify high-risk genetic loci that are significantly linked to myopia. Individuals from 56 Caucasian families with a history of myopia were genotyped on an exome-based array, and the single nucleotide polymorphism (SNP) data were merged with microsatellite genotype data. Refractive error measures on the samples were converted into binary phenotypes consisting of affected, unaffected, or unknown myopia status. Parametric linkage analyses assuming an autosomal dominant model with 90% penetrance and 10% phenocopy rate were performed. Single variant two-point analyses yielded three significantly linked SNPs at 11p14.1 and 11p11.2; a further 45 SNPs at 11p were found to be suggestive. No other chromosome had any significant SNPs or more than seven suggestive linkages. Two of the significant SNPs were located in BBOX1-AS1 and one in the intergenic region between ORA47 and TRIM49B. Collapsed haplotype pattern two-point analysis and multipoint analyses also yielded multiple suggestively linked genes at 11p. Multipoint analysis also identified suggestive evidence of linkage on 20q13. We identified three genome-wide significant linked variants on 11p for myopia in Caucasians. Although the novel specific signals still need to be replicated, 11p is a promising region that has been identified by other linkage studies with a number of potentially interesting candidate genes. We hope that the identification of these regions on 11p as potential causal regions for myopia will lead to more focus on these regions and maybe possible replication of our specific linkage peaks in other studies. We further plan targeted sequencing on 11p for our most highly linked families to more clearly understand the source of the linkage in this region.
Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

PubMed Central

McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.

2013-01-01

To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982
Genome-wide linkage scan for maximum and length-dependent knee muscle strength in young men: significant evidence for linkage at chromosome 14q24.3.

PubMed

De Mars, G; Windelinckx, A; Huygens, W; Peeters, M W; Beunen, G P; Aerssens, J; Vlietinck, R; Thomis, M A I

2008-05-01

Maintenance of high muscular fitness is positively related to bone health, functionality in daily life and increasing insulin sensitivity, and negatively related to falls and fractures, morbidity and mortality. Heritability of muscle strength phenotypes ranges between 31% and 95%, but little is known about the identity of the genes underlying this complex trait. As a first attempt, this genome-wide linkage study aimed to identify chromosomal regions linked to muscle and bone cross-sectional area, isometric knee flexion and extension torque, and torque-length relationship for knee flexors and extensors. In total, 283 informative male siblings (17-36 years old), belonging to 105 families, were used to conduct a genome-wide SNP-based multipoint linkage analysis. The strongest evidence for linkage was found for the torque-length relationship of the knee flexors at 14q24.3 (LOD = 4.09; p<10(-5)). Suggestive evidence for linkage was found at 14q32.2 (LOD = 3.00; P = 0.005) for muscle and bone cross-sectional area, at 2p24.2 (LOD = 2.57; p = 0.01) for isometric knee torque at 30 degrees flexion, at 1q21.3, 2p23.3 and 18q11.2 (LOD = 2.33, 2.69 and 2.21; p<10(-4) for all) for the torque-length relationship of the knee extensors and at 18p11.31 (LOD = 2.39; p = 0.0004) for muscle-mass adjusted isometric knee extension torque. We conclude that many small contributing genes rather than a few important genes are involved in causing variation in different underlying phenotypes of muscle strength. Furthermore, some overlap in promising genomic regions were identified among different strength phenotypes.

Genetic Variation in TLR Genes in Ugandan and South African Populations and Comparison with HapMap Data

PubMed Central

Randhawa, April Kaur; Horne, David J.; Adams, Mark D.; Shey, Muki; Barnholtz-Sloan, Jill; Mayanja-Kizza, Harriet; Kaplan, Gilla; Hanekom, Willem A.; Boom, W. Henry; Hawn, Thomas R.; Stein, Catherine M.

2012-01-01

Genetic epidemiological studies of complex diseases often rely on data from the International HapMap Consortium for identification of single nucleotide polymorphisms (SNPs), particularly those that tag haplotypes. However, little is known about the relevance of the African populations used to collect HapMap data for study populations conducted elsewhere in Africa. Toll-like receptor (TLR) genes play a key role in susceptibility to various infectious diseases, including tuberculosis. We conducted full-exon sequencing in samples obtained from Uganda (n = 48) and South Africa (n = 48), in four genes in the TLR pathway: TLR2, TLR4, TLR6, and TIRAP. We identified one novel TIRAP SNP (with minor allele frequency [MAF] 3.2%) and a novel TLR6 SNP (MAF 8%) in the Ugandan population, and a TLR6 SNP that is unique to the South African population (MAF 14%). These SNPs were also not present in the 1000 Genomes data. Genotype and haplotype frequencies and linkage disequilibrium patterns in Uganda and South Africa were similar to African populations in the HapMap datasets. Multidimensional scaling analysis of polymorphisms in all four genes suggested broad overlap of all of the examined African populations. Based on these data, we propose that there is enough similarity among African populations represented in the HapMap database to justify initial SNP selection for genetic epidemiological studies in Uganda and South Africa. We also discovered three novel polymorphisms that appear to be population-specific and would only be detected by sequencing efforts. PMID:23112821
High-Density Linkage Map Construction and Mapping of Salt-Tolerant QTLs at Seedling Stage in Upland Cotton Using Genotyping by Sequencing (GBS).

PubMed

Diouf, Latyr; Pan, Zhaoe; He, Shou-Pu; Gong, Wen-Fang; Jia, Yin Hua; Magwanga, Richard Odongo; Romy, Kimbembe Romesh Eric; Or Rashid, Harun; Kirungu, Joy Nyangasi; Du, Xiongming

2017-12-05

Over 6% of agricultural land is affected by salinity. It is becoming obligatory to use saline soils, so growing salt-tolerant plants is a priority. To gain an understanding of the genetic basis of upland cotton tolerance to salinity at seedling stage, an intra-specific cross was developed from CCRI35, tolerant to salinity, as female with Nan Dan (NH), sensitive to salinity, as the male. A genetic map of 5178 SNP markers was developed from 277 F 2:3 populations. The map spanned 4768.098 cM, with an average distance of 0.92 cM. A total of 66 QTLs for 10 traits related to salinity were detected in three environments (0, 110, and 150 mM salt treatment). Only 14 QTLs were consistent, accounting for 2.72% to 9.87% of phenotypic variation. Parental contributions were found to be in the ratio of 3:1, 10 QTLs from the sensitive and four QTLs from the resistant parent. Five QTLs were located in A t and nine QTLs in the D t sub-genome. Moreover, eight clusters were identified, in which 12 putative key genes were found to be related to salinity. The GBS-SNPs-based genetic map developed is the first high-density genetic map that has the potential to provide deeper insights into upland cotton salinity tolerance. The 12 key genes found in this study could be used for QTL fine mapping and cloning for further studies.
In-depth genome characterization of a Brazilian common bean core collection using DArTseq high-density SNP genotyping.

PubMed

Valdisser, Paula A M R; Pereira, Wendell J; Almeida Filho, Jâneo E; Müller, Bárbara S F; Coelho, Gesimária R C; de Menezes, Ivandilson P P; Vianna, João P G; Zucchi, Maria I; Lanna, Anna C; Coelho, Alexandre S G; de Oliveira, Jaison P; Moraes, Alessandra da Cunha; Brondani, Claudio; Vianello, Rosana P

2017-05-30

Common bean is a legume of social and nutritional importance as a food crop, cultivated worldwide especially in developing countries, accounting for an important source of income for small farmers. The availability of the complete sequences of the two common bean genomes has dramatically accelerated and has enabled new experimental strategies to be applied for genetic research. DArTseq has been widely used as a method of SNP genotyping allowing comprehensive genome coverage with genetic applications in common bean breeding programs. Using this technology, 6286 SNPs (1 SNP/86.5 Kbp) were genotyped in genic (43.3%) and non-genic regions (56.7%). Genetic subdivision associated to the common bean gene pools (K = 2) and related to grain types (K = 3 and K = 5) were reported. A total of 83% and 91% of all SNPs were polymorphic within the Andean and Mesoamerican gene pools, respectively, and 26% were able to differentiate the gene pools. Genetic diversity analysis revealed an average H E of 0.442 for the whole collection, 0.102 for Andean and 0.168 for Mesoamerican gene pools (F ST = 0.747 between gene pools), 0.440 for the group of cultivars and lines, and 0.448 for the group of landrace accessions (F ST = 0.002 between cultivar/line and landrace groups). The SNP effects were predicted with predominance of impact on non-coding regions (77.8%). SNPs under selection were identified within gene pools comparing landrace and cultivar/line germplasm groups (Andean: 18; Mesoamerican: 69) and between the gene pools (59 SNPs), predominantly on chromosomes 1 and 9. The LD extension estimate corrected for population structure and relatedness (r 2 SV ) was ~ 88 kbp, while for the Andean gene pool was ~ 395 kbp, and for the Mesoamerican was ~ 130 kbp. For common bean, DArTseq provides an efficient and cost-effective strategy of generating SNPs for large-scale genome-wide studies. The DArTseq resulted in an operational panel of 560 polymorphic SNPs in linkage equilibrium, providing high genome coverage. This SNP set could be used in genotyping platforms with many applications, such as population genetics, phylogeny relation between common bean varieties and support to molecular breeding approaches.
PRKCA: A Positional Candidate Gene for Body Mass Index and Asthma

PubMed Central

Murphy, Amy; Tantisira, Kelan G.; Soto-Quirós, Manuel E.; Avila, Lydiana; Klanderman, Barbara J.; Lake, Stephen; Weiss, Scott T.; Celedón, Juan C.

2009-01-01

Asthma incidence and prevalence are higher in obese individuals. A potential mechanistic basis for this relationship is pleiotropy. We hypothesized that significant linkage and candidate-gene association would be found for body mass index (BMI) in a population ascertained on asthma affection status. Linkage analysis for BMI was performed on 657 subjects in eight Costa Rican families enrolled in a study of asthma. Family-based association studies were conducted for BMI with SNPs within a positional candidate gene, PRKCA. SNPs within PRKCA were also tested for association with asthma. Association studies were conducted in 415 Costa Rican parent-child trios and 493 trios participating in the Childhood Asthma Management Program (CAMP). Although only modest evidence of linkage for BMI was obtained for the whole cohort, significant linkage was noted for BMI in females on chromosome 17q (peak LOD = 3.39). Four SNPs in a candidate gene in this region (PRKCA) had unadjusted association p values < 0.05 for BMI in both cohorts, with the joint p value for two SNPs remaining significant after adjustment for multiple comparisons (rs228883 and rs1005651, joint p values = 9.5 × 10−5 and 5.6 × 10−5). Similarly, eight SNPs had unadjusted association p values < 0.05 for asthma in both populations, with one SNP remaining significant after adjustment for multiple comparisons (rs11079657, joint p value = 2.6 × 10−5). PRKCA is a pleiotropic locus that is associated with both BMI and asthma and that has been identified via linkage analysis of BMI in a population ascertained on asthma. PMID:19576566
Draft Genome Sequence, and a Sequence-Defined Genetic Linkage Map of the Legume Crop Species Lupinus angustifolius L

PubMed Central

Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W.; Howieson, John G.; Li, Chengdao

2013-01-01

Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species. PMID:23734219
Draft genome sequence, and a sequence-defined genetic linkage map of the legume crop species Lupinus angustifolius L.

PubMed

Yang, Huaan; Tao, Ye; Zheng, Zequn; Zhang, Qisen; Zhou, Gaofeng; Sweetingham, Mark W; Howieson, John G; Li, Chengdao

2013-01-01

Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species.
Rapid genotyping by low-coverage resequencing to construct genetic linkage maps of fungi: a case study in Lentinula edodes

PubMed Central

2013-01-01

Background Genetic linkage maps are important tools in breeding programmes and quantitative trait analyses. Traditional molecular markers used for genotyping are limited in throughput and efficiency. The advent of next-generation sequencing technologies has facilitated progeny genotyping and genetic linkage map construction in the major grains. However, the applicability of the approach remains untested in the fungal system. Findings Shiitake mushroom, Lentinula edodes, is a basidiomycetous fungus that represents one of the most popular cultivated edible mushrooms. Here, we developed a rapid genotyping method based on low-coverage (~0.5 to 1.5-fold) whole-genome resequencing. We used the approach to genotype 20 single-spore isolates derived from L. edodes strain L54 and constructed the first high-density sequence-based genetic linkage map of L. edodes. The accuracy of the proposed genotyping method was verified experimentally with results from mating compatibility tests and PCR-single-strand conformation polymorphism on a few known genes. The linkage map spanned a total genetic distance of 637.1 cM and contained 13 linkage groups. Two hundred sequence-based markers were placed on the map, with an average marker spacing of 3.4 cM. The accuracy of the map was confirmed by comparing with previous maps the locations of known genes such as matA and matB. Conclusions We used the shiitake mushroom as an example to provide a proof-of-principle that low-coverage resequencing could allow rapid genotyping of basidiospore-derived progenies, which could in turn facilitate the construction of high-density genetic linkage maps of basidiomycetous fungi for quantitative trait analyses and improvement of genome assembly. PMID:23915543
Insights into the genetic architecture of morphological traits in two passerine bird species.

PubMed

Silva, C N S; McFarlane, S E; Hagen, I J; Rönnegård, L; Billing, A M; Kvalnes, T; Kemppainen, P; Rønning, B; Ringsby, T H; Sæther, B-E; Qvarnström, A; Ellegren, H; Jensen, H; Husby, A

2017-09-01

Knowledge about the underlying genetic architecture of phenotypic traits is needed to understand and predict evolutionary dynamics. The number of causal loci, magnitude of the effects and location in the genome are, however, still largely unknown. Here, we use genome-wide single-nucleotide polymorphism (SNP) data from two large-scale data sets on house sparrows and collared flycatchers to examine the genetic architecture of different morphological traits (tarsus length, wing length, body mass, bill depth, bill length, total and visible badge size and white wing patches). Genomic heritabilities were estimated using relatedness calculated from SNPs. The proportion of variance captured by the SNPs (SNP-based heritability) was lower in house sparrows compared with collared flycatchers, as expected given marker density (6348 SNPs in house sparrows versus 38 689 SNPs in collared flycatchers). Indeed, after downsampling to similar SNP density and sample size, this estimate was no longer markedly different between species. Chromosome-partitioning analyses demonstrated that the proportion of variance explained by each chromosome was significantly positively related to the chromosome size for some traits and, generally, that larger chromosomes tended to explain proportionally more variation than smaller chromosomes. Finally, we found two genome-wide significant associations with very small-effect sizes. One SNP on chromosome 20 was associated with bill length in house sparrows and explained 1.2% of phenotypic variation (V P ), and one SNP on chromosome 4 was associated with tarsus length in collared flycatchers (3% of V P ). Although we cannot exclude the possibility of undetected large-effect loci, our results indicate a polygenic basis for morphological traits.
The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies

PubMed Central

Barnett, Ian; Mukherjee, Rajarshi; Lin, Xihong

2017-01-01

It is of substantial interest to study the effects of genes, genetic pathways, and networks on the risk of complex diseases. These genetic constructs each contain multiple SNPs, which are often correlated and function jointly, and might be large in number. However, only a sparse subset of SNPs in a genetic construct is generally associated with the disease of interest. In this article, we propose the generalized higher criticism (GHC) to test for the association between an SNP set and a disease outcome. The higher criticism is a test traditionally used in high-dimensional signal detection settings when marginal test statistics are independent and the number of parameters is very large. However, these assumptions do not always hold in genetic association studies, due to linkage disequilibrium among SNPs and the finite number of SNPs in an SNP set in each genetic construct. The proposed GHC overcomes the limitations of the higher criticism by allowing for arbitrary correlation structures among the SNPs in an SNP-set, while performing accurate analytic p-value calculations for any finite number of SNPs in the SNP-set. We obtain the detection boundary of the GHC test. We compared empirically using simulations the power of the GHC method with existing SNP-set tests over a range of genetic regions with varied correlation structures and signal sparsity. We apply the proposed methods to analyze the CGEM breast cancer genome-wide association study. Supplementary materials for this article are available online. PMID:28736464
Genetics and mapping of a novel downy mildew resistance gene, Pl(18), introgressed from wild Helianthus argophyllus into cultivated sunflower (Helianthus annuus L.).

PubMed

Qi, L L; Foley, M E; Cai, X W; Gulya, T J

2016-04-01

A novel downy mildew resistance gene, Pl(18), was introgressed from wild Helianthus argophyllus into cultivated sunflower and genetically mapped to linkage group 2 of the sunflower genome. The new germplasm, HA-DM1, carrying Pl(18) has been released to the public. Sunflower downy mildew (DM) is considered to be the most destructive foliar disease that has spread to every major sunflower-growing country of the world, except Australia. A new dominant downy mildew resistance gene (Pl 18) transferred from wild Helianthus argophyllus (PI 494573) into cultivated sunflower was mapped to linkage group (LG) 2 of the sunflower genome using bulked segregant analysis with 869 simple sequence repeat (SSR) markers. Phenotyping 142 BC1F2:3 families derived from the cross of HA 89 and H. argophyllus confirmed the single gene inheritance of resistance. Since no other Pl gene has been mapped to LG2, this gene was novel and designated as Pl (18). SSR markers CRT214 and ORS203 flanked Pl(18) at a genetic distance of 1.1 and 0.4 cM, respectively. Forty-six single nucleotide polymorphism (SNP) markers that cover the Pl(18) region were surveyed for saturation mapping of the region. Six co-segregating SNP markers were 1.2 cM distal to Pl(18), and another four co-segregating SNP markers were 0.9 cM proximal to Pl(18). The new BC2F4-derived germplasm, HA-DM1, carrying Pl(18) has been released to the public. This new line is highly resistant to all Plasmopara halstedii races identified in the USA providing breeders with an effective new source of resistance against downy mildew in sunflower. The molecular markers that were developed will be especially useful in marker-assisted selection and pyramiding of Pl resistance genes because of their close proximity to the gene and the availability of high-throughput SNP detection assays.
Genome-wide differentiation of various melon horticultural groups for use in genome wide association study for fruit firmness and construction of a high resolution genetic map

USDA-ARS?s Scientific Manuscript database

We generated 13,789 single nucleotide plymorphism (SNP) markers from 97 melon accessions using genotyping by sequencing and anchored them to chromosomes to understand genome-wide fixation index between various melon morphotypes and linkage disequilibrium (LD) decay for inodorus and cantalupensis, th...
Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Development of Advanced Classification Algorithm for Genome-Wide Single Nucleotide Polymorphism (SNP) Data Analysis

DTIC Science & Technology

2011-04-01

critical. 5. REFERENCES Almasy, L, Blangero, J. (2009) “Human QTL linkage mapping.” Genetica 136:333-340. Amos, CI. (2007) “Successful...quantitative trait loci.” Genetica 136:237-243. Ward, JH, Hook, ME. “A Hierarchical Grouping Procedure Applied to a Problem of Grouping Profiles
Confirmation of Single-Locus Sex Determination and Female Heterogamety in Willow Based on Linkage Analysis.

PubMed

Chen, Yingnan; Wang, Tiantian; Fang, Lecheng; Li, Xiaoping; Yin, Tongming

2016-01-01

In this study, we constructed high-density genetic maps of Salix suchowensis and mapped the gender locus with an F1 pedigree. Genetic maps were separately constructed for the maternal and paternal parents by using amplified fragment length polymorphism (AFLP) markers and the pseudo-testcross strategy. The maternal map consisted of 20 linkage groups that spanned a genetic distance of 2333.3 cM; whereas the paternal map contained 21 linkage groups that covered 2260 cM. Based on the established genetic maps, it was found that the gender of willow was determined by a single locus on linkage group LG_03, and the female was the heterogametic gender. Aligned with mapped SSR markers, linkage group LG_03 was found to be associated with chromosome XV in willow. It is noteworthy that marker density in the vicinity of the gender locus was significantly higher than that expected by chance alone, which indicates severe recombination suppression around the gender locus. In conclusion, this study confirmed the findings on the single-locus sex determination and female heterogamety in willow. It also provided additional evidence that validated the previous studies, which found that different autosomes evolved into sex chromosomes between the sister genera of Salix (willow) and Populus (poplar).
Confirmation of Single-Locus Sex Determination and Female Heterogamety in Willow Based on Linkage Analysis

PubMed Central

Fang, Lecheng; Li, Xiaoping; Yin, Tongming

2016-01-01

In this study, we constructed high-density genetic maps of Salix suchowensis and mapped the gender locus with an F1 pedigree. Genetic maps were separately constructed for the maternal and paternal parents by using amplified fragment length polymorphism (AFLP) markers and the pseudo-testcross strategy. The maternal map consisted of 20 linkage groups that spanned a genetic distance of 2333.3 cM; whereas the paternal map contained 21 linkage groups that covered 2260 cM. Based on the established genetic maps, it was found that the gender of willow was determined by a single locus on linkage group LG_03, and the female was the heterogametic gender. Aligned with mapped SSR markers, linkage group LG_03 was found to be associated with chromosome XV in willow. It is noteworthy that marker density in the vicinity of the gender locus was significantly higher than that expected by chance alone, which indicates severe recombination suppression around the gender locus. In conclusion, this study confirmed the findings on the single-locus sex determination and female heterogamety in willow. It also provided additional evidence that validated the previous studies, which found that different autosomes evolved into sex chromosomes between the sister genera of Salix (willow) and Populus (poplar). PMID:26828940
Irish study of high-density Schizophrenia families: Field methods and power to detect linkage

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kendler, K.S.; Straub, R.E.; MacLean, C.J.

Large samples of multiplex pedigrees will probably be needed to detect susceptibility loci for schizophrenia by linkage analysis. Standardized ascertainment of such pedigrees from culturally and ethnically homogeneous populations may improve the probability of detection and replication of linkage. The Irish Study of High-Density Schizophrenia Families (ISHDSF) was formed from standardized ascertainment of multiplex schizophrenia families in 39 psychiatric facilities covering over 90% of the population in Ireland and Northern Ireland. We here describe a phenotypic sample and a subset thereof, the linkage sample. Individuals were included in the phenotypic sample if adequate diagnostic information, based on personal interview and/ormore » hospital record, was available. Only individuals with available DNA were included in the linkage sample. Inclusion of a pedigree into the phenotypic sample required at least two first, second, or third degree relatives with non-affective psychosis (NAP), one of whom had schizophrenia (S) or poor-outcome schizoaffective disorder (PO-SAD). Entry into the linkage sample required DNA samples on at least two individuals with NAP, of whom at least one had S or PO-SAD. Affection was defined by narrow, intermediate, and broad criteria. 75 refs., 6 tabs.« less
LSCC SNP variant regulates SOX2 modulation of VDAC3.

PubMed

Chyr, Jacqueline; Guo, Dongmin; Zhou, Xiaobo

2018-04-27

Lung squamous cell carcinoma (LSCC) is a genomically complex malignancy with no effective treatments. Recent studies have found a large number of DNA alterations such as SOX2 amplification in LSCC patients. As a stem cell transcription factor, SOX2 is important for the maintenance of pluripotent cells and may play a role in cancer. To study the downstream mechanisms of SOX2, we employed expression quantitative trait loci (eQTLs) technology to investigate how the presence of SOX2 affects the expression of target genes. We discovered unique eQTLs, such as rs798827-VDAC3 (FDR p -value = 0.0034), that are only found in SOX2-active patients but not in SOX2-inactive patients. SNP rs798827 is within strong linkage disequilibrium ( r 2 = 1) to rs58163073, where rs58163073 [T] allele increases the binding affinity of SOX2 and allele [TA] decreases it. In our analysis, SOX2 silencing downregulates VDAC3 in two LSCC cell lines. Chromatin conformation capturing data indicates that this SNP is located within the same Topologically Associating Domain (TAD) of VDAC3, further suggesting SOX2's role in the regulation of VDAC3 through the binding of rs58163073. By first subgrouping patients based on SOX2 activity, we made more relevant eQTL discoveries and our analysis can be applied to other diseases.
Polymorphisms and haplotypes in the bovine neuropeptide Y, growth hormone receptor, ghrelin, insulin-like growth factor 2, and uncoupling proteins 2 and 3 genes and their associations with measures of growth, performance, feed efficiency, and carcass merit in beef cattle.

PubMed

Sherman, E L; Nkrumah, J D; Murdoch, B M; Li, C; Wang, Z; Fu, A; Moore, S S

2008-01-01

Genes that regulate metabolism and energy partitioning have the potential to influence economically important traits in farm animals, as do polymorphisms within these genes. In the current study, SNP in the bovine neuropeptide Y (NPY), growth hormone receptor (GHR), ghrelin (GHRL), uncoupling proteins 2 and 3 (UCP2 and UCP3), IGF2, corticotrophin-releasing hormone (CRH), cocaine and amphetamine regulated transcript (CART), melanocortin-4 receptor (MC4R), proopiomelanocortin (POMC), and GH genes were evaluated for associations with growth, feed efficiency, and carcass merit in beef steers. In total, 24 SNP were evaluated for associations with these traits and haplotypes were constructed within each gene when 2 or more SNP showed significant associations. An A/G SNP located in intron 4 of the GHR gene had the largest effects on BW of the animals (dominance effect P < 0.01) and feed efficiency (allele substitution effect P < 0.05). Another A/G SNP located in the promoter region of GHR had similar effects but the haplotypes of these 2 SNP reduced the effects of the SNP located in intron 4. Three SNP in the NPY gene showed associations to marbling (P < 0.001) as well as with ADG, BW, and feed conversion ratio (FCR; P < 0.05). The combination of these 3 SNP into haplotypes generally improved the association or had a similar scale of association as each single SNP. Only 1 SNP in UCP3, an A/G SNP in intron 3, was associated with ADG (P = 0.025), partial efficiency of growth, and FCR (P < 0.01). Three SNP in UCP2 gene were in almost complete linkage disequilibrium and showed associations with lean meat yield, yield grade, DMI, and BW (P < 0.05). Haplo-types between the SNP in UCP3 and UCP2 generally reduced the associations seen individually in each SNP. An A/G SNP in the GHRL gene tended to show effects on residual feed intake, FCR, and partial efficiency of growth (P < 0.10). The IGF2 SNP most strongly affected LM area (P < 0.01), back fat, ADG, and FCR (P < 0.05). The SNP in the CART, MC4R, POMC, GH, and CRH genes did not show associations at P < 0.05 with any of the traits. Although most of the SNP that showed associations do not cause amino acid changes, these SNP could be linked to other yet to be detected causative mutations or nearby QTL. It will be very important to verify these results in other cattle populations.
Construction and Comparative Analyses of Highly Dense Linkage Maps of Two Sweet Cherry Intra-Specific Progenies of Commercial Cultivars

PubMed Central

Quero-García, José; Guzmán, Alejandra; Mansur, Levi; Gratacós, Eduardo; Silva, Herman; Rosyara, Umesh R.; Iezzoni, Amy; Meisel, Lee A.; Dirlewanger, Elisabeth

2013-01-01

Despite the agronomical importance and high synteny with other Prunus species, breeding improvements for cherry have been slow compared to other temperate fruits, such as apple or peach. However, the recent release of the peach genome v1.0 by the International Peach Genome Initiative and the sequencing of cherry accessions to identify Single Nucleotide Polymorphisms (SNPs) provide an excellent basis for the advancement of cherry genetic and genomic studies. The availability of dense genetic linkage maps in phenotyped segregating progenies would be a valuable tool for breeders and geneticists. Using two sweet cherry (Prunus avium L.) intra-specific progenies derived from crosses between ‘Black Tartarian’ × ‘Kordia’ (BT×K) and ‘Regina’ × ‘Lapins’(R×L), high-density genetic maps of the four parental lines and the two segregating populations were constructed. For BT×K and R×L, 89 and 121 F1 plants were used for linkage mapping, respectively. A total of 5,696 SNP markers were tested in each progeny. As a result of these analyses, 723 and 687 markers were mapped into eight linkage groups (LGs) in BT×K and R×L, respectively. The resulting maps spanned 752.9 and 639.9 cM with an average distance of 1.1 and 0.9 cM between adjacent markers in BT×K and R×L, respectively. The maps displayed high synteny and co-linearity between each other, with the Prunus bin map, and with the peach genome v1.0 for all eight LGs (LG1–LG8). These maps provide a useful tool for investigating traits of interest in sweet cherry and represent a qualitative advance in the understanding of the cherry genome and its synteny with other members of the Rosaceae family. PMID:23382953
De novo assembly and transcriptome analysis of the rubber tree (Hevea brasiliensis) and SNP markers development for rubber biosynthesis pathways.

PubMed

Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira

2014-01-01

Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.
Polymorphisms in AKT3, FIGF, PRKAG3, and TGF-β genes are associated with myofiber characteristics in chickens.

PubMed

Chen, Sirui; An, Jianyong; Lian, Ling; Qu, Lujiang; Zheng, Jiangxia; Xu, Guiyun; Yang, Ning

2013-02-01

Muscle characteristics such as myofiber diameter, density, and total number are important traits in broiler breeding and production. In the present study, 19 SNP of 13 major genes, which are located in the vicinity of quantitative trait loci affecting breast muscle weight, including INS, IGF2, PIK3C2A, AKT3, PRKAB2, PRKAG3, VEGFA, RPS6KA2/3, FIGF, and TGF-β1/2/3, were chosen to be genotyped by high-throughput matrix-assisted laser desorption/ionization time-of-flight mass spectrometry in a broiler population. One hundred twenty birds were slaughtered at 6 wk of age. Body weight, breast muscle weight, myofiber diameter, density, and total number were determined for each bird. Six SNP with a very low minor allele frequency (<1%) were excluded for further analysis. The remaining 13 SNP were used for the association study with muscle characteristics. The results showed that SNP in TGF-β1/2/3 had significant effects on myofiber diameter. A SNP in PRKAG3 had a significant effect on myofiber density (P < 0.05). A C > G mutation in FIGF was strongly associated with total fiber number (P < 0.05). Additionally, birds with the GG genotype of the C > G mutation in AKT3 had significantly larger myofiber numbers (P < 0.05) than birds with the CC or GC genotype. The SNP identified in the present study might be used as potential markers in broiler breeding.

Assumption-free estimation of the genetic contribution to refractive error across childhood.

PubMed

Guggenheim, Jeremy A; St Pourcain, Beate; McMahon, George; Timpson, Nicholas J; Evans, David M; Williams, Cathy

2015-01-01

Studies in relatives have generally yielded high heritability estimates for refractive error: twins 75-90%, families 15-70%. However, because related individuals often share a common environment, these estimates are inflated (via misallocation of unique/common environment variance). We calculated a lower-bound heritability estimate for refractive error free from such bias. Between the ages 7 and 15 years, participants in the Avon Longitudinal Study of Parents and Children (ALSPAC) underwent non-cycloplegic autorefraction at regular research clinics. At each age, an estimate of the variance in refractive error explained by single nucleotide polymorphism (SNP) genetic variants was calculated using genome-wide complex trait analysis (GCTA) using high-density genome-wide SNP genotype information (minimum N at each age=3,404). The variance in refractive error explained by the SNPs ("SNP heritability") was stable over childhood: Across age 7-15 years, SNP heritability averaged 0.28 (SE=0.08, p<0.001). The genetic correlation for refractive error between visits varied from 0.77 to 1.00 (all p<0.001) demonstrating that a common set of SNPs was responsible for the genetic contribution to refractive error across this period of childhood. Simulations suggested lack of cycloplegia during autorefraction led to a small underestimation of SNP heritability (adjusted SNP heritability=0.35; SE=0.09). To put these results in context, the variance in refractive error explained (or predicted) by the time participants spent outdoors was <0.005 and by the time spent reading was <0.01, based on a parental questionnaire completed when the child was aged 8-9 years old. Genetic variation captured by common SNPs explained approximately 35% of the variation in refractive error between unrelated subjects. This value sets an upper limit for predicting refractive error using existing SNP genotyping arrays, although higher-density genotyping in larger samples and inclusion of interaction effects is expected to raise this figure toward twin- and family-based heritability estimates. The same SNPs influenced refractive error across much of childhood. Notwithstanding the strong evidence of association between time outdoors and myopia, and time reading and myopia, less than 1% of the variance in myopia at age 15 was explained by crude measures of these two risk factors, indicating that their effects may be limited, at least when averaged over the whole population.
Whole Genome Sequence Typing to Investigate the Apophysomyces Outbreak following a Tornado in Joplin, Missouri, 2011

PubMed Central

Etienne, Kizee A.; Gillece, John; Hilsabeck, Remy; Schupp, Jim M.; Colman, Rebecca; Lockhart, Shawn R.; Gade, Lalitha; Thompson, Elizabeth H.; Sutton, Deanna A.; Neblett-Fanfair, Robyn; Park, Benjamin J.; Turabelidze, George; Keim, Paul; Brandt, Mary E.; Deak, Eszter; Engelthaler, David M.

2012-01-01

Case reports of Apophysomyces spp. in immunocompetent hosts have been a result of traumatic deep implantation of Apophysomyces spp. spore-contaminated soil or debris. On May 22, 2011 a tornado occurred in Joplin, MO, leaving 13 tornado victims with Apophysomyces trapeziformis infections as a result of lacerations from airborne material. We used whole genome sequence typing (WGST) for high-resolution phylogenetic SNP analysis of 17 outbreak Apophysomyces isolates and five additional temporally and spatially diverse Apophysomyces control isolates (three A. trapeziformis and two A. variabilis isolates). Whole genome SNP phylogenetic analysis revealed three clusters of genotypically related or identical A. trapeziformis isolates and multiple distinct isolates among the Joplin group; this indicated multiple genotypes from a single or multiple sources. Though no linkage between genotype and location of exposure was observed, WGST analysis determined that the Joplin isolates were more closely related to each other than to the control isolates, suggesting local population structure. Additionally, species delineation based on WGST demonstrated the need to reassess currently accepted taxonomic classifications of phylogenetic species within the genus Apophysomyces. PMID:23209631
Whole genome sequence typing to investigate the Apophysomyces outbreak following a tornado in Joplin, Missouri, 2011.

PubMed

Etienne, Kizee A; Gillece, John; Hilsabeck, Remy; Schupp, Jim M; Colman, Rebecca; Lockhart, Shawn R; Gade, Lalitha; Thompson, Elizabeth H; Sutton, Deanna A; Neblett-Fanfair, Robyn; Park, Benjamin J; Turabelidze, George; Keim, Paul; Brandt, Mary E; Deak, Eszter; Engelthaler, David M

2012-01-01

Case reports of Apophysomyces spp. in immunocompetent hosts have been a result of traumatic deep implantation of Apophysomyces spp. spore-contaminated soil or debris. On May 22, 2011 a tornado occurred in Joplin, MO, leaving 13 tornado victims with Apophysomyces trapeziformis infections as a result of lacerations from airborne material. We used whole genome sequence typing (WGST) for high-resolution phylogenetic SNP analysis of 17 outbreak Apophysomyces isolates and five additional temporally and spatially diverse Apophysomyces control isolates (three A. trapeziformis and two A. variabilis isolates). Whole genome SNP phylogenetic analysis revealed three clusters of genotypically related or identical A. trapeziformis isolates and multiple distinct isolates among the Joplin group; this indicated multiple genotypes from a single or multiple sources. Though no linkage between genotype and location of exposure was observed, WGST analysis determined that the Joplin isolates were more closely related to each other than to the control isolates, suggesting local population structure. Additionally, species delineation based on WGST demonstrated the need to reassess currently accepted taxonomic classifications of phylogenetic species within the genus Apophysomyces.
PREST-plus identifies pedigree errors and cryptic relatedness in the GAW18 sample using genome-wide SNP data.

PubMed

Sun, Lei; Dimitromanolakis, Apostolos

2014-01-01

Pedigree errors and cryptic relatedness often appear in families or population samples collected for genetic studies. If not identified, these issues can lead to either increased false negatives or false positives in both linkage and association analyses. To identify pedigree errors and cryptic relatedness among individuals from the 20 San Antonio Family Studies (SAFS) families and cryptic relatedness among the 157 putatively unrelated individuals, we apply PREST-plus to the genome-wide single-nucleotide polymorphism (SNP) data and analyze estimated identity-by-descent (IBD) distributions for all pairs of genotyped individuals. Based on the given pedigrees alone, PREST-plus identifies the following putative pairs: 1091 full-sib, 162 half-sib, 360 grandparent-grandchild, 2269 avuncular, 2717 first cousin, 402 half-avuncular, 559 half-first cousin, 2 half-sib+first cousin, 957 parent-offspring and 440,546 unrelated. Using the genotype data, PREST-plus detects 7 mis-specified relative pairs, with their IBD estimates clearly deviating from the null expectations, and it identifies 4 cryptic related pairs involving 7 individuals from 6 families.
SNiPlay: a web-based tool for detection, management and analysis of SNPs. Application to grapevine diversity projects.

PubMed

Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice

2011-05-05

High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to export genotyping data or sequences in various formats. Our experiments on grapevine genetic projects showed that SNiPlay allows geneticists to rapidly obtain advanced results in several key research areas of plant genetic diversity. Both the management and treatment of large amounts of SNP data are rendered considerably easier for end-users through automation and integration. Current developments are taking into account new advances in high-throughput technologies.SNiPlay is available at: http://sniplay.cirad.fr/.
Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array

USDA-ARS?s Scientific Manuscript database

Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases...
Detection, breakpoint identification and detailed characterisation of a CNV at the FRA16D site using SNP assays.

PubMed

Winchester, L; Newbury, D F; Monaco, A P; Ragoussis, J

2008-01-01

Copy Number Variants (CNV) and other submicroscopic structural changes are now recognised to be widespread across the human genome. We show that SNP data generated for association study can be utilised for the identification of deletion CNVs. During analysis of data for an SNP association study for Specific Language Impairment (SLI) a deletion was identified. SLI adversely affects the language development of children in the absence of any obvious cause. Previous studies have found linkage to a region on chromosome 16. The deletion was located in a known fragile site FRA16D in intron 5-6 of the WWOX gene (also known as FOR). Changes in the FRA16D site have been previously linked to cancer and are often characterised in cell lines. A long-range PCR assay was used to confirm the existence of the deletion. We also show the breakpoint identification and large-scale characterisation of this CNV in a normal human sample set. Copyright 2009 S. Karger AG, Basel.
High-throughput SNP genotyping in Cucurbita pepo for map construction and quantitative trait loci mapping

PubMed Central

2012-01-01

Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in the coding regions of genes involved in different physiological processes. The platform will also be useful for future mapping and diversity studies, and will be essential in order to accelerate the process of breeding new and better-adapted squash varieties. PMID:22356647
Leveraging Genomic Annotations and Pleiotropic Enrichment for Improved Replication Rates in Schizophrenia GWAS

PubMed Central

Wang, Yunpeng; Thompson, Wesley K.; Schork, Andrew J.; Holland, Dominic; Chen, Chi-Hua; Bettella, Francesco; Desikan, Rahul S.; Li, Wen; Witoelar, Aree; Zuber, Verena; Devor, Anna; Nöthen, Markus M.; Rietschel, Marcella; Chen, Qiang; Werge, Thomas; Cichon, Sven; Weinberger, Daniel R.; Djurovic, Srdjan; O’Donovan, Michael; Visscher, Peter M.; Andreassen, Ole A.; Dale, Anders M.

2016-01-01

Most of the genetic architecture of schizophrenia (SCZ) has not yet been identified. Here, we apply a novel statistical algorithm called Covariate-Modulated Mixture Modeling (CM3), which incorporates auxiliary information (heterozygosity, total linkage disequilibrium, genomic annotations, pleiotropy) for each single nucleotide polymorphism (SNP) to enable more accurate estimation of replication probabilities, conditional on the observed test statistic (“z-score”) of the SNP. We use a multiple logistic regression on z-scores to combine information from auxiliary information to derive a “relative enrichment score” for each SNP. For each stratum of these relative enrichment scores, we obtain nonparametric estimates of posterior expected test statistics and replication probabilities as a function of discovery z-scores, using a resampling-based approach that repeatedly and randomly partitions meta-analysis sub-studies into training and replication samples. We fit a scale mixture of two Gaussians model to each stratum, obtaining parameter estimates that minimize the sum of squared differences of the scale-mixture model with the stratified nonparametric estimates. We apply this approach to the recent genome-wide association study (GWAS) of SCZ (n = 82,315), obtaining a good fit between the model-based and observed effect sizes and replication probabilities. We observed that SNPs with low enrichment scores replicate with a lower probability than SNPs with high enrichment scores even when both they are genome-wide significant (p < 5x10-8). There were 693 and 219 independent loci with model-based replication rates ≥80% and ≥90%, respectively. Compared to analyses not incorporating relative enrichment scores, CM3 increased out-of-sample yield for SNPs that replicate at a given rate. This demonstrates that replication probabilities can be more accurately estimated using prior enrichment information with CM3. PMID:26808560
Association between FTO polymorphism in exon 3 with carcass and meat quality traits in crossbred ducks.

PubMed

Gan, W; Song, Q; Zhang, N N; Xiong, X P; Wang, D M C; Li, L

2015-06-18

The fat mass and obesity-associated gene (FTO) is an excellent candidate gene that affects energy metabolism. Single nucleotide polymorphisms (SNPs) in FTO are associated with carcass and meat quality traits in pigs, cattle, and rabbits. The aim of this study was to investigate the association between novel SNPs in the FTO coding region and carcass and meat quality traits in 95 crossbred ducks, using DNA sequencing. We found two transitions G/A (SNP 387 and 473) within exon 3. SNP 387 was a synonymous mutation, whereas SNP 473 was a missense mutation. Association analysis suggested that SNP g.387G>A was significantly associated with all of the carcass traits measured, the intramuscular fat content (IMF), cooking yield (CY), pH values 45 min after slaughter (pH45m), drip losses from the breast muscle, and the leg muscle (P < 0.05). For SNP g.473G>A, the genotype AA exhibited greater leg muscle weight than the genotypes GG or AG (P < 0.05). The D value suggested that the two SNPs exhibited strong linkage disequilibrium. Three haplotypes (G1G2, G1A2, and A1A2) were significantly associated with IMF, CY, the a* value, and all of the carcass traits measured (P < 0.05). The results suggest that FTO is a candidate locus that affects carcass and meat quality traits in ducks.
Multicapillary gel electrophoresis based analysis of genetic variants in the WFS1 gene.

PubMed

Elek, Zsuzsanna; Dénes, Réka; Prokop, Susanne; Somogyi, Anikó; Yowanto, Handy; Luo, Jane; Souquet, Manfred; Guttman, András; Rónai, Zsolt

2016-09-01

The WFS1 gene is one of the thoroughly investigated targets in diabetes research, variants of the gene were suggested to be the genetic components of the common forms (type 1 and type 2) of diabetes. Our project focused on the analysis of polymorphisms (rs4689388, rs148797429, rs4273545) localized in the WFS1 promoter region. Although submarine gel electrophoresis based approaches were also employed in the genetic tests, it was demonstrated that multicapillary electrophoresis offers a state of the art approach for reliable high-throughput SNP and VNTR analysis. Association studies were carried out in a case-control setup. Luciferase reporter assay was employed to test the effect of the investigated loci on the activity of gene expression in vitro. Significant association could be demonstrated between all three polymorphisms and type 2 diabetes in both allele- and genotype-wise settings even using Bonferroni correction. It is notable; however, that the three loci were in strong linkage disequilibrium, thus the observed associations cannot be considered as separate effects. Molecular analyses showed that the rs4273545 GT SNP played a role in the regulation of transcription in vitro. However, this effect took place only in the presence of the region including the rs148797429 site, although this latter locus did not have its own impact on the regulation of gene expression. The paper provides genotyping protocols readily applicable in any multiplex SNP and VNTR analyses, moreover confirms and extends previous results about the role of WFS1 polymorphisms in the genetic risk of diabetes mellitus. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The easy road to genome-wide medium density SNP screening in a non-model species: development and application of a 10 K SNP-chip for the house sparrow (Passer domesticus).

PubMed

Hagen, Ingerid J; Billing, Anna M; Rønning, Bernt; Pedersen, Sindre A; Pärn, Henrik; Slate, Jon; Jensen, Henrik

2013-05-01

With the advent of next generation sequencing, new avenues have opened to study genomics in wild populations of non-model species. Here, we describe a successful approach to a genome-wide medium density Single Nucleotide Polymorphism (SNP) panel in a non-model species, the house sparrow (Passer domesticus), through the development of a 10 K Illumina iSelect HD BeadChip. Genomic DNA and cDNA derived from six individuals were sequenced on a 454 GS FLX system and generated a total of 1.2 million sequences, in which SNPs were detected. As no reference genome exists for the house sparrow, we used the zebra finch (Taeniopygia guttata) reference genome to determine the most likely position of each SNP. The 10 000 SNPs on the SNP-chip were selected to be distributed evenly across 31 chromosomes, giving on average one SNP per 100 000 bp. The SNP-chip was screened across 1968 individual house sparrows from four island populations. Of the original 10 000 SNPs, 7413 were found to be variable, and 99% of these SNPs were successfully called in at least 93% of all individuals. We used the SNP-chip to demonstrate the ability of such genome-wide marker data to detect population sub-division, and compared these results to similar analyses using microsatellites. The SNP-chip will be used to map Quantitative Trait Loci (QTL) for fitness-related phenotypic traits in natural populations. © 2013 Blackwell Publishing Ltd.
Genome-Wide Association Study for Identification and Validation of Novel SNP Markers for Sr6 Stem Rust Resistance Gene in Bread Wheat.

PubMed

Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S

2018-01-01

Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.
A New Metazoan Recombination Rate Record and Consistently High Recombination Rates in the Honey Bee Genus Apis Accompanied by Frequent Inversions but Not Translocations.

PubMed

Rueppell, Olav; Kuster, Ryan; Miller, Katelyn; Fouks, Bertrand; Rubio Correa, Sara; Collazo, Juan; Phaincharoen, Mananya; Tingek, Salim; Koeniger, Nikolaus

2016-12-01

Western honey bees (Apis mellifera) far exceed the commonly observed 1–2 meiotic recombination events per chromosome and exhibit the highest Metazoan recombination rate (20 cM/Mb) described thus far. However, the reasons for this exceptional rate of recombination are not sufficiently understood. In a comparative study, we report on the newly constructed genomic linkage maps of Apis florea and Apis dorsata that represent the two honey bee lineages without recombination rate estimates so far. Each linkage map was generated de novo, based on SNP genotypes of haploid male offspring of a single female. The A. florea map spans 4,782 cM with 1,279 markers in 16 linkage groups. The A. dorsata map is 5,762 cM long and contains 1,189 markers in 16 linkage groups. Respectively, these map sizes result in average recombination rate estimates of 20.8 and 25.1 cM/Mb. Synteny analyses indicate that frequent intra-chromosomal rearrangements but no translocations among chromosomes accompany the high rates of recombination during the independent evolution of the three major honey bee lineages. Our results imply a common cause for the evolution of very high recombination rates in Apis. Our findings also suggest that frequent homologous recombination during meiosis might increase ectopic recombination and rearrangements within but not between chromosomes. It remains to be investigated whether the resulting inversions may have been important in the evolutionary differentiation between honey bee species.
Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus).

PubMed

Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

2015-07-27

Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1-8) were identified and genotyped via direct sequencing covering most of the coding region and 3'UTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3'UTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs.
Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus)

PubMed Central

Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

2015-01-01

Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1–8) were identified and genotyped via direct sequencing covering most of the coding region and 3ʹUTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3ʹUTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs. PMID:26225956
Construction of a high-density genetic map and the X/Y sex-determining gene mapping in spinach based on large-scale markers developed by specific-locus amplified fragment sequencing (SLAF-seq).

PubMed

Qian, Wei; Fan, Guiyan; Liu, Dandan; Zhang, Helong; Wang, Xiaowu; Wu, Jian; Xu, Zhaosheng

2017-04-04

Cultivated spinach (Spinacia oleracea L.) is one of the most widely cultivated types of leafy vegetable in the world, and it has a high nutritional value. Spinach is also an ideal plant for investigating the mechanism of sex determination because it is a dioecious species with separate male and female plants. Some reports on the sex labeling and localization of spinach in the study of molecular markers have surfaced. However, there have only been two reports completed on the genetic map of spinach. The lack of rich and reliable molecular markers and the shortage of high-density linkage maps are important constraints in spinach research work. In this study, a high-density genetic map of spinach based on the Specific-locus Amplified Fragment Sequencing (SLAF-seq) technique was constructed; the sex-determining gene was also finely mapped. Through bio-information analysis, 50.75 Gb of data in total was obtained, including 207.58 million paired-end reads. Finally, 145,456 high-quality SLAF markers were obtained, with 27,800 polymorphic markers and 4080 SLAF markers were finally mapped onto the genetic map after linkage analysis. The map spanned 1,125.97 cM with an average distance of 0.31 cM between the adjacent marker loci. It was divided into 6 linkage groups corresponding to the number of spinach chromosomes. Besides, the combination of Bulked Segregation Analysis (BSA) with SLAF-seq technology(super-BSA) was employed to generate the linkage markers with the sex-determining gene. Combined with the high-density genetic map of spinach, the sex-determining gene X/Y was located at the position of the linkage group (LG) 4 (66.98 cM-69.72 cM and 75.48 cM-92.96 cM), which may be the ideal region for the sex-determining gene. A high-density genetic map of spinach based on the SLAF-seq technique was constructed with a backcross (BC 1 ) population (which is the highest density genetic map of spinach reported at present). At the same time, the sex-determining gene X/Y was mapped to LG4 with super-BSA. This map will offer a suitable basis for further study of spinach, such as gene mapping, map-based cloning of Specific genes, quantitative trait locus (QTL) mapping and marker-assisted selection (MAS). It will also provide an efficient reference for studies on the mechanism of sex determination in other dioecious plants.
A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species

DOE Office of Scientific and Technical Information (OSTI.GOV)

Geraldes, Armando; Hannemann, Jan; Grassa, Chris

2013-01-01

Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. Despite the declining costs of genotyping by sequencing, for most studies, the use of large SNP genotyping arrays still offers the most cost-effective solution for large-scale targeted genotyping. Here we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species range. Due to the rapid decay of linkage disequilibrium in P. trichocarpa we adopted a candidate gene approach to the arraymore » design that resulted in the selection of 34,131 SNPs, the majority of which are located in, or within 2 kb, of 3,543 candidate genes. A subset of the SNPs (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%, indicating that high-quality data are generated with this array. We demonstrate that even among small numbers of samples (n=10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that due to ascertainment bias the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca (P. balsamifera and P. angustifolia). Finally, we provide evidence for the utility of the array for intraspecific studies of genetic differentiation and for species assignment and the detection of natural hybrids.« less
Association Mapping of the High-Grade Myopia MYP3 Locus Reveals Novel Candidates UHRF1BP1L, PTPRR, and PPFIA2

PubMed Central

Hawthorne, Felicia; Feng, Sheng; Metlapally, Ravikanth; Li, Yi-Ju; Tran-Viet, Khanh-Nhat; Guggenheim, Jeremy A.; Malecaze, Francois; Calvas, Patrick; Rosenberg, Thomas; Mackey, David A.; Venturini, Cristina; Hysi, Pirro G.; Hammond, Christopher J.; Young, Terri L.

2013-01-01

Purpose. Myopia, or nearsightedness, is a common ocular genetic disease for which over 20 candidate genomic loci have been identified. The high-grade myopia locus, MYP3, has been reported on chromosome 12q21–23 by four independent linkage studies. Methods. We performed a genetic association study of the MYP3 locus in a family-based high-grade myopia cohort (n = 82) by genotyping 768 single-nucleotide polymorphisms (SNPs) within the linkage region. Qualitative testing for high-grade myopia (sphere ≤ −5 D affected, > −0.5 D unaffected) and quantitative testing on the average dioptric sphere were performed. Results. Several genetic markers were nominally significantly associated with high-grade myopia in qualitative testing, including rs3803036, a missense mutation in PTPRR (P = 9.1 × 10−4) and rs4764971, an intronic SNP in UHRF1BP1L (P = 6.1 × 10−4). Quantitative testing determined statistically significant SNPs rs4764971, also found by qualitative testing (P = 3.1 × 10−6); rs7134216, in the 3′ untranslated region (UTR) of DEPDC4 (P = 5.4 × 10−7); and rs17306116, an intronic SNP within PPFIA2 (P < 9 × 10−4). Independently conducted whole genome expression array analyses identified protein tyrosine phosphatase genes PTPRR and PPFIA2, which are in the same gene family, as differentially expressed in normal rapidly growing fetal relative to normal adult ocular tissue (confirmed by RT-qPCR). Conclusions. In an independent high-grade myopia cohort, an intronic SNP in UHRF1BP1L, rs4764971, was validated for quantitative association, and SNPs within PTPRR (quantitative) and PPFIA2 (qualitative and quantitative) approached significance. Three genes identified by our association study and supported by ocular expression and/or replication, UHRF1BP1L, PTPRR, and PPFIA2, are novel candidates for myopic development within the MYP3 locus that should be further studied. PMID:23422819
Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.

PubMed

Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi

2004-01-01

In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.

Genetic tests for estimating dairy breed proportion and parentage assignment in East African crossbred cattle.

PubMed

Strucken, Eva M; Al-Mamun, Hawlader A; Esquivelzeta-Rabell, Cecilia; Gondro, Cedric; Mwai, Okeyo A; Gibson, John P

2017-09-12

Smallholder dairy farming in much of the developing world is based on the use of crossbred cows that combine local adaptation traits of indigenous breeds with high milk yield potential of exotic dairy breeds. Pedigree recording is rare in such systems which means that it is impossible to make informed breeding decisions. High-density single nucleotide polymorphism (SNP) assays allow accurate estimation of breed composition and parentage assignment but are too expensive for routine application. Our aim was to determine the level of accuracy achieved with low-density SNP assays. We constructed subsets of 100 to 1500 SNPs from the 735k-SNP Illumina panel by selecting: (a) on high minor allele frequencies (MAF) in a crossbred population; (b) on large differences in allele frequency between ancestral breeds; (c) at random; or (d) with a differential evolution algorithm. These panels were tested on a dataset of 1933 crossbred dairy cattle from Kenya/Uganda and on crossbred populations from Ethiopia (N = 545) and Tanzania (N = 462). Dairy breed proportions were estimated by using the ADMIXTURE program, a regression approach, and SNP-best linear unbiased prediction, and tested against estimates obtained by ADMIXTURE based on the 735k-SNP panel. Performance for parentage assignment was based on opposing homozygotes which were used to calculate the separation value (sv) between true and false assignments. Panels of SNPs based on the largest differences in allele frequency between European dairy breeds and a combined Nelore/N'Dama population gave the best predictions of dairy breed proportion (r 2 = 0.962 to 0.994 for 100 to 1500 SNPs) with an average absolute bias of 0.026. Panels of SNPs based on the highest MAF in the crossbred population (Kenya/Uganda) gave the most accurate parentage assignments (sv = -1 to 15 for 100 to 1500 SNPs). Due to the different required properties of SNPs, panels that did well for breed composition did poorly for parentage assignment and vice versa. A combined panel of 400 SNPs was not able to assign parentages correctly, thus we recommend the use of 200 SNPs either for breed proportion prediction or parentage assignment, independently.
A transcriptome-snp-derived linkage map of Apios americana (potato bean) provides insights about genome re-organization and synteny conservation in the phaseolid legumes

USDA-ARS?s Scientific Manuscript database

Apios (Apios americana; “apios”), a tuberous perennial legume in the Phaseoleae tribe, was widely used as a food by Native Americans. Work in the last 40 years has led to several improved breeding lines. Aspects of the pollination biology (complex floral structure and tripping mechanism) have made c...
High-resolution genetic map for understanding the effect of genome-wide recombination rate, selection sweep and linkage disequilibrium on nucleotide diversity in watermelon

USDA-ARS?s Scientific Manuscript database

Genotyping by sequencing (GBS) technology was used to identify a set of 9,933 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1,087 cM for watermelon. The genome-wide variation of recombination rate (GWRR) across the map was evaluated and a positive co...
Dissecting genomic hotspots underlying seed protein, oil, and sucrose content in an interspecific mapping population of soybean using high-density linkage mapping.

PubMed

Patil, Gunvant; Vuong, Tri D; Kale, Sandip; Valliyodan, Babu; Deshmukh, Rupesh; Zhu, Chengsong; Wu, Xiaolei; Bai, Yonghe; Yungbluth, Dennis; Lu, Fang; Kumpatla, Siva; Shannon, J Grover; Varshney, Rajeev K; Nguyen, Henry T

2018-04-04

The cultivated [Glycine max (L) Merr.] and wild [Glycine soja Siebold & Zucc.] soybean species comprise wide variation in seed composition traits. Compared to wild soybean, cultivated soybean contains low protein, high oil, and high sucrose. In this study, an interspecific population was derived from a cross between G. max (Williams 82) and G. soja (PI 483460B). This recombinant inbred line (RIL) population of 188 lines was sequenced at 0.3× depth. Based on 91 342 single nucleotide polymorphisms (SNPs), recombination events in RILs were defined, and a high-resolution bin map was developed (4070 bins). In addition to bin mapping, quantitative trait loci (QTL) analysis for protein, oil, and sucrose was performed using 3343 polymorphic SNPs (3K-SNP), derived from Illumina Infinium BeadChip sequencing platform. The QTL regions from both platforms were compared, and a significant concordance was observed between bin and 3K-SNP markers. Importantly, the bin map derived from next-generation sequencing technology enhanced mapping resolution (from 1325 to 50 Kb). A total of five, nine, and four QTLs were identified for protein, oil, and sucrose content, respectively, and some of the QTLs coincided with soybean domestication-related genomic loci. The major QTL for protein and oil were mapped on Chr. 20 (qPro_20) and suggested negative correlation between oil and protein. In terms of sucrose content, a novel and major QTL were identified on Chr. 8 (qSuc_08) and harbours putative genes involved in sugar transport. In addition, genome-wide association using 91 342 SNPs confirmed the genomic loci derived from QTL mapping. A QTL-based haplotype using whole-genome resequencing of 106 diverse soybean lines identified unique allelic variation in wild soybean that could be utilized to widen the genetic base in cultivated soybean. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Inheritance of Virulence, Construction of a Linkage Map, and Mapping Dominant Virulence Genes in Puccinia striiformis f. sp. tritici Through Characterization of a Sexual Population with Genotyping-by-Sequencing.

PubMed

Yuan, Congying; Wang, Meinan; Skinner, Danniel Z; See, Deven R; Xia, Chongjing; Guo, Xinhong; Chen, Xianming

2018-01-01

Puccinia striiformis f. sp. tritici, the wheat stripe rust pathogen, is a dikaryotic, biotrophic, and macrocyclic fungus. Genetic study of P. striiformis f. sp. tritici virulence was not possible until the recent discovery of Berberis spp. and Mahonia spp. as alternate hosts. To determine inheritance of virulence and map virulence genes, a segregating population of 119 isolates was developed by self-fertilizing P. striiformis f. sp. tritici isolate 08-220 (race PSTv-11) on barberry leaves under controlled greenhouse conditions. The progeny isolates were phenotyped on a set of 29 wheat lines with single genes for race-specific resistance and genotyped with simple sequence repeat (SSR) markers, single nucleotide polymorphism (SNP) markers derived from secreted protein genes, and SNP markers from genotyping-by-sequencing (GBS). Using the GBS technique, 10,163 polymorphic GBS-SNP markers were identified. Clustering and principal component analysis grouped these markers into six genetic groups, and a genetic map, consisting of six linkage groups, was constructed with 805 markers. The six clusters or linkage groups resulting from these analyses indicated a haploid chromosome number of six in P. striiformis f. sp. tritici. Through virulence testing of the progeny isolates, the parental isolate was found to be homozygous for the avirulence loci corresponding to resistance genes Yr5, Yr10, Yr15, Yr24, Yr32, YrSP, YrTr1, Yr45, and Yr53 and homozygous for the virulence locus corresponding to resistance gene Yr41. Segregation was observed for virulence phenotypes in response to the remaining 19 single-gene lines. A single dominant gene or two dominant genes with different nonallelic gene interactions were identified for each of the segregating virulence phenotypes. Of 27 dominant virulence genes identified, 17 were mapped to two chromosomes. Markers tightly linked to some of the virulence loci may facilitate further studies to clone these genes. The virulence genes and their inheritance information are useful for understanding the host-pathogen interactions and for selecting effective resistance genes or gene combinations for developing stripe rust resistant wheat cultivars.
Genomic signatures reveal geographic adaption and human selection in cattle

USDA-ARS?s Scientific Manuscript database

We investigated geographic adaptation and human selection using high-density SNP data of five diverse cattle breeds. Based on allele frequency differences, we detected hundreds of candidate regions under positive selection across Holstein, Angus, Charolais, Brahman, and N'Dama. In addition to well-k...
SNP Marker Integration and QTL Analysis of 12 Agronomic and Morphological Traits in F8 RILs of Pepper (Capsicum annuum L.)

PubMed Central

Lu, Fu-Hao; Kwon, Soon-Wook; Yoon, Min-Young; Kim, Ki-Taek; Cho, Myeong-Cheoul; Yoon, Moo-Kyung; Park, Yong-Jin

2012-01-01

Red pepper, Capsicum annuum L., has been attracting geneticists’ and breeders’ attention as one of the important agronomic crops. This study was to integrate 41 SNP markers newly developed from comparative transcriptomes into a previous linkage map, and map 12 agronomic and morphological traits into the integrated map. A total of 39 markers found precise position and were assigned to 13 linkage groups (LGs) as well as the unassigned LGe, leading to total 458 molecular markers present in this genetic map. Linkage mapping was supported by the physical mapping to tomato and potato genomes using BLAST retrieving, revealing at least two-thirds of the markers mapped to the corresponding LGs. A sum of 23 quantitative trait loci from 11 traits was detected using the composite interval mapping algorithm. A consistent interval between a035_1 and a170_1 on LG5 was detected as a main-effect locus among the resistance QTLs to Phytophthora capsici at high-, intermediate- and low-level tests, and interactions between the QTLs for high-level resistance test were found. Considering the epistatic effect, those QTLs could explain up to 98.25% of the phenotype variations of resistance. Moreover, 17 QTLs for another eight traits were found to locate on LG3, 4, and 12 mostly with varying phenotypic contribution. Furthermore, the locus for corolla color was mapped to LG10 as a marker. The integrated map and the QTLs identified would be helpful for current genetics research and crop breeding, especially in the Solanaceae family. PMID:22684870
Estimation of linkage disequilibrium and interspecific gene flow in Ficedula flycatchers by a newly developed 50k single-nucleotide polymorphism array

PubMed Central

Kawakami, Takeshi; Backström, Niclas; Burri, Reto; Husby, Arild; Olason, Pall; Rice, Amber M; Ålund, Murielle; Qvarnström, Anna; Ellegren, Hans

2014-01-01

With the access to draft genome sequence assemblies and whole-genome resequencing data from population samples, molecular ecology studies will be able to take truly genome-wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single-nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10-fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later-generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system. PMID:24784959
SNP Discovery by Illumina-Based Transcriptome Sequencing of the Olive and the Genetic Characterization of Turkish Olive Genotypes Revealed by AFLP, SSR and SNP Markers

PubMed Central

Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin

2013-01-01

Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of genetic variation among Turkish olive genotypes revealed by SNPs, AFLPs and SSRs allowed us to characterize the Turkish olive genotype. PMID:24058483
Development and preliminary evaluation of a 90 K Axiom® SNP array for the allo-octoploid cultivated strawberry Fragaria × ananassa.

PubMed

Bassil, Nahla V; Davis, Thomas M; Zhang, Hailong; Ficklin, Stephen; Mittmann, Mike; Webster, Teresa; Mahoney, Lise; Wood, David; Alperin, Elisabeth S; Rosyara, Umesh R; Koehorst-Vanc Putten, Herma; Monfort, Amparo; Sargent, Daniel J; Amaya, Iraida; Denoyes, Beatrice; Bianco, Luca; van Dijk, Thijs; Pirani, Ali; Iezzoni, Amy; Main, Dorrie; Peace, Cameron; Yang, Yilong; Whitaker, Vance; Verma, Sujeet; Bellon, Laurent; Brew, Fiona; Herrera, Raul; van de Weg, Eric

2015-03-07

A high-throughput genotyping platform is needed to enable marker-assisted breeding in the allo-octoploid cultivated strawberry Fragaria × ananassa. Short-read sequences from one diploid and 19 octoploid accessions were aligned to the diploid Fragaria vesca 'Hawaii 4' reference genome to identify single nucleotide polymorphisms (SNPs) and indels for incorporation into a 90 K Affymetrix® Axiom® array. We report the development and preliminary evaluation of this array. About 36 million sequence variants were identified in a 19 member, octoploid germplasm panel. Strategies and filtering pipelines were developed to identify and incorporate markers of several types: di-allelic SNPs (66.6%), multi-allelic SNPs (1.8%), indels (10.1%), and ploidy-reducing "haploSNPs" (11.7%). The remaining SNPs included those discovered in the diploid progenitor F. iinumae (3.9%), and speculative "codon-based" SNPs (5.9%). In genotyping 306 octoploid accessions, SNPs were assigned to six classes with Affymetrix's "SNPolisher" R package. The highest quality classes, PolyHigh Resolution (PHR), No Minor Homozygote (NMH), and Off-Target Variant (OTV) comprised 25%, 38%, and 1% of array markers, respectively. These markers were suitable for genetic studies as demonstrated in the full-sib family 'Holiday' × 'Korona' with the generation of a genetic linkage map consisting of 6,594 PHR SNPs evenly distributed across 28 chromosomes with an average density of approximately one marker per 0.5 cM, thus exceeding our goal of one marker per cM. The Affymetrix IStraw90 Axiom array is the first high-throughput genotyping platform for cultivated strawberry and is commercially available to the worldwide scientific community. The array's high success rate is likely driven by the presence of naturally occurring variation in ploidy level within the nominally octoploid genome, and by effectiveness of the employed array design and ploidy-reducing strategies. This array enables genetic analyses including generation of high-density linkage maps, identification of quantitative trait loci for economically important traits, and genome-wide association studies, thus providing a basis for marker-assisted breeding in this high value crop.
A low-density SNP array for analyzing differential selection in freshwater and marine populations of threespine stickleback (Gasterosteus aculeatus).

PubMed

Ferchaud, Anne-Laure; Pedersen, Susanne H; Bekkevold, Dorte; Jian, Jianbo; Niu, Yongchao; Hansen, Michael M

2014-10-06

The threespine stickleback (Gasterosteus aculeatus) has become an important model species for studying both contemporary and parallel evolution. In particular, differential adaptation to freshwater and marine environments has led to high differentiation between freshwater and marine stickleback populations at the phenotypic trait of lateral plate morphology and the underlying candidate gene Ectodysplacin (EDA). Many studies have focused on this trait and candidate gene, although other genes involved in marine-freshwater adaptation may be equally important. In order to develop a resource for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. We have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene Ectodysplacin (EDA).
Linkage Analysis in Autoimmune Addison's Disease: NFATC1 as a Potential Novel Susceptibility Locus.

PubMed

Mitchell, Anna L; Bøe Wolff, Anette; MacArthur, Katie; Weaver, Jolanta U; Vaidya, Bijay; Erichsen, Martina M; Darlay, Rebecca; Husebye, Eystein S; Cordell, Heather J; Pearce, Simon H S

2015-01-01

Autoimmune Addison's disease (AAD) is a rare, highly heritable autoimmune endocrinopathy. It is possible that there may be some highly penetrant variants which confer disease susceptibility that have yet to be discovered. DNA samples from 23 multiplex AAD pedigrees from the UK and Norway (50 cases, 67 controls) were genotyped on the Affymetrix SNP 6.0 array. Linkage analysis was performed using Merlin. EMMAX was used to carry out a genome-wide association analysis comparing the familial AAD cases to 2706 UK WTCCC controls. To explore some of the linkage findings further, a replication study was performed by genotyping 64 SNPs in two of the four linked regions (chromosomes 7 and 18), on the Sequenom iPlex platform in three European AAD case-control cohorts (1097 cases, 1117 controls). The data were analysed using a meta-analysis approach. In a parametric analysis, applying a rare dominant model, loci on chromosomes 7, 9 and 18 had LOD scores >2.8. In a non-parametric analysis, a locus corresponding to the HLA region on chromosome 6, known to be associated with AAD, had a LOD score >3.0. In the genome-wide association analysis, a SNP cluster on chromosome 2 and a pair of SNPs on chromosome 6 were associated with AAD (P <5x10-7). A meta-analysis of the replication study data demonstrated that three chromosome 18 SNPs were associated with AAD, including a non-synonymous variant in the NFATC1 gene. This linkage study has implicated a number of novel chromosomal regions in the pathogenesis of AAD in multiplex AAD families and adds further support to the role of HLA in AAD. The genome-wide association analysis has also identified a region of interest on chromosome 2. A replication study has demonstrated that the NFATC1 gene is worthy of future investigation, however each of the regions identified require further, systematic analysis.
Association analysis of the vitamin D receptor gene, the type I collagen gene COL1A1, and the estrogen receptor gene in idiopathic osteoarthritis.

PubMed

Loughlin, J; Sinsheimer, J S; Mustafa, Z; Carr, A J; Clipsham, K; Bloomfield, V A; Chitnavis, J; Bailey, A; Sykes, B; Chapman, K

2000-03-01

Evidence has accumulated supporting a role for genes in the etiology of osteoarthritis (OA). Several candidates have been targeted as potential susceptibility loci including genes that are involved in the regulation of bone density. Genetic association analysis has suggested a role for the vitamin D receptor gene (VDR) and the estrogen receptor gene (ER) in susceptibility. Such findings must be tested in additional independent cohorts. We tested for association of these 2 genes, plus a third gene implicated in bone density, COL1A1, with idiopathic OA. A case-control cohort of 371 affected probands and 369 unaffected spouses was used. Association was tested using 4 intragenic single nucleotide polymorphisms (SNP), one each for the VDR and COL1A1 genes, and 2 for the ER gene. The VDR and ER SNP are the same SNP that have been associated with OA. All 4 SNP affect restriction enzyme sites and were genotyped using polymerase chain reaction and enzyme digestion. Allele and genotype distributions for each SNP were compared between cases and controls and analyzed using Fisher's exact test. There was no evidence of association of the VDR or the ER gene SNP to OA. There was weak evidence of association of the COL1A1 SNP in female cases (p = 0.017), reflected by a difference in the distribution of genotypes at this SNP between female cases and controls (p = 0.027). However, when corrected for multiple testing, these results were not significant. If the VDR, ER, or COL1A1 genes do encode predisposition to OA then the 4 SNP tested are not associated with major susceptibility alleles at these 3 loci.
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers

PubMed Central

Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.

2018-01-01

Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.

PubMed

Jiang, Yong; Schmidt, Renate H; Reif, Jochen C

2018-05-04

Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.
Variants in the ATP-Binding Cassette Transporter (ABCA7), Apolipoprotein E ε4, and the Risk of Late-Onset Alzheimer Disease in African Americans

PubMed Central

Reitz, Christiane; Jun, Gyungah; Naj, Adam; Rajbhandary, Ruchita; Vardarajan, Badri Narayan; Wang, Li-San; Valladares, Otto; Lin, Chiao-Feng; Larson, Eric B.; Graff-Radford, Neill R.; Evans, Denis; De Jager, Philip L.; Crane, Paul K.; Buxbaum, Joseph D.; Murrell, Jill R.; Raj, Towfique; Ertekin-Taner, Nilufer; Logue, Mark; Baldwin, Clinton T.; Green, Robert C.; Barnes, Lisa L.; Cantwell, Laura B.; Fallin, M. Daniele; Go, Rodney C. P.; Griffith, Patrick; Obisesan, Thomas O.; Manly, Jennifer J.; Lunetta, Kathryn L.; Kamboh, M. Ilyas; Lopez, Oscar L.; Bennett, David A.; Hendrie, Hugh; Hall, Kathleen S.; Goate, Alison M.; Byrd, Goldie S.; Kukull, Walter A.; Foroud, Tatiana M.; Haines, Jonathan L.; Farrer, Lindsay A.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.; Mayeux, Richard

2013-01-01

Importance Genetic variants associated with susceptibility to late-onset Alzheimer disease are known for individuals of European ancestry, but whether the same or different variants account for the genetic risk of Alzheimer disease in African American individuals is unknown. Identification of disease-associated variants helps identify targets for genetic testing, prevention, and treatment. Objective To identify genetic loci associated with late-onset Alzheimer disease in African Americans. Design, Setting, and Participants The Alzheimer Disease Genetics Consortium (ADGC) assembled multiple data sets representing a total of 5896 African Americans (1968 case participants, 3928 control participants) 60 years or older that were collected between 1989 and 2011 at multiple sites. The association of Alzheimer disease with genotyped and imputed single-nucleotide polymorphisms (SNPs) was assessed in case-control and in family-based data sets. Results from individual data sets were combined to perform an inverse variance–weighted meta-analysis, first with genome-wide analyses and subsequently with gene-based tests for previously reported loci. Main Outcomes and Measures Presence of Alzheimer disease according to standardized criteria. Results Genome-wide significance in fully adjusted models (sex, age, APOE genotype, population stratification) was observed for a SNP in ABCA7 (rs115550680, allele = G; frequency, 0.09 cases and 0.06 controls; odds ratio [OR], 1.79 [95% CI, 1.47-2.12]; P = 2.2 × 10–9), which is in linkage disequilibrium with SNPs previously associated with Alzheimer disease in Europeans (0.8
Advances in Maize Genomics and Their Value for Enhancing Genetic Gains from Breeding

PubMed Central

Xu, Yunbi; Skinner, Debra J.; Wu, Huixia; Palacios-Rojas, Natalia; Araus, Jose Luis; Yan, Jianbing; Gao, Shibin; Warburton, Marilyn L.; Crouch, Jonathan H.

2009-01-01

Maize is an important crop for food, feed, forage, and fuel across tropical and temperate areas of the world. Diversity studies at genetic, molecular, and functional levels have revealed that, tropical maize germplasm, landraces, and wild relatives harbor a significantly wider range of genetic variation. Among all types of markers, SNP markers are increasingly the marker-of-choice for all genomics applications in maize breeding. Genetic mapping has been developed through conventional linkage mapping and more recently through linkage disequilibrium-based association analyses. Maize genome sequencing, initially focused on gene-rich regions, now aims for the availability of complete genome sequence. Conventional insertion mutation-based cloning has been complemented recently by EST- and map-based cloning. Transgenics and nutritional genomics are rapidly advancing fields targeting important agronomic traits including pest resistance and grain quality. Substantial advances have been made in methodologies for genomics-assisted breeding, enhancing progress in yield as well as abiotic and biotic stress resistances. Various genomic databases and informatics tools have been developed, among which MaizeGDB is the most developed and widely used by the maize research community. In the future, more emphasis should be given to the development of tools and strategic germplasm resources for more effective molecular breeding of tropical maize products. PMID:19688107
Nucleotide variability and linkage disequilibrium patterns in the porcine MUC4 gene.

PubMed

Yang, Ming; Yang, Bin; Yan, Xueming; Ouyang, Jing; Zeng, Weihong; Ai, Huashui; Ren, Jun; Huang, Lusheng

2012-07-13

MUC4 is a type of membrane anchored glycoprotein and serves as the major constituent of mucus that covers epithelial surfaces of many tissues such as trachea, colon and cervix. MUC4 plays important roles in the lubrication and protection of the surface epithelium, cell proliferation and differentiation, immune response, cell adhesion and cancer development. To gain insights into the evolution of the porcine MUC4 gene, we surveyed the nucleotide variability and linkage disequilibrium (LD) within this gene in Chinese indigenous breeds and Western commercial breeds. A total of 53 SNPs covering the MUC4 gene were genotyped on 5 wild boars and 307 domestic pigs representing 11 Chinese breeds and 3 Western breeds. The nucleotide variability, haplotype phylogeny and LD extent of MUC4 were analyzed in these breeds. Both Chinese and Western breeds had considerable nucleotide diversity at the MUC4 locus. Western pig breeds like Duroc and Large White have comparable nucleotide diversity as many of Chinese breeds, thus artificial selection for lean pork production have not reduced the genetic variability of MUC4 in Western commercial breeds. Haplotype phylogeny analyses indicated that MUC4 had evolved divergently in Chinese and Western pigs. The dendrogram of genetic differentiation between breeds generally reflected demographic history and geographical distribution of these breeds. LD patterns were unexpectedly similar between Chinese and Western breeds, in which LD usually extended less than 20 kb. This is different from the presumed high LD extent (more than 100 kb) in Western commercial breeds. The significant positive Tajima'D, and Fu and Li's D statistics in a few Chinese and Western breeds implied that MUC4 might undergo balancing selection in domestic breeds. Nevertheless, we cautioned that the significant statistics could be upward biased by SNP ascertainment process. Chinese and Western breeds have similar nucleotide diversity but evolve divergently in the MUC4 region. Western breeds exhibited unusual low LD extent at the MUC4 locus, reflecting the complexity of nucleotide variability of pig genome. The finding suggests that high density (e.g. 1SNP/10 kb) markers are required to capture the underlying causal variants at such regions.
Association of the oxytocin receptor (OXTR) gene polymorphisms with autism spectrum disorder (ASD) in the Japanese population.

PubMed

Liu, Xiaoxi; Kawamura, Yoshiya; Shimada, Takafumi; Otowa, Takeshi; Koishi, Shinko; Sugiyama, Toshiro; Nishida, Hisami; Hashimoto, Ohiko; Nakagami, Ryoichi; Tochigi, Mamoru; Umekage, Tadashi; Kano, Yukiko; Miyagawa, Taku; Kato, Nobumasa; Tokunaga, Katsushi; Sasaki, Tsukasa

2010-03-01

The oxytocin receptor (OXTR) gene, which is located on chromosome 3p25.3, has been implicated as a candidate gene for susceptibility of autism spectrum disorder (ASD). Positive associations between OXTR and ASD have been reported in earlier studies. However, the results were inconsistent and demand further studies. In this study, we investigated the associations between OXTR and ASD in a Japanese population by analyzing 11 single-nucleotide polymorphisms (SNPs) using both family-based association test (FBAT) and population-based case-control test. No significant signal was detected in the FBAT test. However, significant differences were observed in allelic frequencies of four SNPs, including rs2254298 between patients and controls. The risk allele of rs2254298 was 'A', which was consistent with the previous study in Chinese, and not with the observations in Caucasian. The difference in the risk allele of this SNP in previous studies might be attributable to an ethnic difference in the linkage disequilibrium structure between the Asians and Caucasians. In addition, haplotype analysis exhibits a significant association between a five-SNP haplotype and ASD, including rs22542898. In conclusion, our study might support that OXTR has a significant role in conferring the risk of ASD in the Japanese population.
Interval mapping for red/green skin color in Asian pears using a modified QTL-seq method

PubMed Central

Xue, Huabai; Shi, Ting; Wang, Fangfang; Zhou, Huangkai; Yang, Jian; Wang, Long; Wang, Suke; Su, Yanli; Zhang, Zhen; Qiao, Yushan; Li, Xiugen

2017-01-01

Pears with red skin are attractive to consumers and provide additional health benefits. Identification of the gene(s) responsible for skin coloration can benefit cultivar selection and breeding. The use of QTL-seq, a bulked segregant analysis method, can be problematic when heterozygous parents are involved. The present study modified the QTL-seq method by introducing a |Δ(SNP-index)| parameter to improve the accuracy of mapping the red skin trait in a group of highly heterozygous Asian pears. The analyses were based on mixed DNA pools composed of 28 red-skinned and 27 green-skinned pear lines derived from a cross between the ‘Mantianhong’ and ‘Hongxiangsu’ red-skinned cultivars. The ‘Dangshansuli’ cultivar genome was used as reference for sequence alignment. An average single-nucleotide polymorphism (SNP) index was calculated using a sliding window approach (200-kb windows, 20-kb increments). Nine scaffolds within the candidate QTL interval were in the fifth linkage group from 111.9 to 177.1 cM. There was a significant linkage between the insertions/deletions and simple sequence repeat markers designed from the candidate intervals and the red/green skin (R/G) locus, which was in a 582.5-kb candidate interval that contained 81 predicted protein-coding gene models and was composed of two subintervals at the bottom of the fifth chromosome. The ZFRI 130-16, In2130-12 and In2130-16 markers located near the R/G locus could potentially be used to identify the red skin trait in Asian pear populations. This study provides new insights into the genetics controlling the red skin phenotype in this fruit. PMID:29118994

Interval mapping for red/green skin color in Asian pears using a modified QTL-seq method.

PubMed

Xue, Huabai; Shi, Ting; Wang, Fangfang; Zhou, Huangkai; Yang, Jian; Wang, Long; Wang, Suke; Su, Yanli; Zhang, Zhen; Qiao, Yushan; Li, Xiugen

2017-01-01

Pears with red skin are attractive to consumers and provide additional health benefits. Identification of the gene(s) responsible for skin coloration can benefit cultivar selection and breeding. The use of QTL-seq, a bulked segregant analysis method, can be problematic when heterozygous parents are involved. The present study modified the QTL-seq method by introducing a |Δ(SNP-index)| parameter to improve the accuracy of mapping the red skin trait in a group of highly heterozygous Asian pears. The analyses were based on mixed DNA pools composed of 28 red-skinned and 27 green-skinned pear lines derived from a cross between the 'Mantianhong' and 'Hongxiangsu' red-skinned cultivars. The 'Dangshansuli' cultivar genome was used as reference for sequence alignment. An average single-nucleotide polymorphism (SNP) index was calculated using a sliding window approach (200-kb windows, 20-kb increments). Nine scaffolds within the candidate QTL interval were in the fifth linkage group from 111.9 to 177.1 cM. There was a significant linkage between the insertions/deletions and simple sequence repeat markers designed from the candidate intervals and the red/green skin (R/G) locus, which was in a 582.5-kb candidate interval that contained 81 predicted protein-coding gene models and was composed of two subintervals at the bottom of the fifth chromosome. The ZFRI 130-16, In2130-12 and In2130-16 markers located near the R/G locus could potentially be used to identify the red skin trait in Asian pear populations. This study provides new insights into the genetics controlling the red skin phenotype in this fruit.
A major QTL controlling apple skin russeting maps on the linkage group 12 of 'Renetta Grigia di Torriana'.

PubMed

Falginella, Luigi; Cipriani, Guido; Monte, Corinne; Gregori, Roberto; Testolin, Raffaele; Velasco, Riccardo; Troggio, Michela; Tartarini, Stefano

2015-06-19

Russeting is a disorder developed by apple fruits that consists of cuticle cracking followed by the replacement of the epidermis by a corky layer that protects the fruit surface from water loss and pathogens. Although influenced by many environmental conditions and orchard management practices, russeting is under genetic control. The difficulty in classifying offspring and consequent variable segregation ratios have led several authors to conclude that more than one genetic determinant could be involved, although some evidence favours a major gene (Ru). In this study we report the mapping of a major genetic russeting determinant on linkage group 12 of apple as inferred from the phenotypic observation in a segregating progeny derived from 'Renetta Grigia di Torriana', the construction of a 20 K Illumina SNP chip based genetic map, and QTL analysis. Recombination analysis in two mapping populations restricted the region of interest to approximately 400 Kb. Of the 58 genes predicted from the Golden Delicious sequence, a putative ABCG family transporter has been identified. Within a small set of russeted cultivars tested with markers of the region, only six showed the same haplotype of 'Renetta Grigia di Torriana'. A major determinant (Ru_RGT) for russeting development putatively involved in cuticle organization is proposed as a candidate for controlling the trait. SNP and SSR markers tightly co-segregating with the Ru_RGT locus may assist the breeder selection. The observed segregations and the analysis of the 'Renetta Grigia di Torriana' haplotypic region in a panel of russeted and non-russeted cultivars may suggest the presence of other determinants for russeting in apple.
De Novo Assembly and Transcriptome Analysis of the Rubber Tree (Hevea brasiliensis) and SNP Markers Development for Rubber Biosynthesis Pathways

PubMed Central

Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira

2014-01-01

Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection. PMID:25048025
The effect of using genealogy-based haplotypes for genomic prediction

PubMed Central

2013-01-01

Background Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. Methods A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. Results About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Conclusions Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy. PMID:23496971
The effect of using genealogy-based haplotypes for genomic prediction.

PubMed

Edriss, Vahid; Fernando, Rohan L; Su, Guosheng; Lund, Mogens S; Guldbrandtsen, Bernt

2013-03-06

Genomic prediction uses two sources of information: linkage disequilibrium between markers and quantitative trait loci, and additive genetic relationships between individuals. One way to increase the accuracy of genomic prediction is to capture more linkage disequilibrium by regression on haplotypes instead of regression on individual markers. The aim of this study was to investigate the accuracy of genomic prediction using haplotypes based on local genealogy information. A total of 4429 Danish Holstein bulls were genotyped with the 50K SNP chip. Haplotypes were constructed using local genealogical trees. Effects of haplotype covariates were estimated with two types of prediction models: (1) assuming that effects had the same distribution for all haplotype covariates, i.e. the GBLUP method and (2) assuming that a large proportion (π) of the haplotype covariates had zero effect, i.e. a Bayesian mixture method. About 7.5 times more covariate effects were estimated when fitting haplotypes based on local genealogical trees compared to fitting individuals markers. Genealogy-based haplotype clustering slightly increased the accuracy of genomic prediction and, in some cases, decreased the bias of prediction. With the Bayesian method, accuracy of prediction was less sensitive to parameter π when fitting haplotypes compared to fitting markers. Use of haplotypes based on genealogy can slightly increase the accuracy of genomic prediction. Improved methods to cluster the haplotypes constructed from local genealogy could lead to additional gains in accuracy.
Tin phosphide-based anodes for sodium-ion batteries: synthesis via solvothermal transformation of Sn metal and phase-dependent Na storage performance

PubMed Central

Shin, Hyun-Seop; Jung, Kyu-Nam; Jo, Yong Nam; Park, Min-Sik; Kim, Hansung; Lee, Jong-Won

2016-01-01

There is a great deal of current interest in the development of rechargeable sodium (Na)-ion batteries (SIBs) for low-cost, large-scale stationary energy storage systems. For the commercial success of this technology, significant progress should be made in developing robust anode (negative electrode) materials with high capacity and long cycle life. Sn-P compounds are considered promising anode materials that have considerable potential to meet the required performance of SIBs, and they have been typically prepared by high-energy mechanical milling. Here, we report Sn-P-based anodes synthesised through solvothermal transformation of Sn metal and their electrochemical Na storage properties. The temperature and time period used for solvothermal treatment play a crucial role in determining the phase, microstructure, and composition of the Sn-P compound and thus its electrochemical performance. The Sn-P compound prepared under an optimised solvothermal condition shows excellent electrochemical performance as an SIB anode, as evidenced by a high reversible capacity of ~560 mAh g−1 at a current density of 100 mA g−1 and cycling stability for 100 cycles. The solvothermal route provides an effective approach to synthesising Sn-P anodes with controlled phases and compositions, thus tailoring their Na storage behaviour. PMID:27189834
Construction of Ultradense Linkage Maps with Lep-MAP2: Stickleback F2 Recombinant Crosses as an Example

PubMed Central

Rastas, Pasi; Calboli, Federico C. F.; Guo, Baocheng; Shikano, Takahito; Merilä, Juha

2016-01-01

High-density linkage maps are important tools for genome biology and evolutionary genetics by quantifying the extent of recombination, linkage disequilibrium, and chromosomal rearrangements across chromosomes, sexes, and populations. They provide one of the best ways to validate and refine de novo genome assemblies, with the power to identify errors in assemblies increasing with marker density. However, assembly of high-density linkage maps is still challenging due to software limitations. We describe Lep-MAP2, a software for ultradense genome-wide linkage map construction. Lep-MAP2 can handle various family structures and can account for achiasmatic meiosis to gain linkage map accuracy. Simulations show that Lep-MAP2 outperforms other available mapping software both in computational efficiency and accuracy. When applied to two large F2-generation recombinant crosses between two nine-spined stickleback (Pungitius pungitius) populations, it produced two high-density (∼6 markers/cM) linkage maps containing 18,691 and 20,054 single nucleotide polymorphisms. The two maps showed a high degree of synteny, but female maps were 1.5–2 times longer than male maps in all linkage groups, suggesting genome-wide recombination suppression in males. Comparison with the genome sequence of the three-spined stickleback (Gasterosteus aculeatus) revealed a high degree of interspecific synteny with a low frequency (<5%) of interchromosomal rearrangements. However, a fairly large (ca. 10 Mb) translocation from autosome to sex chromosome was detected in both maps. These results illustrate the utility and novel features of Lep-MAP2 in assembling high-density linkage maps, and their usefulness in revealing evolutionarily interesting properties of genomes, such as strong genome-wide sex bias in recombination rates. PMID:26668116
Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

DTIC Science & Technology

2011-09-01

Almasy, L, Blangero, J. (2009) Human QTL linkage mapping. Genetica 136:333-340. Amos, CI. (2007) Successful design and conduct of genome-wide...quantitative trait loci. Genetica 136:237-243. Skol AD, Scott LJ, Abecasis GR, Boehnke M. (2006) Joint analysis is more efficient than replication
Association Analyses of RANKL/RANK/OPG Gene Polymorphisms with Femoral Neck Compression Strength Index Variation in Caucasians

PubMed Central

Dong, Shan-Shan; Liu, Xiao-Gang; Chen, Yuan; Guo, Yan; Wang, Liang; Zhao, Jian; Xiong, Dong-Hai; Xu, Xiang-Hong; Recker, Robert R.

2010-01-01

Femoral neck compression strength index (fCSI), a novel phenotypic parameter that integrates bone density, bone size, and body size, has significant potential to improve hip fracture risk assessment. The genetic factors underlying variations in fCSI, however, remain largely unknown. Given the important roles of the receptor activator of the nuclear factor-κB ligand/receptor activator of the nuclear factor-κB/osteoprotegerin (RANKL/RANK/OPG) pathway in the regulation of bone remodeling, we tested the associations between RANKL/RANK/OPG polymorphisms and variations in fCSI as well as its components (femoral neck bone mineral density [fBMD], femoral neck width [FNW], and weight). This was accomplished with a sample comprising 1873 subjects from 405 Caucasian nuclear families. Of the 37 total SNPs studied in these three genes, 3 SNPs, namely, rs12585014, rs7988338, and rs2148073, of RANKL were significantly associated with fCSI (P = 0.0007, 0.0007, and 0.0005, respectively) after conservative Bonferroni correction. Moreover, the three SNPs were approximately in complete linkage disequilibrium. Haplotype-based association tests corroborated the single-SNP results since haplotype 1 of block 1 of the RANKL gene achieved an even more significant association with fCSI (P = 0.0003) than any of the individual SNPs. However, we did not detect any significant associations of these genes with fBMD, FNW, or weight. In summary, our findings suggest that the RANKL gene may play an important role in variation in fCSI, independent of fBMD and non-fBMD components. PMID:19458885
Association analyses of RANKL/RANK/OPG gene polymorphisms with femoral neck compression strength index variation in Caucasians.

PubMed

Dong, Shan-Shan; Liu, Xiao-Gang; Chen, Yuan; Guo, Yan; Wang, Liang; Zhao, Jian; Xiong, Dong-Hai; Xu, Xiang-Hong; Recker, Robert R; Deng, Hong-Wen

2009-08-01

Femoral neck compression strength index (fCSI), a novel phenotypic parameter that integrates bone density, bone size, and body size, has significant potential to improve hip fracture risk assessment. The genetic factors underlying variations in fCSI, however, remain largely unknown. Given the important roles of the receptor activator of the nuclear factor-kappaB ligand/receptor activator of the nuclear factor-kappaB/osteoprotegerin (RANKL/RANK/OPG) pathway in the regulation of bone remodeling, we tested the associations between RANKL/RANK/OPG polymorphisms and variations in fCSI as well as its components (femoral neck bone mineral density [fBMD], femoral neck width [FNW], and weight). This was accomplished with a sample comprising 1873 subjects from 405 Caucasian nuclear families. Of the 37 total SNPs studied in these three genes, 3 SNPs, namely, rs12585014, rs7988338, and rs2148073, of RANKL were significantly associated with fCSI (P = 0.0007, 0.0007, and 0.0005, respectively) after conservative Bonferroni correction. Moreover, the three SNPs were approximately in complete linkage disequilibrium. Haplotype-based association tests corroborated the single-SNP results since haplotype 1 of block 1 of the RANKL gene achieved an even more significant association with fCSI (P = 0.0003) than any of the individual SNPs. However, we did not detect any significant associations of these genes with fBMD, FNW, or weight. In summary, our findings suggest that the RANKL gene may play an important role in variation in fCSI, independent of fBMD and non-fBMD components.
Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids

PubMed Central

Bartkiewicz, Annette M.; Chilla, Friederike; Terefe-Ayana, Diro; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Linde, Marcus; Debener, Thomas

2018-01-01

Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP) array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL) for four phenotypic traits. PMID:29868076
The versican gene and the risk of intracranial aneurysms.

PubMed

Ruigrok, Ynte M; Rinkel, Gabriël J E; Wijmenga, Cisca

2006-09-01

The proteoglycan versican is an excellent candidate gene for intracranial aneurysms (IAs) because it plays an important role in extracellular matrix assembly and is localized in a previously implicated locus for IAs on chromosome 5q. We analyzed all the common variations using 16-tag single nucleotide polymorphisms (SNPs) and haplotypes in the versican gene using a 2-stage genotyping approach. For stage 1, 16 SNPs were genotyped in 307 cases and 639 controls. For stage 2, the two SNPs yielding the most significant associations (P<0.01) were genotyped in a second independent cohort of 310 cases for confirmation of the associations. In stage 1, we found several SNPs in strong linkage disequilibrium and haplotypes constituting these SNPs associated with IAs in the Dutch population (strongest SNP association for rs173686 with odds ratio=1.34, 95% CI=1.09 to 1.65, P=0.004). In stage 2, we confirmed association for the 2 SNPs with the most significant associations (strongest SNP association for rs173686 with odds ratio=1.36, 95% CI=1.11 to 1.67, P=0.003). SNPs in strong linkage disequilibrium and haplotypes constituting these SNPs in the versican gene are associated with IAs suggesting that variation in or near the versican gene plays a role in susceptibility to IAs.
Genotyping by Sequencing for SNP-Based Linkage Analysis and Identification of QTLs Linked to Fruit Quality Traits in Japanese Plum (Prunus salicina Lindl.).

PubMed

Salazar, Juan A; Pacheco, Igor; Shinya, Paulina; Zapata, Patricio; Silva, Claudia; Aradhya, Mallikarjuna; Velasco, Dianne; Ruiz, David; Martínez-Gómez, Pedro; Infante, Rodrigo

2017-01-01

Marker-assisted selection (MAS) in stone fruit ( Prunus species) breeding is currently difficult to achieve due to the polygenic nature of the most relevant agronomic traits linked to fruit quality. Genotyping by sequencing (GBS), however, provides a large quantity of useful data suitable for fine mapping using Single Nucleotide Polymorphisms (SNPs) from a reference genome. In this study, GBS was used to genotype 272 seedlings of three F1 Japanese plum ( Prunus salicina Lindl) progenies derived from crossing "98-99" (as a common female parent) with "Angeleno," "September King," and "September Queen" as male parents. Raw sequences were aligned to the Peach genome v1, and 42,909 filtered SNPs were obtained after sequence alignment. In addition, 153 seedlings from the "98-99" × "Angeleno" cross were used to develop a genetic map for each parent. A total of 981 SNPs were mapped (479 for "98-99" and 502 for "Angeleno"), covering a genetic distance of 688.8 and 647.03 cM, respectively. Fifty five seedlings from this progeny were phenotyped for different fruit quality traits including ripening time, fruit weight, fruit shape, chlorophyll index, skin color, flesh color, over color, firmness, and soluble solids content in the years 2015 and 2016. Linkage-based QTL analysis allowed the identification of genomic regions significantly associated with ripening time (LG4 of both parents and both phenotyping years), fruit skin color (LG3 and LG4 of both parents and both years), chlorophyll degradation index (LG3 of both parents in 2015) and fruit weight (LG7 of both parents in 2016). These results represent a promising situation for GBS in the identification of SNP variants associated to fruit quality traits, potentially applicable in breeding programs through MAS, in a highly heterozygous crop species such as Japanese plum.
Next-generation transcriptome sequencing, SNP discovery and validation in four market classes of peanut, Arachis hypogaea L.

PubMed

Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D

2015-06-01

Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
A method for detecting IBD regions simultaneously in multiple individuals—with applications to disease genetics

PubMed Central

Moltke, Ida; Albrechtsen, Anders; Hansen, Thomas v.O.; Nielsen, Finn C.; Nielsen, Rasmus

2011-01-01

All individuals in a finite population are related if traced back long enough and will, therefore, share regions of their genomes identical by descent (IBD). Detection of such regions has several important applications—from answering questions about human evolution to locating regions in the human genome containing disease-causing variants. However, IBD regions can be difficult to detect, especially in the common case where no pedigree information is available. In particular, all existing non-pedigree based methods can only infer IBD sharing between two individuals. Here, we present a new Markov Chain Monte Carlo method for detection of IBD regions, which does not rely on any pedigree information. It is based on a probabilistic model applicable to unphased SNP data. It can take inbreeding, allele frequencies, genotyping errors, and genomic distances into account. And most importantly, it can simultaneously infer IBD sharing among multiple individuals. Through simulations, we show that the simultaneous modeling of multiple individuals makes the method more powerful and accurate than several other non-pedigree based methods. We illustrate the potential of the method by applying it to data from individuals with breast and/or ovarian cancer, and show that a known disease-causing mutation can be mapped to a 2.2-Mb region using SNP data from only five seemingly unrelated affected individuals. This would not be possible using classical linkage mapping or association mapping. PMID:21493780
Construction of a High-Density Genetic Map from RNA-Seq Data for an Arabidopsis Bay-0 × Shahdara RIL Population

PubMed Central

Serin, Elise A. R.; Snoek, L. B.; Nijveen, Harm; Willems, Leo A. J.; Jiménez-Gómez, Jose M.; Hilhorst, Henk W. M.; Ligterink, Wilco

2017-01-01

High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches. PMID:29259624
Genome-wide linkage scan for maximum and length-dependent knee muscle strength in young men: significant evidence for linkage at chromosome 14q24.3

PubMed Central

De Mars, G; Windelinckx, A; Huygens, W; Peeters, M W; Beunen, G P; Aerssens, J; Vlietinck, R; Thomis, M A I

2008-01-01

Background: Maintenance of high muscular fitness is positively related to bone health, functionality in daily life and increasing insulin sensitivity, and negatively related to falls and fractures, morbidity and mortality. Heritability of muscle strength phenotypes ranges between 31% and 95%, but little is known about the identity of the genes underlying this complex trait. As a first attempt, this genome-wide linkage study aimed to identify chromosomal regions linked to muscle and bone cross-sectional area, isometric knee flexion and extension torque, and torque–length relationship for knee flexors and extensors. Methods: In total, 283 informative male siblings (17–36 years old), belonging to 105 families, were used to conduct a genome-wide SNP-based multipoint linkage analysis. Results: The strongest evidence for linkage was found for the torque–length relationship of the knee flexors at 14q24.3 (LOD = 4.09; p<10−5). Suggestive evidence for linkage was found at 14q32.2 (LOD = 3.00; P = 0.005) for muscle and bone cross-sectional area, at 2p24.2 (LOD = 2.57; p = 0.01) for isometric knee torque at 30° flexion, at 1q21.3, 2p23.3 and 18q11.2 (LOD = 2.33, 2.69 and 2.21; p<10−4 for all) for the torque–length relationship of the knee extensors and at 18p11.31 (LOD = 2.39; p = 0.0004) for muscle-mass adjusted isometric knee extension torque. Conclusions: We conclude that many small contributing genes rather than a few important genes are involved in causing variation in different underlying phenotypes of muscle strength. Furthermore, some overlap in promising genomic regions were identified among different strength phenotypes. PMID:18178634
A gene-based SNP resource and linkage map for the copepod Tigriopus californicus

PubMed Central

2011-01-01

Background As yet, few genomic resources have been developed in crustaceans. This lack is particularly evident in Copepoda, given the extraordinary numerical abundance, and taxonomic and ecological diversity of this group. Tigriopus californicus is ideally suited to serve as a genetic model copepod and has been the subject of extensive work in environmental stress and reproductive isolation. Accordingly, we set out to develop a broadly-useful panel of genetic markers and to construct a linkage map dense enough for quantitative trait locus detection in an interval mapping framework for T. californicus--a first for copepods. Results One hundred and ninety Single Nucleotide Polymorphisms (SNPs) were used to genotype our mapping population of 250 F2 larvae. We were able to construct a linkage map with an average intermarker distance of 1.8 cM, and a maximum intermarker distance of 10.3 cM. All markers were assembled into linkage groups, and the 12 linkage groups corresponded to the 12 known chromosomes of T. californicus. We estimate a total genome size of 401.0 cM, and a total coverage of 73.7%. Seventy five percent of the mapped markers were detected in 9 additional populations of T. californicus. Of available model arthropod genomes, we were able to show more colocalized pairs of homologues between T. californicus and the honeybee Apis mellifera, than expected by chance, suggesting preserved macrosynteny between Hymenoptera and Copepoda. Conclusions Our study provides an abundance of linked markers spanning all chromosomes. Many of these markers are also found in multiple populations of T. californicus, and in two other species in the genus. The genomic resource we have developed will enable mapping throughout the geographical range of this species and in closely related species. This linkage map will facilitate genome sequencing, mapping and assembly in an ecologically and taxonomically interesting group for which genomic resources are currently under development. PMID:22103327
Dissecting tocopherols content in maize (Zea mays L.), using two segregating populations and high-density single nucleotide polymorphism markers

PubMed Central

2012-01-01

Background Tocopherols, which are vitamin E compounds, play an important role in maintaining human health. Compared with other staple foods, maize grains contain high level of tocopherols. Results Two F2 populations (K22/CI7 and K22/Dan340, referred to as POP-1 and POP-2, respectively), which share a common parent (K22), were developed and genotyped using a GoldenGate assay containing 1,536 single nucleotide polymorphism (SNP) markers. An integrated genetic linkage map was constructed using 619 SNP markers, spanning a total of 1649.03 cM of the maize genome with an average interval of 2.67 cM. Seventeen quantitative trait loci (QTLs) for all the traits were detected in the first map and 13 in the second. In these two maps, QTLs for different traits were localized to the same genomic regions and some were co-located with candidate genes in the tocopherol biosynthesis pathway. Single QTL was responsible for 3.03% to 52.75% of the phenotypic variation and the QTLs in sum explained23.4% to 66.52% of the total phenotypic variation. A major QTL (qc5-1/qd5-1) affecting α-tocopherol (αT) was identified on chromosome 5 between the PZA03161.1 and PZA02068.1 in the POP-2. The QTL region was narrowed down from 18.7 Mb to 5.4 Mb by estimating the recombination using high-density markers of the QTL region. This allowed the identification of the candidate gene VTE4 which encodes γ-tocopherol methyltransferase, an enzyme that transforms γ-tocopherol (γT)to αT. Conclusions These results demonstrate that a few QTLs with major effects and several QTLs with medium to minor effects might contribute to the natural variation of tocopherols in maize grain. The high-density markers will help to fine map and identify the QTLs with major effects even in the preliminary segregating populations. Furthermore, this study provides a simple guide line for the breeders to improve traits that minimize the risk of malnutrition, especially in developing countries. PMID:23122295
SNP Identification from RNA Sequencing and Linkage Map Construction of Rubber Tree for Anchoring the Draft Genome

PubMed Central

Shearman, Jeremy R.; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke

2015-01-01

Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly. PMID:25831195

SNP identification from RNA sequencing and linkage map construction of rubber tree for anchoring the draft genome.

PubMed

Shearman, Jeremy R; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-Areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke

2015-01-01

Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly.
Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction

PubMed Central

Barbero, Marina M. D.; Oliveira, Henrique N.; de Camargo, Gregório M. F.; Fernandes Júnior, Gerardo A.; Aspilcueta-Borquis, Rusbel R.; Souza, Fabio R. P.; Boligon, Arione A.; Melo, Thaise P.; Regatieri, Inaê C.; Feitosa, Fabieli L. B.; Fonseca, Larissa F. S.; Magalhães, Ana F. B.; Costa, Raphael B.; Albuquerque, Lucia G.

2018-01-01

Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs. PMID:29293544
Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction.

PubMed

Takada, Luciana; Barbero, Marina M D; Oliveira, Henrique N; de Camargo, Gregório M F; Fernandes Júnior, Gerardo A; Aspilcueta-Borquis, Rusbel R; Souza, Fabio R P; Boligon, Arione A; Melo, Thaise P; Regatieri, Inaê C; Feitosa, Fabieli L B; Fonseca, Larissa F S; Magalhães, Ana F B; Costa, Raphael B; Albuquerque, Lucia G

2018-01-01

Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs.
Hormone-Related Pathways and Risk of Breast Cancer Subtypes in African American Women

PubMed Central

Haddad, Stephen A.; Lunetta, Kathryn L.; Ruiz-Narváez, Edward A.; Bensen, Jeannette T.; Hong, Chi-Chen; Sucheston-Campbell, Lara E.; Yao, Song; Bandera, Elisa V.; Rosenberg, Lynn; Haiman, Christopher A.; Troester, Melissa A.; Ambrosone, Christine B.; Palmer, Julie R.

2016-01-01

Purpose We sought to investigate genetic variation in hormone pathways in relation to risk of overall and subtype-specific breast cancer in women of African ancestry (AA). Methods Genotyping and imputation yielded data on 143,934 SNPs in 308 hormone-related genes for 3663 breast cancer cases (1098 ER-, 1983 ER+, 582 ER unknown) and 4687 controls from the African American Breast Cancer Epidemiology and Risk (AMBER) Consortium. AMBER includes data from four large studies of AA women: the Carolina Breast Cancer Study, the Women's Circle of Health Study, the Black Women's Health Study, and the Multiethnic Cohort Study. Pathway- and gene-based analyses were conducted, and single SNP tests were run for the top genes. Results There were no strong associations at the pathway level. The most significantly associated genes were GHRH, CALM2, CETP, and AKR1C1 for overall breast cancer (gene-based nominal p ≤0.01); NR0B1, IGF2R, CALM2, CYP1B1, and GRB2 for ER+ breast cancer (p ≤0.02); and PGR, MAPK3, MAP3K1, and LHCGR for ER- disease (p ≤0.02). Single-SNP tests for SNPs with pairwise linkage disequilibrium r2 <0.8 in the top genes identified 12 common SNPs (in CALM2, CETP, NR0B1, IGF2R, CYP1B1, PGR, MAPK3, and MAP3K1) associated with overall or subtype-specific breast cancer after gene-level correction for multiple testing. Rs11571215 in PGR (progesterone receptor) was the SNP most strongly associated with ER- disease. Conclusion We identified eight genes in hormone pathways that contain common variants associated with breast cancer in AA women after gene-level correction for multiple testing. PMID:26458823
Association of melanocortin-4 receptor gene polymorphisms with obesity-related parameters in Malaysian Malays.

PubMed

Apalasamy, Yamunah Devi; Ming, Moy Foong; Rampal, Sanjay; Bulgiba, Awang; Mohamed, Zahurin

2013-01-01

Melanocortin-4 receptor (MC4R) is an important regulator of body weight and energy intake. Genetic polymorphisms of the MC4R gene have been found to be linked to obesity in many recent studies across the globe. This study aimed to examine the effects of MC4R polymorphisms on obesity parameters, Linkage disequilibrium (LD) pattern and haplotypes in Malaysian Malays. The study subjects were 652 Malaysian Malays. Genomic DNA was extracted from buccal swabs. Genotyping was performed using Sequenom MassARRAY® iPLEX platform. Anthropometric and blood lipid profiles were measured. MC4R rs571312 SNP was associated with logBMI (p = 0.008) and systolic blood pressure (p = 0.005), while MC4R rs2229616 SNP was associated with total cholesterol (TC) levels (p = 0.016). The MC4R rs7227255 SNP did not show any association with obesity parameters. The strength of LD of the MC4R gene region is low and the haplotypes were not associated with obesity in Malaysian Malays.
Genome wide association study (GWAS) for grain yield in rice cultivated under water deficit.

PubMed

Pantalião, Gabriel Feresin; Narciso, Marcelo; Guimarães, Cléber; Castro, Adriano; Colombari, José Manoel; Breseghello, Flavio; Rodrigues, Luana; Vianello, Rosana Pereira; Borba, Tereza Oliveira; Brondani, Claudio

2016-12-01

The identification of rice drought tolerant materials is crucial for the development of best performing cultivars for the upland cultivation system. This study aimed to identify markers and candidate genes associated with drought tolerance by Genome Wide Association Study analysis, in order to develop tools for use in rice breeding programs. This analysis was made with 175 upland rice accessions (Oryza sativa), evaluated in experiments with and without water restriction, and 150,325 SNPs. Thirteen SNP markers associated with yield under drought conditions were identified. Through stepwise regression analysis, eight SNP markers were selected and validated in silico, and when tested by PCR, two out of the eight SNP markers were able to identify a group of rice genotypes with higher productivity under drought. These results are encouraging for deriving markers for the routine analysis of marker assisted selection. From the drought experiment, including the genes inherited in linkage blocks, 50 genes were identified, from which 30 were annotated, and 10 were previously related to drought and/or abiotic stress tolerance, such as the transcription factors WRKY and Apetala2, and protein kinases.
Genomic Heritability of Beef Cattle Growth

USDA-ARS?s Scientific Manuscript database

Calf weights were examined to determine association between high-density SNP genotypes and growth, in order to estimate additive genetic variation explained by SNP. Data taken from Cycle VII of the U.S. Meat Animal Research Center Germplasm Evaluation Project included birth weight (BWT), 205-d adju...
Mapping a major QTL responsible for dwarf architecture in Brassica napus using a single-nucleotide polymorphism marker approach.

PubMed

Wang, Yankun; Chen, Wenjing; Chu, Pu; Wan, Shubei; Yang, Mao; Wang, Mingming; Guan, Rongzhan

2016-08-18

Key genes related to plant type traits have played very important roles in the "green revolution" by increasing lodging resistance and elevating the harvest indices of crop cultivars. Although there have been numerous achievements in the development of dwarfism and plant type in Brassica napus breeding, exploring new materials conferring oilseed rape with efficient plant types that provide higher yields is still of significance in breeding, as well as in elucidating the mechanisms underlying plant development. Here, we report a new dwarf architecture with down-curved leaf mutant (Bndwf/dcl1) isolated from an ethyl methanesulphonate (EMS)-mutagenized B. napus line, together with its inheritance and gene mapping, and pleiotropic effects of the mapped locus on plant-type traits. We constructed a high-density single-nucleotide polymorphism (SNP) map using a backcross population derived from the Bndwf/dcl1 mutant and the canola cultivar 'zhongshuang11' ('ZS11') and mapped the dwarf architecture with the down-curved leaf dominant locus, BnDWF/DCL1, in a 6.58-cM interval between SNP marker bins M46180 and M49962 on the linkage group (LG) C05 of B. napus. Further mapping with other materials derived from Bndwf/dcl1 narrowed the interval harbouring BnDWF/DCL1 to 175 kb in length and this interval contained 16 annotated genes. Quantitative trait locus (QTL) mappings with the backcross population for plant type traits, including plant height, branching height, main raceme length and average branching interval, indicated that the mapped QTLs for plant type traits were located at the same position as the BnDWF/DCL1 locus. This study suggests that the BnDWF/DCL1 locus is a major pleiotropic locus/QTL in B. napus, which may reduce plant height, alter plant type traits and change leaf shape, and thus may lead to compact plant architecture. Accordingly, this locus may have substantial breeding potential for increasing planting density.
ALOX12 polymorphisms are associated with fat mass but not peak bone mineral density in Chinese nuclear families.

PubMed

Xiao, W-J; He, J-W; Zhang, H; Hu, W-W; Gu, J-M; Yue, H; Gao, G; Yu, J-B; Wang, C; Ke, Y-H; Fu, W-Z; Zhang, Z-L

2011-03-01

Arachidonate 12-lipoxygenase (ALOX12) is a member of the lipoxygenase superfamily, which catalyzes the incorporation of molecular oxygen into polyunsaturated fatty acids. The products of ALOX12 reactions serve as endogenous ligands for peroxisome proliferator-activated receptor γ (PPARG). The activation of the PPARG pathway in marrow-derived mesenchymal progenitors stimulates adipogenesis and inhibits osteoblastogenesis. Our objective was to determine whether polymorphisms in the ALOX12 gene were associated with variations in peak bone mineral density (BMD) and obesity phenotypes in young Chinese men. All six tagging single-nucleotide polymorphisms (SNPs) in the ALOX12 gene were genotyped in a total of 1215 subjects from 400 Chinese nuclear families by allele-specific polymerase chain reaction. The BMD at the lumbar spine and hip, total fat mass (TFM) and total lean mass (TLM) were measured using dual-energy X-ray absorptiometry. The pairwise linkage disequilibrium among SNPs was measured, and the haplotype blocks were inferred. Both the individual SNP markers and the haplotypes were tested for an association with the peak BMD, body mass index, TFM, TLM and percentage fat mass (PFM) using the quantitative transmission disequilibrium test (QTDT). Using the QTDT, significant within-family association was found between the rs2073438 polymorphism in the ALOX12 gene and the TFM and PFM (P=0.007 and 0.012, respectively). Haplotype analyses were combined with our individual SNP results and remained significant even after correction for multiple testing. However, we failed to find significant within-family associations between ALOX12 SNPs and the BMD at any bone site in young Chinese men. Our present results suggest that the rs2073438 polymorphism of ALOX12 contributes to the variation of obesity phenotypes in young Chinese men, although we failed to replicate the association with the peak BMD variation in this sample. Further independent studies are needed to confirm our findings.
Loss-of-function DNA sequence variant in the CLCNKA chloride channel implicates the cardio-renal axis in interindividual heart failure risk variation.

PubMed

Cappola, Thomas P; Matkovich, Scot J; Wang, Wei; van Booven, Derek; Li, Mingyao; Wang, Xuexia; Qu, Liming; Sweitzer, Nancy K; Fang, James C; Reilly, Muredach P; Hakonarson, Hakon; Nerbonne, Jeanne M; Dorn, Gerald W

2011-02-08

Common heart failure has a strong undefined heritable component. Two recent independent cardiovascular SNP array studies identified a common SNP at 1p36 in intron 2 of the HSPB7 gene as being associated with heart failure. HSPB7 resequencing identified other risk alleles but no functional gene variants. Here, we further show no effect of the HSPB7 SNP on cardiac HSPB7 mRNA levels or splicing, suggesting that the SNP marks the position of a functional variant in another gene. Accordingly, we used massively parallel platforms to resequence all coding exons of the adjacent CLCNKA gene, which encodes the K(a) renal chloride channel (ClC-K(a)). Of 51 exonic CLCNKA variants identified, one SNP (rs10927887, encoding Arg83Gly) was common, in linkage disequilibrium with the heart failure risk SNP in HSPB7, and associated with heart failure in two independent Caucasian referral populations (n = 2,606 and 1,168; combined P = 2.25 × 10(-6)). Individual genotyping of rs10927887 in the two study populations and a third independent heart failure cohort (combined n = 5,489) revealed an additive allele effect on heart failure risk that is independent of age, sex, and prior hypertension (odds ratio = 1.27 per allele copy; P = 8.3 × 10(-7)). Functional characterization of recombinant wild-type Arg83 and variant Gly83 ClC-K(a) chloride channel currents revealed ≈ 50% loss-of-function of the variant channel. These findings identify a common, functionally significant genetic risk factor for Caucasian heart failure. The variant CLCNKA risk allele, telegraphed by linked variants in the adjacent HSPB7 gene, uncovers a previously overlooked genetic mechanism affecting the cardio-renal axis.
Associations and interactions between SNPs in the alcohol metabolizing genes and alcoholism phenotypes in European Americans.

PubMed

Sherva, Richard; Rice, John P; Neuman, Rosalind J; Rochberg, Nanette; Saccone, Nancy L; Bierut, Laura J

2009-05-01

Alcohol dependence is a major cause of morbidity and mortality worldwide and has a strong familial component. Several linkage and association studies have identified chromosomal regions and/or genes that affect alcohol consumption, notably in genes involved in the 2-stage pathway of alcohol metabolism. Here, we use multiple regression models to test for associations and interactions between 2 alcohol-related phenotypes and SNPs in 17 genes involved in alcohol metabolism in a sample of 1,588 European American subjects. The strongest evidence for association after correcting for multiple testing was between rs1229984, a nonsynonymous coding SNP in ADH1B, and DSM-IV symptom count (p = 0.0003). This SNP was also associated with maximum number of drinks in 24 hours (p = 0.0004). Each minor allele at this SNP predicts 45% fewer DSM-IV symptoms and 18% fewer max drinks. Another SNP in a splice site in ALDH1A1 (rs8187974) showed evidence for association with both phenotypes as well (p = 0.02 and 0.004, respectively), but neither association was significant after accounting for multiple testing. Minor alleles at this SNP predict greater alcohol consumption. In addition, pairwise interactions were observed between SNPs in several genes (p = 0.00002). We replicated the large effect of rs1229984 on alcohol behavior, and although not common (MAF = 4%), this polymorphism may be highly relevant from a public health perspective in European Americans. Another SNP, rs8187974, may also affect alcohol behavior but requires replication. Also, interactions between polymorphisms in genes involved in alcohol metabolism are likely determinants of the parameters that ultimately affect alcohol consumption.
Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library.

PubMed

Sánchez, Cecilia Castaño; Smith, Timothy P L; Wiedmann, Ralph T; Vallejo, Roger L; Salem, Mohamed; Yao, Jianbo; Rexroad, Caird E

2009-11-25

To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population. The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme HaeIII; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts. The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.
Association of Adenylate Cyclase 10 (ADCY10) Polymorphisms and Bone Mineral Density in Healthy Adults

PubMed Central

Ichikawa, Shoji; Koller, Daniel L.; Curry, Leah R.; Lai, Dongbing; Xuei, Xiaoling; Edenberg, Howard J.; Hui, Siu L.; Peacock, Munro; Foroud, Tatiana; Econs, Michael J.

2010-01-01

Phenotypic variation in bone mineral density (BMD) among healthy adults is influenced by both genetic and environmental factors. Genetic sequence variations in the adenylate cyclase 10 (ADCY10) gene, which is also called soluble adenylate cyclase, have previously been reported to be associated with low spinal BMD in hypercalciuric patients. Since ADCY10 is located in the region linked to spinal BMD in our previous linkage analysis, we tested whether polymorphisms in this gene are also associated with normal BMD variation in healthy adults. Sixteen single nucleotide polymorphisms (SNPs) distributed throughout ADCY10 were genotyped in two healthy groups of American whites: 1,692 premenopausal women and 715 men. Statistical analyses were performed in the two groups to test for association between these SNPs and femoral neck and lumbar spine areal BMD. We observed significant evidence of association (p<0.01) with one SNP each in men and women. Genotypes at these SNPs accounted for less than 1% of hip BMD variation in men, but 1.5% of spinal BMD in women. However, adjacent SNPs did not corroborate the association in either males or females. In conclusion, we found a modest association between an ADCY10 polymorphism and spinal areal BMD in premenopausal white women. PMID:19093065
Genome-Wide Association Mapping of Barley Yellow Dwarf Virus Tolerance in Spring Oat (Avena sativa L.)

PubMed Central

Foresman, Bradley J.; Oliver, Rebekah E.; Jackson, Eric W.; Chao, Shiaoman; Arruda, Marcio P.; Kolb, Frederic L.

2016-01-01

Barley yellow dwarf viruses (BYDVs) are responsible for the disease barley yellow dwarf (BYD) and affect many cereals including oat (Avena sativa L.). Until recently, the molecular marker technology in oat has not allowed for many marker-trait association studies to determine the genetic mechanisms for tolerance. A genome-wide association study (GWAS) was performed on 428 spring oat lines using a recently developed high-density oat single nucleotide polymorphism (SNP) array as well as a SNP-based consensus map. Marker-trait associations were performed using a Q-K mixed model approach to control for population structure and relatedness. Six significant SNP-trait associations representing two QTL were found on chromosomes 3C (Mrg17) and 18D (Mrg04). This is the first report of BYDV tolerance QTL on chromosome 3C (Mrg17) and 18D (Mrg04). Haplotypes using the two QTL were evaluated and distinct classes for tolerance were identified based on the number of favorable alleles. A large number of lines carrying both favorable alleles were observed in the panel. PMID:27175781
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.)

PubMed Central

2013-01-01

Background Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. Results We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. Conclusions The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs. PMID:24160306
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.).

PubMed

Yagi, Masafumi; Yamamoto, Toshiya; Isobe, Sachiko; Hirakawa, Hideki; Tabata, Satoshi; Tanase, Koji; Yamaguchi, Hiroyasu; Onozaki, Takashi

2013-10-26

Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs.
ICSNPathway: identify candidate causal SNPs and pathways from genome-wide association study by one analytical framework.

PubMed

Zhang, Kunlin; Chang, Suhua; Cui, Sijia; Guo, Liyuan; Zhang, Liuyan; Wang, Jing

2011-07-01

Genome-wide association study (GWAS) is widely utilized to identify genes involved in human complex disease or some other trait. One key challenge for GWAS data interpretation is to identify causal SNPs and provide profound evidence on how they affect the trait. Currently, researches are focusing on identification of candidate causal variants from the most significant SNPs of GWAS, while there is lack of support on biological mechanisms as represented by pathways. Although pathway-based analysis (PBA) has been designed to identify disease-related pathways by analyzing the full list of SNPs from GWAS, it does not emphasize on interpreting causal SNPs. To our knowledge, so far there is no web server available to solve the challenge for GWAS data interpretation within one analytical framework. ICSNPathway is developed to identify candidate causal SNPs and their corresponding candidate causal pathways from GWAS by integrating linkage disequilibrium (LD) analysis, functional SNP annotation and PBA. ICSNPathway provides a feasible solution to bridge the gap between GWAS and disease mechanism study by generating hypothesis of SNP → gene → pathway(s). The ICSNPathway server is freely available at http://icsnpathway.psych.ac.cn/.
Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin.

PubMed

Troggio, Michela; Surbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James

2013-01-01

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Genetic analysis of QTL for eye cross and eye diameter in common carp (Cyprinus carpio L.) using microsatellites and SNPs.

PubMed

Jin, S B; Zhang, X F; Lu, J G; Fu, H T; Jia, Z Y; Sun, X W

2015-04-17

A group of 107 F1 hybrid common carp was used to construct a linkage map using JoinMap 4.0. A total of 4877 microsatellite and single nucleotide polymorphism (SNP) markers isolated from a genomic library (978 microsatellite and 3899 SNP markers) were assigned to construct the genetic map, which comprised 50 linkage groups. The total length of the linkage map for the common carp was 4775.90 cM with an average distance between markers of 0.98 cM. Ten quantitative trait loci (QTL) were associated with eye diameter, corresponding to 10.5-57.2% of the total phenotypic variation. Twenty QTL were related to eye cross, contributing to 10.8-36.9% of the total phenotypic variation. Two QTL for eye diameter and four QTL for eye cross each accounted for more than 20% of the total phenotypic variation and were considered to be major QTL. One growth factor related to eye diameter was observed on LG10 of the common carp genome, and three growth factors related to eye cross were observed on LG10, LG35, and LG44 of the common carp genome. The significant positive relationship of eye cross and eye diameter with other commercial traits suggests that eye diameter and eye cross can be used to assist in indirect selection for many commercial traits, particularly body weight. Thus, the growth factor for eye cross may also contribute to the growth of body weight, implying that aggregate breeding could have multiple effects. These findings provide information for future genetic studies and breeding of common carp.
Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

PubMed Central

Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.

2009-01-01

Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876

Autosomal dominant spastic paraplegia with peripheral neuropathy maps to chr12q23-24.

PubMed

Schüle, R; Bonin, M; Dürr, A; Forlani, S; Sperfeld, A D; Klimpe, S; Mueller, J C; Seibel, A; van de Warrenburg, B P; Bauer, P; Schöls, L

2009-06-02

Hereditary spastic paraplegias (HSP) are genetically exceedingly heterogeneous. To date, 37 genetic loci for HSP have been described (SPG1-41), among them 16 loci for autosomal dominant disease. Notwithstanding, further genetic heterogeneity is to be expected in HSP, as various HSP families do not link to any of the known HSP loci. In this study, we aimed to map the disease locus in a German family segregating autosomal dominant complicated HSP. A genome-wide linkage analysis was performed using the GeneChip Mapping 10Kv2.0 Xba Array containing 10,204 SNP markers. Suggestive loci were further analyzed by mapping of microsatellite markers. One locus on chromosome 12q23-24, termed SPG36, was confirmed by high density microsatellite fine mapping with a significant LOD score of 3.2. SPG36 is flanked by markers D12S318 and D12S79. Linkage to SPG36 was excluded in >20 additional autosomal dominant HSP families. Candidate genes were selected and sequenced. No disease-causing mutations were identified in the coding regions of ATXN2, HSPB8, IFT81, Myo1H, UBE3B, and VPS29. SPG36 is complicated by a sensory and motor neuropathy; it is therefore the eighth autosomal dominant subtype of complicated HSP. We report mapping of a new locus for autosomal dominant hereditary spastic paraplegia (HSP) (SPG36) on chromosome 12q23-24 in a German family with autosomal dominant HSP complicated by peripheral neuropathy.
A 200K SNP chip reveals a novel Pacific salmon louse genotype linked to differential efficacy of emamectin benzoate.

PubMed

Messmer, Amber M; Leong, Jong S; Rondeau, Eric B; Mueller, Anita; Despins, Cody A; Minkley, David R; Kent, Matthew P; Lien, Sigbjørn; Boyce, Brad; Morrison, Diane; Fast, Mark D; Norman, Joseph D; Danzmann, Roy G; Koop, Ben F

2018-04-16

Antiparasitic drugs such as emamectin benzoate (EMB) are relied upon to reduce the parasite load, particularly of the sea louse Lepeophtheirus salmonis, on farmed salmon. The decline in EMB treatment efficacy for this purpose is an important issue for salmon producers around the world, and particularly for those in the Atlantic Ocean where widespread EMB tolerance in sea lice is recognized as a significant problem. Salmon farms in the Northeast Pacific Ocean have not historically experienced the same issues with treatment efficacy, possibly due to the relatively large population of endemic salmonid hosts that serve to both redistribute surviving lice and dilute populations potentially under selection by introducing naïve lice to farms. Frequent migration of lice among farmed and wild hosts should limit the effect of farm-specific selection pressures on changes to the overall allele frequencies of sea lice in the Pacific Ocean. A previous study using microsatellites examined L. salmonis oncorhynchi from 10 Pacific locations from wild and farmed hosts and found no population structure. Recently however, a farm population of sea lice was detected where EMB bioassay exposure tolerance was abnormally elevated. In response, we have developed a Pacific louse draft genome that complements the previously-released Atlantic louse sequence. These genomes were combined with whole-genome re-sequencing data to design a highly sensitive 201,279 marker SNP array applicable for both subspecies (90,827 validated Pacific loci; 153,569 validated Atlantic loci). Notably, kmer spectrum analysis of the re-sequenced samples indicated that Pacific lice exhibit a large within-individual heterozygosity rate (average of 1 in every 72 bases) that is markedly higher than that of Atlantic individuals (1 in every 173 bases). The SNP chip was used to produce a high-density map for Atlantic sea louse linkage group 5 that was previously shown to be associated with EMB tolerance in Atlantic lice. Additionally, 478 Pacific louse samples from farmed and wild hosts obtained between 2005 and 2014 were also genotyped on the array. Clustering analysis allowed us to detect the apparent emergence of an otherwise rare genotype at a high frequency among the lice collected from two farms in 2013 that had reported elevated EMB tolerance. This genotype was not observed in louse samples collected from the same farm in 2010, nor in any lice sampled from other locations prior to 2013. However, this genotype was detected at low frequencies in louse samples from farms in two locations reporting elevated EMB tolerance in 2014. These results suggest that a rare genotype present in Pacific lice may be locally expanded in farms after EMB treatment. Supporting this hypothesis, 437 SNPs associated with this genotype were found to be in a region of linkage group 5 that overlaps the region associated with EMB resistance in Atlantic lice. Finally, five of the top diagnostic SNPs within this region were used to screen lice that had been subjected to an EMB survival assay, revealing a significant association between these SNPs and EMB treatment outcome. To our knowledge this work is the first report to identify a genetic link to altered EMB efficacy in L. salmonis in the Pacific Ocean. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
The use of genomic information increases the accuracy of breeding value predictions for sea louse (Caligus rogercresseyi) resistance in Atlantic salmon (Salmo salar).

PubMed

Correa, Katharina; Bangera, Rama; Figueroa, René; Lhorente, Jean P; Yáñez, José M

2017-01-31

Sea lice infestations caused by Caligus rogercresseyi are a main concern to the salmon farming industry due to associated economic losses. Resistance to this parasite was shown to have low to moderate genetic variation and its genetic architecture was suggested to be polygenic. The aim of this study was to compare accuracies of breeding value predictions obtained with pedigree-based best linear unbiased prediction (P-BLUP) methodology against different genomic prediction approaches: genomic BLUP (G-BLUP), Bayesian Lasso, and Bayes C. To achieve this, 2404 individuals from 118 families were measured for C. rogercresseyi count after a challenge and genotyped using 37 K single nucleotide polymorphisms. Accuracies were assessed using fivefold cross-validation and SNP densities of 0.5, 1, 5, 10, 25 and 37 K. Accuracy of genomic predictions increased with increasing SNP density and was higher than pedigree-based BLUP predictions by up to 22%. Both Bayesian and G-BLUP methods can predict breeding values with higher accuracies than pedigree-based BLUP, however, G-BLUP may be the preferred method because of reduced computation time and ease of implementation. A relatively low marker density (i.e. 10 K) is sufficient for maximal increase in accuracy when using G-BLUP or Bayesian methods for genomic prediction of C. rogercresseyi resistance in Atlantic salmon.
Genes, age, and alcoholism: analysis of GAW14 data.

PubMed

Apprey, Victor; Afful, Joseph; Harrell, Jules P; Taylor, Robert E; Bonney, George E

2005-12-30

A genetic analysis of age of onset of alcoholism was performed on the Collaborative Study on the Genetics of Alcoholism data released for Genetic Analysis Workshop 14. Our study illustrates an application of the log-normal age of onset model in our software Genetic Epidemiology Models (GEMs). The phenotype ALDX1 of alcoholism was studied. The analysis strategy was to first find the markers of the Affymetrix SNP dataset with significant association with age of onset, and then to perform linkage analysis on them. ALDX1 revealed strong evidence of linkage for marker tsc0041591 on chromosome 2 and suggestive linkage for marker tsc0894042 on chromosome 3. The largest separation in mean ages of onset of ALDX1 was 19.76 and 24.41 between male smokers who are carriers of the risk allele of tsc0041591 and the non-carriers, respectively. Hence, male smokers who are carriers of marker tsc0041591 on chromosome 2 have an average onset of ALDX1 almost 5 years earlier than non-carriers.
SNP discovery by high-throughput sequencing in soybean

PubMed Central

2010-01-01

Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770
Genome-wide SNP identification and QTL mapping for black rot resistance in cabbage.

PubMed

Lee, Jonghoon; Izzah, Nur Kholilatul; Jayakodi, Murukarthick; Perumal, Sampath; Joh, Ho Jun; Lee, Hyeon Ju; Lee, Sang-Choon; Park, Jee Young; Yang, Ki-Woung; Nou, Il-Sup; Seo, Joodeok; Yoo, Jaeheung; Suh, Youngdeok; Ahn, Kyounggu; Lee, Ji Hyun; Choi, Gyung Ja; Yu, Yeisoo; Kim, Heebal; Yang, Tae-Jin

2015-02-03

Black rot is a destructive bacterial disease causing large yield and quality losses in Brassica oleracea. To detect quantitative trait loci (QTL) for black rot resistance, we performed whole-genome resequencing of two cabbage parental lines and genome-wide SNP identification using the recently published B. oleracea genome sequences as reference. Approximately 11.5 Gb of sequencing data was produced from each parental line. Reference genome-guided mapping and SNP calling revealed 674,521 SNPs between the two cabbage lines, with an average of one SNP per 662.5 bp. Among 167 dCAPS markers derived from candidate SNPs, 117 (70.1%) were validated as bona fide SNPs showing polymorphism between the parental lines. We then improved the resolution of a previous genetic map by adding 103 markers including 87 SNP-based dCAPS markers. The new map composed of 368 markers and covers 1467.3 cM with an average interval of 3.88 cM between adjacent markers. We evaluated black rot resistance in the mapping population in three independent inoculation tests using F2:3 progenies and identified one major QTL and three minor QTLs. We report successful utilization of whole-genome resequencing for large-scale SNP identification and development of molecular markers for genetic map construction. In addition, we identified novel QTLs for black rot resistance. The high-density genetic map will promote QTL analysis for other important agricultural traits and marker-assisted breeding of B. oleracea.
High-density linkage mapping revealed suppression of recombination at the sex determination locus in papaya.

PubMed Central

Ma, Hao; Moore, Paul H; Liu, Zhiyong; Kim, Minna S; Yu, Qingyi; Fitch, Maureen M M; Sekioka, Terry; Paterson, Andrew H; Ming, Ray

2004-01-01

A high-density genetic map of papaya (Carica papaya L.) was constructed using 54 F(2) plants derived from cultivars Kapoho and SunUp with 1501 markers, including 1498 amplified fragment length polymorphism (AFLP) markers, the papaya ringspot virus coat protein marker, morphological sex type, and fruit flesh color. These markers were mapped into 12 linkage groups at a LOD score of 5.0 and recombination frequency of 0.25. The 12 major linkage groups covered a total length of 3294.2 cM, with an average distance of 2.2 cM between adjacent markers. This map revealed severe suppression of recombination around the sex determination locus with a total of 225 markers cosegregating with sex types. The cytosine bases were highly methylated in this region on the basis of the distribution of methylation-sensitive and -insensitive markers. This high-density genetic map is essential for cloning of specific genes of interest such as the sex determination gene and for the integration of genetic and physical maps of papaya. PMID:15020433
Using SNP genetic markers to elucidate the linkage of the Co-34/Phg-3 anthracnose and angular leaf spot resistance gene cluster with the Ur-14 resistance gene

USDA-ARS?s Scientific Manuscript database

The Ouro Negro common bean cultivar contains the Co-34/Phg-3 gene cluster that confers resistance to the anthracnose (ANT) and angular leaf spot (ALS) pathogens. These genes are tightly linked on chromosome 4. Ouro Negro also has the Ur-14 rust resistance gene, reportedly in the vicinity of Co- 34; ...
IL6R Variation Asp358Ala Is a Potential Modifier of Lung Function in Asthma

PubMed Central

Hawkins, Gregory A; Robinson, Mac B; Hastie, Annette T; Li, Xingnan; Li, Huashi; Moore, Wendy C; Howard, Timothy D; Busse, William W.; Erzurum, Serpil C.; Wenzel, Sally E.; Peters, Stephen P; Meyers, Deborah A; Bleecker, Eugene R

2012-01-01

Background The IL6R SNP rs4129267 has recently been identified as an asthma susceptibility locus in subjects of European ancestry but has not been characterized with respect to asthma severity. The SNP rs4129267 is in linkage disequilibrium (r2=1) with the IL6R coding SNP rs2228145 (Asp358Ala). This IL6R coding change increases IL6 receptor shedding and promotes IL6 transsignaling. Objectives To evaluate the IL6R SNP rs2228145 with respect to asthma severity phenotypes. Methods The IL6R SNP rs2228145 was evaluated in subjects of European ancestry with asthma from the Severe Asthma Research Program (SARP). Lung function associations were replicated in the Collaborative Study on the Genetics of Asthma (CSGA) cohort. Serum soluble IL6 receptor (sIL6R) levels were measured in subjects from SARP. Immunohistochemistry was used to qualitatively evaluate IL6R protein expression in BAL cells and endobronchial biopsies. Results The minor C allele of IL6R SNP rs2228145 was associated with lower ppFEV1 in the SARP cohort (p=0.005), the CSGA cohort (0.008), and in combined cohort analysis (p=0.003). Additional associations with ppFVC, FEV1/FVC, and PC20 were observed. The rs2228145 C allele (Ala358) was more frequent in severe asthma phenotypic clusters. Elevated serum sIL6R was associated with lower ppFEV1 (p=0.02) and lower ppFVC (p=0.008) (N=146). IL6R protein expression was observed in BAL macrophages, airway epithelium, vascular endothelium, and airway smooth muscle. Conclusions The IL6R coding SNP rs2228145 (Asp358Ala) is a potential modifier of lung function in asthma and may identify subjects at risk for more severe asthma. IL6 transsignaling may have a pathogenic role in the lung. PMID:22554704
An Expressed Sequence Tag (EST)-enriched genetic map of turbot (Scophthalmus maximus): a useful framework for comparative genomics across model and farmed teleosts

PubMed Central

2012-01-01

Background The turbot (Scophthalmus maximus) is a relevant species in European aquaculture. The small turbot genome provides a source for genomics strategies to use in order to understand the genetic basis of productive traits, particularly those related to sex, growth and pathogen resistance. Genetic maps represent essential genomic screening tools allowing to localize quantitative trait loci (QTL) and to identify candidate genes through comparative mapping. This information is the backbone to develop marker-assisted selection (MAS) programs in aquaculture. Expressed sequenced tag (EST) resources have largely increased in turbot, thus supplying numerous type I markers suitable for extending the previous linkage map, which was mostly based on anonymous loci. The aim of this study was to construct a higher-resolution turbot genetic map using EST-linked markers, which will turn out to be useful for comparative mapping studies. Results A consensus gene-enriched genetic map of the turbot was constructed using 463 SNP and microsatellite markers in nine reference families. This map contains 438 markers, 180 EST-linked, clustered at 24 linkage groups. Linkage and comparative genomics evidences suggested additional linkage group fusions toward the consolidation of turbot map according to karyotype information. The linkage map showed a total length of 1402.7 cM with low average intermarker distance (3.7 cM; ~2 Mb). A global 1.6:1 female-to-male recombination frequency (RF) ratio was observed, although largely variable among linkage groups and chromosome regions. Comparative sequence analysis revealed large macrosyntenic patterns against model teleost genomes, significant hits decreasing from stickleback (54%) to zebrafish (20%). Comparative mapping supported particular chromosome rearrangements within Acanthopterygii and aided to assign unallocated markers to specific turbot linkage groups. Conclusions The new gene-enriched high-resolution turbot map represents a useful genomic tool for QTL identification, positional cloning strategies, and future genome assembling. This map showed large synteny conservation against model teleost genomes. Comparative genomics and data mining from landmarks will provide straightforward access to candidate genes, which will be the basis for genetic breeding programs and evolutionary studies in this species. PMID:22747677
CIDR

Science.gov Websites

NIH CIDR Program Studies For whole exome sequencing projects, we pretest all samples using a high -density SNP array (>200,000 markers). For custom targeted sequencing, we pretest all samples using a 96 pretest samples using a 96 SNP GoldenGate assay. This extensive pretesting allows us to unambiguously tie
Genome-wide association study (GWAS) for growth rate and age at sexual maturation in Atlantic salmon (Salmo salar).

PubMed

Gutierrez, Alejandro P; Yáñez, José M; Fukui, Steve; Swift, Bruce; Davidson, William S

2015-01-01

Early sexual maturation is considered a serious drawback for Atlantic salmon aquaculture as it retards growth, increases production times and affects flesh quality. Although both growth and sexual maturation are thought to be complex processes controlled by several genetic and environmental factors, selection for these traits has been continuously accomplished since the beginning of Atlantic salmon selective breeding programs. In this genome-wide association study (GWAS) we used a 6.5K single-nucleotide polymorphism (SNP) array to genotype ∼ 480 individuals from the Cermaq Canada broodstock program and search for SNPs associated with growth and age at sexual maturation. Using a mixed model approach we identified markers showing a significant association with growth, grilsing (early sexual maturation) and late sexual maturation. The most significant associations were found for grilsing, with markers located in Ssa10, Ssa02, Ssa13, Ssa25 and Ssa12, and for late maturation with markers located in Ssa28, Ssa01 and Ssa21. A lower level of association was detected with growth on Ssa13. Candidate genes, which were linked to these genetic markers, were identified and some of them show a direct relationship with developmental processes, especially for those in association with sexual maturation. However, the relatively low power to detect genetic markers associated with growth (days to 5 kg) in this GWAS indicates the need to use a higher density SNP array in order to overcome the low levels of linkage disequilibrium observed in Atlantic salmon before the information can be incorporated into a selective breeding program.
Genome-wide SNP identification, linkage map construction and QTL mapping for seed mineral concentrations and contents in pea (Pisum sativum L.).

PubMed

Ma, Yu; Coyne, Clarice J; Grusak, Michael A; Mazourek, Michael; Cheng, Peng; Main, Dorrie; McGee, Rebecca J

2017-02-13

Marker-assisted breeding is now routinely used in major crops to facilitate more efficient cultivar improvement. This has been significantly enabled by the use of next-generation sequencing technology to identify loci and markers associated with traits of interest. While rich in a range of nutritional components, such as protein, mineral nutrients, carbohydrates and several vitamins, pea (Pisum sativum L.), one of the oldest domesticated crops in the world, remains behind many other crops in the availability of genomic and genetic resources. To further improve mineral nutrient levels in pea seeds requires the development of genome-wide tools. The objectives of this research were to develop these tools by: identifying genome-wide single nucleotide polymorphisms (SNPs) using genotyping by sequencing (GBS); constructing a high-density linkage map and comparative maps with other legumes, and identifying quantitative trait loci (QTL) for levels of boron, calcium, iron, potassium, magnesium, manganese, molybdenum, phosphorous, sulfur, and zinc in the seed, as well as for seed weight. In this study, 1609 high quality SNPs were found to be polymorphic between 'Kiflica' and 'Aragorn', two parents of an F 6 -derived recombinant inbred line (RIL) population. Mapping 1683 markers including 75 previously published markers and 1608 SNPs developed from the present study generated a linkage map of size 1310.1 cM. Comparative mapping with other legumes demonstrated that the highest level of synteny was observed between pea and the genome of Medicago truncatula. QTL analysis of the RIL population across two locations revealed at least one QTL for each of the mineral nutrient traits. In total, 46 seed mineral concentration QTLs, 37 seed mineral content QTLs, and 6 seed weight QTLs were discovered. The QTLs explained from 2.4% to 43.3% of the phenotypic variance. The genome-wide SNPs and the genetic linkage map developed in this study permitted QTL identification for pea seed mineral nutrients that will serve as important resources to enable marker-assisted selection (MAS) for nutritional quality traits in pea breeding programs.
Biopolymer protected silver nanoparticles on the support of carbon nanotube as interface for electrocatalytic applications

NASA Astrophysics Data System (ADS)

Satyanarayana, M.; Kumar, V. Sunil; Gobi, K. Vengatajalabathy

2016-04-01

In this research, silver nanoparticles (SNPs) are prepared on the surface of carbon nanotubes via chitosan, a biopolymer linkage. Here chitosan act as stabilizing agent for nanoparticles and forms a network on the surface of carbon nanotubes. Synthesized silver nanoparticles-MWCNT hybrid composite is characterized by UV-Visible spectroscopy, XRD analysis, and FESEM with EDS to evaluate the structural and chemical properties of the nanocomposite. The electrocatalytic activity of the fabricated SNP-MWCNT hybrid modified glassy carbon electrode has been evaluated by cyclic voltammetry and electrochemical impedance analysis. The silver nanoparticles are of size ˜35 nm and are well distributed on the surface of carbon nanotubes with chitosan linkage. The prepared nanocomposite shows efficient electrocatalytic properties with high active surface area and excellent electron transfer behaviour.
Relevance of genetic relationship in GWAS and genomic prediction.

PubMed

Pereira, Helcio Duarte; Soriano Viana, José Marcelo; Andrade, Andréa Carla Bastos; Fonseca E Silva, Fabyano; Paes, Geísa Pinheiro

2018-02-01

The objective of this study was to analyze the relevance of relationship information on the identification of low heritability quantitative trait loci (QTLs) from a genome-wide association study (GWAS) and on the genomic prediction of complex traits in human, animal and cross-pollinating populations. The simulation-based data sets included 50 samples of 1000 individuals of seven populations derived from a common population with linkage disequilibrium. The populations had non-inbred and inbred progeny structure (50 to 200) with varying number of members (5 to 20). The individuals were genotyped for 10,000 single nucleotide polymorphisms (SNPs) and phenotyped for a quantitative trait controlled by 10 QTLs and 90 minor genes showing dominance. The SNP density was 0.1 cM and the narrow sense heritability was 25%. The QTL heritabilities ranged from 1.1 to 2.9%. We applied mixed model approaches for both GWAS and genomic prediction using pedigree-based and genomic relationship matrices. For GWAS, the observed false discovery rate was kept below the significance level of 5%, the power of detection for the low heritability QTLs ranged from 14 to 50%, and the average bias between significant SNPs and a QTL ranged from less than 0.01 to 0.23 cM. The QTL detection power was consistently higher using genomic relationship matrix. Regardless of population and training set size, genomic prediction provided higher prediction accuracy of complex trait when compared to pedigree-based prediction. The accuracy of genomic prediction when there is relatedness between individuals in the training set and the reference population is much higher than the value for unrelated individuals.
Population genomic structure and linkage disequilibrium analysis of South African goat breeds using genome-wide SNP data.

PubMed

Mdladla, K; Dzomba, E F; Huson, H J; Muchadeyi, F C

2016-08-01

The sustainability of goat farming in marginal areas of southern Africa depends on local breeds that are adapted to specific agro-ecological conditions. Unimproved non-descript goats are the main genetic resources used for the development of commercial meat-type breeds of South Africa. Little is known about genetic diversity and the genetics of adaptation of these indigenous goat populations. This study investigated the genetic diversity, population structure and breed relations, linkage disequilibrium, effective population size and persistence of gametic phase in goat populations of South Africa. Three locally developed meat-type breeds of the Boer (n = 33), Savanna (n = 31), Kalahari Red (n = 40), a feral breed of Tankwa (n = 25) and unimproved non-descript village ecotypes (n = 110) from four goat-producing provinces of the Eastern Cape, KwaZulu-Natal, Limpopo and North West were assessed using the Illumina Goat 50K SNP Bead Chip assay. The proportion of SNPs with minor allele frequencies >0.05 ranged from 84.22% in the Tankwa to 97.58% in the Xhosa ecotype, with a mean of 0.32 ± 0.13 across populations. Principal components analysis, admixture and pairwise FST identified Tankwa as a genetically distinct population and supported clustering of the populations according to their historical origins. Genome-wide FST identified 101 markers potentially under positive selection in the Tankwa. Average linkage disequilibrium was highest in the Tankwa (r(2) = 0.25 ± 0.26) and lowest in the village ecotypes (r(2) range = 0.09 ± 0.12 to 0.11 ± 0.14). We observed an effective population size of <150 for all populations 13 generations ago. The estimated correlations for all breed pairs were lower than 0.80 at marker distances >100 kb with the exception of those in Savanna and Tswana populations. This study highlights the high level of genetic diversity in South African indigenous goats as well as the utility of the genome-wide SNP marker panels in genetic studies of these populations. © 2016 Stichting International Foundation for Animal Genetics.
Genetic dissection of Al tolerance QTLs in the maize genome by high density SNP scan

USDA-ARS?s Scientific Manuscript database

Aluminum (Al) toxicity is an important limitation to food security in the tropical and subtropical regions. High Al saturation in acid soils limits root development and its ability to uptake water and nutrients. In this study, we present a genome scan for Al tolerance loci with over 50,000 GBS-based...
Common genetic variation and novel loci associated with volumetric mammographic density.

PubMed

Brand, Judith S; Humphreys, Keith; Li, Jingmei; Karlsson, Robert; Hall, Per; Czene, Kamila

2018-04-17

Mammographic density (MD) is a strong and heritable intermediate phenotype of breast cancer, but much of its genetic variation remains unexplained. We conducted a genetic association study of volumetric MD in a Swedish mammography screening cohort (n = 9498) to identify novel MD loci. Associations with volumetric MD phenotypes (percent dense volume, absolute dense volume, and absolute nondense volume) were estimated using linear regression adjusting for age, body mass index, menopausal status, and six principal components. We also estimated the proportion of MD variance explained by additive contributions from single-nucleotide polymorphisms (SNP-based heritability [h 2 SNP ]) in 4948 participants of the cohort. In total, three novel MD loci were identified (at P < 5 × 10 - 8 ): one for percent dense volume (HABP2) and two for the absolute dense volume (INHBB, LINC01483). INHBB is an established locus for ER-negative breast cancer, and HABP2 and LINC01483 represent putative new breast cancer susceptibility loci, because both loci were associated with breast cancer in available meta-analysis data including 122,977 breast cancer cases and 105,974 control subjects (P < 0.05). h 2 SNP (SE) estimates for percent dense, absolute dense, and nondense volume were 0.29 (0.07), 0.31 (0.07), and 0.25 (0.07), respectively. Corresponding ratios of h 2 SNP to previously observed narrow-sense h 2 estimates in the same cohort were 0.46, 0.72, and 0.41, respectively. These findings provide new insights into the genetic basis of MD and biological mechanisms linking MD to breast cancer risk. Apart from identifying three novel loci, we demonstrate that at least 25% of the MD variance is explained by common genetic variation with h 2 SNP /h 2 ratios varying between dense and nondense MD components.
High throughput SNP discovery and genotyping in grapevine (Vitis vinifera L.) by combining a re-sequencing approach and SNPlex technology

PubMed Central

Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M

2007-01-01

Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
Association of the polymorphisms 292 C>T and 1304 G>A in the SLC38A4 gene with hyperglycaemia.

PubMed

González-Renteria, Siblie Marbey; Loera-Castañeda, Verónica; Chairez-Hernández, Isaías; Sosa-Macias, Martha; Paniagua-Castro, Norma; Lares-Aseff, Ismael; Rodríguez-Moran, Martha; Guerrero-Romero, Fernando; Galaviz-Hernández, Carlos

2013-01-01

The SLC38A4 gene is related to system 'A' activity, which seems to be related to impaired gluconeogenesis. The objective of this study was to determine whether the 292 C>T and 1304 G>A polymorphisms of SLC38A4 gene are associated with hyperglycaemia in humans. A total of 227 individuals were enrolled in a case-control study, in which hyperglycaemia was defined by plasma glucose levels ≥95 mg/dL. Genotyping was carried out by using real-time polymerase chain reaction. The frequency of mutant alleles of SLC38A4 gene for single-nucleotide polymorphism (SNP) 1304 G>A was 23.6% and 30.2% for SNP 292 C>T. The frequency of allele T for the SNP 292 C>T in the case and control groups did not show significant differences, whereas the frequency of allele A for the SNP 1304 G>A was significantly higher in the case group than in the control group (p = 0.04). In the logistic regression analysis, the SNP 1304 G>A [odds ratio (OR) 1.78; 95%CI 1.04-3.05, p = 0.03] but not SNP 292 C>T (OR 1.41; 95%CI 0.80-2.47, p = 0.23) showed a significant association with hyperglycaemia. After adjusting by body mass index, waist circumference and triglycerides, the SNP 1304 G>A remained significantly associated with hyperglycaemia (OR 2.13; 95%CI 1.18-3.83, p = 0.03). Pair wise linkage disequilibrium showed correlation (D' > 0.82) between 292 C>T and 1304 G>A SNPs. Haplotype association with hyperglycaemia also showed significant association between both homozygous mutant alleles (A/T) and hyperglycaemia (OR 1.68; 95%CI 1.01-2.79, p = 0.048). Our results suggest that mutant allele A for SNP 1304 G>A of SLC38A4 gene is associated with hyperglycaemia. Copyright © 2012 John Wiley & Sons, Ltd.

A sequencing-based linkage map of cucumber

USDA-ARS?s Scientific Manuscript database

Genetic maps are important tools for molecular breeding, gene cloning, and study of meiotic recombination. In cucumber (Cucumis sativus L.), the marker density, resolution and genome coverage of previously developed genetic maps using PCR-based molecular markers are relatively low. In this study we ...
Implication of Genes for the N-Methyl-D-Aspartate (NMDA) Receptor in Substance Addictions.

PubMed

Chen, Jiali; Ma, Yunlong; Fan, Rongli; Yang, Zhongli; Li, Ming D

2018-02-10

Drug dependence is a chronic brain disease with harmful consequences for both individual users and society. Glutamate is a primary excitatory neurotransmitter in the brain, and both in vivo and in vitro experiments have implicated N-methyl-D-aspartate (NMDA) receptor, a glutamate receptor, as an element in various types of addiction. Recent findings from genetics-based approaches such as genome-wide linkage, candidate gene association, genome-wide association (GWA), and next-generation sequencing have demonstrated the significant association of NMDA receptor subunit genes such as GluN3A, GluN2B, and GluN2A with various addiction-related phenotypes. Of these genes, GluN3A has been the most studied, and it has been revealed to play crucial roles in the etiology of addictions. In this communication, we provide an updated view of the genetic effects of NMDA receptor subunit genes and their functions in the etiology of addictions based on the findings from investigation of both common and rare variants as well as SNP-SNP interactions. To better understand the molecular mechanisms underlying addiction-related behaviors and to promote the development of specific medicines for the prevention and treatment of addictions, current efforts aim not only to identify more causal variants in NMDA receptor subunits by using large independent samples but also to reveal the molecular functions of these variants in addictions.
A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea

PubMed Central

Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

2015-01-01

We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368
Short communication: Improving the accuracy of genomic prediction of body conformation traits in Chinese Holsteins using markers derived from high-density marker panels.

PubMed

Song, H; Li, L; Ma, P; Zhang, S; Su, G; Lund, M S; Zhang, Q; Ding, X

2018-06-01

This study investigated the efficiency of genomic prediction with adding the markers identified by genome-wide association study (GWAS) using a data set of imputed high-density (HD) markers from 54K markers in Chinese Holsteins. Among 3,056 Chinese Holsteins with imputed HD data, 2,401 individuals born before October 1, 2009, were used for GWAS and a reference population for genomic prediction, and the 220 younger cows were used as a validation population. In total, 1,403, 1,536, and 1,383 significant single nucleotide polymorphisms (SNP; false discovery rate at 0.05) associated with conformation final score, mammary system, and feet and legs were identified, respectively. About 2 to 3% genetic variance of 3 traits was explained by these significant SNP. Only a very small proportion of significant SNP identified by GWAS was included in the 54K marker panel. Three new marker sets (54K+) were herein produced by adding significant SNP obtained by linear mixed model for each trait into the 54K marker panel. Genomic breeding values were predicted using a Bayesian variable selection (BVS) model. The accuracies of genomic breeding value by BVS based on the 54K+ data were 2.0 to 5.2% higher than those based on the 54K data. The imputed HD markers yielded 1.4% higher accuracy on average (BVS) than the 54K data. Both the 54K+ and HD data generated lower bias of genomic prediction, and the 54K+ data yielded the lowest bias in all situations. Our results show that the imputed HD data were not very useful for improving the accuracy of genomic prediction and that adding the significant markers derived from the imputed HD marker panel could improve the accuracy of genomic prediction and decrease the bias of genomic prediction. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Construction of an ultra-high density consensus genetic map, and enhancement of the physical map from genome sequencing in Lupinus angustifolius.

PubMed

Zhou, Gaofeng; Jian, Jianbo; Wang, Penghao; Li, Chengdao; Tao, Ye; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark; Yang, Huaan

2018-01-01

An ultra-high density genetic map containing 34,574 sequence-defined markers was developed in Lupinus angustifolius. Markers closely linked to nine genes of agronomic traits were identified. A physical map was improved to cover 560.5 Mb genome sequence. Lupin (Lupinus angustifolius L.) is a recently domesticated legume grain crop. In this study, we applied the restriction-site associated DNA sequencing (RADseq) method to genotype an F 9 recombinant inbred line population derived from a wild type × domesticated cultivar (W × D) cross. A high density linkage map was developed based on the W × D population. By integrating sequence-defined DNA markers reported in previous mapping studies, we established an ultra-high density consensus genetic map, which contains 34,574 markers consisting of 3508 loci covering 2399 cM on 20 linkage groups. The largest gap in the entire consensus map was 4.73 cM. The high density W × D map and the consensus map were used to develop an improved physical map, which covered 560.5 Mb of genome sequence data. The ultra-high density consensus linkage map, the improved physical map and the markers linked to genes of breeding interest reported in this study provide a common tool for genome sequence assembly, structural genomics, comparative genomics, functional genomics, QTL mapping, and molecular plant breeding in lupin.
Estimation of genomic breeding values for milk yield in UK dairy goats.

PubMed

Mucha, S; Mrode, R; MacLaren-Lee, I; Coffey, M; Conington, J

2015-11-01

The objective of this study was to estimate genomic breeding values for milk yield in crossbred dairy goats. The research was based on data provided by 2 commercial goat farms in the UK comprising 590,409 milk yield records on 14,453 dairy goats kidding between 1987 and 2013. The population was created by crossing 3 breeds: Alpine, Saanen, and Toggenburg. In each generation the best performing animals were selected for breeding, and as a result, a synthetic breed was created. The pedigree file contained 30,139 individuals, of which 2,799 were founders. The data set contained test-day records of milk yield, lactation number, farm, age at kidding, and year and season of kidding. Data on milk composition was unavailable. In total 1,960 animals were genotyped with the Illumina 50K caprine chip. Two methods for estimation of genomic breeding value were compared-BLUP at the single nucleotide polymorphism level (BLUP-SNP) and single-step BLUP. The highest accuracy of 0.61 was obtained with single-step BLUP, and the lowest (0.36) with BLUP-SNP. Linkage disequilibrium (r(2), the squared correlation of the alleles at 2 loci) at 50 kb (distance between 2 SNP) was 0.18. This is the first attempt to implement genomic selection in UK dairy goats. Results indicate that the single-step method provides the highest accuracy for populations with a small number of genotyped individuals, where the number of genotyped males is low and females are predominant in the reference population. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Polymorphisms in the promoter region of the bovine lactoferrin gene influence milk somatic cell score and milk production traits in Chinese Holstein cows.

PubMed

Mao, Yongjiang; Zhu, Xiaorui; Xing, Shiyu; Zhang, Meirong; Zhang, Huimin; Wang, Xiaolong; Karrow, Niel; Yang, Liguo; Yang, Zhangping

2015-12-01

Lactoferrin is an iron-binding protein found in cow's milk that plays an important role in preventing mastitis caused by intramammary infection. In this study, 20 Chinese Holstein cows were selected randomly for PCR amplification and sequencing of the bovine lactoferrin gene promoter region and used for SNP discovery in the region between nucleotide positions -461 to -132. Three SNPs (-270T>C, -190G>A and -156A>G) were identified in bovine lactoferrin, then Chinese Holstein cows (n=866) were genotyped using Sequenom MassARRAY (Sequenom Inc., San Diego, CA) based on the previous SNP information in this study, and the associations between SNPs or haplotype and milk somatic cell score (SCS) and production traits were analyzed by the least squares method in the GLM procedure of SAS. SNPs -270T>C and -156A>G showed close linkage disequilibrium (r(2)=0.76). The SNP -190G>A showed a significant association with SCS, and individuals with genotype GG had higher SCS than genotypes AG and AA. Associations were found between the SNPs -270T>C and -190G>A with SCS and the milk composition. The software MatInspector revealed that these SNPs were located within several potential transcription factor binding sites, including NF-κB p50, KLF7 and SP1, and may alter gene expression, but further investigation will be required to elucidate the biological and practical relevance of these SNPs. Copyright © 2015 Elsevier Ltd. All rights reserved.
Linkage disequilibrium compared between five populations of domestic sheep.

PubMed

Meadows, Jennifer R S; Chan, Eva K F; Kijas, James W

2008-09-30

The success of genome-wide scans depends on the strength and magnitude of linkage disequilibrium (LD) present within the populations under investigation. High density SNP arrays are currently in development for the sheep genome, however little is known about the behaviour of LD in this livestock species. This study examined the behaviour of LD within five sheep populations using two LD metrics, D' and x2'. Four economically important Australian sheep flocks, three pure breeds (White Faced Suffolk, Poll Dorset, Merino) and a crossbred population (Merino x Border Leicester), along with an inbred Australian Merino museum flock were analysed. Short range LD (0 - 5 cM) was observed in all five populations, however the persistence with increasing distance and magnitude of LD varied considerably between populations. Average LD (x2') for markers spaced up to 20 cM exceeded the non-syntenic average within the White Faced Suffolk, Poll Dorset and Macarthur Merino. LD decayed faster within the Merino and Merino x Border Leicester, with LD below or consistent with observed background levels. Using marker-marker LD as a guide to the behaviour of marker-QTL LD, estimates of minimum marker spacing were made. For a 95% probability of detecting QTL, a microsatellite marker would be required every 0.1 - 2.5 centimorgans, depending on the population used. Sheep populations were selected which were inbred (Macarthur Merino), highly heterogeneous (Merino) or intermediate between these two extremes. This facilitated analysis and comparison of LD (x2') between populations. The strength and magnitude of LD was found to differ markedly between breeds and aligned closely with both observed levels of genetic diversity and expectations based on breed history. This confirmed that breed specific information is likely to be important for genome wide selection and during the design of successful genome scans where tens of thousands of markers will be required.
selectSNP – An R package for selecting SNPs optimal for genetic evaluation

USDA-ARS?s Scientific Manuscript database

There has been a huge increase in the number of SNPs in the public repositories. This has made it a challenge to design low and medium density SNP panels, which requires careful selection of available SNPs considering many criteria, such as map position, allelic frequency, possible biological functi...
Mapping of a major QTL for salt tolerance of mature field-grown maize plants based on SNP markers.

PubMed

Luo, Meijie; Zhao, Yanxin; Zhang, Ruyang; Xing, Jinfeng; Duan, Minxiao; Li, Jingna; Wang, Naishun; Wang, Wenguang; Zhang, Shasha; Chen, Zhihui; Zhang, Huasheng; Shi, Zi; Song, Wei; Zhao, Jiuran

2017-08-15

Salt stress significantly restricts plant growth and production. Maize is an important food and economic crop but is also a salt sensitive crop. Identification of the genetic architecture controlling salt tolerance facilitates breeders to select salt tolerant lines. However, the critical quantitative trait loci (QTLs) responsible for the salt tolerance of field-grown maize plants are still unknown. To map the main genetic factors contributing to salt tolerance in mature maize, a double haploid population (240 individuals) and 1317 single nucleotide polymorphism (SNP) markers were employed to produce a genetic linkage map covering 1462.05 cM. Plant height of mature maize cultivated in the saline field (SPH) and plant height-based salt tolerance index (ratio of plant height between saline and control fields, PHI) were used to evaluate salt tolerance of mature maize plants. A major QTL for SPH was detected on Chromosome 1 with the LOD score of 22.4, which explained 31.2% of the phenotypic variation. In addition, the major QTL conditioning PHI was also mapped at the same position on Chromosome 1, and two candidate genes involving in ion homeostasis were identified within the confidence interval of this QTL. The detection of the major QTL in adult maize plant establishes the basis for the map-based cloning of genes associated with salt tolerance and provides a potential target for marker assisted selection in developing maize varieties with salt tolerance.
A Genetic Variant Ameliorates β-Thalassemia Severity by Epigenetic-Mediated Elevation of Human Fetal Hemoglobin Expression.

PubMed

Chen, Diyu; Zuo, Yangjin; Zhang, Xinhua; Ye, Yuhua; Bao, Xiuqin; Huang, Haiyan; Tepakhan, Wanicha; Wang, Lijuan; Ju, Junyi; Chen, Guangfu; Zheng, Mincui; Liu, Dun; Huang, Shuodan; Zong, Lu; Li, Changgang; Chen, Yajun; Zheng, Chenguang; Shi, Lihong; Zhao, Quan; Wu, Qiang; Fucharoen, Supan; Zhao, Cunyou; Xu, Xiangmin

2017-07-06

A delayed fetal-to-adult hemoglobin (Hb) switch ameliorates the severity of β-thalassemia and sickle cell disease. The molecular mechanism underlying the epigenetic dysregulation of the switch is unclear. To explore the potential cis-variants responsible for the Hb switching, we systematically analyzed an 80-kb region spanning the β-globin cluster using capture-based next-generation sequencing of 1142 Chinese β-thalassemia persons and identified 31 fetal hemoglobin (HbF)-associated haplotypes of the selected 28 tag regulatory single-nucleotide polymorphisms (rSNPs) in seven linkage disequilibrium (LD) blocks. A Ly1 antibody reactive (LYAR)-binding motif disruptive rSNP rs368698783 (G/A) from LD block 5 in the proximal promoter of hemoglobin subunit gamma 1 (HBG1) was found to be a significant predictor for β-thalassemia clinical severity by epigenetic-mediated variant-dependent HbF elevation. We found this rSNP accounted for 41.6% of β-hemoglobinopathy individuals as an ameliorating factor in a total of 2,738 individuals from southern China and Thailand. We uncovered that the minor allele of the rSNP triggers the attenuation of LYAR and two repressive epigenetic regulators DNA methyltransferase 3 alpha (DNMT3A) and protein arginine methyltransferase 5 (PRMT5) from the HBG promoters, mediating allele-biased γ-globin elevation by facilitating demethylation of HBG core promoter CpG sites in erythroid progenitor cells from β-thalassemia persons. The present study demonstrates that this common rSNP in the proximal A γ-promoter is a major genetic modifier capable of ameliorating the severity of thalassemia major through the epigenetic-mediated regulation of the delayed fetal-to-adult Hb switch and provides potential targets for the treatment of β-hemoglobinopathy. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
CREB1 is a strong genetic predictor of the variation in exercise heart rate response to regular exercise: the HERITAGE Family Study.

PubMed

Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D C; Bouchard, Claude

2010-06-01

A genome-wide linkage scan identified a quantitative trait locus for exercise training-induced changes in submaximal exercise (50 W) heart rate (DeltaHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (n=472). To fine-map the region, 1450 tag SNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with DeltaHR50 was observed with 2 single-nucleotide polymorphisms (SNPs) located in the 5' region of the cAMP-responsive element-binding protein 1 (CREB1) gene (rs2253206: P=1.6x10(-5) and rs2360969: P=4.3x10(-5)). The associations remained significant (P=0.01 and P=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analysis identified 9 SNPs that collectively explained 20% of the DeltaHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs approximately 1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, 6 SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: The log of the odds score dropped from 2.10 to 0.41 after adjusting for all 6 SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (P<0.05) lower promoter activity than the minor allele. Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with DeltaHR50 and explains a considerable proportion of the quantitative trait locus variance. However, at least 5 additional SNPs seem to be required to fully account for the original linkage signal.
CREB1 is a strong genetic predictor of the variation in exercise heart rate response to regular exercise: the HERITAGE Family Study

PubMed Central

Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D.C.; Bouchard, Claude

2011-01-01

Background A genome-wide linkage scan identified a quantitative trait locus (QTL) for exercise training-induced changes in submaximal exercise (50W) heart rate (ΔHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (N=472). Methods and Results To fine map the region, 1,450 tagSNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with ΔHR50 was observed with two SNPs located in the 5′ region of the cAMP responsive element binding protein 1 (CREB1) gene (rs2253206: p=1.6×10−5 and rs2360969: p=4.3×10−5). The associations remained significant (p=0.01 and p=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analyses identified nine SNPs that collectively explained 20% of the ΔHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs ~1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, six SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: the LOD score dropped from 2.10 to 0.41 after adjusting for all six SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (p<0.05) lower promoter activity than the minor allele. Conclusions Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with ΔHR50 and explains considerable proportion of the QTL variance. However, at least five additional SNPs seem to be required to fully account for the original linkage signal. PMID:20407090
Comparing strategies for selection of low-density SNPs for imputation-mediated genomic prediction in U. S. Holsteins.

PubMed

He, Jun; Xu, Jiaqi; Wu, Xiao-Lin; Bauck, Stewart; Lee, Jungjae; Morota, Gota; Kachman, Stephen D; Spangler, Matthew L

2018-04-01

SNP chips are commonly used for genotyping animals in genomic selection but strategies for selecting low-density (LD) SNPs for imputation-mediated genomic selection have not been addressed adequately. The main purpose of the present study was to compare the performance of eight LD (6K) SNP panels, each selected by a different strategy exploiting a combination of three major factors: evenly-spaced SNPs, increased minor allele frequencies, and SNP-trait associations either for single traits independently or for all the three traits jointly. The imputation accuracies from 6K to 80K SNP genotypes were between 96.2 and 98.2%. Genomic prediction accuracies obtained using imputed 80K genotypes were between 0.817 and 0.821 for daughter pregnancy rate, between 0.838 and 0.844 for fat yield, and between 0.850 and 0.863 for milk yield. The two SNP panels optimized on the three major factors had the highest genomic prediction accuracy (0.821-0.863), and these accuracies were very close to those obtained using observed 80K genotypes (0.825-0.868). Further exploration of the underlying relationships showed that genomic prediction accuracies did not respond linearly to imputation accuracies, but were significantly affected by genotype (imputation) errors of SNPs in association with the traits to be predicted. SNPs optimal for map coverage and MAF were favorable for obtaining accurate imputation of genotypes whereas trait-associated SNPs improved genomic prediction accuracies. Thus, optimal LD SNP panels were the ones that combined both strengths. The present results have practical implications on the design of LD SNP chips for imputation-enabled genomic prediction.
Uncoupling protein 2 gene polymorphisms are associated with obesity

PubMed Central

2012-01-01

Background Uncoupling protein 2 (UCP2) gene polymorphisms have been reported as genetic risk factors for obesity and type 2 diabetes mellitus (T2DM). We examined the association of commonly observed UCP2 G(−866)A (rs659366) and Ala55Val (C > T) (rs660339) single nucleotide polymorphisms (SNPs) with obesity, high fasting plasma glucose, and serum lipids in a Balinese population. Methods A total of 603 participants (278 urban and 325 rural subjects) were recruited from Bali Island, Indonesia. Fasting plasma glucose (FPG), triglyceride (TG), high density lipoprotein cholesterol (HDL-C), low density lipoprotein cholesterol (LDL-C) and total cholesterol (TC) were measured. Obesity was determined based on WHO classifications for adult Asians. Participants were genotyped for G(−866)A and Ala55Val polymorphisms of the UCP2 gene. Results Obesity prevalence was higher in urban subjects (51%) as compared to rural subjects (23%). The genotype, minor allele (MAF), and heterozygosity frequencies were similar between urban and rural subjects for both SNPs. All genotype frequencies were in Hardy-Weinberg equilibrium. A combined analysis of genotypes and environment revealed that the urban subjects carrying the A/A genotype of the G(−866)A SNP have higher BMI than the rural subjects with the same genotype. Since the two SNPs showed strong linkage disequilibrium (D’ = 0.946, r2 = 0.657), a haplotype analysis was performed. We found that the AT haplotype was associated with high BMI only when the urban environment was taken into account. Conclusions We have demonstrated the importance of environmental settings in studying the influence of the common UCP2 gene polymorphisms in the development of obesity in a Balinese population. PMID:22533685
Heuristic aspect of the lateral root initiation index: A case study of the role of nitric oxide in root branching.

PubMed

Lira-Ruan, Verónica; Mendivil, Selene Napsucialy; Dubrovsky, Joseph G

2013-10-01

Lateral root (LR) initiation (LRI) is a central process in root branching. Based on LR and/or LR primordium densities, it has been shown that nitric oxide (NO) promotes LRI. However, because NO inhibits primary root growth, we hypothesized that NO may have an opposite effect if the analysis is performed on a cellular basis. Using a previously proposed parameter, the LRI index (which measures how many LRI events take place along a root portion equivalent to the length of a single file of 100 cortical cells of average length), we addressed this hypothesis and illustrate here that the LRI index provides a researcher with a tool to uncover hidden but important information about root initiation. • Arabidopsis thaliana roots were treated with an NO donor (sodium nitroprusside [SNP]) and/or an NO scavenger (2-(4-carboxyphenyl)-4,4,5,5-tetramethylimidazoline-1-oxyl-3-oxide [cPTIO]). LRI was analyzed separately in the root portions formed before and during the treatment. In the latter, SNP caused root growth inhibition and an increase in the LR density accompanied by a decrease in LRI index, indicating overall inhibitory outcome of the NO donor on branching. The inhibitory effect of SNP was reversed by cPTIO, showing the NO-specific action of SNP on LRI. • Analysis of the LRI index permits the discovery of otherwise unknown modes of action of a substance on the root system formation. NO has a dual action on root branching, slightly promoting it in the root portion formed before the treatment and strongly inhibiting it in the root portion formed during the treatment.
Linkage and association study of late-onset Alzheimer disease families linked to 9p21.3.

PubMed

Züchner, S; Gilbert, J R; Martin, E R; Leon-Guerrero, C R; Xu, P-T; Browning, C; Bronson, P G; Whitehead, P; Schmechel, D E; Haines, J L; Pericak-Vance, M A

2008-11-01

A chromosomal locus for late-onset Alzheimer disease (LOAD) has previously been mapped to 9p21.3. The most significant results were reported in a sample of autopsy-confirmed families. Linkage to this locus has been independently confirmed in AD families from a consanguineous Israeli-Arab community. In the present study we analyzed an expanded clinical sample of 674 late-onset AD families, independently ascertained by three different consortia. Sample subsets were stratified by site and autopsy-confirmation. Linkage analysis of a dense array of SNPs across the chromosomal locus revealed the most significant results in the 166 autopsy-confirmed families of the NIMH sample. Peak HLOD scores of 4.95 at D9S741 and 2.81 at the nearby SNP rs2772677 were obtained in a dominant model. The linked region included the cyclin-dependent kinase inhibitor 2A gene (CDKN2A), which has been suggested as an AD candidate gene. By re-sequencing all exons in the vicinity of CDKN2A in 48 AD cases, we identified and genotyped four novel SNPs, including a non-synonymous, a synonymous, and two variations located in untranslated RNA sequences. Family-based allelic and genotypic association analysis yielded significant results in CDKN2A (rs11515: PDT p = 0.003, genotype-PDT p = 0.014). We conclude that CDKN2A is a promising new candidate gene potentially contributing to AD susceptibility on chromosome 9p.
Polymorphism at the TRIB1 gene modulates plasma lipid levels: insight from the Spanish familial hypercholesterolemia cohort study

USDA-ARS?s Scientific Manuscript database

rs17321515 SNP has been associated with variation in LDL-C, high density lipoprotein cholesterol and triglycerides concentrations. This effect has never been studied in patients with severe hypercholesterolemia. Therefore, our aims were to assess the association of the rs17321515 (TRIB1) SNP with pl...
Genotyping-by-sequencing targeting of a novel downy mildew resistance gene Pl 20 from wild Helianthus argophyllus for sunflower (Helianthus annuus L.).

PubMed

Ma, G J; Markell, S G; Song, Q J; Qi, L L

2017-07-01

Genotyping-by-sequencing revealed a new downy mildew resistance gene, Pl 20 , from wild Helianthus argophyllus located on linkage group 8 of the sunflower genome and closely linked to SNP markers that facilitate the marker-assisted selection of resistance genes. Downy mildew (DM), caused by Plasmopara halstedii, is one of the most devastating and yield-limiting diseases of sunflower. Downy mildew resistance identified in wild Helianthus argophyllus accession PI 494578 was determined to be effective against the predominant and virulent races of P. halstedii occurring in the United States. The evaluation of 114 BC 1 F 2:3 families derived from the cross between HA 89 and PI 494578 against P. halstedii race 734 revealed that single dominant gene controls downy mildew resistance in the population. Genotyping-by-sequencing analysis conducted in the BC 1 F 2 population indicated that the DM resistance gene derived from wild H. argophyllus PI 494578 is located on the upper end of the linkage group (LG) 8 of the sunflower genome, as was determined single nucleotide polymorphism (SNP) markers associated with DM resistance. Analysis of 11 additional SNP markers previously mapped to this region revealed that the resistance gene, named Pl 20 , co-segregated with four markers, SFW02745, SFW09076, S8_11272025, and S8_11272046, and is flanked by SFW04358 and S8_100385559 at an interval of 1.8 cM. The newly discovered P. halstedii resistance gene has been introgressed from wild species into cultivated sunflower to provide a novel gene with DM resistance. The homozygous resistant individuals were selected from BC 2 F 2 progenies with the use of markers linked to the Pl 20 gene, and these lines should benefit the sunflower community for Helianthus improvement.
Tomato breeding in the genomics era: insights from a SNP array.

PubMed

Víquez-Zamora, Marcela; Vosman, Ben; van de Geest, Henri; Bovy, Arnaud; Visser, Richard G F; Finkers, Richard; van Heusden, Adriaan W

2013-05-27

The major bottle neck in genetic and linkage studies in tomato has been the lack of a sufficient number of molecular markers. This has radically changed with the application of next generation sequencing and high throughput genotyping. A set of 6000 SNPs was identified and 5528 of them were used to evaluate tomato germplasm at the level of species, varieties and segregating populations. From the 5528 SNPs, 1980 originated from 454-sequencing, 3495 from Illumina Solexa sequencing and 53 were additional known markers. Genotyping different tomato samples allowed the evaluation of the level of heterozygosity and introgressions among commercial varieties. Cherry tomatoes were especially different from round/beefs in chromosomes 4, 5 and 12. We were able to identify a set of 750 unique markers distinguishing S. lycopersicum 'Moneymaker' from all its distantly related wild relatives. Clustering and neighbour joining analysis among varieties and species showed expected grouping patterns, with S. pimpinellifolium as the most closely related to commercial tomatoes earlier results. Our results show that a SNP search in only a few breeding lines already provides generally applicable markers in tomato and its wild relatives. It also shows that the Illumina bead array generated data are highly reproducible. Our SNPs can roughly be divided in two categories: SNPs of which both forms are present in the wild relatives and in domesticated tomatoes (originating from common ancestors) and SNPs unique for the domesticated tomato (originating from after the domestication event). The SNPs can be used for genotyping, identification of varieties, comparison of genetic and physical linkage maps and to confirm (phylogenetic) relations. In the SNPs used for the array there is hardly any overlap with the SolCAP array and it is strongly recommended to combine both SNP sets and to select a core collection of robust SNPs completely covering the entire tomato genome.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.