Wang, Minghui; Londo, Jason P.; Acharya, Charlotte B.; Mitchell, Sharon E.; Sun, Qi; Reisch, Bruce; Cadle-Davidson, Lance
2015-01-01
Genotyping by sequencing (GBS) provides opportunities to generate high-resolution genetic maps at a low genotyping cost, but for highly heterozygous species, missing data and heterozygote undercalling complicate the creation of GBS genetic maps. To overcome these issues, we developed a publicly available, modular approach called HetMappS, which functions independently of parental genotypes and corrects for genotyping errors associated with heterozygosity. For linkage group formation, HetMappS includes both a reference-guided synteny pipeline and a reference-independent de novo pipeline. The de novo pipeline can be utilized for under-characterized or high diversity families that lack an appropriate reference. We applied both HetMappS pipelines in five half-sib F1 families involving genetically diverse Vitis spp. Starting with at least 116,466 putative SNPs per family, the HetMappS pipelines identified 10,440 to 17,267 phased pseudo-testcross (Pt) markers and generated high-confidence maps. Pt marker density exceeded crossover resolution in all cases; up to 5,560 non-redundant markers were used to generate parental maps ranging from 1,047 cM to 1,696 cM. The number of markers used was strongly correlated with family size in both de novo and synteny maps (r = 0.92 and 0.91, respectively). Comparisons between allele and tag frequencies suggested that many markers were in tandem repeats and mapped as single loci, while markers in regions of more than two repeats were removed during map curation. Both pipelines generated similar genetic maps, and genetic order was strongly correlated with the reference genome physical order in all cases. Independently created genetic maps from shared parents exhibited nearly identical results. Flower sex was mapped in three families and correctly localized to the known sex locus in all cases. The HetMappS pipeline could have wide application for genetic mapping in highly heterozygous species, and its modularity provides opportunities to adapt portions of the pipeline to other family types, genotyping technologies or applications. PMID:26244767
2012-01-01
Background Most modern citrus cultivars have an interspecific origin. As a foundational step towards deciphering the interspecific genome structures, a reference whole genome sequence was produced by the International Citrus Genome Consortium from a haploid derived from Clementine mandarin. The availability of a saturated genetic map of Clementine was identified as an essential prerequisite to assist the whole genome sequence assembly. Clementine is believed to be a ‘Mediterranean’ mandarin × sweet orange hybrid, and sweet orange likely arose from interspecific hybridizations between mandarin and pummelo gene pools. The primary goals of the present study were to establish a Clementine reference map using codominant markers, and to perform comparative mapping of pummelo, sweet orange, and Clementine. Results Five parental genetic maps were established from three segregating populations, which were genotyped with Single Nucleotide Polymorphism (SNP), Simple Sequence Repeats (SSR) and Insertion-Deletion (Indel) markers. An initial medium density reference map (961 markers for 1084.1 cM) of the Clementine was established by combining male and female Clementine segregation data. This Clementine map was compared with two pummelo maps and a sweet orange map. The linear order of markers was highly conserved in the different species. However, significant differences in map size were observed, which suggests a variation in the recombination rates. Skewed segregations were much higher in the male than female Clementine mapping data. The mapping data confirmed that Clementine arose from hybridization between ‘Mediterranean’ mandarin and sweet orange. The results identified nine recombination break points for the sweet orange gamete that contributed to the Clementine genome. Conclusions A reference genetic map of citrus, used to facilitate the chromosome assembly of the first citrus reference genome sequence, was established. The high conservation of marker order observed at the interspecific level should allow reasonable inferences of most citrus genome sequences by mapping next-generation sequencing (NGS) data in the reference genome sequence. The genome of the haploid Clementine used to establish the citrus reference genome sequence appears to have been inherited primarily from the ‘Mediterranean’ mandarin. The high frequency of skewed allelic segregations in the male Clementine data underline the probable extent of deviation from Mendelian segregation for characters controlled by heterozygous loci in male parents. PMID:23126659
2013-01-01
Background Faba bean (Vicia faba L.) is among the earliest domesticated crops from the Near East. Today this legume is a key protein feed and food worldwide and continues to serve an important role in culinary traditions throughout Middle East, Mediterranean region, China and Ethiopia. Adapted to a wide range of soil types, the main faba bean breeding objectives are to improve yield, resistance to biotic and abiotic stresses, seed quality and other agronomic traits. Genomic approaches aimed at enhancing faba bean breeding programs require high-quality genetic linkage maps to facilitate quantitative trait locus analysis and gene tagging for use in a marker-assisted selection. The objective of this study was to construct a reference consensus map in faba bean by joining the information from the most relevant maps reported so far in this crop. Results A combination of two approaches, increasing the number of anchor loci in diverse mapping populations and joining the corresponding genetic maps, was used to develop a reference consensus map in faba bean. The map was constructed from three main recombinant inbreed populations derived from four parental lines, incorporates 729 markers and is based on 69 common loci. It spans 4,602 cM with a range from 323 to 1041 loci in six main linkage groups or chromosomes, and an average marker density of one locus every 6 cM. Locus order is generally well maintained between the consensus map and the individual maps. Conclusion We have constructed a reliable and fairly dense consensus genetic linkage map that will serve as a basis for genomic approaches in faba bean research and breeding. The core map contains a larger number of markers than any previous individual map, covers existing gaps and achieves a wider coverage of the large faba bean genome as a whole. This tool can be used as a reference resource for studies in different genetic backgrounds, and provides a framework for transferring genetic information when using different marker technologies. Combined with syntenic approaches, the consensus map will increase marker density in selected genomic regions and will be useful for future faba bean molecular breeding applications. PMID:24377374
A reference linkage map for Eucalyptus
2012-01-01
Background Genetic linkage maps are invaluable resources in plant research. They provide a key tool for many genetic applications including: mapping quantitative trait loci (QTL); comparative mapping; identifying unlinked (i.e. independent) DNA markers for fingerprinting, population genetics and phylogenetics; assisting genome sequence assembly; relating physical and recombination distances along the genome and map-based cloning of genes. Eucalypts are the dominant tree species in most Australian ecosystems and of economic importance globally as plantation trees. The genome sequence of E. grandis has recently been released providing unprecedented opportunities for genetic and genomic research in the genus. A robust reference linkage map containing sequence-based molecular markers is needed to capitalise on this resource. Several high density linkage maps have recently been constructed for the main commercial forestry species in the genus (E. grandis, E. urophylla and E. globulus) using sequenced Diversity Arrays Technology (DArT) and microsatellite markers. To provide a single reference linkage map for eucalypts a composite map was produced through the integration of data from seven independent mapping experiments (1950 individuals) using a marker-merging method. Results The composite map totalled 1107 cM and contained 4101 markers; comprising 3880 DArT, 213 microsatellite and eight candidate genes. Eighty-one DArT markers were mapped to two or more linkage groups, resulting in the 4101 markers being mapped to 4191 map positions. Approximately 13% of DArT markers mapped to identical map positions, thus the composite map contained 3634 unique loci at an average interval of 0.31 cM. Conclusion The composite map represents the most saturated linkage map yet produced in Eucalyptus. As the majority of DArT markers contained on the map have been sequenced, the map provides a direct link to the E. grandis genome sequence and will serve as an important reference for progressing eucalypt research. PMID:22702473
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.)
2013-01-01
Background Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. Results We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. Conclusions The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs. PMID:24160306
Construction of a reference genetic linkage map for carnation (Dianthus caryophyllus L.).
Yagi, Masafumi; Yamamoto, Toshiya; Isobe, Sachiko; Hirakawa, Hideki; Tabata, Satoshi; Tanase, Koji; Yamaguchi, Hiroyasu; Onozaki, Takashi
2013-10-26
Genetic linkage maps are important tools for many genetic applications including mapping of quantitative trait loci (QTLs), identifying DNA markers for fingerprinting, and map-based gene cloning. Carnation (Dianthus caryophyllus L.) is an important ornamental flower worldwide. We previously reported a random amplified polymorphic DNA (RAPD)-based genetic linkage map derived from Dianthus capitatus ssp. andrezejowskianus and a simple sequence repeat (SSR)-based genetic linkage map constructed using data from intraspecific F2 populations; however, the number of markers was insufficient, and so the number of linkage groups (LGs) did not coincide with the number of chromosomes (x = 15). Therefore, we aimed to produce a high-density genetic map to improve its usefulness for breeding purposes and genetic research. We improved the SSR-based genetic linkage map using SSR markers derived from a genomic library, expression sequence tags, and RNA-seq data. Linkage analysis revealed that 412 SSR loci (including 234 newly developed SSR loci) could be mapped to 17 linkage groups (LGs) covering 969.6 cM. Comparison of five minor LGs covering less than 50 cM with LGs in our previous RAPD-based genetic map suggested that four LGs could be integrated into two LGs by anchoring common SSR loci. Consequently, the number of LGs corresponded to the number of chromosomes (x = 15). We added 192 new SSRs, eight RAPD, and two sequence-tagged site loci to refine the RAPD-based genetic linkage map, which comprised 15 LGs consisting of 348 loci covering 978.3 cM. The two maps had 125 SSR loci in common, and most of the positions of markers were conserved between them. We identified 635 loci in carnation using the two linkage maps. We also mapped QTLs for two traits (bacterial wilt resistance and anthocyanin pigmentation in the flower) and a phenotypic locus for flower-type by analyzing previously reported genotype and phenotype data. The improved genetic linkage maps and SSR markers developed in this study will serve as reference genetic linkage maps for members of the genus Dianthus, including carnation, and will be useful for mapping QTLs associated with various traits, and for improving carnation breeding programs.
Sharma, Sanjeev Kumar; Bolser, Daniel; de Boer, Jan; Sønderkær, Mads; Amoros, Walter; Carboni, Martin Federico; D’Ambrosio, Juan Martín; de la Cruz, German; Di Genova, Alex; Douches, David S.; Eguiluz, Maria; Guo, Xiao; Guzman, Frank; Hackett, Christine A.; Hamilton, John P.; Li, Guangcun; Li, Ying; Lozano, Roberto; Maass, Alejandro; Marshall, David; Martinez, Diana; McLean, Karen; Mejía, Nilo; Milne, Linda; Munive, Susan; Nagy, Istvan; Ponce, Olga; Ramirez, Manuel; Simon, Reinhard; Thomson, Susan J.; Torres, Yerisf; Waugh, Robbie; Zhang, Zhonghua; Huang, Sanwen; Visser, Richard G. F.; Bachem, Christian W. B.; Sagredo, Boris; Feingold, Sergio E.; Orjeda, Gisella; Veilleux, Richard E.; Bonierbale, Merideth; Jacobs, Jeanne M. E.; Milbourne, Dan; Martin, David Michael Alan; Bryan, Glenn J.
2013-01-01
The genome of potato, a major global food crop, was recently sequenced. The work presented here details the integration of the potato reference genome (DM) with a new sequence-tagged site marker−based linkage map and other physical and genetic maps of potato and the closely related species tomato. Primary anchoring of the DM genome assembly was accomplished by the use of a diploid segregating population, which was genotyped with several types of molecular genetic markers to construct a new ~936 cM linkage map comprising 2469 marker loci. In silico anchoring approaches used genetic and physical maps from the diploid potato genotype RH89-039-16 (RH) and tomato. This combined approach has allowed 951 superscaffolds to be ordered into pseudomolecules corresponding to the 12 potato chromosomes. These pseudomolecules represent 674 Mb (~93%) of the 723 Mb genome assembly and 37,482 (~96%) of the 39,031 predicted genes. The superscaffold order and orientation within the pseudomolecules are closely collinear with independently constructed high density linkage maps. Comparisons between marker distribution and physical location reveal regions of greater and lesser recombination, as well as regions exhibiting significant segregation distortion. The work presented here has led to a greatly improved ordering of the potato reference genome superscaffolds into chromosomal “pseudomolecules”. PMID:24062527
USDA-ARS?s Scientific Manuscript database
A DArT marker platform is developed for the cotton genome to evaluate the use of DArT markers compared to AFLPs in mapping, and transferability across the mapping populations. We used a reference genetic map of tetraploid Gossypium that already contained ~5000 loci which coalesced into 26 chromosom...
USDA-ARS?s Scientific Manuscript database
We will present an ultra-dense genetic linkage map for the octoploid, cultivated strawberry (Fragaria x ananassa) consisting of over 13K Axiom® based SNP markers and 150 previously mapped reference SSR loci. The high quality of the map is demonstrated by the short sizes of each of the 28 linkage gro...
USDA-ARS?s Scientific Manuscript database
Next generation sequencing offers new ways to identify the genetic mechanisms that underlie mutant phenotypes. The release of a reference diploid Gossypium raimondii (D5) genome and bioinformatics tools to sort tetraploid reads into subgenomes has brought cotton genetic mapping into the genomics er...
Muchero, Wellington; Diop, Ndeye N; Bhat, Prasanna R; Fenton, Raymond D; Wanamaker, Steve; Pottorff, Marti; Hearne, Sarah; Cisse, Ndiaga; Fatokun, Christian; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J
2009-10-27
Consensus genetic linkage maps provide a genomic framework for quantitative trait loci identification, map-based cloning, assessment of genetic diversity, association mapping, and applied breeding in marker-assisted selection schemes. Among "orphan crops" with limited genomic resources such as cowpea [Vigna unguiculata (L.) Walp.] (2n = 2x = 22), the use of transcript-derived SNPs in genetic maps provides opportunities for automated genotyping and estimation of genome structure based on synteny analysis. Here, we report the development and validation of a high-throughput EST-derived SNP assay for cowpea, its application in consensus map building, and determination of synteny to reference genomes. SNP mining from 183,118 ESTs sequenced from 17 cDNA libraries yielded approximately 10,000 high-confidence SNPs from which an Illumina 1,536-SNP GoldenGate genotyping array was developed and applied to 741 recombinant inbred lines from six mapping populations. Approximately 90% of the SNPs were technically successful, providing 1,375 dependable markers. Of these, 928 were incorporated into a consensus genetic map spanning 680 cM with 11 linkage groups and an average marker distance of 0.73 cM. Comparison of this cowpea genetic map to reference legumes, soybean (Glycine max) and Medicago truncatula, revealed extensive macrosynteny encompassing 85 and 82%, respectively, of the cowpea map. Regions of soybean genome duplication were evident relative to the simpler diploid cowpea. Comparison with Arabidopsis revealed extensive genomic rearrangement with some conserved microsynteny. These results support evolutionary closeness between cowpea and soybean and identify regions for synteny-based functional genomics studies in legumes.
Centromere Locations in Brassica A and C Genomes Revealed Through Half-Tetrad Analysis.
Mason, Annaliese S; Rousseau-Gueutin, Mathieu; Morice, Jérôme; Bayer, Philipp E; Besharat, Naghmeh; Cousin, Anouska; Pradhan, Aneeta; Parkin, Isobel A P; Chèvre, Anne-Marie; Batley, Jacqueline; Nelson, Matthew N
2016-02-01
Locating centromeres on genome sequences can be challenging. The high density of repetitive elements in these regions makes sequence assembly problematic, especially when using short-read sequencing technologies. It can also be difficult to distinguish between active and recently extinct centromeres through sequence analysis. An effective solution is to identify genetically active centromeres (functional in meiosis) by half-tetrad analysis. This genetic approach involves detecting heterozygosity along chromosomes in segregating populations derived from gametes (half-tetrads). Unreduced gametes produced by first division restitution mechanisms comprise complete sets of nonsister chromatids. Along these chromatids, heterozygosity is maximal at the centromeres, and homologous recombination events result in homozygosity toward the telomeres. We genotyped populations of half-tetrad-derived individuals (from Brassica interspecific hybrids) using a high-density array of physically anchored SNP markers (Illumina Brassica 60K Infinium array). Mapping the distribution of heterozygosity in these half-tetrad individuals allowed the genetic mapping of all 19 centromeres of the Brassica A and C genomes to the reference Brassica napus genome. Gene and transposable element density across the B. napus genome were also assessed and corresponded well to previously reported genetic map positions. Known centromere-specific sequences were located in the reference genome, but mostly matched unanchored sequences, suggesting that the core centromeric regions may not yet be assembled into the pseudochromosomes of the reference genome. The increasing availability of genetic markers physically anchored to reference genomes greatly simplifies the genetic and physical mapping of centromeres using half-tetrad analysis. We discuss possible applications of this approach, including in species where half-tetrads are currently difficult to isolate. Copyright © 2016 by the Genetics Society of America.
A fruit quality gene map of Prunus
2009-01-01
Background Prunus fruit development, growth, ripening, and senescence includes major biochemical and sensory changes in texture, color, and flavor. The genetic dissection of these complex processes has important applications in crop improvement, to facilitate maximizing and maintaining stone fruit quality from production and processing through to marketing and consumption. Here we present an integrated fruit quality gene map of Prunus containing 133 genes putatively involved in the determination of fruit texture, pigmentation, flavor, and chilling injury resistance. Results A genetic linkage map of 211 markers was constructed for an intraspecific peach (Prunus persica) progeny population, Pop-DG, derived from a canning peach cultivar 'Dr. Davis' and a fresh market cultivar 'Georgia Belle'. The Pop-DG map covered 818 cM of the peach genome and included three morphological markers, 11 ripening candidate genes, 13 cold-responsive genes, 21 novel EST-SSRs from the ChillPeach database, 58 previously reported SSRs, 40 RAFs, 23 SRAPs, 14 IMAs, and 28 accessory markers from candidate gene amplification. The Pop-DG map was co-linear with the Prunus reference T × E map, with 39 SSR markers in common to align the maps. A further 158 markers were bin-mapped to the reference map: 59 ripening candidate genes, 50 cold-responsive genes, and 50 novel EST-SSRs from ChillPeach, with deduced locations in Pop-DG via comparative mapping. Several candidate genes and EST-SSRs co-located with previously reported major trait loci and quantitative trait loci for chilling injury symptoms in Pop-DG. Conclusion The candidate gene approach combined with bin-mapping and availability of a community-recognized reference genetic map provides an efficient means of locating genes of interest in a target genome. We highlight the co-localization of fruit quality candidate genes with previously reported fruit quality QTLs. The fruit quality gene map developed here is a valuable tool for dissecting the genetic architecture of fruit quality traits in Prunus crops. PMID:19995417
Centromere Locations in Brassica A and C Genomes Revealed Through Half-Tetrad Analysis
Mason, Annaliese S.; Rousseau-Gueutin, Mathieu; Morice, Jérôme; Bayer, Philipp E.; Besharat, Naghmeh; Cousin, Anouska; Pradhan, Aneeta; Parkin, Isobel A. P.; Chèvre, Anne-Marie; Batley, Jacqueline; Nelson, Matthew N.
2016-01-01
Locating centromeres on genome sequences can be challenging. The high density of repetitive elements in these regions makes sequence assembly problematic, especially when using short-read sequencing technologies. It can also be difficult to distinguish between active and recently extinct centromeres through sequence analysis. An effective solution is to identify genetically active centromeres (functional in meiosis) by half-tetrad analysis. This genetic approach involves detecting heterozygosity along chromosomes in segregating populations derived from gametes (half-tetrads). Unreduced gametes produced by first division restitution mechanisms comprise complete sets of nonsister chromatids. Along these chromatids, heterozygosity is maximal at the centromeres, and homologous recombination events result in homozygosity toward the telomeres. We genotyped populations of half-tetrad-derived individuals (from Brassica interspecific hybrids) using a high-density array of physically anchored SNP markers (Illumina Brassica 60K Infinium array). Mapping the distribution of heterozygosity in these half-tetrad individuals allowed the genetic mapping of all 19 centromeres of the Brassica A and C genomes to the reference Brassica napus genome. Gene and transposable element density across the B. napus genome were also assessed and corresponded well to previously reported genetic map positions. Known centromere-specific sequences were located in the reference genome, but mostly matched unanchored sequences, suggesting that the core centromeric regions may not yet be assembled into the pseudochromosomes of the reference genome. The increasing availability of genetic markers physically anchored to reference genomes greatly simplifies the genetic and physical mapping of centromeres using half-tetrad analysis. We discuss possible applications of this approach, including in species where half-tetrads are currently difficult to isolate. PMID:26614742
Genetics Home Reference: Feingold syndrome
... Brunner HG. Feingold syndrome: clinical review and genetic mapping. Am J Med Genet A. 2003 Nov 1; ... Brunner HG. MYCN haploinsufficiency is associated with reduced brain size and intestinal atresias in Feingold syndrome. Nat ...
Genetics Home Reference: craniometaphyseal dysplasia
... Passos-Bueno MR. Mapping of the autosomal recessive (AR) craniometaphyseal dysplasia locus to chromosome region 6q21-22 and confirmation of genetic heterogeneity for mild AR spondylocostal dysplasia. Am J Med Genet. 2000 Dec ...
Saxena, Rachit K.; Varma Penmetsa, R.; Upadhyaya, Hari D.; Kumar, Ashish; Carrasquilla-Garcia, Noelia; Schlueter, Jessica A.; Farmer, Andrew; Whaley, Adam M.; Sarma, Birinchi K.; May, Gregory D.; Cook, Douglas R.; Varshney, Rajeev K.
2012-01-01
Single-nucleotide polymorphisms (SNPs, >2000) were discovered by using RNA-seq and allele-specific sequencing approaches in pigeonpea (Cajanus cajan). For making the SNP genotyping cost-effective, successful competitive allele-specific polymerase chain reaction (KASPar) assays were developed for 1616 SNPs and referred to as PKAMs (pigeonpea KASPar assay markers). Screening of PKAMs on 24 genotypes [23 from cultivated species and 1 wild species (Cajanus scarabaeoides)] defined a set of 1154 polymorphic markers (77.4%) with a polymorphism information content (PIC) value from 0.04 to 0.38. One thousand and ninety-four PKAMs showed polymorphisms between parental lines of the reference mapping population (C. cajan ICP 28 × C. scarabaeoides ICPW 94). By using high-quality marker genotyping data on 167 F2 lines from the population, a comprehensive genetic map comprising 875 PKAMs with an average inter-marker distance of 1.11 cM was developed. Previously mapped 35 simple sequence repeat markers were integrated into the PKAM map and an integrated genetic map of 996.21 cM was constructed. Mapped PKAMs showed a higher degree of synteny with the genome of Glycine max followed by Medicago truncatula and Lotus japonicus and least with Vigna unguiculata. These PKAMs will be useful for genetics research and breeding applications in pigeonpea and for utilizing genome information from other legume species. PMID:23103470
Tsai, Hsin Y; Robledo, Diego; Lowe, Natalie R; Bekaert, Michael; Taggart, John B; Bron, James E; Houston, Ross D
2016-07-07
High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species' genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the 'ssalar01' high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. Copyright © 2016 Tsai et al.
Cat-Map: putting cataract on the map
Bennett, Thomas M.; Hejtmancik, J. Fielding
2010-01-01
Lens opacities, or cataract(s), may be inherited as a classic Mendelian disorder usually with early-onset or, more commonly, acquired with age as a multi-factorial or complex trait. Many genetic forms of cataract have been described in mice and other animal models. Considerable progress has been made in mapping and identifying the genes and mutations responsible for inherited forms of cataract, and genetic determinants of age-related cataract are beginning to be discovered. To provide a convenient and accurate summary of current information focused on the increasing genetic complexity of Mendelian and age-related cataract we have created an online chromosome map and reference database for cataract in humans and mice (Cat-Map). PMID:21042563
Ma, Chun-Lei; Jin, Ji-Qiang; Li, Chun-Fang; Wang, Rong-Kai; Zheng, Hong-Kun; Yao, Ming-Zhe; Chen, Liang
2015-01-01
Genetic maps are important tools in plant genomics and breeding. The present study reports the large-scale discovery of single nucleotide polymorphisms (SNPs) for genetic map construction in tea plant. We developed a total of 6,042 valid SNP markers using specific-locus amplified fragment sequencing (SLAF-seq), and subsequently mapped them into the previous framework map. The final map contained 6,448 molecular markers, distributing on fifteen linkage groups corresponding to the number of tea plant chromosomes. The total map length was 3,965 cM, with an average inter-locus distance of 1.0 cM. This map is the first SNP-based reference map of tea plant, as well as the most saturated one developed to date. The SNP markers and map resources generated in this study provide a wealth of genetic information that can serve as a foundation for downstream genetic analyses, such as the fine mapping of quantitative trait loci (QTL), map-based cloning, marker-assisted selection, and anchoring of scaffolds to facilitate the process of whole genome sequencing projects for tea plant. PMID:26035838
de Miguel, Marina; de Maria, Nuria; Guevara, M Angeles; Diaz, Luis; Sáez-Laguna, Enrique; Sánchez-Gómez, David; Chancerel, Emilie; Aranda, Ismael; Collada, Carmen; Plomion, Christophe; Cabezas, José-Antonio; Cervera, María-Teresa
2012-10-04
Pinus pinaster Ait. is a major resin producing species in Spain. Genetic linkage mapping can facilitate marker-assisted selection (MAS) through the identification of Quantitative Trait Loci and selection of allelic variants of interest in breeding populations. In this study, we report annotated genetic linkage maps for two individuals (C14 and C15) belonging to a breeding program aiming to increase resin production. We use different types of DNA markers, including last-generation molecular markers. We obtained 13 and 14 linkage groups for C14 and C15 maps, respectively. A total of 211 and 215 markers were positioned on each map and estimated genome length was between 1,870 and 2,166 cM respectively, which represents near 65% of genome coverage. Comparative mapping with previously developed genetic linkage maps for P. pinaster based on about 60 common markers enabled aligning linkage groups to this reference map. The comparison of our annotated linkage maps and linkage maps reporting QTL information revealed 11 annotated SNPs in candidate genes that co-localized with previously reported QTLs for wood properties and water use efficiency. This study provides genetic linkage maps from a Spanish population that shows high levels of genetic divergence with French populations from which segregating progenies have been previously mapped. These genetic maps will be of interest to construct a reliable consensus linkage map for the species. The importance of developing functional genetic linkage maps is highlighted, especially when working with breeding populations for its future application in MAS for traits of interest.
2012-01-01
Background Pinus pinaster Ait. is a major resin producing species in Spain. Genetic linkage mapping can facilitate marker-assisted selection (MAS) through the identification of Quantitative Trait Loci and selection of allelic variants of interest in breeding populations. In this study, we report annotated genetic linkage maps for two individuals (C14 and C15) belonging to a breeding program aiming to increase resin production. We use different types of DNA markers, including last-generation molecular markers. Results We obtained 13 and 14 linkage groups for C14 and C15 maps, respectively. A total of 211 and 215 markers were positioned on each map and estimated genome length was between 1,870 and 2,166 cM respectively, which represents near 65% of genome coverage. Comparative mapping with previously developed genetic linkage maps for P. pinaster based on about 60 common markers enabled aligning linkage groups to this reference map. The comparison of our annotated linkage maps and linkage maps reporting QTL information revealed 11 annotated SNPs in candidate genes that co-localized with previously reported QTLs for wood properties and water use efficiency. Conclusions This study provides genetic linkage maps from a Spanish population that shows high levels of genetic divergence with French populations from which segregating progenies have been previously mapped. These genetic maps will be of interest to construct a reliable consensus linkage map for the species. The importance of developing functional genetic linkage maps is highlighted, especially when working with breeding populations for its future application in MAS for traits of interest. PMID:23036012
2011-01-01
Background A number of molecular marker linkage maps have been developed for melon (Cucumis melo L.) over the last two decades. However, these maps were constructed using different marker sets, thus, making comparative analysis among maps difficult. In order to solve this problem, a consensus genetic map in melon was constructed using primarily highly transferable anchor markers that have broad potential use for mapping, synteny, and comparative quantitative trait loci (QTL) analysis, increasing breeding effectiveness and efficiency via marker-assisted selection (MAS). Results Under the framework of the International Cucurbit Genomics Initiative (ICuGI, http://www.icugi.org), an integrated genetic map has been constructed by merging data from eight independent mapping experiments using a genetically diverse array of parental lines. The consensus map spans 1150 cM across the 12 melon linkage groups and is composed of 1592 markers (640 SSRs, 330 SNPs, 252 AFLPs, 239 RFLPs, 89 RAPDs, 15 IMAs, 16 indels and 11 morphological traits) with a mean marker density of 0.72 cM/marker. One hundred and ninety-six of these markers (157 SSRs, 32 SNPs, 6 indels and 1 RAPD) were newly developed, mapped or provided by industry representatives as released markers, including 27 SNPs and 5 indels from genes involved in the organic acid metabolism and transport, and 58 EST-SSRs. Additionally, 85 of 822 SSR markers contributed by Syngenta Seeds were included in the integrated map. In addition, 370 QTL controlling 62 traits from 18 previously reported mapping experiments using genetically diverse parental genotypes were also integrated into the consensus map. Some QTL associated with economically important traits detected in separate studies mapped to similar genomic positions. For example, independently identified QTL controlling fruit shape were mapped on similar genomic positions, suggesting that such QTL are possibly responsible for the phenotypic variability observed for this trait in a broad array of melon germplasm. Conclusions Even though relatively unsaturated genetic maps in a diverse set of melon market types have been published, the integrated saturated map presented herein should be considered the initial reference map for melon. Most of the mapped markers contained in the reference map are polymorphic in diverse collection of germplasm, and thus are potentially transferrable to a broad array of genetic experimentation (e.g., integration of physical and genetic maps, colinearity analysis, map-based gene cloning, epistasis dissection, and marker-assisted selection). PMID:21797998
Diaz, Aurora; Fergany, Mohamed; Formisano, Gelsomina; Ziarsolo, Peio; Blanca, José; Fei, Zhanjun; Staub, Jack E; Zalapa, Juan E; Cuevas, Hugo E; Dace, Gayle; Oliver, Marc; Boissot, Nathalie; Dogimont, Catherine; Pitrat, Michel; Hofstede, René; van Koert, Paul; Harel-Beja, Rotem; Tzuri, Galil; Portnoy, Vitaly; Cohen, Shahar; Schaffer, Arthur; Katzir, Nurit; Xu, Yong; Zhang, Haiying; Fukino, Nobuko; Matsumoto, Satoru; Garcia-Mas, Jordi; Monforte, Antonio J
2011-07-28
A number of molecular marker linkage maps have been developed for melon (Cucumis melo L.) over the last two decades. However, these maps were constructed using different marker sets, thus, making comparative analysis among maps difficult. In order to solve this problem, a consensus genetic map in melon was constructed using primarily highly transferable anchor markers that have broad potential use for mapping, synteny, and comparative quantitative trait loci (QTL) analysis, increasing breeding effectiveness and efficiency via marker-assisted selection (MAS). Under the framework of the International Cucurbit Genomics Initiative (ICuGI, http://www.icugi.org), an integrated genetic map has been constructed by merging data from eight independent mapping experiments using a genetically diverse array of parental lines. The consensus map spans 1150 cM across the 12 melon linkage groups and is composed of 1592 markers (640 SSRs, 330 SNPs, 252 AFLPs, 239 RFLPs, 89 RAPDs, 15 IMAs, 16 indels and 11 morphological traits) with a mean marker density of 0.72 cM/marker. One hundred and ninety-six of these markers (157 SSRs, 32 SNPs, 6 indels and 1 RAPD) were newly developed, mapped or provided by industry representatives as released markers, including 27 SNPs and 5 indels from genes involved in the organic acid metabolism and transport, and 58 EST-SSRs. Additionally, 85 of 822 SSR markers contributed by Syngenta Seeds were included in the integrated map. In addition, 370 QTL controlling 62 traits from 18 previously reported mapping experiments using genetically diverse parental genotypes were also integrated into the consensus map. Some QTL associated with economically important traits detected in separate studies mapped to similar genomic positions. For example, independently identified QTL controlling fruit shape were mapped on similar genomic positions, suggesting that such QTL are possibly responsible for the phenotypic variability observed for this trait in a broad array of melon germplasm. Even though relatively unsaturated genetic maps in a diverse set of melon market types have been published, the integrated saturated map presented herein should be considered the initial reference map for melon. Most of the mapped markers contained in the reference map are polymorphic in diverse collection of germplasm, and thus are potentially transferrable to a broad array of genetic experimentation (e.g., integration of physical and genetic maps, colinearity analysis, map-based gene cloning, epistasis dissection, and marker-assisted selection).
Genetics Home Reference: sialuria
... inheritance of sialuria, an inborn error of feedback inhibition. Am J Hum Genet. 2001 Jun;68(6): ... Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & Players ...
Dissection of Host Susceptibility to Bacterial Infections and Its Toxins.
Nashef, Aysar; Agbaria, Mahmoud; Shusterman, Ariel; Lorè, Nicola Ivan; Bragonzi, Alessandra; Wiess, Ervin; Houri-Haddad, Yael; Iraqi, Fuad A
2017-01-01
Infection is one of the leading causes of human mortality and morbidity. Exposure to microbial agents is obviously required. However, also non-microbial environmental and host factors play a key role in the onset, development and outcome of infectious disease, resulting in large of clinical variability between individuals in a population infected with the same microbe. Controlled and standardized investigations of the genetics of susceptibility to infectious disease are almost impossible to perform in humans whereas mouse models allow application of powerful genomic techniques to identify and validate causative genes underlying human diseases with complex etiologies. Most of current animal models used in complex traits diseases genetic mapping have limited genetic diversity. This limitation impedes the ability to create incorporated network using genetic interactions, epigenetics, environmental factors, microbiota, and other phenotypes. A novel mouse genetic reference population for high-resolution mapping and subsequently identifying genes underlying the QTL, namely the Collaborative Cross (CC) mouse genetic reference population (GRP) was recently developed. In this chapter, we discuss a variety of approaches using CC mice for mapping genes underlying quantitative trait loci (QTL) to dissect the host response to polygenic traits, including infectious disease caused by bacterial agents and its toxins.
Genetics Home Reference: prolidase deficiency
... mutations as a tool to investigate structure-function relationship. J Hum Genet. 2004;49(9):500-6. ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: cri-du-chat syndrome
... Pinkel D. High-resolution mapping of genotype-phenotype relationships in cri du chat syndrome using array comparative ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Kantarski, Traci; Larson, Steve; Zhang, Xiaofei; DeHaan, Lee; Borevitz, Justin; Anderson, James; Poland, Jesse
2017-01-01
Development of the first consensus genetic map of intermediate wheatgrass gives insight into the genome and tools for molecular breeding. Intermediate wheatgrass (Thinopyrum intermedium) has been identified as a candidate for domestication and improvement as a perennial grain, forage, and biofuel crop and is actively being improved by several breeding programs. To accelerate this process using genomics-assisted breeding, efficient genotyping methods and genetic marker reference maps are needed. We present here the first consensus genetic map for intermediate wheatgrass (IWG), which confirms the species' allohexaploid nature (2n = 6x = 42) and homology to Triticeae genomes. Genotyping-by-sequencing was used to identify markers that fit expected segregation ratios and construct genetic maps for 13 heterogeneous parents of seven full-sib families. These maps were then integrated using a linear programming method to produce a consensus map with 21 linkage groups containing 10,029 markers, 3601 of which were present in at least two populations. Each of the 21 linkage groups contained between 237 and 683 markers, cumulatively covering 5061 cM (2891 cM--Kosambi) with an average distance of 0.5 cM between each pair of markers. Through mapping the sequence tags to the diploid (2n = 2x = 14) barley reference genome, we observed high colinearity and synteny between these genomes, with three homoeologous IWG chromosomes corresponding to each of the seven barley chromosomes, and mapped translocations that are known in the Triticeae. The consensus map is a valuable tool for wheat breeders to map important disease-resistance genes within intermediate wheatgrass. These genomic tools can help lead to rapid improvement of IWG and development of high-yielding cultivars of this perennial grain that would facilitate the sustainable intensification of agricultural systems.
Genetics Home Reference: Y chromosome infertility
... deletions" of the human Y chromosome and their relationship with male infertility. J Genet Genomics. 2008 Apr; ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: spina bifida
... PubMed or Free article on PubMed Central Bassuk AG, Kibar Z. Genetic basis of neural tube defects. ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: anencephaly
... PubMed or Free article on PubMed Central Bassuk AG, Kibar Z. Genetic basis of neural tube defects. ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: CDKL5 deficiency disorder
... Recurrent mutations in the CDKL5 gene: genotype-phenotype relationships. Am J Med Genet A. 2012 Jul;158A( ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: Ghosal hematodiaphyseal dysplasia
... A, Le Merrer M, Cormier-Daire V. A gene responsible for Ghosal hemato-diaphyseal dysplasia maps to chromosome 7q33-34. Hum Genet. ... are genome editing and CRISPR-Cas9? What is precision medicine? What ...
Genetics Home Reference: complement factor I deficiency
... F, Zelazko M, Marquart H, Muller K, Sjöholm AG, Truedsson L, Villoutreix BO, Blom AM. Genetic, molecular ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Bohra, Abhishek; Saxena, Rachit K; Gnanesh, B N; Saxena, Kulbhushan; Byregowda, M; Rathore, Abhishek; Kavikishor, P B; Cook, Douglas R; Varshney, Rajeev K
2012-10-01
Pigeonpea (Cajanus cajan L.) is an important food legume crop of rainfed agriculture. Owing to exposure of the crop to a number of biotic and abiotic stresses, the crop productivity has remained stagnant for almost last five decades at ca. 750 kg/ha. The availability of a cytoplasmic male sterility (CMS) system has facilitated the development and release of hybrids which are expected to enhance the productivity of pigeonpea. Recent advances in genomics and molecular breeding such as marker-assisted selection (MAS) offer the possibility to accelerate hybrid breeding. Molecular markers and genetic maps are pre-requisites for deploying MAS in breeding. However, in the case of pigeonpea, only one inter- and two intra-specific genetic maps are available so far. Here, four new intra-specific genetic maps comprising 59-140 simple sequence repeat (SSR) loci with map lengths ranging from 586.9 to 881.6 cM have been constructed. Using these four genetic maps together with two recently published intra-specific genetic maps, a consensus map was constructed, comprising of 339 SSR loci spanning a distance of 1,059 cM. Furthermore, quantitative trait loci (QTL) analysis for fertility restoration (Rf) conducted in three mapping populations identified four major QTLs explaining phenotypic variances up to 24 %. To the best of our knowledge, this is the first report on construction of a consensus genetic map in pigeonpea and on the identification of QTLs for fertility restoration. The developed consensus genetic map should serve as a reference for developing new genetic maps as well as correlating with the physical map in pigeonpea to be developed in near future. The availability of more informative markers in the bins harbouring QTLs for sterility mosaic disease (SMD) and Rf will facilitate the selection of the most suitable markers for genetic analysis and molecular breeding applications in pigeonpea.
High-Resolution Maps of Mouse Reference Populations
Simecek, Petr; Forejt, Jiri; Williams, Robert W.; Shiroishi, Toshihiko; Takada, Toyoyuki; Lu, Lu; Johnson, Thomas E.; Bennett, Beth; Deschepper, Christian F.; Scott-Boyer, Marie-Pier; Pardo-Manuel de Villena, Fernando; Churchill, Gary A.
2017-01-01
Genetic reference panels are widely used to map complex, quantitative traits in model organisms. We have generated new high-resolution genetic maps of 259 mouse inbred strains from recombinant inbred strain panels (C57BL/6J × DBA/2J, ILS/IbgTejJ × ISS/IbgTejJ, and C57BL/6J × A/J) and chromosome substitution strain panels (C57BL/6J-Chr#, C57BL/6J-Chr#
High-Resolution Maps of Mouse Reference Populations.
Simecek, Petr; Forejt, Jiri; Williams, Robert W; Shiroishi, Toshihiko; Takada, Toyoyuki; Lu, Lu; Johnson, Thomas E; Bennett, Beth; Deschepper, Christian F; Scott-Boyer, Marie-Pier; Pardo-Manuel de Villena, Fernando; Churchill, Gary A
2017-10-05
Genetic reference panels are widely used to map complex, quantitative traits in model organisms. We have generated new high-resolution genetic maps of 259 mouse inbred strains from recombinant inbred strain panels (C57BL/6J × DBA/2J, ILS/IbgTejJ × ISS/IbgTejJ, and C57BL/6J × A/J) and chromosome substitution strain panels (C57BL/6J-Chr#, C57BL/6J-Chr#
Ganal, Martin W.; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S.; Charcosset, Alain; Clarke, Joseph D.; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D.; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C.; Falque, Matthieu
2011-01-01
SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding. PMID:22174790
Kujur, Alice; Upadhyaya, Hari D.; Shree, Tanima; Bajaj, Deepak; Das, Shouvik; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We discovered 26785 and 16573 high-quality SNPs differentiating two parental genotypes of a RIL mapping population using reference desi and kabuli genome-based GBS assay. Of these, 3625 and 2177 SNPs have been integrated into eight desi and kabuli chromosomes, respectively in order to construct ultra-high density (0.20–0.37 cM) intra-specific chickpea genetic linkage maps. One of these constructed high-resolution genetic map has potential to identify 33 major genomic regions harbouring 35 robust QTLs (PVE: 17.9–39.7%) associated with three agronomic traits, which were mapped within <1 cM mean marker intervals on desi chromosomes. The extended LD (linkage disequilibrium) decay (~15 cM) in chromosomes of genetic maps have encouraged us to use a rapid integrated approach (comparative QTL mapping, QTL-region specific haplotype/LD-based trait association analysis, expression profiling and gene haplotype-based association mapping) rather than a traditional QTL map-based cloning method to narrow-down one major seed weight (SW) robust QTL region. It delineated favourable natural allelic variants and superior haplotype-containing one seed-specific candidate embryo defective gene regulating SW in chickpea. The ultra-high-resolution genetic maps, QTLs/genes and alleles/haplotypes-related genomic information generated and integrated strategy for rapid QTL/gene identification developed have potential to expedite genomics-assisted breeding applications in crop plants, including chickpea for their genetic enhancement. PMID:25942004
A genetic map and germplasm diversity estimation of Mangifera indica (mango) with SNPs
USDA-ARS?s Scientific Manuscript database
Mango (Mangifera indica) is often referred to as the “King of Fruits”. As the first steps in developing a mango genomics project, we genotyped 582 individuals comprising six mapping populations with 1054 SNP markers. The resulting consensus map had 20 linkage groups defined by 726 SNP markers with...
Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl
2017-01-01
Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273
Yu, Yang; Zhang, Xiaojun; Yuan, Jianbo; Li, Fuhua; Chen, Xiaohan; Zhao, Yongzhen; Huang, Long; Zheng, Hongkun; Xiang, Jianhai
2015-01-01
The Pacific white shrimp Litopenaeus vannamei is the dominant crustacean species in global seafood mariculture. Understanding the genome and genetic architecture is useful for deciphering complex traits and accelerating the breeding program in shrimp. In this study, a genome survey was conducted and a high-density linkage map was constructed using a next-generation sequencing approach. The genome survey was used to identify preliminary genome characteristics and to generate a rough reference for linkage map construction. De novo SNP discovery resulted in 25,140 polymorphic markers. A total of 6,359 high-quality markers were selected for linkage map construction based on marker coverage among individuals and read depths. For the linkage map, a total of 6,146 markers spanning 4,271.43 cM were mapped to 44 sex-averaged linkage groups, with an average marker distance of 0.7 cM. An integration analysis linked 5,885 genome scaffolds and 1,504 BAC clones to the linkage map. Based on the high-density linkage map, several QTLs for body weight and body length were detected. This high-density genetic linkage map reveals basic genomic architecture and will be useful for comparative genomics research, genome assembly and genetic improvement of L. vannamei and other penaeid shrimp species. PMID:26503227
Yuan, Shuai; Johnston, H. Richard; Zhang, Guosheng; Li, Yun; Hu, Yi-Juan; Qin, Zhaohui S.
2015-01-01
With rapid decline of the sequencing cost, researchers today rush to embrace whole genome sequencing (WGS), or whole exome sequencing (WES) approach as the next powerful tool for relating genetic variants to human diseases and phenotypes. A fundamental step in analyzing WGS and WES data is mapping short sequencing reads back to the reference genome. This is an important issue because incorrectly mapped reads affect the downstream variant discovery, genotype calling and association analysis. Although many read mapping algorithms have been developed, the majority of them uses the universal reference genome and do not take sequence variants into consideration. Given that genetic variants are ubiquitous, it is highly desirable if they can be factored into the read mapping procedure. In this work, we developed a novel strategy that utilizes genotypes obtained a priori to customize the universal haploid reference genome into a personalized diploid reference genome. The new strategy is implemented in a program named RefEditor. When applying RefEditor to real data, we achieved encouraging improvements in read mapping, variant discovery and genotype calling. Compared to standard approaches, RefEditor can significantly increase genotype calling consistency (from 43% to 61% at 4X coverage; from 82% to 92% at 20X coverage) and reduce Mendelian inconsistency across various sequencing depths. Because many WGS and WES studies are conducted on cohorts that have been genotyped using array-based genotyping platforms previously or concurrently, we believe the proposed strategy will be of high value in practice, which can also be applied to the scenario where multiple NGS experiments are conducted on the same cohort. The RefEditor sources are available at https://github.com/superyuan/refeditor. PMID:26267278
High-resolution genetic maps of Eucalyptus improve Eucalyptus grandis genome assembly.
Bartholomé, Jérôme; Mandrou, Eric; Mabiala, André; Jenkins, Jerry; Nabihoudine, Ibouniyamine; Klopp, Christophe; Schmutz, Jeremy; Plomion, Christophe; Gion, Jean-Marc
2015-06-01
Genetic maps are key tools in genetic research as they constitute the framework for many applications, such as quantitative trait locus analysis, and support the assembly of genome sequences. The resequencing of the two parents of a cross between Eucalyptus urophylla and Eucalyptus grandis was used to design a single nucleotide polymorphism (SNP) array of 6000 markers evenly distributed along the E. grandis genome. The genotyping of 1025 offspring enabled the construction of two high-resolution genetic maps containing 1832 and 1773 markers with an average marker interval of 0.45 and 0.5 cM for E. grandis and E. urophylla, respectively. The comparison between genetic maps and the reference genome highlighted 85% of collinear regions. A total of 43 noncollinear regions and 13 nonsynthetic regions were detected and corrected in the new genome assembly. This improved version contains 4943 scaffolds totalling 691.3 Mb of which 88.6% were captured by the 11 chromosomes. The mapping data were also used to investigate the effect of population size and number of markers on linkage mapping accuracy. This study provides the most reliable linkage maps for Eucalyptus and version 2.0 of the E. grandis genome. © 2014 CIRAD. New Phytologist © 2014 New Phytologist Trust.
Genome-wide SNP identification and QTL mapping for black rot resistance in cabbage.
Lee, Jonghoon; Izzah, Nur Kholilatul; Jayakodi, Murukarthick; Perumal, Sampath; Joh, Ho Jun; Lee, Hyeon Ju; Lee, Sang-Choon; Park, Jee Young; Yang, Ki-Woung; Nou, Il-Sup; Seo, Joodeok; Yoo, Jaeheung; Suh, Youngdeok; Ahn, Kyounggu; Lee, Ji Hyun; Choi, Gyung Ja; Yu, Yeisoo; Kim, Heebal; Yang, Tae-Jin
2015-02-03
Black rot is a destructive bacterial disease causing large yield and quality losses in Brassica oleracea. To detect quantitative trait loci (QTL) for black rot resistance, we performed whole-genome resequencing of two cabbage parental lines and genome-wide SNP identification using the recently published B. oleracea genome sequences as reference. Approximately 11.5 Gb of sequencing data was produced from each parental line. Reference genome-guided mapping and SNP calling revealed 674,521 SNPs between the two cabbage lines, with an average of one SNP per 662.5 bp. Among 167 dCAPS markers derived from candidate SNPs, 117 (70.1%) were validated as bona fide SNPs showing polymorphism between the parental lines. We then improved the resolution of a previous genetic map by adding 103 markers including 87 SNP-based dCAPS markers. The new map composed of 368 markers and covers 1467.3 cM with an average interval of 3.88 cM between adjacent markers. We evaluated black rot resistance in the mapping population in three independent inoculation tests using F2:3 progenies and identified one major QTL and three minor QTLs. We report successful utilization of whole-genome resequencing for large-scale SNP identification and development of molecular markers for genetic map construction. In addition, we identified novel QTLs for black rot resistance. The high-density genetic map will promote QTL analysis for other important agricultural traits and marker-assisted breeding of B. oleracea.
A presentation of the differences between the sheep and goat genetic maps
2005-01-01
The current autosomal version (4.2) of the sheep genetic map comprises 1175 loci and spans ~3540 cM. This corresponds to almost complete coverage of the sheep genome. Each chromosome is represented by a single linkage group, with the largest gap between adjacent loci being 19.8 cM. In contrast the 1998 goat genetic map (the most recently published) is much less well developed spanning 2737 cM and comprising only 307 loci. Only one of the goat chromosomes appears to have complete coverage (chromosome 27), and 16 of the chromosomes are comprised of two or more linkage groups, or a linkage group and one or more unlinked markers. The two maps share 218 loci, and the maps have been aligned using the shared loci as reference points. Overall there is good agreement between the maps in terms of homologous loci mapping to equivalent chromosomes in the two species, with only four markers mapping to non-equivalent chromosomes. However, there are lots of inversions in locus order between the sheep and goat chromosomes. Whilst some of these differences in locus order may be genuine, the majority are likely to be a consequence of the paucity of genetic information for the goat map. PMID:15601590
Fine genetic mapping of spot blotch resistance gene Sb3 in wheat (Triticum aestivum).
Lu, Ping; Liang, Yong; Li, Delin; Wang, Zhengzhong; Li, Wenbin; Wang, Guoxin; Wang, Yong; Zhou, Shenghui; Wu, Qiuhong; Xie, Jingzhong; Zhang, Deyun; Chen, Yongxing; Li, Miaomiao; Zhang, Yan; Sun, Qixin; Han, Chenggui; Liu, Zhiyong
2016-03-01
Spot blotch disease resistance gene Sb3 was mapped to a 0.15 centimorgan (cM) genetic interval spanning a 602 kb physical genomic region on chromosome 3BS. Wheat spot blotch disease, caused by B. sorokiniana, is a devastating disease that can cause severe yield losses. Although inoculum levels can be reduced by planting disease-free seed, treatment of plants with fungicides and crop rotation, genetic resistance is likely to be a robust, economical and environmentally friendly tool in the control of spot blotch. The winter wheat line 621-7-1 confers immune resistance against B. sorokiniana. Genetic analysis indicates that the spot blotch resistance of 621-7-1 is controlled by a single dominant gene, provisionally designated Sb3. Bulked segregant analysis (BSA) and simple sequence repeat (SSR) mapping showed that Sb3 is located on chromosome arm 3BS linked with markers Xbarc133 and Xbarc147. Seven and twelve new polymorphic markers were developed from the Chinese Spring 3BS shotgun survey sequence contigs and 3BS reference sequences, respectively. Finally, Sb3 was mapped in a 0.15 cM genetic interval spanning a 602 kb physical genomic region of Chinese Spring chromosome 3BS. The genetic and physical maps of Sb3 provide a framework for map-based cloning and marker-assisted selection (MAS) of the spot blotch resistance.
Farmer, Andrew D.; Huang, Wei; Ambachew, Daniel; Penmetsa, R. Varma; Carrasquilla-Garcia, Noelia; Assefa, Teshale; Cannon, Steven B.
2018-01-01
Recombination (R) rate and linkage disequilibrium (LD) analyses are the basis for plant breeding. These vary by breeding system, by generation of inbreeding or outcrossing and by region in the chromosome. Common bean (Phaseolus vulgaris L.) is a favored food legume with a small sequenced genome (514 Mb) and n = 11 chromosomes. The goal of this study was to describe R and LD in the common bean genome using a 768-marker array of single nucleotide polymorphisms (SNP) based on Trans-legume Orthologous Group (TOG) genes along with an advanced-generation Recombinant Inbred Line reference mapping population (BAT93 x Jalo EEP558) and an internationally available diversity panel. A whole genome genetic map was created that covered all eleven linkage groups (LG). The LGs were linked to the physical map by sequence data of the TOGs compared to each chromosome sequence of common bean. The genetic map length in total was smaller than for previous maps reflecting the precision of allele calling and mapping with SNP technology as well as the use of gene-based markers. A total of 91.4% of TOG markers had singleton hits with annotated Pv genes and all mapped outside of regions of resistance gene clusters. LD levels were found to be stronger within the Mesoamerican genepool and decay more rapidly within the Andean genepool. The recombination rate across the genome was 2.13 cM / Mb but R was found to be highly repressed around centromeres and frequent outside peri-centromeric regions. These results have important implications for association and genetic mapping or crop improvement in common bean. PMID:29522524
Zhao, Zhenqing; Gu, Honghui; Sheng, Xiaoguang; Yu, Huifang; Wang, Jiansheng; Huang, Long; Wang, Dan
2016-01-01
Molecular markers and genetic maps play an important role in plant genomics and breeding studies. Cauliflower is an important and distinctive vegetable; however, very few molecular resources have been reported for this species. In this study, a novel, specific-locus amplified fragment (SLAF) sequencing strategy was employed for large-scale single nucleotide polymorphism (SNP) discovery and high-density genetic map construction in a double-haploid, segregating population of cauliflower. A total of 12.47 Gb raw data containing 77.92 M pair-end reads were obtained after processing and 6815 polymorphic SLAFs between the two parents were detected. The average sequencing depths reached 52.66-fold for the female parent and 49.35-fold for the male parent. Subsequently, these polymorphic SLAFs were used to genotype the population and further filtered based on several criteria to construct a genetic linkage map of cauliflower. Finally, 1776 high-quality SLAF markers, including 2741 SNPs, constituted the linkage map with average data integrity of 95.68%. The final map spanned a total genetic length of 890.01 cM with an average marker interval of 0.50 cM, and covered 364.9 Mb of the reference genome. The markers and genetic map developed in this study could provide an important foundation not only for comparative genomics studies within Brassica oleracea species but also for quantitative trait loci identification and molecular breeding of cauliflower. PMID:27047515
Improved maize reference genome with single-molecule technologies.
Jiao, Yinping; Peluso, Paul; Shi, Jinghua; Liang, Tiffany; Stitzer, Michelle C; Wang, Bo; Campbell, Michael S; Stein, Joshua C; Wei, Xuehong; Chin, Chen-Shan; Guill, Katherine; Regulski, Michael; Kumari, Sunita; Olson, Andrew; Gent, Jonathan; Schneider, Kevin L; Wolfgruber, Thomas K; May, Michael R; Springer, Nathan M; Antoniou, Eric; McCombie, W Richard; Presting, Gernot G; McMullen, Michael; Ross-Ibarra, Jeffrey; Dawe, R Kelly; Hastie, Alex; Rank, David R; Ware, Doreen
2017-06-22
Complete and accurate reference genomes and annotations provide fundamental tools for characterization of genetic and functional variation. These resources facilitate the determination of biological processes and support translation of research findings into improved and sustainable agricultural technologies. Many reference genomes for crop plants have been generated over the past decade, but these genomes are often fragmented and missing complex repeat regions. Here we report the assembly and annotation of a reference genome of maize, a genetic and agricultural model species, using single-molecule real-time sequencing and high-resolution optical mapping. Relative to the previous reference genome, our assembly features a 52-fold increase in contig length and notable improvements in the assembly of intergenic spaces and centromeres. Characterization of the repetitive portion of the genome revealed more than 130,000 intact transposable elements, allowing us to identify transposable element lineage expansions that are unique to maize. Gene annotations were updated using 111,000 full-length transcripts obtained by single-molecule real-time sequencing. In addition, comparative optical mapping of two other inbred maize lines revealed a prevalence of deletions in regions of low gene density and maize lineage-specific genes.
Wang, Jun; Wang, Zhilan; Du, Xiaofen; Yang, Huiqing; Han, Fang; Han, Yuanhuai; Yuan, Feng; Zhang, Linyi; Peng, Shuzhong; Guo, Erhu
2017-01-01
Foxtail millet (Setaria italica), a very important grain crop in China, has become a new model plant for cereal crops and biofuel grasses. Although its reference genome sequence was released recently, quantitative trait loci (QTLs) controlling complex agronomic traits remains limited. The development of massively parallel genotyping methods and next-generation sequencing technologies provides an excellent opportunity for developing single-nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of complex quantitative traits. In this study, a high-throughput and cost-effective RAD-seq approach was employed to generate a high-density genetic map for foxtail millet. A total of 2,668,587 SNP loci were detected according to the reference genome sequence; meanwhile, 9,968 SNP markers were used to genotype 124 F2 progenies derived from the cross between Hongmiaozhangu and Changnong35; a high-density genetic map spanning 1648.8 cM, with an average distance of 0.17 cM between adjacent markers was constructed; 11 major QTLs for eight agronomic traits were identified; five co-dominant DNA markers were developed. These findings will be of value for the identification of candidate genes and marker-assisted selection in foxtail millet.
Wang, Zhilan; Du, Xiaofen; Yang, Huiqing; Han, Fang; Han, Yuanhuai; Yuan, Feng; Zhang, Linyi; Peng, Shuzhong; Guo, Erhu
2017-01-01
Foxtail millet (Setaria italica), a very important grain crop in China, has become a new model plant for cereal crops and biofuel grasses. Although its reference genome sequence was released recently, quantitative trait loci (QTLs) controlling complex agronomic traits remains limited. The development of massively parallel genotyping methods and next-generation sequencing technologies provides an excellent opportunity for developing single-nucleotide polymorphisms (SNPs) for linkage map construction and QTL analysis of complex quantitative traits. In this study, a high-throughput and cost-effective RAD-seq approach was employed to generate a high-density genetic map for foxtail millet. A total of 2,668,587 SNP loci were detected according to the reference genome sequence; meanwhile, 9,968 SNP markers were used to genotype 124 F2 progenies derived from the cross between Hongmiaozhangu and Changnong35; a high-density genetic map spanning 1648.8 cM, with an average distance of 0.17 cM between adjacent markers was constructed; 11 major QTLs for eight agronomic traits were identified; five co-dominant DNA markers were developed. These findings will be of value for the identification of candidate genes and marker-assisted selection in foxtail millet. PMID:28644843
Construction of an almond linkage map in an Australian population Nonpareil × Lauranne
2010-01-01
Background Despite a high genetic similarity to peach, almonds (Prunus dulcis) have a fleshless fruit and edible kernel, produced as a crop for human consumption. While the release of peach genome v1.0 provides an excellent opportunity for almond genetic and genomic studies, well-assessed segregating populations and the respective saturated genetic linkage maps lay the foundation for such studies to be completed in almond. Results Using an almond intraspecific cross between 'Nonpareil' and 'Lauranne' (N × L), we constructed a moderately saturated map with SSRs, SNPs, ISSRs and RAPDs. The N × L map covered 591.4 cM of the genome with 157 loci. The average marker distance of the map was 4.0 cM. The map displayed high synteny and colinearity with the Prunus T × E reference map in all eight linkage groups (G1-G8). The positions of 14 mapped gene-anchored SNPs corresponded approximately with the positions of homologous sequences in the peach genome v1.0. Analysis of Mendelian segregation ratios showed that 17.9% of markers had significantly skewed genotype ratios at the level of P < 0.05. Due to the large number of skewed markers in the linkage group 7, the potential existence of deleterious gene(s) was assessed in the group. Integrated maps produced by two different mapping methods using JoinMap® 3 were compared, and their high degree of similarity was evident despite the positional inconsistency of a few markers. Conclusions We presented a moderately saturated Australian almond map, which is highly syntenic and collinear with the Prunus reference map and peach genome V1.0. Therefore, the well-assessed almond population reported here can be used to investigate the traits of interest under Australian growing conditions, and provides more information on the almond genome for the international community. PMID:20932335
Population substructure in Cache County, Utah: the Cache County study
2014-01-01
Background Population stratification is a key concern for genetic association analyses. In addition, extreme homogeneity of ethnic origins of a population can make it difficult to interpret how genetic associations in that population may translate into other populations. Here we have evaluated the genetic substructure of samples from the Cache County study relative to the HapMap Reference populations and data from the Alzheimer's Disease Neuroimaging Initiative (ADNI). Results Our findings show that the Cache County study is similar in ethnic diversity to the self-reported "Whites" in the ADNI sample and less homogenous than the HapMap CEU population. Conclusions We conclude that the Cache County study is genetically representative of the general European American population in the USA and is an appropriate population for conducting broadly applicable genetic studies. PMID:25078123
Peng, Wenzhu; Xu, Jian; Zhang, Yan; Feng, Jianxin; Dong, Chuanju; Jiang, Likun; Feng, Jingyan; Chen, Baohua; Gong, Yiwen; Chen, Lin; Xu, Peng
2016-01-01
High density genetic linkage maps are essential for QTL fine mapping, comparative genomics and high quality genome sequence assembly. In this study, we constructed a high-density and high-resolution genetic linkage map with 28,194 SNP markers on 14,146 distinct loci for common carp based on high-throughput genotyping with the carp 250 K single nucleotide polymorphism (SNP) array in a mapping family. The genetic length of the consensus map was 10,595.94 cM with an average locus interval of 0.75 cM and an average marker interval of 0.38 cM. Comparative genomic analysis revealed high level of conserved syntenies between common carp and the closely related model species zebrafish and medaka. The genome scaffolds were anchored to the high-density linkage map, spanning 1,357 Mb of common carp reference genome. QTL mapping and association analysis identified 22 QTLs for growth-related traits and 7 QTLs for sex dimorphism. Candidate genes underlying growth-related traits were identified, including important regulators such as KISS2, IGF1, SMTLB, NPFFR1 and CPE. Candidate genes associated with sex dimorphism were also identified including 3KSR and DMRT2b. The high-density and high-resolution genetic linkage map provides an important tool for QTL fine mapping and positional cloning of economically important traits, and improving common carp genome assembly. PMID:27225429
Quantitative trait loci that control the oil content variation of rapeseed (Brassica napus L.).
Jiang, Congcong; Shi, Jiaqin; Li, Ruiyuan; Long, Yan; Wang, Hao; Li, Dianrong; Zhao, Jianyi; Meng, Jinling
2014-04-01
This report describes an integrative analysis of seed-oil-content quantitative trait loci (QTL) in Brassica napus , using a high-density genetic map to align QTL among different populations. Rapeseed (Brassica napus) is an important source of edible oil and sustainable energy. Given the challenge involved in using only a few genes to substantially increase the oil content of rapeseed without affecting the fatty acid composition, exploitation of a greater number of genetic loci that regulate the oil content variation among rapeseed germplasm is of fundamental importance. In this study, we investigated variation in the seed-oil content among two related genetic populations of Brassica napus, the TN double-haploid population and its derivative reconstructed-F2 population. Each population was grown in multiple experiments under different environmental conditions. Mapping of quantitative trait loci (QTL) identified 41 QTL in the TN populations. Furthermore, of the 20 pairs of epistatic interaction loci detected, approximately one-third were located within the QTL intervals. The use of common markers on different genetic maps and the TN genetic map as a reference enabled us to project QTL from an additional three genetic populations onto the TN genetic map. In summary, we used the TN genetic map of the B. napus genome to identify 46 distinct QTL regions that control seed-oil content on 16 of the 19 linkage groups of B. napus. Of these, 18 were each detected in multiple populations. The present results are of value for ongoing efforts to breed rapeseed with high oil content, and alignment of the QTL makes an important contribution to the development of an integrative system for genetic studies of rapeseed.
Nelson, Matthew N.; Moolhuijzen, Paula M.; Boersma, Jeffrey G.; Chudy, Magdalena; Lesniewska, Karolina; Bellgard, Matthew; Oliver, Richard P.; Święcicki, Wojciech; Wolko, Bogdan; Cowling, Wallace A.; Ellwood, Simon R.
2010-01-01
We have developed a dense reference genetic map of Lupinus angustifolius (2n = 40) based on a set of 106 publicly available recombinant inbred lines derived from a cross between domesticated and wild parental lines. The map comprised 1090 loci in 20 linkage groups and three small clusters, drawing together data from several previous mapping publications plus almost 200 new markers, of which 63 were gene-based markers. A total of 171 mainly gene-based, sequence-tagged site loci served as bridging points for comparing the Lu. angustifolius genome with the genome sequence of the model legume, Lotus japonicus via BLASTn homology searching. Comparative analysis indicated that the genomes of Lu. angustifolius and Lo. japonicus are highly diverged structurally but with significant regions of conserved synteny including the region of the Lu. angustifolius genome containing the pod-shatter resistance gene, lentus. We discuss the potential of synteny analysis for identifying candidate genes for domestication traits in Lu. angustifolius and in improving our understanding of Fabaceae genome evolution. PMID:20133394
A proteome-scale map of the human interactome network
Rolland, Thomas; Taşan, Murat; Charloteaux, Benoit; Pevzner, Samuel J.; Zhong, Quan; Sahni, Nidhi; Yi, Song; Lemmens, Irma; Fontanillo, Celia; Mosca, Roberto; Kamburov, Atanas; Ghiassian, Susan D.; Yang, Xinping; Ghamsari, Lila; Balcha, Dawit; Begg, Bridget E.; Braun, Pascal; Brehme, Marc; Broly, Martin P.; Carvunis, Anne-Ruxandra; Convery-Zupan, Dan; Corominas, Roser; Coulombe-Huntington, Jasmin; Dann, Elizabeth; Dreze, Matija; Dricot, Amélie; Fan, Changyu; Franzosa, Eric; Gebreab, Fana; Gutierrez, Bryan J.; Hardy, Madeleine F.; Jin, Mike; Kang, Shuli; Kiros, Ruth; Lin, Guan Ning; Luck, Katja; MacWilliams, Andrew; Menche, Jörg; Murray, Ryan R.; Palagi, Alexandre; Poulin, Matthew M.; Rambout, Xavier; Rasla, John; Reichert, Patrick; Romero, Viviana; Ruyssinck, Elien; Sahalie, Julie M.; Scholz, Annemarie; Shah, Akash A.; Sharma, Amitabh; Shen, Yun; Spirohn, Kerstin; Tam, Stanley; Tejeda, Alexander O.; Trigg, Shelly A.; Twizere, Jean-Claude; Vega, Kerwin; Walsh, Jennifer; Cusick, Michael E.; Xia, Yu; Barabási, Albert-László; Iakoucheva, Lilia M.; Aloy, Patrick; De Las Rivas, Javier; Tavernier, Jan; Calderwood, Michael A.; Hill, David E.; Hao, Tong; Roth, Frederick P.; Vidal, Marc
2014-01-01
SUMMARY Just as reference genome sequences revolutionized human genetics, reference maps of interactome networks will be critical to fully understand genotype-phenotype relationships. Here, we describe a systematic map of ~14,000 high-quality human binary protein-protein interactions. At equal quality, this map is ~30% larger than what is available from small-scale studies published in the literature in the last few decades. While currently available information is highly biased and only covers a relatively small portion of the proteome, our systematic map appears strikingly more homogeneous, revealing a “broader” human interactome network than currently appreciated. The map also uncovers significant inter-connectivity between known and candidate cancer gene products, providing unbiased evidence for an expanded functional cancer landscape, while demonstrating how high quality interactome models will help “connect the dots” of the genomic revolution. PMID:25416956
ERIC Educational Resources Information Center
Kuna, Jason
2001-01-01
This article explores the impact of the mapping work of the Human Genome Project on individuals with mental retardation and the negative effects of genetic testing. The potential to identify disabilities and the concept of eugenics are discussed, along with ethical issues surrounding potential genetic therapies. (Contains references.) (CR)
Konganti, Kranti; Ehrlich, Andre; Rusyn, Ivan; Threadgill, David W
2018-06-07
Multi-parental recombinant inbred populations, such as the Collaborative Cross (CC) mouse genetic reference population, are increasingly being used for analysis of quantitative trait loci (QTL). However specialized analytic software for these complex populations is typically built in R that works only on command-line, which limits the utility of these powerful resources for many users. To overcome analytic limitations, we developed gQTL, a web accessible, simple graphical user interface application based on the DOQTL platform in R to perform QTL mapping using data from CC mice. Copyright © 2018, G3: Genes, Genomes, Genetics.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro; ...
2017-03-10
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Genetics Home Reference: pilomatricoma
... F, Palacios J. beta-catenin expression in pilomatrixomas. Relationship with beta-catenin gene mutations and comparison with ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: trichothiodystrophy
... trichothiodystrophy and Cockayne syndrome: a complex genotype-phenotype relationship. Neuroscience. 2007 Apr 14;145(4):1388-96. ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: microphthalmia
... CR, Ye M, Garcha K, Bigot K, Perera AG, Staehling-Hampton K, Mema SC, Chanda B, Mushegian ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Crohn disease
... or indirectly, to abnormal inflammation. However, the exact relationship between these factors and Crohn disease risk remains ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: Cowden syndrome
... MS, Eng C. A clinical scoring system for selection of patients for PTEN mutation testing is proposed ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: Laron syndrome
... AL. Obesity, diabetes and cancer: insight into the relationship from a cohort with growth hormone receptor deficiency. ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: xeroderma pigmentosum
... trichothiodystrophy and Cockayne syndrome: a complex genotype-phenotype relationship. Neuroscience. 2007 Apr 14;145(4):1388-96. ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: Miyoshi myopathy
... Itoyama Y. Dysferlin mutations in Japanese Miyoshi myopathy: relationship to phenotype. Neurology. 2003 Jun 10;60(11): ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: erythromelalgia
... Bennett DL, Wood JN, Kinali M. Novel mutations mapping to the fourth sodium channel domain of Nav1. ... and late-onset inherited erythromelalgia: genotype-phenotype correlation. Brain. 2009 Jul;132(Pt 7):1711-22. doi: ...
Genetics Home Reference: Lowe syndrome
... inheritance is that fathers cannot pass X-linked traits to their sons. In some cases of Lowe ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: Danon disease
... inheritance is that fathers cannot pass X-linked traits to their sons. Related Information What does it ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: frontometaphyseal dysplasia
... inheritance is that fathers cannot pass X-linked traits to their sons. Related Information What does it ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: Tourette syndrome
... Rasin MR, Gunel M, Davis NR, Ercan-Sencicek AG, Guez DH, Spertus JA, Leckman JF, Dure LS ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: bradyopsia
... 75-8. Citation on PubMed Vincent A, Robson AG, Holder GE. Pathognomonic (diagnostic) ERGs. A review and ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Clouston syndrome
... M, Nakamura M, Farooq M, Fujikawa H, Kibbi AG, Ito M, Dahdah M, Matta M, Diab H, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such ...
Srivastava, Rishi; Singh, Mohar; Bajaj, Deepak; Parida, Swarup K.
2016-01-01
Development and large-scale genotyping of user-friendly informative genome/gene-derived InDel markers in natural and mapping populations is vital for accelerating genomics-assisted breeding applications of chickpea with minimal resource expenses. The present investigation employed a high-throughput whole genome next-generation resequencing strategy in low and high pod number parental accessions and homozygous individuals constituting the bulks from each of two inter-specific mapping populations [(Pusa 1103 × ILWC 46) and (Pusa 256 × ILWC 46)] to develop non-erroneous InDel markers at a genome-wide scale. Comparing these high-quality genomic sequences, 82,360 InDel markers with reference to kabuli genome and 13,891 InDel markers exhibiting differentiation between low and high pod number parental accessions and bulks of aforementioned mapping populations were developed. These informative markers were structurally and functionally annotated in diverse coding and non-coding sequence components of genome/genes of kabuli chickpea. The functional significance of regulatory and coding (frameshift and large-effect mutations) InDel markers for establishing marker-trait linkages through association/genetic mapping was apparent. The markers detected a greater amplification (97%) and intra-specific polymorphic potential (58–87%) among a diverse panel of cultivated desi, kabuli, and wild accessions even by using a simpler cost-efficient agarose gel-based assay implicating their utility in large-scale genetic analysis especially in domesticated chickpea with narrow genetic base. Two high-density inter-specific genetic linkage maps generated using aforesaid mapping populations were integrated to construct a consensus 1479 InDel markers-anchored high-resolution (inter-marker distance: 0.66 cM) genetic map for efficient molecular mapping of major QTLs governing pod number and seed yield per plant in chickpea. Utilizing these high-density genetic maps as anchors, three major genomic regions harboring each of pod number and seed yield robust QTLs (15–28% phenotypic variation explained) were identified on chromosomes 2, 4, and 6. The integration of genetic and physical maps at these QTLs mapped on chromosomes scaled-down the long major QTL intervals into high-resolution short pod number and seed yield robust QTL physical intervals (0.89–2.94 Mb) which were essentially got validated in multiple genetic backgrounds of two chickpea mapping populations. The genome-wide InDel markers including natural allelic variants and genomic loci/genes delineated at major six especially in one colocalized novel congruent robust pod number and seed yield robust QTLs mapped on a high-density consensus genetic map were found most promising in chickpea. These functionally relevant molecular tags can drive marker-assisted genetic enhancement to develop high-yielding cultivars with increased seed/pod number and yield in chickpea. PMID:27695461
Construction of a map-based reference genome sequence for barley, Hordeum vulgare L.
Beier, Sebastian; Himmelbach, Axel; Colmsee, Christian; Zhang, Xiao-Qi; Barrero, Roberto A.; Zhang, Qisen; Li, Lin; Bayer, Micha; Bolser, Daniel; Taudien, Stefan; Groth, Marco; Felder, Marius; Hastie, Alex; Šimková, Hana; Staňková, Helena; Vrána, Jan; Chan, Saki; Muñoz-Amatriaín, María; Ounit, Rachid; Wanamaker, Steve; Schmutzer, Thomas; Aliyeva-Schnorr, Lala; Grasso, Stefano; Tanskanen, Jaakko; Sampath, Dharanya; Heavens, Darren; Cao, Sujie; Chapman, Brett; Dai, Fei; Han, Yong; Li, Hua; Li, Xuan; Lin, Chongyun; McCooke, John K.; Tan, Cong; Wang, Songbo; Yin, Shuya; Zhou, Gaofeng; Poland, Jesse A.; Bellgard, Matthew I.; Houben, Andreas; Doležel, Jaroslav; Ayling, Sarah; Lonardi, Stefano; Langridge, Peter; Muehlbauer, Gary J.; Kersey, Paul; Clark, Matthew D.; Caccamo, Mario; Schulman, Alan H.; Platzer, Matthias; Close, Timothy J.; Hansson, Mats; Zhang, Guoping; Braumann, Ilka; Li, Chengdao; Waugh, Robbie; Scholz, Uwe; Stein, Nils; Mascher, Martin
2017-01-01
Barley (Hordeum vulgare L.) is a cereal grass mainly used as animal fodder and raw material for the malting industry. The map-based reference genome sequence of barley cv. ‘Morex’ was constructed by the International Barley Genome Sequencing Consortium (IBSC) using hierarchical shotgun sequencing. Here, we report the experimental and computational procedures to (i) sequence and assemble more than 80,000 bacterial artificial chromosome (BAC) clones along the minimum tiling path of a genome-wide physical map, (ii) find and validate overlaps between adjacent BACs, (iii) construct 4,265 non-redundant sequence scaffolds representing clusters of overlapping BACs, and (iv) order and orient these BAC clusters along the seven barley chromosomes using positional information provided by dense genetic maps, an optical map and chromosome conformation capture sequencing (Hi-C). Integrative access to these sequence and mapping resources is provided by the barley genome explorer (BARLEX). PMID:28448065
The cancer transcriptome is shaped by genetic changes, variation in gene transcription, mRNA processing, editing and stability, and the cancer microbiome. Deciphering this variation and understanding its implications on tumorigenesis requires sophisticated computational analyses. Most RNA-Seq analyses rely on methods that first map short reads to a reference genome, and then compare them to annotated transcripts or assemble them. However, this strategy can be limited when the cancer genome is substantially different than the reference or for detecting sequences from the cancer microbiome.
Genetics Home Reference: fish-eye disease
... levels of HDL cholesterol and atherosclerosis, a variable relationship--a review of LCAT deficiency. Vasc Health Risk ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: X-linked juvenile retinoschisis
... juvenile retinoschisis (XLRS): a review of genotype-phenotype relationships. Semin Ophthalmol. 2013 Sep-Nov;28(5-6): ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: Fukuyama congenital muscular dystrophy
... Fujii T, Aiba H, Toda T. Seizure-genotype relationship in Fukuyama-type congenital muscular dystrophy. Brain Dev. ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: glutaric acidemia type II
... E, Bross P, Skovby F, Gregersen N. Clear relationship between ETF/ETFDH genotype and phenotype in patients ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: complete LCAT deficiency
... levels of HDL cholesterol and atherosclerosis, a variable relationship--a review of LCAT deficiency. Vasc Health Risk ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: neurohypophyseal diabetes insipidus
... G, Colao A. Central diabetes insipidus and autoimmunity: relationship between the occurrence of antibodies to arginine vasopressin- ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: mucopolysaccharidosis type II
... inheritance is that fathers cannot pass X-linked traits to their sons. Related Information What does it ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: familial dilated cardiomyopathy
... inheritance is that fathers cannot pass X-linked traits to their sons. Related Information What does it ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: phosphoglycerate kinase deficiency
... inheritance is that fathers cannot pass X-linked traits to their sons. Related Information What does it ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: Melnick-Needles syndrome
... inheritance is that fathers cannot pass X-linked traits to their sons. Related Information What does it ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: encephalocraniocutaneous lipomatosis
... PubMed or Free article on PubMed Central Hunter AG. Oculocerebrocutaneous and encephalocraniocutaneous lipomatosis syndromes: blind men and ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Dandy-Walker malformation
... KA, Lehmann OJ, Hudgins L, Chizhikov VV, Bassuk AG, Ades LC, Krantz ID, Dobyns WB, Millen KJ. ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: McLeod neuroacanthocytosis syndrome
... W, Watt JM, Corbett AJ, Hamdalla HH, Marshall AG, Sutton I, Dotti MT, Malandrini A, Walker RH, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: vitiligo
... PubMed or Free article on PubMed Central Smith AG, Sturm RA. Multiple genes and locus interactions in ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Knobloch syndrome
... for This Page Fukai N, Eklund L, Marneros AG, Oh SP, Keene DR, Tamarkin L, Niemelä M, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Coffin-Lowry syndrome
... 27(1):85-9. Citation on PubMed Hunter AG. Coffin-Lowry syndrome: a 20-year follow-up ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: essential pentosuria
... MacCoss MJ, Levy-Lahad E, King MC, Motulsky AG. Garrod's fourth inborn error of metabolism solved by ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Klippel-Trenaunay syndrome
... 6):291-8. Review. Citation on PubMed Jacob AG, Driscoll DJ, Shaughnessy WJ, Stanson AW, Clay RP, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: age-related macular degeneration
... A, Tosakulwong N, Truitt BJ, Tsironi EE, Uitterlinden AG, van Duijn CM, Vijaya L, Vingerling JR, Vithana ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: pseudohypoaldosteronism type 2
... Thameem F, Al-Shahrouri HZ, Radhakrishnan J, Gharavi AG, Goilav B, Lifton RP. Mutations in kelch-like ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: autosomal dominant vitreoretinochoroidopathy
... RE, Davidson AE, Urquhart JE, Holder GE, Robson AG, Moore AT, Keefe RO, Black GC, Manson FD. ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Liddle syndrome
... Free article on PubMed Central Lifton RP, Gharavi AG, Geller DS. Molecular mechanisms of human hypertension. Cell. ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: actin-accumulation myopathy
... F, Sewry C, Hughes I, Sutphen R, Lacson AG, Swoboda KJ, Vigneron J, Wallgren-Pettersson C, Beggs ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: Dent disease
... 20. Review. Citation on PubMed Wrong OM, Norden AG, Feest TG. Dent's disease; a familial proximal renal ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Molecular mapping of chromosomes 17 and X. Progress report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barker, D.F.
1989-12-31
The basic aims of this project are the construction of high density genetic maps of chromosomes 17 and X and the utilization of these maps for the subsequent isolation of a set of physically overlapping DNA segment clones. The strategy depends on the utilization of chromosome specific libraries of small (1--15 kb) segments from each of the two chromosomes. Since the time of submission of our previous progress report, we have refined the genetic map of markers which we had previously isolated for chromosome 17. We have completed our genetic mapping in CEPH reference and NF1 families of 15 markersmore » in the pericentric region of chromosome 17. Physical mapping results with three probes, were shown be in very close genetic proximity to the NF1 gene, with respect to two translocation breakpoints which disrupt the activity of the gene. All three of the probes were found to lie between the centromere and the most proximal translocation breakpoint, providing important genetic markers proximal to the NF1 gene. Our primary focus has shifted to the X chromosome. We have isolated an additional 30 polymorphic markers, bringing the total number we have isolated to over 80. We have invested substantial effort in characterizing the polymorphisms at each of these loci and constructed plasmid subclones which reveal the polymorphisms for nearly all of the loci. These subclones are of practical value in that they produce simpler and stronger patterns on human genomic Southern blots, thus improving the efficiency of the genetic mapping experiments. These subclones may also be of value for deriving DNA sequence information at each locus, necessary for establishing polymerase chain reaction primers specific for each locus. Such information would allow the use of each locus as a sequence tagged site.« less
Molecular mapping of chromosomes 17 and X
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barker, D.F.
1989-01-01
The basic aims of this project are the construction of high density genetic maps of chromosomes 17 and X and the utilization of these maps for the subsequent isolation of a set of physically overlapping DNA segment clones. The strategy depends on the utilization of chromosome specific libraries of small (1--15 kb) segments from each of the two chromosomes. Since the time of submission of our previous progress report, we have refined the genetic map of markers which we had previously isolated for chromosome 17. We have completed our genetic mapping in CEPH reference and NF1 families of 15 markersmore » in the pericentric region of chromosome 17. Physical mapping results with three probes, were shown be in very close genetic proximity to the NF1 gene, with respect to two translocation breakpoints which disrupt the activity of the gene. All three of the probes were found to lie between the centromere and the most proximal translocation breakpoint, providing important genetic markers proximal to the NF1 gene. Our primary focus has shifted to the X chromosome. We have isolated an additional 30 polymorphic markers, bringing the total number we have isolated to over 80. We have invested substantial effort in characterizing the polymorphisms at each of these loci and constructed plasmid subclones which reveal the polymorphisms for nearly all of the loci. These subclones are of practical value in that they produce simpler and stronger patterns on human genomic Southern blots, thus improving the efficiency of the genetic mapping experiments. These subclones may also be of value for deriving DNA sequence information at each locus, necessary for establishing polymerase chain reaction primers specific for each locus. Such information would allow the use of each locus as a sequence tagged site.« less
Holtz, Yan; Ardisson, Morgane; Ranwez, Vincent; Besnard, Alban; Leroy, Philippe; Poux, Gérard; Roumet, Pierre; Viader, Véronique; Santoni, Sylvain; David, Jacques
2016-01-01
Targeted sequence capture is a promising technology which helps reduce costs for sequencing and genotyping numerous genomic regions in large sets of individuals. Bait sequences are designed to capture specific alleles previously discovered in parents or reference populations. We studied a set of 135 RILs originating from a cross between an emmer cultivar (Dic2) and a recent durum elite cultivar (Silur). Six thousand sequence baits were designed to target Dic2 vs. Silur polymorphisms discovered in a previous RNAseq study. These baits were exposed to genomic DNA of the RIL population. Eighty percent of the targeted SNPs were recovered, 65% of which were of high quality and coverage. The final high density genetic map consisted of more than 3,000 markers, whose genetic and physical mapping were consistent with those obtained with large arrays. PMID:27171472
A draft physical map of a D-genome cotton species (Gossypium raimondii)
2010-01-01
Background Genetically anchored physical maps of large eukaryotic genomes have proven useful both for their intrinsic merit and as an adjunct to genome sequencing. Cultivated tetraploid cottons, Gossypium hirsutum and G. barbadense, share a common ancestor formed by a merger of the A and D genomes about 1-2 million years ago. Toward the long-term goal of characterizing the spectrum of diversity among cotton genomes, the worldwide cotton community has prioritized the D genome progenitor Gossypium raimondii for complete sequencing. Results A whole genome physical map of G. raimondii, the putative D genome ancestral species of tetraploid cottons was assembled, integrating genetically-anchored overgo hybridization probes, agarose based fingerprints and 'high information content fingerprinting' (HICF). A total of 13,662 BAC-end sequences and 2,828 DNA probes were used in genetically anchoring 1585 contigs to a cotton consensus genetic map, and 370 and 438 contigs, respectively to Arabidopsis thaliana (AT) and Vitis vinifera (VV) whole genome sequences. Conclusion Several lines of evidence suggest that the G. raimondii genome is comprised of two qualitatively different components. Much of the gene rich component is aligned to the Arabidopsis and Vitis vinifera genomes and shows promise for utilizing translational genomic approaches in understanding this important genome and its resident genes. The integrated genetic-physical map is of value both in assembling and validating a planned reference sequence. PMID:20569427
Lu, Jiangjie; Liu, Yuyang; Xu, Jing; Mei, Ziwei; Shi, Yujun; Liu, Pengli; He, Jianbo; Wang, Xiaotong; Meng, Yijun; Feng, Shangguo; Shen, Chenjia; Wang, Huizhong
2018-01-01
Plants of the Dendrobium genus are orchids with not only ornamental value but also high medicinal value. To understand the genetic basis of variations in active ingredients of the stem total polysaccharide contents (STPCs) among different Dendrobium species, it is of paramount importance to understand the mechanism of STPC formation and identify genes affecting its process at the whole genome level. Here, we report the first high-density single-nucleotide polymorphism (SNP) integrated genetic map with a good genome coverage of Dendrobium. The specific-locus amplified fragment sequencing (SLAF-seq) technology led to identification of 7,013,400 SNPs from 1,503,626 high-quality SLAF markers from two parents (Dendrobium moniliforme ♀ × Dendrobium officinale ♂) and their interspecific F1 hybrid population. The final genetic map contained 8, 573 SLAF markers, covering 19 linkage groups (LGs). This genetic map spanned a length of 2,737.49 cM, where the average distance between markers is 0.32 cM. In total, 5 quantitative trait loci (QTL) related to STPC were identified, 3 of which have candidate genes within the confidence intervals of these stable QTLs based on the D. officinale genome sequence. This study will build a foundation up for the mapping of other medicinal-related traits and provide an important reference for the molecular breeding of these Chinese herb. PMID:29636767
Construction of an integrated genetic map for Capsicum baccatum L.
Moulin, M M; Rodrigues, R; Ramos, H C C; Bento, C S; Sudré, C P; Gonçalves, L S A; Viana, A P
2015-06-18
Capsicum baccatum L. is one of the five Capsicum domesticated species and has multiple uses in the food, pharmaceutical and cosmetic industries. This species is also a valuable source of genes for chili pepper breeding, especially genes for disease resistance and fruit quality. However, knowledge of the genetic structure of C. baccatum is limited. A reference map for C. baccatum (2n = 2x = 24) based on 42 microsatellite, 85 inter-simple sequence repeat, and 56 random amplified polymorphic DNA markers was constructed using an F2 population consisting of 203 individuals. The map was generated using the JoinMap software (version 4.0) and the linkage groups were formed and ordered using a LOD score of 3.0 and maximum of 40% recombination. The genetic map consisted of 12 major and four minor linkage groups covering a total genome distance of 2547.5 cM with an average distance of 14.25 cM between markers. Of the 152 pairs of microsatellite markers available for Capsicum annuum, 62 were successfully transferred to C. baccatum, generating polymorphism. Forty-two of these markers were mapped, allowing the introduction of C. baccatum in synteny studies with other species of the genus Capsicum.
2012-01-01
Background The turbot (Scophthalmus maximus) is a relevant species in European aquaculture. The small turbot genome provides a source for genomics strategies to use in order to understand the genetic basis of productive traits, particularly those related to sex, growth and pathogen resistance. Genetic maps represent essential genomic screening tools allowing to localize quantitative trait loci (QTL) and to identify candidate genes through comparative mapping. This information is the backbone to develop marker-assisted selection (MAS) programs in aquaculture. Expressed sequenced tag (EST) resources have largely increased in turbot, thus supplying numerous type I markers suitable for extending the previous linkage map, which was mostly based on anonymous loci. The aim of this study was to construct a higher-resolution turbot genetic map using EST-linked markers, which will turn out to be useful for comparative mapping studies. Results A consensus gene-enriched genetic map of the turbot was constructed using 463 SNP and microsatellite markers in nine reference families. This map contains 438 markers, 180 EST-linked, clustered at 24 linkage groups. Linkage and comparative genomics evidences suggested additional linkage group fusions toward the consolidation of turbot map according to karyotype information. The linkage map showed a total length of 1402.7 cM with low average intermarker distance (3.7 cM; ~2 Mb). A global 1.6:1 female-to-male recombination frequency (RF) ratio was observed, although largely variable among linkage groups and chromosome regions. Comparative sequence analysis revealed large macrosyntenic patterns against model teleost genomes, significant hits decreasing from stickleback (54%) to zebrafish (20%). Comparative mapping supported particular chromosome rearrangements within Acanthopterygii and aided to assign unallocated markers to specific turbot linkage groups. Conclusions The new gene-enriched high-resolution turbot map represents a useful genomic tool for QTL identification, positional cloning strategies, and future genome assembling. This map showed large synteny conservation against model teleost genomes. Comparative genomics and data mining from landmarks will provide straightforward access to candidate genes, which will be the basis for genetic breeding programs and evolutionary studies in this species. PMID:22747677
Fang, Xiaomei; Dong, Kongjun; Wang, Xiaoqin; Liu, Tianpeng; He, Jihong; Ren, Ruiyu; Zhang, Lei; Liu, Rui; Liu, Xueying; Li, Man; Huang, Mengzhu; Zhang, Zhengsheng; Yang, Tianyu
2016-05-04
Foxtail millet [Setaria italica (L.) P. Beauv.], a crop of historical importance in China, has been adopted as a model crop for studying C-4 photosynthesis, stress biology and biofuel traits. Construction of a high density genetic map and identification of stable quantitative trait loci (QTL) lay the foundation for marker-assisted selection for agronomic traits and yield improvement. A total of 10598 SSR markers were developed according to the reference genome sequence of foxtail millet cultivar 'Yugu1'. A total of 1013 SSR markers showing polymorphism between Yugu1 and Longgu7 were used to genotype 167 individuals from a Yugu1 × Longgu7 F2 population, and a high density genetic map was constructed. The genetic map contained 1035 loci and spanned 1318.8 cM with an average distance of 1.27 cM between adjacent markers. Based on agronomic and yield traits identified in 2 years, 29 QTL were identified for 11 traits with combined analysis and single environment analysis. These QTL explained from 7.0 to 14.3 % of phenotypic variation. Favorable QTL alleles for peduncle length originated from Longgu7 whereas favorable alleles for the other traits originated from Yugu1 except for qLMS6.1. New SSR markers, a high density genetic map and QTL identified for agronomic and yield traits lay the ground work for functional gene mapping, map-based cloning and marker-assisted selection in foxtail millet.
USDA-ARS?s Scientific Manuscript database
Ongoing developments and cost decreases in next-generation sequencing (NGS) technologies have led to an increase in their application, which has greatly enhanced the fields of genetics and genomics. Mapping sequence reads onto a reference genome is a fundamental step in the analysis of NGS data. Eff...
Genetics Home Reference: tarsal-carpal coalition syndrome
... Belmonte JC, Choe S. Structural basis of BMP signalling inhibition by the cystine knot protein Noggin. Nature. 2002 ... Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & Players ...
Genetics Home Reference: bare lymphocyte syndrome type I
... R. ABC proteins in antigen translocation and viral inhibition. Nat Chem Biol. 2010 Aug;6(8):572- ... Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & Players ...
Genetics Home Reference: familial adenomatous polyposis
... Järvinen HJ, Peltomäki P. The complex genotype-phenotype relationship in familial adenomatous polyposis. Eur J Gastroenterol Hepatol. ... for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & ...
Genetics Home Reference: Ohdo syndrome, Maat-Kievit-Brunner type
... inheritance is that fathers cannot pass X-linked traits to their sons. Related Information What does it ... should consult with a qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map ...
Genetics Home Reference: action myoclonus-renal failure syndrome
... Vears DF, Franceschetti S, Canafoglia L, Wallace R, Bassuk AG, Power DA, Tassinari CA, Andermann E, Lehesjoki AE, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: cone-rod dystrophy
... Citation on PubMed Sergouniotis PI, McKibbin M, Robson AG, Bolz HJ, De Baere E, Müller PL, Heller ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: lysosomal acid lipase deficiency
... Cegielska J, Whitley CB, Eckert S, Valayannopoulos V, Quinn AG. Clinical Features of Lysosomal Acid Lipase Deficiency. J ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: scalp-ear-nipple syndrome
... ear nipple syndrome Sources for This Page Marneros AG, Beck AE, Turner EH, McMillin MJ, Edwards MJ, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: hyperphosphatemic familial tumoral calcinosis
... PubMed Central Ichikawa S, Baujat G, Seyahi A, Garoufali AG, Imel EA, Padgett LR, Austin AM, Sorenson AH, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: SYNGAP1-related intellectual disability
... TK, Ozkan ED, Shi Y, Reish NJ, Almonte AG, Miller BH, Wiltgen BJ, Miller CA, Xu X, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: leptin receptor deficiency
... Ferraz-Amaro I, Dattani MT, Ercan O, Myhre AG, Retterstol L, Stanhope R, Edge JA, McKenzie S, Lessan ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: proximal 18q deletion syndrome
... Feenstra I, Vissers LE, Orsel M, van Kessel AG, Brunner HG, Veltman JA, van Ravenswaaij-Arts CM. ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: spastic paraplegia type 4
... 2005 Jul 7. Review. Citation on PubMed Yip AG, Dürr A, Marchuk DA, Ashley-Koch A, Hentati ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Liu, Xiao-Ping; Gao, Bao-Zhen; Han, Feng-Qing; Fang, Zhi-Yuan; Yang, Li-Mei; Zhuang, Mu; Lv, Hong-Hao; Liu, Yu-Mei; Li, Zhan-Sheng; Cai, Cheng-Cheng; Yu, Hai-Long; Li, Zhi-Yuan; Zhang, Yang-Yong
2017-03-14
Due to its variegated and colorful leaves, ornamental kale (Brassica oleracea L. var. acephala) has become a popular ornamental plant. In this study, we report the fine mapping and analysis of a candidate purple leaf gene using a backcross population and an F 2 population derived from two parental lines: W1827 (with white leaves) and P1835 (with purple leaves). Genetic analysis indicated that the purple leaf trait is controlled by a single dominant gene, which we named BoPr. Using markers developed based on the reference genome '02-12', the BoPr gene was preliminarily mapped to a 280-kb interval of chromosome C09, with flanking markers M17 and BoID4714 at genetic distances of 4.3 cM and 1.5 cM, respectively. The recombination rate within this interval is almost 12 times higher than the usual level, which could be caused by assembly error for reference genome '02-12' at this interval. Primers were designed based on 'TO1000', another B. oleracea reference genome. Among the newly designed InDel markers, BRID485 and BRID490 were found to be the closest to BoPr, flanking the gene at genetic distances of 0.1 cM and 0.2 cM, respectively; the interval between the two markers is 44.8 kb (reference genome 'TO1000'). Seven annotated genes are located within the 44.8 kb genomic region, of which only Bo9g058630 shows high homology to AT5G42800 (dihydroflavonol reductase), which was identified as a candidate gene for BoPr. Blast analysis revealed that this 44.8 kb interval is located on an unanchored scaffold (Scaffold000035_P2) of '02-12', confirming the existence of assembly error at the interval between M17 and BoID4714 for reference genome '02-12'. This study identified a candidate gene for BoPr and lays a foundation for the cloning and functional analysis of this gene.
Chen, Zhangguo; Gowan, Katherine; Leach, Sonia M; Viboolsittiseri, Sawanee S; Mishra, Ameet K; Kadoishi, Tanya; Diener, Katrina; Gao, Bifeng; Jones, Kenneth; Wang, Jing H
2016-10-21
Whole genome next generation sequencing (NGS) is increasingly employed to detect genomic rearrangements in cancer genomes, especially in lymphoid malignancies. We recently established a unique mouse model by specifically deleting a key non-homologous end-joining DNA repair gene, Xrcc4, and a cell cycle checkpoint gene, Trp53, in germinal center B cells. This mouse model spontaneously develops mature B cell lymphomas (termed G1XP lymphomas). Here, we attempt to employ whole genome NGS to identify novel structural rearrangements, in particular inter-chromosomal translocations (CTXs), in these G1XP lymphomas. We sequenced six lymphoma samples, aligned our NGS data with mouse reference genome (in C57BL/6J (B6) background) and identified CTXs using CREST algorithm. Surprisingly, we detected widespread CTXs in both lymphomas and wildtype control samples, majority of which were false positive and attributable to different genetic backgrounds. In addition, we validated our NGS pipeline by sequencing multiple control samples from distinct tissues of different genetic backgrounds of mouse (B6 vs non-B6). Lastly, our studies showed that widespread false positive CTXs can be generated by simply aligning sequences from different genetic backgrounds of mouse. We conclude that mapping and alignment with reference genome might not be a preferred method for analyzing whole-genome NGS data obtained from a genetic background different from reference genome. Given the complex genetic background of different mouse strains or the heterogeneity of cancer genomes in human patients, in order to minimize such systematic artifacts and uncover novel CTXs, a preferred method might be de novo assembly of personalized normal control genome and cancer cell genome, instead of mapping and aligning NGS data to mouse or human reference genome. Thus, our studies have critical impact on the manner of data analysis for cancer genomics.
Genetics Home Reference: Pallister-Killian mosaic syndrome
... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & Players U.S. Department of Health & Human Services National Institutes of Health National Library of ...
Genetics Home Reference: distal 18q deletion syndrome
... Veltman JA, van Ravenswaaij-Arts CM. Genotype-phenotype mapping of chromosome 18q deletions by high-resolution array ... L, Pihko H. 18q deletions: clinical, molecular, and brain MRI findings of 14 individuals. Am J Med ...
Genetics Home Reference: STING-associated vasculopathy with onset in infancy
... Free article on PubMed Central Paludan SR, Bowie AG. Immune sensing of DNA. Immunity. 2013 May 23; ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Genetics Home Reference: myoclonic epilepsy with ragged-red fibers
... Rahman S, Poulton J, Marchington DR, Landon DN, Debono AG, Morgan-Hughes JA, Hanna MG. New phenotypic diversity ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Qian, Wei; Fan, Guiyan; Liu, Dandan; Zhang, Helong; Wang, Xiaowu; Wu, Jian; Xu, Zhaosheng
2017-04-04
Cultivated spinach (Spinacia oleracea L.) is one of the most widely cultivated types of leafy vegetable in the world, and it has a high nutritional value. Spinach is also an ideal plant for investigating the mechanism of sex determination because it is a dioecious species with separate male and female plants. Some reports on the sex labeling and localization of spinach in the study of molecular markers have surfaced. However, there have only been two reports completed on the genetic map of spinach. The lack of rich and reliable molecular markers and the shortage of high-density linkage maps are important constraints in spinach research work. In this study, a high-density genetic map of spinach based on the Specific-locus Amplified Fragment Sequencing (SLAF-seq) technique was constructed; the sex-determining gene was also finely mapped. Through bio-information analysis, 50.75 Gb of data in total was obtained, including 207.58 million paired-end reads. Finally, 145,456 high-quality SLAF markers were obtained, with 27,800 polymorphic markers and 4080 SLAF markers were finally mapped onto the genetic map after linkage analysis. The map spanned 1,125.97 cM with an average distance of 0.31 cM between the adjacent marker loci. It was divided into 6 linkage groups corresponding to the number of spinach chromosomes. Besides, the combination of Bulked Segregation Analysis (BSA) with SLAF-seq technology(super-BSA) was employed to generate the linkage markers with the sex-determining gene. Combined with the high-density genetic map of spinach, the sex-determining gene X/Y was located at the position of the linkage group (LG) 4 (66.98 cM-69.72 cM and 75.48 cM-92.96 cM), which may be the ideal region for the sex-determining gene. A high-density genetic map of spinach based on the SLAF-seq technique was constructed with a backcross (BC 1 ) population (which is the highest density genetic map of spinach reported at present). At the same time, the sex-determining gene X/Y was mapped to LG4 with super-BSA. This map will offer a suitable basis for further study of spinach, such as gene mapping, map-based cloning of Specific genes, quantitative trait locus (QTL) mapping and marker-assisted selection (MAS). It will also provide an efficient reference for studies on the mechanism of sex determination in other dioecious plants.
Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders.
Hamosh, Ada; Scott, Alan F; Amberger, Joanna S; Bocchini, Carol A; McKusick, Victor A
2005-01-01
Online Mendelian Inheritance in Man (OMIM) is a comprehensive, authoritative and timely knowledgebase of human genes and genetic disorders compiled to support human genetics research and education and the practice of clinical genetics. Started by Dr Victor A. McKusick as the definitive reference Mendelian Inheritance in Man, OMIM (http://www.ncbi.nlm.nih.gov/omim/) is now distributed electronically by the National Center for Biotechnology Information, where it is integrated with the Entrez suite of databases. Derived from the biomedical literature, OMIM is written and edited at Johns Hopkins University with input from scientists and physicians around the world. Each OMIM entry has a full-text summary of a genetically determined phenotype and/or gene and has numerous links to other genetic databases such as DNA and protein sequence, PubMed references, general and locus-specific mutation databases, HUGO nomenclature, MapViewer, GeneTests, patient support groups and many others. OMIM is an easy and straightforward portal to the burgeoning information in human genetics.
Completed Genome Sequences of Strains from 36 Serotypes of Salmonella
Robertson, James; Yoshida, Catherine; Gurnik, Simone; Rankin, Marisa
2018-01-01
ABSTRACT We report here the completed closed genome sequences of strains representing 36 serotypes of Salmonella. These genome sequences will provide useful references for understanding the genetic variation between serotypes, particularly as references for mapping of raw reads or to create assemblies of higher quality, as well as to aid in studies of comparative genomics of Salmonella. PMID:29348347
Integrating common and rare genetic variation in diverse human populations.
Altshuler, David M; Gibbs, Richard A; Peltonen, Leena; Altshuler, David M; Gibbs, Richard A; Peltonen, Leena; Dermitzakis, Emmanouil; Schaffner, Stephen F; Yu, Fuli; Peltonen, Leena; Dermitzakis, Emmanouil; Bonnen, Penelope E; Altshuler, David M; Gibbs, Richard A; de Bakker, Paul I W; Deloukas, Panos; Gabriel, Stacey B; Gwilliam, Rhian; Hunt, Sarah; Inouye, Michael; Jia, Xiaoming; Palotie, Aarno; Parkin, Melissa; Whittaker, Pamela; Yu, Fuli; Chang, Kyle; Hawes, Alicia; Lewis, Lora R; Ren, Yanru; Wheeler, David; Gibbs, Richard A; Muzny, Donna Marie; Barnes, Chris; Darvishi, Katayoon; Hurles, Matthew; Korn, Joshua M; Kristiansson, Kati; Lee, Charles; McCarrol, Steven A; Nemesh, James; Dermitzakis, Emmanouil; Keinan, Alon; Montgomery, Stephen B; Pollack, Samuela; Price, Alkes L; Soranzo, Nicole; Bonnen, Penelope E; Gibbs, Richard A; Gonzaga-Jauregui, Claudia; Keinan, Alon; Price, Alkes L; Yu, Fuli; Anttila, Verneri; Brodeur, Wendy; Daly, Mark J; Leslie, Stephen; McVean, Gil; Moutsianas, Loukas; Nguyen, Huy; Schaffner, Stephen F; Zhang, Qingrun; Ghori, Mohammed J R; McGinnis, Ralph; McLaren, William; Pollack, Samuela; Price, Alkes L; Schaffner, Stephen F; Takeuchi, Fumihiko; Grossman, Sharon R; Shlyakhter, Ilya; Hostetter, Elizabeth B; Sabeti, Pardis C; Adebamowo, Clement A; Foster, Morris W; Gordon, Deborah R; Licinio, Julio; Manca, Maria Cristina; Marshall, Patricia A; Matsuda, Ichiro; Ngare, Duncan; Wang, Vivian Ota; Reddy, Deepa; Rotimi, Charles N; Royal, Charmaine D; Sharp, Richard R; Zeng, Changqing; Brooks, Lisa D; McEwen, Jean E
2010-09-02
Despite great progress in identifying genetic variants that influence human disease, most inherited risk remains unexplained. A more complete understanding requires genome-wide studies that fully examine less common alleles in populations with a wide range of ancestry. To inform the design and interpretation of such studies, we genotyped 1.6 million common single nucleotide polymorphisms (SNPs) in 1,184 reference individuals from 11 global populations, and sequenced ten 100-kilobase regions in 692 of these individuals. This integrated data set of common and rare alleles, called 'HapMap 3', includes both SNPs and copy number polymorphisms (CNPs). We characterized population-specific differences among low-frequency variants, measured the improvement in imputation accuracy afforded by the larger reference panel, especially in imputing SNPs with a minor allele frequency of
Loci Contributing to Boric Acid Toxicity in Two Reference Populations of Drosophila melanogaster
Najarro, Michael A.; Hackett, Jennifer L.; Macdonald, Stuart J.
2017-01-01
Populations maintain considerable segregating variation in the response to toxic, xenobiotic compounds. To identify variants associated with resistance to boric acid, a commonly-used household insecticide with a poorly understood mechanism of action, we assayed thousands of individuals from hundreds of strains. Using the Drosophila Synthetic Population Resource (DSPR), a multi-parental population (MPP) of inbred genotypes, we mapped six QTL to short genomic regions containing few protein-coding genes (3–188), allowing us to identify plausible candidate genes underlying resistance to boric acid toxicity. One interval contains multiple genes from the cytochrome P450 family, and we show that ubiquitous RNAi of one of these genes, Cyp9b2, markedly reduces resistance to the toxin. Resistance to boric acid is positively correlated with caffeine resistance. The two phenotypes additionally share a pair of QTL, potentially suggesting a degree of pleiotropy in the genetic control of resistance to these two distinct xenobiotics. Finally, we screened the Drosophila Genetic Reference Panel (DGRP) in an attempt to identify sequence variants within mapped QTL that are associated with boric acid resistance. The approach was largely unsuccessful, with only one QTL showing any associations at QTL-specific 20% False Discovery Rate (FDR) thresholds. Nonetheless, these associations point to a potential candidate gene that can be targeted in future validation efforts. Although the mapping data resulting from the two reference populations do not clearly overlap, our work provides a starting point for further genetic dissection of the processes underlying boric acid toxicity in insects. PMID:28592646
Loci Contributing to Boric Acid Toxicity in Two Reference Populations of Drosophila melanogaster.
Najarro, Michael A; Hackett, Jennifer L; Macdonald, Stuart J
2017-06-07
Populations maintain considerable segregating variation in the response to toxic, xenobiotic compounds. To identify variants associated with resistance to boric acid, a commonly-used household insecticide with a poorly understood mechanism of action, we assayed thousands of individuals from hundreds of strains. Using the Drosophila Synthetic Population Resource (DSPR), a multi-parental population (MPP) of inbred genotypes, we mapped six QTL to short genomic regions containing few protein-coding genes (3-188), allowing us to identify plausible candidate genes underlying resistance to boric acid toxicity. One interval contains multiple genes from the cytochrome P450 family, and we show that ubiquitous RNAi of one of these genes, Cyp9b2 , markedly reduces resistance to the toxin. Resistance to boric acid is positively correlated with caffeine resistance. The two phenotypes additionally share a pair of QTL, potentially suggesting a degree of pleiotropy in the genetic control of resistance to these two distinct xenobiotics. Finally, we screened the Drosophila Genetic Reference Panel (DGRP) in an attempt to identify sequence variants within mapped QTL that are associated with boric acid resistance. The approach was largely unsuccessful, with only one QTL showing any associations at QTL-specific 20% False Discovery Rate (FDR) thresholds. Nonetheless, these associations point to a potential candidate gene that can be targeted in future validation efforts. Although the mapping data resulting from the two reference populations do not clearly overlap, our work provides a starting point for further genetic dissection of the processes underlying boric acid toxicity in insects. Copyright © 2017 Najarro et al.
Molecular mapping of chromosomes 17 and X
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barker, D.F.
1991-01-15
Progress toward the construction of high density genetic maps of chromosomes 17 and X has been made by isolating and characterizing a relatively large set of polymorphic probes for each chromosome and using these probes to construct genetic maps. We have mapped the same polymorphic probes against a series of chromosome breakpoints on X and 17. The probes could be assigned to over 30 physical intervals on the X chromosome and 7 intervals on 17. In many cases, this process resulted in improved characterization of the relative locations of the breakpoints with respect to each other and the definition ofmore » new physical intervals. The strategy for isolation of the polymorphic clones utilized chromosome specific libraries of 1--15 kb segments from each of the two chromosomes. From these libraries, clones were screened for those detecting restriction fragment length polymorphisms. The markers were further characterized, the chromosomal assignments confirmed and in most cases segments of the original probes were subcloned into plasmids to produce probes with improved signal to noise ratios for use in the genetic marker studies. The linkage studies utilize the CEPH reference families and other well-characterized families in our collection which have been used for genetic disease linkage work. Preliminary maps and maps of portions of specific regions of 17 and X are provided. We have nearly completed a map of the 1 megabase Mycoplasma arthritidis genome by applying these techniques to a lambda phage library of its genome. We have found bit mapping to be an efficient means to organize a contiguous set of overlapping clones from a larger genome.« less
Molecular mapping of chromosomes 17 and X. Progress report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Barker, D.F.
1991-01-15
Progress toward the construction of high density genetic maps of chromosomes 17 and X has been made by isolating and characterizing a relatively large set of polymorphic probes for each chromosome and using these probes to construct genetic maps. We have mapped the same polymorphic probes against a series of chromosome breakpoints on X and 17. The probes could be assigned to over 30 physical intervals on the X chromosome and 7 intervals on 17. In many cases, this process resulted in improved characterization of the relative locations of the breakpoints with respect to each other and the definition ofmore » new physical intervals. The strategy for isolation of the polymorphic clones utilized chromosome specific libraries of 1--15 kb segments from each of the two chromosomes. From these libraries, clones were screened for those detecting restriction fragment length polymorphisms. The markers were further characterized, the chromosomal assignments confirmed and in most cases segments of the original probes were subcloned into plasmids to produce probes with improved signal to noise ratios for use in the genetic marker studies. The linkage studies utilize the CEPH reference families and other well-characterized families in our collection which have been used for genetic disease linkage work. Preliminary maps and maps of portions of specific regions of 17 and X are provided. We have nearly completed a map of the 1 megabase Mycoplasma arthritidis genome by applying these techniques to a lambda phage library of its genome. We have found bit mapping to be an efficient means to organize a contiguous set of overlapping@ clones from a larger genome.« less
Applications of the 1000 Genomes Project resources
Zheng-Bradley, Xiangqun
2017-01-01
Abstract The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. PMID:27436001
Genetics Home Reference: dilated cardiomyopathy with ataxia syndrome
... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA.gov Copyright Privacy Accessibility FOIA Viewers & Players U.S. Department of Health & Human Services National Institutes of Health National Library of ...
Genetics Home Reference: alveolar capillary dysplasia with misalignment of pulmonary veins
... K, Schultz R, Hallam L, McRae D, Nicholson AG, Newbury R, Durham-O'Donnell J, Knight G, ... qualified healthcare professional . About Selection Criteria for Links Data Files & API Site Map Subscribe Customer Support USA. ...
Munger, Steven C.; Raghupathy, Narayanan; Choi, Kwangbom; Simons, Allen K.; Gatti, Daniel M.; Hinerfeld, Douglas A.; Svenson, Karen L.; Keller, Mark P.; Attie, Alan D.; Hibbs, Matthew A.; Graber, Joel H.; Chesler, Elissa J.; Churchill, Gary A.
2014-01-01
Massively parallel RNA sequencing (RNA-seq) has yielded a wealth of new insights into transcriptional regulation. A first step in the analysis of RNA-seq data is the alignment of short sequence reads to a common reference genome or transcriptome. Genetic variants that distinguish individual genomes from the reference sequence can cause reads to be misaligned, resulting in biased estimates of transcript abundance. Fine-tuning of read alignment algorithms does not correct this problem. We have developed Seqnature software to construct individualized diploid genomes and transcriptomes for multiparent populations and have implemented a complete analysis pipeline that incorporates other existing software tools. We demonstrate in simulated and real data sets that alignment to individualized transcriptomes increases read mapping accuracy, improves estimation of transcript abundance, and enables the direct estimation of allele-specific expression. Moreover, when applied to expression QTL mapping we find that our individualized alignment strategy corrects false-positive linkage signals and unmasks hidden associations. We recommend the use of individualized diploid genomes over reference sequence alignment for all applications of high-throughput sequencing technology in genetically diverse populations. PMID:25236449
Li, Min; Li, Yujuan; Wang, Ying; Ma, Xiangjian; Zhang, Yuan; Tan, Feng; Wu, Rongling
2016-01-01
As a salt-tolerant arbor tree species, Salix matsudana plays an important role in afforestation and greening in the coastal areas of China. To select superior Salix varieties that adapt to wide saline areas, it is of paramount importance to understand and identify the mechanisms of salt-tolerance at the level of the whole genome. Here, we describe a high-density genetic linkage map of S. matsudana that represents a good coverage of the Salix genome. An intraspecific F1 hybrid population was established by crossing the salt-sensitive “Yanjiang” variety as the female parent with the salt-tolerant “9901” variety as the male parent. This population, along with its parents, was genotyped by specific length amplified fragment sequencing (SLAF-seq), leading to 277,333 high-quality SLAF markers. By marker analysis, we found that both the parents and offspring were tetraploid. The mean sequencing depth was 53.20-fold for “Yanjiang”, 47.41-fold for “9901”, and 11.02-fold for the offspring. Of the SLAF markers detected, 42,321 are polymorphic with sufficient quality for map construction. The final genetic map was constructed using 6,737 SLAF markers, covering 38 linkage groups (LGs). The genetic map spanned 5,497.45 cM in length, with an average distance of 0.82 cM. As a first high-density genetic map of S. matsudana constructed from salt tolerance-varying varieties, this study will provide a foundation for mapping quantitative trait loci that modulate salt tolerance and resistance in Salix and provide important references for molecular breeding of this important forest tree. PMID:27327501
Picotti, Paola; Clement-Ziza, Mathieu; Lam, Henry; Campbell, David S.; Schmidt, Alexander; Deutsch, Eric W.; Röst, Hannes; Sun, Zhi; Rinner, Oliver; Reiter, Lukas; Shen, Qin; Michaelson, Jacob J.; Frei, Andreas; Alberti, Simon; Kusebauch, Ulrike; Wollscheid, Bernd; Moritz, Robert; Beyer, Andreas; Aebersold, Ruedi
2013-01-01
Complete reference maps or datasets, like the genomic map of an organism, are highly beneficial tools for biological and biomedical research. Attempts to generate such reference datasets for a proteome so far failed to reach complete proteome coverage, with saturation apparent at approximately two thirds of the proteomes tested, even for the most thoroughly characterized proteomes. Here, we used a strategy based on high-throughput peptide synthesis and mass spectrometry to generate a close to complete reference map (97% of the genome-predicted proteins) of the S. cerevisiae proteome. We generated two versions of this mass spectrometric map one supporting discovery- (shotgun) and the other hypothesis-driven (targeted) proteomic measurements. The two versions of the map, therefore, constitute a complete set of proteomic assays to support most studies performed with contemporary proteomic technologies. The reference libraries can be browsed via a web-based repository and associated navigation tools. To demonstrate the utility of the reference libraries we applied them to a protein quantitative trait locus (pQTL) analysis, which requires measurement of the same peptides over a large number of samples with high precision. Protein measurements over a set of 78 S. cerevisiae strains revealed a complex relationship between independent genetic loci, impacting on the levels of related proteins. Our results suggest that selective pressure favors the acquisition of sets of polymorphisms that maintain the stoichiometry of protein complexes and pathways. PMID:23334424
Design of microarray experiments for genetical genomics studies.
Bueno Filho, Júlio S S; Gilmour, Steven G; Rosa, Guilherme J M
2006-10-01
Microarray experiments have been used recently in genetical genomics studies, as an additional tool to understand the genetic mechanisms governing variation in complex traits, such as for estimating heritabilities of mRNA transcript abundances, for mapping expression quantitative trait loci, and for inferring regulatory networks controlling gene expression. Several articles on the design of microarray experiments discuss situations in which treatment effects are assumed fixed and without any structure. In the case of two-color microarray platforms, several authors have studied reference and circular designs. Here, we discuss the optimal design of microarray experiments whose goals refer to specific genetic questions. Some examples are used to illustrate the choice of a design for comparing fixed, structured treatments, such as genotypic groups. Experiments targeting single genes or chromosomic regions (such as with transgene research) or multiple epistatic loci (such as within a selective phenotyping context) are discussed. In addition, microarray experiments in which treatments refer to families or to subjects (within family structures or complex pedigrees) are presented. In these cases treatments are more appropriately considered to be random effects, with specific covariance structures, in which the genetic goals relate to the estimation of genetic variances and the heritability of transcriptional abundances.
A first generation BAC-based physical map of the rainbow trout genome
Palti, Yniv; Luo, Ming-Cheng; Hu, Yuqin; Genet, Carine; You, Frank M; Vallejo, Roger L; Thorgaard, Gary H; Wheeler, Paul A; Rexroad, Caird E
2009-01-01
Background Rainbow trout (Oncorhynchus mykiss) are the most-widely cultivated cold freshwater fish in the world and an important model species for many research areas. Coupling great interest in this species as a research model with the need for genetic improvement of aquaculture production efficiency traits justifies the continued development of genomics research resources. Many quantitative trait loci (QTL) have been identified for production and life-history traits in rainbow trout. A bacterial artificial chromosome (BAC) physical map is needed to facilitate fine mapping of QTL and the selection of positional candidate genes for incorporation in marker-assisted selection (MAS) for improving rainbow trout aquaculture production. This resource will also facilitate efforts to obtain and assemble a whole-genome reference sequence for this species. Results The physical map was constructed from DNA fingerprinting of 192,096 BAC clones using the 4-color high-information content fingerprinting (HICF) method. The clones were assembled into physical map contigs using the finger-printing contig (FPC) program. The map is composed of 4,173 contigs and 9,379 singletons. The total number of unique fingerprinting fragments (consensus bands) in contigs is 1,185,157, which corresponds to an estimated physical length of 2.0 Gb. The map assembly was validated by 1) comparison with probe hybridization results and agarose gel fingerprinting contigs; and 2) anchoring large contigs to the microsatellite-based genetic linkage map. Conclusion The production and validation of the first BAC physical map of the rainbow trout genome is described in this paper. We are currently integrating this map with the NCCCWA genetic map using more than 200 microsatellites isolated from BAC end sequences and by identifying BACs that harbor more than 300 previously mapped markers. The availability of an integrated physical and genetic map will enable detailed comparative genome analyses, fine mapping of QTL, positional cloning, selection of positional candidate genes for economically important traits and the incorporation of MAS into rainbow trout breeding programs. PMID:19814815
2012-01-01
Background Cultivated peanut or groundnut (Arachis hypogaea L.) is an important oilseed crop with an allotetraploid genome (AABB, 2n = 4x = 40). Both the low level of genetic variation within the cultivated gene pool and its polyploid nature limit the utilization of molecular markers to explore genome structure and facilitate genetic improvement. Nevertheless, a wealth of genetic diversity exists in diploid Arachis species (2n = 2x = 20), which represent a valuable gene pool for cultivated peanut improvement. Interspecific populations have been used widely for genetic mapping in diploid species of Arachis. However, an intraspecific mapping strategy was essential to detect chromosomal rearrangements among species that could be obscured by mapping in interspecific populations. To develop intraspecific reference linkage maps and gain insights into karyotypic evolution within the genus, we comparatively mapped the A- and B-genome diploid species using intraspecific F2 populations. Exploring genome organization among diploid peanut species by comparative mapping will enhance our understanding of the cultivated tetraploid peanut genome. Moreover, new sources of molecular markers that are highly transferable between species and developed from expressed genes will be required to construct saturated genetic maps for peanut. Results A total of 2,138 EST-SSR (expressed sequence tag-simple sequence repeat) markers were developed by mining a tetraploid peanut EST assembly including 101,132 unigenes (37,916 contigs and 63,216 singletons) derived from 70,771 long-read (Sanger) and 270,957 short-read (454) sequences. A set of 97 SSR markers were also developed by mining 9,517 genomic survey sequences of Arachis. An SSR-based intraspecific linkage map was constructed using an F2 population derived from a cross between K 9484 (PI 298639) and GKBSPSc 30081 (PI 468327) in the B-genome species A. batizocoi. A high degree of macrosynteny was observed when comparing the homoeologous linkage groups between A (A. duranensis) and B (A. batizocoi) genomes. Comparison of the A- and B-genome genetic linkage maps also showed a total of five inversions and one major reciprocal translocation between two pairs of chromosomes under our current mapping resolution. Conclusions Our findings will contribute to understanding tetraploid peanut genome origin and evolution and eventually promote its genetic improvement. The newly developed EST-SSR markers will enrich current molecular marker resources in peanut. PMID:23140574
Zhang, Ning; Zhang, Linan; Tao, Ye; Guo, Li; Sun, Juan; Li, Xia; Zhao, Nan; Peng, Jie; Li, Xiaojie; Zeng, Liang; Chen, Jinsa; Yang, Guanpin
2015-03-15
Kelp (Saccharina japonica) has been intensively cultured in China for almost a century. Its genetic improvement is comparable with that of rice. However, the development of its molecular tools is extremely limited, thus its genes, genetics and genomics. Kelp performs an alternative life cycle during which sporophyte generation alternates with gametophyte generation. The gametophytes of kelp can be cloned and crossed. Due to these characteristics, kelp may serve as a reference for the biological and genetic studies of Volvox, mosses and ferns. We constructed a high density single nucleotide polymorphism (SNP) linkage map for kelp by restriction site associated DNA (RAD) sequencing. In total, 4,994 SNP-containing physical (tag-defined) RAD loci were mapped on 31 linkage groups. The map expanded a total genetic distance of 1,782.75 cM, covering 98.66% of the expected (1,806.94 cM). The length of RAD tags (85 bp) was extended to 400-500 bp with Miseq method, offering us an easiness of developing SNP chips and shifting SNP genotyping to a high throughput track. The number of linkage groups was in accordance with the documented with cytological methods. In addition, we identified a set of microsatellites (99 in total) from the extended RAD tags. A gametophyte sex determining locus was mapped on linkage group 2 in a window about 9.0 cM in width, which was 2.66 cM up to marker_40567 and 6.42 cM down to marker_23595. A high density SNP linkage map was constructed for kelp, an intensively cultured brown alga in China. The RAD tags were also extended so that a SNP chip could be developed. In addition, a set of microsatellites were identified among mapped loci, and a gametophyte sex determining locus was mapped. This map will facilitate the genetic studies of kelp including for example the evaluation of germplasm and the decipherment of the genetic bases of economic traits.
Genomic analyses of the CAM plant pineapple.
Zhang, Jisen; Liu, Juan; Ming, Ray
2014-07-01
The innovation of crassulacean acid metabolism (CAM) photosynthesis in arid and/or low CO2 conditions is a remarkable case of adaptation in flowering plants. As the most important crop that utilizes CAM photosynthesis, the genetic and genomic resources of pineapple have been developed over many years. Genetic diversity studies using various types of DNA markers led to the reclassification of the two genera Ananas and Pseudananas and nine species into one genus Ananas and two species, A. comosus and A. macrodontes with five botanical varieties in A. comosus. Five genetic maps have been constructed using F1 or F2 populations, and high-density genetic maps generated by genotype sequencing are essential resources for sequencing and assembling the pineapple genome and for marker-assisted selection. There are abundant expression sequence tag resources but limited genomic sequences in pineapple. Genes involved in the CAM pathway has been analysed in several CAM plants but only a few of them are from pineapple. A reference genome of pineapple is being generated and will accelerate genetic and genomic research in this major CAM crop. This reference genome of pineapple provides the foundation for studying the origin and regulatory mechanism of CAM photosynthesis, and the opportunity to evaluate the classification of Ananas species and botanical cultivars. © The Author 2014. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Genetic mapping and predictive testing for multiple endocrine neoplasia type 1 (MEN1)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pandit, S.D.; Read, C.; Liu, L.
1994-09-01
Multiple endocrine neoplasia type 1 (MEN1) is an autosomal dominant disorder with an estimated prevalance of 20-200 per million persons. It is characterized by the combined occurence of tumors involving two or more endocrine glands, namely the parathyroid glands, the endocrine pancreas and the anterior pituitary. This disorder affects virtually all age groups with an average range of 20-60 years. Linkage analysis mapped the MEN1 locus to 11q13 near the human muscle glycogen phosphorylase (PYGM) locus. Additional genetic mapping and deletion analysis studies have refined the region containing the MEN1 locus to a 3 cM interval flanked by markers PYGMmore » and D11S146/D11S97, a physical distance of approximately 1.5 Mb. We have identified 8 large families segregating MEN1 (71 affected from a population of 389 individuals). A high resolution reference map for the 11q13 region has been constructed using four new microsatellite markers, the CEPH reference (40 family) pedigree resource, and the CRI-MAP program package. Subsequent analyses using the LINKAGE program package and 8 MEN 1 families placed the MEN1 locus within the context of the microsatellite map. This map was used to develop a linkage-based predictive test. These markers have also been used to further refine the interval containing the MEN1 locus from the study of chromosome deletions (loss of heterozygosity, LOH studies) in paired sets of tumor and germline DNA from 87 MEN 1 affected individuals.« less
Efficient high-throughput sequencing of a laser microdissected chromosome arm
2013-01-01
Background Genomic sequence assemblies are key tools for a broad range of gene function and evolutionary studies. The diploid amphibian Xenopus tropicalis plays a pivotal role in these fields due to its combination of experimental flexibility, diploid genome, and early-branching tetrapod taxonomic position, having diverged from the amniote lineage ~360 million years ago. A genome assembly and a genetic linkage map have recently been made available. Unfortunately, large gaps in the linkage map attenuate long-range integrity of the genome assembly. Results We laser dissected the short arm of X. tropicalis chromosome 7 for next generation sequencing and computational mapping to the reference genome. This arm is of particular interest as it encodes the sex determination locus, but its genetic map contains large gaps which undermine available genome assemblies. Whole genome amplification of 15 laser-microdissected 7p arms followed by next generation sequencing yielded ~35 million reads, over four million of which uniquely mapped to the X. tropicalis genome. Our analysis placed more than 200 previously unmapped scaffolds on the analyzed chromosome arm, providing valuable low-resolution physical map information for de novo genome assembly. Conclusion We present a new approach for improving and validating genetic maps and sequence assemblies. Whole genome amplification of 15 microdissected chromosome arms provided sufficient high-quality material for localizing previously unmapped scaffolds and genes as well as recognizing mislocalized scaffolds. PMID:23714049
Qi, Peng; Gimode, Davis; Saha, Dipnarayan; Schröder, Stephan; Chakraborty, Debkanta; Wang, Xuewen; Dida, Mathews M; Malmberg, Russell L; Devos, Katrien M
2018-06-15
Research on orphan crops is often hindered by a lack of genomic resources. With the advent of affordable sequencing technologies, genotyping an entire genome or, for large-genome species, a representative fraction of the genome has become feasible for any crop. Nevertheless, most genotyping-by-sequencing (GBS) methods are geared towards obtaining large numbers of markers at low sequence depth, which excludes their application in heterozygous individuals. Furthermore, bioinformatics pipelines often lack the flexibility to deal with paired-end reads or to be applied in polyploid species. UGbS-Flex combines publicly available software with in-house python and perl scripts to efficiently call SNPs from genotyping-by-sequencing reads irrespective of the species' ploidy level, breeding system and availability of a reference genome. Noteworthy features of the UGbS-Flex pipeline are an ability to use paired-end reads as input, an effective approach to cluster reads across samples with enhanced outputs, and maximization of SNP calling. We demonstrate use of the pipeline for the identification of several thousand high-confidence SNPs with high representation across samples in an F 3 -derived F 2 population in the allotetraploid finger millet. Robust high-density genetic maps were constructed using the time-tested mapping program MAPMAKER which we upgraded to run efficiently and in a semi-automated manner in a Windows Command Prompt Environment. We exploited comparative GBS with one of the diploid ancestors of finger millet to assign linkage groups to subgenomes and demonstrate the presence of chromosomal rearrangements. The paper combines GBS protocol modifications, a novel flexible GBS analysis pipeline, UGbS-Flex, recommendations to maximize SNP identification, updated genetic mapping software, and the first high-density maps of finger millet. The modules used in the UGbS-Flex pipeline and for genetic mapping were applied to finger millet, an allotetraploid selfing species without a reference genome, as a case study. The UGbS-Flex modules, which can be run independently, are easily transferable to species with other breeding systems or ploidy levels.
Portis, Ezio; Scaglione, Davide; Acquadro, Alberto; Mauromicale, Giovanni; Mauro, Rosario; Knapp, Steven J; Lanteri, Sergio
2012-05-23
The Asteraceae species Cynara cardunculus (2n = 2x = 34) includes the two fully cross-compatible domesticated taxa globe artichoke (var. scolymus L.) and cultivated cardoon (var. altilis DC). As both are out-pollinators and suffer from marked inbreeding depression, linkage analysis has focussed on the use of a two way pseudo-test cross approach. A set of 172 microsatellite (SSR) loci derived from expressed sequence tag DNA sequence were integrated into the reference C. cardunculus genetic maps, based on segregation among the F1 progeny of a cross between a globe artichoke and a cultivated cardoon. The resulting maps each detected 17 major linkage groups, corresponding to the species' haploid chromosome number. A consensus map based on 66 co-dominant shared loci (64 SSRs and two SNPs) assembled 694 loci, with a mean inter-marker spacing of 2.5 cM. When the maps were used to elucidate the pattern of inheritance of head production earliness, a key commercial trait, seven regions were shown to harbour relevant quantitative trait loci (QTL). Together, these QTL accounted for up to 74% of the overall phenotypic variance. The newly developed consensus as well as the parental genetic maps can accelerate the process of tagging and eventually isolating the genes underlying earliness in both the domesticated C. cardunculus forms. The largest single effect mapped to the same linkage group in each parental maps, and explained about one half of the phenotypic variance, thus representing a good candidate for marker assisted selection.
Applications of the 1000 Genomes Project resources.
Zheng-Bradley, Xiangqun; Flicek, Paul
2017-05-01
The 1000 Genomes Project created a valuable, worldwide reference for human genetic variation. Common uses of the 1000 Genomes dataset include genotype imputation supporting Genome-wide Association Studies, mapping expression Quantitative Trait Loci, filtering non-pathogenic variants from exome, whole genome and cancer genome sequencing projects, and genetic analysis of population structure and molecular evolution. In this article, we will highlight some of the multiple ways that the 1000 Genomes data can be and has been utilized for genetic studies. © The Author 2016. Published by Oxford University Press.
Niu, Yuze; Gao, Fengtao; Zhao, Yongwei; Zhang, Jing; Sun, Jian; Shao, Changwei; Liao, Xiaolin; Wang, Lei; Tian, Yongsheng; Chen, Songlin
2012-01-01
High-density genetic linkage maps were constructed for the Japanese flounder (Paralichthys olivaceus). A total of 1624 microsatellite markers were polymorphic in the reference family. Linkage analysis using JoinMap 4.0 resulted in the mapping of 1487 markers to 24 linkage groups, a result which was consistent with the 24 chromosomes seen in chromosome spreads. The female map was composed of 1257 markers, covering a total of 1663.8 cM with an average interval 1.35 cM between markers. The male map consisted of 1224 markers, spanning 1726.5 cM, with an average interval of 1.44 cM. The genome length in the Japanese flounder was estimated to be 1730.3 cM for the females and 1798.0 cM for the males, a coverage of 96.2% for the female and 96.0% for the male map. The mean recombination at common intervals throughout the genome revealed a slight difference between sexes, i.e. 1.07 times higher in the male than female. High-density genetic linkage maps are very useful for marker-assisted selection (MAS) programs for economically valuable traits in this species and for further evolutionary studies in flatfish and vertebrate species. Furthermore, four quantiative trait loci (QTL) associated with growth traits were mapped on the genetic map. One QTL was identified for body weight on LG 14 f, which explained 14.85% of the total variation of the body weight. Three QTL were identified for body width on LG14f and LG14m, accounting for 16.75%, 13.62% and 13.65% of the total variation in body width, respectively. The additive effects were evident as negative values. There were four QTL for growth traits clustered on LG14, which should prove to be very useful for improving growth traits using molecular MAS. PMID:23209734
Integrative analysis of 111 reference human epigenomes
Kundaje, Anshul; Meuleman, Wouter; Ernst, Jason; Bilenky, Misha; Yen, Angela; Kheradpour, Pouya; Zhang, Zhizhuo; Heravi-Moussavi, Alireza; Liu, Yaping; Amin, Viren; Ziller, Michael J; Whitaker, John W; Schultz, Matthew D; Sandstrom, Richard S; Eaton, Matthew L; Wu, Yi-Chieh; Wang, Jianrong; Ward, Lucas D; Sarkar, Abhishek; Quon, Gerald; Pfenning, Andreas; Wang, Xinchen; Claussnitzer, Melina; Coarfa, Cristian; Harris, R Alan; Shoresh, Noam; Epstein, Charles B; Gjoneska, Elizabeta; Leung, Danny; Xie, Wei; Hawkins, R David; Lister, Ryan; Hong, Chibo; Gascard, Philippe; Mungall, Andrew J; Moore, Richard; Chuah, Eric; Tam, Angela; Canfield, Theresa K; Hansen, R Scott; Kaul, Rajinder; Sabo, Peter J; Bansal, Mukul S; Carles, Annaick; Dixon, Jesse R; Farh, Kai-How; Feizi, Soheil; Karlic, Rosa; Kim, Ah-Ram; Kulkarni, Ashwinikumar; Li, Daofeng; Lowdon, Rebecca; Mercer, Tim R; Neph, Shane J; Onuchic, Vitor; Polak, Paz; Rajagopal, Nisha; Ray, Pradipta; Sallari, Richard C; Siebenthall, Kyle T; Sinnott-Armstrong, Nicholas; Stevens, Michael; Thurman, Robert E; Wu, Jie; Zhang, Bo; Zhou, Xin; Beaudet, Arthur E; Boyer, Laurie A; De Jager, Philip; Farnham, Peggy J; Fisher, Susan J; Haussler, David; Jones, Steven; Li, Wei; Marra, Marco; McManus, Michael T; Sunyaev, Shamil; Thomson, James A; Tlsty, Thea D; Tsai, Li-Huei; Wang, Wei; Waterland, Robert A; Zhang, Michael; Chadwick, Lisa H; Bernstein, Bradley E; Costello, Joseph F; Ecker, Joseph R; Hirst, Martin; Meissner, Alexander; Milosavljevic, Aleksandar; Ren, Bing; Stamatoyannopoulos, John A; Wang, Ting; Kellis, Manolis
2015-01-01
The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but a similar reference has lacked for epigenomic studies. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection to-date of human epigenomes for primary cells and tissues. Here, we describe the integrative analysis of 111 reference human epigenomes generated as part of the program, profiled for histone modification patterns, DNA accessibility, DNA methylation, and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically-relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation, and human disease. PMID:25693563
Integrative analysis of 111 reference human epigenomes.
Kundaje, Anshul; Meuleman, Wouter; Ernst, Jason; Bilenky, Misha; Yen, Angela; Heravi-Moussavi, Alireza; Kheradpour, Pouya; Zhang, Zhizhuo; Wang, Jianrong; Ziller, Michael J; Amin, Viren; Whitaker, John W; Schultz, Matthew D; Ward, Lucas D; Sarkar, Abhishek; Quon, Gerald; Sandstrom, Richard S; Eaton, Matthew L; Wu, Yi-Chieh; Pfenning, Andreas R; Wang, Xinchen; Claussnitzer, Melina; Liu, Yaping; Coarfa, Cristian; Harris, R Alan; Shoresh, Noam; Epstein, Charles B; Gjoneska, Elizabeta; Leung, Danny; Xie, Wei; Hawkins, R David; Lister, Ryan; Hong, Chibo; Gascard, Philippe; Mungall, Andrew J; Moore, Richard; Chuah, Eric; Tam, Angela; Canfield, Theresa K; Hansen, R Scott; Kaul, Rajinder; Sabo, Peter J; Bansal, Mukul S; Carles, Annaick; Dixon, Jesse R; Farh, Kai-How; Feizi, Soheil; Karlic, Rosa; Kim, Ah-Ram; Kulkarni, Ashwinikumar; Li, Daofeng; Lowdon, Rebecca; Elliott, GiNell; Mercer, Tim R; Neph, Shane J; Onuchic, Vitor; Polak, Paz; Rajagopal, Nisha; Ray, Pradipta; Sallari, Richard C; Siebenthall, Kyle T; Sinnott-Armstrong, Nicholas A; Stevens, Michael; Thurman, Robert E; Wu, Jie; Zhang, Bo; Zhou, Xin; Beaudet, Arthur E; Boyer, Laurie A; De Jager, Philip L; Farnham, Peggy J; Fisher, Susan J; Haussler, David; Jones, Steven J M; Li, Wei; Marra, Marco A; McManus, Michael T; Sunyaev, Shamil; Thomson, James A; Tlsty, Thea D; Tsai, Li-Huei; Wang, Wei; Waterland, Robert A; Zhang, Michael Q; Chadwick, Lisa H; Bernstein, Bradley E; Costello, Joseph F; Ecker, Joseph R; Hirst, Martin; Meissner, Alexander; Milosavljevic, Aleksandar; Ren, Bing; Stamatoyannopoulos, John A; Wang, Ting; Kellis, Manolis
2015-02-19
The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.
Improving the goat long-read assembly with optical mapping
USDA-ARS?s Scientific Manuscript database
Reference genome assemblies provide important context in genetics by standardizing the order of genes and providing a universal set of coordinates for individual nucleotides. Often due to the high complexity of genic regions and higher copy number of genes involved in immune function, immunity-relat...
Adapt-Mix: learning local genetic correlation structure improves summary statistics-based analyses
Park, Danny S.; Brown, Brielin; Eng, Celeste; Huntsman, Scott; Hu, Donglei; Torgerson, Dara G.; Burchard, Esteban G.; Zaitlen, Noah
2015-01-01
Motivation: Approaches to identifying new risk loci, training risk prediction models, imputing untyped variants and fine-mapping causal variants from summary statistics of genome-wide association studies are playing an increasingly important role in the human genetics community. Current summary statistics-based methods rely on global ‘best guess’ reference panels to model the genetic correlation structure of the dataset being studied. This approach, especially in admixed populations, has the potential to produce misleading results, ignores variation in local structure and is not feasible when appropriate reference panels are missing or small. Here, we develop a method, Adapt-Mix, that combines information across all available reference panels to produce estimates of local genetic correlation structure for summary statistics-based methods in arbitrary populations. Results: We applied Adapt-Mix to estimate the genetic correlation structure of both admixed and non-admixed individuals using simulated and real data. We evaluated our method by measuring the performance of two summary statistics-based methods: imputation and joint-testing. When using our method as opposed to the current standard of ‘best guess’ reference panels, we observed a 28% decrease in mean-squared error for imputation and a 73.7% decrease in mean-squared error for joint-testing. Availability and implementation: Our method is publicly available in a software package called ADAPT-Mix available at https://github.com/dpark27/adapt_mix. Contact: noah.zaitlen@ucsf.edu PMID:26072481
Wing, Rod A; Ammiraju, Jetty S S; Luo, Meizhong; Kim, Hyeran; Yu, Yeisoo; Kudrna, Dave; Goicoechea, Jose L; Wang, Wenming; Nelson, Will; Rao, Kiran; Brar, Darshan; Mackill, Dave J; Han, Bin; Soderlund, Cari; Stein, Lincoln; SanMiguel, Phillip; Jackson, Scott
2005-09-01
The wild species of the genus Oryza offer enormous potential to make a significant impact on agricultural productivity of the cultivated rice species Oryza sativa and Oryza glaberrima. To unlock the genetic potential of wild rice we have initiated a project entitled the 'Oryza Map Alignment Project' (OMAP) with the ultimate goal of constructing and aligning BAC/STC based physical maps of 11 wild and one cultivated rice species to the International Rice Genome Sequencing Project's finished reference genome--O. sativa ssp. japonica c. v. Nipponbare. The 11 wild rice species comprise nine different genome types and include six diploid genomes (AA, BB, CC, EE, FF and GG) and four tetrapliod genomes (BBCC, CCDD, HHKK and HHJJ) with broad geographical distribution and ecological adaptation. In this paper we describe our strategy to construct robust physical maps of all 12 rice species with an emphasis on the AA diploid O. nivara--thought to be the progenitor of modern cultivated rice.
The systematic sequencing of the cancer genome has led to the identification of numerous genetic alterations in cancer. However, a deeper understanding of the functional consequences of these alterations is necessary to guide appropriate therapeutic strategies. Here, we describe Onco-GPS (OncoGenic Positioning System), a data-driven analysis framework to organize individual tumor samples with shared oncogenic alterations onto a reference map defined by their underlying cellular states.
Hulse-Kemp, Amanda M.; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A.; Scheffler, Brian E.; Fang, David D.; Chen, Z. Jeffrey; Van Deynze, Allen; Stelly, David M.
2015-01-01
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. PMID:25858960
Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing
2014-01-01
Background Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families. Results Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays. Conclusions This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication. PMID:24571138
Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram P; Gupta, Deepak K; Singh, Sangeeta; Dogra, Vivek; Gaikwad, Kishor; Sharma, Tilak R; Raje, Ranjeet S; Bandhopadhya, Tapas K; Datta, Subhojit; Singh, Mahendra N; Bashasab, Fakrudin; Kulwal, Pawan; Wanjari, K B; K Varshney, Rajeev; Cook, Douglas R; Singh, Nagendra K
2011-01-20
Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥ 18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea.
2011-01-01
Background Pigeonpea [Cajanus cajan (L.) Millspaugh], one of the most important food legumes of semi-arid tropical and subtropical regions, has limited genomic resources, particularly expressed sequence based (genic) markers. We report a comprehensive set of validated genic simple sequence repeat (SSR) markers using deep transcriptome sequencing, and its application in genetic diversity analysis and mapping. Results In this study, 43,324 transcriptome shotgun assembly unigene contigs were assembled from 1.696 million 454 GS-FLX sequence reads of separate pooled cDNA libraries prepared from leaf, root, stem and immature seed of two pigeonpea varieties, Asha and UPAS 120. A total of 3,771 genic-SSR loci, excluding homopolymeric and compound repeats, were identified; of which 2,877 PCR primer pairs were designed for marker development. Dinucleotide was the most common repeat motif with a frequency of 60.41%, followed by tri- (34.52%), hexa- (2.62%), tetra- (1.67%) and pentanucleotide (0.76%) repeat motifs. Primers were synthesized and tested for 772 of these loci with repeat lengths of ≥18 bp. Of these, 550 markers were validated for consistent amplification in eight diverse pigeonpea varieties; 71 were found to be polymorphic on agarose gel electrophoresis. Genetic diversity analysis was done on 22 pigeonpea varieties and eight wild species using 20 highly polymorphic genic-SSR markers. The number of alleles at these loci ranged from 4-10 and the polymorphism information content values ranged from 0.46 to 0.72. Neighbor-joining dendrogram showed distinct separation of the different groups of pigeonpea cultivars and wild species. Deep transcriptome sequencing of the two parental lines helped in silico identification of polymorphic genic-SSR loci to facilitate the rapid development of an intra-species reference genetic map, a subset of which was validated for expected allelic segregation in the reference mapping population. Conclusion We developed 550 validated genic-SSR markers in pigeonpea using deep transcriptome sequencing. From these, 20 highly polymorphic markers were used to evaluate the genetic relationship among species of the genus Cajanus. A comprehensive set of genic-SSR markers was developed as an important genomic resource for diversity analysis and genetic mapping in pigeonpea. PMID:21251263
A linkage map for the B-genome of Arachis (Fabaceae) and its synteny to the A-genome
Moretzsohn, Márcio C; Barbosa, Andrea VG; Alves-Freitas, Dione MT; Teixeira, Cristiane; Leal-Bertioli, Soraya CM; Guimarães, Patrícia M; Pereira, Rinaldo W; Lopes, Catalina R; Cavallari, Marcelo M; Valls, José FM; Bertioli, David J; Gimenes, Marcos A
2009-01-01
Background Arachis hypogaea (peanut) is an important crop worldwide, being mostly used for edible oil production, direct consumption and animal feed. Cultivated peanut is an allotetraploid species with two different genome components, A and B. Genetic linkage maps can greatly assist molecular breeding and genomic studies. However, the development of linkage maps for A. hypogaea is difficult because it has very low levels of polymorphism. This can be overcome by the utilization of wild species of Arachis, which present the A- and B-genomes in the diploid state, and show high levels of genetic variability. Results In this work, we constructed a B-genome linkage map, which will complement the previously published map for the A-genome of Arachis, and produced an entire framework for the tetraploid genome. This map is based on an F2 population of 93 individuals obtained from the cross between the diploid A. ipaënsis (K30076) and the closely related A. magna (K30097), the former species being the most probable B genome donor to cultivated peanut. In spite of being classified as different species, the parents showed high crossability and relatively low polymorphism (22.3%), compared to other interspecific crosses. The map has 10 linkage groups, with 149 loci spanning a total map distance of 1,294 cM. The microsatellite markers utilized, developed for other Arachis species, showed high transferability (81.7%). Segregation distortion was 21.5%. This B-genome map was compared to the A-genome map using 51 common markers, revealing a high degree of synteny between both genomes. Conclusion The development of genetic maps for Arachis diploid wild species with A- and B-genomes effectively provides a genetic map for the tetraploid cultivated peanut in two separate diploid components and is a significant advance towards the construction of a transferable reference map for Arachis. Additionally, we were able to identify affinities of some Arachis linkage groups with Medicago truncatula, which will allow the transfer of information from the nearly-complete genome sequences of this model legume to the peanut crop. PMID:19351409
Staton, Margaret; Zhebentyayeva, Tetyana; Olukolu, Bode; Fang, Guang Chen; Nelson, Dana; Carlson, John E; Abbott, Albert G
2015-10-05
Chinese chestnut (Castanea mollissima) has emerged as a model species for the Fagaceae family with extensive genomic resources including a physical map, a dense genetic map and quantitative trait loci (QTLs) for chestnut blight resistance. These resources enable comparative genomics analyses relative to model plants. We assessed the degree of conservation between the chestnut genome and other well annotated and assembled plant genomic sequences, focusing on the QTL regions of most interest to the chestnut breeding community. The integrated physical and genetic map of Chinese chestnut has been improved to now include 858 shared sequence-based markers. The utility of the integrated map has also been improved through the addition of 42,970 BAC (bacterial artificial chromosome) end sequences spanning over 26 million bases of the estimated 800 Mb chestnut genome. Synteny between chestnut and ten model plant species was conducted on a macro-syntenic scale using sequences from both individual probes and BAC end sequences across the chestnut physical map. Blocks of synteny with chestnut were found in all ten reference species, with the percent of the chestnut physical map that could be aligned ranging from 10 to 39 %. The integrated genetic and physical map was utilized to identify BACs that spanned the three previously identified QTL regions conferring blight resistance. The clones were pooled and sequenced, yielding 396 sequence scaffolds covering 13.9 Mbp. Comparative genomic analysis on a microsytenic scale, using the QTL-associated genomic sequence, identified synteny from chestnut to other plant genomes ranging from 5.4 to 12.9 % of the genome sequences aligning. On both the macro- and micro-synteny levels, the peach, grape and poplar genomes were found to be the most structurally conserved with chestnut. Interestingly, these results did not strictly follow the expectation that decreased phylogenetic distance would correspond to increased levels of genome preservation, but rather suggest the additional influence of life-history traits on preservation of synteny. The regions of synteny that were detected provide an important tool for defining and cataloging genes in the QTL regions for advancing chestnut blight resistance research.
Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu
2015-01-01
The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line ‘APL01’ and a normally petalled variety ‘Holly’. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus. PMID:26779193
Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu
2015-01-01
The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line 'APL01' and a normally petalled variety 'Holly'. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus.
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma Jj; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco Cam; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple ( Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289
Linking the potato genome to the conserved ortholog set (COS) markers
2013-01-01
Background Conserved ortholog set (COS) markers are an important functional genomics resource that has greatly improved orthology detection in Asterid species. A comprehensive list of these markers is available at Sol Genomics Network (http://solgenomics.net/) and many of these have been placed on the genetic maps of a number of solanaceous species. Results We amplified over 300 COS markers from eight potato accessions involving two diploid landraces of Solanum tuberosum Andigenum group (formerly classified as S. goniocalyx, S. phureja), and a dihaploid clone derived from a modern tetraploid cultivar of S. tuberosum and the wild species S. berthaultii, S. chomatophilum, and S. paucissectum. By BLASTn (Basic Local Alignment Search Tool of the NCBI, National Center for Biotechnology Information) algorithm we mapped the DNA sequences of these markers into the potato genome sequence. Additionally, we mapped a subset of these markers genetically in potato and present a comparison between the physical and genetic locations of these markers in potato and in comparison with the genetic location in tomato. We found that most of the COS markers are single-copy in the reference genome of potato and that the genetic location in tomato and physical location in potato sequence are mostly in agreement. However, we did find some COS markers that are present in multiple copies and those that map in unexpected locations. Sequence comparisons between species show that some of these markers may be paralogs. Conclusions The sequence-based physical map becomes helpful in identification of markers for traits of interest thereby reducing the number of markers to be tested for applications like marker assisted selection, diversity, and phylogenetic studies. PMID:23758607
2012-01-01
Background The Asteraceae species Cynara cardunculus (2n = 2x = 34) includes the two fully cross-compatible domesticated taxa globe artichoke (var. scolymus L.) and cultivated cardoon (var. altilis DC). As both are out-pollinators and suffer from marked inbreeding depression, linkage analysis has focussed on the use of a two way pseudo-test cross approach. Results A set of 172 microsatellite (SSR) loci derived from expressed sequence tag DNA sequence were integrated into the reference C. cardunculus genetic maps, based on segregation among the F1 progeny of a cross between a globe artichoke and a cultivated cardoon. The resulting maps each detected 17 major linkage groups, corresponding to the species’ haploid chromosome number. A consensus map based on 66 co-dominant shared loci (64 SSRs and two SNPs) assembled 694 loci, with a mean inter-marker spacing of 2.5 cM. When the maps were used to elucidate the pattern of inheritance of head production earliness, a key commercial trait, seven regions were shown to harbour relevant quantitative trait loci (QTL). Together, these QTL accounted for up to 74% of the overall phenotypic variance. Conclusion The newly developed consensus as well as the parental genetic maps can accelerate the process of tagging and eventually isolating the genes underlying earliness in both the domesticated C. cardunculus forms. The largest single effect mapped to the same linkage group in each parental maps, and explained about one half of the phenotypic variance, thus representing a good candidate for marker assisted selection. PMID:22621324
The Release 6 reference sequence of the Drosophila melanogaster genome
Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.; ...
2015-01-14
Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
The Release 6 reference sequence of the Drosophila melanogaster genome
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hoskins, Roger A.; Carlson, Joseph W.; Wan, Kenneth H.
Drosophila melanogaster plays an important role in molecular, genetic, and genomic studies of heredity, development, metabolism, behavior, and human disease. The initial reference genome sequence reported more than a decade ago had a profound impact on progress in Drosophila research, and improving the accuracy and completeness of this sequence continues to be important to further progress. We previously described improvement of the 117-Mb sequence in the euchromatic portion of the genome and 21 Mb in the heterochromatic portion, using a whole-genome shotgun assembly, BAC physical mapping, and clone-based finishing. Here, we report an improved reference sequence of the single-copy andmore » middle-repetitive regions of the genome, produced using cytogenetic mapping to mitotic and polytene chromosomes, clone-based finishing and BAC fingerprint verification, ordering of scaffolds by alignment to cDNA sequences, incorporation of other map and sequence data, and validation by whole-genome optical restriction mapping. These data substantially improve the accuracy and completeness of the reference sequence and the order and orientation of sequence scaffolds into chromosome arm assemblies. Representation of the Y chromosome and other heterochromatic regions is particularly improved. The new 143.9-Mb reference sequence, designated Release 6, effectively exhausts clone-based technologies for mapping and sequencing. Highly repeat-rich regions, including large satellite blocks and functional elements such as the ribosomal RNA genes and the centromeres, are largely inaccessible to current sequencing and assembly methods and remain poorly represented. In conclusion, further significant improvements will require sequencing technologies that do not depend on molecular cloning and that produce very long reads.« less
Grattapaglia, Dario; Mamani, Eva M C; Silva-Junior, Orzenil B; Faria, Danielle A
2015-03-01
Keystone species in their native ranges, eucalypts, are ecologically and genetically very diverse, growing naturally along extensive latitudinal and altitudinal ranges and variable environments. Besides their ecological importance, eucalypts are also the most widely planted trees for sustainable forestry in the world. We report the development of a novel collection of 535 microsatellites for species of Eucalyptus, 494 designed from ESTs and 41 from genomic libraries. A selected subset of 223 was evaluated for individual identification, parentage testing, and ancestral information content in the two most extensively studied species, Eucalyptus grandis and Eucalyptus globulus. Microsatellites showed high transferability and overlapping allele size range, suggesting they have arisen still in their common ancestor and confirming the extensive genome conservation between these two species. A consensus linkage map with 437 microsatellites, the most comprehensive microsatellite-only genetic map for Eucalyptus, was built by assembling segregation data from three mapping populations and anchored to the Eucalyptus genome. An overall colinearity between recombination-based and physical positioning of 84% of the mapped microsatellites was observed, with some ordering discrepancies and sporadic locus duplications, consistent with the recently described whole genome duplication events in Eucalyptus. The linkage map covered 95.2% of the 605.8-Mbp assembled genome sequence, placing one microsatellite every 1.55 Mbp on average, and an overall estimate of physical to recombination distance of 618 kbp/cM. The genetic parameters estimates together with linkage and physical position data for this large set of microsatellites should assist marker choice for genome-wide population genetics and comparative mapping in Eucalyptus. © 2014 John Wiley & Sons Ltd.
Benner, Christian; Havulinna, Aki S; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ripatti, Samuli; Pirinen, Matti
2017-10-05
During the past few years, various novel statistical methods have been developed for fine-mapping with the use of summary statistics from genome-wide association studies (GWASs). Although these approaches require information about the linkage disequilibrium (LD) between variants, there has not been a comprehensive evaluation of how estimation of the LD structure from reference genotype panels performs in comparison with that from the original individual-level GWAS data. Using population genotype data from Finland and the UK Biobank, we show here that a reference panel of 1,000 individuals from the target population is adequate for a GWAS cohort of up to 10,000 individuals, whereas smaller panels, such as those from the 1000 Genomes Project, should be avoided. We also show, both theoretically and empirically, that the size of the reference panel needs to scale with the GWAS sample size; this has important consequences for the application of these methods in ongoing GWAS meta-analyses and large biobank studies. We conclude by providing software tools and by recommending practices for sharing LD information to more efficiently exploit summary statistics in genetics research. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Hulse-Kemp, Amanda M; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A; Scheffler, Brian E; Fang, David D; Chen, Z Jeffrey; Van Deynze, Allen; Stelly, David M
2015-04-09
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. Copyright © 2015 Hulse-Kemp et al.
Silva, C; Garcia-Mas, J; Sánchez, A M; Arús, P; Oliveira, M M
2005-03-01
Blooming time is one of the most important agronomic traits in almond. Biochemical and molecular events underlying flowering regulation must be understood before methods to stimulate late flowering can be developed. Attempts to elucidate the genetic control of this process have led to the identification of a major gene (Lb) and quantitative trait loci (QTLs) linked to observed phenotypic differences, but although this gene and these QTLs have been placed on the Prunus reference genetic map, their sequences and specific functions remain unknown. The aim of our investigation was to associate these loci with known genes using a candidate gene approach. Two almond cDNAs and eight Prunus expressed sequence tags were selected as candidate genes (CGs) since their sequences were highly identical to those of flowering regulatory genes characterized in other species. The CGs were amplified from both parental lines of the mapping population using specific primers. Sequence comparison revealed DNA polymorphisms between the parental lines, mainly of the single nucleotide type. Polymorphisms were used to develop co-dominant cleaved amplified polymorphic sequence markers or length polymorphisms based on insertion/deletion events for mapping the candidate genes on the Prunus reference map. Ten candidate genes were assigned to six linkage groups in the Prunus genome. The positions of two of these were compatible with the regions where two QTLs for blooming time were detected. One additional candidate was localized close to the position of the Evergrowing gene, which determines a non-deciduous behaviour in peach.
Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.
2015-01-01
High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Poon, Anna; Goldowitz, Daniel
2014-03-19
Adult neurogenesis, which is the continual production of new neurons in the mature brain, demonstrates the strikingly plastic nature of the nervous system. Adult neural stem cells and their neural precursors, collectively referred to as neural progenitor cells (NPCs), are present in the subgranular zone (SGZ) of the dentate gyrus, the subventricular zone (SVZ), and rostral migratory stream (RMS). In order to harness the potential of NPCs to treat neurodegenerative diseases and brain injuries, it will be important to understand the molecules that regulate NPCs in the adult brain. The genetic basis underlying NPC proliferation is still not fully understood. From our previous quantitative trait locus (QTL) analysis, we had success in using a relatively small reference population of recombinant inbred strains of mice (AXBXA) to identify a genetic region that is significantly correlated with NPC proliferation in the RMS. In this study, we expanded our initial QTL mapping of RMS proliferation to a far richer genetic resource, the BXD RI mouse strains. A 3-fold difference in the number of proliferative, bromodeoxyuridine (BrdU)-labeled cells was quantified in the adult RMS of 61 BXD RI strains. RMS cell proliferation is highly dependent on the genetic background of the mice with an estimated heritability of 0.58. Genome-wide mapping revealed a significant QTL on chromosome (Chr) 6 and a suggestive QTL on Chr 11 regulating the number of NPCs in the RMS. Composite interval analysis further revealed secondary QTLs on Chr 14 and Chr 18. The loci regulating RMS cell proliferation did not overlap with the suggestive loci modulating cell proliferation in the SGZ. These mapped loci serve as starting points to identify genes important for this process. A subset of candidate genes in this region is associated with cell proliferation and neurogenesis. Interconnectivity of these candidate genes was demonstrated using pathway and transcriptional covariance analyses. Differences in RMS cell proliferation across the BXD RI strains identifies genetic loci that serve to provide insights into the interplay of underlying genes that may be important for regulating NPC proliferation in the adult mouse brain.
Construction of the third-generation Zea mays haplotype map.
Bukowski, Robert; Guo, Xiaosen; Lu, Yanli; Zou, Cheng; He, Bing; Rong, Zhengqin; Wang, Bo; Xu, Dawen; Yang, Bicheng; Xie, Chuanxiao; Fan, Longjiang; Gao, Shibin; Xu, Xun; Zhang, Gengyun; Li, Yingrui; Jiao, Yinping; Doebley, John F; Ross-Ibarra, Jeffrey; Lorant, Anne; Buffalo, Vince; Romay, M Cinta; Buckler, Edward S; Ware, Doreen; Lai, Jinsheng; Sun, Qi; Xu, Yunbi
2018-04-01
Characterization of genetic variations in maize has been challenging, mainly due to deterioration of collinearity between individual genomes in the species. An international consortium of maize research groups combined resources to develop the maize haplotype version 3 (HapMap 3), built from whole-genome sequencing data from 1218 maize lines, covering predomestication and domesticated Zea mays varieties across the world. A new computational pipeline was set up to process more than 12 trillion bp of sequencing data, and a set of population genetics filters was applied to identify more than 83 million variant sites. We identified polymorphisms in regions where collinearity is largely preserved in the maize species. However, the fact that the B73 genome used as the reference only represents a fraction of all haplotypes is still an important limiting factor.
A Single Molecule Scaffold for the Maize Genome
Zhou, Shiguo; Wei, Fusheng; Nguyen, John; Bechner, Mike; Potamousis, Konstantinos; Goldstein, Steve; Pape, Louise; Mehan, Michael R.; Churas, Chris; Pasternak, Shiran; Forrest, Dan K.; Wise, Roger; Ware, Doreen; Wing, Rod A.; Waterman, Michael S.; Livny, Miron; Schwartz, David C.
2009-01-01
About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/∼23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/∼2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars. PMID:19936062
Genetical genomics of Populus leaf shape variation
Drost, Derek R.; Puranik, Swati; Novaes, Evandro; ...
2015-06-30
Leaf morphology varies extensively among plant species and is under strong genetic control. Mutagenic screens in model systems have identified genes and established molecular mechanisms regulating leaf initiation, development, and shape. However, it is not known whether this diversity across plant species is related to naturally occurring variation at these genes. Quantitative trait locus (QTL) analysis has revealed a polygenic control for leaf shape variation in different species suggesting that loci discovered by mutagenesis may only explain part of the naturally occurring variation in leaf shape. Here we undertook a genetical genomics study in a poplar intersectional pseudo-backcross pedigree tomore » identify genetic factors controlling leaf shape. Here, the approach combined QTL discovery in a genetic linkage map anchored to the Populus trichocarpa reference genome sequence and transcriptome analysis.« less
A negative genetic interaction map in isogenic cancer cell lines reveals cancer cell vulnerabilities
Vizeacoumar, Franco J; Arnold, Roland; Vizeacoumar, Frederick S; Chandrashekhar, Megha; Buzina, Alla; Young, Jordan T F; Kwan, Julian H M; Sayad, Azin; Mero, Patricia; Lawo, Steffen; Tanaka, Hiromasa; Brown, Kevin R; Baryshnikova, Anastasia; Mak, Anthony B; Fedyshyn, Yaroslav; Wang, Yadong; Brito, Glauber C; Kasimer, Dahlia; Makhnevych, Taras; Ketela, Troy; Datti, Alessandro; Babu, Mohan; Emili, Andrew; Pelletier, Laurence; Wrana, Jeff; Wainberg, Zev; Kim, Philip M; Rottapel, Robert; O'Brien, Catherine A; Andrews, Brenda; Boone, Charles; Moffat, Jason
2013-01-01
Improved efforts are necessary to define the functional product of cancer mutations currently being revealed through large-scale sequencing efforts. Using genome-scale pooled shRNA screening technology, we mapped negative genetic interactions across a set of isogenic cancer cell lines and confirmed hundreds of these interactions in orthogonal co-culture competition assays to generate a high-confidence genetic interaction network of differentially essential or differential essentiality (DiE) genes. The network uncovered examples of conserved genetic interactions, densely connected functional modules derived from comparative genomics with model systems data, functions for uncharacterized genes in the human genome and targetable vulnerabilities. Finally, we demonstrate a general applicability of DiE gene signatures in determining genetic dependencies of other non-isogenic cancer cell lines. For example, the PTEN−/− DiE genes reveal a signature that can preferentially classify PTEN-dependent genotypes across a series of non-isogenic cell lines derived from the breast, pancreas and ovarian cancers. Our reference network suggests that many cancer vulnerabilities remain to be discovered through systematic derivation of a network of differentially essential genes in an isogenic cancer cell model. PMID:24104479
A comprehensive map of the porcine genome.
Rohrer, G A; Alexander, L J; Hu, Z; Smith, T P; Keele, J W; Beattie, C W
1996-05-01
We report the highest density genetic linkage map for a livestock species produced to date. Three published maps for Sus scrofa were merged by genotyping virtually every publicly available microsatellite across a single reference population to yield 1042 linked loci, 536 of which are novel assignments, spanning 2286.2 cM (average interval 2.23 cM) in 19 linkage groups (18 autosomal and X chromosomes, n = 19). Linkage groups were constructed de novo and mapped by locus content to avoid propagation of errors in older genotypes. The physical and genetic maps were integrated with 123 informative loci assigned previously by fluorescence in situ hybridization (FISH). Fourteen linkage groups span the entire length of each chromosome. Coverage of chromosomes 11, 12, 15, and 18 will be evaluated as more markers are physically assigned. Marker-deficient regions were identified only on 11q1.7-qter and 14 cen-q1.2. Recombination rates (cM/Mbp) varied between and within chromosomes. Short chromosomal arms recombined at higher rates than long arms, and recombination was more frequent in telomeric regions than in pericentric regions. The high-resolution comprehensive map has the marker density needed to identify quantitative trait loci (QTL), implement marker-assisted selection or introgression and YAC contig construction or chromosomal microdissection.
USDA-ARS?s Scientific Manuscript database
Research into the mineral contents of cereal grains and vegetables is motivated by interest in improving their nutritional value. Biofortification refers to natural enhancement of the grain/food product through traditional breeding. Since it does not require genetic engineering, it is acceptable t...
USDA-ARS?s Scientific Manuscript database
A searchable and publicly viewable set of mapped genomes from 96 rams from 9 US sheep breeds was created. The nine pure breeds were selected to represent genetic diversity for traits such as fertility, prolificacy, maternal ability, growth rate, carcass leanness, wool quality, mature weight, and lo...
Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W
2014-09-01
A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.
Zeng, Ping; Mukherjee, Sayan; Zhou, Xiang
2017-01-01
Epistasis, commonly defined as the interaction between multiple genes, is an important genetic component underlying phenotypic variation. Many statistical methods have been developed to model and identify epistatic interactions between genetic variants. However, because of the large combinatorial search space of interactions, most epistasis mapping methods face enormous computational challenges and often suffer from low statistical power due to multiple test correction. Here, we present a novel, alternative strategy for mapping epistasis: instead of directly identifying individual pairwise or higher-order interactions, we focus on mapping variants that have non-zero marginal epistatic effects—the combined pairwise interaction effects between a given variant and all other variants. By testing marginal epistatic effects, we can identify candidate variants that are involved in epistasis without the need to identify the exact partners with which the variants interact, thus potentially alleviating much of the statistical and computational burden associated with standard epistatic mapping procedures. Our method is based on a variance component model, and relies on a recently developed variance component estimation method for efficient parameter inference and p-value computation. We refer to our method as the “MArginal ePIstasis Test”, or MAPIT. With simulations, we show how MAPIT can be used to estimate and test marginal epistatic effects, produce calibrated test statistics under the null, and facilitate the detection of pairwise epistatic interactions. We further illustrate the benefits of MAPIT in a QTL mapping study by analyzing the gene expression data of over 400 individuals from the GEUVADIS consortium. PMID:28746338
Hsueh, Wen-Chi; He, Qimei; Willcox, D. Craig; Nievergelt, Caroline M.; Donlon, Timothy A.; Kwok, Pui-Yan; Suzuki, Makoto; Willcox, Bradley J.
2014-01-01
Isolated populations have advantages for genetic studies of longevity from decreased haplotype diversity and long-range linkage disequilibrium. This permits smaller sample sizes without loss of power, among other utilities. Little is known about the genome of the Okinawans, a potential population isolate, recognized for longevity. Therefore, we assessed genetic diversity, structure, and admixture in Okinawans, and compared this with Caucasians, Chinese, Japanese, and Africans from HapMap II, genotyped on the same Affymetrix GeneChip Human Mapping 500K array. Principal component analysis, haplotype coverage, and linkage disequilibrium decay revealed a distinct Okinawan genome—more homogeneity, less haplotype diversity, and longer range linkage disequilibrium. Population structure and admixture analyses utilizing 52 global reference populations from the Human Genome Diversity Cell Line Panel demonstrated that Okinawans clustered almost exclusively with East Asians. Sibling relative risk (λs) analysis revealed that siblings of Okinawan centenarians have 3.11 times (females) and 3.77 times (males) more likelihood of centenarianism. These findings suggest that Okinawans are genetically distinct and share several characteristics of a population isolate, which are prone to develop extreme phenotypes (eg, longevity) from genetic drift, natural selection, and population bottlenecks. These data support further exploration of genetic influence on longevity in the Okinawans. PMID:24444611
Shirasawa, Kenta; Hand, Melanie L.; Henderson, Steven T.; Okada, Takashi; Johnson, Susan D.; Taylor, Jennifer M.; Spriggs, Andrew; Siddons, Hayley; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Koltunow, Anna M. G.
2015-01-01
Background and Aims Apomixis in plants generates clonal progeny with a maternal genotype through asexual seed formation. Hieracium subgenus Pilosella (Asteraceae) contains polyploid, highly heterozygous apomictic and sexual species. Within apomictic Hieracium, dominant genetic loci independently regulate the qualitative developmental components of apomixis. In H. praealtum, LOSS OF APOMEIOSIS (LOA) enables formation of embryo sacs without meiosis and LOSS OF PARTHENOGENESIS (LOP) enables fertilization-independent seed formation. A locus required for fertilization-independent endosperm formation (AutE) has been identified in H. piloselloides. Additional quantitative loci appear to influence the penetrance of the qualitative loci, although the controlling genes remain unknown. This study aimed to develop the first genetic linkage maps for sexual and apomictic Hieracium species using simple sequence repeat (SSR) markers derived from expressed transcripts within the developing ovaries. Methods RNA from microdissected Hieracium ovule cell types and ovaries was sequenced and SSRs were identified. Two different F1 mapping populations were created to overcome difficulties associated with genome complexity and asexual reproduction. SSR markers were analysed within each mapping population to generate draft linkage maps for apomictic and sexual Hieracium species. Key Results A collection of 14 684 Hieracium expressed SSR markers were developed and linkage maps were constructed for Hieracium species using a subset of the SSR markers. Both the LOA and LOP loci were successfully assigned to linkage groups; however, AutE could not be mapped using the current populations. Comparisons with lettuce (Lactuca sativa) revealed partial macrosynteny between the two Asteraceae species. Conclusions A collection of SSR markers and draft linkage maps were developed for two apomictic and one sexual Hieracium species. These maps will support cloning of controlling genes at LOA and LOP loci in Hieracium and should also assist with identification of quantitative loci that affect the expressivity of apomixis. Future work will focus on mapping AutE using alternative populations. PMID:25538115
Li, Libei; Zhao, Shuqi; Su, Junji; Fan, Shuli; Pang, Chaoyou; Wei, Hengling; Wang, Hantao; Gu, Lijiao; Zhang, Chi; Liu, Guoyuan; Yu, Dingwei; Liu, Qibao; Zhang, Xianlong; Yu, Shuxun
2017-01-01
Due to China's rapidly increasing population, the total arable land area has dramatically decreased; as a consequence, the competition for farming land allocated for grain and cotton production has become fierce. Therefore, to overcome the existing contradiction between cotton grain and fiber production and the limited farming land, development of early-maturing cultivars is necessary. In this research, a high-density linkage map of upland cotton was constructed using genotyping by sequencing (GBS) to discover single nucleotide polymorphism (SNP) markers associated with early maturity in 170 F2 individuals derived from a cross between LU28 and ZHONG213. The high-density genetic map, which was composed of 3978 SNP markers across the 26 cotton chromosomes, spanned 2480 cM with an average genetic distance of 0.62 cM. Collinearity analysis showed that the genetic map was of high quality and accurate and agreed well with the Gossypium hirsutum reference genome. Based on this high-density linkage map, QTL analysis was performed on cotton early-maturity traits, including FT, FBP, WGP, NFFB, HNFFB and PH. A total 47 QTLs for the six traits were detected; each of these QTLs explained between 2.61% and 32.57% of the observed phenotypic variation. A major region controlling early-maturity traits in Gossypium hirsutum was identified for FT, FBP, WGP, NFFB and HNFFB on chromosome D03. QTL analyses revealed that phenotypic variation explained (PVE) ranged from 10.42% to 32.57%. Two potential candidate genes, Gh_D03G0885 and Gh_D03G0922, were predicted in a stable QTL region and had higher expression levels in the early-maturity variety ZHONG213 than in the late-maturity variety LU28. However, further evidence is required for functional validation. This study could provide useful information for the dissection of early-maturity traits and guide valuable genetic loci for molecular-assisted selection (MAS) in cotton breeding.
Li, Libei; Zhao, Shuqi; Su, Junji; Fan, Shuli; Pang, Chaoyou; Wei, Hengling; Wang, Hantao; Gu, Lijiao; Zhang, Chi; Liu, Guoyuan; Yu, Dingwei; Liu, Qibao; Zhang, Xianlong
2017-01-01
Due to China’s rapidly increasing population, the total arable land area has dramatically decreased; as a consequence, the competition for farming land allocated for grain and cotton production has become fierce. Therefore, to overcome the existing contradiction between cotton grain and fiber production and the limited farming land, development of early-maturing cultivars is necessary. In this research, a high-density linkage map of upland cotton was constructed using genotyping by sequencing (GBS) to discover single nucleotide polymorphism (SNP) markers associated with early maturity in 170 F2 individuals derived from a cross between LU28 and ZHONG213. The high-density genetic map, which was composed of 3978 SNP markers across the 26 cotton chromosomes, spanned 2480 cM with an average genetic distance of 0.62 cM. Collinearity analysis showed that the genetic map was of high quality and accurate and agreed well with the Gossypium hirsutum reference genome. Based on this high-density linkage map, QTL analysis was performed on cotton early-maturity traits, including FT, FBP, WGP, NFFB, HNFFB and PH. A total 47 QTLs for the six traits were detected; each of these QTLs explained between 2.61% and 32.57% of the observed phenotypic variation. A major region controlling early-maturity traits in Gossypium hirsutum was identified for FT, FBP, WGP, NFFB and HNFFB on chromosome D03. QTL analyses revealed that phenotypic variation explained (PVE) ranged from 10.42% to 32.57%. Two potential candidate genes, Gh_D03G0885 and Gh_D03G0922, were predicted in a stable QTL region and had higher expression levels in the early-maturity variety ZHONG213 than in the late-maturity variety LU28. However, further evidence is required for functional validation. This study could provide useful information for the dissection of early-maturity traits and guide valuable genetic loci for molecular-assisted selection (MAS) in cotton breeding. PMID:28809947
Genetic complexity of miscanthus cell wall composition and biomass quality for biofuels.
van der Weijde, Tim; Kamei, Claire L Alvim; Severing, Edouard I; Torres, Andres F; Gomez, Leonardo D; Dolstra, Oene; Maliepaard, Chris A; McQueen-Mason, Simon J; Visser, Richard G F; Trindade, Luisa M
2017-05-25
Miscanthus sinensis is a high yielding perennial grass species with great potential as a bioenergy feedstock. One of the challenges that currently impedes commercial cellulosic biofuel production is the technical difficulty to efficiently convert lignocellulosic biomass into biofuel. The development of feedstocks with better biomass quality will improve conversion efficiency and the sustainability of the value-chain. Progress in the genetic improvement of biomass quality may be substantially expedited by the development of genetic markers associated to quality traits, which can be used in a marker-assisted selection program. To this end, a mapping population was developed by crossing two parents of contrasting cell wall composition. The performance of 182 F1 offspring individuals along with the parents was evaluated in a field trial with a randomized block design with three replicates. Plants were phenotyped for cell wall composition and conversion efficiency characters in the second and third growth season after establishment. A new SNP-based genetic map for M. sinensis was built using a genotyping-by-sequencing (GBS) approach, which resulted in 464 short-sequence uniparental markers that formed 16 linkage groups in the male map and 17 linkage groups in the female map. A total of 86 QTLs for a variety of biomass quality characteristics were identified, 20 of which were detected in both growth seasons. Twenty QTLs were directly associated to different conversion efficiency characters. Marker sequences were aligned to the sorghum reference genome to facilitate cross-species comparisons. Analyses revealed that for some traits previously identified QTLs in sorghum occurred in homologous regions on the same chromosome. In this work we report for the first time the genetic mapping of cell wall composition and bioconversion traits in the bioenergy crop miscanthus. These results are a first step towards the development of marker-assisted selection programs in miscanthus to improve biomass quality and facilitate its use as feedstock for biofuel production.
Shriner, Daniel; Adeyemo, Adebowale; Gerry, Norman P.; Herbert, Alan; Chen, Guanjie; Doumatey, Ayo; Huang, Hanxia; Zhou, Jie; Christman, Michael F.; Rotimi, Charles N.
2009-01-01
Human height is the prototypical polygenic quantitative trait. Recently, several genetic variants influencing adult height were identified, primarily in individuals of East Asian (Chinese Han or Korean) or European ancestry. Here, we examined 152 genetic variants representing 107 independent loci previously associated with adult height for transferability in a well-powered sample of 1,016 unrelated African Americans. When we tested just the reported variants originally identified as associated with adult height in individuals of East Asian or European ancestry, only 8.3% of these loci transferred (p-values≤0.05 under an additive genetic model with directionally consistent effects) to our African American sample. However, when we comprehensively evaluated all HapMap variants in linkage disequilibrium (r 2≥0.3) with the reported variants, the transferability rate increased to 54.1%. The transferability rate was 70.8% for associations originally reported as genome-wide significant and 38.0% for associations originally reported as suggestive. An additional 23 loci were significantly associated but failed to transfer because of directionally inconsistent effects. Six loci were associated with adult height in all three groups. Using differences in linkage disequilibrium patterns between HapMap CEU or CHB reference data and our African American sample, we fine-mapped these six loci, improving both the localization and the annotation of these transferable associations. PMID:20027299
Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu
2016-01-01
Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
Renaud, Karine M.; Tucker, Robert D.; Peters, Stephen G.; Stettner, Will R.; Masonic, Linda M.; Moran, Thomas W.
2011-01-01
This map is a modified version of Geological-structural map of Hajigak iron-ore deposit, scale 1:10,000, which was compiled by M.S. Smirnov and I.K. Kusov in 1965. (Refer to the References Cited section in the Map PDF for complete citations of the original map and a related report.) USGS scientists, in cooperation with the Afghan Geological Survey and the Task Force for Business and Stability Operations of the U.S. Department of Defense, studied the original documents and also visited the field area in November 2009. This modified map illustrates the geological structure of the Haji-Gak iron deposit and includes cross sections of the same area. The map reproduces the topology (contacts, faults, and so forth) of the original Soviet map and cross sections and includes modifications based on our examination of these documents. Elevations on the cross sections are derived from the original Soviet topography and may not match the newer topography used on the current map. We have attempted to translate the original Russian terminology and rock classification into modern English geologic usage as literally as possible without changing any genetic or process-oriented implications in the original descriptions. We also use the age designations from the original map. The unit colors on the map and cross sections differ from the colors shown on the original version. The units are colored according to the color and pattern scheme of the Commission for the Geological Map of the World (CGMW) (http://www.ccgm.org).
Pool, John E
2015-12-01
North American populations of Drosophila melanogaster derive from both European and African source populations, but despite their importance for genetic research, patterns of ancestry along their genomes are largely undocumented. Here, I infer geographic ancestry along genomes of the Drosophila Genetic Reference Panel (DGRP) and the D. melanogaster reference genome, which may have implications for reference alignment, association mapping, and population genomic studies in Drosophila. Overall, the proportion of African ancestry was estimated to be 20% for the DGRP and 9% for the reference genome. Combining my estimate of admixture timing with historical records, I provide the first estimate of natural generation time for this species (approximately 15 generations per year). Ancestry levels were found to vary strikingly across the genome, with less African introgression on the X chromosome, in regions of high recombination, and at genes involved in specific processes (e.g., circadian rhythm). An important role for natural selection during the admixture process was further supported by evidence that many unlinked pairs of loci showed a deficiency of Africa-Europe allele combinations between them. Numerous epistatic fitness interactions may therefore exist between African and European genotypes, leading to ongoing selection against incompatible variants. By focusing on hubs in this network of fitness interactions, I identified a set of interacting loci that include genes with roles in sensation and neuropeptide/hormone reception. These findings suggest that admixed D. melanogaster samples could become an important study system for the genetics of early-stage isolation between populations. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
HGDP and HapMap Analysis by Ancestry Mapper Reveals Local and Global Population Relationships
Magalhães, Tiago R.; Casey, Jillian P.; Conroy, Judith; Regan, Regina; Fitzpatrick, Darren J.; Shah, Naisha; Sobral, João; Ennis, Sean
2012-01-01
Knowledge of human origins, migrations, and expansions is greatly enhanced by the availability of large datasets of genetic information from different populations and by the development of bioinformatic tools used to analyze the data. We present Ancestry Mapper, which we believe improves on existing methods, for the assignment of genetic ancestry to an individual and to study the relationships between local and global populations. The principle function of the method, named Ancestry Mapper, is to give each individual analyzed a genetic identifier, made up of just 51 genetic coordinates, that corresponds to its relationship to the HGDP reference population. As a consequence, the Ancestry Mapper Id (AMid) has intrinsic biological meaning and provides a tool to measure similarity between world populations. We applied Ancestry Mapper to a dataset comprised of the HGDP and HapMap data. The results show distinctions at the continental level, while simultaneously giving details at the population level. We clustered AMids of HGDP/HapMap and observe a recapitulation of human migrations: for a small number of clusters, individuals are grouped according to continental origins; for a larger number of clusters, regional and population distinctions are evident. Calculating distances between AMids allows us to infer ancestry. The number of coordinates is expandable, increasing the power of Ancestry Mapper. An R package called Ancestry Mapper is available to apply this method to any high density genomic data set. PMID:23189146
HGDP and HapMap analysis by Ancestry Mapper reveals local and global population relationships.
Magalhães, Tiago R; Casey, Jillian P; Conroy, Judith; Regan, Regina; Fitzpatrick, Darren J; Shah, Naisha; Sobral, João; Ennis, Sean
2012-01-01
Knowledge of human origins, migrations, and expansions is greatly enhanced by the availability of large datasets of genetic information from different populations and by the development of bioinformatic tools used to analyze the data. We present Ancestry Mapper, which we believe improves on existing methods, for the assignment of genetic ancestry to an individual and to study the relationships between local and global populations. The principle function of the method, named Ancestry Mapper, is to give each individual analyzed a genetic identifier, made up of just 51 genetic coordinates, that corresponds to its relationship to the HGDP reference population. As a consequence, the Ancestry Mapper Id (AMid) has intrinsic biological meaning and provides a tool to measure similarity between world populations. We applied Ancestry Mapper to a dataset comprised of the HGDP and HapMap data. The results show distinctions at the continental level, while simultaneously giving details at the population level. We clustered AMids of HGDP/HapMap and observe a recapitulation of human migrations: for a small number of clusters, individuals are grouped according to continental origins; for a larger number of clusters, regional and population distinctions are evident. Calculating distances between AMids allows us to infer ancestry. The number of coordinates is expandable, increasing the power of Ancestry Mapper. An R package called Ancestry Mapper is available to apply this method to any high density genomic data set.
Construction of the third-generation Zea mays haplotype map
Bukowski, Robert; Guo, Xiaosen; Lu, Yanli; Zou, Cheng; He, Bing; Rong, Zhengqin; Wang, Bo; Xu, Dawen; Yang, Bicheng; Xie, Chuanxiao; Fan, Longjiang; Gao, Shibin; Xu, Xun; Zhang, Gengyun; Li, Yingrui; Jiao, Yinping; Doebley, John F; Ross-Ibarra, Jeffrey; Lorant, Anne; Buffalo, Vince; Romay, M Cinta; Buckler, Edward S; Ware, Doreen; Lai, Jinsheng; Sun, Qi
2017-01-01
Abstract Background Characterization of genetic variations in maize has been challenging, mainly due to deterioration of collinearity between individual genomes in the species. An international consortium of maize research groups combined resources to develop the maize haplotype version 3 (HapMap 3), built from whole-genome sequencing data from 1218 maize lines, covering predomestication and domesticated Zea mays varieties across the world. Results A new computational pipeline was set up to process more than 12 trillion bp of sequencing data, and a set of population genetics filters was applied to identify more than 83 million variant sites. Conclusions We identified polymorphisms in regions where collinearity is largely preserved in the maize species. However, the fact that the B73 genome used as the reference only represents a fraction of all haplotypes is still an important limiting factor. PMID:29300887
YouGenMap: a web platform for dynamic multi-comparative mapping and visualization of genetic maps
Keith Batesole; Kokulapalan Wimalanathan; Lin Liu; Fan Zhang; Craig S. Echt; Chun Liang
2014-01-01
Comparative genetic maps are used in examination of genome organization, detection of conserved gene order, and exploration of marker order variations. YouGenMap is an open-source web tool that offers dynamic comparative mapping capability of users' own genetic mapping between 2 or more map sets. Users' genetic map data and optional gene annotations are...
Raboanatahiry, Nadia; Chao, Hongbo; Guo, Liangxing; Gan, Jianping; Xiang, Jun; Yan, Mingli; Zhang, Libin; Yu, Longjiang; Li, Maoteng
2017-10-12
Deciphering the genetic architecture of a species is a good way to understand its evolutionary history, but also to tailor its profile for breeding elite cultivars with desirable traits. Aligning QTLs from diverse population in one map and utilizing it for comparison, but also as a basis for multiple analyses assure a stronger evidence to understand the genetic system related to a given phenotype. In this study, 439 genes involved in fatty acid (FA) and triacylglycerol (TAG) biosyntheses were identified in Brassica napus. B. napus genome showed mixed gene loss and insertion compared to B. rapa and B. oleracea, and C genome had more inserted genes. Identified QTLs for oil (OC-QTLs) and fatty acids (FA-QTLs) from nine reported populations were projected on the physical map of the reference genome "Darmor-bzh" to generate a map. Thus, 335 FA-QTLs and OC-QTLs could be highlighted and 82 QTLs were overlapping. Chromosome C3 contained 22 overlapping QTLs with all trait studied except for C18:3. In total, 218 candidate genes which were potentially involved in FA and TAG were identified in 162 QTLs confidence intervals and some of them might affect many traits. Also, 76 among these candidate genes were found inside 57 overlapping QTLs, and candidate genes for oil content were in majority (61/76 genes). Then, sixteen genes were found in overlapping QTLs involving three populations, and the remaining 60 genes were found in overlapping QTLs of two populations. Interaction network and pathway analysis of these candidate genes indicated ten genes that might have strong influence over the other genes that control fatty acids and oil formation. The present results provided new information for genetic basis of FA and TAG formation in B. napus. A map including QTLs from numerous populations was built, which could serve as reference to study the genome profile of B. napus, and new potential genes emerged which might affect seed oil. New useful tracks were showed for the selection of population or/and selection of interesting genes for breeding improvement purpose.
2014-01-01
Background The Bactrocera dorsalis species complex currently harbors approximately 90 different members. The species complex has undergone many revisions in the past decades, and there is still an ongoing debate about the species limits. The availability of a variety of tools and approaches, such as molecular-genomic and cytogenetic analyses, are expected to shed light on the rather complicated issues of species complexes and incipient speciation. The clarification of genetic relationships among the different members of this complex is a prerequisite for the rational application of sterile insect technique (SIT) approaches for population control. Results Colonies established in the Insect Pest Control Laboratory (IPCL) (Seibersdorf, Vienna), representing five of the main economic important members of the Bactrocera dorsalis complex were cytologically characterized. The taxa under study were B. dorsalis s.s., B. philippinensis, B. papayae, B. invadens and B. carambolae. Mitotic and polytene chromosome analyses did not reveal any chromosomal characteristics that could be used to distinguish between the investigated members of the B. dorsalis complex. Therefore, their polytene chromosomes can be regarded as homosequential with the reference maps of B. dorsalis s.s.. In situ hybridization of six genes further supported the proposed homosequentiallity of the chromosomes of these specific members of the complex. Conclusions The present analysis supports that the polytene chromosomes of the five taxa under study are homosequential. Therefore, the use of the available polytene chromosome maps for B. dorsalis s.s. as reference maps for all these five biological entities is proposed. Present data provide important insight in the genetic relationships among the different members of the B. dorsalis complex, and, along with other studies in the field, can facilitate SIT applications targeting this complex. Moreover, the availability of 'universal' reference polytene chromosome maps for members of the complex, along with the documented application of in situ hybridization, can facilitate ongoing and future genome projects in this complex. PMID:25471636
The Arab genome: Health and wealth.
Zayed, Hatem
2016-11-05
The 22 Arab nations have a unique genetic structure, which reflects both conserved and diverse gene pools due to the prevalent endogamous and consanguineous marriage culture and the long history of admixture among different ethnic subcultures descended from the Asian, European, and African continents. Human genome sequencing has enabled large-scale genomic studies of different populations and has become a powerful tool for studying disease predictions and diagnosis. Despite the importance of the Arab genome for better understanding the dynamics of the human genome, discovering rare genetic variations, and studying early human migration out of Africa, it is poorly represented in human genome databases, such as HapMap and the 1000 Genomes Project. In this review, I demonstrate the significance of sequencing the Arab genome and setting an Arab genome reference(s) for better understanding the molecular pathogenesis of genetic diseases, discovering novel/rare variants, and identifying a meaningful genotype-phenotype correlation for complex diseases. Copyright © 2016. Published by Elsevier B.V.
USDA-ARS?s Scientific Manuscript database
A bacterial artificial chromosome (BAC) library and BAC-end sequences for Gossypium hirsutum L. have recently been developed. Here we report on genomic-based genome-wide SNP mining utilizing re-sequencing data with a BAC-end sequence reference for twelve G. hirsutum L. lines, one G. barbadense L. li...
Brereton, Nicholas J. B.; Marleau, Julie; Nissim, Werther Guidi; Labrecque, Michel; Joly, Simon; Pitre, Frederic E.
2016-01-01
Metatranscriptomic study of nonmodel organisms requires strategies that retain the highly resolved genetic information generated from model organisms while allowing for identification of the unexpected. A real-world biological application of phytoremediation, the field growth of 10 Salix cultivars on polluted soils, was used as an exemplar nonmodel and multifaceted crop response well-disposed to the study of gene expression. Sequence reads were assembled de novo to create 10 independent transcriptomes, a global transcriptome, and were mapped against the Salix purpurea 94006 reference genome. Annotation of assembled contigs was performed without a priori assumption of the originating organism. Global transcriptome construction from 3.03 billion paired-end reads revealed 606,880 unique contigs annotated from 1588 species, often common in all 10 cultivars. Comparisons between transcriptomic and metatranscriptomic methodologies provide clear evidence that nonnative RNA can mistakenly map to reference genomes, especially to conserved regions of common housekeeping genes, such as actin, α/β-tubulin, and elongation factor 1-α. In Salix, Rubisco activase transcripts were down-regulated in contaminated trees across all 10 cultivars, whereas thiamine thizole synthase and CP12, a Calvin Cycle master regulator, were uniformly up-regulated. De novo assembly approaches, with unconstrained annotation, can improve data quality; care should be taken when exploring such plant genetics to reduce de facto data exclusion by mapping to a single reference genome alone. Salix gene expression patterns strongly suggest cultivar-wide alteration of specific photosynthetic apparatus and protection of the antenna complexes from oxidation damage in contaminated trees, providing an insight into common stress tolerance strategies in a real-world phytoremediation system. PMID:27002060
Tucker, Robert D.; Peters, Stephen G.; Schulz, Klaus J.; Renaud, Karine M.; Stettner, Will R.; Masonic, Linda M.; Packard, Patricia H.
2011-01-01
This map is a modified version of the Geological map of the Khanneshin carbonatite complex, scale 1:10,000, which was compiled by V.G. Cheremytsin in 1976. Scientists from the U.S. Geological Survey, in cooperation with the Afghan Geological Survey and the Task Force for Business and Stability Operations of the U.S. Department of Defense, studied the original map and also visited the field area in September 2009, August 2010, and February 2011. This modified map, which includes cross sections, illustrates the geologic structure of the Khanneshin carbonatite complex. The map reproduces the topology (contacts, faults, and so forth) of the original Soviet map and cross sections and includes modifications based on our examination of that map and a related report, and based on observations made during our field visits. (Refer to the References section in the Map PDF for complete citations of the original map and related report.) Elevations on the cross section are derived from the original Soviet topography and may not match the newer topography used on the current map. We have attempted to translate the original Russian terminology and rock classification into modern English geologic usage as literally as possible without changing any genetic or process-oriented implications in the original descriptions. We also use the age designations from the original map. The unit colors on the map and cross sections differ from the colors shown on the original version. The units are colored according to the color and pattern scheme of the Commission for the Geological Map of the World (CGMW) (http://www.ccgm.org).
Population differences in the rate of proliferation of international HapMap cell lines.
Stark, Amy L; Zhang, Wei; Zhou, Tong; O'Donnell, Peter H; Beiswanger, Christine M; Huang, R Stephanie; Cox, Nancy J; Dolan, M Eileen
2010-12-10
The International HapMap Project is a resource for researchers containing genotype, sequencing, and expression information for EBV-transformed lymphoblastoid cell lines derived from populations across the world. The expansion of the HapMap beyond the four initial populations of Phase 2, referred to as Phase 3, has increased the sample number and ethnic diversity available for investigation. However, differences in the rate of cellular proliferation between the populations can serve as confounders in phenotype-genotype studies using these cell lines. Within the Phase 2 populations, the JPT and CHB cell lines grow faster (p < 0.0001) than the CEU or YRI cell lines. Phase 3 YRI cell lines grow significantly slower than Phase 2 YRI lines (p < 0.0001), with no widespread genetic differences based on common SNPs. In addition, we found significant growth differences between the cell lines in the Phase 2 ASN populations and the Han Chinese from the Denver metropolitan area panel in Phase 3 (p < 0.0001). Therefore, studies that separate HapMap panels into discovery and replication sets must take this into consideration. Copyright © 2010 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Bink, Marco CAM; van Heerwaarden, Joost; Chancerel, Emilie; Boury, Christophe; Lesur, Isabelle; Isik, Fikret; Bouffier, Laurent; Plomion, Christophe
2016-01-01
Background Increasing our understanding of the genetic architecture of complex traits, through analyses of genotype-phenotype associations and of the genes/polymorphisms accounting for trait variation, is crucial, to improve the integration of molecular markers into forest tree breeding. In this study, two full-sib families and one breeding population of maritime pine were used to identify quantitative trait loci (QTLs) for height growth and stem straightness, through linkage analysis (LA) and linkage disequilibrium (LD) mapping approaches. Results The populations used for LA consisted of two unrelated three-generation full-sib families (n = 197 and n = 477). These populations were assessed for height growth or stem straightness and genotyped for 248 and 217 markers, respectively. The population used for LD mapping consisted of 661 founders of the first and second generations of the breeding program. This population was phenotyped for the same traits and genotyped for 2,498 single-nucleotide polymorphism (SNP) markers corresponding to 1,652 gene loci. The gene-based reference genetic map of maritime pine was used to localize and compare the QTLs detected by the two approaches, for both traits. LA identified three QTLs for stem straightness and two QTLs for height growth. The LD study yielded seven significant associations (P ≤ 0.001): four for stem straightness and three for height growth. No colocalisation was found between QTLs identified by LA and SNPs detected by LD mapping for the same trait. Conclusions This study provides the first comparison of LA and LD mapping approaches in maritime pine, highlighting the complementary nature of these two approaches for deciphering the genetic architecture of two mandatory traits of the breeding program. PMID:27806077
Bartholomé, Jérôme; Bink, Marco Cam; van Heerwaarden, Joost; Chancerel, Emilie; Boury, Christophe; Lesur, Isabelle; Isik, Fikret; Bouffier, Laurent; Plomion, Christophe
2016-01-01
Increasing our understanding of the genetic architecture of complex traits, through analyses of genotype-phenotype associations and of the genes/polymorphisms accounting for trait variation, is crucial, to improve the integration of molecular markers into forest tree breeding. In this study, two full-sib families and one breeding population of maritime pine were used to identify quantitative trait loci (QTLs) for height growth and stem straightness, through linkage analysis (LA) and linkage disequilibrium (LD) mapping approaches. The populations used for LA consisted of two unrelated three-generation full-sib families (n = 197 and n = 477). These populations were assessed for height growth or stem straightness and genotyped for 248 and 217 markers, respectively. The population used for LD mapping consisted of 661 founders of the first and second generations of the breeding program. This population was phenotyped for the same traits and genotyped for 2,498 single-nucleotide polymorphism (SNP) markers corresponding to 1,652 gene loci. The gene-based reference genetic map of maritime pine was used to localize and compare the QTLs detected by the two approaches, for both traits. LA identified three QTLs for stem straightness and two QTLs for height growth. The LD study yielded seven significant associations (P ≤ 0.001): four for stem straightness and three for height growth. No colocalisation was found between QTLs identified by LA and SNPs detected by LD mapping for the same trait. This study provides the first comparison of LA and LD mapping approaches in maritime pine, highlighting the complementary nature of these two approaches for deciphering the genetic architecture of two mandatory traits of the breeding program.
Sun, Lidan; Yang, Weiru; Zhang, Qixiang; Cheng, Tangren; Pan, Huitang; Xu, Zongda; Zhang, Jie; Chen, Chuguang
2013-01-01
Because of its popularity as an ornamental plant in East Asia, mei (Prunus mume Sieb. et Zucc.) has received increasing attention in genetic and genomic research with the recent shotgun sequencing of its genome. Here, we performed the genome-wide characterization of simple sequence repeats (SSRs) in the mei genome and detected a total of 188,149 SSRs occurring at a frequency of 794 SSR/Mb. Mononucleotide repeats were the most common type of SSR in genomic regions, followed by di- and tetranucleotide repeats. Most of the SSRs in coding sequences (CDS) were composed of tri- or hexanucleotide repeat motifs, but mononucleotide repeats were always the most common in intergenic regions. Genome-wide comparison of SSR patterns among the mei, strawberry (Fragaria vesca), and apple (Malus×domestica) genomes showed mei to have the highest density of SSRs, slightly higher than that of strawberry (608 SSR/Mb) and almost twice as high as that of apple (398 SSR/Mb). Mononucleotide repeats were the dominant SSR motifs in the three Rosaceae species. Using 144 SSR markers, we constructed a 670 cM-long linkage map of mei delimited into eight linkage groups (LGs), with an average marker distance of 5 cM. Seventy one scaffolds covering about 27.9% of the assembled mei genome were anchored to the genetic map, depending on which the macro-colinearity between the mei genome and Prunus T×E reference map was identified. The framework map of mei constructed provides a first step into subsequent high-resolution genetic mapping and marker-assisted selection for this ornamental species. PMID:23555708
Chen, J W; Wang, L; Pang, X F; Pan, Q H
2006-04-01
Genetic analysis and fine mapping of a resistance gene against brown planthopper (BPH) biotype 2 in rice was performed using two F(2) populations derived from two crosses between a resistant indica cultivar (cv.), AS20-1, and two susceptible japonica cvs., Aichi Asahi and Lijiangxintuanheigu. Insect resistance was evaluated using F(1) plants and the two F(2) populations. The results showed that a single recessive gene, tentatively designated as bph19(t), conditioned the resistance in AS20-1. A linkage analysis, mainly employing microsatellite markers, was carried out in the two F(2) populations through bulked segregant analysis and recessive class analysis (RCA), in combination with bioinformatics analysis (BIA). The resistance gene locus bph19(t) was finely mapped to a region of about 1.0 cM on the short arm of chromosome 3, flanked by markers RM6308 and RM3134, where one known marker RM1022, and four new markers, b1, b2, b3 and b4, developed in the present study were co-segregating with the locus. To physically map this locus, the bph19(t)-linked markers were landed on bacterial artificial chromosome or P1 artificial chromosome clones of the reference cv., Nipponbare, released by the International Rice Genome Sequencing Project. Sequence information of these clones was used to construct a physical map of the bph19(t) locus, in silico, by BIA. The bph19(t) locus was physically defined to an interval of about 60 kb. The detailed genetic and physical maps of the bph19(t) locus will facilitate marker-assisted gene pyramiding and cloning.
Tulpová, Zuzana; Luo, Ming-Cheng; Toegelová, Helena; Visendi, Paul; Hayashi, Satomi; Vojta, Petr; Paux, Etienne; Kilian, Andrzej; Abrouk, Michaël; Bartoš, Jan; Hajdúch, Marián; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana
2018-03-08
Bread wheat (Triticum aestivum L.) is a staple food for a significant part of the world's population. The growing demand on its production can be satisfied by improving yield and resistance to biotic and abiotic stress. Knowledge of the genome sequence would aid in discovering genes and QTLs underlying these traits and provide a basis for genomics-assisted breeding. Physical maps and BAC clones associated with them have been valuable resources from which to generate a reference genome of bread wheat and to assist map-based gene cloning. As a part of a joint effort coordinated by the International Wheat Genome Sequencing Consortium, we have constructed a BAC-based physical map of bread wheat chromosome arm 7DS consisting of 895 contigs and covering 94% of its estimated length. By anchoring BAC contigs to one radiation hybrid map and three high resolution genetic maps, we assigned 73% of the assembly to a distinct genomic position. This map integration, interconnecting a total of 1713 markers with ordered and sequenced BAC clones from a minimal tiling path, provides a tool to speed up gene cloning in wheat. The process of physical map assembly included the integration of the 7DS physical map with a whole-genome physical map of Aegilops tauschii and a 7DS Bionano genome map, which together enabled efficient scaffolding of physical-map contigs, even in the non-recombining region of the genetic centromere. Moreover, this approach facilitated a comparison of bread wheat and its ancestor at BAC-contig level and revealed a reconstructed region in the 7DS pericentromere. Copyright © 2018. Published by Elsevier B.V.
2014-01-01
Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. PMID:24708878
Mariette, Stéphanie; Wong Jun Tai, Fabienne; Roch, Guillaume; Barre, Aurélien; Chague, Aurélie; Decroocq, Stéphane; Groppi, Alexis; Laizet, Yec'han; Lambert, Patrick; Tricon, David; Nikolski, Macha; Audergon, Jean-Marc; Abbott, Albert G; Decroocq, Véronique
2016-01-01
In fruit tree species, many important traits have been characterized genetically by using single-family descent mapping in progenies segregating for the traits. However, most mapped loci have not been sufficiently resolved to the individual genes due to insufficient progeny sizes for high resolution mapping and the previous lack of whole-genome sequence resources of the study species. To address this problem for Plum Pox Virus (PPV) candidate resistance gene identification in Prunus species, we implemented a genome-wide association (GWA) approach in apricot. This study exploited the broad genetic diversity of the apricot (Prunus armeniaca) germplasm containing resistance to PPV, next-generation sequence-based genotyping, and the high-quality peach (Prunus persica) genome reference sequence for single nucleotide polymorphism (SNP) identification. The results of this GWA study validated previously reported PPV resistance quantitative trait loci (QTL) intervals, highlighted other potential resistance loci, and resolved each to a limited set of candidate genes for further study. This work substantiates the association genetics approach for resolution of QTL to candidate genes in apricot and suggests that this approach could simplify identification of other candidate genes for other marked trait intervals in this germplasm. © 2015 INRA, UMR 1332 BFP New Phytologist © 2015 New Phytologist Trust.
A Reference Proteomic Database of Lactobacillus plantarum CMCC-P0002
Tian, Wanhong; Yu, Gang; Liu, Xiankai; Wang, Jie; Feng, Erling; Zhang, Xuemin; Chen, Bei; Zeng, Ming; Wang, Hengliang
2011-01-01
Lactobacillus plantarum is a widespread probiotic bacteria found in many fermented food products. In this study, the whole-cell proteins and secretory proteins of L. plantarum were separated by two-dimensional electrophoresis method. A total of 434 proteins were identified by tandem mass spectrometry, including a plasmid-encoded hypothetical protein pLP9000_05. The information of first 20 highest abundance proteins was listed for the further genetic manipulation of L. plantarum, such as construction of high-level expressions system. Furthermore, the first interaction map of L. plantarum was established by Blue-Native/SDS-PAGE technique. A heterodimeric complex composed of maltose phosphorylase Map3 and Map2, and two homodimeric complexes composed of Map3 and Map2 respectively, were identified at the same time, indicating the important roles of these proteins. These findings provided valuable information for the further proteomic researches of L. plantarum. PMID:21998671
A reference proteomic database of Lactobacillus plantarum CMCC-P0002.
Zhu, Li; Hu, Wei; Liu, Datao; Tian, Wanhong; Yu, Gang; Liu, Xiankai; Wang, Jie; Feng, Erling; Zhang, Xuemin; Chen, Bei; Zeng, Ming; Wang, Hengliang
2011-01-01
Lactobacillus plantarum is a widespread probiotic bacteria found in many fermented food products. In this study, the whole-cell proteins and secretory proteins of L. plantarum were separated by two-dimensional electrophoresis method. A total of 434 proteins were identified by tandem mass spectrometry, including a plasmid-encoded hypothetical protein pLP9000_05. The information of first 20 highest abundance proteins was listed for the further genetic manipulation of L. plantarum, such as construction of high-level expressions system. Furthermore, the first interaction map of L. plantarum was established by Blue-Native/SDS-PAGE technique. A heterodimeric complex composed of maltose phosphorylase Map3 and Map2, and two homodimeric complexes composed of Map3 and Map2 respectively, were identified at the same time, indicating the important roles of these proteins. These findings provided valuable information for the further proteomic researches of L. plantarum.
Renaud, Karine M.; Tucker, Robert D.; Peters, Stephen G.; Stettner, Will R.; Masonic, Linda M.; Moran, Thomas W.
2011-01-01
This map is a modified version of Geologic-prospecting plan of western area of Hajigak iron-ore deposit, scale 1:2,000, which was compiled by V.V. Reshetniak and I.K. Kusov in 1965. (Refer to the References Cited section in the Map PDF for complete citations of the original map and related reports.) USGS scientists, in cooperation with the Afghan Geological Survey and the Task Force for Business and Stability Operations of the U.S. Department of Defense, studied the original documents and also visited the field area in November 2009. This modified map illustrates the geological structure of the western Haji-Gak iron deposit and includes cross sections of the same area. The map reproduces the topology (contacts, faults, and so forth) of the original Soviet map and includes modifications based on our examination of that document. We constructed the cross sections from data derived from the original map. Elevations on the cross sections are derived from the original Soviet topography and may not match the newer topography used on the current map. We have attempted to translate the original Russian terminology and rock classification into modern English geologic usage as literally as possible without changing any genetic or process-oriented implications in the original descriptions. We also use the age designations from the original map. The unit colors on the map and cross sections differ from the colors shown on the original version. The units are colored according to the color and pattern scheme of the Commission for the Geological Map of the World (CGMW) (http://www.ccgm.org).
Daware, Anurag; Das, Sweta; Srivastava, Rishi; Badoni, Saurabh; Singh, Ashok K.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.
2016-01-01
Development and use of genome-wide informative simple sequence repeat (SSR) markers and novel integrated genomic strategies are vital to drive genomics-assisted breeding applications and for efficient dissection of quantitative trait loci (QTLs) underlying complex traits in rice. The present study developed 6244 genome-wide informative SSR markers exhibiting in silico fragment length polymorphism based on repeat-unit variations among genomic sequences of 11 indica, japonica, aus, and wild rice accessions. These markers were mapped on diverse coding and non-coding sequence components of known cloned/candidate genes annotated from 12 chromosomes and revealed a much higher amplification (97%) and polymorphic potential (88%) along with wider genetic/functional diversity level (16–74% with a mean 53%) especially among accessions belonging to indica cultivar group, suggesting their utility in large-scale genomics-assisted breeding applications in rice. A high-density 3791 SSR markers-anchored genetic linkage map (IR 64 × Sonasal) spanning 2060 cM total map-length with an average inter-marker distance of 0.54 cM was generated. This reference genetic map identified six major genomic regions harboring robust QTLs (31% combined phenotypic variation explained with a 5.7–8.7 LOD) governing grain weight on six rice chromosomes. One strong grain weight major QTL region (OsqGW5.1) was narrowed-down by integrating traditional QTL mapping with high-resolution QTL region-specific integrated SSR and single nucleotide polymorphism markers-based QTL-seq analysis and differential expression profiling. This led us to delineate two natural allelic variants in two known cis-regulatory elements (RAV1AAT and CARGCW8GAT) of glycosyl hydrolase and serine carboxypeptidase genes exhibiting pronounced seed-specific differential regulation in low (Sonasal) and high (IR 64) grain weight mapping parental accessions. Our genome-wide SSR marker resource (polymorphic within/between diverse cultivar groups) and integrated genomic strategy can efficiently scan functionally relevant potential molecular tags (markers, candidate genes and alleles) regulating complex agronomic traits (grain weight) and expedite marker-assisted genetic enhancement in rice. PMID:27833617
2012-01-01
Background Cotton is the world’s most important natural textile fiber and a significant oilseed crop. Decoding cotton genomes will provide the ultimate reference and resource for research and utilization of the species. Integration of high-density genetic maps with genomic sequence information will largely accelerate the process of whole-genome assembly in cotton. Results In this paper, we update a high-density interspecific genetic linkage map of allotetraploid cultivated cotton. An additional 1,167 marker loci have been added to our previously published map of 2,247 loci. Three new marker types, InDel (insertion-deletion) and SNP (single nucleotide polymorphism) developed from gene information, and REMAP (retrotransposon-microsatellite amplified polymorphism), were used to increase map density. The updated map consists of 3,414 loci in 26 linkage groups covering 3,667.62 cM with an average inter-locus distance of 1.08 cM. Furthermore, genome-wide sequence analysis was finished using 3,324 informative sequence-based markers and publicly-available Gossypium DNA sequence information. A total of 413,113 EST and 195 BAC sequences were physically anchored and clustered by 3,324 sequence-based markers. Of these, 14,243 ESTs and 188 BACs from different species of Gossypium were clustered and specifically anchored to the high-density genetic map. A total of 2,748 candidate unigenes from 2,111 ESTs clusters and 63 BACs were mined for functional annotation and classification. The 337 ESTs/genes related to fiber quality traits were integrated with 132 previously reported cotton fiber quality quantitative trait loci, which demonstrated the important roles in fiber quality of these genes. Higher-level sequence conservation between different cotton species and between the A- and D-subgenomes in tetraploid cotton was found, indicating a common evolutionary origin for orthologous and paralogous loci in Gossypium. Conclusion This study will serve as a valuable genomic resource for tetraploid cotton genome assembly, for cloning genes related to superior agronomic traits, and for further comparative genomic analyses in Gossypium. PMID:23046547
Wang, Chao; Shi, Xue; Liu, Lin; Li, Haiyan; Ammiraju, Jetty S S; Kudrna, David A; Xiong, Wentao; Wang, Hao; Dai, Zhaozhao; Zheng, Yonglian; Lai, Jinsheng; Jin, Weiwei; Messing, Joachim; Bennetzen, Jeffrey L; Wing, Rod A; Luo, Meizhong
2013-11-01
Maize is one of the most important food crops and a key model for genetics and developmental biology. A genetically anchored and high-quality draft genome sequence of maize inbred B73 has been obtained to serve as a reference sequence. To facilitate evolutionary studies in maize and its close relatives, much like the Oryza Map Alignment Project (OMAP) (www.OMAP.org) bacterial artificial chromosome (BAC) resource did for the rice community, we constructed BAC libraries for maize inbred lines Zheng58, Chang7-2, and Mo17 and maize wild relatives Zea mays ssp. parviglumis and Tripsacum dactyloides. Furthermore, to extend functional genomic studies to maize and sorghum, we also constructed binary BAC (BIBAC) libraries for the maize inbred B73 and the sorghum landrace Nengsi-1. The BAC/BIBAC vectors facilitate transfer of large intact DNA inserts from BAC clones to the BIBAC vector and functional complementation of large DNA fragments. These seven Zea Map Alignment Project (ZMAP) BAC/BIBAC libraries have average insert sizes ranging from 92 to 148 kb, organellar DNA from 0.17 to 2.3%, empty vector rates between 0.35 and 5.56%, and genome equivalents of 4.7- to 8.4-fold. The usefulness of the Parviglumis and Tripsacum BAC libraries was demonstrated by mapping clones to the reference genome. Novel genes and alleles present in these ZMAP libraries can now be used for functional complementation studies and positional or homology-based cloning of genes for translational genomics.
Cadle-Davidson, Lance; Gadoury, David; Fresnedo-Ramírez, Jonathan; Yang, Shanshan; Barba, Paola; Sun, Qi; Demmings, Elizabeth M; Seem, Robert; Schaub, Michelle; Nowogrodzki, Anna; Kasinathan, Hema; Ledbetter, Craig; Reisch, Bruce I
2016-10-01
The genomics era brought unprecedented opportunities for genetic analysis of host resistance, but it came with the challenge that accurate and reproducible phenotypes are needed so that genomic results appropriately reflect biology. Phenotyping host resistance by natural infection in the field can produce variable results due to the uncontrolled environment, uneven distribution and genetics of the pathogen, and developmentally regulated resistance among other factors. To address these challenges, we developed highly controlled, standardized methodologies for phenotyping powdery mildew resistance in the context of a phenotyping center, receiving samples of up to 140 grapevine progeny per F 1 family. We applied these methodologies to F 1 families segregating for REN1- or REN2-mediated resistance and validated that some but not all bioassays identified the REN1 or REN2 locus. A point-intercept method (hyphal transects) to quantify colony density objectively at 8 or 9 days postinoculation proved to be the phenotypic response most reproducibly predicted by these resistance loci. Quantitative trait locus (QTL) mapping with genotyping-by-sequencing maps defined the REN1 and REN2 loci at relatively high resolution. In the reference PN40024 genome under each QTL, nucleotide-binding site-leucine-rich repeat candidate resistance genes were identified-one gene for REN1 and two genes for REN2. The methods described here for centralized resistance phenotyping and high-resolution genetic mapping can inform strategies for breeding resistance to powdery mildews and other pathogens on diverse, highly heterozygous hosts.
Roy, Neha Samir; Park, Kyong-Cheul; Lee, Sung-Il; Im, Min-Ji; Ramekar, Rahul Vasudeo; Kim, Nam-Soo
2018-02-01
Molecular marker technologies have proven to be an important breakthrough for genetic studies, construction of linkage maps and population genetics analysis. Transposable elements (TEs) constitute major fractions of repetitive sequences in plants and offer a wide range of possible areas to be explored as molecular markers. Sequence characterized amplified region (SCAR) marker development provides us with a simple and time saving alternative approach for marker development. We employed the CACTA-TD to develop SCARs and then integrated them into linkage map and used them for population structure and genetic diversity analysis of corn inbred population. A total of 108 dominant SCAR markers were designed out of which, 32 were successfully integrated in to the linkage map of maize RIL population and the remaining were added to a physical map for references to check the distribution throughout all chromosomes. Moreover, 76 polymorphic SCARs were used for diversity analysis of corn accessions being used in Korean corn breeding program. The overall average polymorphic information content (PIC) was 0.34, expected heterozygosity was 0.324 and Shannon's information index was 0.491 with a percentage of polymorphism of 98.67%. Further analysis by associating with desirable traits may also provide some accurate trait specific tagged SCAR markers. TE linked SCARs can provide an added level of polymorphism as well as improved discriminating ability and therefore can be useful in further breeding programs to develop high yielding germplasm.
Genetic analysis of arsenic accumulation in maize using QTL mapping
NASA Astrophysics Data System (ADS)
Fu, Zhongjun; Li, Weihua; Xing, Xiaolong; Xu, Mengmeng; Liu, Xiaoyang; Li, Haochuan; Xue, Yadong; Liu, Zonghua; Tang, Jihua
2016-02-01
Arsenic (As) is a toxic heavy metal that can accumulate in crops and poses a threat to human health. The genetic mechanism of As accumulation is unclear. Herein, we used quantitative trait locus (QTL) mapping to unravel the genetic basis of As accumulation in a maize recombinant inbred line population derived from the Chinese crossbred variety Yuyu22. The kernels had the lowest As content among the different maize tissues, followed by the axes, stems, bracts and leaves. Fourteen QTLs were identified at each location. Some of these QTLs were identified in different environments and were also detected by joint analysis. Compared with the B73 RefGen v2 reference genome, the distributions and effects of some QTLs were closely linked to those of QTLs detected in a previous study; the QTLs were likely in strong linkage disequilibrium. Our findings could be used to help maintain maize production to satisfy the demand for edible corn and to decrease the As content in As-contaminated soil through the selection and breeding of As pollution-safe cultivars.
Genetic analysis of arsenic accumulation in maize using QTL mapping.
Fu, Zhongjun; Li, Weihua; Xing, Xiaolong; Xu, Mengmeng; Liu, Xiaoyang; Li, Haochuan; Xue, Yadong; Liu, Zonghua; Tang, Jihua
2016-02-16
Arsenic (As) is a toxic heavy metal that can accumulate in crops and poses a threat to human health. The genetic mechanism of As accumulation is unclear. Herein, we used quantitative trait locus (QTL) mapping to unravel the genetic basis of As accumulation in a maize recombinant inbred line population derived from the Chinese crossbred variety Yuyu22. The kernels had the lowest As content among the different maize tissues, followed by the axes, stems, bracts and leaves. Fourteen QTLs were identified at each location. Some of these QTLs were identified in different environments and were also detected by joint analysis. Compared with the B73 RefGen v2 reference genome, the distributions and effects of some QTLs were closely linked to those of QTLs detected in a previous study; the QTLs were likely in strong linkage disequilibrium. Our findings could be used to help maintain maize production to satisfy the demand for edible corn and to decrease the As content in As-contaminated soil through the selection and breeding of As pollution-safe cultivars.
Recombination rate variation in mice from an isolated island.
Wang, Richard J; Gray, Melissa M; Parmenter, Michelle D; Broman, Karl W; Payseur, Bret A
2017-01-01
Recombination rate is a heritable trait that varies among individuals. Despite the major impact of recombination rate on patterns of genetic diversity and the efficacy of selection, natural variation in this phenotype remains poorly characterized. We present a comparison of genetic maps, sampling 1212 meioses, from a unique population of wild house mice (Mus musculus domesticus) that recently colonized remote Gough Island. Crosses to a mainland reference strain (WSB/EiJ) reveal pervasive variation in recombination rate among Gough Island mice, including subchromosomal intervals spanning up to 28% of the genome. In spite of this high level of polymorphism, the genomewide recombination rate does not significantly vary. In general, we find that recombination rate varies more when measured in smaller genomic intervals. Using the current standard genetic map of the laboratory mouse to polarize intervals with divergent recombination rates, we infer that the majority of evolutionary change occurred in one of the two tested lines of Gough Island mice. Our results confirm that natural populations harbour a high level of recombination rate polymorphism and highlight the disparities in recombination rate evolution across genomic scales. © 2016 John Wiley & Sons Ltd.
Recombination rate variation in mice from an isolated island
Wang, Richard J.; Gray, Melissa M.; Parmenter, Michelle D.; Broman, Karl W.; Payseur, Bret A.
2016-01-01
Recombination rate is a heritable trait that varies among individuals. Despite the major impact of recombination rate on patterns of genetic diversity and the efficacy of selection, natural variation in this phenotype remains poorly characterized. We present a comparison of genetic maps, sampling 1,212 meioses, from a unique population of wild house mice (Mus musculus domesticus) that recently colonized remote Gough Island. Crosses to a mainland reference strain (WSB/EiJ) reveal pervasive variation in recombination rate among Gough Island mice, including sub-chromosomal intervals spanning up to 28% of the genome. In spite of this high level of polymorphism, the genome-wide recombination rate does not significantly vary. In general, we find that recombination rate varies more when measured in smaller genomic intervals. Using the current standard genetic map of the laboratory mouse to polarize intervals with divergent recombination rates, we infer that the majority of evolutionary change occurred in one of the two tested lines of Gough Island mice. Our results confirm that natural populations harbor a high level of recombination rate polymorphism and highlight the disparities in recombination rate evolution across genomic scales. PMID:27864900
QTL mapping for fruit quality in Citrus using DArTseq markers.
Curtolo, Maiara; Cristofani-Yaly, Mariângela; Gazaffi, Rodrigo; Takita, Marco Aurélio; Figueira, Antonio; Machado, Marcos Antonio
2017-04-12
Citrus breeding programs have many limitations associated with the species biology and physiology, requiring the incorporation of new biotechnological tools to provide new breeding possibilities. Diversity Arrays Technology (DArT) markers, combined with next-generation sequencing, have wide applicability in the construction of high-resolution genetic maps and in quantitative trait locus (QTL) mapping. This study aimed to construct an integrated genetic map using full-sib progeny derived from Murcott tangor and Pera sweet orange and DArTseq™ molecular markers and to perform QTL mapping of twelve fruit quality traits. A controlled Murcott x Pera crossing was conducted at the Citrus Germplasm Repository at the Sylvio Moreira Citrus Centre of the Agronomic Institute (IAC) located in Cordeirópolis, SP, in 1997. In 2012, 278 F 1 individuals out of a family of 312 confirmed hybrid individuals were analyzed for fruit traits and genotyped using the DArTseq markers. Using OneMap software to obtain the integrated genetic map, we considered only the DArT loci that showed no segregation deviation. The likelihood ratio and the genomic information from the available Citrus sinensis L. Osbeck genome were used to determine the linkage groups (LGs). The resulting integrated map contained 661 markers in 13 LGs, with a genomic coverage of 2,774 cM and a mean density of 0.23 markers/cM. The groups were assigned to the nine Citrus haploid chromosomes; however, some of the chromosomes were represented by two LGs due the lack of information for a single integration, as in cases where markers segregated in a 3:1 fashion. A total of 19 QTLs were identified through composite interval mapping (CIM) of the 12 analyzed fruit characteristics: fruit diameter (cm), height (cm), height/diameter ratio, weight (g), rind thickness (cm), segments per fruit, total soluble solids (TSS, %), total titratable acidity (TTA, %), juice content (%), number of seeds, TSS/TTA ratio and number of fruits per box. The genomic sequence (pseudochromosomes) of C. sinensis was compared to the genetic map, and synteny was clearly identified. Further analysis of the map regions with the highest LOD scores enabled the identification of putative genes that could be associated with the fruit quality characteristics. An integrated linkage map of Murcott tangor and Pera sweet orange using DArTseq™ molecular markers was established and it was useful to perform QTL mapping of twelve fruit quality traits. The next generation sequences data allowed the comparison between the linkage map and the genomic sequence (pseudochromosomes) of C. sinensis and the identification of genes that may be responsible for phenotypic traits in Citrus. The obtained linkage map was used to assign sequences that had not been previously assigned to a position in the reference genome.
Abdelrahman, Hisham; ElHady, Mohamed; Alcivar-Warren, Acacia; Allen, Standish; Al-Tobasei, Rafet; Bao, Lisui; Beck, Ben; Blackburn, Harvey; Bosworth, Brian; Buchanan, John; Chappell, Jesse; Daniels, William; Dong, Sheng; Dunham, Rex; Durland, Evan; Elaswad, Ahmed; Gomez-Chiarri, Marta; Gosh, Kamal; Guo, Ximing; Hackett, Perry; Hanson, Terry; Hedgecock, Dennis; Howard, Tiffany; Holland, Leigh; Jackson, Molly; Jin, Yulin; Khalil, Karim; Kocher, Thomas; Leeds, Tim; Li, Ning; Lindsey, Lauren; Liu, Shikai; Liu, Zhanjiang; Martin, Kyle; Novriadi, Romi; Odin, Ramjie; Palti, Yniv; Peatman, Eric; Proestou, Dina; Qin, Guyu; Reading, Benjamin; Rexroad, Caird; Roberts, Steven; Salem, Mohamed; Severin, Andrew; Shi, Huitong; Shoemaker, Craig; Stiles, Sheila; Tan, Suxu; Tang, Kathy F J; Thongda, Wilawan; Tiersch, Terrence; Tomasso, Joseph; Prabowo, Wendy Tri; Vallejo, Roger; van der Steen, Hein; Vo, Khoi; Waldbieser, Geoff; Wang, Hanping; Wang, Xiaozhu; Xiang, Jianhai; Yang, Yujia; Yant, Roger; Yuan, Zihao; Zeng, Qifan; Zhou, Tao
2017-02-20
Advancing the production efficiency and profitability of aquaculture is dependent upon the ability to utilize a diverse array of genetic resources. The ultimate goals of aquaculture genomics, genetics and breeding research are to enhance aquaculture production efficiency, sustainability, product quality, and profitability in support of the commercial sector and for the benefit of consumers. In order to achieve these goals, it is important to understand the genomic structure and organization of aquaculture species, and their genomic and phenomic variations, as well as the genetic basis of traits and their interrelationships. In addition, it is also important to understand the mechanisms of regulation and evolutionary conservation at the levels of genome, transcriptome, proteome, epigenome, and systems biology. With genomic information and information between the genomes and phenomes, technologies for marker/causal mutation-assisted selection, genome selection, and genome editing can be developed for applications in aquaculture. A set of genomic tools and resources must be made available including reference genome sequences and their annotations (including coding and non-coding regulatory elements), genome-wide polymorphic markers, efficient genotyping platforms, high-density and high-resolution linkage maps, and transcriptome resources including non-coding transcripts. Genomic and genetic control of important performance and production traits, such as disease resistance, feed conversion efficiency, growth rate, processing yield, behaviour, reproductive characteristics, and tolerance to environmental stressors like low dissolved oxygen, high or low water temperature and salinity, must be understood. QTL need to be identified, validated across strains, lines and populations, and their mechanisms of control understood. Causal gene(s) need to be identified. Genetic and epigenetic regulation of important aquaculture traits need to be determined, and technologies for marker-assisted selection, causal gene/mutation-assisted selection, genome selection, and genome editing using CRISPR and other technologies must be developed, demonstrated with applicability, and application to aquaculture industries.Major progress has been made in aquaculture genomics for dozens of fish and shellfish species including the development of genetic linkage maps, physical maps, microarrays, single nucleotide polymorphism (SNP) arrays, transcriptome databases and various stages of genome reference sequences. This paper provides a general review of the current status, challenges and future research needs of aquaculture genomics, genetics, and breeding, with a focus on major aquaculture species in the United States: catfish, rainbow trout, Atlantic salmon, tilapia, striped bass, oysters, and shrimp. While the overall research priorities and the practical goals are similar across various aquaculture species, the current status in each species should dictate the next priority areas within the species. This paper is an output of the USDA Workshop for Aquaculture Genomics, Genetics, and Breeding held in late March 2016 in Auburn, Alabama, with participants from all parts of the United States.
Shirasawa, Kenta; Hand, Melanie L; Henderson, Steven T; Okada, Takashi; Johnson, Susan D; Taylor, Jennifer M; Spriggs, Andrew; Siddons, Hayley; Hirakawa, Hideki; Isobe, Sachiko; Tabata, Satoshi; Koltunow, Anna M G
2015-03-01
Apomixis in plants generates clonal progeny with a maternal genotype through asexual seed formation. Hieracium subgenus Pilosella (Asteraceae) contains polyploid, highly heterozygous apomictic and sexual species. Within apomictic Hieracium, dominant genetic loci independently regulate the qualitative developmental components of apomixis. In H. praealtum, LOSS OF APOMEIOSIS (LOA) enables formation of embryo sacs without meiosis and LOSS OF PARTHENOGENESIS (LOP) enables fertilization-independent seed formation. A locus required for fertilization-independent endosperm formation (AutE) has been identified in H. piloselloides. Additional quantitative loci appear to influence the penetrance of the qualitative loci, although the controlling genes remain unknown. This study aimed to develop the first genetic linkage maps for sexual and apomictic Hieracium species using simple sequence repeat (SSR) markers derived from expressed transcripts within the developing ovaries. RNA from microdissected Hieracium ovule cell types and ovaries was sequenced and SSRs were identified. Two different F1 mapping populations were created to overcome difficulties associated with genome complexity and asexual reproduction. SSR markers were analysed within each mapping population to generate draft linkage maps for apomictic and sexual Hieracium species. A collection of 14 684 Hieracium expressed SSR markers were developed and linkage maps were constructed for Hieracium species using a subset of the SSR markers. Both the LOA and LOP loci were successfully assigned to linkage groups; however, AutE could not be mapped using the current populations. Comparisons with lettuce (Lactuca sativa) revealed partial macrosynteny between the two Asteraceae species. A collection of SSR markers and draft linkage maps were developed for two apomictic and one sexual Hieracium species. These maps will support cloning of controlling genes at LOA and LOP loci in Hieracium and should also assist with identification of quantitative loci that affect the expressivity of apomixis. Future work will focus on mapping AutE using alternative populations. © The Author 2014. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms
Money, Daniel; Gardner, Kyle; Migicovsky, Zoë; Schwaninger, Heidi; Zhong, Gan-Yuan; Myles, Sean
2015-01-01
Obtaining genome-wide genotype data from a set of individuals is the first step in many genomic studies, including genome-wide association and genomic selection. All genotyping methods suffer from some level of missing data, and genotype imputation can be used to fill in the missing data and improve the power of downstream analyses. Model organisms like human and cattle benefit from high-quality reference genomes and panels of reference genotypes that aid in imputation accuracy. In nonmodel organisms, however, genetic and physical maps often are either of poor quality or are completely absent, and there are no panels of reference genotypes available. There is therefore a need for imputation methods designed specifically for nonmodel organisms in which genomic resources are poorly developed and marker order is unreliable or unknown. Here we introduce LinkImpute, a software package based on a k-nearest neighbor genotype imputation method, LD-kNNi, which is designed for unordered markers. No physical or genetic maps are required, and it is designed to work on unphased genotype data from heterozygous species. It exploits the fact that markers useful for imputation often are not physically close to the missing genotype but rather distributed throughout the genome. Using genotyping-by-sequencing data from diverse and heterozygous accessions of apples, grapes, and maize, we compare LD-kNNi with several genotype imputation methods and show that LD-kNNi is fast, comparable in accuracy to the best-existing methods, and exhibits the least bias in allele frequency estimates. PMID:26377960
Wheat Landrace Genome Diversity
Wingen, Luzie U.; West, Claire; Leverington-Waite, Michelle; Collier, Sarah; Orford, Simon; Goram, Richard; Yang, Cai-Yun; King, Julie; Allen, Alexandra M.; Burridge, Amanda; Edwards, Keith J.; Griffiths, Simon
2017-01-01
Understanding the genomic complexity of bread wheat (Triticum aestivum L.) is a cornerstone in the quest to unravel the processes of domestication and the following adaptation of domesticated wheat to a wide variety of environments across the globe. Additionally, it is of importance for future improvement of the crop, particularly in the light of climate change. Focusing on the adaptation after domestication, a nested association mapping (NAM) panel of 60 segregating biparental populations was developed, mainly involving landrace accessions from the core set of the Watkins hexaploid wheat collection optimized for genetic diversity. A modern spring elite variety, “Paragon,” was used as common reference parent. Genetic maps were constructed following identical rules to make them comparable. In total, 1611 linkage groups were identified, based on recombination from an estimated 126,300 crossover events over the whole NAM panel. A consensus map, named landrace consensus map (LRC), was constructed and contained 2498 genetic loci. These newly developed genetics tools were used to investigate the rules underlying genome fluidity or rigidity, e.g., by comparing marker distances and marker orders. In general, marker order was highly correlated, which provides support for strong synteny between bread wheat accessions. However, many exceptional cases of incongruent linkage groups and increased marker distances were also found. Segregation distortion was detected for many markers, sometimes as hot spots present in different populations. Furthermore, evidence for translocations in at least 36 of the maps was found. These translocations fell, in general, into many different translocation classes, but a few translocation classes were found in several accessions, the most frequent one being the well-known T5B:7B translocation. Loci involved in recombination rate, which is an interesting trait for plant breeding, were identified by QTL analyses using the crossover counts as a trait. In total, 114 significant QTL were detected, nearly half of them with increasing effect from the nonreference parents. PMID:28213475
Reference genome sequence of the model plant Setaria
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ~400-Mb assembly covers ~80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Reference genome sequence of the model plant Setaria.
Bennetzen, Jeffrey L; Schmutz, Jeremy; Wang, Hao; Percifield, Ryan; Hawkins, Jennifer; Pontaroli, Ana C; Estep, Matt; Feng, Liang; Vaughn, Justin N; Grimwood, Jane; Jenkins, Jerry; Barry, Kerrie; Lindquist, Erika; Hellsten, Uffe; Deshpande, Shweta; Wang, Xuewen; Wu, Xiaomei; Mitros, Therese; Triplett, Jimmy; Yang, Xiaohan; Ye, Chu-Yu; Mauro-Herrera, Margarita; Wang, Lin; Li, Pinghua; Sharma, Manoj; Sharma, Rita; Ronald, Pamela C; Panaud, Olivier; Kellogg, Elizabeth A; Brutnell, Thomas P; Doust, Andrew N; Tuskan, Gerald A; Rokhsar, Daniel; Devos, Katrien M
2012-05-13
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The ∼400-Mb assembly covers ∼80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species that demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).
Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S
2009-03-01
A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.
A genome-wide SNP scan accelerates trait-regulatory genomic loci identification in chickpea
Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We identified 44844 high-quality SNPs by sequencing 92 diverse chickpea accessions belonging to a seed and pod trait-specific association panel using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays. A GWAS (genome-wide association study) in an association panel of 211, including the 92 sequenced accessions, identified 22 major genomic loci showing significant association (explaining 23–47% phenotypic variation) with pod and seed number/plant and 100-seed weight. Eighteen trait-regulatory major genomic loci underlying 13 robust QTLs were validated and mapped on an intra-specific genetic linkage map by QTL mapping. A combinatorial approach of GWAS, QTL mapping and gene haplotype-specific LD mapping and transcript profiling uncovered one superior haplotype and favourable natural allelic variants in the upstream regulatory region of a CesA-type cellulose synthase (Ca_Kabuli_CesA3) gene regulating high pod and seed number/plant (explaining 47% phenotypic variation) in chickpea. The up-regulation of this superior gene haplotype correlated with increased transcript expression of Ca_Kabuli_CesA3 gene in the pollen and pod of high pod/seed number accession, resulting in higher cellulose accumulation for normal pollen and pollen tube growth. A rapid combinatorial genome-wide SNP genotyping-based approach has potential to dissect complex quantitative agronomic traits and delineate trait-regulatory genomic loci (candidate genes) for genetic enhancement in crop plants, including chickpea. PMID:26058368
Horikoshi, Momoko; Mӓgi, Reedik; van de Bunt, Martijn; Surakka, Ida; Sarin, Antti-Pekka; Mahajan, Anubha; Marullo, Letizia; Thorleifsson, Gudmar; Hӓgg, Sara; Hottenga, Jouke-Jan; Ladenvall, Claes; Ried, Janina S; Winkler, Thomas W; Willems, Sara M; Pervjakova, Natalia; Esko, Tõnu; Beekman, Marian; Nelson, Christopher P; Willenborg, Christina; Wiltshire, Steven; Ferreira, Teresa; Fernandez, Juan; Gaulton, Kyle J; Steinthorsdottir, Valgerdur; Hamsten, Anders; Magnusson, Patrik K E; Willemsen, Gonneke; Milaneschi, Yuri; Robertson, Neil R; Groves, Christopher J; Bennett, Amanda J; Lehtimӓki, Terho; Viikari, Jorma S; Rung, Johan; Lyssenko, Valeriya; Perola, Markus; Heid, Iris M; Herder, Christian; Grallert, Harald; Müller-Nurasyid, Martina; Roden, Michael; Hypponen, Elina; Isaacs, Aaron; van Leeuwen, Elisabeth M; Karssen, Lennart C; Mihailov, Evelin; Houwing-Duistermaat, Jeanine J; de Craen, Anton J M; Deelen, Joris; Havulinna, Aki S; Blades, Matthew; Hengstenberg, Christian; Erdmann, Jeanette; Schunkert, Heribert; Kaprio, Jaakko; Tobin, Martin D; Samani, Nilesh J; Lind, Lars; Salomaa, Veikko; Lindgren, Cecilia M; Slagboom, P Eline; Metspalu, Andres; van Duijn, Cornelia M; Eriksson, Johan G; Peters, Annette; Gieger, Christian; Jula, Antti; Groop, Leif; Raitakari, Olli T; Power, Chris; Penninx, Brenda W J H; de Geus, Eco; Smit, Johannes H; Boomsma, Dorret I; Pedersen, Nancy L; Ingelsson, Erik; Thorsteinsdottir, Unnur; Stefansson, Kari; Ripatti, Samuli; Prokopenko, Inga; McCarthy, Mark I; Morris, Andrew P
2015-07-01
Reference panels from the 1000 Genomes (1000G) Project Consortium provide near complete coverage of common and low-frequency genetic variation with minor allele frequency ≥0.5% across European ancestry populations. Within the European Network for Genetic and Genomic Epidemiology (ENGAGE) Consortium, we have undertaken the first large-scale meta-analysis of genome-wide association studies (GWAS), supplemented by 1000G imputation, for four quantitative glycaemic and obesity-related traits, in up to 87,048 individuals of European ancestry. We identified two loci for body mass index (BMI) at genome-wide significance, and two for fasting glucose (FG), none of which has been previously reported in larger meta-analysis efforts to combine GWAS of European ancestry. Through conditional analysis, we also detected multiple distinct signals of association mapping to established loci for waist-hip ratio adjusted for BMI (RSPO3) and FG (GCK and G6PC2). The index variant for one association signal at the G6PC2 locus is a low-frequency coding allele, H177Y, which has recently been demonstrated to have a functional role in glucose regulation. Fine-mapping analyses revealed that the non-coding variants most likely to drive association signals at established and novel loci were enriched for overlap with enhancer elements, which for FG mapped to promoter and transcription factor binding sites in pancreatic islets, in particular. Our study demonstrates that 1000G imputation and genetic fine-mapping of common and low-frequency variant association signals at GWAS loci, integrated with genomic annotation in relevant tissues, can provide insight into the functional and regulatory mechanisms through which their effects on glycaemic and obesity-related traits are mediated.
Discovery and Fine-Mapping of Glycaemic and Obesity-Related Trait Loci Using High-Density Imputation
van de Bunt, Martijn; Surakka, Ida; Sarin, Antti-Pekka; Mahajan, Anubha; Marullo, Letizia; Thorleifsson, Gudmar; Hӓgg, Sara; Hottenga, Jouke-Jan; Ladenvall, Claes; Ried, Janina S.; Winkler, Thomas W.; Willems, Sara M.; Pervjakova, Natalia; Esko, Tõnu; Beekman, Marian; Nelson, Christopher P.; Willenborg, Christina; Ferreira, Teresa; Fernandez, Juan; Gaulton, Kyle J.; Steinthorsdottir, Valgerdur; Hamsten, Anders; Magnusson, Patrik K. E.; Willemsen, Gonneke; Milaneschi, Yuri; Robertson, Neil R.; Groves, Christopher J.; Bennett, Amanda J.; Lehtimӓki, Terho; Viikari, Jorma S.; Rung, Johan; Lyssenko, Valeriya; Perola, Markus; Heid, Iris M.; Herder, Christian; Grallert, Harald; Müller-Nurasyid, Martina; Roden, Michael; Hypponen, Elina; Isaacs, Aaron; van Leeuwen, Elisabeth M.; Karssen, Lennart C.; Mihailov, Evelin; Houwing-Duistermaat, Jeanine J.; de Craen, Anton J. M.; Deelen, Joris; Havulinna, Aki S.; Blades, Matthew; Hengstenberg, Christian; Erdmann, Jeanette; Schunkert, Heribert; Kaprio, Jaakko; Tobin, Martin D.; Samani, Nilesh J.; Lind, Lars; Salomaa, Veikko; Lindgren, Cecilia M.; Slagboom, P. Eline; Metspalu, Andres; van Duijn, Cornelia M.; Eriksson, Johan G.; Peters, Annette; Gieger, Christian; Jula, Antti; Groop, Leif; Raitakari, Olli T.; Power, Chris; Penninx, Brenda W. J. H.; de Geus, Eco; Smit, Johannes H.; Boomsma, Dorret I.; Pedersen, Nancy L.; Ingelsson, Erik; Thorsteinsdottir, Unnur; Stefansson, Kari; Ripatti, Samuli; Prokopenko, Inga; McCarthy, Mark I.; Morris, Andrew P.
2015-01-01
Reference panels from the 1000 Genomes (1000G) Project Consortium provide near complete coverage of common and low-frequency genetic variation with minor allele frequency ≥0.5% across European ancestry populations. Within the European Network for Genetic and Genomic Epidemiology (ENGAGE) Consortium, we have undertaken the first large-scale meta-analysis of genome-wide association studies (GWAS), supplemented by 1000G imputation, for four quantitative glycaemic and obesity-related traits, in up to 87,048 individuals of European ancestry. We identified two loci for body mass index (BMI) at genome-wide significance, and two for fasting glucose (FG), none of which has been previously reported in larger meta-analysis efforts to combine GWAS of European ancestry. Through conditional analysis, we also detected multiple distinct signals of association mapping to established loci for waist-hip ratio adjusted for BMI (RSPO3) and FG (GCK and G6PC2). The index variant for one association signal at the G6PC2 locus is a low-frequency coding allele, H177Y, which has recently been demonstrated to have a functional role in glucose regulation. Fine-mapping analyses revealed that the non-coding variants most likely to drive association signals at established and novel loci were enriched for overlap with enhancer elements, which for FG mapped to promoter and transcription factor binding sites in pancreatic islets, in particular. Our study demonstrates that 1000G imputation and genetic fine-mapping of common and low-frequency variant association signals at GWAS loci, integrated with genomic annotation in relevant tissues, can provide insight into the functional and regulatory mechanisms through which their effects on glycaemic and obesity-related traits are mediated. PMID:26132169
Genotype harmonizer: automatic strand alignment and format conversion for genotype data integration.
Deelen, Patrick; Bonder, Marc Jan; van der Velde, K Joeri; Westra, Harm-Jan; Winder, Erwin; Hendriksen, Dennis; Franke, Lude; Swertz, Morris A
2014-12-11
To gain statistical power or to allow fine mapping, researchers typically want to pool data before meta-analyses or genotype imputation. However, the necessary harmonization of genetic datasets is currently error-prone because of many different file formats and lack of clarity about which genomic strand is used as reference. Genotype Harmonizer (GH) is a command-line tool to harmonize genetic datasets by automatically solving issues concerning genomic strand and file format. GH solves the unknown strand issue by aligning ambiguous A/T and G/C SNPs to a specified reference, using linkage disequilibrium patterns without prior knowledge of the used strands. GH supports many common GWAS/NGS genotype formats including PLINK, binary PLINK, VCF, SHAPEIT2 & Oxford GEN. GH is implemented in Java and a large part of the functionality can also be used as Java 'Genotype-IO' API. All software is open source under license LGPLv3 and available from http://www.molgenis.org/systemsgenetics. GH can be used to harmonize genetic datasets across different file formats and can be easily integrated as a step in routine meta-analysis and imputation pipelines.
Rousseau-Gueutin, Mathieu; Lerceteau-Köhler, Estelle; Barrot, Laure; Sargent, Daniel James; Monfort, Amparo; Simpson, David; Arús, Pere; Guérin, Guy; Denoyes-Rothan, Béatrice
2008-01-01
Macrosynteny and colinearity between Fragaria (strawberry) species showing extreme levels of ploidy have been studied through comparative genetic mapping between the octoploid cultivated strawberry (F. ×ananassa) and its diploid relatives. A comprehensive map of the octoploid strawberry, in which almost all linkage groups are ranged into the seven expected homoeologous groups was obtained, thus providing the first reference map for the octoploid Fragaria. High levels of conserved macrosynteny and colinearity were observed between homo(eo)logous linkage groups and between the octoploid homoeologous groups and their corresponding diploid linkage groups. These results reveal that the polyploidization events that took place along the evolution of the Fragaria genus and the more recent juxtaposition of two octoploid strawberry genomes in the cultivated strawberry did not trigger any major chromosomal rearrangements in genomes involved in F. ×ananassa. They further suggest the existence of a close relationship between the diploid Fragaria genomes. In addition, despite the possible existence of residual levels of polysomic segregation suggested by the observation of large linkage groups in coupling phase only, the prevalence of linkage groups in coupling/repulsion phase clearly demonstrates that the meiotic behavior is mainly disomic in the cultivated strawberry. PMID:18660542
USDA-ARS?s Scientific Manuscript database
Construction of genetic linkage map is essential for genetic and genomic studies. Recent advances in sequencing and genotyping technologies made it possible to generate high-density and high-resolution genetic linkage maps, especially for the organisms lacking extensive genomic resources. In the pre...
Toward a framework linkage map of the canine genome.
Langston, A A; Mellersh, C S; Wiegand, N A; Acland, G M; Ray, K; Aguirre, G D; Ostrander, E A
1999-01-01
Selective breeding to maintain specific physical and behavioral traits has made the modern dog one of the most physically diverse species on earth. One unfortunate consequence of the common breeding practices used to develop lines of dogs with the desired traits is amplification and propagation of genetic diseases within distinct breeds. To map disease loci we have constructed a first-generation framework map of the canine genome. We developed large numbers of highly polymorphic markers, constructed a panel of canine-rodent hybrid cell lines, and assigned those markers to chromosome groups using the hybrid cell lines. Finally, we determined the order and spacing of markers on individual canine chromosomes by linkage analysis using a reference panel of 17 outbred pedigrees. This article describes approaches and strategies to accomplish these goals.
The first genetic map of pigeon pea based on diversity arrays technology (DArT) markers.
Yang, Shi Ying; Saxena, Rachit K; Kulwal, Pawan L; Ash, Gavin J; Dubey, Anuja; Harper, John D I; Upadhyaya, Hari D; Gothalwal, Ragini; Kilian, Andrzej; Varshney, Rajeev K
2011-04-01
With an objective to develop a genetic map in pigeon pea (Cajanus spp.), a total of 554 diversity arrays technology (DArT) markers showed polymorphism in a pigeon pea F(2) mapping population of 72 progenies derived from an interspecific cross of ICP 28 (Cajanus cajan) and ICPW 94 (Cajanus scarabaeoides). Approximately 13% of markers did not conform to expected segregation ratio. The total number of DArT marker loci segregating in Mendelian manner was 405 with 73.1% (P > 0.001) of DArT markers having unique segregation patterns. Two groups of genetic maps were generated using DArT markers. While the maternal genetic linkage map had 122 unique DArT maternal marker loci, the paternal genetic linkage map has a total of 172 unique DArT paternal marker loci. The length of these two maps covered 270.0 cM and 451.6 cM, respectively. These are the first genetic linkage maps developed for pigeon pea, and this is the first report of genetic mapping in any grain legume using diversity arrays technology.
Genome-Wide Analysis Reveals Novel Regulators of Growth in Drosophila melanogaster
Vonesch, Sibylle Chantal; Lamparter, David; Mackay, Trudy F. C.; Bergmann, Sven; Hafen, Ernst
2016-01-01
Organismal size depends on the interplay between genetic and environmental factors. Genome-wide association (GWA) analyses in humans have implied many genes in the control of height but suffer from the inability to control the environment. Genetic analyses in Drosophila have identified conserved signaling pathways controlling size; however, how these pathways control phenotypic diversity is unclear. We performed GWA of size traits using the Drosophila Genetic Reference Panel of inbred, sequenced lines. We find that the top associated variants differ between traits and sexes; do not map to canonical growth pathway genes, but can be linked to these by epistasis analysis; and are enriched for genes and putative enhancers. Performing GWA on well-studied developmental traits under controlled conditions expands our understanding of developmental processes underlying phenotypic diversity. PMID:26751788
Serin, Elise A. R.; Snoek, L. B.; Nijveen, Harm; Willems, Leo A. J.; Jiménez-Gómez, Jose M.; Hilhorst, Henk W. M.; Ligterink, Wilco
2017-01-01
High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches. PMID:29259624
2013-01-01
Background Rapid development of highly saturated genetic maps aids molecular breeding, which can accelerate gain per breeding cycle in woody perennial plants such as Rubus idaeus (red raspberry). Recently, robust genotyping methods based on high-throughput sequencing were developed, which provide high marker density, but result in some genotype errors and a large number of missing genotype values. Imputation can reduce the number of missing values and can correct genotyping errors, but current methods of imputation require a reference genome and thus are not an option for most species. Results Genotyping by Sequencing (GBS) was used to produce highly saturated maps for a R. idaeus pseudo-testcross progeny. While low coverage and high variance in sequencing resulted in a large number of missing values for some individuals, a novel method of imputation based on maximum likelihood marker ordering from initial marker segregation overcame the challenge of missing values, and made map construction computationally tractable. The two resulting parental maps contained 4521 and 2391 molecular markers spanning 462.7 and 376.6 cM respectively over seven linkage groups. Detection of precise genomic regions with segregation distortion was possible because of map saturation. Microsatellites (SSRs) linked these results to published maps for cross-validation and map comparison. Conclusions GBS together with genome-independent imputation provides a rapid method for genetic map construction in any pseudo-testcross progeny. Our method of imputation estimates the correct genotype call of missing values and corrects genotyping errors that lead to inflated map size and reduced precision in marker placement. Comparison of SSRs to published R. idaeus maps showed that the linkage maps constructed with GBS and our method of imputation were robust, and marker positioning reliable. The high marker density allowed identification of genomic regions with segregation distortion in R. idaeus, which may help to identify deleterious alleles that are the basis of inbreeding depression in the species. PMID:23324311
Xue, Angli; Wang, Hongcheng; Zhu, Jun
2017-09-28
Startle behavior is important for survival, and abnormal startle responses are related to several neurological diseases. Drosophila melanogaster provides a powerful system to investigate the genetic underpinnings of variation in startle behavior. Since mechanically induced, startle responses and environmental conditions can be readily quantified and precisely controlled. The 156 wild-derived fully sequenced lines of the Drosophila Genetic Reference Panel (DGRP) were used to identify SNPs and transcripts associated with variation in startle behavior. The results validated highly significant effects of 33 quantitative trait SNPs (QTSs) and 81 quantitative trait transcripts (QTTs) directly associated with phenotypic variation of startle response. We also detected QTT variation controlled by 20 QTSs (tQTSs) and 73 transcripts (tQTTs). Association mapping based on genomic and transcriptomic data enabled us to construct a complex genetic network that underlies variation in startle behavior. Based on principles of evolutionary conservation, human orthologous genes could be superimposed on this network. This study provided both genetic and biological insights into the variation of startle response behavior of Drosophila melanogaster, and highlighted the importance of genetic network to understand the genetic architecture of complex traits.
The Genetics of Chemoreception in the Labella and Tarsi of Aedes aegypti
2014-01-01
molecular pathway also involved in DEET perception in the dipteran relative Drosophila melanogaster (Ditzen et al., 2008; DeGennaro et al., 2013). Transgenic...Downloads/). Output Fastq Illumina files were map- ped to the reference genome with TopHat (Trapnell et al., 2009). The unambiguous sequence alignment...Q., Pikielny, C.W., 2002. Novel genes expressing in subsets of chemosensory sensilla on the front legs of male Drosophila melanogaster . Cell. Tissue
Liu, Ren-Hu; Meng, Jin-Ling
2003-05-01
MAPMAKER is one of the most widely used computer software package for constructing genetic linkage maps.However, the PC version, MAPMAKER 3.0 for PC, could not draw the genetic linkage maps that its Macintosh version, MAPMAKER 3.0 for Macintosh,was able to do. Especially in recent years, Macintosh computer is much less popular than PC. Most of the geneticists use PC to analyze their genetic linkage data. So a new computer software to draw the same genetic linkage maps on PC as the MAPMAKER for Macintosh to do on Macintosh has been crying for. Microsoft Excel,one component of Microsoft Office package, is one of the most popular software in laboratory data processing. Microsoft Visual Basic for Applications (VBA) is one of the most powerful functions of Microsoft Excel. Using this program language, we can take creative control of Excel, including genetic linkage map construction, automatic data processing and more. In this paper, a Microsoft Excel macro called MapDraw is constructed to draw genetic linkage maps on PC computer based on given genetic linkage data. Use this software,you can freely construct beautiful genetic linkage map in Excel and freely edit and copy it to Word or other application. This software is just an Excel format file. You can freely copy it from ftp://211.69.140.177 or ftp://brassica.hzau.edu.cn and the source code can be found in Excel's Visual Basic Editor.
Methods of analysis and resources available for genetic trait mapping.
Ott, J
1999-01-01
Methods of genetic linkage analysis are reviewed and put in context with other mapping techniques. Sources of information are outlined (books, web sites, computer programs). Special consideration is given to statistical problems in canine genetic mapping (heterozygosity, inbreeding, marker maps).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greenspan, D.S.; Northrup, H.; Au, K.S.
1995-02-10
COL5A1, the gene for the {alpha}1 chain of type V collagen, has been considered a candidate gene for certain diseases based on chromosomal location and/or disease phenotype. We have employed 3{prime}-untranslated region RFLPs to exclude COL5A1 as a candidate gene in families with tuberous sclerosis 1, Ehlers-Danlos syndrome type H, and nail-patella syndrome. In addition, we describe a polymorphic simple sequence repeat (SSR) within a COL5A1 intron. This SSR is used to exclude COL5A1 as a candidate gene in hereditary hemorrhagic telangiectasia (Osler-Rendu-Weber disease) and to add COL5A1 to the existing map of {open_quotes}index{close_quotes} markers of chromosome 9 by evaluationmore » of the COL5A1 locus on the CEPH 40-family reference pedigree set. This genetic mapping places COL5A1 between markers D9S66 and D9S67. 14 refs., 1 fig., 2 tabs.« less
A hybrid BAC physical map of potato: a framework for sequencing a heterozygous genome
2011-01-01
Background Potato is the world's third most important food crop, yet cultivar improvement and genomic research in general remain difficult because of the heterozygous and tetraploid nature of its genome. The development of physical map resources that can facilitate genomic analyses in potato has so far been very limited. Here we present the methods of construction and the general statistics of the first two genome-wide BAC physical maps of potato, which were made from the heterozygous diploid clone RH89-039-16 (RH). Results First, a gel electrophoresis-based physical map was made by AFLP fingerprinting of 64478 BAC clones, which were aligned into 4150 contigs with an estimated total length of 1361 Mb. Screening of BAC pools, followed by the KeyMaps in silico anchoring procedure, identified 1725 AFLP markers in the physical map, and 1252 BAC contigs were anchored the ultradense potato genetic map. A second, sequence-tag-based physical map was constructed from 65919 whole genome profiling (WGP) BAC fingerprints and these were aligned into 3601 BAC contigs spanning 1396 Mb. The 39733 BAC clones that overlap between both physical maps provided anchors to 1127 contigs in the WGP physical map, and reduced the number of contigs to around 2800 in each map separately. Both physical maps were 1.64 times longer than the 850 Mb potato genome. Genome heterozygosity and incomplete merging of BAC contigs are two factors that can explain this map inflation. The contig information of both physical maps was united in a single table that describes hybrid potato physical map. Conclusions The AFLP physical map has already been used by the Potato Genome Sequencing Consortium for sequencing 10% of the heterozygous genome of clone RH on a BAC-by-BAC basis. By layering a new WGP physical map on top of the AFLP physical map, a genetically anchored genome-wide framework of 322434 sequence tags has been created. This reference framework can be used for anchoring and ordering of genomic sequences of clone RH (and other potato genotypes), and opens the possibility to finish sequencing of the RH genome in a more efficient way via high throughput next generation approaches. PMID:22142254
Gramene 2013: comparative plant genomics resources.
Monaco, Marcela K; Stein, Joshua; Naithani, Sushma; Wei, Sharon; Dharmawardhana, Palitha; Kumari, Sunita; Amarasinghe, Vindhya; Youens-Clark, Ken; Thomason, James; Preece, Justin; Pasternak, Shiran; Olson, Andrew; Jiao, Yinping; Lu, Zhenyuan; Bolser, Dan; Kerhornou, Arnaud; Staines, Dan; Walts, Brandon; Wu, Guanming; D'Eustachio, Peter; Haw, Robin; Croft, David; Kersey, Paul J; Stein, Lincoln; Jaiswal, Pankaj; Ware, Doreen
2014-01-01
Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. Whole-genome alignments complemented by phylogenetic gene family trees help infer syntenic and orthologous relationships. Genetic variation data, sequences and genome mappings available for 10 species, including Arabidopsis, rice and maize, help infer putative variant effects on genes and transcripts. The pathways section also hosts 10 species-specific metabolic pathways databases developed in-house or by our collaborators using Pathway Tools software, which facilitates searches for pathway, reaction and metabolite annotations, and allows analyses of user-defined expression datasets. Recently, we released a Plant Reactome portal featuring 133 curated rice pathways. This portal will be expanded for Arabidopsis, maize and other plant species. We continue to provide genetic and QTL maps and marker datasets developed by crop researchers. The project provides a unique community platform to support scientific research in plant genomics including studies in evolution, genetics, plant breeding, molecular biology, biochemistry and systems biology.
Silady, Rebecca A; Effgen, Sigi; Koornneef, Maarten; Reymond, Matthieu
2011-01-01
A Quantitative Trait Locus (QTL) analysis was performed using two novel Recombinant Inbred Line (RIL) populations, derived from the progeny between two Arabidopsis thaliana genotypes collected at the same site in Kyoto (Japan) crossed with the reference laboratory strain Landsberg erecta (Ler). We used these two RIL populations to determine the genetic basis of seed dormancy and flowering time, which are assumed to be the main traits controlling life history variation in Arabidopsis. The analysis revealed quantitative variation for seed dormancy that is associated with allelic variation at the seed dormancy QTL DOG1 (for Delay Of Germination 1) in one population and at DOG6 in both. These DOG QTL have been previously identified using mapping populations derived from accessions collected at different sites around the world. Genetic variation within a population may enhance its ability to respond accurately to variation within and between seasons. In contrast, variation for flowering time, which also segregated within each mapping population, is mainly governed by the same QTL.
LinkImpute: Fast and Accurate Genotype Imputation for Nonmodel Organisms.
Money, Daniel; Gardner, Kyle; Migicovsky, Zoë; Schwaninger, Heidi; Zhong, Gan-Yuan; Myles, Sean
2015-09-15
Obtaining genome-wide genotype data from a set of individuals is the first step in many genomic studies, including genome-wide association and genomic selection. All genotyping methods suffer from some level of missing data, and genotype imputation can be used to fill in the missing data and improve the power of downstream analyses. Model organisms like human and cattle benefit from high-quality reference genomes and panels of reference genotypes that aid in imputation accuracy. In nonmodel organisms, however, genetic and physical maps often are either of poor quality or are completely absent, and there are no panels of reference genotypes available. There is therefore a need for imputation methods designed specifically for nonmodel organisms in which genomic resources are poorly developed and marker order is unreliable or unknown. Here we introduce LinkImpute, a software package based on a k-nearest neighbor genotype imputation method, LD-kNNi, which is designed for unordered markers. No physical or genetic maps are required, and it is designed to work on unphased genotype data from heterozygous species. It exploits the fact that markers useful for imputation often are not physically close to the missing genotype but rather distributed throughout the genome. Using genotyping-by-sequencing data from diverse and heterozygous accessions of apples, grapes, and maize, we compare LD-kNNi with several genotype imputation methods and show that LD-kNNi is fast, comparable in accuracy to the best-existing methods, and exhibits the least bias in allele frequency estimates. Copyright © 2015 Money et al.
Lessons from 25 years of genetic mapping in onion: where next?
USDA-ARS?s Scientific Manuscript database
Genetic maps are useful tools for both basic research and plant improvement. Close association of genetic markers with genes controlling economically important traits allows for indirect selection, avoiding often time-consuming and expensive phenotypic evaluations. As a result, detailed genetic maps...
Guo, Yinshan; Xing, Huiyang; Zhao, Yuhui; Liu, Zhendong; Li, Kun; Guo, Xiuwu
2017-01-01
Genetic maps are important tools in plant genomics and breeding. We report a large-scale discovery of single nucleotide polymorphisms (SNPs) using the specific length amplified fragment sequencing (SLAF-seq) technique for the construction of high-density genetic maps for two elite wine grape cultivars, ‘Chardonnay’ and ‘Beibinghong’, and their 130 F1 plants. A total of 372.53 M paired-end reads were obtained after preprocessing. The average sequencing depth was 33.81 for ‘Chardonnay’ (the female parent), 48.20 for ‘Beibinghong’ (the male parent), and 12.66 for the F1 offspring. We detected 202,349 high-quality SLAFs of which 144,972 were polymorphic; 10,042 SNPs were used to construct a genetic map that spanned 1,969.95 cM, with an average genetic distance of 0.23 cM between adjacent markers. This genetic map contains the largest molecular marker number of the grape maps so far reported. We thus demonstrate that SLAF-seq is a promising strategy for the construction of high-density genetic maps; the map that we report here is a good potential resource for QTL mapping of genes linked to major economic and agronomic traits, map-based cloning, and marker-assisted selection of grape. PMID:28746364
2014-01-01
Background Modern watermelon (Citrullus lanatus L.) cultivars share a narrow genetic base due to many years of selection for desirable horticultural qualities. Wild subspecies within C. lanatus are important potential sources of novel alleles for watermelon breeding, but successful trait introgression into elite cultivars has had limited success. The application of marker assisted selection (MAS) in watermelon is yet to be realized, mainly due to the past lack of high quality genetic maps. Recently, a number of useful maps have become available, however these maps have few common markers, and were constructed using different marker sets, thus, making integration and comparative analysis among maps difficult. The objective of this research was to use single-nucleotide polymorphism (SNP) anchor markers to construct an integrated genetic map for C. lanatus. Results Under the framework of the high density genetic map, an integrated genetic map was constructed by merging data from four independent mapping experiments using a genetically diverse array of parental lines, which included three subspecies of watermelon. The 698 simple sequence repeat (SSR), 219 insertion-deletion (InDel), 36 structure variation (SV) and 386 SNP markers from the four maps were used to construct an integrated map. This integrated map contained 1339 markers, spanning 798 cM with an average marker interval of 0.6 cM. Fifty-eight previously reported quantitative trait loci (QTL) for 12 traits in these populations were also integrated into the map. In addition, new QTL identified for brix, fructose, glucose and sucrose were added. Some QTL associated with economically important traits detected in different genetic backgrounds mapped to similar genomic regions of the integrated map, suggesting that such QTL are responsible for the phenotypic variability observed in a broad array of watermelon germplasm. Conclusions The integrated map described herein enhances the utility of genomic tools over previous watermelon genetic maps. A large proportion of the markers in the integrated map are SSRs, InDels and SNPs, which are easily transferable across laboratories. Moreover, the populations used to construct the integrated map include all three watermelon subspecies, making this integrated map useful for the selection of breeding traits, identification of QTL, MAS, analysis of germplasm and commercial hybrid seed detection. PMID:24443961
Li, Yun; Liu, Shikai; Qin, Zhenkui; Waldbieser, Geoff; Wang, Ruijia; Sun, Luyang; Bao, Lisui; Danzmann, Roy G.; Dunham, Rex; Liu, Zhanjiang
2015-01-01
Construction of genetic linkage map is essential for genetic and genomic studies. Recent advances in sequencing and genotyping technologies made it possible to generate high-density and high-resolution genetic linkage maps, especially for the organisms lacking extensive genomic resources. In the present work, we constructed a high-density and high-resolution genetic map for channel catfish with three large resource families genotyped using the catfish 250K single-nucleotide polymorphism (SNP) array. A total of 54,342 SNPs were placed on the linkage map, which to our knowledge had the highest marker density among aquaculture species. The estimated genetic size was 3,505.4 cM with a resolution of 0.22 cM for sex-averaged genetic map. The sex-specific linkage maps spanned a total of 4,495.1 cM in females and 2,593.7 cM in males, presenting a ratio of 1.7 : 1 between female and male in recombination fraction. After integration with the previously established physical map, over 87% of physical map contigs were anchored to the linkage groups that covered a physical length of 867 Mb, accounting for ∼90% of the catfish genome. The integrated map provides a valuable tool for validating and improving the catfish whole-genome assembly and facilitates fine-scale QTL mapping and positional cloning of genes responsible for economically important traits. PMID:25428894
P.H. Sisco; T.L. Kubisiak; M. Casasoli; T. Barreneche; A. Kremer; C. Clark; R.R. Sederoff; F.V. Hebard; F. Villani
2005-01-01
We have added 275 AFLP and 24 SSR markers and the 5SrDNA locus to a previously published genetic map based on a hybrid cross between Castanea mollissima and C. denata. The SSR markers, 5SrDNA locus, and one isozyme locus also permitted us to correlate the linkage groups in the published genetic map of C. sativa...
Spriggs, Andrew; Henderson, Steven T.; Hand, Melanie L.; Johnson, Susan D.; Taylor, Jennifer M.; Koltunow, Anna
2018-01-01
Cowpea ( Vigna unguiculata (L.) Walp) is an important legume crop for food security in areas of low-input and smallholder farming throughout Africa and Asia. Genetic improvements are required to increase yield and resilience to biotic and abiotic stress and to enhance cowpea crop performance. An integrated cowpea genomic and gene expression data resource has the potential to greatly accelerate breeding and the delivery of novel genetic traits for cowpea. Extensive genomic resources for cowpea have been absent from the public domain; however, a recent early release reference genome for IT97K-499-35 ( Vigna unguiculata v1.0, NSF, UCR, USAID, DOE-JGI, http://phytozome.jgi.doe.gov/) has now been established in a collaboration between the Joint Genome Institute (JGI) and University California (UC) Riverside. Here we release supporting genomic and transcriptomic data for IT97K-499-35 and a second transformable cowpea variety, IT86D-1010. The transcriptome resource includes six tissue-specific datasets for each variety, with particular emphasis on reproductive tissues that extend and support the V. unguiculata v1.0 reference. Annotations have been included in our resource to allow direct mapping to the v1.0 cowpea reference. Access to this resource provided here is supported by raw and assembled data downloads. PMID:29528046
Spriggs, Andrew; Henderson, Steven T; Hand, Melanie L; Johnson, Susan D; Taylor, Jennifer M; Koltunow, Anna
2018-02-09
Cowpea ( Vigna unguiculata (L.) Walp) is an important legume crop for food security in areas of low-input and smallholder farming throughout Africa and Asia. Genetic improvements are required to increase yield and resilience to biotic and abiotic stress and to enhance cowpea crop performance. An integrated cowpea genomic and gene expression data resource has the potential to greatly accelerate breeding and the delivery of novel genetic traits for cowpea. Extensive genomic resources for cowpea have been absent from the public domain; however, a recent early release reference genome for IT97K-499-35 ( Vigna unguiculata v1.0, NSF, UCR, USAID, DOE-JGI, http://phytozome.jgi.doe.gov/) has now been established in a collaboration between the Joint Genome Institute (JGI) and University California (UC) Riverside. Here we release supporting genomic and transcriptomic data for IT97K-499-35 and a second transformable cowpea variety, IT86D-1010. The transcriptome resource includes six tissue-specific datasets for each variety, with particular emphasis on reproductive tissues that extend and support the V. unguiculata v1.0 reference. Annotations have been included in our resource to allow direct mapping to the v1.0 cowpea reference. Access to this resource provided here is supported by raw and assembled data downloads.
He, Yanxia; Yuan, Wangjun; Dong, Meifang; Han, Yuanji; Shang, Fude
2017-01-01
Osmanthus fragrans is an ornamental plant of substantial commercial value, and no genetic linkage maps of this species have previously been reported. Specific-locus amplified fragment sequencing (SLAF-seq) is a recently developed technology that allows massive single nucleotide polymorphisms (SNPs) to be identified and high-resolution genotyping. In our current research, we generated the first genetic map of O. fragrans using SLAF-seq, which is composed with 206.92 M paired-end reads and 173,537 SLAF markers. Among total 90,715 polymorphic SLAF markers, 15,317 polymorphic SLAFs could be used for genetic map construction. The integrated map contained 14,189 high quality SLAFs that were grouped in 23 genetic linkage groups, with a total length of 2962.46 cM and an average distance of 0.21 cM between two adjacent markers. In addition, 23,664 SNPs were identified from the mapped markers. As far as we know, this is the first of the genetic map of O. fragrans. Our results are further demonstrate that SLAF-seq is a very effective method for developing markers and constructing high-density linkage maps. The SNP markers and the genetic map reported in this study should be valuable resource in future research. PMID:29018460
Genotype Imputation with Thousands of Genomes
Howie, Bryan; Marchini, Jonathan; Stephens, Matthew
2011-01-01
Genotype imputation is a statistical technique that is often used to increase the power and resolution of genetic association studies. Imputation methods work by using haplotype patterns in a reference panel to predict unobserved genotypes in a study dataset, and a number of approaches have been proposed for choosing subsets of reference haplotypes that will maximize accuracy in a given study population. These panel selection strategies become harder to apply and interpret as sequencing efforts like the 1000 Genomes Project produce larger and more diverse reference sets, which led us to develop an alternative framework. Our approach is built around a new approximation that uses local sequence similarity to choose a custom reference panel for each study haplotype in each region of the genome. This approximation makes it computationally efficient to use all available reference haplotypes, which allows us to bypass the panel selection step and to improve accuracy at low-frequency variants by capturing unexpected allele sharing among populations. Using data from HapMap 3, we show that our framework produces accurate results in a wide range of human populations. We also use data from the Malaria Genetic Epidemiology Network (MalariaGEN) to provide recommendations for imputation-based studies in Africa. We demonstrate that our approximation improves efficiency in large, sequence-based reference panels, and we discuss general computational strategies for modern reference datasets. Genome-wide association studies will soon be able to harness the power of thousands of reference genomes, and our work provides a practical way for investigators to use this rich information. New methodology from this study is implemented in the IMPUTE2 software package. PMID:22384356
Barba, Paola; Lillis, Jacquelyn; Luce, R Stephen; Travadon, Renaud; Osier, Michael; Baumgartner, Kendra; Wilcox, Wayne F; Reisch, Bruce I; Cadle-Davidson, Lance
2018-05-01
Rapid characterization of novel NB-LRR-associated resistance to Phomopsis cane spot on grapevine using high-throughput sampling and low-coverage sequencing for genotyping, locus mapping and transcriptome analysis provides insights into genetic resistance to a hemibiotrophic fungus. Phomopsis cane and leaf spot, caused by the hemibiotrophic fungus Diaporthe ampelina (syn = Phomopsis viticola), reduces the productivity in grapevines. Host resistance was studied on three F 1 families derived from crosses involving resistant genotypes 'Horizon', Illinois 547-1, Vitis cinerea B9 and V. vinifera 'Chardonnay'. All families had progeny with extremely susceptible phenotypes, developing lesions on both dormant canes and maturing fruit clusters. Segregation of symptoms was observed under natural levels of inoculum in the field, while phenotypes on green shoots were confirmed under controlled inoculations in greenhouse. High-density genetic maps were used to localize novel qualitative resistance loci named Rda1 and Rda2 from V. cinerea B9 and 'Horizon', respectively. Co-linearity between reference genetic and physical maps allowed localization of Rda2 locus between 1.5 and 2.4 Mbp on chromosome 7, and Rda1 locus between 19.3 and 19.6 Mbp of chromosome 15, which spans a cluster of five NB-LRR genes. Further dissection of this locus was obtained by QTL mapping of gene expression values 14 h after inoculation across a subset of the 'Chardonnay' × V. cinerea B9 progeny. This provided evidence for the association between transcript levels of two of these NB-LRR genes with Rda1, with increased NB-LRR expression among susceptible progeny. In resistant parent V. cinerea B9, inoculation with D. ampelina was characterized by up-regulation of SA-associated genes and down-regulation of ethylene pathways, suggesting an R-gene-mediated response. With dominant effects associated with disease-free berries and minimal symptoms on canes, Rda1 and Rda2 are promising loci for grapevine genetic improvement.
Rong, Junkang; Feltus, F. Alex; Waghmare, Vijay N.; Pierce, Gary J.; Chee, Peng W.; Draye, Xavier; Saranga, Yehoshua; Wright, Robert J.; Wilkins, Thea A.; May, O. Lloyd; Smith, C. Wayne; Gannaway, John R.; Wendel, Jonathan F.; Paterson, Andrew H.
2007-01-01
QTL mapping experiments yield heterogeneous results due to the use of different genotypes, environments, and sampling variation. Compilation of QTL mapping results yields a more complete picture of the genetic control of a trait and reveals patterns in organization of trait variation. A total of 432 QTL mapped in one diploid and 10 tetraploid interspecific cotton populations were aligned using a reference map and depicted in a CMap resource. Early demonstrations that genes from the non-fiber-producing diploid ancestor contribute to tetraploid lint fiber genetics gain further support from multiple populations and environments and advanced-generation studies detecting QTL of small phenotypic effect. Both tetraploid subgenomes contribute QTL at largely non-homeologous locations, suggesting divergent selection acting on many corresponding genes before and/or after polyploid formation. QTL correspondence across studies was only modest, suggesting that additional QTL for the target traits remain to be discovered. Crosses between closely-related genotypes differing by single-gene mutants yield profoundly different QTL landscapes, suggesting that fiber variation involves a complex network of interacting genes. Members of the lint fiber development network appear clustered, with cluster members showing heterogeneous phenotypic effects. Meta-analysis linked to synteny-based and expression-based information provides clues about specific genes and families involved in QTL networks. PMID:17565937
Rong, Junkang; Feltus, F Alex; Waghmare, Vijay N; Pierce, Gary J; Chee, Peng W; Draye, Xavier; Saranga, Yehoshua; Wright, Robert J; Wilkins, Thea A; May, O Lloyd; Smith, C Wayne; Gannaway, John R; Wendel, Jonathan F; Paterson, Andrew H
2007-08-01
QTL mapping experiments yield heterogeneous results due to the use of different genotypes, environments, and sampling variation. Compilation of QTL mapping results yields a more complete picture of the genetic control of a trait and reveals patterns in organization of trait variation. A total of 432 QTL mapped in one diploid and 10 tetraploid interspecific cotton populations were aligned using a reference map and depicted in a CMap resource. Early demonstrations that genes from the non-fiber-producing diploid ancestor contribute to tetraploid lint fiber genetics gain further support from multiple populations and environments and advanced-generation studies detecting QTL of small phenotypic effect. Both tetraploid subgenomes contribute QTL at largely non-homeologous locations, suggesting divergent selection acting on many corresponding genes before and/or after polyploid formation. QTL correspondence across studies was only modest, suggesting that additional QTL for the target traits remain to be discovered. Crosses between closely-related genotypes differing by single-gene mutants yield profoundly different QTL landscapes, suggesting that fiber variation involves a complex network of interacting genes. Members of the lint fiber development network appear clustered, with cluster members showing heterogeneous phenotypic effects. Meta-analysis linked to synteny-based and expression-based information provides clues about specific genes and families involved in QTL networks.
Mapping landscape friction to locate isolated tsetse populations that are candidates for elimination
Dicko, Ahmadou H.; Cecchi, Giuliano; Ravel, Sophie; Guerrini, Laure; Solano, Philippe; Vreysen, Marc J. B.; De Meeûs, Thierry; Lancelot, Renaud
2015-01-01
Tsetse flies are the cyclical vectors of deadly human and animal trypanosomes in sub-Saharan Africa. Tsetse control is a key component for the integrated management of both plagues, but local eradication successes have been limited to less than 2% of the infested area. This is attributed to either resurgence of residual populations that were omitted from the eradication campaign or reinvasion from neighboring infested areas. Here we focused on Glossina palpalis gambiensis, a riverine tsetse species representing the main vector of trypanosomoses in West Africa. We mapped landscape resistance to tsetse genetic flow, hereafter referred to as friction, to identify natural barriers that isolate tsetse populations. For this purpose, we fitted a statistical model of the genetic distance between 37 tsetse populations sampled in the region, using a set of remotely sensed environmental data as predictors. The least-cost path between these populations was then estimated using the predicted friction map. The method enabled us to avoid the subjectivity inherent in the expert-based weighting of environmental parameters. Finally, we identified potentially isolated clusters of G. p. gambiensis habitat based on a species distribution model and ranked them according to their predicted genetic distance to the main tsetse population. The methodology presented here will inform the choice on the most appropriate intervention strategies to be implemented against tsetse flies in different parts of Africa. It can also be used to control other pests and to support conservation of endangered species. PMID:26553973
Pottorff, Marti; Wanamaker, Steve; Ma, Yaqin Q; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J
2012-01-01
Fusarium oxysporum f.sp. tracheiphilum (Fot) is a soil-borne fungal pathogen that causes vascular wilt disease in cowpea. Fot race 3 is one of the major pathogens affecting cowpea production in California. Identification of Fot race 3 resistance determinants will expedite delivery of improved cultivars by replacing time-consuming phenotypic screening with selection based on perfect markers, thereby generating successful cultivars in a shorter time period. Resistance to Fot race 3 was studied in the RIL population California Blackeye 27 (resistant) x 24-125B-1 (susceptible). Biparental mapping identified a Fot race 3 resistance locus, Fot3-1, which spanned 3.56 cM on linkage group one of the CB27 x 24-125B-1 genetic map. A marker-trait association narrowed the resistance locus to a 1.2 cM region and identified SNP marker 1_1107 as co-segregating with Fot3-1 resistance. Macro and microsynteny was observed for the Fot3-1 locus region in Glycine max where six disease resistance genes were observed in the two syntenic regions of soybean chromosomes 9 and 15. Fot3-1 was identified on the cowpea physical map on BAC clone CH093L18, spanning approximately 208,868 bp on BAC contig250. The Fot3-1 locus was narrowed to 0.5 cM distance on the cowpea genetic map linkage group 6, flanked by SNP markers 1_0860 and 1_1107. BAC clone CH093L18 was sequenced and four cowpea sequences with similarity to leucine-rich repeat serine/threonine protein kinases were identified and are cowpea candidate genes for the Fot3-1 locus. This study has shown how readily candidate genes can be identified for simply inherited agronomic traits when appropriate genetic stocks and integrated genomic resources are available. High co-linearity between cowpea and soybean genomes illustrated that utilizing synteny can transfer knowledge from a reference legume to legumes with less complete genomic resources. Identification of Fot race 3 resistance genes will enable transfer into high yielding cowpea varieties using marker-assisted selection (MAS).
Pottorff, Marti; Wanamaker, Steve; Ma, Yaqin Q.; Ehlers, Jeffrey D.; Roberts, Philip A.; Close, Timothy J.
2012-01-01
Fusarium oxysporum f.sp. tracheiphilum (Fot) is a soil-borne fungal pathogen that causes vascular wilt disease in cowpea. Fot race 3 is one of the major pathogens affecting cowpea production in California. Identification of Fot race 3 resistance determinants will expedite delivery of improved cultivars by replacing time-consuming phenotypic screening with selection based on perfect markers, thereby generating successful cultivars in a shorter time period. Resistance to Fot race 3 was studied in the RIL population California Blackeye 27 (resistant) x 24-125B-1 (susceptible). Biparental mapping identified a Fot race 3 resistance locus, Fot3-1, which spanned 3.56 cM on linkage group one of the CB27 x 24-125B-1 genetic map. A marker-trait association narrowed the resistance locus to a 1.2 cM region and identified SNP marker 1_1107 as co-segregating with Fot3-1 resistance. Macro and microsynteny was observed for the Fot3-1 locus region in Glycine max where six disease resistance genes were observed in the two syntenic regions of soybean chromosomes 9 and 15. Fot3-1 was identified on the cowpea physical map on BAC clone CH093L18, spanning approximately 208,868 bp on BAC contig250. The Fot3-1 locus was narrowed to 0.5 cM distance on the cowpea genetic map linkage group 6, flanked by SNP markers 1_0860 and 1_1107. BAC clone CH093L18 was sequenced and four cowpea sequences with similarity to leucine-rich repeat serine/threonine protein kinases were identified and are cowpea candidate genes for the Fot3-1 locus. This study has shown how readily candidate genes can be identified for simply inherited agronomic traits when appropriate genetic stocks and integrated genomic resources are available. High co-linearity between cowpea and soybean genomes illustrated that utilizing synteny can transfer knowledge from a reference legume to legumes with less complete genomic resources. Identification of Fot race 3 resistance genes will enable transfer into high yielding cowpea varieties using marker-assisted selection (MAS). PMID:22860000
Conservatism and novelty in the genetic architecture of adaptation in Heliconius butterflies.
Huber, B; Whibley, A; Poul, Y L; Navarro, N; Martin, A; Baxter, S; Shah, A; Gilles, B; Wirth, T; McMillan, W O; Joron, M
2015-05-01
Understanding the genetic architecture of adaptive traits has been at the centre of modern evolutionary biology since Fisher; however, evaluating how the genetic architecture of ecologically important traits influences their diversification has been hampered by the scarcity of empirical data. Now, high-throughput genomics facilitates the detailed exploration of variation in the genome-to-phenotype map among closely related taxa. Here, we investigate the evolution of wing pattern diversity in Heliconius, a clade of neotropical butterflies that have undergone an adaptive radiation for wing-pattern mimicry and are influenced by distinct selection regimes. Using crosses between natural wing-pattern variants, we used genome-wide restriction site-associated DNA (RAD) genotyping, traditional linkage mapping and multivariate image analysis to study the evolution of the architecture of adaptive variation in two closely related species: Heliconius hecale and H. ismenius. We implemented a new morphometric procedure for the analysis of whole-wing pattern variation, which allows visualising spatial heatmaps of genotype-to-phenotype association for each quantitative trait locus separately. We used the H. melpomene reference genome to fine-map variation for each major wing-patterning region uncovered, evaluated the role of candidate genes and compared genetic architectures across the genus. Our results show that, although the loci responding to mimicry selection are highly conserved between species, their effect size and phenotypic action vary throughout the clade. Multilocus architecture is ancestral and maintained across species under directional selection, whereas the single-locus (supergene) inheritance controlling polymorphism in H. numata appears to have evolved only once. Nevertheless, the conservatism in the wing-patterning toolkit found throughout the genus does not appear to constrain phenotypic evolution towards local adaptive optima.
Scaglione, Davide; Acquadro, Alberto; Portis, Ezio; Taylor, Christopher A; Lanteri, Sergio; Knapp, Steven J
2009-01-01
Background The globe artichoke (Cynara cardunculus var. scolymus L.) is a significant crop in the Mediterranean basin. Despite its commercial importance and its both dietary and pharmaceutical value, knowledge of its genetics and genomics remains scant. Microsatellite markers have become a key tool in genetic and genomic analysis, and we have exploited recently acquired EST (expressed sequence tag) sequence data (Composite Genome Project - CGP) to develop an extensive set of microsatellite markers. Results A unigene assembly was created from over 36,000 globe artichoke EST sequences, containing 6,621 contigs and 12,434 singletons. Over 12,000 of these unigenes were functionally assigned on the basis of homology with Arabidopsis thaliana reference proteins. A total of 4,219 perfect repeats, located within 3,308 unigenes was identified and the gene ontology (GO) analysis highlighted some GO term's enrichments among different classes of microsatellites with respect to their position. Sufficient flanking sequence was available to enable the design of primers to amplify 2,311 of these microsatellites, and a set of 300 was tested against a DNA panel derived from 28 C. cardunculus genotypes. Consistent amplification and polymorphism was obtained from 236 of these assays. Their polymorphic information content (PIC) ranged from 0.04 to 0.90 (mean 0.66). Between 176 and 198 of the assays were informative in at least one of the three available mapping populations. Conclusion EST-based microsatellites have provided a large set of de novo genetic markers, which show significant amounts of polymorphism both between and within the three taxa of C. cardunculus. They are thus well suited as assays for phylogenetic analysis, the construction of genetic maps, marker-assisted breeding, transcript mapping and other genomic applications in the species. PMID:19785740
Kumar, Ajay; Seetan, Raed; Mergoum, Mohamed; Tiwari, Vijay K; Iqbal, Muhammad J; Wang, Yi; Al-Azzam, Omar; Šimková, Hana; Luo, Ming-Cheng; Dvorak, Jan; Gu, Yong Q; Denton, Anne; Kilian, Andrzej; Lazo, Gerard R; Kianian, Shahryar F
2015-10-16
The large and complex genome of bread wheat (Triticum aestivum L., ~17 Gb) requires high resolution genome maps with saturated marker scaffolds to anchor and orient BAC contigs/ sequence scaffolds for whole genome assembly. Radiation hybrid (RH) mapping has proven to be an excellent tool for the development of such maps for it offers much higher and more uniform marker resolution across the length of the chromosome compared to genetic mapping and does not require marker polymorphism per se, as it is based on presence (retention) vs. absence (deletion) marker assay. In this study, a 178 line RH panel was genotyped with SSRs and DArT markers to develop the first high resolution RH maps of the entire D-genome of Ae. tauschii accession AL8/78. To confirm map order accuracy, the AL8/78-RH maps were compared with:1) a DArT consensus genetic map constructed using more than 100 bi-parental populations, 2) a RH map of the D-genome of reference hexaploid wheat 'Chinese Spring', and 3) two SNP-based genetic maps, one with anchored D-genome BAC contigs and another with anchored D-genome sequence scaffolds. Using marker sequences, the RH maps were also anchored with a BAC contig based physical map and draft sequence of the D-genome of Ae. tauschii. A total of 609 markers were mapped to 503 unique positions on the seven D-genome chromosomes, with a total map length of 14,706.7 cR. The average distance between any two marker loci was 29.2 cR which corresponds to 2.1 cM or 9.8 Mb. The average mapping resolution across the D-genome was estimated to be 0.34 Mb (Mb/cR) or 0.07 cM (cM/cR). The RH maps showed almost perfect agreement with several published maps with regard to chromosome assignments of markers. The mean rank correlations between the position of markers on AL8/78 maps and the four published maps, ranged from 0.75 to 0.92, suggesting a good agreement in marker order. With 609 mapped markers, a total of 2481 deletions for the whole D-genome were detected with an average deletion size of 42.0 Mb. A total of 520 markers were anchored to 216 Ae. tauschii sequence scaffolds, 116 of which were not anchored earlier to the D-genome. This study reports the development of first high resolution RH maps for the D-genome of Ae. tauschii accession AL8/78, which were then used for the anchoring of unassigned sequence scaffolds. This study demonstrates how RH mapping, which offered high and uniform resolution across the length of the chromosome, can facilitate the complete sequence assembly of the large and complex plant genomes.
RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes.
Ono, Hiromasa; Ogasawara, Osamu; Okubo, Kosaku; Bono, Hidemasa
2017-08-29
Gene expression data are exponentially accumulating; thus, the functional annotation of such sequence data from metadata is urgently required. However, life scientists have difficulty utilizing the available data due to its sheer magnitude and complicated access. We have developed a web tool for browsing reference gene expression pattern of mammalian tissues and cell lines measured using different methods, which should facilitate the reuse of the precious data archived in several public databases. The web tool is called Reference Expression dataset (RefEx), and RefEx allows users to search by the gene name, various types of IDs, chromosomal regions in genetic maps, gene family based on InterPro, gene expression patterns, or biological categories based on Gene Ontology. RefEx also provides information about genes with tissue-specific expression, and the relative gene expression values are shown as choropleth maps on 3D human body images from BodyParts3D. Combined with the newly incorporated Functional Annotation of Mammals (FANTOM) dataset, RefEx provides insight regarding the functional interpretation of unfamiliar genes. RefEx is publicly available at http://refex.dbcls.jp/.
RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes
Ono, Hiromasa; Ogasawara, Osamu; Okubo, Kosaku; Bono, Hidemasa
2017-01-01
Gene expression data are exponentially accumulating; thus, the functional annotation of such sequence data from metadata is urgently required. However, life scientists have difficulty utilizing the available data due to its sheer magnitude and complicated access. We have developed a web tool for browsing reference gene expression pattern of mammalian tissues and cell lines measured using different methods, which should facilitate the reuse of the precious data archived in several public databases. The web tool is called Reference Expression dataset (RefEx), and RefEx allows users to search by the gene name, various types of IDs, chromosomal regions in genetic maps, gene family based on InterPro, gene expression patterns, or biological categories based on Gene Ontology. RefEx also provides information about genes with tissue-specific expression, and the relative gene expression values are shown as choropleth maps on 3D human body images from BodyParts3D. Combined with the newly incorporated Functional Annotation of Mammals (FANTOM) dataset, RefEx provides insight regarding the functional interpretation of unfamiliar genes. RefEx is publicly available at http://refex.dbcls.jp/. PMID:28850115
Han, Koeun; Jeong, Hee-Jin; Yang, Hee-Bum; Kang, Sung-Min; Kwon, Jin-Kyung; Kim, Seungill; Choi, Doil; Kang, Byoung-Cheorl
2016-04-01
Most agricultural traits are controlled by quantitative trait loci (QTLs); however, there are few studies on QTL mapping of horticultural traits in pepper (Capsicum spp.) due to the lack of high-density molecular maps and the sequence information. In this study, an ultra-high-density map and 120 recombinant inbred lines (RILs) derived from a cross between C. annuum'Perennial' and C. annuum'Dempsey' were used for QTL mapping of horticultural traits. Parental lines and RILs were resequenced at 18× and 1× coverage, respectively. Using a sliding window approach, an ultra-high-density bin map containing 2,578 bins was constructed. The total map length of the map was 1,372 cM, and the average interval between bins was 0.53 cM. A total of 86 significant QTLs controlling 17 horticultural traits were detected. Among these, 32 QTLs controlling 13 traits were major QTLs. Our research shows that the construction of bin maps using low-coverage sequence is a powerful method for QTL mapping, and that the short intervals between bins are helpful for fine-mapping of QTLs. Furthermore, bin maps can be used to improve the quality of reference genomes by elucidating the genetic order of unordered regions and anchoring unassigned scaffolds to linkage groups. © The Author 2016. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Genepleio software for effective estimation of gene pleiotropy from protein sequences.
Chen, Wenhai; Chen, Dandan; Zhao, Ming; Zou, Yangyun; Zeng, Yanwu; Gu, Xun
2015-01-01
Though pleiotropy, which refers to the phenomenon of a gene affecting multiple traits, has long played a central role in genetics, development, and evolution, estimation of the number of pleiotropy components remains a hard mission to accomplish. In this paper, we report a newly developed software package, Genepleio, to estimate the effective gene pleiotropy from phylogenetic analysis of protein sequences. Since this estimate can be interpreted as the minimum pleiotropy of a gene, it is used to play a role of reference for many empirical pleiotropy measures. This work would facilitate our understanding of how gene pleiotropy affects the pattern of genotype-phenotype map and the consequence of organismal evolution.
Lim, Haw Chuan; Braun, Michael J
2016-09-01
Sample availability limits population genetics research on many species, especially taxa from regions with high diversity. However, many such species are well represented in museum collections assembled before the molecular era. Development of techniques to recover genetic data from these invaluable specimens will benefit biodiversity science. Using a mixture of freshly preserved and historical tissue samples, and a sequence capture probe set targeting >5000 loci, we produced high-confidence genotype calls on thousands of single nucleotide polymorphisms (SNPs) in each of five South-East Asian bird species and their close relatives (N = 27-43). On average, 66.2% of the reads mapped to the pseudo-reference genome of each species. Of these mapped reads, an average of 52.7% was identified as PCR or optical duplicates. We achieved deeper effective sequencing for historical samples (122.7×) compared to modern samples (23.5×). The number of nucleotide sites with at least 8× sequencing depth was high, with averages ranging from 0.89 × 10(6) bp (Arachnothera, modern samples) to 1.98 × 10(6) bp (Stachyris, modern samples). Linear regression revealed that the amount of sequence data obtained from each historical sample (represented by per cent of the pseudo-reference genome recovered with ≥8× sequencing depth) was positively and significantly (P ≤ 0.013) related to how recently the sample was collected. We observed characteristic post-mortem damage in the DNA of historical samples. However, we were able to reduce the error rate significantly by truncating ends of reads during read mapping (local alignment) and conducting stringent SNP and genotype filtering. © 2016 John Wiley & Sons Ltd.
Yang, S; Chen, S; Geng, X X; Yan, G; Li, Z Y; Meng, J L; Cowling, W A; Zhou, W J
2016-04-01
We present the first genetic map of an allohexaploid Brassica species, based on segregating microsatellite markers in a doubled haploid mapping population generated from a hybrid between two hexaploid parents. This study reports the first genetic map of trigenomic Brassica. A doubled haploid mapping population consisting of 189 lines was obtained via microspore culture from a hybrid H16-1 derived from a cross between two allohexaploid Brassica lines (7H170-1 and Y54-2). Simple sequence repeat primer pairs specific to the A genome (107), B genome (44) and C genome (109) were used to construct a genetic linkage map of the population. Twenty-seven linkage groups were resolved from 274 polymorphic loci on the A genome (109), B genome (49) and C genome (116) covering a total genetic distance of 3178.8 cM with an average distance between markers of 11.60 cM. This is the first genetic framework map for the artificially synthesized Brassica allohexaploids. The linkage groups represent the expected complement of chromosomes in the A, B and C genomes from the original diploid and tetraploid parents. This framework linkage map will be valuable for QTL analysis and future genetic improvement of a new allohexaploid Brassica species, and in improving our understanding of the genetic control of meiosis in new polyploids.
Niederstätter, Harald; Rampl, Gerhard; Erhart, Daniel; Pitterl, Florian; Oberacher, Herbert; Neuhuber, Franz; Hausner, Isolde; Gassner, Christoph; Schennach, Harald; Berger, Burkhard; Parson, Walther
2012-01-01
The small alpine district of East Tyrol (Austria) has an exceptional demographic history. It was contemporaneously inhabited by members of the Romance, the Slavic and the Germanic language groups for centuries. Since the Late Middle Ages, however, the population of the principally agrarian-oriented area is solely Germanic speaking. Historic facts about East Tyrol's colonization are rare, but spatial density-distribution analysis based on the etymology of place-names has facilitated accurate spatial mapping of the various language groups' former settlement regions. To test for present-day Y chromosome population substructure, molecular genetic data were compared to the information attained by the linguistic analysis of pasture names. The linguistic data were used for subdividing East Tyrol into two regions of former Romance (A) and Slavic (B) settlement. Samples from 270 East Tyrolean men were genotyped for 17 Y-chromosomal microsatellites (Y-STRs) and 27 single nucleotide polymorphisms (Y-SNPs). Analysis of the probands' surnames revealed no evidence for spatial genetic structuring. Also, spatial autocorrelation analysis did not indicate significant correlation between genetic (Y-STR haplotypes) and geographic distance. Haplogroup R-M17 chromosomes, however, were absent in region A, but constituted one of the most frequent haplogroups in region B. The R-M343 (R1b) clade showed a marked and complementary frequency distribution pattern in these two regions. To further test East Tyrol's modern Y-chromosomal landscape for geographic patterning attributable to the early history of settlement in this alpine area, principal coordinates analysis was performed. The Y-STR haplotypes from region A clearly clustered with those of Romance reference populations and the samples from region B matched best with Germanic speaking reference populations. The combined use of onomastic and molecular genetic data revealed and mapped the marked structuring of the distribution of Y chromosomes in an alpine region that has been culturally homogeneous for centuries.
Niederstätter, Harald; Rampl, Gerhard; Erhart, Daniel; Pitterl, Florian; Oberacher, Herbert; Neuhuber, Franz; Hausner, Isolde; Gassner, Christoph; Schennach, Harald; Berger, Burkhard; Parson, Walther
2012-01-01
The small alpine district of East Tyrol (Austria) has an exceptional demographic history. It was contemporaneously inhabited by members of the Romance, the Slavic and the Germanic language groups for centuries. Since the Late Middle Ages, however, the population of the principally agrarian-oriented area is solely Germanic speaking. Historic facts about East Tyrol's colonization are rare, but spatial density-distribution analysis based on the etymology of place-names has facilitated accurate spatial mapping of the various language groups' former settlement regions. To test for present-day Y chromosome population substructure, molecular genetic data were compared to the information attained by the linguistic analysis of pasture names. The linguistic data were used for subdividing East Tyrol into two regions of former Romance (A) and Slavic (B) settlement. Samples from 270 East Tyrolean men were genotyped for 17 Y-chromosomal microsatellites (Y-STRs) and 27 single nucleotide polymorphisms (Y-SNPs). Analysis of the probands' surnames revealed no evidence for spatial genetic structuring. Also, spatial autocorrelation analysis did not indicate significant correlation between genetic (Y-STR haplotypes) and geographic distance. Haplogroup R-M17 chromosomes, however, were absent in region A, but constituted one of the most frequent haplogroups in region B. The R-M343 (R1b) clade showed a marked and complementary frequency distribution pattern in these two regions. To further test East Tyrol's modern Y-chromosomal landscape for geographic patterning attributable to the early history of settlement in this alpine area, principal coordinates analysis was performed. The Y-STR haplotypes from region A clearly clustered with those of Romance reference populations and the samples from region B matched best with Germanic speaking reference populations. The combined use of onomastic and molecular genetic data revealed and mapped the marked structuring of the distribution of Y chromosomes in an alpine region that has been culturally homogeneous for centuries. PMID:22848647
A high-density genetic map of Arachis duranensis, a diploid ancestor of cultivated peanut
2012-01-01
Background Cultivated peanut (Arachis hypogaea) is an allotetraploid species whose ancestral genomes are most likely derived from the A-genome species, A. duranensis, and the B-genome species, A. ipaensis. The very recent (several millennia) evolutionary origin of A. hypogaea has imposed a bottleneck for allelic and phenotypic diversity within the cultigen. However, wild diploid relatives are a rich source of alleles that could be used for crop improvement and their simpler genomes can be more easily analyzed while providing insight into the structure of the allotetraploid peanut genome. The objective of this research was to establish a high-density genetic map of the diploid species A. duranensis based on de novo generated EST databases. Arachis duranensis was chosen for mapping because it is the A-genome progenitor of cultivated peanut and also in order to circumvent the confounding effects of gene duplication associated with allopolyploidy in A. hypogaea. Results More than one million expressed sequence tag (EST) sequences generated from normalized cDNA libraries of A. duranensis were assembled into 81,116 unique transcripts. Mining this dataset, 1236 EST-SNP markers were developed between two A. duranensis accessions, PI 475887 and Grif 15036. An additional 300 SNP markers also were developed from genomic sequences representing conserved legume orthologs. Of the 1536 SNP markers, 1054 were placed on a genetic map. In addition, 598 EST-SSR markers identified in A. hypogaea assemblies were included in the map along with 37 disease resistance gene candidate (RGC) and 35 other previously published markers. In total, 1724 markers spanning 1081.3 cM over 10 linkage groups were mapped. Gene sequences that provided mapped markers were annotated using similarity searches in three different databases, and gene ontology descriptions were determined using the Medicago Gene Atlas and TAIR databases. Synteny analysis between A. duranensis, Medicago and Glycine revealed significant stretches of conserved gene clusters spread across the peanut genome. A higher level of colinearity was detected between A. duranensis and Glycine than with Medicago. Conclusions The first high-density, gene-based linkage map for A. duranensis was generated that can serve as a reference map for both wild and cultivated Arachis species. The markers developed here are valuable resources for the peanut, and more broadly, to the legume research community. The A-genome map will have utility for fine mapping in other peanut species and has already had application for mapping a nematode resistance gene that was introgressed into A. hypogaea from A. cardenasii. PMID:22967170
Wen, Weie; He, Zhonghu; Gao, Fengmei; Liu, Jindong; Jin, Hui; Zhai, Shengnan; Qu, Yanying; Xia, Xianchun
2017-01-01
A high-density consensus map is a powerful tool for gene mapping, cloning and molecular marker-assisted selection in wheat breeding. The objective of this study was to construct a high-density, single nucleotide polymorphism (SNP)-based consensus map of common wheat (Triticum aestivum L.) by integrating genetic maps from four recombinant inbred line populations. The populations were each genotyped using the wheat 90K Infinium iSelect SNP assay. A total of 29,692 SNP markers were mapped on 21 linkage groups corresponding to 21 hexaploid wheat chromosomes, covering 2,906.86 cM, with an overall marker density of 10.21 markers/cM. Compared with the previous maps based on the wheat 90K SNP chip detected 22,736 (76.6%) of the SNPs with consistent chromosomal locations, whereas 1,974 (6.7%) showed different chromosomal locations, and 4,982 (16.8%) were newly mapped. Alignment of the present consensus map and the wheat expressed sequence tags (ESTs) Chromosome Bin Map enabled assignment of 1,221 SNP markers to specific chromosome bins and 819 ESTs were integrated into the consensus map. The marker orders of the consensus map were validated based on physical positions on the wheat genome with Spearman rank correlation coefficients ranging from 0.69 (4D) to 0.97 (1A, 4B, 5B, and 6A), and were also confirmed by comparison with genetic position on the previously 40K SNP consensus map with Spearman rank correlation coefficients ranging from 0.84 (6D) to 0.99 (6A). Chromosomal rearrangements reported previously were confirmed in the present consensus map and new putative rearrangements were identified. In addition, an integrated consensus map was developed through the combination of five published maps with ours, containing 52,607 molecular markers. The consensus map described here provided a high-density SNP marker map and a reliable order of SNPs, representing a step forward in mapping and validation of chromosomal locations of SNPs on the wheat 90K array. Moreover, it can be used as a reference for quantitative trait loci (QTL) mapping to facilitate exploitation of genes and QTL in wheat breeding. PMID:28848588
Genome-Wide Structural Variation Detection by Genome Mapping on Nanochannel Arrays.
Mak, Angel C Y; Lai, Yvonne Y Y; Lam, Ernest T; Kwok, Tsz-Piu; Leung, Alden K Y; Poon, Annie; Mostovoy, Yulia; Hastie, Alex R; Stedman, William; Anantharaman, Thomas; Andrews, Warren; Zhou, Xiang; Pang, Andy W C; Dai, Heng; Chu, Catherine; Lin, Chin; Wu, Jacob J K; Li, Catherine M L; Li, Jing-Woei; Yim, Aldrin K Y; Chan, Saki; Sibert, Justin; Džakula, Željko; Cao, Han; Yiu, Siu-Ming; Chan, Ting-Fung; Yip, Kevin Y; Xiao, Ming; Kwok, Pui-Yan
2016-01-01
Comprehensive whole-genome structural variation detection is challenging with current approaches. With diploid cells as DNA source and the presence of numerous repetitive elements, short-read DNA sequencing cannot be used to detect structural variation efficiently. In this report, we show that genome mapping with long, fluorescently labeled DNA molecules imaged on nanochannel arrays can be used for whole-genome structural variation detection without sequencing. While whole-genome haplotyping is not achieved, local phasing (across >150-kb regions) is routine, as molecules from the parental chromosomes are examined separately. In one experiment, we generated genome maps from a trio from the 1000 Genomes Project, compared the maps against that derived from the reference human genome, and identified structural variations that are >5 kb in size. We find that these individuals have many more structural variants than those published, including some with the potential of disrupting gene function or regulation. Copyright © 2016 by the Genetics Society of America.
Bilton, Timothy P.; Schofield, Matthew R.; Black, Michael A.; Chagné, David; Wilcox, Phillip L.; Dodds, Ken G.
2018-01-01
Next-generation sequencing is an efficient method that allows for substantially more markers than previous technologies, providing opportunities for building high-density genetic linkage maps, which facilitate the development of nonmodel species’ genomic assemblies and the investigation of their genes. However, constructing genetic maps using data generated via high-throughput sequencing technology (e.g., genotyping-by-sequencing) is complicated by the presence of sequencing errors and genotyping errors resulting from missing parental alleles due to low sequencing depth. If unaccounted for, these errors lead to inflated genetic maps. In addition, map construction in many species is performed using full-sibling family populations derived from the outcrossing of two individuals, where unknown parental phase and varying segregation types further complicate construction. We present a new methodology for modeling low coverage sequencing data in the construction of genetic linkage maps using full-sibling populations of diploid species, implemented in a package called GUSMap. Our model is based on the Lander–Green hidden Markov model but extended to account for errors present in sequencing data. We were able to obtain accurate estimates of the recombination fractions and overall map distance using GUSMap, while most existing mapping packages produced inflated genetic maps in the presence of errors. Our results demonstrate the feasibility of using low coverage sequencing data to produce genetic maps without requiring extensive filtering of potentially erroneous genotypes, provided that the associated errors are correctly accounted for in the model. PMID:29487138
Bilton, Timothy P; Schofield, Matthew R; Black, Michael A; Chagné, David; Wilcox, Phillip L; Dodds, Ken G
2018-05-01
Next-generation sequencing is an efficient method that allows for substantially more markers than previous technologies, providing opportunities for building high-density genetic linkage maps, which facilitate the development of nonmodel species' genomic assemblies and the investigation of their genes. However, constructing genetic maps using data generated via high-throughput sequencing technology ( e.g. , genotyping-by-sequencing) is complicated by the presence of sequencing errors and genotyping errors resulting from missing parental alleles due to low sequencing depth. If unaccounted for, these errors lead to inflated genetic maps. In addition, map construction in many species is performed using full-sibling family populations derived from the outcrossing of two individuals, where unknown parental phase and varying segregation types further complicate construction. We present a new methodology for modeling low coverage sequencing data in the construction of genetic linkage maps using full-sibling populations of diploid species, implemented in a package called GUSMap. Our model is based on the Lander-Green hidden Markov model but extended to account for errors present in sequencing data. We were able to obtain accurate estimates of the recombination fractions and overall map distance using GUSMap, while most existing mapping packages produced inflated genetic maps in the presence of errors. Our results demonstrate the feasibility of using low coverage sequencing data to produce genetic maps without requiring extensive filtering of potentially erroneous genotypes, provided that the associated errors are correctly accounted for in the model. Copyright © 2018 Bilton et al.
HapMap tagSNP transferability in multiple populations: general guidelines
Xing, Jinchuan; Witherspoon, David J.; Watkins, W. Scott; Zhang, Yuhua; Tolpinrud, Whitney; Jorde, Lynn B.
2008-01-01
This PDF receipt will only be used as the basis for generating PubMed Central (PMC) documents. PMC documents will be made available for review after conversion (approx. 2–3 weeks time). Any corrections that need to be made will be done at that time. No materials will be released to PMC without the approval of an author. Only the PMC documents will appear on PubMed Central -- this PDF Receipt will not appear on PubMed Central. Linkage disequilibrium (LD) has received much recent attention because of its value in localizing disease-causing genes. Due to the extensive LD between neighboring loci in the human genome, it is believed that a subset of the single nucleotide polymorphisms in a region (tagSNPs) can be selected to capture most of the remaining SNP variants. In this study, we examined LD patterns and HapMap tagSNP transferability in more than 300 individuals. A South Indian and an African Mbuti Pygmy population sample were included to evaluate the performance of HapMap tagSNPs in geographically distinct and genetically isolated populations. Our results show that HapMap tagSNPs selected with r2 >= 0.8 can capture more than 85% of the SNPs in populations that are from the same continental group. Combined tagSNPs from HapMap CEU and CHB+JPT serve as the best reference for the Indian sample. The HapMap YRI are a sufficient reference for tagSNP selection in the Pygmy sample. In addition to our findings, we reviewed over 25 recent studies of tagSNP transferability and propose a general guideline for selecting tagSNPs from HapMap populations. PMID:18482828
Abdul-Wajid, Sarah; Veeman, Michael T; Chiba, Shota; Turner, Thomas L; Smith, William C
2014-05-01
Studies in tunicates such as Ciona have revealed new insights into the evolutionary origins of chordate development. Ciona populations are characterized by high levels of natural genetic variation, between 1 and 5%. This variation has provided abundant material for forward genetic studies. In the current study, we make use of deep sequencing and homozygosity mapping to map spontaneous mutations in outbred populations. With this method we have mapped two spontaneous developmental mutants. In Ciona intestinalis we mapped a short-tail mutation with strong phenotypic similarity to a previously identified mutant in the related species Ciona savignyi. Our bioinformatic approach mapped the mutation to a narrow interval containing a single mutated gene, α-laminin3,4,5, which is the gene previously implicated in C. savignyi. In addition, we mapped a novel genetic mutation disrupting neural tube closure in C. savignyi to a T-type Ca(2+) channel gene. The high efficiency and unprecedented mapping resolution of our study is a powerful advantage for developmental genetics in Ciona, and may find application in other outbred species.
The binary protein-protein interaction landscape of Escherichia coli
Rajagopala, Seesandra V.; Vlasblom, James; Arnold, Roland; Franca-Koh, Jonathan; Pakala, Suman B.; Phanse, Sadhna; Ceol, Arnaud; Häuser, Roman; Siszler, Gabriella; Wuchty, Stefan; Emili, Andrew; Babu, Mohan; Aloy, Patrick; Pieper, Rembert; Uetz, Peter
2014-01-01
Efforts to map the Escherichia coli interactome have identified several hundred macromolecular complexes, but direct binary protein-protein interactions (PPIs) have not been surveyed on a large scale. Here we performed yeast two-hybrid screens of 3,305 baits against 3,606 preys (~70% of the E. coli proteome) in duplicate to generate a map of 2,234 interactions, approximately doubling the number of known binary PPIs in E. coli. Integration of binary PPIs and genetic interactions revealed functional dependencies among components involved in cellular processes, including envelope integrity, flagellum assembly and protein quality control. Many of the binary interactions that could be mapped within multi-protein complexes were informative regarding internal topology and indicated that interactions within complexes are significantly more conserved than those interactions connecting different complexes. This resource will be useful for inferring bacterial gene function and provides a draft reference of the basic physical wiring network of this evolutionarily significant model microbe. PMID:24561554
A global interaction network maps a wiring diagram of cellular function
Costanzo, Michael; VanderSluis, Benjamin; Koch, Elizabeth N.; Baryshnikova, Anastasia; Pons, Carles; Tan, Guihong; Wang, Wen; Usaj, Matej; Hanchard, Julia; Lee, Susan D.; Pelechano, Vicent; Styles, Erin B.; Billmann, Maximilian; van Leeuwen, Jolanda; van Dyk, Nydia; Lin, Zhen-Yuan; Kuzmin, Elena; Nelson, Justin; Piotrowski, Jeff S.; Srikumar, Tharan; Bahr, Sondra; Chen, Yiqun; Deshpande, Raamesh; Kurat, Christoph F.; Li, Sheena C.; Li, Zhijian; Usaj, Mojca Mattiazzi; Okada, Hiroki; Pascoe, Natasha; Luis, Bryan-Joseph San; Sharifpoor, Sara; Shuteriqi, Emira; Simpkins, Scott W.; Snider, Jamie; Suresh, Harsha Garadi; Tan, Yizhao; Zhu, Hongwei; Malod-Dognin, Noel; Janjic, Vuk; Przulj, Natasa; Troyanskaya, Olga G.; Stagljar, Igor; Xia, Tian; Ohya, Yoshikazu; Gingras, Anne-Claude; Raught, Brian; Boutros, Michael; Steinmetz, Lars M.; Moore, Claire L.; Rosebrock, Adam P.; Caudy, Amy A.; Myers, Chad L.; Andrews, Brenda; Boone, Charles
2017-01-01
We generated a global genetic interaction network for Saccharomyces cerevisiae, constructing over 23 million double mutants, identifying ~550,000 negative and ~350,000 positive genetic interactions. This comprehensive network maps genetic interactions for essential gene pairs, highlighting essential genes as densely connected hubs. Genetic interaction profiles enabled assembly of a hierarchical model of cell function, including modules corresponding to protein complexes and pathways, biological processes, and cellular compartments. Negative interactions connected functionally related genes, mapped core bioprocesses, and identified pleiotropic genes, whereas positive interactions often mapped general regulatory connections among gene pairs, rather than shared functionality. The global network illustrates how coherent sets of genetic interactions connect protein complex and pathway modules to map a functional wiring diagram of the cell. PMID:27708008
Genetic mapping in the presence of genotyping errors.
Cartwright, Dustin A; Troggio, Michela; Velasco, Riccardo; Gutin, Alexander
2007-08-01
Genetic maps are built using the genotypes of many related individuals. Genotyping errors in these data sets can distort genetic maps, especially by inflating the distances. We have extended the traditional likelihood model used for genetic mapping to include the possibility of genotyping errors. Each individual marker is assigned an error rate, which is inferred from the data, just as the genetic distances are. We have developed a software package, called TMAP, which uses this model to find maximum-likelihood maps for phase-known pedigrees. We have tested our methods using a data set in Vitis and on simulated data and confirmed that our method dramatically reduces the inflationary effect caused by increasing the number of markers and leads to more accurate orders.
Genetic Mapping in the Presence of Genotyping Errors
Cartwright, Dustin A.; Troggio, Michela; Velasco, Riccardo; Gutin, Alexander
2007-01-01
Genetic maps are built using the genotypes of many related individuals. Genotyping errors in these data sets can distort genetic maps, especially by inflating the distances. We have extended the traditional likelihood model used for genetic mapping to include the possibility of genotyping errors. Each individual marker is assigned an error rate, which is inferred from the data, just as the genetic distances are. We have developed a software package, called TMAP, which uses this model to find maximum-likelihood maps for phase-known pedigrees. We have tested our methods using a data set in Vitis and on simulated data and confirmed that our method dramatically reduces the inflationary effect caused by increasing the number of markers and leads to more accurate orders. PMID:17277374
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hasin-Brumshtein, Yehudit; Khan, Arshad H.; Hormozdiari, Farhad
2016-09-13
Previous studies had shown that the integration of genome wide expression profiles, in metabolic tissues, with genetic and phenotypic variance, provided valuable insight into the underlying molecular mechanisms. We used RNA-Seq to characterize hypothalamic transcriptome in 99 inbred strains of mice from the Hybrid Mouse Diversity Panel (HMDP), a reference resource population for cardiovascular and metabolic traits. We report numerous novel transcripts supported by proteomic analyses, as well as novel non coding RNAs. High resolution genetic mapping of transcript levels in HMDP, reveals bothlocalandtransexpression Quantitative Trait Loci (eQTLs) demonstrating 2transeQTL 'hotspots' associated with expression of hundreds of genes. We alsomore » report thousands of alternative splicing events regulated by genetic variants. Finally, comparison with about 150 metabolic and cardiovascular traits revealed many highly significant associations. Our data provide a rich resource for understanding the many physiologic functions mediated by the hypothalamus and their genetic regulation.« less
GDR (Genome Database for Rosaceae): integrated web-database for Rosaceae genomics and genetics data
Jung, Sook; Staton, Margaret; Lee, Taein; Blenda, Anna; Svancara, Randall; Abbott, Albert; Main, Dorrie
2008-01-01
The Genome Database for Rosaceae (GDR) is a central repository of curated and integrated genetics and genomics data of Rosaceae, an economically important family which includes apple, cherry, peach, pear, raspberry, rose and strawberry. GDR contains annotated databases of all publicly available Rosaceae ESTs, the genetically anchored peach physical map, Rosaceae genetic maps and comprehensively annotated markers and traits. The ESTs are assembled to produce unigene sets of each genus and the entire Rosaceae. Other annotations include putative function, microsatellites, open reading frames, single nucleotide polymorphisms, gene ontology terms and anchored map position where applicable. Most of the published Rosaceae genetic maps can be viewed and compared through CMap, the comparative map viewer. The peach physical map can be viewed using WebFPC/WebChrom, and also through our integrated GDR map viewer, which serves as a portal to the combined genetic, transcriptome and physical mapping information. ESTs, BACs, markers and traits can be queried by various categories and the search result sites are linked to the mapping visualization tools. GDR also provides online analysis tools such as a batch BLAST/FASTA server for the GDR datasets, a sequence assembly server and microsatellite and primer detection tools. GDR is available at http://www.rosaceae.org. PMID:17932055
DOE Office of Scientific and Technical Information (OSTI.GOV)
Li, Hongqiang; Chen, Hao; Bao, Lei
2005-01-01
Genetic loci that regulate inherited traits are routinely identified using quantitative trait locus (QTL) mapping methods. However, the genotype-phenotype associations do not provide information on the gene expression program through which the genetic loci regulate the traits. Transcription modules are 'selfconsistent regulatory units' and are closely related to the modular components of gene regulatory network [Ihmels, J., Friedlander, G., Bergmann, S., Sarig, O., Ziv, Y. and Barkai, N. (2002) Revealing modular organization in the yeast transcriptional network. Nat. Genet., 31, 370-377; Segal, E., Shapira, M., Regev, A., Pe'er, D., Botstein, D., Koller, D. and Friedman, N. (2003) Module networks: identifyingmore » regulatory modules and their condition-specific regulators from gene expression data. Nat. Genet., 34, 166-176]. We used genome-wide genotype and gene expression data of a genetic reference population that consists of mice of 32 recombinant inbred strains to identify the transcription modules and the genetic loci regulating them. Twenty-nine transcription modules defined by genetic variations were identified. Statistically significant associations between the transcription modules and 18 classical physiological and behavioral traits were found. Genome-wide interval mapping showed that major QTLs regulating the transcription modules are often co-localized with the QTLs regulating the associated classical traits. The association and the possible co-regulation of the classical trait and transcription module indicate that the transcription module may be involved in the gene pathways connecting the QTL and the classical trait. Our results show that a transcription module may associate with multiple seemingly unrelated classical traits and a classical trait may associate with different modules. Literature mining results provided strong independent evidences for the relations among genes of the transcription modules, genes in the regions of the QTLs regulating the transcription modules and the keywords representing the classical traits.« less
Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui
2017-01-01
Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.
2011-01-01
Background Comparative genome mapping studies in Rosaceae have been conducted until now by aligning genetic maps within the same genus, or closely related genera and using a limited number of common markers. The growing body of genomics resources and sequence data for both Prunus and Fragaria permits detailed comparisons between these genera and the recently released Malus × domestica genome sequence. Results We generated a comparative analysis using 806 molecular markers that are anchored genetically to the Prunus and/or Fragaria reference maps, and physically to the Malus genome sequence. Markers in common for Malus and Prunus, and Malus and Fragaria, respectively were 784 and 148. The correspondence between marker positions was high and conserved syntenic blocks were identified among the three genera in the Rosaceae. We reconstructed a proposed ancestral genome for the Rosaceae. Conclusions A genome containing nine chromosomes is the most likely candidate for the ancestral Rosaceae progenitor. The number of chromosomal translocations observed between the three genera investigated was low. However, the number of inversions identified among Malus and Prunus was much higher than any reported genome comparisons in plants, suggesting that small inversions have played an important role in the evolution of these two genera or of the Rosaceae. PMID:21226921
Haemonchus contortus: Genome Structure, Organization and Comparative Genomics.
Laing, R; Martinelli, A; Tracey, A; Holroyd, N; Gilleard, J S; Cotton, J A
2016-01-01
One of the first genome sequencing projects for a parasitic nematode was that for Haemonchus contortus. The open access data from the Wellcome Trust Sanger Institute provided a valuable early resource for the research community, particularly for the identification of specific genes and genetic markers. Later, a second sequencing project was initiated by the University of Melbourne, and the two draft genome sequences for H. contortus were published back-to-back in 2013. There is a pressing need for long-range genomic information for genetic mapping, population genetics and functional genomic studies, so we are continuing to improve the Wellcome Trust Sanger Institute assembly to provide a finished reference genome for H. contortus. This review describes this process, compares the H. contortus genome assemblies with draft genomes from other members of the strongylid group and discusses future directions for parasite genomics using the H. contortus model. Copyright © 2016 Elsevier Ltd. All rights reserved.
Variation in recombination rate may bias human genetic disease mapping studies.
Boyle, A Susannah; Noor, Mohamed A F
2004-11-01
The availability of the human genome sequence and variability information (as from the International HapMap project) will enhance our ability to map genetic disorders and choose targets for therapeutic intervention. However, several factors, such as regional variation in recombination rate, can bias conclusions from genetic mapping studies. Here, we examine the impact of regional variation in recombination rate across the human genome. Through computer simulations and literature surveys, we conclude that genetic disorders have been mapped to regions of low recombination more often than expected if such diseases were randomly distributed across the genome. This concentration in low recombination regions may be an artifact, and disorders appearing to be caused by a few genes of large effect may be polygenic. Future genetic mapping studies should be conscious of this potential complication by noting the regional recombination rate of regions implicated in diseases.
2012-01-01
Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison. PMID:22908993
A microsatellite genetic linkage map of black rockfish ( Sebastes schlegeli)
NASA Astrophysics Data System (ADS)
Chu, Guannan; Jiang, Liming; He, Yan; Yu, Haiyang; Wang, Zhigang; Jiang, Haibin; Zhang, Quanqi
2014-12-01
Ovoviviparous black rockfish ( Sebastes schlegeli) is an important marine fish species for aquaculture and fisheries in China. Genetic information of this species is scarce because of the lack of microsatellite markers. In this study, a large number of microsatellite markers of black rockfish were isolated by constructing microsatellite-enriched libraries. Female- and male-specific genetic linkage maps were constructed using 435 microsatellite markers genotyped in a full-sib family of the fish species. The female linkage map contained 140 microsatellite markers, in which 23 linkage groups had a total genetic length of 1334.1 cM and average inter-marker space of 13.3 cM. The male linkage map contained 156 microsatellite markers, in which 25 linkage groups had a total genetic length of 1359.6 cM and average inter-marker distance of 12.4 cM. The genome coverage of the female and male linkage maps was 68.6% and 69.3%, respectively. The female-to-male ratio of the recombination rate was approximately 1.07:1 in adjacent microsatellite markers. This paper presents the first genetic linkage map of microsatellites in black rockfish. The collection of polymorphic markers and sex-specific linkage maps of black rockfish could be useful for further investigations on parental assignment, population genetics, quantitative trait loci mapping, and marker-assisted selection in related breeding programs.
Liu, Jie; Milne, Richard I; Möller, Michael; Zhu, Guang-Fu; Ye, Lin-Jiang; Luo, Ya-Huang; Yang, Jun-Bo; Wambulwa, Moses C; Wang, Chun-Neng; Li, De-Zhu; Gao, Lian-Ming
2018-05-22
Rapid and accurate identification of endangered species is a critical component of biosurveillance and conservation management, and potentially policing illegal trades. However, this is often not possible using traditional taxonomy, especially where only small or preprocessed parts of plants are available. Reliable identification can be achieved via a comprehensive DNA barcode reference library, accompanied by precise distribution data. However, these require extensive sampling at spatial and taxonomic scales, which has rarely been achieved for cosmopolitan taxa. Here, we construct a comprehensive DNA barcode reference library and generate distribution maps using species distribution modelling (SDM), for all 15 Taxus species worldwide. We find that trnL-trnF is the ideal barcode for Taxus: It can distinguish all Taxus species and in combination with ITS identify hybrids. Among five analysis methods tested, NJ was the most effective. Among 4,151 individuals screened for trnL-trnF, 73 haplotypes were detected, all species-specific and some population private. Taxonomical, geographical and genetic dimensions of sampling strategy were all found to affect the comprehensiveness of the resulting DNA barcode library. Maps from SDM showed that most species had allopatric distributions, except T. mairei in the Sino-Himalayan region. Using the barcode library and distribution map data, two unknown forensic samples were identified to species (and in one case, population) level and another was determined as a putative interspecific hybrid. This integrated species identification system for Taxus can be used for biosurveillance, conservation management and to monitor and prosecute illegal trade. Similar identification systems are recommended for other IUCN- and CITES-listed taxa. © 2018 John Wiley & Sons Ltd.
USDA-ARS?s Scientific Manuscript database
Comparative genetic mapping between clementine, pummelo and sweet orange and the interspecicic structure of the Clementine genome The availability of a saturated genetic map of Clementine was identified by the International Citrus Genome Consortium as an essential prerequisite to assist the assembly...
2012-01-01
Background Mycobacterium avium subspecies paratuberculosis (Map) is the aetiological agent of Johne’s disease or paratuberculosis and is included within the Mycobacterium avium complex (MAC). Map strains are of two major types often referred to as ‘Sheep’ or ‘S-type’ and ‘Cattle’ or ‘C-type’. With the advent of more discriminatory typing techniques it has been possible to further classify the S-type strains into two groups referred to as Type I and Type III. This study was undertaken to genotype a large panel of S-type small ruminant isolates from different hosts and geographical origins and to compare them with a large panel of well documented C-type isolates to assess the genetic diversity of these strain types. Methods used included Mycobacterial Interspersed Repetitive Units - Variable-Number Tandem Repeat analysis (MIRU-VNTR), analysis of Large Sequence Polymorphisms by PCR (LSP analysis), Single Nucleotide Polymorphism (SNP) analysis of gyr genes, Pulsed-Field Gel Electrophoresis (PFGE) and Restriction Fragment Length Polymorphism analysis coupled with hybridization to IS900 (IS900-RFLP) analysis. Results The presence of LSPA4 and absence of LSPA20 was confirmed in all 24 Map S-type strains analysed. SNPs within the gyr genes divided the S-type strains into types I and III. Twenty four PFGE multiplex profiles and eleven different IS900-RFLP profiles were identified among the S-type isolates, some of them not previously published. Both PFGE and IS900-RFLP segregated the S-type strains into types I and III and the results concurred with those of the gyr SNP analysis. Nine MIRU-VNTR genotypes were identified in these isolates. MIRU-VNTR analysis differentiated Map strains from other members of Mycobacterium avium Complex, and Map S-type from C-type but not type I from III. Pigmented Map isolates were found of type I or III. Conclusion This is the largest panel of S-type strains investigated to date. The S-type strains could be further divided into two subtypes, I and III by some of the typing techniques (IS900-RFLP, PFGE and SNP analysis of the gyr genes). MIRU-VNTR did not divide the strains into the subtypes I and III but did detect genetic differences between isolates within each of the subtypes. Pigmentation is not exclusively associated with type I strains. PMID:23164429
A random matrix approach to language acquisition
NASA Astrophysics Data System (ADS)
Nicolaidis, A.; Kosmidis, Kosmas; Argyrakis, Panos
2009-12-01
Since language is tied to cognition, we expect the linguistic structures to reflect patterns that we encounter in nature and are analyzed by physics. Within this realm we investigate the process of lexicon acquisition, using analytical and tractable methods developed within physics. A lexicon is a mapping between sounds and referents of the perceived world. This mapping is represented by a matrix and the linguistic interaction among individuals is described by a random matrix model. There are two essential parameters in our approach. The strength of the linguistic interaction β, which is considered as a genetically determined ability, and the number N of sounds employed (the lexicon size). Our model of linguistic interaction is analytically studied using methods of statistical physics and simulated by Monte Carlo techniques. The analysis reveals an intricate relationship between the innate propensity for language acquisition β and the lexicon size N, N~exp(β). Thus a small increase of the genetically determined β may lead to an incredible lexical explosion. Our approximate scheme offers an explanation for the biological affinity of different species and their simultaneous linguistic disparity.
Decomposing Oncogenic Transcriptional Signatures to Generate Maps of Divergent Cellular States.
Kim, Jong Wook; Abudayyeh, Omar O; Yeerna, Huwate; Yeang, Chen-Hsiang; Stewart, Michelle; Jenkins, Russell W; Kitajima, Shunsuke; Konieczkowski, David J; Medetgul-Ernar, Kate; Cavazos, Taylor; Mah, Clarence; Ting, Stephanie; Van Allen, Eliezer M; Cohen, Ofir; Mcdermott, John; Damato, Emily; Aguirre, Andrew J; Liang, Jonathan; Liberzon, Arthur; Alexe, Gabriella; Doench, John; Ghandi, Mahmoud; Vazquez, Francisca; Weir, Barbara A; Tsherniak, Aviad; Subramanian, Aravind; Meneses-Cime, Karina; Park, Jason; Clemons, Paul; Garraway, Levi A; Thomas, David; Boehm, Jesse S; Barbie, David A; Hahn, William C; Mesirov, Jill P; Tamayo, Pablo
2017-08-23
The systematic sequencing of the cancer genome has led to the identification of numerous genetic alterations in cancer. However, a deeper understanding of the functional consequences of these alterations is necessary to guide appropriate therapeutic strategies. Here, we describe Onco-GPS (OncoGenic Positioning System), a data-driven analysis framework to organize individual tumor samples with shared oncogenic alterations onto a reference map defined by their underlying cellular states. We applied the methodology to the RAS pathway and identified nine distinct components that reflect transcriptional activities downstream of RAS and defined several functional states associated with patterns of transcriptional component activation that associates with genomic hallmarks and response to genetic and pharmacological perturbations. These results show that the Onco-GPS is an effective approach to explore the complex landscape of oncogenic cellular states across cancers, and an analytic framework to summarize knowledge, establish relationships, and generate more effective disease models for research or as part of individualized precision medicine paradigms. Copyright © 2017 Elsevier Inc. All rights reserved.
A clone-free, single molecule map of the domestic cow (Bos taurus) genome.
Zhou, Shiguo; Goldstein, Steve; Place, Michael; Bechner, Michael; Patino, Diego; Potamousis, Konstantinos; Ravindran, Prabu; Pape, Louise; Rincon, Gonzalo; Hernandez-Ortiz, Juan; Medrano, Juan F; Schwartz, David C
2015-08-28
The cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation. The optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts). Alignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds.
Hinze, Lori L; Gazave, Elodie; Gore, Michael A; Fang, David D; Scheffler, Brian E; Yu, John Z; Jones, Don C; Frelichowski, James; Percy, Richard G
2016-05-01
A diversity reference set has been constructed for the Gossypium accessions in the US National Cotton Germplasm Collection to facilitate more extensive evaluation and utilization of accessions held in the Collection. A set of 105 mapped simple sequence repeat markers was used to study the allelic diversity of 1933 tetraploid Gossypium accessions representative of the range of diversity of the improved and wild accessions of G. hirsutum and G. barbadense. The reference set contained 410 G. barbadense accessions and 1523 G. hirsutum accessions. Observed numbers of polymorphic and private bands indicated a greater diversity in G. hirsutum as compared to G. barbadense as well as in wild-type accessions as compared to improved accessions in both species. The markers clearly differentiated the 2 species. Patterns of diversity within species were observed but not clearly delineated, with much overlap occurring between races and regions of origin for wild accessions and between historical and geographic breeding pools for cultivated accessions. Although the percentage of accessions showing introgression was higher among wild accessions than cultivars in both species, the average level of introgression within individual accessions, as indicated by species-specific bands, was much higher in wild accessions of G. hirsutum than in wild accessions of G. barbadense. The average level of introgression within individual accessions was higher in improved G. barbadense cultivars than in G. hirsutum cultivars. This molecular characterization reveals the levels and distributions of genetic diversity that will allow for better exploration and utilization of cotton genetic resources. Published by Oxford University Press on behalf of the American Genetic Association 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Druka, Arnis; Druka, Ilze; Centeno, Arthur G; Li, Hongqiang; Sun, Zhaohui; Thomas, William T B; Bonar, Nicola; Steffenson, Brian J; Ullrich, Steven E; Kleinhofs, Andris; Wise, Roger P; Close, Timothy J; Potokina, Elena; Luo, Zewei; Wagner, Carola; Schweizer, Günther F; Marshall, David F; Kearsey, Michael J; Williams, Robert W; Waugh, Robbie
2008-11-18
A typical genetical genomics experiment results in four separate data sets; genotype, gene expression, higher-order phenotypic data and metadata that describe the protocols, processing and the array platform. Used in concert, these data sets provide the opportunity to perform genetic analysis at a systems level. Their predictive power is largely determined by the gene expression dataset where tens of millions of data points can be generated using currently available mRNA profiling technologies. Such large, multidimensional data sets often have value beyond that extracted during their initial analysis and interpretation, particularly if conducted on widely distributed reference genetic materials. Besides quality and scale, access to the data is of primary importance as accessibility potentially allows the extraction of considerable added value from the same primary dataset by the wider research community. Although the number of genetical genomics experiments in different plant species is rapidly increasing, none to date has been presented in a form that allows quick and efficient on-line testing for possible associations between genes, loci and traits of interest by an entire research community. Using a reference population of 150 recombinant doubled haploid barley lines we generated novel phenotypic, mRNA abundance and SNP-based genotyping data sets, added them to a considerable volume of legacy trait data and entered them into the GeneNetwork http://www.genenetwork.org. GeneNetwork is a unified on-line analytical environment that enables the user to test genetic hypotheses about how component traits, such as mRNA abundance, may interact to condition more complex biological phenotypes (higher-order traits). Here we describe these barley data sets and demonstrate some of the functionalities GeneNetwork provides as an easily accessible and integrated analytical environment for exploring them. By integrating barley genotypic, phenotypic and mRNA abundance data sets directly within GeneNetwork's analytical environment we provide simple web access to the data for the research community. In this environment, a combination of correlation analysis and linkage mapping provides the potential to identify and substantiate gene targets for saturation mapping and positional cloning. By integrating datasets from an unsequenced crop plant (barley) in a database that has been designed for an animal model species (mouse) with a well established genome sequence, we prove the importance of the concept and practice of modular development and interoperability of software engineering for biological data sets.
Genetic Control of Fruit Vitamin C Contents1
Davey, Mark W.; Kenis, Katrien; Keulemans, Johan
2006-01-01
An F1 progeny derived from a cross between the apple (Malus x domestica) cultivars Telamon and Braeburn was used to identify quantitative trait loci (QTL) linked to the vitamin C (l-ascorbate [l-AA]) contents of fruit skin and flesh (cortex) tissues. We identified up to three highly significant QTLs for both the mean l-AA and the mean total l-AA contents of fruit flesh on both parental genetic linkage maps, confirming the quantitative nature of these traits. These QTLs account for up to a maximum of 60% of the total population variation observed in the progeny, and with a maximal individual contribution of 31% per QTL. QTLs common to both parents were identified on linkage groups (LGs) 6, 10, and 11 of the Malus reference map, while each parent also had additional unique QTLs on other LGs. Interestingly, one strong QTL on LG-17 of the Telamon linkage map colocalized with a highly significant QTL associated with flesh browning, and a minor QTL for dehydroascorbate content, supporting earlier work that links fruit l-AA contents with the susceptibility of hardfruit to postharvest browning. We also found significant minor QTLs for skin l-AA and total l-AA (l-AA + dehydroascorbate) contents in Telamon. Currently, little is known about the genetic determinants underlying tissue l-AA homeostasis, but the presence of major, highly significant QTL in both these apple genotypes under field conditions suggests the existence of common control mechanisms, allelic heterozygosity, and helps outline strategies and the potential for the molecular breeding of these traits. PMID:16844833
Characterization of a Thermo-Inducible Chlorophyll-Deficient Mutant in Barley.
Wang, Rong; Yang, Fei; Zhang, Xiao-Qi; Wu, Dianxin; Tan, Cong; Westcott, Sharon; Broughton, Sue; Li, Chengdao; Zhang, Wenying; Xu, Yanhao
2017-01-01
Leaf color is an important trait for not only controlling crop yield but also monitoring plant status under temperature stress. In this study, a thermo-inducible chlorophyll-deficient mutant, named V-V-Y, was identified from a gamma-radiated population of the barley variety Vlamingh. The leaves of the mutant were green under normal growing temperature but turned yellowish under high temperature in the glasshouse experiment. The ratio of chlorophyll a and chlorophyll b in the mutant declined much faster in the first 7-9 days under heat treatment. The leaves of V-V-Y turned yellowish but took longer to senesce under heat stress in the field experiment. Genetic analysis indicated that a single nuclear gene controlled the mutant trait. The mutant gene ( vvy ) was mapped to the long arm of chromosome 4H between SNP markers 1_0269 and 1_1531 with a genetic distance of 2.2 cM and a physical interval of 9.85 Mb. A QTL for grain yield was mapped to the same interval and explained 10.4% of the yield variation with a LOD score of 4. This QTL is coincident with the vvy gene interval that is responsible for the thermo-inducible chlorophyll-deficient trait. Fine mapping, based on the barley reference genome sequence, further narrowed the vvy gene to a physical interval of 0.428 Mb with 11 annotated genes. This is the first report of fine mapping a thermo-inducible chlorophyll-deficient gene in barley.
Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D
2015-06-01
Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
A spatial haplotype copying model with applications to genotype imputation.
Yang, Wen-Yun; Hormozdiari, Farhad; Eskin, Eleazar; Pasaniuc, Bogdan
2015-05-01
Ever since its introduction, the haplotype copy model has proven to be one of the most successful approaches for modeling genetic variation in human populations, with applications ranging from ancestry inference to genotype phasing and imputation. Motivated by coalescent theory, this approach assumes that any chromosome (haplotype) can be modeled as a mosaic of segments copied from a set of chromosomes sampled from the same population. At the core of the model is the assumption that any chromosome from the sample is equally likely to contribute a priori to the copying process. Motivated by recent works that model genetic variation in a geographic continuum, we propose a new spatial-aware haplotype copy model that jointly models geography and the haplotype copying process. We extend hidden Markov models of haplotype diversity such that at any given location, haplotypes that are closest in the genetic-geographic continuum map are a priori more likely to contribute to the copying process than distant ones. Through simulations starting from the 1000 Genomes data, we show that our model achieves superior accuracy in genotype imputation over the standard spatial-unaware haplotype copy model. In addition, we show the utility of our model in selecting a small personalized reference panel for imputation that leads to both improved accuracy as well as to a lower computational runtime than the standard approach. Finally, we show our proposed model can be used to localize individuals on the genetic-geographical map on the basis of their genotype data.
Thomas L. Kubisiak; C.Dana Nelson; W.L. Name; M. Stine
1996-01-01
Considerable concern has been voiced regarding the reproducibility/transferability of RAPD markers across different genetic backgrounds in genetic mapping experiments. Therefore, separate gametic subsets (mapping populations) were used to construct individual random amplified polymorphic DNA (RAPD) linkage maps for a single longleaf pine (Pinus palustris...
Zhao, Yuhui; Su, Kai; Wang, Gang; Zhang, Liping; Zhang, Jijun; Li, Junpeng; Guo, Yinshan
2017-07-14
Genetic linkage maps are an important tool in genetic and genomic research. In this study, two hawthorn cultivars, Qiujinxing and Damianqiu, and 107 progenies from a cross between them were used for constructing a high-density genetic linkage map using the 2b-restriction site-associated DNA (2b-RAD) sequencing method, as well as for mapping quantitative trait loci (QTL) for flavonoid content. In total, 206,411,693 single-end reads were obtained, with an average sequencing depth of 57× in the parents and 23× in the progeny. After quality trimming, 117,896 high-quality 2b-RAD tags were retained, of which 42,279 were polymorphic; of these, 12,951 markers were used for constructing the genetic linkage map. The map contained 17 linkage groups and 3,894 markers, with a total map length of 1,551.97 cM and an average marker interval of 0.40 cM. QTL mapping identified 21 QTLs associated with flavonoid content in 10 linkage groups, which explained 16.30-59.00% of the variance. This is the first high-density linkage map for hawthorn, which will serve as a basis for fine-scale QTL mapping and marker-assisted selection of important traits in hawthorn germplasm and will facilitate chromosome assignment for hawthorn whole-genome assemblies in the future.
2013-03-14
SUPPLEMENTARY NOTES 14. ABSTRACT Autism is an extremely common and heterogeneous neurodevelopmental disorder. While genetic factors are known to play...AFRL-SA-WP-TR-2013-0013 Comprehensive Clinical Phenotyping and Genetic Mapping for the Discovery of Autism Susceptibility Genes...Genetic Mapping for the Discovery of Autism Susceptibility Genes 5a. CONTRACT NUMBER N/A 5b. GRANT NUMBER N/A 5c. PROGRAM ELEMENT NUMBER N/A 6
Development of forward genetics in Toxoplasma gondii
Sibley, L. David
2009-01-01
The development of forward genetics as a functional system in Toxoplasma gondii spanned more than three decades from the mid-1970s until now. The initial demonstration of experimental genetics relied on chemically-induced drug resistant mutants that were crossed by co-infecting cats, collecting oocysts, sporulating and hatching progeny in vitro. To capitalize on this, genetic markers were employed to develop linkage maps by tracking inheritance through experimental crosses. In all, three generations of genetic maps were developed to define the chromosomes, estimate recombination rates, and provide a system for linkage analysis. Ultimately this genetic map would become the foundation for the assembly of the T. gondii genome, which was derived from whole genome shotgun sequencing, into a chromosome-centric view. Finally, application of forward genetics to multigenic biological traits showed the potential to map and identify specific genes that control complex phenotypes including virulence. PMID:19254720
The zebrafish reference genome sequence and its relationship to the human genome.
Howe, Kerstin; Clark, Matthew D; Torroja, Carlos F; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T; Guerra-Assunção, José A; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F; Laird, Gavin K; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Elliot, David; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Begum, Sharmin; Mortimore, Beverley; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Lloyd, Christine; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James D; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Lanz, Christa; Raddatz, Günter; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Schuster, Stephan C; Carter, Nigel P; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M J; Enright, Anton; Geisler, Robert; Plasterk, Ronald H A; Lee, Charles; Westerfield, Monte; de Jong, Pieter J; Zon, Leonard I; Postlethwait, John H; Nüsslein-Volhard, Christiane; Hubbard, Tim J P; Roest Crollius, Hugues; Rogers, Jane; Stemple, Derek L
2013-04-25
Zebrafish have become a popular organism for the study of vertebrate gene function. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination.
The zebrafish reference genome sequence and its relationship to the human genome
Howe, Kerstin; Clark, Matthew D.; Torroja, Carlos F.; Torrance, James; Berthelot, Camille; Muffato, Matthieu; Collins, John E.; Humphray, Sean; McLaren, Karen; Matthews, Lucy; McLaren, Stuart; Sealy, Ian; Caccamo, Mario; Churcher, Carol; Scott, Carol; Barrett, Jeffrey C.; Koch, Romke; Rauch, Gerd-Jörg; White, Simon; Chow, William; Kilian, Britt; Quintais, Leonor T.; Guerra-Assunção, José A.; Zhou, Yi; Gu, Yong; Yen, Jennifer; Vogel, Jan-Hinnerk; Eyre, Tina; Redmond, Seth; Banerjee, Ruby; Chi, Jianxiang; Fu, Beiyuan; Langley, Elizabeth; Maguire, Sean F.; Laird, Gavin K.; Lloyd, David; Kenyon, Emma; Donaldson, Sarah; Sehra, Harminder; Almeida-King, Jeff; Loveland, Jane; Trevanion, Stephen; Jones, Matt; Quail, Mike; Willey, Dave; Hunt, Adrienne; Burton, John; Sims, Sarah; McLay, Kirsten; Plumb, Bob; Davis, Joy; Clee, Chris; Oliver, Karen; Clark, Richard; Riddle, Clare; Eliott, David; Threadgold, Glen; Harden, Glenn; Ware, Darren; Mortimer, Beverly; Kerry, Giselle; Heath, Paul; Phillimore, Benjamin; Tracey, Alan; Corby, Nicole; Dunn, Matthew; Johnson, Christopher; Wood, Jonathan; Clark, Susan; Pelan, Sarah; Griffiths, Guy; Smith, Michelle; Glithero, Rebecca; Howden, Philip; Barker, Nicholas; Stevens, Christopher; Harley, Joanna; Holt, Karen; Panagiotidis, Georgios; Lovell, Jamieson; Beasley, Helen; Henderson, Carl; Gordon, Daria; Auger, Katherine; Wright, Deborah; Collins, Joanna; Raisen, Claire; Dyer, Lauren; Leung, Kenric; Robertson, Lauren; Ambridge, Kirsty; Leongamornlert, Daniel; McGuire, Sarah; Gilderthorp, Ruth; Griffiths, Coline; Manthravadi, Deepa; Nichol, Sarah; Barker, Gary; Whitehead, Siobhan; Kay, Michael; Brown, Jacqueline; Murnane, Clare; Gray, Emma; Humphries, Matthew; Sycamore, Neil; Barker, Darren; Saunders, David; Wallis, Justene; Babbage, Anne; Hammond, Sian; Mashreghi-Mohammadi, Maryam; Barr, Lucy; Martin, Sancha; Wray, Paul; Ellington, Andrew; Matthews, Nicholas; Ellwood, Matthew; Woodmansey, Rebecca; Clark, Graham; Cooper, James; Tromans, Anthony; Grafham, Darren; Skuce, Carl; Pandian, Richard; Andrews, Robert; Harrison, Elliot; Kimberley, Andrew; Garnett, Jane; Fosker, Nigel; Hall, Rebekah; Garner, Patrick; Kelly, Daniel; Bird, Christine; Palmer, Sophie; Gehring, Ines; Berger, Andrea; Dooley, Christopher M.; Ersan-Ürün, Zübeyde; Eser, Cigdem; Geiger, Horst; Geisler, Maria; Karotki, Lena; Kirn, Anette; Konantz, Judith; Konantz, Martina; Oberländer, Martina; Rudolph-Geiger, Silke; Teucke, Mathias; Osoegawa, Kazutoyo; Zhu, Baoli; Rapp, Amanda; Widaa, Sara; Langford, Cordelia; Yang, Fengtang; Carter, Nigel P.; Harrow, Jennifer; Ning, Zemin; Herrero, Javier; Searle, Steve M. J.; Enright, Anton; Geisler, Robert; Plasterk, Ronald H. A.; Lee, Charles; Westerfield, Monte; de Jong, Pieter J.; Zon, Leonard I.; Postlethwait, John H.; Nüsslein-Volhard, Christiane; Hubbard, Tim J. P.; Crollius, Hugues Roest; Rogers, Jane; Stemple, Derek L.
2013-01-01
Zebrafish have become a popular organism for the study of vertebrate gene function1,2. The virtually transparent embryos of this species, and the ability to accelerate genetic studies by gene knockdown or overexpression, have led to the widespread use of zebrafish in the detailed investigation of vertebrate gene function and increasingly, the study of human genetic disease3–5. However, for effective modelling of human genetic disease it is important to understand the extent to which zebrafish genes and gene structures are related to orthologous human genes. To examine this, we generated a high-quality sequence assembly of the zebrafish genome, made up of an overlapping set of completely sequenced large-insert clones that were ordered and oriented using a high-resolution high-density meiotic map. Detailed automatic and manual annotation provides evidence of more than 26,000 protein-coding genes6, the largest gene set of any vertebrate so far sequenced. Comparison to the human reference genome shows that approximately 70% of human genes have at least one obvious zebrafish orthologue. In addition, the high quality of this genome assembly provides a clearer understanding of key genomic features such as a unique repeat content, a scarcity of pseudogenes, an enrichment of zebrafish-specific genes on chromosome 4 and chromosomal regions that influence sex determination. PMID:23594743
Allen, Upton D; Hu, Pingzhao; Pereira, Sergio L; Robinson, Joan L; Paton, Tara A; Beyene, Joseph; Khodai-Booran, Nasser; Dipchand, Anne; Hébert, Diane; Ng, Vicky; Nalpathamkalam, Thomas; Read, Stanley
2016-02-01
This study examines EBV strains from transplant patients and patients with IM by sequencing major EBV genes. We also used NGS to detect EBV DNA within total genomic DNA, and to evaluate its genetic variation. Sanger sequencing of major EBV genes was used to compare SNVs from samples taken from transplant patients vs. patients with IM. We sequenced EBV DNA from a healthy EBV-seropositive individual on a HiSeq 2000 instrument. Data were mapped to the EBV reference genomes (AG876 and B95-8). The number of EBNA2 SNVs was higher than for EBNA1 and the other genes sequenced within comparable reference coordinates. For EBNA2, there was a median of 15 SNV among transplant samples compared with 10 among IM samples (p = 0.036). EBNA1 showed little variation between samples. For NGS, we identified 640 and 892 variants at an unadjusted p value of 5 × 10(-8) for AG876 and B95-8 genomes, respectively. We used complementary sequence strategies to examine EBV genetic diversity and its application to transplantation. The results provide the framework for further characterization of EBV strains and related outcomes after organ transplantation. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Guo, Yinshan; Shi, Guangli; Liu, Zhendong; Zhao, Yuhui; Yang, Xiaoxu; Zhu, Junchi; Li, Kun; Guo, Xiuwu
2015-01-01
In this study, 149 F1 plants from the interspecific cross between 'Red Globe' (Vitis vinifera L.) and 'Shuangyou' (Vitis amurensis Rupr.) and the parent were used to construct a molecular genetic linkage map by using the specific length amplified fragment sequencing technique. DNA sequencing generated 41.282 Gb data consisting of 206,411,693 paired-end reads. The average sequencing depths were 68.35 for 'Red Globe,' 63.65 for 'Shuangyou,' and 8.01 for each progeny. In all, 115,629 high-quality specific length amplified fragments were detected, of which 42,279 were polymorphic. The genetic map was constructed using 7,199 of these polymorphic markers. These polymorphic markers were assigned to 19 linkage groups; the total length of the map was 1929.13 cm, with an average distance of 0.28 cm between each maker. To our knowledge, the genetic maps constructed in this study contain the largest number of molecular markers. These high-density genetic maps might form the basis for the fine quantitative trait loci mapping and molecular-assisted breeding of grape.
Portis, E; Mauromicale, G; Mauro, R; Acquadro, A; Scaglione, D; Lanteri, S
2009-12-01
The genome organization of globe artichoke (Cynara cardunculus var. scolymus), unlike other species belonging to Asteraceae (=Compositae) family (i.e. sunflower, lettuce and chicory), remains largely unexplored. The species is highly heterozygous and suffers marked inbreeding depression when forced to self-fertilize. Thus a two-way pseudo-testcross represents the optimal strategy for linkage analysis. Here, we report linkage maps based on the progeny of a cross between globe artichoke (C. cardunculus var. scolymus) and cultivated cardoon (C. cardunculus var. altilis). The population was genotyped using a variety of PCR-based marker platforms, resulting in the identification of 708 testcross markers suitable for map construction. The male map consisted of 177 loci arranged in 17 major linkage groups, spanning 1,015.5 cM, while female map was built with 326 loci arranged into 20 major linkage groups, spanning 1,486.8 cM. The presence of 84 loci shared between these maps and those previously developed from a cross within globe artichoke allowed for map alignment and the definition of 17 homologous linkage groups, corresponding to the haploid number of the species. This will provide a favourable property for QTL scanning; furthermore, as 25 mapped markers (8%) correspond to coding regions, it has an additional value as functional map and might represent an important genetic tool for candidate gene studies in globe artichoke.
Gramene 2016: comparative plant genomics and pathway resources
Tello-Ruiz, Marcela K.; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M.; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A.; Huerta, Laura; Keays, Maria; Tang, Y. Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J.; Jaiswal, Pankaj; Ware, Doreen
2016-01-01
Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. PMID:26553803
Nelson, Matthew R.; Bryc, Katarzyna; King, Karen S.; Indap, Amit; Boyko, Adam R.; Novembre, John; Briley, Linda P.; Maruyama, Yuka; Waterworth, Dawn M.; Waeber, Gérard; Vollenweider, Peter; Oksenberg, Jorge R.; Hauser, Stephen L.; Stirnadel, Heide A.; Kooner, Jaspal S.; Chambers, John C.; Jones, Brendan; Mooser, Vincent; Bustamante, Carlos D.; Roses, Allen D.; Burns, Daniel K.; Ehm, Margaret G.; Lai, Eric H.
2008-01-01
Technological and scientific advances, stemming in large part from the Human Genome and HapMap projects, have made large-scale, genome-wide investigations feasible and cost effective. These advances have the potential to dramatically impact drug discovery and development by identifying genetic factors that contribute to variation in disease risk as well as drug pharmacokinetics, treatment efficacy, and adverse drug reactions. In spite of the technological advancements, successful application in biomedical research would be limited without access to suitable sample collections. To facilitate exploratory genetics research, we have assembled a DNA resource from a large number of subjects participating in multiple studies throughout the world. This growing resource was initially genotyped with a commercially available genome-wide 500,000 single-nucleotide polymorphism panel. This project includes nearly 6,000 subjects of African-American, East Asian, South Asian, Mexican, and European origin. Seven informative axes of variation identified via principal-component analysis (PCA) of these data confirm the overall integrity of the data and highlight important features of the genetic structure of diverse populations. The potential value of such extensively genotyped collections is illustrated by selection of genetically matched population controls in a genome-wide analysis of abacavir-associated hypersensitivity reaction. We find that matching based on country of origin, identity-by-state distance, and multidimensional PCA do similarly well to control the type I error rate. The genotype and demographic data from this reference sample are freely available through the NCBI database of Genotypes and Phenotypes (dbGaP). PMID:18760391
Lindén, Rolf O; Eronen, Ville-Pekka; Aittokallio, Tero
2011-03-24
High-throughput genetic screening approaches have enabled systematic means to study how interactions among gene mutations contribute to quantitative fitness phenotypes, with the aim of providing insights into the functional wiring diagrams of genetic interaction networks on a global scale. However, it is poorly known how well these quantitative interaction measurements agree across the screening approaches, which hinders their integrated use toward improving the coverage and quality of the genetic interaction maps in yeast and other organisms. Using large-scale data matrices from epistatic miniarray profiling (E-MAP), genetic interaction mapping (GIM), and synthetic genetic array (SGA) approaches, we carried out here a systematic comparative evaluation among these quantitative maps of genetic interactions in yeast. The relatively low association between the original interaction measurements or their customized scores could be improved using a matrix-based modelling framework, which enables the use of single- and double-mutant fitness estimates and measurements, respectively, when scoring genetic interactions. Toward an integrative analysis, we show how the detections from the different screening approaches can be combined to suggest novel positive and negative interactions which are complementary to those obtained using any single screening approach alone. The matrix approximation procedure has been made available to support the design and analysis of the future screening studies. We have shown here that even if the correlation between the currently available quantitative genetic interaction maps in yeast is relatively low, their comparability can be improved by means of our computational matrix approximation procedure, which will enable integrative analysis and detection of a wider spectrum of genetic interactions using data from the complementary screening approaches.
The Qatar genome project: translation of whole-genome sequencing into clinical practice.
Zayed, Hatem
2016-10-01
Qatar Genome Project was launched in 2013 with the intent to sequence the genome of each Qatari citizen in an effort to protect Qataris from the high rate of indigenous genetic diseases by allowing the mapping of disease-causing variants/rare variants and establishing a Qatari reference genome. Indeed, this project is expected to have numerous global benefits because the elevated homogeneity of the Qatari population, that will make Qatar an excellent genetic laboratory that will generate a wealth of data that will allow us to make sense of the genotype-phenotype correlations of many diseases, especially the complex multifactorial diseases, and will pave the way for changing the traditional medical practice of looking first at the phenotype rather than the genotype. © 2016 John Wiley & Sons Ltd.
Architecture of the wood-wide web: Rhizopogon spp. genets link multiple Douglas-fir cohorts.
Beiler, Kevin J; Durall, Daniel M; Simard, Suzanne W; Maxwell, Sheri A; Kretzer, Annette M
2010-01-01
*The role of mycorrhizal networks in forest dynamics is poorly understood because of the elusiveness of their spatial structure. We mapped the belowground distribution of the fungi Rhizopogon vesiculosus and Rhizopogon vinicolor and interior Douglas-fir trees (Pseudotsuga menziesii var. glauca) to determine the architecture of a mycorrhizal network in a multi-aged old-growth forest. *Rhizopogon spp. mycorrhizas were collected within a 30 x 30 m plot. Trees and fungal genets were identified using multi-locus microsatellite DNA analysis. Tree genotypes from mycorrhizas were matched to reference trees aboveground. Two trees were considered linked if they shared the same fungal genet(s). *The two Rhizopogon species each formed 13-14 genets, each colonizing up to 19 trees in the plot. Rhizopogon vesiculosus genets were larger, occurred at greater depths, and linked more trees than genets of R. vinicolor. Multiple tree cohorts were linked, with young saplings established within the mycorrhizal network of Douglas-fir veterans. A strong positive relationship was found between tree size and connectivity, resulting in a scale-free network architecture with small-world properties. *This mycorrhizal network architecture suggests an efficient and robust network, where large trees play a foundational role in facilitating conspecific regeneration and stabilizing the ecosystem.
Feng, Xiu; Yu, Xiaomu; Fu, Beide; Wang, Xinhua; Liu, Haiyang; Pang, Meixia; Tong, Jingou
2018-04-02
A high-density genetic linkage map is essential for QTL fine mapping, comparative genome analysis, identification of candidate genes and marker-assisted selection for economic traits in aquaculture species. The Yangtze River common carp (Cyprinus carpio haematopterus) is one of the most important aquacultured strains in China. However, quite limited genetics and genomics resources have been developed for genetic improvement of economic traits in such strain. A high-resolution genetic linkage map was constructed by using 7820 2b-RAD (2b-restriction site-associated DNA) and 295 microsatellite markers in a F2 family of the Yangtze River common carp (C. c. haematopterus). The length of the map was 4586.56 cM with an average marker interval of 0.57 cM. Comparative genome mapping revealed that a high proportion (70%) of markers with disagreed chromosome location was observed between C. c. haematopterus and another common carp strain (subspecies) C. c. carpio. A clear 2:1 relationship was observed between C. c. haematopterus linkage groups (LGs) and zebrafish (Danio rerio) chromosomes. Based on the genetic map, 21 QTLs for growth-related traits were detected on 12 LGs, and contributed values of phenotypic variance explained (PVE) ranging from 16.3 to 38.6%, with LOD scores ranging from 4.02 to 11.13. A genome-wide significant QTL (LOD = 10.83) and three chromosome-wide significant QTLs (mean LOD = 4.84) for sex were mapped on LG50 and LG24, respectively. A 1.4 cM confidence interval of QTL for all growth-related traits showed conserved synteny with a 2.06 M segment on chromosome 14 of D. rerio. Five potential candidate genes were identified by blast search in this genomic region, including a well-studied multi-functional growth related gene, Apelin. We mapped a set of suggestive and significant QTLs for growth-related traits and sex based on a high-density genetic linkage map using SNP and microsatellite markers for Yangtze River common carp. Several candidate growth genes were also identified from the QTL regions by comparative mapping. This genetic map would provide a basis for genome assembly and comparative genomics studies, and those QTL-derived candidate genes and genetic markers are useful genomic resources for marker-assisted selection (MAS) of growth-related traits in the Yangtze River common carp.
A circular genetic map of Erwinia carotovora subsp. atroseptica 3-2
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nikolaichik, E.A.; Pesnyakevich, A.G.
1995-08-01
A circular genetic map of Erwinia carotovora subsp. atroseptica 3-2 was constructed on the basis of the R471a plasmid and Tn5 and Tn9 using Hfr-like donors. Forty-six genes, including phytopathogenicity genes, were located on the basis of interrupted mating experiment results and analysis of coinheritance of markers on a map of 183 min in length. The similarity and differences of chromosomal genetic maps of Erwinia genus bacteria are discussed. 23 refs., 2 figs., 4 tabs.
IntegratedMap: a Web interface for integrating genetic map data.
Yang, Hongyu; Wang, Hongyu; Gingle, Alan R
2005-05-01
IntegratedMap is a Web application and database schema for storing and interactively displaying genetic map data. Its Web interface includes a menu for direct chromosome/linkage group selection, a search form for selection based on mapped object location and linkage group displays. An overview display provides convenient access to the full range of mapped and anchored object types with genetic locus details, such as numbers, types and names of mapped/anchored objects displayed in a compact scrollable list box that automatically updates based on selected map location and object type. Also, multilinkage group and localized map views are available along with links that can be configured for integration with other Web resources. IntegratedMap is implemented in C#/ASP.NET and the package, including a MySQL schema creation script, is available from http://cggc.agtec.uga.edu/Data/download.asp
Zych, Konrad; Li, Yang; van der Velde, Joeri K; Joosen, Ronny V L; Ligterink, Wilco; Jansen, Ritsert C; Arends, Danny
2015-02-19
Genetic markers and maps are instrumental in quantitative trait locus (QTL) mapping in segregating populations. The resolution of QTL localization depends on the number of informative recombinations in the population and how well they are tagged by markers. Larger populations and denser marker maps are better for detecting and locating QTLs. Marker maps that are initially too sparse can be saturated or derived de novo from high-throughput omics data, (e.g. gene expression, protein or metabolite abundance). If these molecular phenotypes are affected by genetic variation due to a major QTL they will show a clear multimodal distribution. Using this information, phenotypes can be converted into genetic markers. The Pheno2Geno tool uses mixture modeling to select phenotypes and transform them into genetic markers suitable for construction and/or saturation of a genetic map. Pheno2Geno excludes candidate genetic markers that show evidence for multiple possibly epistatically interacting QTL and/or interaction with the environment, in order to provide a set of robust markers for follow-up QTL mapping. We demonstrate the use of Pheno2Geno on gene expression data of 370,000 probes in 148 A. thaliana recombinant inbred lines. Pheno2Geno is able to saturate the existing genetic map, decreasing the average distance between markers from 7.1 cM to 0.89 cM, close to the theoretical limit of 0.68 cM (with 148 individuals we expect a recombination every 100/148=0.68 cM); this pinpointed almost all of the informative recombinations in the population. The Pheno2Geno package makes use of genome-wide molecular profiling and provides a tool for high-throughput de novo map construction and saturation of existing genetic maps. Processing of the showcase dataset takes less than 30 minutes on an average desktop PC. Pheno2Geno improves QTL mapping results at no additional laboratory cost and with minimum computational effort. Its results are formatted for direct use in R/qtl, the leading R package for QTL studies. Pheno2Geno is freely available on CRAN under "GNU GPL v3". The Pheno2Geno package as well as the tutorial can also be found at: http://pheno2geno.nl .
Jared W. Westbrook; Vikram E. Chhatre; Le-Shin Wu; Srikar Chamala; Leandro Gomide Neves; Patricio Munoz; Pedro J. Martinez-Garcia; David B. Neale; Matias Kirst; Keithanne Mockaitis; C. Dana Nelson; Gary F. Peter; John M. Davis; Craig S. Echt
2015-01-01
A consensus genetic map for Pinus taeda (loblolly pine) and Pinus elliottii (slash pine) was constructed by merging three previously published P. taeda maps with a map from a pseudo-backcross between P. elliottii and P. taeda. The consensus map positioned 3856 markers via...
High-density genetic map construction and comparative genome analysis in asparagus bean.
Huang, Haitao; Tan, Huaqiang; Xu, Dongmei; Tang, Yi; Niu, Yisong; Lai, Yunsong; Tie, Manman; Li, Huanxiu
2018-03-19
Genetic maps are a prerequisite for quantitative trait locus (QTL) analysis, marker-assisted selection (MAS), fine gene mapping, and assembly of genome sequences. So far, several asparagus bean linkage maps have been established using various kinds of molecular markers. However, these maps were all constructed by gel- or array-based markers. No maps based on sequencing method have been reported. In this study, an NGS-based strategy, SLAF-seq, was applied to create a high-density genetic map for asparagus bean. Through SLAF library construction and Illumina sequencing of two parents and 100 F2 individuals, a total of 55,437 polymorphic SLAF markers were developed and mined for SNP markers. The map consisted of 5,225 SNP markers in 11 LGs, spanning a total distance of 1,850.81 cM, with an average distance between markers of 0.35 cM. Comparative genome analysis with four other legume species, soybean, common bean, mung bean and adzuki bean showed that asparagus bean is genetically more related to adzuki bean. The results will provide a foundation for future genomic research, such as QTL fine mapping, comparative mapping in pulses, and offer support for assembling asparagus bean genome sequence.
Leitwein, M; Gagnaire, P-A; Desmarais, E; Guendouz, S; Rohmer, M; Berrebi, P; Guinand, B
2016-12-01
A genome-wide assessment of diversity is provided for wild Mediterranean brown trout Salmo trutta populations from headwater tributaries of the Orb River and from Atlantic and Mediterranean hatchery-reared strains that have been used for stocking. Double-digest restriction-site-associated DNA sequencing (dd-RADseq) was performed and the efficiency of de novo and reference-mapping approaches to obtain individual genotypes was compared. Large numbers of single nucleotide polymorphism (SNP) markers with similar genome-wide distributions were discovered using both approaches (196 639 v. 121 016 SNPs, respectively), with c. 80% of the loci detected de novo being also found with reference mapping, using the Atlantic salmon Salmo salar genome as a reference. Lower mapping density but larger nucleotide diversity (π) was generally observed near extremities of linkage groups, consistent with regions of residual tetrasomic inheritance observed in salmonids. Genome-wide diversity estimates revealed reduced polymorphism in hatchery strains (π = 0·0040 and π = 0·0029 in Atlantic and Mediterranean strains, respectively) compared to wild populations (π = 0·0049), a pattern that was congruent with allelic richness estimated from microsatellite markers. Finally, pronounced heterozygote deficiency was found in hatchery strains (Atlantic F IS = 0·18; Mediterranean F IS = 0·42), indicating that stocking practices may affect the genetic diversity in wild populations. These new genomic resources will provide important tools to define better conservation strategies in S. trutta. © 2016 The Fisheries Society of the British Isles.
Mapping X-Disease Phytoplasma Resistance in Prunus virginiana.
Lenz, Ryan R; Dai, Wenhao
2017-01-01
Phytoplasmas such as " Candidatus Phytoplasma pruni," the causal agent of X-disease of stone fruits, lack detailed biological analysis. This has limited the understanding of plant resistance mechanisms. Chokecherry ( Prunus virginiana L.) is a promising model to be used for the plant-phytoplasma interaction due to its documented ability to resist X-disease infection. A consensus chokecherry genetic map "Cho" was developed with JoinMap 4.0 by joining two parental maps. The new map contains a complete set of 16 linkage groups, spanning a genetic distance of 2,172 cM with an average marker density of 3.97 cM. Three significant quantitative trait loci (QTL) associated with X-disease resistance were identified contributing to a total of 45.9% of the phenotypic variation. This updated genetic linkage map and the identified QTL will provide the framework needed to facilitate molecular genetics, genomics, breeding, and biotechnology research concerning X-disease in chokecherry and other Prunus species.
Taranto, F; D'Agostino, N; Greco, B; Cardi, T; Tripodi, P
2016-11-21
Knowledge on population structure and genetic diversity in vegetable crops is essential for association mapping studies and genomic selection. Genotyping by sequencing (GBS) represents an innovative method for large scale SNP detection and genotyping of genetic resources. Herein we used the GBS approach for the genome-wide identification of SNPs in a collection of Capsicum spp. accessions and for the assessment of the level of genetic diversity in a subset of 222 cultivated pepper (Capsicum annum) genotypes. GBS analysis generated a total of 7,568,894 master tags, of which 43.4% uniquely aligned to the reference genome CM334. A total of 108,591 SNP markers were identified, of which 105,184 were in C. annuum accessions. In order to explore the genetic diversity of C. annuum and to select a minimal core set representing most of the total genetic variation with minimum redundancy, a subset of 222 C. annuum accessions were analysed using 32,950 high quality SNPs. Based on Bayesian and Hierarchical clustering it was possible to divide the collection into three clusters. Cluster I had the majority of varieties and landraces mainly from Southern and Northern Italy, and from Eastern Europe, whereas clusters II and III comprised accessions of different geographical origins. Considering the genome-wide genetic variation among the accessions included in cluster I, a second round of Bayesian (K = 3) and Hierarchical (K = 2) clustering was performed. These analysis showed that genotypes were grouped not only based on geographical origin, but also on fruit-related features. GBS data has proven useful to assess the genetic diversity in a collection of C. annuum accessions. The high number of SNP markers, uniformly distributed on the 12 chromosomes, allowed the accessions to be distinguished according to geographical origin and fruit-related features. SNP markers and information on population structure developed in this study will undoubtedly support genome-wide association mapping studies and marker-assisted selection programs.
Frahm, Jan-Michael; Pollefeys, Marc Andre Leon; Gallup, David Robert
2015-12-08
Methods of generating a three dimensional representation of an object in a reference plane from a depth map including distances from a reference point to pixels in an image of the object taken from a reference point. Weights are assigned to respective voxels in a three dimensional grid along rays extending from the reference point through the pixels in the image based on the distances in the depth map from the reference point to the respective pixels, and a height map including an array of height values in the reference plane is formed based on the assigned weights. An n-layer height map may be constructed by generating a probabilistic occupancy grid for the voxels and forming an n-dimensional height map comprising an array of layer height values in the reference plane based on the probabilistic occupancy grid.
Westbrook, Jared W.; Chhatre, Vikram E.; Wu, Le-Shin; Chamala, Srikar; Neves, Leandro Gomide; Muñoz, Patricio; Martínez-García, Pedro J.; Neale, David B.; Kirst, Matias; Mockaitis, Keithanne; Nelson, C. Dana; Peter, Gary F.; Echt, Craig S.
2015-01-01
A consensus genetic map for Pinus taeda (loblolly pine) and Pinus elliottii (slash pine) was constructed by merging three previously published P. taeda maps with a map from a pseudo-backcross between P. elliottii and P. taeda. The consensus map positioned 3856 markers via genotyping of 1251 individuals from four pedigrees. It is the densest linkage map for a conifer to date. Average marker spacing was 0.6 cM and total map length was 2305 cM. Functional predictions of mapped genes were improved by aligning expressed sequence tags used for marker discovery to full-length P. taeda transcripts. Alignments to the P. taeda genome mapped 3305 scaffold sequences onto 12 linkage groups. The consensus genetic map was used to compare the genome-wide linkage disequilibrium in a population of distantly related P. taeda individuals (ADEPT2) used for association genetic studies and a multiple-family pedigree used for genomic selection (CCLONES). The prevalence and extent of LD was greater in CCLONES as compared to ADEPT2; however, extended LD with LGs or between LGs was rare in both populations. The average squared correlations, r2, between SNP alleles less than 1 cM apart were less than 0.05 in both populations and r2 did not decay substantially with genetic distance. The consensus map and analysis of linkage disequilibrium establish a foundation for comparative association mapping and genomic selection in P. taeda and P. elliottii. PMID:26068575
Guo, Yinshan; Shi, Guangli; Liu, Zhendong; Zhao, Yuhui; Yang, Xiaoxu; Zhu, Junchi; Li, Kun; Guo, Xiuwu
2015-01-01
In this study, 149 F1 plants from the interspecific cross between ‘Red Globe’ (Vitis vinifera L.) and ‘Shuangyou’ (Vitis amurensis Rupr.) and the parent were used to construct a molecular genetic linkage map by using the specific length amplified fragment sequencing technique. DNA sequencing generated 41.282 Gb data consisting of 206,411,693 paired-end reads. The average sequencing depths were 68.35 for ‘Red Globe,’ 63.65 for ‘Shuangyou,’ and 8.01 for each progeny. In all, 115,629 high-quality specific length amplified fragments were detected, of which 42,279 were polymorphic. The genetic map was constructed using 7,199 of these polymorphic markers. These polymorphic markers were assigned to 19 linkage groups; the total length of the map was 1929.13 cm, with an average distance of 0.28 cm between each maker. To our knowledge, the genetic maps constructed in this study contain the largest number of molecular markers. These high-density genetic maps might form the basis for the fine quantitative trait loci mapping and molecular-assisted breeding of grape. PMID:26089826
Genetic linkage maps are valuable tools in evolutionary biology; however, their availability for wild populations is extremely limited. Fundulus heteroclitus (Atlantic killifish) is a non-migratory estuarine fish that exhibits high allelic and phenotypic diversity partitioned among subpopulations that reside in disparate environmental conditions. An ideal candidate model organism for studying gene-environment interactions, the molecular toolbox for F. heteroclitus is limited. We identified hundreds of novel microsatellites which, when combined with existing microsatellites and single nucleotide polymorphisms (SNPs), were used to construct the first genetic linkage map for this species. By integrating independent linkage maps from three genetic crosses, we developed a consensus map containing 24 linkage groups, consistent with the number of chromosomes reported for this species. These linkage groups span 2300 centimorgans (cM) of recombinant genomic space, intermediate in size relative to the current linkage maps for the teleosts, medaka and zebrafish. Comparisons between fish genomes support a high degree of synteny between the consensus F. heteroclitus linkage map and the medaka and (to a lesser extent) zebrafish physical genome assemblies.This dataset is associated with the following publication:Waits , E., J. Martinson , B. Rinner, S. Morris, D. Proestou, D. Champlin , and D. Nacci. Genetic linkage map and comparative genome analysis for the estuarine Atlanti
Marcus, Jeffrey M; Hughes, Tia M
2009-06-01
Structured inquiry approaches, in which students receive a Drosophila strain of unknown genotype to analyze and map the constituent mutations, are a common feature of many genetics teaching laboratories. The required crosses frustrate many students because they are aware that they are participating in a fundamentally trivial exercise, as the map locations of the genes are already established and have been recalculated thousands of times by generations of students. We modified the traditional structured inquiry approach to include a novel research experience for the students in our undergraduate genetics laboratories. Students conducted crosses with Drosophila strains carrying P[lacW] transposon insertions in genes without documented recombination map positions, representing a large number of unique, but equivalent genetic unknowns. Using the eye color phenotypes associated with the inserts as visible markers, it is straightforward to calculate recombination map positions for the interrupted loci. Collectively, our students mapped 95 genetic loci on chromosomes 2 and 3. In most cases, the calculated 95% confidence interval for meiotic map location overlapped with the predicted map position based on cytology. The research experience evoked positive student responses and helped students better understand the nature of scientific research for little additional cost or instructor effort.
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.
Wolen, Aaron R; Miles, Michael F
2012-01-01
For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-01-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1–8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. PMID:25762582
Bushakra, Jill M; Bryant, Douglas W; Dossett, Michael; Vining, Kelly J; VanBuren, Robert; Gilmore, Barbara S; Lee, Jungmin; Mockler, Todd C; Finn, Chad E; Bassil, Nahla V
2015-08-01
We have constructed a densely populated, saturated genetic linkage map of black raspberry and successfully placed a locus for aphid resistance. Black raspberry (Rubus occidentalis L.) is a high-value crop in the Pacific Northwest of North America with an international marketplace. Few genetic resources are readily available and little improvement has been achieved through breeding efforts to address production challenges involved in growing this crop. Contributing to its lack of improvement is low genetic diversity in elite cultivars and an untapped reservoir of genetic diversity from wild germplasm. In the Pacific Northwest, where most production is centered, the current standard commercial cultivar is highly susceptible to the aphid Amphorophora agathonica Hottes, which is a vector for the Raspberry mosaic virus complex. Infection with the virus complex leads to a rapid decline in plant health resulting in field replacement after only 3-4 growing seasons. Sources of aphid resistance have been identified in wild germplasm and are used to develop mapping populations to study the inheritance of these valuable traits. We have constructed a genetic linkage map using single-nucleotide polymorphism and transferable (primarily simple sequence repeat) markers for F1 population ORUS 4305 consisting of 115 progeny that segregate for aphid resistance. Our linkage map of seven linkage groups representing the seven haploid chromosomes of black raspberry consists of 274 markers on the maternal map and 292 markers on the paternal map including a morphological locus for aphid resistance. This is the first linkage map of black raspberry and will aid in developing markers for marker-assisted breeding, comparative mapping with other Rubus species, and enhancing the black raspberry genome assembly.
Background controlled QTL mapping in pure-line genetic populations derived from four-way crosses
Zhang, S; Meng, L; Wang, J; Zhang, L
2017-01-01
Pure lines derived from multiple parents are becoming more important because of the increased genetic diversity, the possibility to conduct replicated phenotyping trials in multiple environments and potentially high mapping resolution of quantitative trait loci (QTL). In this study, we proposed a new mapping method for QTL detection in pure-line populations derived from four-way crosses, which is able to control the background genetic variation through a two-stage mapping strategy. First, orthogonal variables were created for each marker and used in an inclusive linear model, so as to completely absorb the genetic variation in the mapping population. Second, inclusive composite interval mapping approach was implemented for one-dimensional scanning, during which the inclusive linear model was employed to control the background variation. Simulation studies using different genetic models demonstrated that the new method is efficient when considering high detection power, low false discovery rate and high accuracy in estimating quantitative trait loci locations and effects. For illustration, the proposed method was applied in a reported wheat four-way recombinant inbred line population. PMID:28722705
Background controlled QTL mapping in pure-line genetic populations derived from four-way crosses.
Zhang, S; Meng, L; Wang, J; Zhang, L
2017-10-01
Pure lines derived from multiple parents are becoming more important because of the increased genetic diversity, the possibility to conduct replicated phenotyping trials in multiple environments and potentially high mapping resolution of quantitative trait loci (QTL). In this study, we proposed a new mapping method for QTL detection in pure-line populations derived from four-way crosses, which is able to control the background genetic variation through a two-stage mapping strategy. First, orthogonal variables were created for each marker and used in an inclusive linear model, so as to completely absorb the genetic variation in the mapping population. Second, inclusive composite interval mapping approach was implemented for one-dimensional scanning, during which the inclusive linear model was employed to control the background variation. Simulation studies using different genetic models demonstrated that the new method is efficient when considering high detection power, low false discovery rate and high accuracy in estimating quantitative trait loci locations and effects. For illustration, the proposed method was applied in a reported wheat four-way recombinant inbred line population.
Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire
2012-01-01
Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
Genetic Architectures of Quantitative Variation in RNA Editing Pathways
Gu, Tongjun; Gatti, Daniel M.; Srivastava, Anuj; Snyder, Elizabeth M.; Raghupathy, Narayanan; Simecek, Petr; Svenson, Karen L.; Dotu, Ivan; Chuang, Jeffrey H.; Keller, Mark P.; Attie, Alan D.; Braun, Robert E.; Churchill, Gary A.
2016-01-01
RNA editing refers to post-transcriptional processes that alter the base sequence of RNA. Recently, hundreds of new RNA editing targets have been reported. However, the mechanisms that determine the specificity and degree of editing are not well understood. We examined quantitative variation of site-specific editing in a genetically diverse multiparent population, Diversity Outbred mice, and mapped polymorphic loci that alter editing ratios globally for C-to-U editing and at specific sites for A-to-I editing. An allelic series in the C-to-U editing enzyme Apobec1 influences the editing efficiency of Apob and 58 additional C-to-U editing targets. We identified 49 A-to-I editing sites with polymorphisms in the edited transcript that alter editing efficiency. In contrast to the shared genetic control of C-to-U editing, most of the variable A-to-I editing sites were determined by local nucleotide polymorphisms in proximity to the editing site in the RNA secondary structure. Our results indicate that RNA editing is a quantitative trait subject to genetic variation and that evolutionary constraints have given rise to distinct genetic architectures in the two canonical types of RNA editing. PMID:26614740
A second-generation anchored genetic linkage map of the tammar wallaby (Macropus eugenii)
2011-01-01
Background The tammar wallaby, Macropus eugenii, a small kangaroo used for decades for studies of reproduction and metabolism, is the model Australian marsupial for genome sequencing and genetic investigations. The production of a more comprehensive cytogenetically-anchored genetic linkage map will significantly contribute to the deciphering of the tammar wallaby genome. It has great value as a resource to identify novel genes and for comparative studies, and is vital for the ongoing genome sequence assembly and gene ordering in this species. Results A second-generation anchored tammar wallaby genetic linkage map has been constructed based on a total of 148 loci. The linkage map contains the original 64 loci included in the first-generation map, plus an additional 84 microsatellite loci that were chosen specifically to increase coverage and assist with the anchoring and orientation of linkage groups to chromosomes. These additional loci were derived from (a) sequenced BAC clones that had been previously mapped to tammar wallaby chromosomes by fluorescence in situ hybridization (FISH), (b) End sequence from BACs subsequently FISH-mapped to tammar wallaby chromosomes, and (c) tammar wallaby genes orthologous to opossum genes predicted to fill gaps in the tammar wallaby linkage map as well as three X-linked markers from a published study. Based on these 148 loci, eight linkage groups were formed. These linkage groups were assigned (via FISH-mapped markers) to all seven autosomes and the X chromosome. The sex-pooled map size is 1402.4 cM, which is estimated to provide 82.6% total coverage of the genome, with an average interval distance of 10.9 cM between adjacent markers. The overall ratio of female/male map length is 0.84, which is comparable to the ratio of 0.78 obtained for the first-generation map. Conclusions Construction of this second-generation genetic linkage map is a significant step towards complete coverage of the tammar wallaby genome and considerably extends that of the first-generation map. It will be a valuable resource for ongoing tammar wallaby genetic research and assembling the genome sequence. The sex-pooled map is available online at http://compldb.angis.org.au/. PMID:21854616
A second-generation anchored genetic linkage map of the tammar wallaby (Macropus eugenii).
Wang, Chenwei; Webley, Lee; Wei, Ke-jun; Wakefield, Matthew J; Patel, Hardip R; Deakin, Janine E; Alsop, Amber; Marshall Graves, Jennifer A; Cooper, Desmond W; Nicholas, Frank W; Zenger, Kyall R
2011-08-19
The tammar wallaby, Macropus eugenii, a small kangaroo used for decades for studies of reproduction and metabolism, is the model Australian marsupial for genome sequencing and genetic investigations. The production of a more comprehensive cytogenetically-anchored genetic linkage map will significantly contribute to the deciphering of the tammar wallaby genome. It has great value as a resource to identify novel genes and for comparative studies, and is vital for the ongoing genome sequence assembly and gene ordering in this species. A second-generation anchored tammar wallaby genetic linkage map has been constructed based on a total of 148 loci. The linkage map contains the original 64 loci included in the first-generation map, plus an additional 84 microsatellite loci that were chosen specifically to increase coverage and assist with the anchoring and orientation of linkage groups to chromosomes. These additional loci were derived from (a) sequenced BAC clones that had been previously mapped to tammar wallaby chromosomes by fluorescence in situ hybridization (FISH), (b) End sequence from BACs subsequently FISH-mapped to tammar wallaby chromosomes, and (c) tammar wallaby genes orthologous to opossum genes predicted to fill gaps in the tammar wallaby linkage map as well as three X-linked markers from a published study. Based on these 148 loci, eight linkage groups were formed. These linkage groups were assigned (via FISH-mapped markers) to all seven autosomes and the X chromosome. The sex-pooled map size is 1402.4 cM, which is estimated to provide 82.6% total coverage of the genome, with an average interval distance of 10.9 cM between adjacent markers. The overall ratio of female/male map length is 0.84, which is comparable to the ratio of 0.78 obtained for the first-generation map. Construction of this second-generation genetic linkage map is a significant step towards complete coverage of the tammar wallaby genome and considerably extends that of the first-generation map. It will be a valuable resource for ongoing tammar wallaby genetic research and assembling the genome sequence. The sex-pooled map is available online at http://compldb.angis.org.au/.
Cross-referencing yeast genetics and mammalian genomes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hieter, P.; Basset, D.; Boguski, M.
1994-09-01
We have initiated a project that will systematically transfer information about yeast genes onto the genetic maps of mice and human beings. Rapidly expanding human EST data will serve as a source of candidate human homologs that will be repeatedly searched using yeast protein sequence queries. Search results will be automatically reported to participating labs. Human cDNA sequences from which the ESTs are derived will be mapped at high resolution in the human and mouse genomes. The comparative mapping information cross-references the genomic position of novel human cDNAs with functional information known about the cognate yeast genes. This should facilitatemore » the initial identification of genes responsible for mammalian mutant phenotypes, including human disease. In addition, the identification of mammalian homologs of yeast genes provides reagents for determining evolutionary conservation and for performing direct experiments in multicellular eukaryotes to enhance study of the yeast protein`s function. For example, ESTs homologous to CDC27 and CDC16 were identified, and the corresponding cDNA clones were obtained from ATTC, completely sequenced, and mapped on human and mouse chromosomes. In addition, the CDC17hs cDNA has been used to raise antisera to the CDC27Hs protein and used in subcellular localization experiments and junctional studies in mammalian cells. We have received funding from the National Center for Human Genome Research to provide a community resource which will establish comprehensive cross-referencing among yeast, human, and mouse loci. The project is set up as a service and information on how to communicate with this effort will be provided.« less
An integrated molecular cytogenetic map of Cucumis sativus L. chromosome 2.
Han, Yonghua; Zhang, Zhonghua; Huang, Sanwen; Jin, Weiwei
2011-01-27
Integration of molecular, genetic and cytological maps is still a challenge for most plant species. Recent progress in molecular and cytogenetic studies created a basis for developing integrated maps in cucumber (Cucumis sativus L.). In this study, eleven fosmid clones and three plasmids containing 45S rDNA, the centromeric satellite repeat Type III and the pericentriomeric repeat CsRP1 sequences respectively were hybridized to cucumber metaphase chromosomes to assign their cytological location on chromosome 2. Moreover, an integrated molecular cytogenetic map of cucumber chromosomes 2 was constructed by fluorescence in situ hybridization (FISH) mapping of 11 fosmid clones together with the cucumber centromere-specific Type III sequence on meiotic pachytene chromosomes. The cytogenetic map was fully integrated with genetic linkage map since each fosmid clone was anchored by a genetically mapped simple sequence repeat marker (SSR). The relationship between the genetic and physical distances along chromosome was analyzed. Recombination was not evenly distributed along the physical length of chromosome 2. Suppression of recombination was found in centromeric and pericentromeric regions. Our results also indicated that the molecular markers composing the linkage map for chromosome 2 provided excellent coverage of the chromosome.
Fu, Beide; Liu, Haiyang; Yu, Xiaomu; Tong, Jingou
2016-01-01
Growth related traits in fish are controlled by quantitative trait loci (QTL), but no QTL for growth have been detected in bighead carp (Hypophthalmichthys nobilis) due to the lack of high-density genetic map. In this study, an ultra-high density genetic map was constructed with 3,121 SNP markers by sequencing 117 individuals in a F1 family using 2b-RAD technology. The total length of the map was 2341.27 cM, with an average marker interval of 0.75 cM. A high level of genomic synteny between our map and zebrafish was detected. Based on this genetic map, one genome-wide significant and 37 suggestive QTL for five growth-related traits were identified in 6 linkage groups (i.e. LG3, LG11, LG15, LG18, LG19, LG22). The phenotypic variance explained (PVE) by these QTL varied from 15.4% to 38.2%. Marker within the significant QTL region was surrounded by CRP1 and CRP2, which played an important role in muscle cell division. These high-density map and QTL information provided a solid base for QTL fine mapping and comparative genomics in bighead carp. PMID:27345016
Gu, Yu; Zhao, Qian-Cheng; Sun, De-Ling; Song, Wen-Qin
2007-06-01
Nucleotide binding site (NBS) profiling, a new method was used to map resistance gene analogues (RGAs) in cauliflower (Brassica oleracea var. botrytis). This method allows amplification and the mapping of genetic markers anchored in the conserved NBS encoding domain of plant disease resistance genes. AFLP was also performed to construct the cauliflower intervarietal genetic map. The aim of constructing genetic map was to identify potential molecular markers linked to important agronomic traits that would be particularly useful for development and improving the species. Using 17 AFLP primer combinations and two degeneration primer/enzyme combinations, a total of 234 AFLP markers and 21 NBS markers were mapped in the F2 population derived from self-pollinating a single F1 plant of the cross AD White Flower x C-8. The markers were mapped in 9 of major linkage groups spanning 668.4 cM, with an average distance of 2.9 cM between adjacent mapped markers. The AFLP markers were well distributed throughout the linkage groups. The linkage groups contained from 12 to 47 loci each and the distance between two consecutive loci ranged from 0 to 14.9 cM. NBS markers were mapped on 8 of the 9 linkage groups of the genetic map. Most of these markers were organized in clusters. This result demonstrates the feasibility of the NBS-profiling method for generating NBS markers for resistance loci in cauliflower. The clustering of the markers mapped in this study adds to the evidence that most of them could be real RGAs.
Sineokiĭ, S P; Pogosov, V Z; Iankovskiĭ, N K; Krylov, V N
1976-01-01
123 Amber mutants of lambdoid bacteriophage phi81 are isolated and distributed into 19 complementation groups. Deletion mapping made possible to locate 5 gene groups on the genetic map of bacteriophage phi81 and to determine a region of possible location of mm' sticky ends on the prophage genetic map. A gene of phage phi81 is localized, which controls the adsorption specificity, and which functional similarity to a respective gene of phage phi80 is demonstrated.
van Rheenen, Wouter; Shatunov, Aleksey; Dekker, Annelot M; McLaughlin, Russell L; Diekstra, Frank P; Pulit, Sara L; van der Spek, Rick A A; Võsa, Urmo; de Jong, Simone; Robinson, Matthew R; Yang, Jian; Fogh, Isabella; van Doormaal, Perry TC; Tazelaar, Gijs H P; Koppers, Max; Blokhuis, Anna M; Sproviero, William; Jones, Ashley R; Kenna, Kevin P; van Eijk, Kristel R; Harschnitz, Oliver; Schellevis, Raymond D; Brands, William J; Medic, Jelena; Menelaou, Androniki; Vajda, Alice; Ticozzi, Nicola; Lin, Kuang; Rogelj, Boris; Vrabec, Katarina; Ravnik-Glavač, Metka; Koritnik, Blaž; Zidar, Janez; Leonardis, Lea; Grošelj, Leja Dolenc; Millecamps, Stéphanie; Salachas, François; Meininger, Vincent; de Carvalho, Mamede; Pinto, Susana; Mora, Jesus S; Rojas-García, Ricardo; Polak, Meraida; Chandran, Siddharthan; Colville, Shuna; Swingler, Robert; Morrison, Karen E; Shaw, Pamela J; Hardy, John; Orrell, Richard W; Pittman, Alan; Sidle, Katie; Fratta, Pietro; Malaspina, Andrea; Topp, Simon; Petri, Susanne; Abdulla, Susanne; Drepper, Carsten; Sendtner, Michael; Meyer, Thomas; Ophoff, Roel A; Staats, Kim A; Wiedau-Pazos, Martina; Lomen-Hoerth, Catherine; Van Deerlin, Vivianna M; Trojanowski, John Q; Elman, Lauren; McCluskey, Leo; Basak, A Nazli; Tunca, Ceren; Hamzeiy, Hamid; Parman, Yesim; Meitinger, Thomas; Lichtner, Peter; Radivojkov-Blagojevic, Milena; Andres, Christian R; Maurel, Cindy; Bensimon, Gilbert; Landwehrmeyer, Bernhard; Brice, Alexis; Payan, Christine A M; Saker-Delye, Safaa; Dürr, Alexandra; Wood, Nicholas W; Tittmann, Lukas; Lieb, Wolfgang; Franke, Andre; Rietschel, Marcella; Cichon, Sven; Nöthen, Markus M; Amouyel, Philippe; Tzourio, Christophe; Dartigues, Jean-François; Uitterlinden, Andre G; Rivadeneira, Fernando; Estrada, Karol; Hofman, Albert; Curtis, Charles; Blauw, Hylke M; van der Kooi, Anneke J; de Visser, Marianne; Goris, An; Weber, Markus; Shaw, Christopher E; Smith, Bradley N; Pansarasa, Orietta; Cereda, Cristina; Bo, Roberto Del; Comi, Giacomo P; D’Alfonso, Sandra; Bertolin, Cinzia; Sorarù, Gianni; Mazzini, Letizia; Pensato, Viviana; Gellera, Cinzia; Tiloca, Cinzia; Ratti, Antonia; Calvo, Andrea; Moglia, Cristina; Brunetti, Maura; Arcuti, Simona; Capozzo, Rosa; Zecca, Chiara; Lunetta, Christian; Penco, Silvana; Riva, Nilo; Padovani, Alessandro; Filosto, Massimiliano; Muller, Bernard; Stuit, Robbert Jan; Blair, Ian; Zhang, Katharine; McCann, Emily P; Fifita, Jennifer A; Nicholson, Garth A; Rowe, Dominic B; Pamphlett, Roger; Kiernan, Matthew C; Grosskreutz, Julian; Witte, Otto W; Ringer, Thomas; Prell, Tino; Stubendorff, Beatrice; Kurth, Ingo; Hübner, Christian A; Leigh, P Nigel; Casale, Federico; Chio, Adriano; Beghi, Ettore; Pupillo, Elisabetta; Tortelli, Rosanna; Logroscino, Giancarlo; Powell, John; Ludolph, Albert C; Weishaupt, Jochen H; Robberecht, Wim; Van Damme, Philip; Franke, Lude; Pers, Tune H; Brown, Robert H; Glass, Jonathan D; Landers, John E; Hardiman, Orla; Andersen, Peter M; Corcia, Philippe; Vourc’h, Patrick; Silani, Vincenzo; Wray, Naomi R; Visscher, Peter M; de Bakker, Paul I W; van Es, Michael A; Pasterkamp, R Jeroen; Lewis, Cathryn M; Breen, Gerome; Al-Chalabi, Ammar; van den Berg, Leonard H; Veldink, Jan H
2017-01-01
To elucidate the genetic architecture of amyotrophic lateral sclerosis (ALS) and find associated loci, we assembled a custom imputation reference panel from whole-genome-sequenced patients with ALS and matched controls (n = 1,861). Through imputation and mixed-model association analysis in 12,577 cases and 23,475 controls, combined with 2,579 cases and 2,767 controls in an independent replication cohort, we fine-mapped a new risk locus on chromosome 21 and identified C21orf2 as a gene associated with ALS risk. In addition, we identified MOBP and SCFD1 as new associated risk loci. We established evidence of ALS being a complex genetic trait with a polygenic architecture. Furthermore, we estimated the SNP-based heritability at 8.5%, with a distinct and important role for low-frequency variants (frequency 1–10%). This study motivates the interrogation of larger samples with full genome coverage to identify rare causal variants that underpin ALS risk. PMID:27455348
Global Mapping of the Yeast Genetic Interaction Network
NASA Astrophysics Data System (ADS)
Tong, Amy Hin Yan; Lesage, Guillaume; Bader, Gary D.; Ding, Huiming; Xu, Hong; Xin, Xiaofeng; Young, James; Berriz, Gabriel F.; Brost, Renee L.; Chang, Michael; Chen, YiQun; Cheng, Xin; Chua, Gordon; Friesen, Helena; Goldberg, Debra S.; Haynes, Jennifer; Humphries, Christine; He, Grace; Hussein, Shamiza; Ke, Lizhu; Krogan, Nevan; Li, Zhijian; Levinson, Joshua N.; Lu, Hong; Ménard, Patrice; Munyana, Christella; Parsons, Ainslie B.; Ryan, Owen; Tonikian, Raffi; Roberts, Tania; Sdicu, Anne-Marie; Shapiro, Jesse; Sheikh, Bilal; Suter, Bernhard; Wong, Sharyl L.; Zhang, Lan V.; Zhu, Hongwei; Burd, Christopher G.; Munro, Sean; Sander, Chris; Rine, Jasper; Greenblatt, Jack; Peter, Matthias; Bretscher, Anthony; Bell, Graham; Roth, Frederick P.; Brown, Grant W.; Andrews, Brenda; Bussey, Howard; Boone, Charles
2004-02-01
A genetic interaction network containing ~1000 genes and ~4000 interactions was mapped by crossing mutations in 132 different query genes into a set of ~4700 viable gene yeast deletion mutants and scoring the double mutant progeny for fitness defects. Network connectivity was predictive of function because interactions often occurred among functionally related genes, and similar patterns of interactions tended to identify components of the same pathway. The genetic network exhibited dense local neighborhoods; therefore, the position of a gene on a partially mapped network is predictive of other genetic interactions. Because digenic interactions are common in yeast, similar networks may underlie the complex genetics associated with inherited phenotypes in other organisms.
Genetic ancestry is associated with colorectal adenomas and adenocarcinomas in Latino populations.
Hernandez-Suarez, Gustavo; Sanabria, Maria Carolina; Serrano, Marta; Herran, Oscar F; Perez, Jesus; Plata, Jose L; Zabaleta, Jovanny; Tenesa, Albert
2014-10-01
Colorectal cancer rates in Latin American countries are less than half of those observed in the United States. Latin Americans are the resultant of generations of an admixture of Native American, European, and African individuals. The potential role of genetic admixture in colorectal carcinogenesis has not been examined. We evaluate the association of genetic ancestry with colorectal neoplasms in 190 adenocarcinomas, 113 sporadic adenomas and 243 age- and sex-matched controls enrolled in a multicentric case-control study in Colombia. Individual ancestral genetic fractions were estimated using the STRUCTURE software, based on allele frequencies and assuming three distinct population origins. We used the Illumina Cancer Panel to genotype 1,421 sparse single-nucleotide polymorphisms (SNPs), and Northern and Western European ancestry, LWJ and Han Chinese in Beijing, China populations from the HapMap project as references. A total of 678 autosomal SNPs overlapped with the HapMap data set SNPs and were used for ancestry estimations. African mean ancestry fraction was higher in adenomas (0.13, 95% confidence interval (95% CI)=0.11-0.15) and cancer cases (0.14, 95% CI=0.12-0.16) compared with controls (0.11, 95% CI=0.10-0.12). Conditional logistic regression analysis, controlling for known risk factors, showed a positive association of African ancestry per 10% increase with both colorectal adenoma (odds ratio (OR)=1.12, 95% CI=0.97-1.30) and adenocarcinoma (OR=1.19, 95% CI=1.05-1.35). In conclusion, increased African ancestry (or variants linked to it) contributes to the increased susceptibility of colorectal cancer in admixed Latin American population.
Edwards, Stefan M.; Sørensen, Izel F.; Sarup, Pernille; Mackay, Trudy F. C.; Sørensen, Peter
2016-01-01
Predicting individual quantitative trait phenotypes from high-resolution genomic polymorphism data is important for personalized medicine in humans, plant and animal breeding, and adaptive evolution. However, this is difficult for populations of unrelated individuals when the number of causal variants is low relative to the total number of polymorphisms and causal variants individually have small effects on the traits. We hypothesized that mapping molecular polymorphisms to genomic features such as genes and their gene ontology categories could increase the accuracy of genomic prediction models. We developed a genomic feature best linear unbiased prediction (GFBLUP) model that implements this strategy and applied it to three quantitative traits (startle response, starvation resistance, and chill coma recovery) in the unrelated, sequenced inbred lines of the Drosophila melanogaster Genetic Reference Panel. Our results indicate that subsetting markers based on genomic features increases the predictive ability relative to the standard genomic best linear unbiased prediction (GBLUP) model. Both models use all markers, but GFBLUP allows differential weighting of the individual genetic marker relationships, whereas GBLUP weighs the genetic marker relationships equally. Simulation studies show that it is possible to further increase the accuracy of genomic prediction for complex traits using this model, provided the genomic features are enriched for causal variants. Our GFBLUP model using prior information on genomic features enriched for causal variants can increase the accuracy of genomic predictions in populations of unrelated individuals and provides a formal statistical framework for leveraging and evaluating information across multiple experimental studies to provide novel insights into the genetic architecture of complex traits. PMID:27235308
Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai
2014-01-01
The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047
Evaluation and application of summary statistic imputation to discover new height-associated loci.
Rüeger, Sina; McDaid, Aaron; Kutalik, Zoltán
2018-05-01
As most of the heritability of complex traits is attributed to common and low frequency genetic variants, imputing them by combining genotyping chips and large sequenced reference panels is the most cost-effective approach to discover the genetic basis of these traits. Association summary statistics from genome-wide meta-analyses are available for hundreds of traits. Updating these to ever-increasing reference panels is very cumbersome as it requires reimputation of the genetic data, rerunning the association scan, and meta-analysing the results. A much more efficient method is to directly impute the summary statistics, termed as summary statistics imputation, which we improved to accommodate variable sample size across SNVs. Its performance relative to genotype imputation and practical utility has not yet been fully investigated. To this end, we compared the two approaches on real (genotyped and imputed) data from 120K samples from the UK Biobank and show that, genotype imputation boasts a 3- to 5-fold lower root-mean-square error, and better distinguishes true associations from null ones: We observed the largest differences in power for variants with low minor allele frequency and low imputation quality. For fixed false positive rates of 0.001, 0.01, 0.05, using summary statistics imputation yielded a decrease in statistical power by 9, 43 and 35%, respectively. To test its capacity to discover novel associations, we applied summary statistics imputation to the GIANT height meta-analysis summary statistics covering HapMap variants, and identified 34 novel loci, 19 of which replicated using data in the UK Biobank. Additionally, we successfully replicated 55 out of the 111 variants published in an exome chip study. Our study demonstrates that summary statistics imputation is a very efficient and cost-effective way to identify and fine-map trait-associated loci. Moreover, the ability to impute summary statistics is important for follow-up analyses, such as Mendelian randomisation or LD-score regression.
Evaluation and application of summary statistic imputation to discover new height-associated loci
2018-01-01
As most of the heritability of complex traits is attributed to common and low frequency genetic variants, imputing them by combining genotyping chips and large sequenced reference panels is the most cost-effective approach to discover the genetic basis of these traits. Association summary statistics from genome-wide meta-analyses are available for hundreds of traits. Updating these to ever-increasing reference panels is very cumbersome as it requires reimputation of the genetic data, rerunning the association scan, and meta-analysing the results. A much more efficient method is to directly impute the summary statistics, termed as summary statistics imputation, which we improved to accommodate variable sample size across SNVs. Its performance relative to genotype imputation and practical utility has not yet been fully investigated. To this end, we compared the two approaches on real (genotyped and imputed) data from 120K samples from the UK Biobank and show that, genotype imputation boasts a 3- to 5-fold lower root-mean-square error, and better distinguishes true associations from null ones: We observed the largest differences in power for variants with low minor allele frequency and low imputation quality. For fixed false positive rates of 0.001, 0.01, 0.05, using summary statistics imputation yielded a decrease in statistical power by 9, 43 and 35%, respectively. To test its capacity to discover novel associations, we applied summary statistics imputation to the GIANT height meta-analysis summary statistics covering HapMap variants, and identified 34 novel loci, 19 of which replicated using data in the UK Biobank. Additionally, we successfully replicated 55 out of the 111 variants published in an exome chip study. Our study demonstrates that summary statistics imputation is a very efficient and cost-effective way to identify and fine-map trait-associated loci. Moreover, the ability to impute summary statistics is important for follow-up analyses, such as Mendelian randomisation or LD-score regression. PMID:29782485
Cui, Junjie; Luo, Shaobo; Niu, Yu; Huang, Rukui; Wen, Qingfang; Su, Jianwen; Miao, Nansheng; He, Weiming; Dong, Zhensheng; Cheng, Jiaowen; Hu, Kailin
2018-01-01
Genetic mapping is a basic tool necessary for anchoring assembled scaffold sequences and for identifying QTLs controlling important traits. Though bitter gourd (Momordica charantia) is both consumed and used as a medicinal, research on its genomics and genetic mapping is severely limited. Here, we report the construction of a restriction site associated DNA (RAD)-based genetic map for bitter gourd using an F2 mapping population comprising 423 individuals derived from two cultivated inbred lines, the gynoecious line ‘K44’ and the monoecious line ‘Dali-11.’ This map comprised 1,009 SNP markers and spanned a total genetic distance of 2,203.95 cM across the 11 linkage groups. It anchored a total of 113 assembled scaffolds that covered about 251.32 Mb (85.48%) of the 294.01 Mb assembled genome. In addition, three horticulturally important traits including sex expression, fruit epidermal structure, and immature fruit color were evaluated using a combination of qualitative and quantitative data. As a result, we identified three QTL/gene loci responsible for these traits in three environments. The QTL/gene gy/fffn/ffn, controlling sex expression involved in gynoecy, first female flower node, and female flower number was detected in the reported region. Particularly, two QTLs/genes, Fwa/Wr and w, were found to be responsible for fruit epidermal structure and white immature fruit color, respectively. This RAD-based genetic map promotes the assembly of the bitter gourd genome and the identified genetic loci will accelerate the cloning of relevant genes in the future. PMID:29706980
Mathew, Lisa S; Spannagl, Manuel; Al-Malki, Ameena; George, Binu; Torres, Maria F; Al-Dous, Eman K; Al-Azwani, Eman K; Hussein, Emad; Mathew, Sweety; Mayer, Klaus F X; Mohamoud, Yasmin Ali; Suhre, Karsten; Malek, Joel A
2014-04-15
The date palm is one of the oldest cultivated fruit trees. It is critical in many ways to cultures in arid lands by providing highly nutritious fruit while surviving extreme heat and environmental conditions. Despite its importance from antiquity, few genetic resources are available for improving the productivity and development of the dioecious date palm. To date there has been no genetic map and no sex chromosome has been identified. Here we present the first genetic map for date palm and identify the putative date palm sex chromosome. We placed ~4000 markers on the map using nearly 1200 framework markers spanning a total of 1293 cM. We have integrated the genetic map, derived from the Khalas cultivar, with the draft genome and placed up to 19% of the draft genome sequence scaffolds onto linkage groups for the first time. This analysis revealed approximately ~1.9 cM/Mb on the map. Comparison of the date palm linkage groups revealed significant long-range synteny to oil palm. Analysis of the date palm sex-determination region suggests it is telomeric on linkage group 12 and recombination is not suppressed in the full chromosome. Based on a modified genotyping-by-sequencing approach we have overcome challenges due to lack of genetic resources and provide the first genetic map for date palm. Combined with the recent draft genome sequence of the same cultivar, this resource offers a critical new tool for date palm biotechnology, palm comparative genomics and a better understanding of sex chromosome development in the palms.
Yadav, Anupama; Dhole, Kaustubh; Sinha, Himanshu
2016-12-01
Cryptic genetic variation (CGV) refers to genetic variants whose effects are buffered in most conditions but manifest phenotypically upon specific genetic and environmental perturbations. Despite having a central role in adaptation, contribution of CGV to regulation of quantitative traits is unclear. Instead, a relatively simplistic architecture of additive genetic loci is known to regulate phenotypic variation in most traits. In this paper, we investigate the regulation of CGV and its implication on the genetic architecture of quantitative traits at a genome-wide level. We use a previously published dataset of biparental recombinant population of Saccharomyces cerevisiae phenotyped in 34 diverse environments to perform single locus, two-locus, and covariance mapping. We identify loci that have independent additive effects as well as those which regulate the phenotypic manifestation of other genetic variants (variance QTL). We find that whereas additive genetic variance is predominant, a higher order genetic interaction network regulates variation in certain environments. Despite containing pleiotropic loci, with effects across environments, these genetic networks are highly environment specific. CGV is buffered under most allelic combinations of these networks and perturbed only in rare combinations resulting in high phenotypic variance. The presence of such environment specific genetic networks is the underlying cause of abundant gene–environment interactions. We demonstrate that overlaying identified molecular networks on such genetic networks can identify potential candidate genes and underlying mechanisms regulating phenotypic variation. Such an integrated approach applied to human disease datasets has the potential to improve the ability to predict disease predisposition and identify specific therapeutic targets.
Yadav, Anupama; Dhole, Kaustubh
2016-01-01
Cryptic genetic variation (CGV) refers to genetic variants whose effects are buffered in most conditions but manifest phenotypically upon specific genetic and environmental perturbations. Despite having a central role in adaptation, contribution of CGV to regulation of quantitative traits is unclear. Instead, a relatively simplistic architecture of additive genetic loci is known to regulate phenotypic variation in most traits. In this paper, we investigate the regulation of CGV and its implication on the genetic architecture of quantitative traits at a genome-wide level. We use a previously published dataset of biparental recombinant population of Saccharomyces cerevisiae phenotyped in 34 diverse environments to perform single locus, two-locus, and covariance mapping. We identify loci that have independent additive effects as well as those which regulate the phenotypic manifestation of other genetic variants (variance QTL). We find that whereas additive genetic variance is predominant, a higher order genetic interaction network regulates variation in certain environments. Despite containing pleiotropic loci, with effects across environments, these genetic networks are highly environment specific. CGV is buffered under most allelic combinations of these networks and perturbed only in rare combinations resulting in high phenotypic variance. The presence of such environment specific genetic networks is the underlying cause of abundant gene–environment interactions. We demonstrate that overlaying identified molecular networks on such genetic networks can identify potential candidate genes and underlying mechanisms regulating phenotypic variation. Such an integrated approach applied to human disease datasets has the potential to improve the ability to predict disease predisposition and identify specific therapeutic targets. PMID:28172852
A sequencing-based linkage map of cucumber
USDA-ARS?s Scientific Manuscript database
Genetic maps are important tools for molecular breeding, gene cloning, and study of meiotic recombination. In cucumber (Cucumis sativus L.), the marker density, resolution and genome coverage of previously developed genetic maps using PCR-based molecular markers are relatively low. In this study we ...
Kudapa, Himabindu; Bharti, Arvind K; Cannon, Steven B; Farmer, Andrew D; Mulaosmanovic, Benjamin; Kramer, Robin; Bohra, Abhishek; Weeks, Nathan T; Crow, John A; Tuteja, Reetu; Shah, Trushar; Dutta, Sutapa; Gupta, Deepak K; Singh, Archana; Gaikwad, Kishor; Sharma, Tilak R; May, Gregory D; Singh, Nagendra K; Varshney, Rajeev K
2012-09-01
A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.
2013-01-01
Background Genetic linkage maps are important tools in breeding programmes and quantitative trait analyses. Traditional molecular markers used for genotyping are limited in throughput and efficiency. The advent of next-generation sequencing technologies has facilitated progeny genotyping and genetic linkage map construction in the major grains. However, the applicability of the approach remains untested in the fungal system. Findings Shiitake mushroom, Lentinula edodes, is a basidiomycetous fungus that represents one of the most popular cultivated edible mushrooms. Here, we developed a rapid genotyping method based on low-coverage (~0.5 to 1.5-fold) whole-genome resequencing. We used the approach to genotype 20 single-spore isolates derived from L. edodes strain L54 and constructed the first high-density sequence-based genetic linkage map of L. edodes. The accuracy of the proposed genotyping method was verified experimentally with results from mating compatibility tests and PCR-single-strand conformation polymorphism on a few known genes. The linkage map spanned a total genetic distance of 637.1 cM and contained 13 linkage groups. Two hundred sequence-based markers were placed on the map, with an average marker spacing of 3.4 cM. The accuracy of the map was confirmed by comparing with previous maps the locations of known genes such as matA and matB. Conclusions We used the shiitake mushroom as an example to provide a proof-of-principle that low-coverage resequencing could allow rapid genotyping of basidiospore-derived progenies, which could in turn facilitate the construction of high-density genetic linkage maps of basidiomycetous fungi for quantitative trait analyses and improvement of genome assembly. PMID:23915543
Tan, Shu; Cheng, Jiao-Wen; Zhang, Li; Qin, Cheng; Nong, Ding-Guo; Li, Wei-Peng; Tang, Xin; Wu, Zhi-Ming; Hu, Kai-Lin
2015-01-01
Re-sequencing permits the mining of genome-wide variations on a large scale and provides excellent resources for the research community. To accelerate the development and application of molecular markers and identify the QTLs affecting the flowering time-related trait in pepper, a total of 1,038 pairs of InDel and 674 SSR primers from different sources were used for genetic mapping using the F2 population (n = 154) derived from a cross between BA3 (C. annuum) and YNXML (C. frutescens). Of these, a total of 224 simple PCR-based markers, including 129 InDels and 95 SSRs, were validated and integrated into a map, which was designated as the BY map. The BY map consisted of 13 linkage groups (LGs) and spanned a total genetic distance of 1,249.77 cM with an average marker distance of 5.60 cM. Comparative analysis of the genetic and physical map based on the anchored markers showed that the BY map covered nearly the whole pepper genome. Based on the BY map, one major and five minor QTLs affecting the number of leaves on the primary axis (Nle) were detected on chromosomes P2, P7, P10 and P11 in 2012. The major QTL on P2 was confirmed based on another subset of the same F2 population (n = 147) in 2014 with selective genotyping of markers from the BY map. With the accomplishment of pepper whole genome sequencing and annotations (release 2.0), 153 candidate genes were predicted to embed in the Nle2.2 region, of which 12 important flowering related genes were obtained. The InDel/SSR-based interspecific genetic map, QTLs and candidate genes obtained by the present study will be useful for the downstream isolation of flowering time-related gene and other genetic applications for pepper.
Blake, Damer P; Oakes, Richard; Smith, Adrian L
2011-02-01
Eimeria maxima is one of the seven Eimeria spp. that infect the chicken and cause the disease coccidiosis. The well characterised immunogenicity and genetic diversity associated with E. maxima promote its use in genetics-led studies on avian coccidiosis. The development of a genetic map for E. maxima, presented here based upon 647 amplified fragment length polymorphism markers typed from 22 clonal hybrid lines and assembled into 13 major linkage groups, is a major new resource for work with this parasite. Comparison with genetic maps produced for other coccidial parasites indicates relatively high levels of genetic recombination. Conversion of ∼14% of the markers representing the major linkage groups to sequence characterised amplified region markers can provide a scaffold for the assembly of future genomic sequences as well as providing a foundation for more detailed genetic maps. Comparison with the Eimeria tenella genetic map produced 10years ago has revealed a less biased marker distribution, with no more than nine markers mapped within any unresolved heritable unit. Nonetheless, preliminary bioinformatic characterisation of the three largest publicly available genomic E. maxima sequences suggest that the feature-poor/feature-rich structure which has previously been found to define the first sequenced E. tenella chromosome also defines the E. maxima genome. The significance of such a segmented genome and the apparent potential for variation in genetic recombination will be relevant to haplotype stability and the longevity of future anticoccidial strategies based upon multiple loci targeted by novel chemotherapeutic drugs or recombinant subunit vaccines. Copyright © 2010 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Nayak, Spurthi N.; Varghese, Nicy; Shah, Trushar M.; Penmetsa, R. Varma; Thirunavukkarasu, Nepolean; Gudipati, Srivani; Gaur, Pooran M.; Kulwal, Pawan L.; Upadhyaya, Hari D.; KaviKishor, Polavarapu B.; Winter, Peter; Kahl, Günter; Town, Christopher D.; Kilian, Andrzej; Cook, Douglas R.; Varshney, Rajeev K.
2011-01-01
Chickpea (Cicer arietinum L.) is the third most important cool season food legume, cultivated in arid and semi-arid regions of the world. The goal of this study was to develop novel molecular markers such as microsatellite or simple sequence repeat (SSR) markers from bacterial artificial chromosome (BAC)-end sequences (BESs) and diversity arrays technology (DArT) markers, and to construct a high-density genetic map based on recombinant inbred line (RIL) population ICC 4958 (C. arietinum)×PI 489777 (C. reticulatum). A BAC-library comprising 55,680 clones was constructed and 46,270 BESs were generated. Mining of these BESs provided 6,845 SSRs, and primer pairs were designed for 1,344 SSRs. In parallel, DArT arrays with ca. 15,000 clones were developed, and 5,397 clones were found polymorphic among 94 genotypes tested. Screening of newly developed BES-SSR markers and DArT arrays on the parental genotypes of the RIL mapping population showed polymorphism with 253 BES-SSR markers and 675 DArT markers. Segregation data obtained for these polymorphic markers and 494 markers data compiled from published reports or collaborators were used for constructing the genetic map. As a result, a comprehensive genetic map comprising 1,291 markers on eight linkage groups (LGs) spanning a total of 845.56 cM distance was developed (http://cmap.icrisat.ac.in/cmap/sm/cp/thudi/). The number of markers per linkage group ranged from 68 (LG 8) to 218 (LG 3) with an average inter-marker distance of 0.65 cM. While the developed resource of molecular markers will be useful for genetic diversity, genetic mapping and molecular breeding applications, the comprehensive genetic map with integrated BES-SSR markers will facilitate its anchoring to the physical map (under construction) to accelerate map-based cloning of genes in chickpea and comparative genome evolution studies in legumes. PMID:22102885
Dukić, Marinela; Berner, Daniel; Roesti, Marius; Haag, Christoph R; Ebert, Dieter
2016-10-13
Recombination rate is an essential parameter for many genetic analyses. Recombination rates are highly variable across species, populations, individuals and different genomic regions. Due to the profound influence that recombination can have on intraspecific diversity and interspecific divergence, characterization of recombination rate variation emerges as a key resource for population genomic studies and emphasises the importance of high-density genetic maps as tools for studying genome biology. Here we present such a high-density genetic map for Daphnia magna, and analyse patterns of recombination rate across the genome. A F2 intercross panel was genotyped by Restriction-site Associated DNA sequencing to construct the third-generation linkage map of D. magna. The resulting high-density map included 4037 markers covering 813 scaffolds and contigs that sum up to 77 % of the currently available genome draft sequence (v2.4) and 55 % of the estimated genome size (238 Mb). Total genetic length of the map presented here is 1614.5 cM and the genome-wide recombination rate is estimated to 6.78 cM/Mb. Merging genetic and physical information we consistently found that recombination rate estimates are high towards the peripheral parts of the chromosomes, while chromosome centres, harbouring centromeres in D. magna, show very low recombination rate estimates. Due to its high-density, the third-generation linkage map for D. magna can be coupled with the draft genome assembly, providing an essential tool for genome investigation in this model organism. Thus, our linkage map can be used for the on-going improvements of the genome assembly, but more importantly, it has enabled us to characterize variation in recombination rate across the genome of D. magna for the first time. These new insights can provide a valuable assistance in future studies of the genome evolution, mapping of quantitative traits and population genetic studies.
Marancik, David; Gao, Guangtu; Paneru, Bam; Ma, Hao; Hernandez, Alvaro G.; Salem, Mohamed; Yao, Jianbo; Palti, Yniv; Wiens, Gregory D.
2014-01-01
Genetic improvement for enhanced disease resistance in fish is an increasingly utilized approach to mitigate endemic infectious disease in aquaculture. In domesticated salmonid populations, large phenotypic variation in disease resistance has been identified but the genetic basis for altered responsiveness remains unclear. We previously reported three generations of selection and phenotypic validation of a bacterial cold water disease (BCWD) resistant line of rainbow trout, designated ARS-Fp-R. This line has higher survival after infection by either standardized laboratory challenge or natural challenge as compared to two reference lines, designated ARS-Fp-C (control) and ARS-Fp-S (susceptible). In this study, we utilized 1.1 g fry from the three genetic lines and performed RNA-seq to measure transcript abundance from the whole body of naive and Flavobacterium psychrophilum infected fish at day 1 (early time-point) and at day 5 post-challenge (onset of mortality). Sequences from 24 libraries were mapped onto the rainbow trout genome reference transcriptome of 46,585 predicted protein coding mRNAs that included 2633 putative immune-relevant gene transcripts. A total of 1884 genes (4.0% genome) exhibited differential transcript abundance between infected and mock-challenged fish (FDR < 0.05) that included chemokines, complement components, tnf receptor superfamily members, interleukins, nod-like receptor family members, and genes involved in metabolism and wound healing. The largest number of differentially expressed genes occurred on day 5 post-infection between naive and challenged ARS-Fp-S line fish correlating with high bacterial load. After excluding the effect of infection, we identified 21 differentially expressed genes between the three genetic lines. In summary, these data indicate global transcriptome differences between genetic lines of naive animals as well as differentially regulated transcriptional responses to infection. PMID:25620978
Mapping X-Disease Phytoplasma Resistance in Prunus virginiana
Lenz, Ryan R.; Dai, Wenhao
2017-01-01
Phytoplasmas such as “Candidatus Phytoplasma pruni,” the causal agent of X-disease of stone fruits, lack detailed biological analysis. This has limited the understanding of plant resistance mechanisms. Chokecherry (Prunus virginiana L.) is a promising model to be used for the plant-phytoplasma interaction due to its documented ability to resist X-disease infection. A consensus chokecherry genetic map “Cho” was developed with JoinMap 4.0 by joining two parental maps. The new map contains a complete set of 16 linkage groups, spanning a genetic distance of 2,172 cM with an average marker density of 3.97 cM. Three significant quantitative trait loci (QTL) associated with X-disease resistance were identified contributing to a total of 45.9% of the phenotypic variation. This updated genetic linkage map and the identified QTL will provide the framework needed to facilitate molecular genetics, genomics, breeding, and biotechnology research concerning X-disease in chokecherry and other Prunus species. PMID:29238359
Comparative mapping and rapid karyotypic evolution in the genus helianthus.
Burke, John M; Lai, Zhao; Salmaso, Marzia; Nakazato, Takuya; Tang, Shunxue; Heesacker, Adam; Knapp, Steven J; Rieseberg, Loren H
2004-01-01
Comparative genetic linkage maps provide a powerful tool for the study of karyotypic evolution. We constructed a joint SSR/RAPD genetic linkage map of the Helianthus petiolaris genome and used it, along with an integrated SSR genetic linkage map derived from four independent H. annuus mapping populations, to examine the evolution of genome structure between these two annual sunflower species. The results of this work indicate the presence of 27 colinear segments resulting from a minimum of eight translocations and three inversions. These 11 rearrangements are more than previously suspected on the basis of either cytological or genetic map-based analyses. Taken together, these rearrangements required a minimum of 20 chromosomal breakages/fusions. On the basis of estimates of the time since divergence of these two species (750,000-1,000,000 years), this translates into an estimated rate of 5.5-7.3 chromosomal rearrangements per million years of evolution, the highest rate reported for any taxonomic group to date. PMID:15166168
Genetic Map of Bacteriophage φX174
Benbow, R. M.; Hutchison, C. A.; Fabricant, J. D.; Sinsheimer, R. L.
1971-01-01
Bacteriophage φX174 temperature-sensitive and nonsense mutations in eight cistrons were mapped by using two-, three-, and four-factor genetic crosses. The genetic map is circular with a total length of 24 × 10−4wt recombinants per progeny phage. The cistron order is D-E-F-G-H-A-B-C. High negative interference is seen, consistent with a small closed circular deoxyribonucleic acid molecule as a genome. PMID:16789129
Identification of 29 Rat Genetic Markers by Arbitrarily Primed Polymerase Chain Reaction
Canzian, Federico; Toyota, Minoru; Hosoya, Yoko; Sugimura, Takashi; Nagao, Minako
1996-01-01
The number of genetic markers for the rat is still limited, in spite of its wide use in cancer research. To facilitate accurate mapping of both established and novel rat genetic markers, we constructed a linkage map by genotyping 105 F2 rats from ACI/N (ACI) and BUF/Nac (BUF) crosses. This map consists of 120 genetic markers that had been previously reported, mainly by two research groups, but had not been integrated. To find new genetic markers, the arbitrarily primed polymerase chain reaction (AP‐PCR) was applied to detect polymorphic bands between ACI and BUF rats. After testing 56 single primers and 12 combinations of primers, we found 36 bands produced by 16 single primers and two combinations to be reliably polymorphic between ACI and BUF rats. The 36 bands were typed in the 105 F2 rats, and 29 of them could be linkage‐mapped. AP‐PCR is thus useful to detect new genetic markers in laboratory strains of rats. PMID:8698613
Mok, Calvin A; Au, Vinci; Thompson, Owen A; Edgley, Mark L; Gevirtzman, Louis; Yochem, John; Lowry, Joshua; Memar, Nadin; Wallenfang, Matthew R; Rasoloson, Dominique; Bowerman, Bruce; Schnabel, Ralf; Seydoux, Geraldine; Moerman, Donald G; Waterston, Robert H
2017-10-01
Mutants remain a powerful means for dissecting gene function in model organisms such as Caenorhabditis elegans Massively parallel sequencing has simplified the detection of variants after mutagenesis but determining precisely which change is responsible for phenotypic perturbation remains a key step. Genetic mapping paradigms in C . elegans rely on bulk segregant populations produced by crosses with the problematic Hawaiian wild isolate and an excess of redundant information from whole-genome sequencing (WGS). To increase the repertoire of available mutants and to simplify identification of the causal change, we performed WGS on 173 temperature-sensitive (TS) lethal mutants and devised a novel mapping method. The mapping method uses molecular inversion probes (MIP-MAP) in a targeted sequencing approach to genetic mapping, and replaces the Hawaiian strain with a Million Mutation Project strain with high genomic and phenotypic similarity to the laboratory wild-type strain N2 We validated MIP-MAP on a subset of the TS mutants using a competitive selection approach to produce TS candidate mapping intervals with a mean size < 3 Mb. MIP-MAP successfully uses a non-Hawaiian mapping strain and multiplexed libraries are sequenced at a fraction of the cost of WGS mapping approaches. Our mapping results suggest that the collection of TS mutants contains a diverse library of TS alleles for genes essential to development and reproduction. MIP-MAP is a robust method to genetically map mutations in both viable and essential genes and should be adaptable to other organisms. It may also simplify tracking of individual genotypes within population mixtures. Copyright © 2017 by the Genetics Society of America.
Druka, Arnis; Druka, Ilze; Centeno, Arthur G; Li, Hongqiang; Sun, Zhaohui; Thomas, William TB; Bonar, Nicola; Steffenson, Brian J; Ullrich, Steven E; Kleinhofs, Andris; Wise, Roger P; Close, Timothy J; Potokina, Elena; Luo, Zewei; Wagner, Carola; Schweizer, Günther F; Marshall, David F; Kearsey, Michael J; Williams, Robert W; Waugh, Robbie
2008-01-01
Background A typical genetical genomics experiment results in four separate data sets; genotype, gene expression, higher-order phenotypic data and metadata that describe the protocols, processing and the array platform. Used in concert, these data sets provide the opportunity to perform genetic analysis at a systems level. Their predictive power is largely determined by the gene expression dataset where tens of millions of data points can be generated using currently available mRNA profiling technologies. Such large, multidimensional data sets often have value beyond that extracted during their initial analysis and interpretation, particularly if conducted on widely distributed reference genetic materials. Besides quality and scale, access to the data is of primary importance as accessibility potentially allows the extraction of considerable added value from the same primary dataset by the wider research community. Although the number of genetical genomics experiments in different plant species is rapidly increasing, none to date has been presented in a form that allows quick and efficient on-line testing for possible associations between genes, loci and traits of interest by an entire research community. Description Using a reference population of 150 recombinant doubled haploid barley lines we generated novel phenotypic, mRNA abundance and SNP-based genotyping data sets, added them to a considerable volume of legacy trait data and entered them into the GeneNetwork . GeneNetwork is a unified on-line analytical environment that enables the user to test genetic hypotheses about how component traits, such as mRNA abundance, may interact to condition more complex biological phenotypes (higher-order traits). Here we describe these barley data sets and demonstrate some of the functionalities GeneNetwork provides as an easily accessible and integrated analytical environment for exploring them. Conclusion By integrating barley genotypic, phenotypic and mRNA abundance data sets directly within GeneNetwork's analytical environment we provide simple web access to the data for the research community. In this environment, a combination of correlation analysis and linkage mapping provides the potential to identify and substantiate gene targets for saturation mapping and positional cloning. By integrating datasets from an unsequenced crop plant (barley) in a database that has been designed for an animal model species (mouse) with a well established genome sequence, we prove the importance of the concept and practice of modular development and interoperability of software engineering for biological data sets. PMID:19017390
USDA-ARS?s Scientific Manuscript database
High-density genetic linkage maps are essential for fine mapping QTLs controlling disease resistance traits, such as early leaf spot (ELS), late leaf spot (LLS), and Tomato spotted wilt virus (TSWV). With completion of the genome sequences of two diploid ancestors of cultivated peanut, we could use ...
Wang, Baohua; Liu, Limei; Zhang, Dong; Zhuang, Zhimin; Guo, Hui; Qiao, Xin; Wei, Lijuan; Rong, Junkang; May, O. Lloyd; Paterson, Andrew H.; Chee, Peng W.
2016-01-01
Among the seven tetraploid cotton species, little is known about transmission genetics and genome organization in Gossypium mustelinum, the species most distant from the source of most cultivated cotton, G. hirsutum. In this research, an F2 population was developed from an interspecific cross between G. hirsutum and G. mustelinum (HM). A genetic linkage map was constructed mainly using simple sequence repeat (SSRs) and restriction fragment length polymorphism (RFLP) DNA markers. The arrangements of most genetic loci along the HM chromosomes were identical to those of other tetraploid cotton species. However, both major and minor structural rearrangements were also observed, for which we propose a parsimony-based model for structural divergence of tetraploid cottons from common ancestors. Sequences of mapped markers were used for alignment with the 26 scaffolds of the G. hirsutum draft genome, and showed high consistency. Quantitative trait locus (QTL) mapping of fiber elongation in advanced backcross populations derived from the same parents demonstrated the value of the HM map. The HM map will serve as a valuable resource for QTL mapping and introgression of G. mustelinum alleles into G. hirsutum, and help clarify evolutionary relationships between the tetraploid cotton genomes. PMID:27172208
A second-generation genetic linkage map of the domestic dog, Canis familiaris.
Neff, M W; Broman, K W; Mellersh, C S; Ray, K; Acland, G M; Aguirre, G D; Ziegle, J S; Ostrander, E A; Rine, J
1999-01-01
Purebred strains, pronounced phenotypic variation, and a high incidence of heritable disease make the domestic dog uniquely suited to complement genetic analyses in humans and mice. A comprehensive genetic linkage map would afford many opportunities in dogs, ranging from the positional cloning of disease genes to the dissection of quantitative differences in size, shape, and behavior. Here we report a canine linkage map with the number of mapped loci expanded to 276 and 10-cM coverage extended to 75-90% of the genome. Most of the 38 canine autosomes are likely represented in the collection of 39 autosomal linkage groups. Eight markers were sufficiently informative to detect linkage at distances of 10-13 cM, yet remained unlinked to any other marker. Taken together, the results suggested a genome size of about 27 M. As in other species, the genetic length varied between sexes, with the female autosomal distance being approximately 1.4-fold greater than that of male meioses. Fifteen markers anchored well-described genes on the map, thereby serving as landmarks for comparative mapping in dogs. We discuss the utility of the current map and outline steps necessary for future map improvement. PMID:9927471
A Genetic Linkage Map for Cattle
Bishop, M. D.; Kappes, S. M.; Keele, J. W.; Stone, R. T.; Sunden, SLF.; Hawkins, G. A.; Toldo, S. S.; Fries, R.; Grosz, M. D.; Yoo, J.; Beattie, C. W.
1994-01-01
We report the most extensive physically anchored linkage map for cattle produced to date. Three-hundred thirteen genetic markers ordered in 30 linkage groups, anchored to 24 autosomal chromosomes (n = 29), the X and Y chromosomes, four unanchored syntenic groups and two unassigned linkage groups spanning 2464 cM of the bovine genome are summarized. The map also assigns 19 type I loci to specific chromosomes and/or syntenic groups and four cosmid clones containing informative microsatellites to chromosomes 13, 25 and 29 anchoring syntenic groups U11, U7 and U8, respectively. This map provides the skeletal framework prerequisite to development of a comprehensive genetic map for cattle and analysis of economic trait loci (ETL). PMID:7908653
Ollitrault, Frédérique; Terol, Javier; Pina, Jose Antonio; Navarro, Luis; Talon, Manuel; Ollitrault, Patrick
2010-11-01
Microsatellite primers were developed from bacterial artificial chromosome (BAC) end sequences of Citrus clementina and their transferability and polymorphism tested in the genus Citrus for future anchorage of physical and genetic maps and comparative interspecific genetic mapping. • Using PAGE and DNA silver staining, 79 primer pairs were selected for their transferability and polymorphism among 526 microsatellites mined in BES. A preliminary diversity study in Citrus was conducted with 18 of them, in C. reticulata, C. maxima, C. medica, C. sinensis, C. aurantium, C. paradisi, C. lemon, C. aurantifolia, and some papedas (wild citrus), using a capillary electrophoresis fragment analyzer. Intra- and interspecific polymorphism was observed, and heterozygous markers were identified for the different genotypes to be used for genetic mapping. • These results indicate the utility of the developed primers for comparative mapping studies and the integration of physical and genetic maps.
Mercenaro, Luca; Nieddu, Giovanni; Porceddu, Andrea; Pezzotti, Mario; Camiolo, Salvatore
2017-01-01
The genetic diversity among grapevine (Vitis vinifera L.) cultivars that underlies differences in agronomic performance and wine quality reflects the accumulation of single nucleotide polymorphisms (SNPs) and small indels as well as larger genomic variations. A combination of high throughput sequencing and mapping against the grapevine reference genome allows the creation of comprehensive sequence variation maps. We used next generation sequencing and bioinformatics to generate an inventory of SNPs and small indels in four widely cultivated Sardinian grape cultivars (Bovale sardo, Cannonau, Carignano and Vermentino). More than 3,200,000 SNPs were identified with high statistical confidence. Some of the SNPs caused the appearance of premature stop codons and thus identified putative pseudogenes. The analysis of SNP distribution along chromosomes led to the identification of large genomic regions with uninterrupted series of homozygous SNPs. We used a digital comparative genomic hybridization approach to identify 6526 genomic regions with significant differences in copy number among the four cultivars compared to the reference sequence, including 81 regions shared between all four cultivars and 4953 specific to single cultivars (representing 1.2 and 75.9% of total copy number variation, respectively). Reads mapping at a distance that was not compatible with the insert size were used to identify a dataset of putative large deletions with cultivar Cannonau revealing the highest number. The analysis of genes mapping to these regions provided a list of candidates that may explain some of the phenotypic differences among the Bovale sardo, Cannonau, Carignano and Vermentino cultivars. PMID:28775732
Tennessen, Jacob A.; Govindarajulu, Rajanikanth; Ashman, Tia-Lynn; Liston, Aaron
2014-01-01
Whole-genome duplications are radical evolutionary events that have driven speciation and adaptation in many taxa. Higher-order polyploids have complex histories often including interspecific hybridization and dynamic genomic changes. This chromosomal reshuffling is poorly understood for most polyploid species, despite their evolutionary and agricultural importance, due to the challenge of distinguishing homologous sequences from each other. Here, we use dense linkage maps generated with targeted sequence capture to improve the diploid strawberry (Fragaria vesca) reference genome and to disentangle the subgenomes of the wild octoploid progenitors of cultivated strawberry, Fragaria virginiana and Fragaria chiloensis. Our novel approach, POLiMAPS (Phylogenetics Of Linkage-Map-Anchored Polyploid Subgenomes), leverages sequence reads to associate informative interhomeolog phylogenetic markers with linkage groups and reference genome positions. In contrast to a widely accepted model, we find that one of the four subgenomes originates with the diploid cytoplasm donor F. vesca, one with the diploid Fragaria iinumae, and two with an unknown ancestor close to F. iinumae. Extensive unidirectional introgression has converted F. iinumae-like subgenomes to be more F. vesca-like, but never the reverse, due either to homoploid hybridization in the F. iinumae-like diploid ancestors or else strong selection spreading F. vesca-like sequence among subgenomes through homeologous exchange. In addition, divergence between homeologous chromosomes has been substantially augmented by interchromosomal rearrangements. Our phylogenetic approach reveals novel aspects of the complicated web of genetic exchanges that occur during polyploid evolution and suggests a path forward for unraveling other agriculturally and ecologically important polyploid genomes. PMID:25477420
Eynard, Sonia E; Croiseau, Pascal; Laloë, Denis; Fritz, Sebastien; Calus, Mario P L; Restoux, Gwendal
2018-01-04
Genomic selection (GS) is commonly used in livestock and increasingly in plant breeding. Relying on phenotypes and genotypes of a reference population, GS allows performance prediction for young individuals having only genotypes. This is expected to achieve fast high genetic gain but with a potential loss of genetic diversity. Existing methods to conserve genetic diversity depend mostly on the choice of the breeding individuals. In this study, we propose a modification of the reference population composition to mitigate diversity loss. Since the high cost of phenotyping is the limiting factor for GS, our findings are of major economic interest. This study aims to answer the following questions: how would decisions on the reference population affect the breeding population, and how to best select individuals to update the reference population and balance maximizing genetic gain and minimizing loss of genetic diversity? We investigated three updating strategies for the reference population: random, truncation, and optimal contribution (OC) strategies. OC maximizes genetic merit for a fixed loss of genetic diversity. A French Montbéliarde dairy cattle population with 50K SNP chip genotypes and simulations over 10 generations were used to compare these different strategies using milk production as the trait of interest. Candidates were selected to update the reference population. Prediction bias and both genetic merit and diversity were measured. Changes in the reference population composition slightly affected the breeding population. Optimal contribution strategy appeared to be an acceptable compromise to maintain both genetic gain and diversity in the reference and the breeding populations. Copyright © 2018 Eynard et al.
Xia, Zhiqiang; Zhang, Shengkui; Wen, Mingfu; Lu, Cheng; Sun, Yufang; Zou, Meiling; Wang, Wenquan
2018-01-01
As an important biofuel plant, the demand for higher yield Jatropha curcas L. is rapidly increasing. However, genetic analysis of Jatropha and molecular breeding for higher yield have been hampered by the limited number of molecular markers available. An ultrahigh-density linkage map for a Jatropha mapping population of 153 individuals was constructed and covered 1380.58 cM of the Jatropha genome, with average marker density of 0.403 cM. The genetic linkage map consisted of 3422 SNP and indel markers, which clustered into 11 linkage groups. With this map, 13 repeatable QTLs (reQTLs) for fruit yield traits were identified. Ten reQTLs, qNF - 1 , qNF - 2a , qNF - 2b , qNF - 2c , qNF - 3 , qNF - 4 , qNF - 6 , qNF - 7a , qNF - 7b and qNF - 8, that control the number of fruits (NF) mapped to LGs 1, 2, 3, 4, 6, 7 and 8, whereas three reQTLs, qTWF - 1 , qTWF - 2 and qTWF - 3, that control the total weight of fruits (TWF) mapped to LGs 1, 2 and 3, respectively. It is interesting that there are two candidate critical genes, which may regulate Jatropha fruit yield. We also identified three pleiotropic reQTL pairs associated with both the NF and TWF traits. This study is the first to report an ultrahigh-density Jatropha genetic linkage map construction, and the markers used in this study showed great potential for QTL mapping. Thirteen fruit-yield reQTLs and two important candidate genes were identified based on this linkage map. This genetic linkage map will be a useful tool for the localization of other economically important QTLs and candidate genes for Jatropha .
Feltus, F Alex
2014-06-01
Understanding the control of any trait optimally requires the detection of causal genes, gene interaction, and mechanism of action to discover and model the biochemical pathways underlying the expressed phenotype. Functional genomics techniques, including RNA expression profiling via microarray and high-throughput DNA sequencing, allow for the precise genome localization of biological information. Powerful genetic approaches, including quantitative trait locus (QTL) and genome-wide association study mapping, link phenotype with genome positions, yet genetics is less precise in localizing the relevant mechanistic information encoded in DNA. The coupling of salient functional genomic signals with genetically mapped positions is an appealing approach to discover meaningful gene-phenotype relationships. Techniques used to define this genetic-genomic convergence comprise the field of systems genetics. This short review will address an application of systems genetics where RNA profiles are associated with genetically mapped genome positions of individual genes (eQTL mapping) or as gene sets (co-expression network modules). Both approaches can be applied for knowledge independent selection of candidate genes (and possible control mechanisms) underlying complex traits where multiple, likely unlinked, genomic regions might control specific complex traits. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Usaj, Matej; Tan, Yizhao; Wang, Wen; VanderSluis, Benjamin; Zou, Albert; Myers, Chad L.; Costanzo, Michael; Andrews, Brenda; Boone, Charles
2017-01-01
Providing access to quantitative genomic data is key to ensure large-scale data validation and promote new discoveries. TheCellMap.org serves as a central repository for storing and analyzing quantitative genetic interaction data produced by genome-scale Synthetic Genetic Array (SGA) experiments with the budding yeast Saccharomyces cerevisiae. In particular, TheCellMap.org allows users to easily access, visualize, explore, and functionally annotate genetic interactions, or to extract and reorganize subnetworks, using data-driven network layouts in an intuitive and interactive manner. PMID:28325812
Usaj, Matej; Tan, Yizhao; Wang, Wen; VanderSluis, Benjamin; Zou, Albert; Myers, Chad L; Costanzo, Michael; Andrews, Brenda; Boone, Charles
2017-05-05
Providing access to quantitative genomic data is key to ensure large-scale data validation and promote new discoveries. TheCellMap.org serves as a central repository for storing and analyzing quantitative genetic interaction data produced by genome-scale Synthetic Genetic Array (SGA) experiments with the budding yeast Saccharomyces cerevisiae In particular, TheCellMap.org allows users to easily access, visualize, explore, and functionally annotate genetic interactions, or to extract and reorganize subnetworks, using data-driven network layouts in an intuitive and interactive manner. Copyright © 2017 Usaj et al.
Choi, Hong-Kyu; Kim, Dongjin; Uhm, Taesik; Limpens, Eric; Lim, Hyunju; Mun, Jeong-Hwan; Kalo, Peter; Penmetsa, R Varma; Seres, Andrea; Kulikova, Olga; Roe, Bruce A; Bisseling, Ton; Kiss, Gyorgy B; Cook, Douglas R
2004-01-01
A core genetic map of the legume Medicago truncatula has been established by analyzing the segregation of 288 sequence-characterized genetic markers in an F(2) population composed of 93 individuals. These molecular markers correspond to 141 ESTs, 80 BAC end sequence tags, and 67 resistance gene analogs, covering 513 cM. In the case of EST-based markers we used an intron-targeted marker strategy with primers designed to anneal in conserved exon regions and to amplify across intron regions. Polymorphisms were significantly more frequent in intron vs. exon regions, thus providing an efficient mechanism to map transcribed genes. Genetic and cytogenetic analysis produced eight well-resolved linkage groups, which have been previously correlated with eight chromosomes by means of FISH with mapped BAC clones. We anticipated that mapping of conserved coding regions would have utility for comparative mapping among legumes; thus 60 of the EST-based primer pairs were designed to amplify orthologous sequences across a range of legume species. As an initial test of this strategy, we used primers designed against M. truncatula exon sequences to rapidly map genes in M. sativa. The resulting comparative map, which includes 68 bridging markers, indicates that the two Medicago genomes are highly similar and establishes the basis for a Medicago composite map. PMID:15082563
Julier, Bernadette; Flajoulot, Sandrine; Barre, Philippe; Cardinet, Gaëlle; Santoni, Sylvain; Huguet, Thierry; Huyghe, Christian
2003-01-01
Background Alfalfa (Medicago sativa) is a major forage crop. The genetic progress is slow in this legume species because of its autotetraploidy and allogamy. The genetic structure of this species makes the construction of genetic maps difficult. To reach this objective, and to be able to detect QTLs in segregating populations, we used the available codominant microsatellite markers (SSRs), most of them identified in the model legume Medicago truncatula from EST database. A genetic map was constructed with AFLP and SSR markers using specific mapping procedures for autotetraploids. The tetrasomic inheritance was analysed in an alfalfa mapping population. Results We have demonstrated that 80% of primer pairs defined on each side of SSR motifs in M. truncatula EST database amplify with the alfalfa DNA. Using a F1 mapping population of 168 individuals produced from the cross of 2 heterozygous parental plants from Magali and Mercedes cultivars, we obtained 599 AFLP markers and 107 SSR loci. All but 3 SSR loci showed a clear tetrasomic inheritance. For most of the SSR loci, the double-reduction was not significant. For the other loci no specific genotypes were produced, so the significant double-reduction could arise from segregation distortion. For each parent, the genetic map contained 8 groups of four homologous chromosomes. The lengths of the maps were 2649 and 3045 cM, with an average distance of 7.6 and 9.0 cM between markers, for Magali and Mercedes parents, respectively. Using only the SSR markers, we built a composite map covering 709 cM. Conclusions Compared to diploid alfalfa genetic maps, our maps cover about 88–100% of the genome and are close to saturation. The inheritance of the codominant markers (SSR) and the pattern of linkage repulsions between markers within each homology group are consistent with the hypothesis of a tetrasomic meiosis in alfalfa. Except for 2 out of 107 SSR markers, we found a similar order of markers on the chromosomes between the tetraploid alfalfa and M. truncatula genomes indicating a high level of colinearity between these two species. These maps will be a valuable tool for alfalfa breeding and are being used to locate QTLs. PMID:14683527
2014-01-01
Background The date palm is one of the oldest cultivated fruit trees. It is critical in many ways to cultures in arid lands by providing highly nutritious fruit while surviving extreme heat and environmental conditions. Despite its importance from antiquity, few genetic resources are available for improving the productivity and development of the dioecious date palm. To date there has been no genetic map and no sex chromosome has been identified. Results Here we present the first genetic map for date palm and identify the putative date palm sex chromosome. We placed ~4000 markers on the map using nearly 1200 framework markers spanning a total of 1293 cM. We have integrated the genetic map, derived from the Khalas cultivar, with the draft genome and placed up to 19% of the draft genome sequence scaffolds onto linkage groups for the first time. This analysis revealed approximately ~1.9 cM/Mb on the map. Comparison of the date palm linkage groups revealed significant long-range synteny to oil palm. Analysis of the date palm sex-determination region suggests it is telomeric on linkage group 12 and recombination is not suppressed in the full chromosome. Conclusions Based on a modified gentoyping-by-sequencing approach we have overcome challenges due to lack of genetic resources and provide the first genetic map for date palm. Combined with the recent draft genome sequence of the same cultivar, this resource offers a critical new tool for date palm biotechnology, palm comparative genomics and a better understanding of sex chromosome development in the palms. PMID:24735434
Gong, Wen-Bing; Li, Lei; Zhou, Yan; Bian, Yin-Bing; Kwan, Hoi-Shan; Cheung, Man-Kit; Xiao, Yang
2016-06-01
To provide a better understanding of the genetic architecture of fruiting body formation of Lentinula edodes, quantitative trait loci (QTLs) mapping was employed to uncover the loci underlying seven fruiting body-related traits (FBRTs). An improved L. edodes genetic linkage map, comprising 572 markers on 12 linkage groups with a total map length of 983.7 cM, was constructed by integrating 82 genomic sequence-based insertion-deletion (InDel) markers into a previously published map. We then detected a total of 62 QTLs for seven target traits across two segregating testcross populations, with individual QTLs contributing 5.5 %-30.2 % of the phenotypic variation. Fifty-three out of the 62 QTLs were clustered in six QTL hotspots, suggesting the existence of main genomic regions regulating the morphological characteristics of fruiting bodies in L. edodes. A stable QTL hotspot on MLG2, containing QTLs for all investigated traits, was identified in both testcross populations. QTLs for related traits were frequently co-located on the linkage groups, demonstrating the genetic basis for phenotypic correlation of traits. Meta-QTL (mQTL) analysis was performed and identified 16 mQTLs with refined positions and narrow confidence intervals (CIs). Nine genes, including those encoding MAP kinase, blue-light photoreceptor, riboflavin-aldehyde-forming enzyme and cyclopropane-fatty-acyl-phospholipid synthase, and cytochrome P450s, were likely to be candidate genes controlling the shape of fruiting bodies. The study has improved our understanding of the genetic architecture of fruiting body formation in L. edodes. To our knowledge, this is the first genome-wide QTL detection of FBRTs in L. edodes. The improved genetic map, InDel markers and QTL hotspot regions revealed here will assist considerably in the conduct of future genetic and breeding studies of L. edodes.
Kamphuis, Lars G; Hane, James K; Nelson, Matthew N; Gao, Lingling; Atkins, Craig A; Singh, Karam B
2015-01-01
Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map. PMID:25060816
Behnke, Michael S; Khan, Asis; Sibley, L David
2015-02-01
Quantitative trait locus (QTL) mapping studies have been integral in identifying and understanding virulence mechanisms in the parasite Toxoplasma gondii. In this study, we interrogated a different phenotype by mapping sinefungin (SNF) resistance in the genetic cross between type 2 ME49-FUDR(r) and type 10 VAND-SNF(r). The genetic map of this cross was generated by whole-genome sequencing of the progeny and subsequent identification of single nucleotide polymorphisms (SNPs) inherited from the parents. Based on this high-density genetic map, we were able to pinpoint the sinefungin resistance phenotype to one significant locus on chromosome IX. Within this locus, a single nonsynonymous SNP (nsSNP) resulting in an early stop codon in the TGVAND_290860 gene was identified, occurring only in the sinefungin-resistant progeny. Using CRISPR/CAS9, we were able to confirm that targeted disruption of TGVAND_290860 renders parasites sinefungin resistant. Because disruption of the SNR1 gene confers resistance, we also show that it can be used as a negative selectable marker to insert either a positive drug selection cassette or a heterologous reporter. These data demonstrate the power of combining classical genetic mapping, whole-genome sequencing, and CRISPR-mediated gene disruption for combined forward and reverse genetic strategies in T. gondii. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann
2017-01-01
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-04-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1-8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Mutation Scanning in Wheat by Exon Capture and Next-Generation Sequencing.
King, Robert; Bird, Nicholas; Ramirez-Gonzalez, Ricardo; Coghill, Jane A; Patil, Archana; Hassani-Pak, Keywan; Uauy, Cristobal; Phillips, Andrew L
2015-01-01
Targeted Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach to identify novel sequence variation in genomes, with the aims of investigating gene function and/or developing useful alleles for breeding. Despite recent advances in wheat genomics, most current TILLING methods are low to medium in throughput, being based on PCR amplification of the target genes. We performed a pilot-scale evaluation of TILLING in wheat by next-generation sequencing through exon capture. An oligonucleotide-based enrichment array covering ~2 Mbp of wheat coding sequence was used to carry out exon capture and sequencing on three mutagenised lines of wheat containing previously-identified mutations in the TaGA20ox1 homoeologous genes. After testing different mapping algorithms and settings, candidate SNPs were identified by mapping to the IWGSC wheat Chromosome Survey Sequences. Where sequence data for all three homoeologues were found in the reference, mutant calls were unambiguous; however, where the reference lacked one or two of the homoeologues, captured reads from these genes were mis-mapped to other homoeologues, resulting either in dilution of the variant allele frequency or assignment of mutations to the wrong homoeologue. Competitive PCR assays were used to validate the putative SNPs and estimate cut-off levels for SNP filtering. At least 464 high-confidence SNPs were detected across the three mutagenized lines, including the three known alleles in TaGA20ox1, indicating a mutation rate of ~35 SNPs per Mb, similar to that estimated by PCR-based TILLING. This demonstrates the feasibility of using exon capture for genome re-sequencing as a method of mutation detection in polyploid wheat, but accurate mutation calling will require an improved genomic reference with more comprehensive coverage of homoeologues.
Gaber, Richard F.; Mathison, Lorilee; Edelman, Irv; Culbertson, Michael R.
1983-01-01
Five previously unmapped frameshift suppressor genes have been located on the yeast genetic map. In addition, we have further characterized the map positions of two suppressors whose approximate locations were determined in an earlier study. These results represent the completion of genetic mapping studies on all 25 of the known frameshift suppressor genes in yeast.—The approximate location of each suppressor gene was initially determined through the use of a set of mapping strains containing 61 signal markers distributed throughout the yeast genome. Standard meiotic linkage was assayed in crosses between strains carrying the suppressors and the mapping strains. Subsequent to these approximate linkage determinations, each suppressor gene was more precisely located in multi-point crosses. The implications of these mapping results for the genomic distribution of frameshift suppressor genes, which include both glycine and proline tRNA genes, are discussed. PMID:17246112
A maximum likelihood map of chromosome 1.
Rao, D C; Keats, B J; Lalouel, J M; Morton, N E; Yee, S
1979-01-01
Thirteen loci are mapped on chromosome 1 from genetic evidence. The maximum likelihood map presented permits confirmation that Scianna (SC) and a fourteenth locus, phenylketonuria (PKU), are on chromosome 1, although the location of the latter on the PGM1-AMY segment is uncertain. Eight other controversial genetic assignments are rejected, providing a practical demonstration of the resolution which maximum likelihood theory brings to mapping. PMID:293128
2012-01-01
Background Brassica oleracea encompass a family of vegetables and cabbage that are among the most widely cultivated crops. In 2009, the B. oleracea Genome Sequencing Project was launched using next generation sequencing technology. None of the available maps were detailed enough to anchor the sequence scaffolds for the Genome Sequencing Project. This report describes the development of a large number of SSR and SNP markers from the whole genome shotgun sequence data of B. oleracea, and the construction of a high-density genetic linkage map using a double haploid mapping population. Results The B. oleracea high-density genetic linkage map that was constructed includes 1,227 markers in nine linkage groups spanning a total of 1197.9 cM with an average of 0.98 cM between adjacent loci. There were 602 SSR markers and 625 SNP markers on the map. The chromosome with the highest number of markers (186) was C03, and the chromosome with smallest number of markers (99) was C09. Conclusions This first high-density map allowed the assembled scaffolds to be anchored to pseudochromosomes. The map also provides useful information for positional cloning, molecular breeding, and integration of information of genes and traits in B. oleracea. All the markers on the map will be transferable and could be used for the construction of other genetic maps. PMID:23033896
Genetic landscapes GIS Toolbox: tools to map patterns of genetic divergence and diversity.
Vandergast, Amy G.; Perry, William M.; Lugo, Roberto V.; Hathaway, Stacie A.
2011-01-01
The Landscape Genetics GIS Toolbox contains tools that run in the Geographic Information System software, ArcGIS, to map genetic landscapes and to summarize multiple genetic landscapes as average and variance surfaces. These tools can be used to visualize the distribution of genetic diversity across geographic space and to study associations between patterns of genetic diversity and geographic features or other geo-referenced environmental data sets. Together, these tools create genetic landscape surfaces directly from tables containing genetic distance or diversity data and sample location coordinates, greatly reducing the complexity of building and analyzing these raster surfaces in a Geographic Information System.
A second generation genetic linkage map of Japanese flounder (Paralichthys olivaceus)
2010-01-01
Background Japanese flounder (Paralichthys olivaceus) is one of the most economically important marine species in Northeast Asia. Information on genetic markers associated with quantitative trait loci (QTL) can be used in breeding programs to identify and select individuals carrying desired traits. Commercial production of Japanese flounder could be increased by developing disease-resistant fish and improving commercially important traits. Previous maps have been constructed with AFLP markers and a limited number of microsatellite markers. In this study, improved genetic linkage maps are presented. In contrast with previous studies, these maps were built mainly with a large number of codominant markers so they can potentially be used to analyze different families and populations. Results Sex-specific genetic linkage maps were constructed for the Japanese flounder including a total of 1,375 markers [1,268 microsatellites, 105 single nucleotide polymorphisms (SNPs) and two genes]; 1,167 markers are linked to the male map and 1,067 markers are linked to the female map. The lengths of the male and female maps are 1,147.7 cM and 833.8 cM, respectively. Based on estimations of map lengths, the female and male maps covered 79 and 82% of the genome, respectively. Recombination ratio in the new maps revealed F:M of 1:0.7. All linkage groups in the maps presented large differences in the location of sex-specific recombination hot-spots. Conclusions The improved genetic linkage maps are very useful for QTL analyses and marker-assisted selection (MAS) breeding programs for economically important traits in Japanese flounder. In addition, SNP flanking sequences were blasted against Tetraodon nigroviridis (puffer fish) and Danio rerio (zebrafish), and synteny analysis has been carried out. The ability to detect synteny among species or genera based on homology analysis of SNP flanking sequences may provide opportunities to complement initial QTL experiments with candidate gene approaches from homologous chromosomal locations identified in related model organisms. PMID:20937088
A saturated SSR/DArT linkage map of Musa acuminata addressing genome rearrangements among bananas.
Hippolyte, Isabelle; Bakry, Frederic; Seguin, Marc; Gardes, Laetitia; Rivallan, Ronan; Risterucci, Ange-Marie; Jenny, Christophe; Perrier, Xavier; Carreel, Françoise; Argout, Xavier; Piffanelli, Pietro; Khan, Imtiaz A; Miller, Robert N G; Pappas, Georgios J; Mbéguié-A-Mbéguié, Didier; Matsumoto, Takashi; De Bernardinis, Veronique; Huttner, Eric; Kilian, Andrzej; Baurens, Franc-Christophe; D'Hont, Angélique; Cote, François; Courtois, Brigitte; Glaszmann, Jean-Christophe
2010-04-13
The genus Musa is a large species complex which includes cultivars at diploid and triploid levels. These sterile and vegetatively propagated cultivars are based on the A genome from Musa acuminata, exclusively for sweet bananas such as Cavendish, or associated with the B genome (Musa balbisiana) in cooking bananas such as Plantain varieties. In M. acuminata cultivars, structural heterozygosity is thought to be one of the main causes of sterility, which is essential for obtaining seedless fruits but hampers breeding. Only partial genetic maps are presently available due to chromosomal rearrangements within the parents of the mapping populations. This causes large segregation distortions inducing pseudo-linkages and difficulties in ordering markers in the linkage groups. The present study aims at producing a saturated linkage map of M. acuminata, taking into account hypotheses on the structural heterozygosity of the parents. An F1 progeny of 180 individuals was obtained from a cross between two genetically distant accessions of M. acuminata, 'Borneo' and 'Pisang Lilin' (P. Lilin). Based on the gametic recombination of each parent, two parental maps composed of SSR and DArT markers were established. A significant proportion of the markers (21.7%) deviated (p < 0.05) from the expected Mendelian ratios. These skewed markers were distributed in different linkage groups for each parent. To solve some complex ordering of the markers on linkage groups, we associated tools such as tree-like graphic representations, recombination frequency statistics and cytogenetical studies to identify structural rearrangements and build parsimonious linkage group order. An illustration of such an approach is given for the P. Lilin parent. We propose a synthetic map with 11 linkage groups containing 489 markers (167 SSRs and 322 DArTs) covering 1197 cM. This first saturated map is proposed as a "reference Musa map" for further analyses. We also propose two complete parental maps with interpretations of structural rearrangements localized on the linkage groups. The structural heterozygosity in P. Lilin is hypothesized to result from a duplication likely accompanied by an inversion on another chromosome. This paper also illustrates a methodological approach, transferable to other species, to investigate the mapping of structural rearrangements and determine their consequences on marker segregation.
Ajmal, M; Zafar, S; Hameed, A
2016-01-01
ABSTRACT Clinical anophthalmia is a rare inherited disease of the eye and phenotype refers to the absence of ocular tissue in the orbit of eye. Patients may have unilateral or bilateral anophthalmia, and generally have short palpebral fissures and small orbits. Anophthalmia may be isolated or associated with a broader syndrome and may have genetic or environmental causes. However, genetic cause has been defined in only a small proportion of cases, therefore, a consanguineous Pakistani family of the Pashtoon ethnic group, with isolated clinical anophthalmia was investigated using linkage mapping. A family pedigree was created to trace the possible mode of inheritance of the disease. Blood samples were collected from affected as well as normal members of this family, and screened for disease-associated mutations. This family was analyzed for linkage to all the known loci of clinical anophthalmia, using microsatellite short tandem repeat (STR) markers. Direct sequencing was performed to find out disease-associated mutations in the candidate gene. This family with isolated clinical anophthalmia, was mapped to the SOX2 gene that is located at chromosome 3q26.3-q27. However, on exonic and regulatory regions mutation screening of the SOX2 gene, the disease-associated mutation was not identified. It showed that another gene responsible for development of the eye might be present at chromosome 3q26.3-q27 and needs to be identified and screened for the disease-associated mutation in this family. PMID:27785411
Quantitative genetic-interaction mapping in mammalian cells
Roguev, Assen; Talbot, Dale; Negri, Gian Luca; Shales, Michael; Cagney, Gerard; Bandyopadhyay, Sourav; Panning, Barbara; Krogan, Nevan J
2013-01-01
Mapping genetic interactions (GIs) by simultaneously perturbing pairs of genes is a powerful tool for understanding complex biological phenomena. Here we describe an experimental platform for generating quantitative GI maps in mammalian cells using a combinatorial RNA interference strategy. We performed ~11,000 pairwise knockdowns in mouse fibroblasts, focusing on 130 factors involved in chromatin regulation to create a GI map. Comparison of the GI and protein-protein interaction (PPI) data revealed that pairs of genes exhibiting positive GIs and/or similar genetic profiles were predictive of the corresponding proteins being physically associated. The mammalian GI map identified pathways and complexes but also resolved functionally distinct submodules within larger protein complexes. By integrating GI and PPI data, we created a functional map of chromatin complexes in mouse fibroblasts, revealing that the PAF complex is a central player in the mammalian chromatin landscape. PMID:23407553
Wu, Zhikun; Wang, Bo; Chen, Xun; Wu, Jiangsheng; King, Graham J; Xiao, Yingjie; Liu, Kede
2016-01-01
High-density genetic markers are the prerequisite for understanding linkage disequilibrium (LD) and genome-wide association studies (GWASs) of complex traits in crops. To evaluate the LD pattern in oilseed rape, we sequenced a previous association panel containing 189 B. napus inbred lines using double-digested restriction-site associated DNA (ddRAD) and genotyped 19,327 RAD tags. A total of 15,921 RAD tags were assigned to a published genetic linkage map and the majority (71.1%) of these tags was uniquely mapped to the draft reference genome "Darmor-bzh." The distance of LD decay was 1,214 kb across the genome at the background level (r2 = 0.26), with the distances of LD decay being 405 kb and 2,111 kb in the A and C subgenomes, respectively. A total of 361 haplotype blocks with length > 100 kb were identified in the entire genome. The association panel could be classified into two groups, P1 and P2, which are essentially consistent with the geographical origins of varieties. A large number of group-specific haplotypes were identified, reflecting that varieties in the P1 and P2 groups experienced distinct selection in breeding programs to adapt their different growth habitats. GWAS repeatedly detected two loci significantly associated with oil content of seeds based on the developed SNPs, suggesting that the high-density SNPs were useful for understanding the genetic determinants of complex traits in GWAS.
Single Nucleotide Polymorphism Markers for Genetic Mapping in Drosophila melanogaster
Hoskins, Roger A.; Phan, Alexander C.; Naeemuddin, Mohammed; Mapa, Felipa A.; Ruddy, David A.; Ryan, Jessica J.; Young, Lynn M.; Wells, Trent; Kopczynski, Casey; Ellis, Michael C.
2001-01-01
For nearly a century, genetic analysis in Drosophila melanogaster has been a powerful tool for analyzing gene function, yet Drosophila lacks the molecular genetic mapping tools that recently have revolutionized human, mouse, and plant genetics. Here, we describe the systematic characterization of a dense set of molecular markers in Drosophila by using a sequence tagged site-based physical map of the genome. We identify 474 biallelic markers in standard laboratory strains of Drosophila that span the genome. Most of these markers are single nucleotide polymorphisms and sequences for these variants are provided in an accessible format. The average density of the new markers is one per 225 kb on the autosomes and one per megabase on the X chromosome. We include in this survey a set of P-element strains that provide additional use for high-resolution mapping. We show one application of the new markers in a simple set of crosses to map a mutation in the hedgehog gene to an interval of <1 Mb. This new map resource significantly increases the efficiency and resolution of recombination mapping and will be of immediate value to the Drosophila research community. PMID:11381036
Single nucleotide polymorphism markers for genetic mapping in Drosophila melanogaster
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hoskins, Roger A.; Phan, Alexander C.; Naeemuddin, Mohammed
2001-04-16
For nearly a century, genetic analysis in Drosophila melanogaster has been a powerful tool for analyzing gene function, yet Drosophila lacks the molecular genetic mapping tools that have recently revolutionized human, mouse and plant genetics. Here, we describe the systematic characterization of a dense set of molecular markers in Drosophila using an STS-based physical map of the genome. We identify 474 biallelic markers in standard laboratory strains of Drosophila that the genome. The majority of these markers are single nucleotide polymorphisms (SNPs) and sequences for these variants are provided in an accessible format. The average density of the new markersmore » is 1 marker per 225 kb on the autosomes and 1 marker per 1 Mb on the X chromosome. We include in this survey a set of P-element strains that provide additional utility for high-resolution mapping. We demonstrate one application of the new markers in a simple set of crosses to map a mutation in the hedgehog gene to an interval of <1 Mb. This new map resource significantly increases the efficiency and resolution of recombination mapping and will be of immediate value to the Drosophila research community.« less
2011-01-01
Background Over recent years, a growing effort has been made to develop microsatellite markers for the genomic analysis of the common bean (Phaseolus vulgaris) to broaden the knowledge of the molecular genetic basis of this species. The availability of large sets of expressed sequence tags (ESTs) in public databases has given rise to an expedient approach for the identification of SSRs (Simple Sequence Repeats), specifically EST-derived SSRs. In the present work, a battery of new microsatellite markers was obtained from a search of the Phaseolus vulgaris EST database. The diversity, degree of transferability and polymorphism of these markers were tested. Results From 9,583 valid ESTs, 4,764 had microsatellite motifs, from which 377 were used to design primers, and 302 (80.11%) showed good amplification quality. To analyze transferability, a group of 167 SSRs were tested, and the results showed that they were 82% transferable across at least one species. The highest amplification rates were observed between the species from the Phaseolus (63.7%), Vigna (25.9%), Glycine (19.8%), Medicago (10.2%), Dipterix (6%) and Arachis (1.8%) genera. The average PIC (Polymorphism Information Content) varied from 0.53 for genomic SSRs to 0.47 for EST-SSRs, and the average number of alleles per locus was 4 and 3, respectively. Among the 315 newly tested SSRs in the BJ (BAT93 X Jalo EEP558) population, 24% (76) were polymorphic. The integration of these segregant loci into a framework map composed of 123 previously obtained SSR markers yielded a total of 199 segregant loci, of which 182 (91.5%) were mapped to 14 linkage groups, resulting in a map length of 1,157 cM. Conclusions A total of 302 newly developed EST-SSR markers, showing good amplification quality, are available for the genetic analysis of Phaseolus vulgaris. These markers showed satisfactory rates of transferability, especially between species that have great economic and genomic values. Their diversity was comparable to genomic SSRs, and they were incorporated in the common bean reference genetic map, which constitutes an important contribution to and advance in Phaseolus vulgaris genomic research. PMID:21554695
Ge, Y; Li, X; Yang, X X; Cui, C S; Qu, S P
2015-05-22
Cucurbita maxima is one of the most widely cultivated vegetables in China and exhibits distinct morphological characteristics. In this study, genetic linkage analysis with 57 simple-sequence repeats, 21 amplified fragment length polymorphisms, 3 random-amplified polymorphic DNA, and one morphological marker revealed 20 genetic linkage groups of C. maxima covering a genetic distance of 991.5 cM with an average of 12.1 cM between adjacent markers. Genetic linkage analysis identified the simple-sequence repeat marker 'PU078072' 5.9 cM away from the locus 'Rc', which controls rind color. The genetic map in the present study will be useful for better mapping, tagging, and cloning of quantitative trait loci/gene(s) affecting economically important traits and for breeding new varieties of C. maxima through marker-assisted selection.
Chen, Jiayu; Calhoun, Vince D.; Pearlson, Godfrey D.; Perrone-Bizzozero, Nora; Sui, Jing; Turner, Jessica A.; Bustillo, Juan R; Ehrlich, Stefan; Sponheim, Scott R.; Cañive, José M.; Ho, Beng-Choon; Liu, Jingyu
2013-01-01
One application of imaging genomics is to explore genetic variants associated with brain structure and function, presenting a new means of mapping genetic influences on mental disorders. While there is growing interest in performing genome-wide searches for determinants, it remains challenging to identify genetic factors of small effect size, especially in limited sample sizes. In an attempt to address this issue, we propose to take advantage of a priori knowledge, specifically to extend parallel independent component analysis (pICA) to incorporate a reference (pICA-R), aiming to better reveal relationships between hidden factors of a particular attribute. The new approach was first evaluated on simulated data for its performance under different configurations of effect size and dimensionality. Then pICA-R was applied to a 300-participant (140 schizophrenia (SZ) patients versus 160 healthy controls) dataset consisting of structural magnetic resonance imaging (sMRI) and single nucleotide polymorphism (SNP) data. Guided by a reference SNP set derived from ANK3, a gene implicated by the Psychiatric Genomic Consortium SZ study, pICA-R identified one pair of SNP and sMRI components with a significant loading correlation of 0.27 (p = 1.64×10−6). The sMRI component showed a significant group difference in loading parameters between patients and controls (p = 1.33×10−15), indicating SZ-related reduction in gray matter concentration in prefrontal and temporal regions. The linked SNP component also showed a group difference (p = 0.04) and was predominantly contributed to by 1,030 SNPs. The effect of these top contributing SNPs was verified using association test results of the Psychiatric Genomic Consortium SZ study, where the 1,030 SNPs exhibited significant SZ enrichment compared to the whole genome. In addition, pathway analyses indicated the genetic component majorly relating to neurotransmitter and nervous system signaling pathways. Given the simulation and experiment results, pICA-R may prove a promising multivariate approach for use in imaging genomics to discover reliable genetic risk factors under a scenario of relatively high dimensionality and small effect size. PMID:23727316
Genetic architecture of natural variation in Drosophila melanogaster aggressive behavior
Shorter, John; Couch, Charlene; Huang, Wen; Carbone, Mary Anna; Peiffer, Jason; Anholt, Robert R. H.; Mackay, Trudy F. C.
2015-01-01
Aggression is an evolutionarily conserved complex behavior essential for survival and the organization of social hierarchies. With the exception of genetic variants associated with bioamine signaling, which have been implicated in aggression in many species, the genetic basis of natural variation in aggression is largely unknown. Drosophila melanogaster is a favorable model system for exploring the genetic basis of natural variation in aggression. Here, we performed genome-wide association analyses using the inbred, sequenced lines of the Drosophila melanogaster Genetic Reference Panel (DGRP) and replicate advanced intercross populations derived from the most and least aggressive DGRP lines. We identified genes that have been previously implicated in aggressive behavior as well as many novel loci, including gustatory receptor 63a (Gr63a), which encodes a subunit of the receptor for CO2, and genes associated with development and function of the nervous system. Although genes from the two association analyses were largely nonoverlapping, they mapped onto a genetic interaction network inferred from an analysis of pairwise epistasis in the DGRP. We used mutations and RNAi knock-down alleles to functionally validate 79% of the candidate genes and 75% of the candidate epistatic interactions tested. Epistasis for aggressive behavior causes cryptic genetic variation in the DGRP that is revealed by changing allele frequencies in the outbred populations derived from extreme DGRP lines. This phenomenon may pertain to other fitness traits and species, with implications for evolution, applied breeding, and human genetics. PMID:26100892
USDA-ARS?s Scientific Manuscript database
Elymus L. is the largest and most complex genus in the Triticeae with approximately 150 polyploid perennial grass species occurring worldwide. We report here the first genetic linkage map for Elymus. Backcross mapping populations were created by crossing caespitose Elymus wawawaiensis (EW) (Snake ...
Monroe, J Grey; Allen, Zachariah A; Tanger, Paul; Mullen, Jack L; Lovell, John T; Moyers, Brook T; Whitley, Darrell; McKay, John K
2017-01-01
Recent advances in nucleic acid sequencing technologies have led to a dramatic increase in the number of markers available to generate genetic linkage maps. This increased marker density can be used to improve genome assemblies as well as add much needed resolution for loci controlling variation in ecologically and agriculturally important traits. However, traditional genetic map construction methods from these large marker datasets can be computationally prohibitive and highly error prone. We present TSPmap , a method which implements both approximate and exact Traveling Salesperson Problem solvers to generate linkage maps. We demonstrate that for datasets with large numbers of genomic markers (e.g. 10,000) and in multiple population types generated from inbred parents, TSPmap can rapidly produce high quality linkage maps with low sensitivity to missing and erroneous genotyping data compared to two other benchmark methods, JoinMap and MSTmap . TSPmap is open source and freely available as an R package. With the advancement of low cost sequencing technologies, the number of markers used in the generation of genetic maps is expected to continue to rise. TSPmap will be a useful tool to handle such large datasets into the future, quickly producing high quality maps using a large number of genomic markers.
THREaD Mapper Studio: a novel, visual web server for the estimation of genetic linkage maps
Cheema, Jitender; Ellis, T. H. Noel; Dicks, Jo
2010-01-01
The estimation of genetic linkage maps is a key component in plant and animal research, providing both an indication of the genetic structure of an organism and a mechanism for identifying candidate genes associated with traits of interest. Because of this importance, several computational solutions to genetic map estimation exist, mostly implemented as stand-alone software packages. However, the estimation process is often largely hidden from the user. Consequently, problems such as a program crashing may occur that leave a user baffled. THREaD Mapper Studio (http://cbr.jic.ac.uk/threadmapper) is a new web site that implements a novel, visual and interactive method for the estimation of genetic linkage maps from DNA markers. The rationale behind the web site is to make the estimation process as transparent and robust as possible, while also allowing users to use their expert knowledge during analysis. Indeed, the 3D visual nature of the tool allows users to spot features in a data set, such as outlying markers and potential structural rearrangements that could cause problems with the estimation procedure and to account for them in their analysis. Furthermore, THREaD Mapper Studio facilitates the visual comparison of genetic map solutions from third party software, aiding users in developing robust solutions for their data sets. PMID:20494977
N'Diaye, Amidou; Haile, Jemanesh K; Fowler, D Brian; Ammar, Karim; Pozniak, Curtis J
2017-01-01
Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called 'large p, small n' problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly.
N’Diaye, Amidou; Haile, Jemanesh K.; Fowler, D. Brian; Ammar, Karim; Pozniak, Curtis J.
2017-01-01
Advances in sequencing and genotyping methods have enable cost-effective production of high throughput single nucleotide polymorphism (SNP) markers, making them the choice for linkage mapping. As a result, many laboratories have developed high-throughput SNP assays and built high-density genetic maps. However, the number of markers may, by orders of magnitude, exceed the resolution of recombination for a given population size so that only a minority of markers can accurately be ordered. Another issue attached to the so-called ‘large p, small n’ problem is that high-density genetic maps inevitably result in many markers clustering at the same position (co-segregating markers). While there are a number of related papers, none have addressed the impact of co-segregating markers on genetic maps. In the present study, we investigated the effects of co-segregating markers on high-density genetic map length and marker order using empirical data from two populations of wheat, Mohawk × Cocorit (durum wheat) and Norstar × Cappelle Desprez (bread wheat). The maps of both populations consisted of 85% co-segregating markers. Our study clearly showed that excess of co-segregating markers can lead to map expansion, but has little effect on markers order. To estimate the inflation factor (IF), we generated a total of 24,473 linkage maps (8,203 maps for Mohawk × Cocorit and 16,270 maps for Norstar × Cappelle Desprez). Using seven machine learning algorithms, we were able to predict with an accuracy of 0.7 the map expansion due to the proportion of co-segregating markers. For example in Mohawk × Cocorit, with 10 and 80% co-segregating markers the length of the map inflated by 4.5 and 16.6%, respectively. Similarly, the map of Norstar × Cappelle Desprez expanded by 3.8 and 11.7% with 10 and 80% co-segregating markers. With the increasing number of markers on SNP-chips, the proportion of co-segregating markers in high-density maps will continue to increase making map expansion unavoidable. Therefore, we suggest developers improve linkage mapping algorithms for efficient analysis of high-throughput data. This study outlines a practical strategy to estimate the IF due to the proportion of co-segregating markers and outlines a method to scale the length of the map accordingly. PMID:28878789
2010-01-01
Background Genetic markers and linkage mapping are basic prerequisites for marker-assisted selection and map-based cloning. In the case of the key grassland species Lolium spp., numerous mapping populations have been developed and characterised for various traits. Although some genetic linkage maps of these populations have been aligned with each other using publicly available DNA markers, the number of common markers among genetic maps is still low, limiting the ability to compare candidate gene and QTL locations across germplasm. Results A set of 204 expressed sequence tag (EST)-derived simple sequence repeat (SSR) markers has been assigned to map positions using eight different ryegrass mapping populations. Marker properties of a subset of 64 EST-SSRs were assessed in six to eight individuals of each mapping population and revealed 83% of the markers to be polymorphic in at least one population and an average number of alleles of 4.88. EST-SSR markers polymorphic in multiple populations served as anchor markers and allowed the construction of the first comprehensive consensus map for ryegrass. The integrated map was complemented with 97 SSRs from previously published linkage maps and finally contained 284 EST-derived and genomic SSR markers. The total map length was 742 centiMorgan (cM), ranging for individual chromosomes from 70 cM of linkage group (LG) 6 to 171 cM of LG 2. Conclusions The consensus linkage map for ryegrass based on eight mapping populations and constructed using a large set of publicly available Lolium EST-SSRs mapped for the first time together with previously mapped SSR markers will allow for consolidating existing mapping and QTL information in ryegrass. Map and markers presented here will prove to be an asset in the development for both molecular breeding of ryegrass as well as comparative genetics and genomics within grass species. PMID:20712870
Berdugo-Cely, Jhon; Valbuena, Raúl Iván; Sánchez-Betancourt, Erika; Barrero, Luz Stella; Yockteng, Roxana
2017-01-01
The potato (Solanum tuberosum L.) is the fourth most important crop food in the world and Colombia has one of the most important collections of potato germplasm in the world (the Colombian Central Collection-CCC). Little is known about its potential as a source of genetic diversity for molecular breeding programs. In this study, we analyzed 809 Andigenum group accessions from the CCC using 5968 SNPs to determine: 1) the genetic diversity and population structure of the Andigenum germplasm and 2) the usefulness of this collection to map qualitative traits across the potato genome. The genetic structure analysis based on principal components, cluster analyses, and Bayesian inference revealed that the CCC can be subdivided into two main groups associated with their ploidy level: Phureja (diploid) and Andigena (tetraploid). The Andigena population was more genetically diverse but less genetically substructured than the Phureja population (three vs. five subpopulations, respectively). The association mapping analysis of qualitative morphological data using 4666 SNPs showed 23 markers significantly associated with nine morphological traits. The present study showed that the CCC is a highly diverse germplasm collection genetically and phenotypically, useful to implement association mapping in order to identify genes related to traits of interest and to assist future potato genetic breeding programs.
Berdugo-Cely, Jhon; Valbuena, Raúl Iván; Sánchez-Betancourt, Erika; Barrero, Luz Stella
2017-01-01
The potato (Solanum tuberosum L.) is the fourth most important crop food in the world and Colombia has one of the most important collections of potato germplasm in the world (the Colombian Central Collection-CCC). Little is known about its potential as a source of genetic diversity for molecular breeding programs. In this study, we analyzed 809 Andigenum group accessions from the CCC using 5968 SNPs to determine: 1) the genetic diversity and population structure of the Andigenum germplasm and 2) the usefulness of this collection to map qualitative traits across the potato genome. The genetic structure analysis based on principal components, cluster analyses, and Bayesian inference revealed that the CCC can be subdivided into two main groups associated with their ploidy level: Phureja (diploid) and Andigena (tetraploid). The Andigena population was more genetically diverse but less genetically substructured than the Phureja population (three vs. five subpopulations, respectively). The association mapping analysis of qualitative morphological data using 4666 SNPs showed 23 markers significantly associated with nine morphological traits. The present study showed that the CCC is a highly diverse germplasm collection genetically and phenotypically, useful to implement association mapping in order to identify genes related to traits of interest and to assist future potato genetic breeding programs. PMID:28257509
Chicken microsatellite markers isolated from libraries enriched for simple tandem repeats.
Gibbs, M; Dawson, D A; McCamley, C; Wardle, A F; Armour, J A; Burke, T
1997-12-01
The total number of microsatellite loci is considered to be at least 10-fold lower in avian species than in mammalian species. Therefore, efficient large-scale cloning of chicken microsatellites, as required for the construction of a high-resolution linkage map, is facilitated by the construction of libraries using an enrichment strategy. In this study, a plasmid library enriched for tandem repeats was constructed from chicken genomic DNA by hybridization selection. Using this technique the proportion of recombinant clones that cross-hybridized to probes containing simple tandem repeats was raised to 16%, compared with < 0.1% in a non-enriched library. Primers were designed from 121 different sequences. Polymerase chain reaction (PCR) analysis of two chicken reference pedigrees enabled 72 loci to be localized within the collaborative chicken genetic map, and at least 30 of the remaining loci have been shown to be informative in these or other crosses.
Mapping the genetic diversity of HLA haplotypes in the Japanese populations
Saw, Woei-Yuh; Liu, Xuanyao; Khor, Chiea-Chuen; Takeuchi, Fumihiko; Katsuya, Tomohiro; Kimura, Ryosuke; Nabika, Toru; Ohkubo, Takayoshi; Tabara, Yasuharu; Yamamoto, Ken; Yokota, Mitsuhiro; Akiyama, Koichi; Asano, Hiroyuki; Asayama, Kei; Haga, Toshikazu; Hara, Azusa; Hirose, Takuo; Hosaka, Miki; Ichihara, Sahoko; Imai, Yutaka; Inoue, Ryusuke; Ishiguro, Aya; Isomura, Minoru; Isono, Masato; Kamide, Kei; Kato, Norihiro; Katsuya, Tomohiro; Kikuya, Masahiro; Kohara, Katsuhiko; Matsubara, Tatsuaki; Matsuda, Ayako; Metoki, Hirohito; Miki, Tetsuro; Murakami, Keiko; Nabika, Toru; Nakatochi, Masahiro; Ogihara, Toshio; Ohnaka, Keizo; Ohkubo, Takayoshi; Rakugi, Hiromi; Satoh, Michihiro; Shiwaku, Kunihiro; Sugimoto, Ken; Tabara, Yasuharu; Takami, Yoichi; Takayanagi, Ryoichi; Takeuchi, Fumihiko; Tsubota-Utsugi, Megumi; Yamamoto, Ken; Yamamoto, Koichi; Yamasaki, Masayuki; Yasui, Daisaku; Yokota, Mitsuhiro; Teo, Yik-Ying; Kato, Norihiro
2015-01-01
Japan has often been viewed as an Asian country that possesses a genetically homogenous community. The basis for partitioning the country into prefectures has largely been geographical, although cultural and linguistic differences still exist between some of the districts/prefectures, especially between Okinawa and the mainland prefectures. The Major Histocompatibility Complex (MHC) region has consistently emerged as the most polymorphic region in the human genome, harbouring numerous biologically important variants; nevertheless the presence of population-specific long haplotypes hinders the imputation of SNPs and classical HLA alleles. Here, we examined the extent of genetic variation at the MHC between eight Japanese populations sampled from Okinawa, and six other prefectures located in or close to the mainland of Japan, specifically focusing at the haplotypes observed within each population, and what the impact of any variation has on imputation. Our results indicated that Okinawa was genetically farther to the mainland Japanese than were Gujarati Indians from Tamil Indians, while the mainland Japanese from six prefectures were more homogeneous than between northern and southern Han Chinese. The distribution of haplotypes across Japan was similar, although imputation was most accurate for Okinawa and several mainland prefectures when population-specific panels were used as reference. PMID:26648100
Knoll, A T; Jiang, K; Levitt, P
2018-06-01
Humans exhibit broad heterogeneity in affiliative social behavior. Twin and family studies show that individual differences in core dimensions of social behavior are heritable, yet there are knowledge gaps in understanding the underlying genetic and neurobiological mechanisms. Animal genetic reference panels (GRPs) provide a tractable strategy for examining the behavioral and genetic architecture of complex traits. Here, using males from 50 mouse strains from the BXD GRP, 4 domains of affiliative social behavior-social approach, social recognition, direct social interaction (DSI) (partner sniffing) and vocal communication-were examined in 2 widely used behavioral tasks-the 3-chamber and DSI tasks. There was continuous and broad variation in social and nonsocial traits, with moderate to high heritability of social approach sniff preference (0.31), ultrasonic vocalization (USV) count (0.39), partner sniffing (0.51), locomotor activity (0.54-0.66) and anxiety-like behavior (0.36). Principal component analysis shows that variation in social and nonsocial traits are attributable to 5 independent factors. Genome-wide mapping identified significant quantitative trait loci for USV count on chromosome (Chr) 18 and locomotor activity on Chr X, with suggestive loci and candidate quantitative trait genes identified for all traits with one notable exception-partner sniffing in the DSI task. The results show heritable variation in sociability, which is independent of variation in activity and anxiety-like traits. In addition, a highly heritable and ethological domain of affiliative sociability-partner sniffing-appears highly polygenic. These findings establish a basis for identifying functional natural variants, leading to a new understanding typical and atypical sociability. © 2017 The Authors. Genes, Brain and Behavior published by International Behavioural and Neural Genetics Society and John Wiley & Sons Ltd.
Genetic ancestry is associated with colorectal adenomas and adenocarcinomas in Latino populations
Hernandez-Suarez, Gustavo; Sanabria, Maria Carolina; Serrano, Marta; Herran, Oscar F; Perez, Jesus; Plata, Jose L; Zabaleta, Jovanny; Tenesa, Albert
2014-01-01
Colorectal cancer rates in Latin American countries are less than half of those observed in the United States. Latin Americans are the resultant of generations of an admixture of Native American, European, and African individuals. The potential role of genetic admixture in colorectal carcinogenesis has not been examined. We evaluate the association of genetic ancestry with colorectal neoplasms in 190 adenocarcinomas, 113 sporadic adenomas and 243 age- and sex-matched controls enrolled in a multicentric case–control study in Colombia. Individual ancestral genetic fractions were estimated using the STRUCTURE software, based on allele frequencies and assuming three distinct population origins. We used the Illumina Cancer Panel to genotype 1,421 sparse single-nucleotide polymorphisms (SNPs), and Northern and Western European ancestry, LWJ and Han Chinese in Beijing, China populations from the HapMap project as references. A total of 678 autosomal SNPs overlapped with the HapMap data set SNPs and were used for ancestry estimations. African mean ancestry fraction was higher in adenomas (0.13, 95% confidence interval (95% CI)=0.11–0.15) and cancer cases (0.14, 95% CI=0.12–0.16) compared with controls (0.11, 95% CI=0.10–0.12). Conditional logistic regression analysis, controlling for known risk factors, showed a positive association of African ancestry per 10% increase with both colorectal adenoma (odds ratio (OR)=1.12, 95% CI=0.97–1.30) and adenocarcinoma (OR=1.19, 95% CI=1.05–1.35). In conclusion, increased African ancestry (or variants linked to it) contributes to the increased susceptibility of colorectal cancer in admixed Latin American population. PMID:24518838
Turner, Leslie M; Harr, Bettina
2014-12-09
Mapping hybrid defects in contact zones between incipient species can identify genomic regions contributing to reproductive isolation and reveal genetic mechanisms of speciation. The house mouse features a rare combination of sophisticated genetic tools and natural hybrid zones between subspecies. Male hybrids often show reduced fertility, a common reproductive barrier between incipient species. Laboratory crosses have identified sterility loci, but each encompasses hundreds of genes. We map genetic determinants of testis weight and testis gene expression using offspring of mice captured in a hybrid zone between M. musculus musculus and M. m. domesticus. Many generations of admixture enables high-resolution mapping of loci contributing to these sterility-related phenotypes. We identify complex interactions among sterility loci, suggesting multiple, non-independent genetic incompatibilities contribute to barriers to gene flow in the hybrid zone.
Febrer, Melanie; Goicoechea, Jose Luis; Wright, Jonathan; McKenzie, Neil; Song, Xiang; Lin, Jinke; Collura, Kristi; Wissotski, Marina; Yu, Yeisoo; Ammiraju, Jetty S. S.; Wolny, Elzbieta; Idziak, Dominika; Betekhtin, Alexander; Kudrna, Dave; Hasterok, Robert; Wing, Rod A.; Bevan, Michael W.
2010-01-01
The pooid subfamily of grasses includes some of the most important crop, forage and turf species, such as wheat, barley and Lolium. Developing genomic resources, such as whole-genome physical maps, for analysing the large and complex genomes of these crops and for facilitating biological research in grasses is an important goal in plant biology. We describe a bacterial artificial chromosome (BAC)-based physical map of the wild pooid grass Brachypodium distachyon and integrate this with whole genome shotgun sequence (WGS) assemblies using BAC end sequences (BES). The resulting physical map contains 26 contigs spanning the 272 Mb genome. BES from the physical map were also used to integrate a genetic map. This provides an independent vaildation and confirmation of the published WGS assembly. Mapped BACs were used in Fluorescence In Situ Hybridisation (FISH) experiments to align the integrated physical map and sequence assemblies to chromosomes with high resolution. The physical, genetic and cytogenetic maps, integrated with whole genome shotgun sequence assemblies, enhance the accuracy and durability of this important genome sequence and will directly facilitate gene isolation. PMID:20976139
A Genetic Linkage Map of the Male Goat Genome
Vaiman, D.; Schibler, L.; Bourgeois, F.; Oustry, A.; Amigues, Y.; Cribiu, E. P.
1996-01-01
This paper presents a first genetic linkage map of the goat genome. Primers derived from the flanking sequences of 612 bovine, ovine and goat microsatellite markers were gathered and tested for amplification with goat DNA under standardized PCR conditions. This screen made it possible to choose a set of 55 polymorphic markers that can be used in the three species and to define a panel of 223 microsatellites suitable for the goat. Twelve half-sib paternal goat families were then used to build a linkage map of the goat genome. The linkage analysis made it possible to construct a meiotic map covering 2300 cM, i.e., >80% of the total estimated length of the goat genome. Moreover, eight cosmids containing microsatellites were mapped by fluorescence in situ hybridization in goat and sheep. Together with 11 microsatellite-containing cosmids previously mapped in cattle (and supposing conservation of the banding pattern between this species and the goat) and data from the sheep map, these results made the orientation of 15 linkage groups possible. Furthermore, 12 coding sequences were mapped either genetically or physically, providing useful data for comparative mapping. PMID:8878693
Kim, Jin-Hee; Chung, Il Kyung; Kim, Kyung-Min
2017-01-01
The Sweet potato, Ipomoea batatas (L.) Lam, is difficult to study in genetics and genomics because it is a hexaploid. The sweet potato study not have been performed domestically or internationally. In this study was performed to construct genetic map and quantitative trait loci (QTL) analysis. A total of 245 EST-SSR markers were developed, and the map was constructed by using 210 of those markers. The total map length was 1508.1 cM, and the mean distance between markers was 7.2 cM. Fifteen characteristics were investigated for QTLs analysis. According to those, the Four QTLs were identified, and The LOD score was 3.0. Further studies need to develop molecular markers in terms of EST-SSR markers for doing to be capable of efficient breeding. The genetic map created here using EST-SSR markers will facilitate planned breeding of sweet potato cultivars with various desirable traits.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bennetzen, Jeffrey L; Yang, Xiaohan; Ye, Chuyu
We generated a high-quality reference genome sequence for foxtail millet (Setaria italica). The {approx}400-Mb assembly covers {approx}80% of the genome and >95% of the gene space. The assembly was anchored to a 992-locus genetic map and was annotated by comparison with >1.3 million expressed sequence tag reads. We produced more than 580 million RNA-Seq reads to facilitate expression analyses. We also sequenced Setaria viridis, the ancestral wild relative of S. italica, and identified regions of differential single-nucleotide polymorphism density, distribution of transposable elements, small RNA content, chromosomal rearrangement and segregation distortion. The genus Setaria includes natural and cultivated species thatmore » demonstrate a wide capacity for adaptation. The genetic basis of this adaptation was investigated by comparing five sequenced grass genomes. We also used the diploid Setaria genome to evaluate the ongoing genome assembly of a related polyploid, switchgrass (Panicum virgatum).« less
Software for Quantifying and Simulating Microsatellite Genotyping Error
Johnson, Paul C.D.; Haydon, Daniel T.
2007-01-01
Microsatellite genetic marker data are exploited in a variety of fields, including forensics, gene mapping, kinship inference and population genetics. In all of these fields, inference can be thwarted by failure to quantify and account for data errors, and kinship inference in particular can benefit from separating errors into two distinct classes: allelic dropout and false alleles. Pedant is MS Windows software for estimating locus-specific maximum likelihood rates of these two classes of error. Estimation is based on comparison of duplicate error-prone genotypes: neither reference genotypes nor pedigree data are required. Other functions include: plotting of error rate estimates and confidence intervals; simulations for performing power analysis and for testing the robustness of error rate estimates to violation of the underlying assumptions; and estimation of expected heterozygosity, which is a required input. The program, documentation and source code are available from http://www.stats.gla.ac.uk/~paulj/pedant.html. PMID:20066126
Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content.
Tran, Hue T M; Ramaraj, Thiruvarangan; Furtado, Agnelo; Lee, Leonard Slade; Henry, Robert J
2018-03-07
Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high- or low-caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long-read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway-based analysis, 65 caffeine-associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Ensemble Learning of QTL Models Improves Prediction of Complex Traits
Bian, Yang; Holland, James B.
2015-01-01
Quantitative trait locus (QTL) models can provide useful insights into trait genetic architecture because of their straightforward interpretability but are less useful for genetic prediction because of the difficulty in including the effects of numerous small effect loci without overfitting. Tight linkage between markers introduces near collinearity among marker genotypes, complicating the detection of QTL and estimation of QTL effects in linkage mapping, and this problem is exacerbated by very high density linkage maps. Here we developed a thinning and aggregating (TAGGING) method as a new ensemble learning approach to QTL mapping. TAGGING reduces collinearity problems by thinning dense linkage maps, maintains aspects of marker selection that characterize standard QTL mapping, and by ensembling, incorporates information from many more markers-trait associations than traditional QTL mapping. The objective of TAGGING was to improve prediction power compared with QTL mapping while also providing more specific insights into genetic architecture than genome-wide prediction models. TAGGING was compared with standard QTL mapping using cross validation of empirical data from the maize (Zea mays L.) nested association mapping population. TAGGING-assisted QTL mapping substantially improved prediction ability for both biparental and multifamily populations by reducing both the variance and bias in prediction. Furthermore, an ensemble model combining predictions from TAGGING-assisted QTL and infinitesimal models improved prediction abilities over the component models, indicating some complementarity between model assumptions and suggesting that some trait genetic architectures involve a mixture of a few major QTL and polygenic effects. PMID:26276383
Spigler, R B; Lewers, K S; Main, D S; Ashman, T-L
2008-12-01
The evolution of separate sexes (dioecy) from hermaphroditism is one of the major evolutionary transitions in plants, and this transition can be accompanied by the development of sex chromosomes. Studies in species with intermediate sexual systems are providing unprecedented insight into the initial stages of sex chromosome evolution. Here, we describe the genetic mechanism of sex determination in the octoploid, subdioecious wild strawberry, Fragaria virginiana Mill., based on a whole-genome simple sequence repeat (SSR)-based genetic map and on mapping sex determination as two qualitative traits, male and female function. The resultant total map length is 2373 cM and includes 212 markers on 42 linkage groups (mean marker spacing: 14 cM). We estimated that approximately 70 and 90% of the total F. virginiana genetic map resides within 10 and 20 cM of a marker on this map, respectively. Both sex expression traits mapped to the same linkage group, separated by approximately 6 cM, along with two SSR markers. Together, our phenotypic and genetic mapping results support a model of gender determination in subdioecious F. virginiana with at least two linked loci (or gene regions) with major effects. Reconstruction of parental genotypes at these loci reveals that both female and hermaphrodite heterogamety exist in this species. Evidence of recombination between the sex-determining loci, an important hallmark of incipient sex chromosomes, suggest that F. virginiana is an example of the youngest sex chromosome in plants and thus a novel model system for the study of sex chromosome evolution.
Liu, Dexin; Liu, Fang; Shan, Xiaoru; Zhang, Jian; Tang, Shiyi; Fang, Xiaomei; Liu, Xueying; Wang, Wenwen; Tan, Zhaoyun; Teng, Zhonghua; Zhang, Zhengsheng; Liu, Dajun
2015-10-01
Upland cotton plays a critical role not only in the textile industry, but also in the production of important secondary metabolites, such as oil and proteins. Construction of a high-density linkage map and identifying yield and seed trait quantitative trail loci (QTL) are prerequisites for molecular marker-assisted selective breeding projects. Here, we update a high-density upland cotton genetic map from recombinant inbred lines. A total of 25,313 SSR primer pairs were screened for polymorphism between Yumian 1 and T586, and 1712 SSR primer pairs were used to genotype the mapping population and construct a map. An additional 1166 loci have been added to our previously published map with 509 SSR markers. The updated genetic map spans a total recombinant length of 3338.2 cM and contains 1675 SSR loci and nine morphological markers, with an average interval of 1.98 cM between adjacent markers. Green lint (Lg) mapped on chromosome 15 in a previous report is mapped in an interval of 2.6 cM on chromosome 21. Based on the map and phenotypic data from multiple environments, 79 lint percentage and seed nutrient trait QTL are detected. These include 8 lint percentage, 13 crude protein, 15 crude oil, 8 linoleic, 10 oleic, 13 palmitic, and 12 stearic acid content QTL. They explain 3.5-62.7 % of the phenotypic variation observed. Four morphological markers identified have a major impact on lint percentage and cottonseed nutrients traits. In this study, our genetic map provides new sights into the tetraploid cotton genome. Furthermore, the stable QTL and morphological markers could be used for fine-mapping and map-based cloning.
An annotated genetic map of loblolly pine based on microsatellite and cDNA markers
USDA-ARS?s Scientific Manuscript database
Previous loblolly pine (Pinus taeda L.) genetic linkage maps have been based on a variety of DNA polymorphisms, such as AFLPs, RAPDs, RFLPs, and ESTPs, but only a few SSRs (simple sequence repeats), also known as simple tandem repeats or microsatellites, have been mapped in P. taeda. The objective o...
USDA-ARS?s Scientific Manuscript database
Genomics applications in durum (Triticum durum Desf.) wheat have the potential to boost exploitation of genetic resources and to advance understanding of the genetics of important complex traits (e.g. resilience to environmental and biotic stresses). A dense and accurate consensus map specific for ...
Comparative mapping in the Fagaceae and beyond with EST-SSRs
2012-01-01
Background Genetic markers and linkage mapping are basic prerequisites for comparative genetic analyses, QTL detection and map-based cloning. A large number of mapping populations have been developed for oak, but few gene-based markers are available for constructing integrated genetic linkage maps and comparing gene order and QTL location across related species. Results We developed a set of 573 expressed sequence tag-derived simple sequence repeats (EST-SSRs) and located 397 markers (EST-SSRs and genomic SSRs) on the 12 oak chromosomes (2n = 2x = 24) on the basis of Mendelian segregation patterns in 5 full-sib mapping pedigrees of two species: Quercus robur (pedunculate oak) and Quercus petraea (sessile oak). Consensus maps for the two species were constructed and aligned. They showed a high degree of macrosynteny between these two sympatric European oaks. We assessed the transferability of EST-SSRs to other Fagaceae genera and a subset of these markers was mapped in Castanea sativa, the European chestnut. Reasonably high levels of macrosynteny were observed between oak and chestnut. We also obtained diversity statistics for a subset of EST-SSRs, to support further population genetic analyses with gene-based markers. Finally, based on the orthologous relationships between the oak, Arabidopsis, grape, poplar, Medicago, and soybean genomes and the paralogous relationships between the 12 oak chromosomes, we propose an evolutionary scenario of the 12 oak chromosomes from the eudicot ancestral karyotype. Conclusions This study provides map locations for a large set of EST-SSRs in two oak species of recognized biological importance in natural ecosystems. This first step toward the construction of a gene-based linkage map will facilitate the assignment of future genome scaffolds to pseudo-chromosomes. This study also provides an indication of the potential utility of new gene-based markers for population genetics and comparative mapping within and beyond the Fagaceae. PMID:22931513
Blenda, Anna; Fang, David D.; Rami, Jean-François; Garsmeur, Olivier; Luo, Feng; Lacape, Jean-Marc
2012-01-01
A consensus genetic map of tetraploid cotton was constructed using six high-density maps and after the integration of a sequence-based marker redundancy check. Public cotton SSR libraries (17,343 markers) were curated for sequence redundancy using 90% as a similarity cutoff. As a result, 20% of the markers (3,410) could be considered as redundant with some other markers. The marker redundancy information had been a crucial part of the map integration process, in which the six most informative interspecific Gossypium hirsutum×G. barbadense genetic maps were used for assembling a high density consensus (HDC) map for tetraploid cotton. With redundant markers being removed, the HDC map could be constructed thanks to the sufficient number of collinear non-redundant markers in common between the component maps. The HDC map consists of 8,254 loci, originating from 6,669 markers, and spans 4,070 cM, with an average of 2 loci per cM. The HDC map presents a high rate of locus duplications, as 1,292 markers among the 6,669 were mapped in more than one locus. Two thirds of the duplications are bridging homoeologous AT and DT chromosomes constitutive of allopolyploid cotton genome, with an average of 64 duplications per AT/DT chromosome pair. Sequences of 4,744 mapped markers were used for a mutual blast alignment (BBMH) with the 13 major scaffolds of the recently released Gossypium raimondii genome indicating high level of homology between the diploid D genome and the tetraploid cotton genetic map, with only a few minor possible structural rearrangements. Overall, the HDC map will serve as a valuable resource for trait QTL comparative mapping, map-based cloning of important genes, and better understanding of the genome structure and evolution of tetraploid cotton. PMID:23029214
Solignac, Michel; Mougel, Florence; Vautrin, Dominique; Monnerot, Monique; Cornuet, Jean-Marie
2007-01-01
The honey bee is a key model for social behavior and this feature led to the selection of the species for genome sequencing. A genetic map is a necessary companion to the sequence. In addition, because there was originally no physical map for the honey bee genome project, a meiotic map was the only resource for organizing the sequence assembly on the chromosomes. We present the genetic (meiotic) map here and describe the main features that emerged from comparison with the sequence-based physical map. The genetic map of the honey bee is saturated and the chromosomes are oriented from the centromeric to the telomeric regions. The map is based on 2,008 markers and is about 40 Morgans (M) long, resulting in a marker density of one every 2.05 centiMorgans (cM). For the 186 megabases (Mb) of the genome mapped and assembled, this corresponds to a very high average recombination rate of 22.04 cM/Mb. Honey bee meiosis shows a relatively homogeneous recombination rate along and across chromosomes, as well as within and between individuals. Interference is higher than inferred from the Kosambi function of distance. In addition, numerous recombination hotspots are dispersed over the genome. The very large genetic length of the honey bee genome, its small physical size and an almost complete genome sequence with a relatively low number of genes suggest a very promising future for association mapping in the honey bee, particularly as the existence of haploid males allows easy bulk segregant analysis.
Taylor, Natalie; Long, Janet C; Debono, Deborah; Williams, Rachel; Salisbury, Elizabeth; O'Neill, Sharron; Eykman, Elizabeth; Braithwaite, Jeffrey; Chin, Melvin
2016-03-12
Lynch syndrome is an inherited disorder associated with a range of cancers, and found in 2-5 % of colorectal cancers. Lynch syndrome is diagnosed through a combination of significant family and clinical history and pathology. The definitive diagnostic germline test requires formal patient consent after genetic counselling. If diagnosed early, carriers of Lynch syndrome can undergo increased surveillance for cancers, which in turn can prevent late stage cancers, optimise treatment and decrease mortality for themselves and their relatives. However, over the past decade, international studies have reported that only a small proportion of individuals with suspected Lynch syndrome were referred for genetic consultation and possible genetic testing. The aim of this project is to use behaviour change theory and implementation science approaches to increase the number and speed of healthcare professional referrals of colorectal cancer patients with a high-likelihood risk of Lynch syndrome to appropriate genetic counselling services. The six-step Theoretical Domains Framework Implementation (TDFI) approach will be used at two large, metropolitan hospitals treating colorectal cancer patients. Steps are: 1) form local multidisciplinary teams to map current referral processes; 2) identify target behaviours that may lead to increased referrals using discussion supported by a retrospective audit; 3) identify barriers to those behaviours using the validated Influences on Patient Safety Behaviours Questionnaire and TDFI guided focus groups; 4) co-design interventions to address barriers using focus groups; 5) co-implement interventions; and 6) evaluate intervention impact. Chi square analysis will be used to test the difference in the proportion of high-likelihood risk Lynch syndrome patients being referred for genetic testing before and after intervention implementation. A paired t-test will be used to assess the mean time from the pathology test results to referral for high-likelihood Lynch syndrome patients pre-post intervention. Run charts will be used to continuously monitor change in referrals over time, based on scheduled monthly audits. This project is based on a tested and refined implementation strategy (TDFI approach). Enhancing the process of identifying and referring people at high-likelihood risk of Lynch syndrome for genetic counselling will improve outcomes for patients and their relatives, and potentially save public money.
Mapping Children--Mapping Space.
ERIC Educational Resources Information Center
Pick, Herbert L., Jr.
Research is underway concerning the way the perception, conception, and representation of spatial layout develops. Three concepts are important here--space itself, frame of reference, and cognitive map. Cognitive map refers to a form of representation of the behavioral space, not paired associate or serial response learning. Other criteria…
2013-01-01
Background Cucumber is an important vegetable crop that is susceptible to many pathogens, but no disease resistance (R) genes have been cloned. The availability of whole genome sequences provides an excellent opportunity for systematic identification and characterization of the nucleotide binding and leucine-rich repeat (NB-LRR) type R gene homolog (RGH) sequences in the genome. Cucumber has a very narrow genetic base making it difficult to construct high-density genetic maps. Development of a consensus map by synthesizing information from multiple segregating populations is a method of choice to increase marker density. As such, the objectives of the present study were to identify and characterize NB-LRR type RGHs, and to develop a high-density, integrated cucumber genetic-physical map anchored with RGH loci. Results From the Gy14 draft genome, 70 NB-containing RGHs were identified and characterized. Most RGHs were in clusters with uneven distribution across seven chromosomes. In silico analysis indicated that all 70 RGHs had EST support for gene expression. Phylogenetic analysis classified 58 RGHs into two clades: CNL and TNL. Comparative analysis revealed high-degree sequence homology and synteny in chromosomal locations of these RGH members between the cucumber and melon genomes. Fifty-four molecular markers were developed to delimit 67 of the 70 RGHs, which were integrated into a genetic map through linkage analysis. A 1,681-locus cucumber consensus map including 10 gene loci and spanning 730.0 cM in seven linkage groups was developed by integrating three component maps with a bin-mapping strategy. Physically, 308 scaffolds with 193.2 Mbp total DNA sequences were anchored onto this consensus map that covered 52.6% of the 367 Mbp cucumber genome. Conclusions Cucumber contains relatively few NB-LRR RGHs that are clustered and unevenly distributed in the genome. All RGHs seem to be transcribed and shared significant sequence homology and synteny with the melon genome suggesting conservation of these RGHs in the Cucumis lineage. The 1,681-locus consensus genetic-physical map developed and the RGHs identified and characterized herein are valuable genomics resources that may have many applications such as quantitative trait loci identification, map-based gene cloning, association mapping, marker-assisted selection, as well as assembly of a more complete cucumber genome. PMID:23531125
AD-LIBS: inferring ancestry across hybrid genomes using low-coverage sequence data.
Schaefer, Nathan K; Shapiro, Beth; Green, Richard E
2017-04-04
Inferring the ancestry of each region of admixed individuals' genomes is useful in studies ranging from disease gene mapping to speciation genetics. Current methods require high-coverage genotype data and phased reference panels, and are therefore inappropriate for many data sets. We present a software application, AD-LIBS, that uses a hidden Markov model to infer ancestry across hybrid genomes without requiring variant calling or phasing. This approach is useful for non-model organisms and in cases of low-coverage data, such as ancient DNA. We demonstrate the utility of AD-LIBS with synthetic data. We then use AD-LIBS to infer ancestry in two published data sets: European human genomes with Neanderthal ancestry and brown bear genomes with polar bear ancestry. AD-LIBS correctly infers 87-91% of ancestry in simulations and produces ancestry maps that agree with published results and global ancestry estimates in humans. In brown bears, we find more polar bear ancestry than has been published previously, using both AD-LIBS and an existing software application for local ancestry inference, HAPMIX. We validate AD-LIBS polar bear ancestry maps by recovering a geographic signal within bears that mirrors what is seen in SNP data. Finally, we demonstrate that AD-LIBS is more effective than HAPMIX at inferring ancestry when preexisting phased reference data are unavailable and genomes are sequenced to low coverage. AD-LIBS is an effective tool for ancestry inference that can be used even when few individuals are available for comparison or when genomes are sequenced to low coverage. AD-LIBS is therefore likely to be useful in studies of non-model or ancient organisms that lack large amounts of genomic DNA. AD-LIBS can therefore expand the range of studies in which admixture mapping is a viable tool.
Lack, Justin B; Cardeno, Charis M; Crepeau, Marc W; Taylor, William; Corbett-Detig, Russell B; Stevens, Kristian A; Langley, Charles H; Pool, John E
2015-04-01
Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets. Copyright © 2015 by the Genetics Society of America.
García-Arias, Francy L; Osorio-Guarín, Jaime A; Núñez Zarantes, Victor M
2018-01-01
Association mapping has been proposed as an efficient approach to assist plant breeding programs to investigate the genetic basis of agronomic traits. In this study, we evaluated 18 traits related to yield, (FWP, NF, FWI, and FWII), fruit size-shape (FP, FA, MW, WMH, MH, HMW, DI, FSI, FSII, OVO, OBO), and fruit quality (FIR, CF, and SST), in a diverse collection of 100 accessions of Physalis peruviana including wild, landraces, and anther culture derived lines. We identified seven accessions with suitable traits: fruit weight per plant (FWP) > 7,000 g/plant and cracked fruits (CF) < 4%, to be used as parents in cape gooseberry breeding program. In addition, the accessions were also characterized using Genotyping By Sequencing (GBS). We discovered 27,982 and 36,142 informative SNP markers based on the alignment against the two cape gooseberry references transcriptomes. Besides, 30,344 SNPs were identified based on alignment to the tomato reference genome. Genetic structure analysis showed that the population could be divided into two or three sub-groups, corresponding to landraces-anther culture and wild accessions for K = 2 and wild, landraces, and anther culture plants for K = 3. Association analysis was carried out using a Mixed Linear Model (MLM) and 34 SNP markers were significantly associated. These results reveal the basis of the genetic control of important agronomic traits and may facilitate marker-based breeding in P. peruviana .
García-Arias, Francy L.; Osorio-Guarín, Jaime A.; Núñez Zarantes, Victor M.
2018-01-01
Association mapping has been proposed as an efficient approach to assist plant breeding programs to investigate the genetic basis of agronomic traits. In this study, we evaluated 18 traits related to yield, (FWP, NF, FWI, and FWII), fruit size-shape (FP, FA, MW, WMH, MH, HMW, DI, FSI, FSII, OVO, OBO), and fruit quality (FIR, CF, and SST), in a diverse collection of 100 accessions of Physalis peruviana including wild, landraces, and anther culture derived lines. We identified seven accessions with suitable traits: fruit weight per plant (FWP) > 7,000 g/plant and cracked fruits (CF) < 4%, to be used as parents in cape gooseberry breeding program. In addition, the accessions were also characterized using Genotyping By Sequencing (GBS). We discovered 27,982 and 36,142 informative SNP markers based on the alignment against the two cape gooseberry references transcriptomes. Besides, 30,344 SNPs were identified based on alignment to the tomato reference genome. Genetic structure analysis showed that the population could be divided into two or three sub-groups, corresponding to landraces-anther culture and wild accessions for K = 2 and wild, landraces, and anther culture plants for K = 3. Association analysis was carried out using a Mixed Linear Model (MLM) and 34 SNP markers were significantly associated. These results reveal the basis of the genetic control of important agronomic traits and may facilitate marker-based breeding in P. peruviana. PMID:29616069
Too Many Referrals of Low-risk Women for BRCA1/2 Genetic Services by Family Physicians
White, Della Brown; Bonham, Vence L.; Jenkins, Jean; Stevens, Nancy; McBride, Colleen M.
2009-01-01
Increasing availability and public awareness of BRCA1/2 genetic testing will increase women’s self-referrals to genetic services. The objective of this study was to examine whether patient characteristics influence family physicians’ (FPs’) referral decisions when a patient requests BRCA1/2 genetic testing. FPs (n = 284) completed a web-based survey in 2006 to assess their attitudes and practices related to using genetics in their clinical practice. Using a 2×2×2 factorial design we tested the effects of a hypothetical patient’s race, level of worry and insurance status on FPs’ decisions to refer her for BRCA1/2 testing. The patient was not appropriate for referral based on USPSTF guidelines. No patient characteristics were associated with FPs’ referral decisions. Although referral was not indicated, only 8% did not refer to genetic services, 92% referred for genetic services, and 50% referred to genetic counseling. FPs regarded it unlikely that the patient carried a mutation. However, 65% of FPs believed if they refused to refer for genetic services it would harm their relationship with the patient. Despite scarce and costly genetic services FPs were likely to inappropriately refer a low-risk patient who requested BRCA1/2 testing. The implications of this inappropriate referral on women’s screening behavior, genetic services, and health care costs are unknown. Clinicians and patients could benefit from education about appropriate use of genetic services so that both are more comfortable with a decision against referral. PMID:18990739
Turner, Leslie M; Harr, Bettina
2014-01-01
Mapping hybrid defects in contact zones between incipient species can identify genomic regions contributing to reproductive isolation and reveal genetic mechanisms of speciation. The house mouse features a rare combination of sophisticated genetic tools and natural hybrid zones between subspecies. Male hybrids often show reduced fertility, a common reproductive barrier between incipient species. Laboratory crosses have identified sterility loci, but each encompasses hundreds of genes. We map genetic determinants of testis weight and testis gene expression using offspring of mice captured in a hybrid zone between M. musculus musculus and M. m. domesticus. Many generations of admixture enables high-resolution mapping of loci contributing to these sterility-related phenotypes. We identify complex interactions among sterility loci, suggesting multiple, non-independent genetic incompatibilities contribute to barriers to gene flow in the hybrid zone. DOI: http://dx.doi.org/10.7554/eLife.02504.001 PMID:25487987
Chen, Yingnan; Wang, Tiantian; Fang, Lecheng; Li, Xiaoping; Yin, Tongming
2016-01-01
In this study, we constructed high-density genetic maps of Salix suchowensis and mapped the gender locus with an F1 pedigree. Genetic maps were separately constructed for the maternal and paternal parents by using amplified fragment length polymorphism (AFLP) markers and the pseudo-testcross strategy. The maternal map consisted of 20 linkage groups that spanned a genetic distance of 2333.3 cM; whereas the paternal map contained 21 linkage groups that covered 2260 cM. Based on the established genetic maps, it was found that the gender of willow was determined by a single locus on linkage group LG_03, and the female was the heterogametic gender. Aligned with mapped SSR markers, linkage group LG_03 was found to be associated with chromosome XV in willow. It is noteworthy that marker density in the vicinity of the gender locus was significantly higher than that expected by chance alone, which indicates severe recombination suppression around the gender locus. In conclusion, this study confirmed the findings on the single-locus sex determination and female heterogamety in willow. It also provided additional evidence that validated the previous studies, which found that different autosomes evolved into sex chromosomes between the sister genera of Salix (willow) and Populus (poplar).
Fang, Lecheng; Li, Xiaoping; Yin, Tongming
2016-01-01
In this study, we constructed high-density genetic maps of Salix suchowensis and mapped the gender locus with an F1 pedigree. Genetic maps were separately constructed for the maternal and paternal parents by using amplified fragment length polymorphism (AFLP) markers and the pseudo-testcross strategy. The maternal map consisted of 20 linkage groups that spanned a genetic distance of 2333.3 cM; whereas the paternal map contained 21 linkage groups that covered 2260 cM. Based on the established genetic maps, it was found that the gender of willow was determined by a single locus on linkage group LG_03, and the female was the heterogametic gender. Aligned with mapped SSR markers, linkage group LG_03 was found to be associated with chromosome XV in willow. It is noteworthy that marker density in the vicinity of the gender locus was significantly higher than that expected by chance alone, which indicates severe recombination suppression around the gender locus. In conclusion, this study confirmed the findings on the single-locus sex determination and female heterogamety in willow. It also provided additional evidence that validated the previous studies, which found that different autosomes evolved into sex chromosomes between the sister genera of Salix (willow) and Populus (poplar). PMID:26828940
Zhang, Yanxin; Wang, Linhai; Gao, Yuan; Li, Donghua; Yu, Jingyin; Zhou, Rong; Zhang, Xiurong
2018-06-14
As an important oil crop, growth habit of sesame (Sesamum indicum L.) is naturally indeterminate, which brings about asynchronous maturity of capsules and causes loss of yield. The genetic basis of determinate growth habit in sesame was investigated by classical genetic analysis through multiple populations, results revealed that it was controlled by an unique recessive gene. The genotyping by sequencing (GBS) approach was employed for high-throughput SNP identification and genotyping in the F 2 population, then a high density bin map was constructed, the map was 1086.403 cM in length, which consisted of 1184 bins (13,679 SNPs), with an average of 0.918 cM between adjacent bins. Based on bin mapping in conjunction with SSR markers analysis in targeted region, the novel sesame determinacy gene was mapped on LG09 in a genome region of 41 kb. This study dissected genetic basis of determinate growth habit in sesame, constructed a new high-density bin map and mapped a novel determinacy gene. Results of this study demonstrate that we employed an optimized approach to get fine-accuracy, high-resolution and high-efficiency mapping result in sesame. The findings provided important foundation for sesame determinacy gene cloning and were expected to be applied in breeding for cultivars suited to mechanized production.
Genetics Home Reference: ornithine translocase deficiency
... Diagnosis of Japanese patients with HHH syndrome by molecular genetic analysis: a common mutation, R179X. J Hum Genet. ... M, Fariello G, Dionisi-Vici C. Clinical and molecular findings in hyperornithinemia-hyperammonemia-homocitrullinuria ... Bulletins Genetics Home Reference Celebrates Its ...
Innate immunity and the new forward genetics.
Beutler, Bruce
2016-12-01
As it is a hard-wired system for responses to microbes, innate immunity is particularly susceptible to classical genetic analysis. Mutations led the way to the discovery of many of the molecular elements of innate immune sensing and signaling pathways. In turn, the need for a faster way to find the molecular causes of mutation-induced phenotypes triggered a huge transformation in forward genetics. During the 1980s and 1990s, many heritable phenotypes were ascribed to mutations through positional cloning. In mice, this required three steps. First, a genetic mapping step was used to show that a given phenotype emanated from a circumscribed region of the genome. Second, a physical mapping step was undertaken, in which all of the region was cloned and its gene content determined. Finally, a concerted search for the mutation was performed. Such projects usually lasted for several years, but could produce breakthroughs in our understanding of biological processes. Publication of the annotated mouse genome sequence in 2002 made physical mapping unnecessary. More recently we devised a new technology for automated genetic mapping, which eliminated both genetic mapping and the search for mutations among candidate genes. The cause of phenotype can now be determined instantaneously. We have created more than 100,000 coding/splicing mutations. And by screening for defects of innate and adaptive immunity we have discovered many "new" proteins needed for innate immune function. Copyright © 2016 Elsevier Ltd. All rights reserved.
Innate immunity and the new forward genetics
Beutler, Bruce
2016-01-01
As it is a hard-wired system for responses to microbes, innate immunity is particularly susceptible to classical genetic analysis. Mutations led the way to the discovery of many of the molecular elements of innate immune sensing and signaling pathways. In turn, the need for a faster way to find the molecular causes of mutation-induced phenotypes triggered a huge transformation in forward genetics. During the 1980s and 1990s, many heritable phenotypes were ascribed to mutations through positional cloning. In mice, this required three steps. First, a genetic mapping step was used to show that a given phenotype emanated from a circumscribed region of the genome. Second, a physical mapping step was undertaken, in which all of the region was cloned and its gene content determined. Finally, a concerted search for the mutation was performed. Such projects usually lasted for several years, but could produce breakthroughs in our understanding of biological processes. Publication of the annotated mouse genome sequence in 2002 made physical mapping unnecessary. More recently we devised a new technology for automated genetic mapping, which eliminated both genetic mapping and the search for mutations among candidate genes. The cause of phenotype can now be determined instantaneously. We have created more than 100,000 coding/splicing mutations. And by screening for defects of innate and adaptive immunity we have discovered many “new” proteins needed for innate immune function. PMID:27890263
CloudAligner: A fast and full-featured MapReduce based tool for sequence mapping.
Nguyen, Tung; Shi, Weisong; Ruden, Douglas
2011-06-06
Research in genetics has developed rapidly recently due to the aid of next generation sequencing (NGS). However, massively-parallel NGS produces enormous amounts of data, which leads to storage, compatibility, scalability, and performance issues. The Cloud Computing and MapReduce framework, which utilizes hundreds or thousands of shared computers to map sequencing reads quickly and efficiently to reference genome sequences, appears to be a very promising solution for these issues. Consequently, it has been adopted by many organizations recently, and the initial results are very promising. However, since these are only initial steps toward this trend, the developed software does not provide adequate primary functions like bisulfite, pair-end mapping, etc., in on-site software such as RMAP or BS Seeker. In addition, existing MapReduce-based applications were not designed to process the long reads produced by the most recent second-generation and third-generation NGS instruments and, therefore, are inefficient. Last, it is difficult for a majority of biologists untrained in programming skills to use these tools because most were developed on Linux with a command line interface. To urge the trend of using Cloud technologies in genomics and prepare for advances in second- and third-generation DNA sequencing, we have built a Hadoop MapReduce-based application, CloudAligner, which achieves higher performance, covers most primary features, is more accurate, and has a user-friendly interface. It was also designed to be able to deal with long sequences. The performance gain of CloudAligner over Cloud-based counterparts (35 to 80%) mainly comes from the omission of the reduce phase. In comparison to local-based approaches, the performance gain of CloudAligner is from the partition and parallel processing of the huge reference genome as well as the reads. The source code of CloudAligner is available at http://cloudaligner.sourceforge.net/ and its web version is at http://mine.cs.wayne.edu:8080/CloudAligner/. Our results show that CloudAligner is faster than CloudBurst, provides more accurate results than RMAP, and supports various input as well as output formats. In addition, with the web-based interface, it is easier to use than its counterparts.
K.D. Jermstad; D.L. Bassoni; N.C. Wheeler; D.B. Neale
1998-01-01
We have constructed a sex-averaged genetic linkage map in coastal Douglas-fir ( Pseudotsuga menziesii [Mirb.] Franco var menziesii) using a three-generation outcrossed pedigree and molecular markers. Our research objectives are to learn about genome organization and to identify markers associated with adaptive traits. The map...
Kathleen D. Jermstad; Andrew J. Eckert; Jill L. Wegrzyn; Annette Delfino-Mix; Dean A Davis; Deems C. Burton; David B. Neale
2011-01-01
The majority of genomic research in conifers has been conducted in the Pinus subgenus Pinus mostly due to the high economic importance of the species within this taxon. Genetic maps have been constructed for several of these pines and comparative mapping analyses have consistently revealed notable synteny. In contrast,...
USDA-ARS?s Scientific Manuscript database
High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement; high-density linkage maps are especially important in paleopolyploids with exce...
9. international mouse genome conference
DOE Office of Scientific and Technical Information (OSTI.GOV)
NONE
This conference was held November 12--16, 1995 in Ann Arbor, Michigan. The purpose of this conference was to provide a multidisciplinary forum for exchange of state-of-the-art information on genetic mapping in mice. This report contains abstracts of presentations, focusing on the following areas: mutation identification; comparative mapping; informatics and complex traits; mutagenesis; gene identification and new technology; and genetic and physical mapping.
Newborn Screening: MedlinePlus Health Topic
... deficiency (National Library of Medicine) Genetics Home Reference: glutaric acidemia type I (National Library of Medicine) Genetics Home Reference: glutaric acidemia type II (National Library of Medicine) Genetics ...
Hehir-Kwa, Jayne Y; Marschall, Tobias; Kloosterman, Wigard P; Francioli, Laurent C; Baaijens, Jasmijn A; Dijkstra, Louis J; Abdellaoui, Abdel; Koval, Vyacheslav; Thung, Djie Tjwan; Wardenaar, René; Renkens, Ivo; Coe, Bradley P; Deelen, Patrick; de Ligt, Joep; Lameijer, Eric-Wubbo; van Dijk, Freerk; Hormozdiari, Fereydoun; Uitterlinden, André G; van Duijn, Cornelia M; Eichler, Evan E; de Bakker, Paul I W; Swertz, Morris A; Wijmenga, Cisca; van Ommen, Gert-Jan B; Slagboom, P Eline; Boomsma, Dorret I; Schönhuth, Alexander; Ye, Kai; Guryev, Victor
2016-10-06
Structural variation (SV) represents a major source of differences between individual human genomes and has been linked to disease phenotypes. However, the majority of studies provide neither a global view of the full spectrum of these variants nor integrate them into reference panels of genetic variation. Here, we analyse whole genome sequencing data of 769 individuals from 250 Dutch families, and provide a haplotype-resolved map of 1.9 million genome variants across 9 different variant classes, including novel forms of complex indels, and retrotransposition-mediated insertions of mobile elements and processed RNAs. A large proportion are previously under reported variants sized between 21 and 100 bp. We detect 4 megabases of novel sequence, encoding 11 new transcripts. Finally, we show 191 known, trait-associated SNPs to be in strong linkage disequilibrium with SVs and demonstrate that our panel facilitates accurate imputation of SVs in unrelated individuals.
USDA-ARS?s Scientific Manuscript database
Sorghum is the second cereal crop to have a full genome completely sequenced (Nature (2009), 457:551). This achievement is widely recognized as a scientific milestone for grass genetics and genomics in general. However, the true worth of genetic information lies in translating the sequence informa...
Gramene 2016: comparative plant genomics and pathway resources.
Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen
2016-01-04
Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials. Published by Oxford University Press on behalf of Nucleic Acids Research 2015. This work is written by (a) US Government employee(s) and is in the public domain in the US.
A mapping closure for turbulent scalar mixing using a time-evolving reference field
NASA Technical Reports Server (NTRS)
Girimaji, Sharath S.
1992-01-01
A general mapping-closure approach for modeling scalar mixing in homogeneous turbulence is developed. This approach is different from the previous methods in that the reference field also evolves according to the same equations as the physical scalar field. The use of a time-evolving Gaussian reference field results in a model that is similar to the mapping closure model of Pope (1991), which is based on the methodology of Chen et al. (1989). Both models yield identical relationships between the scalar variance and higher-order moments, which are in good agreement with heat conduction simulation data and can be consistent with any type of epsilon(phi) evolution. The present methodology can be extended to any reference field whose behavior is known. The possibility of a beta-pdf reference field is explored. The shortcomings of the mapping closure methods are discussed, and the limit at which the mapping becomes invalid is identified.
NASA Technical Reports Server (NTRS)
Huang, Dong; Yang, Wenze; Tan, Bin; Rautiainen, Miina; Zhang, Ping; Hu, Jiannan; Shabanov, Nikolay V.; Linder, Sune; Knyazikhin, Yuri; Myneni, Ranga B.
2006-01-01
The validation of moderate-resolution satellite leaf area index (LAI) products such as those operationally generated from the Moderate Resolution Imaging Spectroradiometer (MODIS) sensor data requires reference LAI maps developed from field LAI measurements and fine-resolution satellite data. Errors in field measurements and satellite data determine the accuracy of the reference LAI maps. This paper describes a method by which reference maps of known accuracy can be generated with knowledge of errors in fine-resolution satellite data. The method is demonstrated with data from an international field campaign in a boreal coniferous forest in northern Sweden, and Enhanced Thematic Mapper Plus images. The reference LAI map thus generated is used to assess modifications to the MODIS LAI/fPAR algorithm recently implemented to derive the next generation of the MODIS LAI/fPAR product for this important biome type.
Bossolini, Eligio; Klahre, Ulrich; Brandenburg, Anna; Reinhardt, Didier; Kuhlemeier, Cris
2011-04-01
Two linkage maps were constructed for the model plant Petunia. Mapping populations were obtained by crossing the wild species Petunia axillaris subsp. axillaris with Petunia inflata, and Petunia axillaris subsp. parodii with Petunia exserta. Both maps cover the seven chromosomes of Petunia, and span 970 centimorgans (cM) and 700 cM of the genomes, respectively. In total, 207 markers were mapped. Of these, 28 are multilocus amplified fragment length polymorphism (AFLP) markers and 179 are gene-derived markers. For the first time we report on the development and mapping of 83 Petunia microsatellites. The two maps retain the same marker order, but display significant differences of recombination frequencies at orthologous mapping intervals. A complex pattern of genomic rearrangements was detected with the related genome of tomato (Solanum lycopersicum), indicating that synteny between Petunia and other Solanaceae crops has been considerably disrupted. The newly developed markers will facilitate the genetic characterization of mutants and ecological studies on genetic diversity and speciation within the genus Petunia. The maps will provide a powerful tool to link genetic and genomic information and will be useful to support sequence assembly of the Petunia genome.
Sim, Sheina B.; Geib, Scott M.
2017-01-01
Genetic sexing strains (GSS) used in sterile insect technique (SIT) programs are textbook examples of how classical Mendelian genetics can be directly implemented in the management of agricultural insect pests. Although the foundation of traditionally developed GSS are single locus, autosomal recessive traits, their genetic basis are largely unknown. With the advent of modern genomic techniques, the genetic basis of sexing traits in GSS can now be further investigated. This study is the first of its kind to integrate traditional genetic techniques with emerging genomics to characterize a GSS using the tephritid fruit fly pest Bactrocera cucurbitae as a model. These techniques include whole-genome sequencing, the development of a mapping population and linkage map, and quantitative trait analysis. The experiment designed to map the genetic sexing trait in B. cucurbitae, white pupae (wp), also enabled the generation of a chromosome-scale genome assembly by integrating the linkage map with the assembly. Quantitative trait loci analysis revealed SNP loci near position 42 MB on chromosome 3 to be tightly linked to wp. Gene annotation and synteny analysis show a near perfect relationship between chromosomes in B. cucurbitae and Muller elements A–E in Drosophila melanogaster. This chromosome-scale genome assembly is complete, has high contiguity, was generated using a minimal input DNA, and will be used to further characterize the genetic mechanisms underlying wp. Knowledge of the genetic basis of genetic sexing traits can be used to improve SIT in this species and expand it to other economically important Diptera. PMID:28450369
Casjens, S.; Eppler, K.; Sampson, L.; Parr, R.; Wyckoff, E.
1991-01-01
The mechanism by which dsDNA is packaged by viruses is not yet understood in any system. Bacteriophage P22 has been a productive system in which to study the molecular genetics of virus particle assembly and DNA packaging. Only five phage encoded proteins, the products of genes 3, 2, 1, 8 and 5, are required for packaging the virus chromosome inside the coat protein shell. We report here the construction of a detailed genetic and physical map of these genes, the neighboring gene 4 and a portion of gene 10, in which 289 conditional lethal amber, opal, temperature sensitive and cold sensitive mutations are mapped into 44 small (several hundred base pair) intervals of known sequence. Knowledge of missense mutant phenotypes and information on the location of these mutations allows us to begin the assignment of partial protein functions to portions of these genes. The map and mapping strains will be of use in the further genetic dissection of the P22 DNA packaging and prohead assembly processes. PMID:2029965
Cloud computing-based TagSNP selection algorithm for human genome data.
Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2015-01-05
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.
Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data
Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2015-01-01
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used. PMID:25569088
Sánchez-Sevilla, José F.; Horvath, Aniko; Botella, Miguel A.; Gaston, Amèlia; Folta, Kevin; Kilian, Andrzej; Denoyes, Beatrice; Amaya, Iraida
2015-01-01
Cultivated strawberry (Fragaria × ananassa) is a genetically complex allo-octoploid crop with 28 pairs of chromosomes (2n = 8x = 56) for which a genome sequence is not yet available. The diploid Fragaria vesca is considered the donor species of one of the octoploid sub-genomes and its available genome sequence can be used as a reference for genomic studies. A wide number of strawberry cultivars are stored in ex situ germplasm collections world-wide but a number of previous studies have addressed the genetic diversity present within a limited number of these collections. Here, we report the development and application of two platforms based on the implementation of Diversity Array Technology (DArT) markers for high-throughput genotyping in strawberry. The first DArT microarray was used to evaluate the genetic diversity of 62 strawberry cultivars that represent a wide range of variation based on phenotype, geographical and temporal origin and pedigrees. A total of 603 DArT markers were used to evaluate the diversity and structure of the population and their cluster analyses revealed that these markers were highly efficient in classifying the accessions in groups based on historical, geographical and pedigree-based cues. The second DArTseq platform took benefit of the complexity reduction method optimized for strawberry and the development of next generation sequencing technologies. The strawberry DArTseq was used to generate a total of 9,386 SNP markers in the previously developed ‘232’ × ‘1392’ mapping population, of which, 4,242 high quality markers were further selected to saturate this map after several filtering steps. The high-throughput platforms here developed for genotyping strawberry will facilitate genome-wide characterizations of large accessions sets and complement other available options. PMID:26675207
CrowdPhase: crowdsourcing the phase problem
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jorda, Julien; Sawaya, Michael R.; Yeates, Todd O., E-mail: yeates@mbi.ucla.edu
The idea of attacking the phase problem by crowdsourcing is introduced. Using an interactive, multi-player, web-based system, participants work simultaneously to select phase sets that correspond to better electron-density maps in order to solve low-resolution phasing problems. The human mind innately excels at some complex tasks that are difficult to solve using computers alone. For complex problems amenable to parallelization, strategies can be developed to exploit human intelligence in a collective form: such approaches are sometimes referred to as ‘crowdsourcing’. Here, a first attempt at a crowdsourced approach for low-resolution ab initio phasing in macromolecular crystallography is proposed. A collaborativemore » online game named CrowdPhase was designed, which relies on a human-powered genetic algorithm, where players control the selection mechanism during the evolutionary process. The algorithm starts from a population of ‘individuals’, each with a random genetic makeup, in this case a map prepared from a random set of phases, and tries to cause the population to evolve towards individuals with better phases based on Darwinian survival of the fittest. Players apply their pattern-recognition capabilities to evaluate the electron-density maps generated from these sets of phases and to select the fittest individuals. A user-friendly interface, a training stage and a competitive scoring system foster a network of well trained players who can guide the genetic algorithm towards better solutions from generation to generation via gameplay. CrowdPhase was applied to two synthetic low-resolution phasing puzzles and it was shown that players could successfully obtain phase sets in the 30° phase error range and corresponding molecular envelopes showing agreement with the low-resolution models. The successful preliminary studies suggest that with further development the crowdsourcing approach could fill a gap in current crystallographic methods by making it possible to extract meaningful information in cases where limited resolution might otherwise prevent initial phasing.« less
Szpirer, C; Szpirer, J; Tissir, F; Stephanova, E; Vanvooren, P; Kurtz, T W; Iwai, N; Inagami, T; Pravenec, M; Kren, V; Klinga-Levan, K; Levan, G
1997-09-01
Seven genes were regionally localized on rat Chromosome (Chr) 1, from 1p11 to 1q42, and two of these genes were also included in a linkage map. This mapping work integrates the genetic linkage map and the cytogenetic map, and allows us to orient the linkage map with respect to the centromere, and to deduce the approximate position of the centromere in the linkage map. These mapping data also indicate that the Slc9a3 gene, encoding the Na+/H+ exchanger 3, is an unlikely candidate for the blood pressure loci assigned to rat Chr 1. These new localizations expand comparative mapping between rat Chr 1 and mouse or human chromosomes.
A Semiparametric Approach for Composite Functional Mapping of Dynamic Quantitative Traits
Yang, Runqing; Gao, Huijiang; Wang, Xin; Zhang, Ji; Zeng, Zhao-Bang; Wu, Rongling
2007-01-01
Functional mapping has emerged as a powerful tool for mapping quantitative trait loci (QTL) that control developmental patterns of complex dynamic traits. Original functional mapping has been constructed within the context of simple interval mapping, without consideration of separate multiple linked QTL for a dynamic trait. In this article, we present a statistical framework for mapping QTL that affect dynamic traits by capitalizing on the strengths of functional mapping and composite interval mapping. Within this so-called composite functional-mapping framework, functional mapping models the time-dependent genetic effects of a QTL tested within a marker interval using a biologically meaningful parametric function, whereas composite interval mapping models the time-dependent genetic effects of the markers outside the test interval to control the genome background using a flexible nonparametric approach based on Legendre polynomials. Such a semiparametric framework was formulated by a maximum-likelihood model and implemented with the EM algorithm, allowing for the estimation and the test of the mathematical parameters that define the QTL effects and the regression coefficients of the Legendre polynomials that describe the marker effects. Simulation studies were performed to investigate the statistical behavior of composite functional mapping and compare its advantage in separating multiple linked QTL as compared to functional mapping. We used the new mapping approach to analyze a genetic mapping example in rice, leading to the identification of multiple QTL, some of which are linked on the same chromosome, that control the developmental trajectory of leaf age. PMID:17947431
Genetic mapping of resistance to Fusarium oxysporum f. sp. tulipae in tulip.
Tang, Nan; van der Lee, Theo; Shahin, Arwa; Holdinga, Maarten; Bijman, Paul; Caser, Matteo; Visser, Richard G F; van Tuyl, Jaap M; Arens, Paul
Fusarium oxysporum is a major problem in the production of tulip bulbs. Breeding for resistant cultivars through a conventional approach is a slow process due to the long life cycle of tulip. Until now, marker-assisted selection (MAS) has been hampered by the large genome size and the absence of a genetic map. This study is aimed at construction of the first genetic map for tulip and at the identification of loci associated with resistance to F. oxysporum . A cross-pollinated population of 125 individuals segregating for Fusarium resistance was obtained from Tulipa gesneriana "Kees Nelis" and T. fosteriana "Cantata." Fusarium resistance of the mapping population was evaluated through a soil infection test in two consecutive years, and a spot inoculation test in which a green fluorescent protein tagged Fusarium strain was used for inoculation. The genetic maps have been constructed for the parents separately. The genetic map of "Kees Nelis" comprised 342 markers on 27 linkage groups covering 1707 cM, while the map of "Cantata" comprised 300 markers on 21 linkage groups covering 1201 cM. Median distance between markers was 3.9 cM for "Kees Nelis" and 3.1 cM for "Cantata." Six putative quantitative trait loci (QTLs) for Fusarium resistance were identified, derived from both parents. QTL2, QTL3, and QTL6 were significant in all disease tests. For the flanking markers of the QTLs, phenotypic means of the two allelic groups, segregating from a parent for such a marker, were significantly different. These markers will be useful for the development of MAS in tulip breeding.
Xu, Zhenzhen; Zhang, Chaojun; Ge, Xiaoyang; Wang, Ni; Zhou, Kehai; Yang, Xiaojie; Wu, Zhixia; Zhang, Xueyan; Liu, Chuanliang; Yang, Zuoren; Li, Changfeng; Liu, Kun; Yang, Zhaoen; Qian, Yuyuan; Li, Fuguang
2015-07-01
The first high-density linkage map was constructed to identify quantitative trait loci (QTLs) for somatic embryogenesis (SE) in cotton ( Gossypium hirsutum L.) using leaf petioles as explants. Cotton transformation is highly limited by only a few regenerable genotypes and the lack of understanding of the genetic and molecular basis of somatic embryogenesis (SE) in cotton (Gossypium hirsutum L.). To construct a more saturated linkage map and further identify quantitative trait loci (QTLs) for SE using leaf petioles as explants, a high embryogenesis frequency line (W10) from the commercial Chinese cotton cultivar CRI24 was crossed with TM-1, a genetic standard upland cotton with no embryogenesis frequency. The genetic map spanned 2300.41 cM in genetic distance and contained 411 polymorphic simple sequence repeat (SSR) loci. Of the 411 mapped loci, 25 were developed from unigenes identified for SE in our previous study. Six QTLs for SE were detected by composite interval mapping method, each explaining 6.88-37.07% of the phenotypic variance. Single marker analysis was also performed to verify the reliability of QTLs detection, and the SSR markers NAU3325 and DPL0209 were detected by the two methods. Further studies on the relatively stable and anchoring QTLs/markers for SE in an advanced population of W10 × TM-1 and other cross combinations with different SE abilities may shed light on the genetic and molecular mechanism of SE in cotton.
USDA-ARS?s Scientific Manuscript database
In this study, we generated a linkage map containing 1,151,856 high quality SNPs between Mo17 and B73, which were verified in the maize intermated B73'×'Mo17 (IBM) Syn10 population. This resource is an excellent complement to existing maize genetic maps available in an online database (iPlant, http:...
USDA-ARS?s Scientific Manuscript database
Genotyping by sequencing (GBS) technology was used to identify a set of 9,933 single nucleotide polymorphism (SNP) markers for constructing a high-resolution genetic map of 1,087 cM for watermelon. The genome-wide variation of recombination rate (GWRR) across the map was evaluated and a positive co...
Construction of Red Fox Chromosomal Fragments from the Short-Read Genome Assembly.
Rando, Halie M; Farré, Marta; Robson, Michael P; Won, Naomi B; Johnson, Jennifer L; Buch, Ronak; Bastounes, Estelle R; Xiang, Xueyan; Feng, Shaohong; Liu, Shiping; Xiong, Zijun; Kim, Jaebum; Zhang, Guojie; Trut, Lyudmila N; Larkin, Denis M; Kukekova, Anna V
2018-06-20
The genome of a red fox ( Vulpes vulpes ) was recently sequenced and assembled using next-generation sequencing (NGS). The assembly is of high quality, with 94X coverage and a scaffold N50 of 11.8 Mbp, but is split into 676,878 scaffolds, some of which are likely to contain assembly errors. Fragmentation and misassembly hinder accurate gene prediction and downstream analysis such as the identification of loci under selection. Therefore, assembly of the genome into chromosome-scale fragments was an important step towards developing this genomic model. Scaffolds from the assembly were aligned to the dog reference genome and compared to the alignment of an outgroup genome (cat) against the dog to identify syntenic sequences among species. The program Reference-Assisted Chromosome Assembly (RACA) then integrated the comparative alignment with the mapping of the raw sequencing reads generated during assembly against the fox scaffolds. The 128 sequence fragments RACA assembled were compared to the fox meiotic linkage map to guide the construction of 40 chromosomal fragments. This computational approach to assembly was facilitated by prior research in comparative mammalian genomics, and the continued improvement of the red fox genome can in turn offer insight into canid and carnivore chromosome evolution. This assembly is also necessary for advancing genetic research in foxes and other canids.
Thayer, Edward C.; Olson, Maynard V.; Karp, Richard M.
1999-01-01
Genetic and physical maps display the relative positions of objects or markers occurring within a target DNA molecule. In constructing maps, the primary objective is to determine the ordering of these objects. A further objective is to assign a coordinate to each object, indicating its distance from a reference end of the target molecule. This paper describes a computational method and a body of software for assigning coordinates to map objects, given a solution or partial solution to the ordering problem. We describe our method in the context of multiple–complete–digest (MCD) mapping, but it should be applicable to a variety of other mapping problems. Because of errors in the data or insufficient clone coverage to uniquely identify the true ordering of the map objects, a partial ordering is typically the best one can hope for. Once a partial ordering has been established, one often seeks to overlay a metric along the map to assess the distances between the map objects. This problem often proves intractable because of data errors such as erroneous local length measurements (e.g., large clone lengths on low-resolution physical maps). We present a solution to the coordinate assignment problem for MCD restriction-fragment mapping, in which a coordinated set of single-enzyme restriction maps are simultaneously constructed. We show that the coordinate assignment problem can be expressed as the solution of a system of linear constraints. If the linear system is free of inconsistencies, it can be solved using the standard Bellman–Ford algorithm. In the more typical case where the system is inconsistent, our program perturbs it to find a new consistent system of linear constraints, close to those of the given inconsistent system, using a modified Bellman–Ford algorithm. Examples are provided of simple map inconsistencies and the methods by which our program detects candidate data errors and directs the user to potential suspect regions of the map. PMID:9927487
Error and Uncertainty in the Accuracy Assessment of Land Cover Maps
NASA Astrophysics Data System (ADS)
Sarmento, Pedro Alexandre Reis
Traditionally the accuracy assessment of land cover maps is performed through the comparison of these maps with a reference database, which is intended to represent the "real" land cover, being this comparison reported with the thematic accuracy measures through confusion matrixes. Although, these reference databases are also a representation of reality, containing errors due to the human uncertainty in the assignment of the land cover class that best characterizes a certain area, causing bias in the thematic accuracy measures that are reported to the end users of these maps. The main goal of this dissertation is to develop a methodology that allows the integration of human uncertainty present in reference databases in the accuracy assessment of land cover maps, and analyse the impacts that uncertainty may have in the thematic accuracy measures reported to the end users of land cover maps. The utility of the inclusion of human uncertainty in the accuracy assessment of land cover maps is investigated. Specifically we studied the utility of fuzzy sets theory, more precisely of fuzzy arithmetic, for a better understanding of human uncertainty associated to the elaboration of reference databases, and their impacts in the thematic accuracy measures that are derived from confusion matrixes. For this purpose linguistic values transformed in fuzzy intervals that address the uncertainty in the elaboration of reference databases were used to compute fuzzy confusion matrixes. The proposed methodology is illustrated using a case study in which the accuracy assessment of a land cover map for Continental Portugal derived from Medium Resolution Imaging Spectrometer (MERIS) is made. The obtained results demonstrate that the inclusion of human uncertainty in reference databases provides much more information about the quality of land cover maps, when compared with the traditional approach of accuracy assessment of land cover maps. None
Zhao, Chuanzhi; Qiu, Jingjing; Agarwal, Gaurav; Wang, Jiangshan; Ren, Xuezhen; Xia, Han; Guo, Baozhu; Ma, Changle; Wan, Shubo; Bertioli, David J.; Varshney, Rajeev K.; Pandey, Manish K.; Wang, Xingjun
2017-01-01
Despite several efforts in the last decade toward development of simple sequence repeat (SSR) markers in peanut, there is still a need for more markers for conducting different genetic and breeding studies. With the effort of the International Peanut Genome Initiative, the availability of reference genome for both the diploid progenitors of cultivated peanut allowed us to identify 135,529 and 199,957 SSRs from the A (Arachis duranensis) and B genomes (Arachis ipaensis), respectively. Genome sequence analysis showed uneven distribution of the SSR motifs across genomes with variation in parameters such as SSR type, repeat number, and SSR length. Using the flanking sequences of identified SSRs, primers were designed for 51,354 and 60,893 SSRs with densities of 49 and 45 SSRs per Mb in A. duranensis and A. ipaensis, respectively. In silico PCR analysis of these SSR markers showed high transferability between wild and cultivated Arachis species. Two physical maps were developed for the A genome and the B genome using these SSR markers, and two reported disease resistance quantitative trait loci (QTLs), qF2TSWV5 for tomato spotted wilt virus (TSWV) and qF2LS6 for leaf spot (LS), were mapped in the 8.135 Mb region of chromosome A04 of A. duranensis. From this genomic region, 719 novel SSR markers were developed, which provide the possibility for fine mapping of these QTLs. In addition, this region also harbors 652 genes and 49 of these are defense related genes, including two NB-ARC genes, three LRR receptor-like genes and three WRKY transcription factors. These disease resistance related genes could contribute to resistance to viral (such as TSWV) and fungal (such as LS) diseases in peanut. In summary, this study not only provides a large number of molecular markers for potential use in peanut genetic map development and QTL mapping but also for map-based gene cloning and molecular breeding. PMID:28769940
Attitudes and Practices Among Internists Concerning Genetic Testing
Chung, Wendy; Marder, Karen; Shanmugham, Anita; Chin, Lisa J.; Stark, Meredith; Leu, Cheng-Shiun; Appelbaum, Paul S.
2012-01-01
Many questions remain concerning whether, when, and how physicians order genetic tests, and what factors are involved in their decisions. We surveyed 220 internists from two academic medical centers about their utilization of genetic testing. Rates of genetic utilizations varied widely by disease. Respondents were most likely to have ordered tests for Factor V Leiden (16.8%), followed by Breast/Ovarian Cancer (15.0%). In the past 6 months, 65% had counseled patients on genetic issues, 44% had ordered genetic tests, 38.5% had referred patients to a genetic counselor or geneticist, and 27.5% had received ads from commercial labs for genetic testing. Only 4.5% had tried to hide or disguise genetic information, and <2% have had patients report genetic discrimination. Only 53.4% knew of a geneticist/genetic counselor to whom to refer patients. Most rated their knowledge as very/somewhat poor concerning genetics (73.7%) and guidelines for genetic testing (87.1%). Most felt needs for more training on when to order tests (79%), and how to counsel patients (82%), interpret results (77.3%), and maintain privacy (80.6%). Physicians were more likely to have ordered a genetic test if patients inquired about genetic testing (p<.001), and if physicians had a geneticist/genetic counselor to whom to refer patients (p<.002), had referred patients to a geneticist/genetic counselor in the past 6 months, had more comfort counseling patients about testing (p<.019), counseled patients about genetics, larger practices (p<.032), fewer African-American patients (p<.027), and patients who had reported genetic discrimination (p<.044). In a multiple logistic regression, ordering a genetic test was associated with patients inquiring about testing, having referred patients to a geneticist/genetic counselor and knowing how to order tests., These data suggest that physicians recognize their knowledge deficits, and are interested in training. These findings have important implications for future medical practice, research, and education. PMID:22585186
Jairin, Jirapong; Kobayashi, Tetsuya; Yamagata, Yoshiyuki; Sanada-Morimura, Sachiyo; Mori, Kazuki; Tashiro, Kosuke; Kuhara, Satoru; Kuwazaki, Seigo; Urio, Masahiro; Suetsugu, Yoshitaka; Yamamoto, Kimiko; Matsumura, Masaya; Yasui, Hideshi
2013-01-01
In this study, we developed the first genetic linkage map for the major rice insect pest, the brown planthopper (BPH, Nilaparvata lugens). The linkage map was constructed by integrating linkage data from two backcross populations derived from three inbred BPH strains. The consensus map consists of 474 simple sequence repeats, 43 single-nucleotide polymorphisms, and 1 sequence-tagged site, for a total of 518 markers at 472 unique positions in 17 linkage groups. The linkage groups cover 1093.9 cM, with an average distance of 2.3 cM between loci. The average number of marker loci per linkage group was 27.8. The sex-linkage group was identified by exploiting X-linked and Y-specific markers. Our linkage map and the newly developed markers used to create it constitute an essential resource and a useful framework for future genetic analyses in BPH. PMID:23204257
Zhou, Gaofeng; Jian, Jianbo; Wang, Penghao; Li, Chengdao; Tao, Ye; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark; Yang, Huaan
2018-01-01
An ultra-high density genetic map containing 34,574 sequence-defined markers was developed in Lupinus angustifolius. Markers closely linked to nine genes of agronomic traits were identified. A physical map was improved to cover 560.5 Mb genome sequence. Lupin (Lupinus angustifolius L.) is a recently domesticated legume grain crop. In this study, we applied the restriction-site associated DNA sequencing (RADseq) method to genotype an F 9 recombinant inbred line population derived from a wild type × domesticated cultivar (W × D) cross. A high density linkage map was developed based on the W × D population. By integrating sequence-defined DNA markers reported in previous mapping studies, we established an ultra-high density consensus genetic map, which contains 34,574 markers consisting of 3508 loci covering 2399 cM on 20 linkage groups. The largest gap in the entire consensus map was 4.73 cM. The high density W × D map and the consensus map were used to develop an improved physical map, which covered 560.5 Mb of genome sequence data. The ultra-high density consensus linkage map, the improved physical map and the markers linked to genes of breeding interest reported in this study provide a common tool for genome sequence assembly, structural genomics, comparative genomics, functional genomics, QTL mapping, and molecular plant breeding in lupin.
Genetic diversity in Monoporeia affinis at polluted and reference sites of the Baltic Bothnian Bay.
Guban, Peter; Wennerström, Lovisa; Elfwing, Tina; Sundelin, Brita; Laikre, Linda
2015-04-15
The amphipod Monoporeia affinis plays an important role in the Baltic Sea ecosystem as prey and as detritivore. The species is monitored for contaminant effects, but almost nothing is known about its genetics in this region. A pilot screening for genetic variation at the mitochondrial COI gene was performed in 113 individuals collected at six sites in the northern Baltic. Three coastal sites were polluted by pulp mill effluents, PAHs, and trace metals, and two coastal reference sites were without obvious connection to pollution sources. An off-coastal reference site was also included. Contaminated sites showed lower levels of genetic diversity than the coastal reference ones although the difference was not statistically significant. Divergence patterns measured as ΦST showed no significant differentiation within reference and polluted groups, but there was significant genetic divergence between them. The off-coastal sample differed significantly from all coastal sites and also showed lower genetic variation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Zhu, Yufeng; Yin, Yanfei; Yang, Keqiang; Li, Jihong; Sang, Yalin; Huang, Long; Fan, Shu
2015-08-18
Walnut (Juglans regia, 2n = 32, approximately 606 Mb per 1C genome) is an economically important tree crop. Resistance to anthracnose, caused by Colletotrichum gloeosporioides, is a major objective of walnut genetic improvement in China. The recently developed specific length amplified fragment sequencing (SLAF-seq) is an efficient strategy that can obtain large numbers of markers with sufficient sequence information to construct high-density genetic maps and permits detection of quantitative trait loci (QTLs) for molecular breeding. SLAF-seq generated 161.64 M paired-end reads. 153,820 SLAF markers were obtained, of which 49,174 were polymorphic. 13,635 polymorphic markers were sorted into five segregation types and 2,577 markers of them were used to construct genetic linkage maps: 2,395 of these fell into 16 linkage groups (LGs) for the female map, 448 markers for the male map, and 2,577 markers for the integrated map. Taking into account the size of all LGs, the marker coverage was 2,664.36 cM for the female map, 1,305.58 cM for the male map, and 2,457.82 cM for the integrated map. The average intervals between two adjacent mapped markers were 1.11 cM, 2.91 cM and 0.95 cM for three maps, respectively. 'SNP_only' markers accounted for 89.25% of the markers on the integrated map. Mapping markers contained 5,043 single nucleotide polymorphisms (SNPs) loci, which corresponded to two SNP loci per SLAF marker. According to the integrated map, we used interval mapping (Logarithm of odds, LOD > 3.0) to detect our quantitative trait. One QTL was detected for anthracnose resistance. The interval of this QTL ranged from 165.51 cM to 176.33 cM on LG14, and ten markers in this interval that were above the threshold value were considered to be linked markers to the anthracnose resistance trait. The phenotypic variance explained by each marker ranged from 16.2 to 19.9%, and their LOD scores varied from 3.22 to 4.04. High-density genetic maps for walnut containing 16 LGs were constructed using the SLAF-seq method with an F1 population. One QTL for walnut anthracnose resistance was identified based on the map. The results will aid molecular marker-assisted breeding and walnut resistance genes identification.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Willard, H.F.; Cremers, F.; Mandel, J.L.
A high-quality integrated genetic and physical map of the X chromosome from telomere to telomere, based primarily on YACs formatted with probes and STSs, is increasingly close to reality. At the Fifth International X Chromosome Workshop, organized by A.M. Poustka and D. Schlessinger in Heidelberg, Germany, April 24--27, 1994, substantial progress was recorded on extension and refinement of the physical map, on the integration of genetic and cytogenetic data, on attempts to use the map to direct gene searches, and on nascent large-scale sequencing efforts. This report summarizes physical and genetic mapping information presented at the workshop and/or published sincemore » the reports of the fourth International X Chromosome Workshop. The principle aim of the workshop was to derive a consensus map of the chromosome, in terms of physical contigs emphasizing the location of genes and microsatellite markers. The resulting map is presented and updates previous versions. This report also updates the list of highly informative microsatellites. The text highlights the working state of the map, the genes known to reside on the X, and the progress toward integration of various types of data.« less
An Ultra-High-Density, Transcript-Based, Genetic Map of Lettuce
Truco, Maria José; Ashrafi, Hamid; Kozik, Alexander; van Leeuwen, Hans; Bowers, John; Wo, Sebastian Reyes Chin; Stoffel, Kevin; Xu, Huaqin; Hill, Theresa; Van Deynze, Allen; Michelmore, Richard W.
2013-01-01
We have generated an ultra-high-density genetic map for lettuce, an economically important member of the Compositae, consisting of 12,842 unigenes (13,943 markers) mapped in 3696 genetic bins distributed over nine chromosomal linkage groups. Genomic DNA was hybridized to a custom Affymetrix oligonucleotide array containing 6.4 million features representing 35,628 unigenes of Lactuca spp. Segregation of single-position polymorphisms was analyzed using 213 F7:8 recombinant inbred lines that had been generated by crossing cultivated Lactuca sativa cv. Salinas and L. serriola acc. US96UC23, the wild progenitor species of L. sativa. The high level of replication of each allele in the recombinant inbred lines was exploited to identify single-position polymorphisms that were assigned to parental haplotypes. Marker information has been made available using GBrowse to facilitate access to the map. This map has been anchored to the previously published integrated map of lettuce providing candidate genes for multiple phenotypes. The high density of markers achieved in this ultradense map allowed syntenic studies between lettuce and Vitis vinifera as well as other plant species. PMID:23550116
An Ultra-High-Density, Transcript-Based, Genetic Map of Lettuce.
Truco, Maria José; Ashrafi, Hamid; Kozik, Alexander; van Leeuwen, Hans; Bowers, John; Wo, Sebastian Reyes Chin; Stoffel, Kevin; Xu, Huaqin; Hill, Theresa; Van Deynze, Allen; Michelmore, Richard W
2013-04-09
We have generated an ultra-high-density genetic map for lettuce, an economically important member of the Compositae, consisting of 12,842 unigenes (13,943 markers) mapped in 3696 genetic bins distributed over nine chromosomal linkage groups. Genomic DNA was hybridized to a custom Affymetrix oligonucleotide array containing 6.4 million features representing 35,628 unigenes of Lactuca spp. Segregation of single-position polymorphisms was analyzed using 213 F 7:8 recombinant inbred lines that had been generated by crossing cultivated Lactuca sativa cv. Salinas and L. serriola acc. US96UC23, the wild progenitor species of L. sativa The high level of replication of each allele in the recombinant inbred lines was exploited to identify single-position polymorphisms that were assigned to parental haplotypes. Marker information has been made available using GBrowse to facilitate access to the map. This map has been anchored to the previously published integrated map of lettuce providing candidate genes for multiple phenotypes. The high density of markers achieved in this ultradense map allowed syntenic studies between lettuce and Vitis vinifera as well as other plant species. Copyright © 2013 Truco et al.